BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (378 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 763 0.0 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 374 e-102 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 322 1e-86 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 299 1e-79 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 285 2e-75 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 284 3e-75 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 276 1e-72 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 275 2e-72 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 266 6e-70 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 253 8e-66 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 244 5e-63 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 238 3e-61 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 229 2e-58 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 228 4e-58 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 223 8e-57 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 222 2e-56 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 221 4e-56 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 220 5e-56 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 217 4e-55 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 217 5e-55 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 214 4e-54 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 213 6e-54 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 212 2e-53 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 212 2e-53 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 211 4e-53 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 211 4e-53 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 210 6e-53 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 209 1e-52 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 208 2e-52 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 208 3e-52 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 206 8e-52 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 202 1e-50 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 201 4e-50 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 199 1e-49 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 198 3e-49 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 194 4e-48 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 194 4e-48 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 189 2e-46 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 187 5e-46 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 182 3e-44 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 181 5e-44 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 181 6e-44 UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=B... 174 4e-42 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 172 2e-41 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 171 4e-41 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 170 9e-41 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 167 8e-40 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 166 1e-39 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 165 2e-39 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 164 5e-39 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 163 1e-38 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 159 1e-37 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 158 4e-37 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 157 5e-37 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 157 7e-37 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 156 1e-36 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 149 2e-34 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 149 2e-34 UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX 147 5e-34 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 144 6e-33 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 143 1e-32 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 143 1e-32 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 140 8e-32 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 138 3e-31 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 137 5e-31 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 133 1e-29 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 132 2e-29 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 126 2e-27 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 121 3e-26 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 117 5e-25 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 115 2e-24 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 112 2e-23 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 112 3e-23 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 110 8e-23 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 106 1e-21 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 103 9e-21 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 102 2e-20 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 102 3e-20 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 101 4e-20 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 100 9e-20 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 100 2e-19 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 98 5e-19 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 97 1e-18 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 96 2e-18 UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica... 96 2e-18 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 95 5e-18 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 94 1e-17 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 93 1e-17 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 92 3e-17 UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azolla... 92 4e-17 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 91 9e-17 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 90 1e-16 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 90 1e-16 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 86 2e-15 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 84 8e-15 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 83 1e-14 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 83 2e-14 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 82 2e-14 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 82 3e-14 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 81 5e-14 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 81 8e-14 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 81 8e-14 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 80 1e-13 UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=B... 80 2e-13 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 79 3e-13 UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp... 79 3e-13 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 79 3e-13 UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX 78 5e-13 UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina... 77 7e-13 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 77 1e-12 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 76 2e-12 UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7... 75 4e-12 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 74 1e-11 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 74 1e-11 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 73 1e-11 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 71 5e-11 UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata ob... 71 6e-11 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 71 8e-11 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 70 9e-11 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 70 2e-10 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 68 4e-10 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 67 8e-10 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 66 2e-09 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 66 2e-09 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 66 2e-09 UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewane... 66 2e-09 UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax... 66 2e-09 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 65 4e-09 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 65 6e-09 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 65 6e-09 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 64 8e-09 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 64 9e-09 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 64 1e-08 UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris mari... 63 2e-08 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 62 4e-08 UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legione... 62 5e-08 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 62 5e-08 UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio ... 61 7e-08 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 60 1e-07 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 60 2e-07 UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammap... 59 3e-07 UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswald... 59 3e-07 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 57 1e-06 UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum fe... 57 2e-06 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 55 3e-06 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 55 4e-06 UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum ... 55 6e-06 UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus Re... 54 7e-06 UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata ob... 54 8e-06 UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mo... 54 1e-05 UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscurig... 54 1e-05 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 53 1e-05 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 52 2e-05 UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothe... 52 3e-05 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 52 3e-05 UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria... 52 5e-05 UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia... 51 6e-05 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 51 8e-05 UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmone... 50 1e-04 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 49 3e-04 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 49 3e-04 UniRef50_Q8X3B6 H repeat-associated protein n=1 Tax=Escherichia ... 49 3e-04 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 49 4e-04 UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobac... 47 0.001 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 47 0.001 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 46 0.002 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 46 0.003 UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaerom... 45 0.003 UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus... 44 0.008 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 43 0.021 UniRef50_C7YKI1 Putative uncharacterized protein n=1 Tax=Nectria... 42 0.036 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 42 0.038 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 42 0.039 UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 41 0.058 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 41 0.061 UniRef50_UPI000023EBF2 hypothetical protein FG01150.1 n=1 Tax=Gi... 41 0.088 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 763 bits (1969), Expect = 0.0, Method: Compositional matrix adjust. Identities = 369/378 (97%), Positives = 371/378 (98%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 MELKKLM HISIIPDYRQ WK+EHKLSDILLLTICAVISGAEGWEDIEDFGETH DFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 AEKIQKQGGDYLFAVKG QGRLNKAFEEKFPLKELNNP HDSYA+SEKSHGREEIRLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE EMTVRYYISSADLTAEKFATAIR Sbjct: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK Sbjct: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 Query: 361 AAMDRNYLASVLTGSGLS 378 AAMDRNYLASVL GSGLS Sbjct: 361 AAMDRNYLASVLAGSGLS 378 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 374 bits (959), Expect = e-102, Method: Compositional matrix adjust. Identities = 180/372 (48%), Positives = 249/372 (66%), Gaps = 4/372 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ +SII D RQ K+ H L D+L L I AVISG EGWE+I+DFG D+L++Y F Sbjct: 6 LINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRKYLPFS 65 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 GIP DTI+R+ I P +F +CF WM+ C DVIAIDGKTLR S++K + Sbjct: 66 GGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKKDKSDT 125 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH++SAF+ +S+V+GQ+KT+ KSNEITAIP+LL++LD++G ++T DAMGCQ IA+KI Sbjct: 126 IHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKIAKKIV 185 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPD 245 +GGDYL VKGNQ RL A + F ++ L P ++Y EK HGRE+ R+ +V D + Sbjct: 186 DKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMVADA-N 244 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHV 305 E+ D FEW GLK L AVSFR+ E+ + + V++YISSA L A+ A R HW V Sbjct: 245 EIGDLVFEWPGLKTLGYAVSFRT---EKDMQTTVAVKFYISSAKLDAKSLLEASRAHWTV 301 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN LHW+LD+ MNED C+IR+ N+ E + +RH ++N+L N+K F G++RK ++A Sbjct: 302 ENNLHWQLDISMNEDSCRIRQQNSTENLATVRHTSLNLLKNEKSFDGGIKRKHKQANRSD 361 Query: 366 NYLASVLTGSGL 377 +Y V++G L Sbjct: 362 SYRELVVSGLSL 373 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 322 bits (826), Expect = 1e-86, Method: Compositional matrix adjust. Identities = 172/375 (45%), Positives = 240/375 (64%), Gaps = 14/375 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ SII D RQ K++H+L DIL L + AVI GAEGW+DIE+ G ++L++ G F+ Sbjct: 7 LVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFFK 66 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 GIPV DTIAR++S ++P + CFI WM + D +IA+DGK++RHSYDK +R+ A Sbjct: 67 KGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKSA 126 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH++SA++ + +V+GQ KTD+KSNEI AIP LL++LDIKG I+T DAMGCQ+ IAEKI Sbjct: 127 IHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKIV 186 Query: 186 KQGGDYLFAVKGNQGRLNKA----FE--EKFPLKELNNPAHDSYAMSEKSHGREEIRLHI 239 + GDY+ AVK NQ +L++ FE +F K + HD + S K HGR E+R + Sbjct: 187 TKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVR---HDYFEESHKGHGRVELRRYW 243 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAI 299 + D+ L + W L+ + + S R I + E RY+I+S A+ FA A+ Sbjct: 244 ISDMLSTLGN-PERWASLQSIGMVESERYIDGKTTAE----TRYFITSIAPDAKIFANAV 298 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN+LHW LDV EDD ++RR NA+E F RH+AIN L N+K K G++ K Sbjct: 299 RKHWAIENQLHWVLDVSFREDDSRVRRDNASENFGVFRHVAINALRNEKSCKKGIKAKRY 358 Query: 360 KAAMDRNYLASVLTG 374 KA + +Y VL G Sbjct: 359 KATLQPDYAQKVLNG 373 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 299 bits (765), Expect = 1e-79, Method: Compositional matrix adjust. Identities = 160/372 (43%), Positives = 230/372 (61%), Gaps = 7/372 (1%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M ++ H S I D+RQ+ K+ + L D+L ++CAVI+ + GW +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 F +GIP DTIAR+VS I P F+ CF+ WM+ H + +VIAIDGKTLR SY++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 R IH+ISA+++ + LV+GQ+K ++KSNEITAIP LL MLD++G ++T DAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 A I +GGDYL AVK NQG L KA + F + D + EKSHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFS-PHRSAGLSDDHVNIEKSHGRIENRTCYV 239 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 DFT W+ LK + + SFR++ + K + RYYISS L+AE+ +A R Sbjct: 240 LSSAALDGDFT-HWEALKSIVMVESFRAV---KGKTASLEYRYYISSKVLSAEQALSATR 295 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +E+ +HW LDV MNED+C+I + N AE + +RH+++N+L + K + K ++ Sbjct: 296 EHWGIES-MHWVLDVSMNEDECQIYKNNGAENLAYLRHMSLNMLQKEPT-KLSIVGKRKR 353 Query: 361 AAMDRNYLASVL 372 M+ +L VL Sbjct: 354 CLMNPAFLEKVL 365 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 285 bits (728), Expect = 2e-75, Method: Compositional matrix adjust. Identities = 155/361 (42%), Positives = 216/361 (59%), Gaps = 9/361 (2%) Query: 17 RQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIAR 76 R+ H DIL++ +CA+ISGA + +IE FG + ++ + + NGIP HDT Sbjct: 21 RETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQTFLALPNGIPSHDTFNN 80 Query: 77 VVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMH 136 V++ +SP +F CF+ W + IAID KTLR S DK + +H++SA++T Sbjct: 81 VLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKKNGKSPLHLVSAWATET 140 Query: 137 SLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVK 196 +LVIGQIKT+E SNEITAIPELLN LD+KG +++ DAMGCQ +IAEKI ++ DY+ A+K Sbjct: 141 ALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEIAEKIVEKDADYVLALK 200 Query: 197 GNQGRLNKAFEEKFPLKELNNPAH---DSYAMSEKSHGREEIRLHIVCDVPDELIDFTFE 253 GNQ +L+++ E F L N D E S+GREEIR + +++I E Sbjct: 201 GNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRCAYATNEIEKIIA-NDE 259 Query: 254 WKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 WK +K + + S R KKE E +RYYISSA L+AE +R HW +ENKLHW L Sbjct: 260 WKNIKTVAMIESQRI-----KKEKEFDIRYYISSAKLSAEDCLKVVRKHWEIENKLHWTL 314 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 DV ED+ +IR+ N AE + +R IA+N++ +K K G K A D YL +L Sbjct: 315 DVAFREDESRIRQRNTAENMAILRRIALNLVKQEKTAKVGQATKRLMAGWDEKYLLKLLN 374 Query: 374 G 374 G Sbjct: 375 G 375 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 284 bits (727), Expect = 3e-75, Method: Compositional matrix adjust. Identities = 150/368 (40%), Positives = 220/368 (59%), Gaps = 20/368 (5%) Query: 23 EHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCIS 82 +HKL DI+ +TICAVI GA+ W DIE FG+ +LK++ + NGIP HDT RV S ++ Sbjct: 26 KHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNGIPSHDTFGRVFSLLN 85 Query: 83 PAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQ 142 P + F+ W++ S +++AIDGKTLRHSYD+S+ + A+ +ISA++T + LV+GQ Sbjct: 86 PEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQMISAWATTNGLVLGQ 145 Query: 143 IKTDEKSNEITAIPE---------------LLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 DEKSNEITAIP+ LL +L + G I+T DA+GCQK+I ++I +Q Sbjct: 146 SIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLDAIGCQKEIVKQITEQ 205 Query: 188 GGDYLFAVKGNQGRLNKAFEEKFPLKELNNPA---HDSYAMSEKSHGREEIRLHIVCDVP 244 DY+ +K NQG L + E F ++N Y + ++ HGR+E+R + + Sbjct: 206 DADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEGHGRQEVRYYQMLSNV 265 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWH 304 E ID ++W L + R K LE RY+ISS + + FA+++R HW Sbjct: 266 AEEIDPDWQWLNLNSIGYVEYLRVENGTDKTSLER--RYFISSLNNNIKLFASSVREHWC 323 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN+ HW LDV NEDD +IR+ NA + +RH+A+N+L +K K G++ K +KA D Sbjct: 324 IENQCHWILDVQFNEDDSRIRKDNAPANMAILRHLALNLLKQEKTLKVGVKAKRKKAGWD 383 Query: 365 RNYLASVL 372 NYL VL Sbjct: 384 ENYLLKVL 391 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 276 bits (705), Expect = 1e-72, Method: Compositional matrix adjust. Identities = 156/371 (42%), Positives = 221/371 (59%), Gaps = 10/371 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + + H S I D RQ K+ + L +ILLLT+CAV+SGA W I +G FLK++ F Sbjct: 24 EFLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPF 83 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 +G P HD + + + + F CFI+W+ + + V+AIDGKT R S DK+ + Sbjct: 84 ADGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTV-TGVVAIDGKTSRRSLDKAGGKA 142 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 AIH+ISA+S+ +L + Q + D KSNEITAIPELL +L +KG I+T DAMGCQ++IA KI Sbjct: 143 AIHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKI 202 Query: 185 QKQGGDYLFAVKGNQGRLNK---AFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVC 241 + DY+ A+KGNQG L K F + + ++ + EKSHGR E R VC Sbjct: 203 ISKEADYILALKGNQGSLRKDTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVTVC 262 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 D L W GLK + V V + +I+ ++ + RYYISS AE A AIR+ Sbjct: 263 TDIDWL-KADHNWPGLKSI-VMVQYHAILQDKTRA---ETRYYISSMTSDAEHHAKAIRD 317 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN LHW +D+V +D+C+IR GNA F+ I+H+A N+L + K K LR K A Sbjct: 318 HWGIENGLHWVMDMVFRDDECRIRNGNAPANFTTIKHVASNMLRSVK-GKHSLRSKRHIA 376 Query: 362 AMDRNYLASVL 372 + D ++LA ++ Sbjct: 377 SWDDDFLAEII 387 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 275 bits (704), Expect = 2e-72, Method: Compositional matrix adjust. Identities = 145/374 (38%), Positives = 218/374 (58%), Gaps = 7/374 (1%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ H S + D R A ++E+ L DI+++T+CAV+ GA+ W ++ ++G + +LKQ+ Sbjct: 6 FASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQWI 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 NG+P HDT V + + P + +CF+NW + + ++IAIDGKTLR + + Sbjct: 66 ALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGEQ 125 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 IH++SA+++ + LV+GQ DEKSNEITAIPELL +L+++G +++ DAMGCQ IAE Sbjct: 126 CSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIAE 185 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPAHDSYAMSEKSHGREEIRLHI 239 I + GDY+ A+KGNQG L + F + HDSY EK HGR E R + Sbjct: 186 TIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTYW 245 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAI 299 D L+ W LK + S R + + RYY+ S + A++FA A+ Sbjct: 246 TMGQTDYLLGAE-RWAQLKSIGCVESCRR---QPGHPGTLQRRYYLLSIESDAQRFADAV 301 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R+HW +EN+LHW LDV ED + +G +A+ S IRHIA N+L + K G++ K Sbjct: 302 RSHWGIENQLHWILDVGFREDKLRACQGCSAQNLSVIRHIAANLLQQESTAKCGVKAKRL 361 Query: 360 KAAMDRNYLASVLT 373 KA D NYL +L+ Sbjct: 362 KAGWDDNYLVKILS 375 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 266 bits (681), Expect = 6e-70, Method: Compositional matrix adjust. Identities = 139/373 (37%), Positives = 217/373 (58%), Gaps = 12/373 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+G++ I D R +H L D+L + I AVI+G++GWED+E++G ++L ++ + Sbjct: 31 LLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLELP 90 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +GIP DT RV I P +C W++ +S ++I IDGKTLR SYD++ + A Sbjct: 91 HGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQCA 150 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 ++ ++A+++ SLV+GQ+K + SNEITAIP LL +LDI G IIT DAMG Q I ++I Sbjct: 151 LYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQIC 210 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELN---NPAHDSYAMSEKSHGREEIRLHIVCD 242 +Q DY+ +K N L ++ F + N HD Y K H R E R V Sbjct: 211 RQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRY--VWA 268 Query: 243 VPDELIDFTF---EWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAI 299 +P + + +W GL+ + V R + + + +++Y++S A+ AI Sbjct: 269 IPVAAMGELYQQQQWHGLQTIVVVERIRHLWNKTTHD----IQFYLTSLPPNAQFLCHAI 324 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN LHW LDV +ED C+IR + + F+ +R +A+N+L +K FK LR+KM+ Sbjct: 325 RTHWSIENNLHWTLDVTFSEDQCRIRSEYSPQNFALLRRLALNVLHQEKTFKRSLRQKMK 384 Query: 360 KAAMDRNYLASVL 372 +AAM+ NY+ +VL Sbjct: 385 QAAMNNNYMMTVL 397 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 253 bits (646), Expect = 8e-66, Method: Compositional matrix adjust. Identities = 140/381 (36%), Positives = 225/381 (59%), Gaps = 18/381 (4%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L+ H I D R + +H+L D+L++ +C ++ G E + D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 +GIP HDT RV + + P F +CF+ W + ++ +++A+DGK LR + ++ + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQGQSP 126 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 I +SA++ +SLV+GQI+ +K+NEITA+P+LL +L++ G I+T DAMGCQK+IA + Sbjct: 127 RVI--VSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IQKQGGDYLFAVKGNQGRLN---KAF--------EEKFPLKELNNPAHDSYAMSEKSHGR 232 I + +Y+ A+KGNQG+ + KA+ +++ P+ E N A +EK HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPV-EKNAVALAYKETTEKDHGR 243 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTA 292 E R + L D +W GL+ + V S R + +Q +E RYY+SS ++ Sbjct: 244 LETRRYWQSGDVSWLADRQ-QWAGLRSVGVVESVRQV-GQQAPTVER--RYYLSSLNVDV 299 Query: 293 EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA 352 EKFA A+R HW VEN LHW LDV ED + R G+AAE + +R +A+N+L + K Sbjct: 300 EKFARAVRGHWSVENSLHWVLDVQCGEDRNRARSGHAAENLATLRRLALNLLKRESTKKR 359 Query: 353 GLRRKMRKAAMDRNYLASVLT 373 G++ K A+ D +YL +L+ Sbjct: 360 GIKGKQLNASWDHDYLLRLLS 380 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 244 bits (622), Expect = 5e-63, Method: Compositional matrix adjust. Identities = 132/375 (35%), Positives = 221/375 (58%), Gaps = 14/375 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++ + + + D R+ +H L D+L++ + AVI+GA+G I + E H ++LK + Sbjct: 13 ILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSRLELP 72 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINW---MRDCHSSDD--KDVIAIDGKTLRHSYDKS 120 +G+P HDTI R+++ + P F +CF W MR ++DD +++IAIDGKTLR S+D+ Sbjct: 73 SGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRSHDRG 132 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + G + + SA++ + +GQ+ +KSNEI PEL+ +D++ I+T DA GCQ+D+ Sbjct: 133 KGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGCQRDV 192 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH---DSYAMSEKSHGREEIRL 237 AEKI GDY+ A+K NQ RL++ + + N+ A + + K HGR + R Sbjct: 193 AEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRLDKRF 252 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFAT 297 + +PDE + +W+GLK + VA+ I+++ RYYISS A++FA Sbjct: 253 YYQVKLPDE-VPAGEDWRGLKTIGVAIR----ISQENGRETCDTRYYISSLKPDAKQFAA 307 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 A+R HW +EN LHW LDV ED+ ++R AAE + ++ +A++++ K ++ + R+ Sbjct: 308 AVRGHWGIENSLHWTLDVTFREDESRLRNRIAAENLAWLKRLAVSLIKQHKSKESVVMRR 367 Query: 358 MRKAAMDRNYLASVL 372 R A + N+LA +L Sbjct: 368 -RMAGWNVNFLAEIL 381 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 238 bits (607), Expect = 3e-61, Method: Compositional matrix adjust. Identities = 132/360 (36%), Positives = 205/360 (56%), Gaps = 10/360 (2%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + H + D R +H L D++ LT+ A++SGAEGW+DI+ FG++ D+L+++ F+ Sbjct: 3 FITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAFK 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 G+PV DTIAR++S + P FI+W+ + + VIA DGKTLRHS+D R+ A Sbjct: 63 EGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFDGD-RKTA 121 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H +SA++ LV+ Q K+ K NE++ + EL+ +L++KG I+T DAM C K +A+ I Sbjct: 122 LHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAIN 181 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH---DSYAMSEKSHGREEIRLHIVCD 242 +GGDY+ VK NQG+L F + P +S ++ HGR E R ++ Sbjct: 182 AKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYVQLP 241 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNH 302 + L + W +K + R + + KE T YYISS ++ + A AIR+H Sbjct: 242 ITPWLTQ-SQGWTNIKPVIEVTRKRYL---KDKETSETA-YYISSLEVNLPQIAKAIRSH 296 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN HW LD+ EDD +IRRG+A E + R A+N L K ++ K+++AA Sbjct: 297 WSIENASHWVLDMTYREDDSRIRRGDAPENMATFRRFAMN-LARLSPIKDSMKGKLKQAA 355 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 229 bits (583), Expect = 2e-58, Method: Compositional matrix adjust. Identities = 139/379 (36%), Positives = 199/379 (52%), Gaps = 24/379 (6%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M K L+ ++ IPD R K H LS+++ + ICA++ G + W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV--IAIDGKTLRHSYD 118 + GIP HDT R+ + + PA F W+ D DDK V +A+DGK LR + Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMG-DDKLVGQLAVDGKALRATA- 118 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K R A+H+++ +ST + +GQ K +KSNEITAIPELL +L++KG +++ DAMG Q Sbjct: 119 KGRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQV 178 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL----KELNNPAHDSYAMSEKSHGREE 234 IA+ I K+ GDYL AVK NQ LN +E+F E + H + HGR+E Sbjct: 179 KIADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKE 238 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMT-----VRYYISSAD 289 R V V DE + +WK ++IIA Q + +E VR+YISS Sbjct: 239 HRRCWVLMV-DESMPVCQQWKA----------KTIIAVQAERIENGKGYDFVRFYISSRA 287 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 L A A R HW VEN LHW LD+ ED + R G A E + IR +N+L +K Sbjct: 288 LDATSALKATRAHWSVENLLHWTLDIAFGEDQLQARAGFAGENLAVIRQWILNMLKQNKS 347 Query: 350 FKAGLRRKMRKAAMDRNYL 368 + K R ++ YL Sbjct: 348 RNLSMANKRRLCCLNEQYL 366 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 228 bits (580), Expect = 4e-58, Method: Compositional matrix adjust. Identities = 139/342 (40%), Positives = 192/342 (56%), Gaps = 13/342 (3%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISP 83 H ++L++ I AV+S + EDI +G D+L+Q+ NG+ +T R+ + P Sbjct: 28 HDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLVLLNGVASEETFLRIFRALDP 87 Query: 84 AKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 +F F W+ + + +DGKT+R S S AIH++SAF+T +V+GQ Sbjct: 88 KQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGS--GSGGESAIHMVSAFATELGVVLGQE 144 Query: 144 KTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLN 203 K KSNEITAIPELL L I G ++T DAMGCQK+IA +I QGGDYL AVKGNQ L Sbjct: 145 KVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIARQITDQGGDYLLAVKGNQPTLL 204 Query: 204 KAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVA 263 A E +F + + + D + SHGR I I +P E I +W KK+ Sbjct: 205 DAIETEF-IDQYQSDDVDRHRQVHPSHGR--IVAQIASVLPAEGIVDLADWPECKKIARV 261 Query: 264 VSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCK 323 S R + + K + RYYISS +LTAE+ A A+R HW +EN+LHW LDV ED Sbjct: 262 DSLRKVGNHESK---LERRYYISSRELTAEQLAAAVRAHWGIENRLHWVLDVSFGEDAST 318 Query: 324 IRRGNAAELFSGIRHIAINIL---TNDKVFKAGLRRKMRKAA 362 IR+GNA + S ++ I +N++ T DK K LR K + AA Sbjct: 319 IRKGNAPQNLSLLKKIVLNLIRLDTADKT-KTSLRLKRKCAA 359 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 223 bits (568), Expect = 8e-57, Method: Compositional matrix adjust. Identities = 135/378 (35%), Positives = 207/378 (54%), Gaps = 14/378 (3%) Query: 1 MELKKLMGHISI---IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDF 57 M++ KL + + + D+R A + H+LS++L + +CAV+SGA+ +E+I +G + Sbjct: 1 MDIGKLADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPW 60 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHS 116 L+ + + G+ DT RV + + P +F + F W+ + KD VIAIDGK+ R + Sbjct: 61 LRGFLRLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRT 120 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K+ +H++SAF+ +V+GQ T EKSNEITAIPELL +LDI+G I+T DAMG Sbjct: 121 TSKAAA-APLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGT 179 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRL--NKAFEEKFPLKELNNPAHDSYAMSEKSHGREE 234 Q IA I+++G Y+ VK N +L + F + P L + ++ + HGR E Sbjct: 180 QTKIARAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLTPSS--THETTSTGHGRIE 237 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEK 294 +R D D L WK + V R++ E YYISS AE+ Sbjct: 238 VRRCTAFDATDRLHKAE-AWKDVASFAVVERVRTVGERTSTERV----YYISSLPADAER 292 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 A AIR+HW VEN+LHW LDV +D + R G+ A + +RH+A+N++ DK K + Sbjct: 293 IAVAIRSHWEVENRLHWCLDVQFGDDYARGRIGHIAHNLALVRHMALNLIRLDKSIKTSI 352 Query: 355 RRKMRKAAMDRNYLASVL 372 + K AA + A++L Sbjct: 353 KTKRLLAATSDEFRAALL 370 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 222 bits (566), Expect = 2e-56, Method: Compositional matrix adjust. Identities = 143/373 (38%), Positives = 205/373 (54%), Gaps = 18/373 (4%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 L++ G I+ D R H+L DI+ + + AV++GA+ W IE +G+ +L+ + Sbjct: 14 LEQYFGEIT---DPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFL 70 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 NGIP HDT ARV + + P F +W++ S+ VIAIDGKT + SYD+ Sbjct: 71 ALPNGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGG 130 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 A+ ++SA+++ H LV+GQ D KSNEITAIP LL L + G I++ DAMG + IA Sbjct: 131 VKALQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAA 190 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYA---MSEKSHGREEIRLHI 239 +I KQ DY+ A+KGNQ L K ++ F + + A Y E +H R E R Sbjct: 191 QIHKQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRR-- 248 Query: 240 VCDVPDELIDFT----FEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 V VP E + FT +W GL+ L V S R + + E RY++SS A F Sbjct: 249 VFQVPVEQV-FTPKQGRDWAGLRSLVVIQSQRCLWNKDTTE----TRYFLSSLSTDAATF 303 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A IR HW +EN+LHW LDVV NED +IR+ +A FS +R + +N+L D K L Sbjct: 304 AHYIRAHWGIENQLHWCLDVVFNEDKSRIRKDHAPRNFSLLRRLTLNLLHRDSS-KGSLV 362 Query: 356 RKMRKAAMDRNYL 368 K +A +D ++ Sbjct: 363 MKRYRAGLDDQFM 375 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 221 bits (562), Expect = 4e-56, Method: Compositional matrix adjust. Identities = 140/348 (40%), Positives = 200/348 (57%), Gaps = 15/348 (4%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISP 83 H +IL++ I AV+S + EDI + T +L+++ +NGIP +T R++ + P Sbjct: 19 HDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLKNGIPSEETFLRILRALDP 78 Query: 84 AKFHECFINWMRDCHS--SDDKDV---IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSL 138 +F F W+ SDD + IAIDGKT+R S S AIH++SAF+T L Sbjct: 79 KQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGS--GSGGESAIHMVSAFATELGL 136 Query: 139 VIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 V+GQ K KSNEITAIPELL L IKG ++T DAMGCQK IA++I + GDYL VKGN Sbjct: 137 VLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSIAKQIVAKKGDYLLMVKGN 196 Query: 199 QGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 Q +L +A E F + + + D + E+ HGR ++ V ++D +W Sbjct: 197 QPKLLEAIETAF-IDQHGVESVDRSSRVERGHGRTVGQIASVLSAKG-IVD-PADWPKCV 253 Query: 259 KLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMN 318 + S R ++ +++ +LE RYYISS L+AE+ A A+R HW VEN+LHW LDV + Sbjct: 254 TIGRIDSMR-VVGDKQSDLER--RYYISSRALSAEQLAAAVRAHWGVENRLHWILDVSFS 310 Query: 319 EDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKMRKAAMD 364 ED + + NA + S +R IA+ I+ DK K+ LR K + AA D Sbjct: 311 EDASTVAKDNAPQNLSLLRKIALTIIRADKTDTRKSSLRLKRKGAAWD 358 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 220 bits (561), Expect = 5e-56, Method: Compositional matrix adjust. Identities = 122/363 (33%), Positives = 189/363 (52%), Gaps = 9/363 (2%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L H+S++ D R H L D+L L + AV SG +GW +I+ FGE ++L+++ F Sbjct: 3 LFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPFA 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP TIAR++ + P C +W+ D ++ K +IAIDGKTLR + Sbjct: 63 NGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLG--CNT 120 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H + AF + L + Q K EI + L+ ML+I +IT DA+ Q+ E I Sbjct: 121 LHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAIV 180 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPD 245 + GDY+ VK NQ L +A + ++ + ++ +A SEK HGR E R I +P Sbjct: 181 ARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQR--ITFQIPS 238 Query: 246 EL-IDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWH 304 +L +W +K L R I K +E + +Y+SS D+ E ATA+R HW Sbjct: 239 KLSPKLQEKWPSVKTLIAVERHRKI--GNKTSIETS--FYLSSHDIDPEYIATAVRGHWR 294 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN LHW LDVV ED C++ AE + +R +A+N+ + K ++ K+ ++ + Sbjct: 295 IENSLHWVLDVVYREDACRVHEQRTAESLAIVRRMALNLAKLEITQKRSMKSKLHRSLLS 354 Query: 365 RNY 367 Y Sbjct: 355 DEY 357 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 217 bits (553), Expect = 4e-55, Method: Compositional matrix adjust. Identities = 125/327 (38%), Positives = 185/327 (56%), Gaps = 7/327 (2%) Query: 18 QAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARV 77 +A+ +H DI+ L + AVISGA W +I+ FGE H D+L++Y FE GIPV DTIARV Sbjct: 14 RAYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPFECGIPVDDTIARV 73 Query: 78 VSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHS 137 + I P F+E F+N++ + + ++VIAIDGKTLRHS++ + A+H ++ +S Sbjct: 74 IKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFN-PETQSALHSVTVWSQSRG 132 Query: 138 LVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKG 197 L++ Q K+ K NE A+ E+++ +K +IT DAM QK IAEKI ++ GDY+ +K Sbjct: 133 LILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKIAEKIIEKKGDYVMPLKK 192 Query: 198 NQGRLNKAFEEKFPLKELNNPAH-DSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKG 256 N + E F + P ++Y R + R + V D L EWKG Sbjct: 193 NHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYRKLKVSDWLSKAE-EWKG 251 Query: 257 LKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVV 316 +K + RS + KE + V +YISS D+ + A +R HW VENK HW LDVV Sbjct: 252 IKSVLEVCRKRS---DNGKESQEKV-FYISSLDVDIQILAKCVRGHWEVENKAHWVLDVV 307 Query: 317 MNEDDCKIRRGNAAELFSGIRHIAINI 343 ED+C + AE + +R +A+N+ Sbjct: 308 YKEDECAVTDEWGAENLAILRRLALNL 334 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 217 bits (553), Expect = 5e-55, Method: Compositional matrix adjust. Identities = 129/370 (34%), Positives = 198/370 (53%), Gaps = 16/370 (4%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 I D+R H L+DIL++ CA++ G + +E FG +L+ + NGIP HD Sbjct: 23 IDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLALPNGIPSHD 82 Query: 73 TIARVVSCISPAKFHECFINW----MRDCHS----SDDKDVIAIDGKTLRHSYDKSRRRG 124 T +V S + P +F E F W +R S S K VIAIDGK LR + DK + Sbjct: 83 TFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRGAVDKGQAPA 142 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 I + A+++ SL +GQ+K +KSNEI A+PELL ML +KG I+T DAMGCQ+++A KI Sbjct: 143 VI--VGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMGCQREVARKI 200 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEK-SHGREEIRLHIVCDV 243 +Q GDY+ A+K NQ L++ E A ++ E HGR E+R V + Sbjct: 201 IQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHEVRRCWVSEE 260 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHW 303 + + +W GL+ + R++ + + RY+ISS A A ++R HW Sbjct: 261 VECWLQGAEKWAGLRSVAAVECERTVAGQTT----VQRRYFISSLKADAALIAASVRAHW 316 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKAGLRRKMRKAA 362 +EN LHW LDV ED+ + RRG +AE + +R + ++ + K + ++ +A Sbjct: 317 GIENSLHWVLDVTFGEDESRSRRGYSAENLATLRRLTHAMIKRENPNSKKSVNQRRFEAG 376 Query: 363 MDRNYLASVL 372 + +YL ++L Sbjct: 377 LSTDYLQTLL 386 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 214 bits (545), Expect = 4e-54, Method: Compositional matrix adjust. Identities = 145/383 (37%), Positives = 203/383 (53%), Gaps = 33/383 (8%) Query: 12 IIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVH 71 I+ D R +H+ S I+L+ I AVI GA+ W IEDFG++ F NGIP H Sbjct: 22 ILIDNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKLSNFNGIPSH 81 Query: 72 DTIARVVSCISPAKFHECFINWMRD---CHSSDDKDVIAIDGKTLRHSYD-----KSRRR 123 DT R S + P KF E + W++ C+S IAIDGKT+R +Y+ + R++ Sbjct: 82 DTFNRFFSALDPLKFEESYRQWVQSILKCYSGH----IAIDGKTIRGAYESEQDKRHRKQ 137 Query: 124 GAI----------HVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 G + HVISAF+T + +GQ+ T EK NEI IPELL+ML IK IIT DA Sbjct: 138 GVLPDSNTGKYKLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDCIITIDA 197 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNK---AFEEKFPLKELNNPAHDSYAMSEKSH 230 +GCQ+ IAEK+ K GDY+F VK NQ +L + + E K D Y E+ H Sbjct: 198 LGCQRTIAEKVIKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKG-TTVRFDKYETHEEGH 256 Query: 231 GREEIRLHIVCDVPDEL-IDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSAD 289 GR E R+ C+ P L D +WK ++ + R+ K + R +ISS + Sbjct: 257 GRNESRICYCCNDPGFLGADIRKKWKNIQSFGYIENTRNT----NKGTTVEKRCFISSLE 312 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 A+K R HW +EN LHW+LDV +ED+ + RR +A FS + IA+ L N+K Sbjct: 313 PDAQKILKNSREHWEIENNLHWQLDVNFHEDNTR-RRNISALNFSVLAKIALATLRNNKR 371 Query: 350 FKAGLRRKMRKAAMDRNYLASVL 372 + + RK A D +L ++ Sbjct: 372 -EIPINRKRLIAGWDNEFLWELI 393 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 213 bits (543), Expect = 6e-54, Method: Compositional matrix adjust. Identities = 135/379 (35%), Positives = 202/379 (53%), Gaps = 19/379 (5%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 L+ L+ H S I D R ++ H L +ILLL +C ++ + +E+I +G H FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 + +G+P + +++ I PA F F W+R D +AIDGKT R S+D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFPGR-ADFVAIDGKTSRRSHDRRAG 130 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD----IKGKIITTDAMGCQK 178 IH++SAF+T LV+ Q +K+NE+ AIP LL+ L + G +++ DA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLH 238 IA I+ QG DYL AVK NQ L E F + + + HD +K HGR E R H Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGADHHHDL----DKGHGRVEER-H 245 Query: 239 IVCDVPDELIDFTFEWKGLKKL--CVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 + + + T + G +L A+ A RY+ISSA LTAE A Sbjct: 246 VSVIREVDWLSGTRRFPGEMRLPDVAAIVRVHTTAHIADRTRTDTRYFISSAPLTAEHAA 305 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL--TND-KVFKAG 353 A+R HW +EN+LHW LDV+ +D ++R G+ A+ + +RH A+N++ ND K K Sbjct: 306 DAVRGHWGIENRLHWVLDVLFKDDLSRLRTGHGAKNMAVVRHFALNLIRAANDQKSLKT- 364 Query: 354 LRRKMRKAAMDRNYLASVL 372 RRKM A +YLAS+L Sbjct: 365 -RRKM--AGWSDDYLASLL 380 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 212 bits (539), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 129/353 (36%), Positives = 191/353 (54%), Gaps = 9/353 (2%) Query: 22 MEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCI 81 + + L+++LL T+ +I A +++IE G D+L+Q+ FE+G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 SPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 P F W+ V AIDGKTLR S + GA+H++SA++ LVIG Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 Q + KSNEITAIPELL+ L + G I+T DAMG QK IA K+ +G DY+ A+KGNQG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LNKAFEEKFPLKEL--NNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 L+ + F +L HD + HGR E R V D L + W GL Sbjct: 181 LHDDVRDFFADPDLLRECARHDDTCI---GHGRIEERTCQVADASAWLTEQHSGWAGLAS 237 Query: 260 LCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + ++ R+ ++ E+ R YISS + A R+HW VEN LHW+LDV E Sbjct: 238 IAAVIATRT--DKKSGEVSSETRLYISSLPPDPKTILNACRSHWSVENNLHWQLDVTFRE 295 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 D+C+ R+ +A + IRH A N+L + K ++RK KAAM++ + +V+ Sbjct: 296 DECRTRKDHAPLSLAIIRHAAFNMLKREPS-KMSIKRKRLKAAMNQAFRKTVI 347 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 212 bits (539), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 133/364 (36%), Positives = 201/364 (55%), Gaps = 25/364 (6%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 I D+R + ++L DILL++ AVI + + ++ F + +L+ + DF +G P HD Sbjct: 12 IEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCDFRHGPPSHD 71 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSDDKDV------IAIDGKTLRHSYDKSRRRGAI 126 T +V+S + P E F WM + + K V +AIDGKT+ S S + A Sbjct: 72 TFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS--GSAEQNAS 129 Query: 127 HVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQK 186 HV++AF++ LV+GQIKTDEKSNEITAIPELL + +K ++T DAMG QK+IA KI + Sbjct: 130 HVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQKNIAAKIIE 189 Query: 187 QGGDYLFAVKGNQGR--------LNKAFEEKFPLKELNNPAHDSYAMS-EKSHGREEIRL 237 +GGDY+ AVKGNQ + L+ +++ +EL A YA++ EK HGR E R Sbjct: 190 KGGDYVLAVKGNQKKLRDDIIWHLHSELQDR-STRELK--AKGQYAVTLEKDHGRIEKR- 245 Query: 238 HIVCDVPDELIDFTF--EWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 C + ++L F +W+G+ + + R + + K + S + A+ Sbjct: 246 --ECYLSNDLSWFEGLEDWRGIAGVGWIRNTRHVDNKNDKTTTEDHYFIYSLKEAQAKDL 303 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 R HW +EN LHW LD+ EDDC+ R NAAE+ + +R +A+ +L K G+R Sbjct: 304 LRIKREHWAIENNLHWMLDMAFREDDCRARAKNAAEVMNILRKLALQMLKTCDTCKCGMR 363 Query: 356 RKMR 359 K + Sbjct: 364 SKRK 367 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 211 bits (536), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 124/365 (33%), Positives = 192/365 (52%), Gaps = 11/365 (3%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 I D R + + L +ILL+T+CA+I G + W+ I DFG+ +L Q+ + G+P Sbjct: 23 IKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVNMRCGVPSTL 82 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAF 132 T ARV S I P +F C WM D+I +DGK+L S + + + A H+++A+ Sbjct: 83 TFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQKATHIVNAY 142 Query: 133 STMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYL 192 + +G+++ +KSNEI AIP LLN L+++G II+ DAMG QK IA I+ + DY+ Sbjct: 143 LPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANLIRLKQADYV 202 Query: 193 FAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEK---SHGREEIRLHIVCDVPDELI- 248 A+K N R + E F + + Y E HGR E R + C +P Sbjct: 203 LALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSY--CVLPMMYFH 260 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTA-EKFATAIRNHWHVEN 307 + W+ L+ + S R + E+E RYYI+S + + AIR HW +EN Sbjct: 261 KYKKYWRDLQAIVRVQSKR----HKGNEIETATRYYITSLPFAEHRRMSQAIRQHWAIEN 316 Query: 308 KLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNY 367 +LHW+LD+ + ED I RG A + + +R + + +L N+ K G+ K +AA+ Y Sbjct: 317 QLHWKLDIGLGEDASLITRGYADQNLATLRKMVLKMLENENSSKQGIAGKRIQAALSTRY 376 Query: 368 LASVL 372 L V+ Sbjct: 377 LRKVV 381 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 211 bits (536), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 140/378 (37%), Positives = 202/378 (53%), Gaps = 19/378 (5%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLK-QYGDFENGIPVH 71 I D R K HK+ I+ ++I AVI GA+ W +IE+FG F K + D E IP H Sbjct: 12 IEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPDLE-FIPSH 70 Query: 72 DTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRH------SYDKSRRRGA 125 DT R S I P F F NW++ + K V+AIDGK +R + + Sbjct: 71 DTFNRFFSIIKPEYFELIFRNWVKQV-CQEVKGVVAIDGKLMRGPSQCDGEHTTGKEGFK 129 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + ++SA+S ++ + +GQ+K D+KSNEITAIP L+N L++ G I+T DAMGCQKDI + I Sbjct: 130 LWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDITQTII 189 Query: 186 KQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCD 242 ++ +Y+ A+K N+ + L K + + ++ + HGR E R V Sbjct: 190 ERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEKRTCTVVS 249 Query: 243 VPDELIDFTFEWK--GLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT-AEKFATAI 299 +++ F+ K GLK + S R+I+A E VRYY++S D T E+ A+AI Sbjct: 250 Y-GSIMEKMFKKKLVGLKSIVGIKSERTIVAT--GEYTQEVRYYVTSLDNTKPEEIASAI 306 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN LHW+LDV ED K + NAA FS +A+ IL DK K + K Sbjct: 307 RQHWSIENNLHWQLDVTFREDYSK-KVKNAARNFSVATKMALTILKKDKTTKGSMNLKRL 365 Query: 360 KAAMDRNYLASVLTGSGL 377 KA D YL+ +L + Sbjct: 366 KAGWDEKYLSQLLQNNNF 383 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 210 bits (535), Expect = 6e-53, Method: Compositional matrix adjust. Identities = 126/372 (33%), Positives = 191/372 (51%), Gaps = 19/372 (5%) Query: 13 IPDYR-QAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVH 71 +PD R + H L+DIL + CAVI+GAEGWEDI ++G + F +++ + +NG+P H Sbjct: 12 LPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLELKNGVPSH 71 Query: 72 DTIARVVSCISPAKFHECFINWMRD-CHSS-------DDKDVIAIDGKTLRHSYDKSRRR 123 DT RV + + P F + F W + C ++ D +A+DGK+ R S K Sbjct: 72 DTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGKSARRSA-KPTFS 130 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 G +H++ + +L++GQ E +EIT ++L LD+ G ++T DA GCQ + E Sbjct: 131 GCLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTLDAAGCQTETLEV 190 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFP-LKELNNPAHDSYAMSEKSHGREEIRLHIVCD 242 I+ +GG+Y+ VKGNQ L A F E D + +HGR E R V Sbjct: 191 IRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAHGRHEERNVTVVH 250 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNH 302 PD L W G+ + + R + + K E T YY+SS + A + A IR H Sbjct: 251 DPDGL---PAGWAGVGSVALVCRDRQV---KGKANESTAHYYLSSLRVGAAELAGYIRGH 304 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 WH+E+ +HW LDV ED+ + R G+A IR +A+++L K + + +A Sbjct: 305 WHIES-MHWVLDVAFREDESRTRVGHAGANLGMIRRVAVSLLKRAGK-KGSIHTRRLRAG 362 Query: 363 MDRNYLASVLTG 374 D Y+A VL G Sbjct: 363 WDDQYMAQVLQG 374 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 209 bits (532), Expect = 1e-52, Method: Compositional matrix adjust. Identities = 118/373 (31%), Positives = 201/373 (53%), Gaps = 14/373 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ H++++ + R +H L D++ L I A++SGAEGW DIE +G++ D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP T+AR++ CI E + W+ + + K +IA DGK LR S+ + + A Sbjct: 69 NGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKDA 127 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + +++A+ T + LV+ Q T K EI + ++L++L++KG ++T DA+ CQ++ EKI Sbjct: 128 LQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKIS 187 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMS--EKSHGREEIR--LHIVC 241 ++ + VK NQ +L +A + +F + L + + + E HGR+E R + Sbjct: 188 EKKAHVVVQVKNNQPKLYQAVQSQF--QSLFDAQKEKIVVEHKESGHGRQEERYVFQLKA 245 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 +P EL T +W ++ + RS A K ++ + YY+SS + IR Sbjct: 246 KLPPEL---TEKWPTIRSIIAVERHRS--ANGKGTVDTS--YYVSSLSPKHKLLGHYIRQ 298 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN H+ LDVV NED +I +A E + R +NI+ R K+++A Sbjct: 299 HWRIENSQHYILDVVFNEDASRIAMEDAVENMALFRRFVLNIVKQSNCGARSQRNKLKRA 358 Query: 362 AMDRNYLASVLTG 374 + +Y A + G Sbjct: 359 GWNDDYRAQLFFG 371 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 208 bits (530), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 115/341 (33%), Positives = 176/341 (51%), Gaps = 12/341 (3%) Query: 38 ISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 ++ AE WEDIE +G + +L+ + NGIP HDT RV + F CF ++ Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 ++V+A+DGK++R S G +H++S +++ L +GQ D KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 LL L + G I+T DAMGCQ IAE+I+ +G D L +K N G +A F L + Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 218 -----PAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAE 272 P D++ + HGR +R + D + W L ++ + R I Sbjct: 184 GAAGRPVFDAF----EGHGR-LVRRRVFVDAAATALAPLSGWPDLSRVLAVETLRGI--P 236 Query: 273 QKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 + +RY+++S IR HW VEN LHW L+V EDD ++R AA Sbjct: 237 GTGTVVADIRYFLTSCRDDPSVLVGVIRRHWSVENALHWVLEVSFREDDSRVRDRTAARN 296 Query: 333 FSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 F+ +R IA+N++ D+ +A LR + +KAA D +Y+ ++ Sbjct: 297 FALVRKIALNLIAQDRSTQASLRGRRKKAAWDDDYMLQIIA 337 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 208 bits (529), Expect = 3e-52, Method: Compositional matrix adjust. Identities = 132/363 (36%), Positives = 200/363 (55%), Gaps = 29/363 (7%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + ++ + + D R+ WK++H LSDI+LL A +SGAE W++IE FG+ + LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD---------DKDVIAIDGKTL 113 ENGIP HDT+ RV + + P E W SD K ++AIDGKT+ Sbjct: 66 QLENGIPSHDTLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTI 125 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 R + S ++ A+H+++A++T + GQ+ T+EKSNEITAIPELL+M+ +KG +++ DA Sbjct: 126 RG--NGSAKQKALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDA 183 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGRE 233 MG QK IA+KI K+ DY AVK NQ L E+ P E++ A D Y EK+HG+ Sbjct: 184 MGTQKAIADKIIKKKADYCLAVKENQKTL---LEDIVPFFEMSQEADDHYHTVEKAHGQI 240 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE 293 E R + V L E+ ++ + A R + + +E E + RY+I S ++A+ Sbjct: 241 ETRAYEVIHDVSWLRKTHPEFGHIQSIGRA---RIHLDKNGQESEES-RYFILSCQVSAK 296 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + +R HW +E+ +HW LDVV ED K + +A N+ DK A Sbjct: 297 ELCDYVRGHWQIES-MHWLLDVVFREDANKTLN----------KQLAFNLNVMDKFCLAV 345 Query: 354 LRR 356 L++ Sbjct: 346 LKQ 348 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 206 bits (525), Expect = 8e-52, Method: Compositional matrix adjust. Identities = 125/373 (33%), Positives = 185/373 (49%), Gaps = 11/373 (2%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L + I D RQA K+ H++ ++L++ C+ + E + D+ DF ++ +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 ++G P HD V+ I P E W D IAIDGK LR +++ Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGWCGDLEGRH----IAIDGKALRGTHNAETG 116 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 R +H++ A+ + L GQI EKSNEI AIP LL L +KG +T DAMG Q IAE Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPAHDSYAMSEKSHGREEIRLHI 239 +I G DY+ A+K N R ++ + F E L+ H E SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAI 299 + + D +++W GL+ VA R + V Y++ S E+ A + Sbjct: 237 ITEELD-WYHKSWKWAGLQS--VAQVRRQVQRSHDGPPLEEVHYFLCSFKADVERLAKLV 293 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VEN+ HW LDV NED C++R NAA + +R + I L K LRRK + Sbjct: 294 RGHWSVENRCHWVLDVTFNEDHCQVRDRNAAHNLTILREMVITTLHRHPA-KVSLRRKRK 352 Query: 360 KAAMDRNYLASVL 372 A MD + +L Sbjct: 353 LATMDPAFRLQML 365 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 202 bits (515), Expect = 1e-50, Method: Compositional matrix adjust. Identities = 124/363 (34%), Positives = 197/363 (54%), Gaps = 16/363 (4%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++L+ + I D RQ K+ H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + ++ K +I IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + G H++SA+S +GQ EKSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDS---YAMSEKSHGREEIR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 + + + + WKGLK + + R + ++ K L + RY+ISS E + Sbjct: 239 EYYQTE-KIKWLSQKKAWKGLKSIIME---RKTLEKEGKRL-IEYRYFISSLKEEIETVS 293 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV--FKAGL 354 A+R HW +E+ +HW LDV ED AA+ + IR +++IL +V K + Sbjct: 294 RAVRGHWSIES-MHWHLDVTFREDANTTIDKMAAQNLNIIRKWSLSILKTAEVSRHKLSM 352 Query: 355 RRK 357 R+K Sbjct: 353 RKK 355 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 201 bits (510), Expect = 4e-50, Method: Compositional matrix adjust. Identities = 117/373 (31%), Positives = 189/373 (50%), Gaps = 11/373 (2%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++ H+ I D R EH + DI L + AVISGA+ W +FG ++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP +I R+ +S + ++W+ + + + IAIDGK L+ S A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLK-GAKASASSAA 119 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H+++A+ T LV K +E+ + ELL LD+K +++T DA+ CQ + I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVC--DV 243 K+GGD + VKGNQ +L +A + +F NNP + + + K HGR E R+ C ++ Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLNL 239 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHW 303 P E+ +W LK L R + + + +Y+SSA LT+E F AIR HW Sbjct: 240 PAEI---KMKWSQLKTLIAVERHRKV----GNKTSIDTHFYVSSAVLTSEAFGRAIRAHW 292 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 EN HW LD + ED K+ + A + + +R A+N++ K +K +A Sbjct: 293 QTENNQHWLLDKLFQEDKQKMYDEDGASILAILRRWALNLVKLHPA-KTSQTQKFNRACW 351 Query: 364 DRNYLASVLTGSG 376 ++ ++ G+G Sbjct: 352 SDDFREEIIFGTG 364 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 199 bits (506), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 125/375 (33%), Positives = 193/375 (51%), Gaps = 13/375 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY--GD 63 L+ S +PD R+ + L+ IL++ +CA++ GA+ W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 E G P HDT + + F F +W+R+ D V+AIDGKTLR S K Sbjct: 63 LEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVID-GVVAIDGKTLRGSGKKGSNE 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H+++A++ L + Q T K +E+ + LL++L +KG I+T DA+GCQ ++AEK Sbjct: 122 -LLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAEK 180 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPAHDSYAMSEKSHGREEIRLHI- 239 I +GGDY+ VK NQ L +A E F + + +EK HGR E R + Sbjct: 181 IVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYTW 240 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADL-TAEKFATA 298 + DV WK L + + S R I ++ + RY I S + T E FA A Sbjct: 241 INDVTWMDRPMRAAWKKLGGVGMIESIRQI----GDKVSVDQRYAIGSCGVQTVEMFAKA 296 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 R+HW +EN LHW LDVV ED C+ R GN+A S +R + L ++ K GL R+ Sbjct: 297 SRSHWGIENGLHWTLDVVFREDQCRARLGNSARNLSVLRKFVLTTLRKEEGCKMGLNRRR 356 Query: 359 RKAAMDRNYLASVLT 373 A + +Y S++ Sbjct: 357 LHADRNESYRESLIA 371 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 198 bits (503), Expect = 3e-49, Method: Compositional matrix adjust. Identities = 133/361 (36%), Positives = 183/361 (50%), Gaps = 15/361 (4%) Query: 11 SIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 +IIPD R ++ ++I+ + + AVI GA+ W +IE FG+TH + K IP Sbjct: 8 AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKARLPGLVSIPS 67 Query: 71 HDTIARVVSCISPAKFHECFINWMRD-CHSSDDKDVIAIDGKTLRHSYDKSRR-----RG 124 HDT++R S + F ECF W+ D C V+AIDGK + + DKS R Sbjct: 68 HDTLSRFFSILDIDWFEECFRLWVDDICRRI--PGVVAIDGKAICDNPDKSSNSKNGVRS 125 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++++SA+S + + +GQ K +EKSNE AIPEL+ LD++ IIT DA+GCQK I + I Sbjct: 126 KLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIGCQKSITKLI 185 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH-DSYAMSEKSHGREEIRLHIVCDV 243 + DY+ K N L E + H Y K HGR E R VC Sbjct: 186 IENKADYILCAKDNHEALRNIIEFNLSEESRYYLCHAKRYFEENKGHGRSEYR-ECVCIS 244 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHW 303 L F W G+K L + S R + KE M RYYISS + +IR HW Sbjct: 245 AKNLQYFLKGWTGIKTLAMINSIRKM---GDKEAVMETRYYISSLEPDPIIILKSIRPHW 301 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 VEN LHW LD+ EDD + + GNAA FS I +A+ +L + K G+ K + Sbjct: 302 EVENNLHWVLDIGFREDDDR-KIGNAAINFSAIPKLALMLLKQSDI-KLGMAGKRKACGW 359 Query: 364 D 364 D Sbjct: 360 D 360 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 194 bits (494), Expect = 4e-48, Method: Compositional matrix adjust. Identities = 103/196 (52%), Positives = 133/196 (67%), Gaps = 13/196 (6%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L L H + + D RQA K+ +KL D+L L + AVISGAEGWE+IEDFG +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 YGDF +GIPVHDTIAR+V I P FH+ FI WM+ DK V+A+DGKTL Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 H+ISAF+T + +V+GQ +TDEKSNEITA+PELL +L+++G ++T DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 AEKIQKQGGDYLFAVK 196 + I K+ DY AVK Sbjct: 168 VKTIVKKKADYCIAVK 183 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 194 bits (493), Expect = 4e-48, Method: Compositional matrix adjust. Identities = 118/372 (31%), Positives = 185/372 (49%), Gaps = 26/372 (6%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 +PD R H L D+L + + A I GAE D F +++ + G+P HD Sbjct: 12 LPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLPSHD 71 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAF 132 T +RV + P F CF ++ D D V+AIDGKTLR S+D++ R A+HV+SAF Sbjct: 72 TFSRVFRLLDPVAFSRCFQQFL-DHLGEDGAGVLAIDGKTLRRSFDRAAGRSALHVVSAF 130 Query: 133 STMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYL 192 ++ +++GQ NEI A LL + D+KG ++T DA+ Q+ A+ I ++GGD+L Sbjct: 131 ASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGGDWL 190 Query: 193 FAVKGNQGRLNKAFEEKF--PLKELNNPAHDSYAMSEKSHGREEIRLHIVC-DV------ 243 F +K N+ L E F P L P + ++ HGR E+R H V DV Sbjct: 191 FPLKDNRPALRAEVERYFADPATVLAVP----HVTTDADHGRIEVRRHWVSHDVAWLASD 246 Query: 244 ---PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 PDE + GLK L + + T Y+SSA L + A A+R Sbjct: 247 RRFPDEAV-----LPGLKILGLV---ERTVTSPDGRTTATRTLYLSSAALEPKTLARAVR 298 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +E +HW LD +ED + R+ + E + +R +A+N++ + + +R + ++ Sbjct: 299 AHWSIEAAVHWVLDTSFDEDRARNRKDHGPENLATLRKLALNVVRSANN-QDSIRLRRKR 357 Query: 361 AAMDRNYLASVL 372 A +Y ++L Sbjct: 358 AGWSDDYARTIL 369 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 189 bits (479), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 112/297 (37%), Positives = 166/297 (55%), Gaps = 9/297 (3%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 +PD R + H LS++L + +CAV+ GA + D+ +G+++ +L+++ + G+P HD Sbjct: 16 VPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKFLKLKAGVPSHD 75 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHSYDKSRRRGAIHVISA 131 T RV++ I PA F F+ W+ + D V+AIDGKT R S K G +H++SA Sbjct: 76 TFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGKDTS-GPLHMVSA 134 Query: 132 FSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDY 191 F+ LV+GQ TD+KSNEITAIPELL ML ++G I+T DAMG Q IA I+ +G DY Sbjct: 135 FAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAIARTIRSRGADY 194 Query: 192 LFAVKGNQGRLNKAFEEKFP-LKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDF 250 + VK N L + + E PA + K HGR E+R D +L Sbjct: 195 VLCVKDNPPTLTDSILLTLAGVAEKIAPA-SHFEEQTKGHGRVEVRRCWAYDAVSQLYK- 252 Query: 251 TFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVEN 307 + +W GL+ + R++ + K +E YYISS A + A A+R+HW VE+ Sbjct: 253 SEQWAGLQSFALVERERTV--DGKTSVER--HYYISSLPADAARIAQAVRSHWAVES 305 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 187 bits (475), Expect = 5e-46, Method: Compositional matrix adjust. Identities = 112/282 (39%), Positives = 161/282 (57%), Gaps = 17/282 (6%) Query: 12 IIPDYRQAWKME-HKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 +IPD R+A + H LSDIL + +CAV+SG + WE + +FG T +L+Q+ NGIP Sbjct: 20 LIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANGIPS 79 Query: 71 HDTIARVVSCISPAKFHECFINWMRDCH-SSDDKDVIAIDGKTLRHSYDKSRRRGAIHVI 129 HDT RV S I P F F +W D D +A+DGKT+R S+ S R A+H++ Sbjct: 80 HDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSHRGSAGR-ALHLL 138 Query: 130 SAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGG 189 A+S L++ Q + D KSNEITAIP++L++ D++G I+ DA+GCQK +A +I + GG Sbjct: 139 HAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITEAGG 198 Query: 190 DYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELID 249 DY+ A+KGNQ L+ + +P + A+ EK HGR E R V D D L Sbjct: 199 DYVLALKGNQSALHDDVRLFMETQADRHPQGQAEAV-EKDHGRIETRRIWVNDEIDWLTQ 257 Query: 250 FTFEWKGLKKLCVAVSFRSIIAEQKKELEMTV----RYYISS 287 +W GLK L ++ E ++EL V R +I+S Sbjct: 258 KP-DWPGLKTL--------VMVESRRELNGQVSCERRCFITS 290 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 182 bits (461), Expect = 3e-44, Method: Compositional matrix adjust. Identities = 89/209 (42%), Positives = 136/209 (65%), Gaps = 4/209 (1%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +LK + G I PD+R++ K + L ILL+ I +VI GA+ W ++E++ + +FL+ + Sbjct: 6 KLKTIFGQI---PDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSF 62 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 D NGIP HDT RV S I +F +CFI W+ +++IAIDGKT+R + Sbjct: 63 LDLPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGG 121 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 ++ +H++SA++ ++LV+GQ+K KSNEITAIP+LL +L I+ I+T DAMGCQ IA Sbjct: 122 KKSPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIA 181 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 + I K+ DY+ AVK NQ +L + E++F Sbjct: 182 KAIVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 181 bits (458), Expect = 5e-44, Method: Compositional matrix adjust. Identities = 125/370 (33%), Positives = 189/370 (51%), Gaps = 16/370 (4%) Query: 15 DYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTI 74 D RQ+WK+ + LS IL L ++G E +++EDF E + Y D G P HDT+ Sbjct: 19 DSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEPLFATYVDLSEGCPSHDTL 78 Query: 75 ARVVSCISPAKFHECFINWMRDCHSSDD-KDVIAIDGKTLRHSYDKSRRRGAIHVISAFS 133 RV+S ++ + E + + + S D +I++DGKT+R + K+++ +H+++A+ Sbjct: 79 ERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRGNRGKNQK--PVHIVTAYD 136 Query: 134 TMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLF 193 H L +GQ+ +EKSNEI AIP+LL +DI+ I+T DAMG Q I + I K DY Sbjct: 137 GGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCL 196 Query: 194 AVKGNQGRLNKAFEEKFP----LKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELID 249 AVKGNQ L F L+EL A Y EKS G+ E+R + V L Sbjct: 197 AVKGNQETLYDDIALYFSDVNLLEELQENAQ-YYQTVEKSRGQIEVREYWVSSDIKWLCQ 255 Query: 250 FTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKL 309 +W L+ + + R+ I ++ +L RY+I S FA +R HW +E+ + Sbjct: 256 NHPKWHKLRGIGMT---RNTI-DKDGQLSQENRYFIFSFKPDVLTFANCVRGHWQIES-M 310 Query: 310 HWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL--RRKMRKAAMD-RN 366 HW LDVV +ED + AA + IR + + L K L RRK R ++ + Sbjct: 311 HWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKKDLSYRRKQRYISVHLED 370 Query: 367 YLASVLTGSG 376 YL + G Sbjct: 371 YLVQLFGERG 380 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 181 bits (458), Expect = 6e-44, Method: Compositional matrix adjust. Identities = 110/371 (29%), Positives = 192/371 (51%), Gaps = 11/371 (2%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ H+ II D R ++H L D++ LT+ A++SGA GW+ IE FG D+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +GIP IA ++ + + W+ D K +IA+DGKT+R ++ + A Sbjct: 63 HGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ-A 121 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H++SAF + + + ++K +E ++++ L + ++T DA+ CQK +KI Sbjct: 122 LHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKII 181 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIR--LHIVCDV 243 + D++ +KGNQ L A + F ++PA + HGR+E R + I ++ Sbjct: 182 SKKSDFVIQIKGNQPALLAAVKAAF-AACYDSPALAISEQTNTGHGRKECRRVMQIEGNL 240 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHW 303 P EL + +W ++ L S R++ + + R+Y+SS + + A IR HW Sbjct: 241 PPELSE---KWPHIRTLVEVASERTV----GNKTACSSRWYVSSLPVDTAQLADIIRAHW 293 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 +EN+LHW LDVV ED+ + + A+ + A++++ + K L K + AA Sbjct: 294 AIENQLHWVLDVVFREDELNVSDPDGAKHLALFNRAALSVIKQHQGKKDSLAAKRQSAAW 353 Query: 364 DRNYLASVLTG 374 D + + +L G Sbjct: 354 DPAFRSELLFG 364 >UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=Bacteria RepID=B7UED9_ECOLX Length = 104 Score = 174 bits (442), Expect = 4e-42, Method: Compositional matrix adjust. Identities = 83/99 (83%), Positives = 89/99 (89%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 MTVRYYISSAD TAEKF TAIRNHWH+EN L+WRLDVVMNEDD KIRRGNAAE FSGIRH Sbjct: 1 MTVRYYISSADSTAEKFVTAIRNHWHMENNLYWRLDVVMNEDDYKIRRGNAAESFSGIRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGSGL 377 IAINILTN++VFKA RRKMRKA MD+NYLASVL G+G Sbjct: 61 IAINILTNNQVFKARSRRKMRKATMDKNYLASVLAGAGF 99 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 172 bits (435), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 107/360 (29%), Positives = 174/360 (48%), Gaps = 17/360 (4%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 +PD R A H L++IL + + A + GA D+ F + +NG+P HD Sbjct: 15 LPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDVLVLKNGLPSHD 73 Query: 73 TIARVVSCISPAKFHECFINWMR----DCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHV 128 T +RV + P F + F +M+ K VIA+DGK LR Y+ R + Sbjct: 74 TFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGYESGRSHMPPVM 133 Query: 129 ISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQG 188 ++A++ + + ++ +NE +L+ +L +KG ++T DA+ C + +AE I+ +G Sbjct: 134 VTAWAAQTRMALANVQA-PNNNEAAGALQLIELLQLKGCVVTADALHCHRGMAEAIKARG 192 Query: 189 GDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELI 248 GDY+ AVK NQ L + + K ++ S + HGR+E R +V VP Sbjct: 193 GDYVLAVKDNQPALMR--DAKAAIRAATRQGKPSTITVDAGHGRKEKRRAVVAAVPQMAQ 250 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 D F GLK + S R K +E RY++ S + +R HW +EN Sbjct: 251 DHDFA--GLKAVARITSKRGT----DKTVE---RYFLMSQAYPPKDVLRIVRTHWTIENS 301 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 LHW LDVV++ED + R+ NA + +R +A+N+ LR K+++A + +L Sbjct: 302 LHWPLDVVLDEDLARNRKDNAPANLAVLRRLALNVARAHPDNTTSLRGKLKRAGWNDTFL 361 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 171 bits (433), Expect = 4e-41, Method: Compositional matrix adjust. Identities = 90/233 (38%), Positives = 139/233 (59%), Gaps = 3/233 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ H + D R +HKL DI+++ +CA+I GA+ + +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + I P +F + F +W+ DV+ IDGKT++HS +K + A Sbjct: 69 NGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKKA 128 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH+++A+++ LV+ Q K E++ EITAIP L+ +L++ G ++T DAMG Q DIAE + Sbjct: 129 IHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELLH 188 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPAHDSYAMSEKSHGREEI 235 +G DY A+KGNQ L + +E F E H + EK R E+ Sbjct: 189 SKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEV 241 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 170 bits (430), Expect = 9e-41, Method: Compositional matrix adjust. Identities = 96/256 (37%), Positives = 150/256 (58%), Gaps = 15/256 (5%) Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++H+++A+ + +L++GQ+K D+KSNEITAIP+LL ML ++G I+T DAMGCQK IA++I Sbjct: 2 SLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQI 61 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP-AHDSYAMSEKSHGREEIRLHIVCDV 243 + DY+ AVK NQ L + + F ++N H + + HGR E R + V Sbjct: 62 GSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREYSTI-V 120 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTV----RYYISSADLTAEKFATAI 299 D+L+ W L + + E K+E+ T+ RY+I S + A++F A+ Sbjct: 121 GDDLLAGITGWDNLNAIG--------MVESKREVGNTISNEKRYFIMSINGHAQRFGDAV 172 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN +HW LDV ED +IR+ N+ E S +R IA+N + + K ++RK + Sbjct: 173 REHWGIENTVHWVLDVSFGEDQSRIRKDNSPENLSMLRKIALNCVKQEST-KTSMKRKRK 231 Query: 360 KAAMDRNYLASVLTGS 375 A D ++L VLTG+ Sbjct: 232 MAGWDNSFLIKVLTGN 247 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 167 bits (422), Expect = 8e-40, Method: Compositional matrix adjust. Identities = 108/363 (29%), Positives = 178/363 (49%), Gaps = 15/363 (4%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 +PD R A + H L ++L++ +V+ G+ ++ FG F + + ++ IP HD Sbjct: 22 VPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRNFLKLKHAIPSHD 80 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSS-DDKDVIAIDGKTLRHSYDKSRRRGAIHVISA 131 T + V I P F + D D D+IAIDGK LR + D ++SA Sbjct: 81 TFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDPGESARTRMMVSA 140 Query: 132 FSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDY 191 +++ L + + D + E++A E L ++D++GK++T DA+ C + I GGD+ Sbjct: 141 YASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRRTVAAINAGGGDW 199 Query: 192 LFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKS-HGREEIRLHIVCDVPDELIDF 250 A+KGNQ L F ++P A++E + HGR+E R +V V + + Sbjct: 200 CLALKGNQESLLSDARGCFSKGHKSDPT----AVTENTGHGRKETRKAVV--VSAKALAE 253 Query: 251 TFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLH 310 E+ GLK + R E ++ RY+ S T E A+R+HW +EN LH Sbjct: 254 YHEFPGLKGFGRIEATR----ETGGKVTSETRYFALSWVPTPEVLLAAVRDHWAIENALH 309 Query: 311 WRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLAS 370 W+LDV ED + R+ N + +R A+++L D K L K+++A D +L S Sbjct: 310 WQLDVSFREDAARNRKDNGPGNIAVLRRRALDVLRRD-TSKGSLSIKIKRAGWDTTFLRS 368 Query: 371 VLT 373 +L+ Sbjct: 369 ILS 371 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 166 bits (420), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 129/400 (32%), Positives = 191/400 (47%), Gaps = 43/400 (10%) Query: 15 DYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTI 74 D R+ K+ + I+L+T+ V + W DI DF DFL+++ P HDT+ Sbjct: 29 DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTL 88 Query: 75 ARVVSCISPAKFHECFINW---MR-DCHSSDDKDV----------------IAIDGKTL- 113 R I + C+ W MR D S +D D IAIDGKT+ Sbjct: 89 RRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTIC 148 Query: 114 ---------RHSYDKSRRRGA----IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 + S K + A +H++SAF + SL +GQ + K NEI AIP+LL+ Sbjct: 149 GAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLD 208 Query: 161 MLDIK-GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPA 219 +DI+ G ++T DA+G QK I EKI ++ DYL VK N +L + E ++ Sbjct: 209 DIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRE 268 Query: 220 HDSYAMSEKS---HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE 276 +D +E++ HG R I C P L +WK L+ + + + IA E Sbjct: 269 NDFIKRAEETTEGHGFMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIKTEKINIA--TGE 326 Query: 277 LEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGI 336 ++ +ISS E R HW VEN LHW+LDV NEDD + + N+A+ FS + Sbjct: 327 IQNEKHCFISSLVNNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTL 385 Query: 337 RHIAINILTN--DKVFKAGLRRKMRKAAMDRNYLASVLTG 374 +A+ IL N D+ K + RK +KA YLA+++ Sbjct: 386 TKMALTILKNYQDEDKKTSVNRKRKKAGWSDEYLANLINN 425 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 165 bits (418), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 104/343 (30%), Positives = 165/343 (48%), Gaps = 21/343 (6%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 +PD R +H L +IL + + AV+ GA ++E F + D L+Q+ E G P HD Sbjct: 10 VPDPRD-LTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLERGAPSHD 68 Query: 73 TIARVVSCISPAKFHECFINWM----RDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHV 128 T +RV++ + P +E F+ +M K +A+DGK+LR +Y K R V Sbjct: 69 TFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGRSHMPPLV 128 Query: 129 ISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQG 188 ++ F + + Q E E+ A L +L +KG +T DA+ C + + + ++ G Sbjct: 129 VTVFGCDTFMSLAQTVAQE-GGEVQAAIAALELLSLKGLTVTADALHCHRRMTKTVRDGG 187 Query: 189 GDYLFAVKGNQGRLNKAFEEKFPL-KELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDEL 247 G Y+ A+KGNQ +L A E L K A + E +HGR E+R V Sbjct: 188 GHYVIAIKGNQSKL--AAEANTALDKAAAGKATKFHQTEEDAHGRHEVRRAFVIPFAQ-- 243 Query: 248 IDFTFEWKGLKKLCV---AVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWH 304 T L LC S+R++ + + VR Y S + A + +R HW Sbjct: 244 ---TPGKNALVDLCAIGRVESWRTVEGKTTHK----VRCYALSRKMPAHELLATVRRHWS 296 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 +EN LHW+LDV++ ED + R+ N A + +R + +N+L D Sbjct: 297 IENDLHWQLDVLLGEDHIRGRKNNTAANHAILRRLTLNVLRAD 339 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 164 bits (415), Expect = 5e-39, Method: Compositional matrix adjust. Identities = 88/240 (36%), Positives = 137/240 (57%), Gaps = 8/240 (3%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++L+ + I D RQ K+ H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + ++ K +I IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + G H++SA+S +GQ EKSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDS---YAMSEKSHGREEIR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 163 bits (412), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 125/385 (32%), Positives = 184/385 (47%), Gaps = 43/385 (11%) Query: 30 LLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHEC 89 +L+T+ V + W DI DF DFL+++ P HDT+ R I + C Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FINW---MR-DCHSSDDKDV----------------IAIDGKTL----------RHSYDK 119 + W MR D S +D D IAIDGKT+ + S K Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 120 SRRRGA----IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIK-GKIITTDAM 174 + A +H++SAF + SL +GQ + K NEI AIP+LL+ +DI+ G ++T DA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKS---HG 231 G QK I EKI ++ DYL VK N +L + E ++ +D +E++ HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT 291 R I C P L +WK L+ + + + IA E++ +ISS Sbjct: 241 FMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIKTEKINIA--TGEIQNEKHCFISSLVNN 298 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN--DKV 349 E R HW VEN LHW+LDV NEDD + + N+A+ FS + +A+ IL N D+ Sbjct: 299 PELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTLTKMALTILKNYQDED 357 Query: 350 FKAGLRRKMRKAAMDRNYLASVLTG 374 K + RK +KA YLA+++ Sbjct: 358 KKTSVNRKRKKAGWSDEYLANLINN 382 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 159 bits (403), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 81/195 (41%), Positives = 119/195 (61%), Gaps = 2/195 (1%) Query: 94 MRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEIT 153 M+ H +V+AIDGKTLR SYD+ R+ IH++SA+++ + LV+GQ+KT+ KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIP L+ MLD++G I+T DAM CQ IA+ I ++GGDYL AVKGNQG+L A + F Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFT-P 119 Query: 214 ELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 P EK GR E R + V D + DF+ W GL + + ++R+ Q Sbjct: 120 HRRAPIDRDTCQIEKQKGRVEARTYHVLSASDLIRDFS-TWSGLTSIVMVENYRAAKGRQ 178 Query: 274 KKELEMTVRYYISSA 288 + + + + + + S+ Sbjct: 179 RARVGVPLLHKVQSS 193 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 158 bits (399), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 89/278 (32%), Positives = 143/278 (51%), Gaps = 17/278 (6%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L+ + + D R K+EH+L DIL++ +CAV++ AE +EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD------VIAIDGKTLRHS 116 D GIP HDT RV I P F CF+NW R + D IA+DGK +RHS Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ R +H++SA++T LV+ Q D K E A+P +L L + G +++ DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN-----PAHDSYAMSEKSHG 231 ++++A+ I +G YL +K NQ +++ F + P D++ + +HG Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAF---DDTHG 238 Query: 232 REEIRLHIVCDVPDELIDFTFE-WKGLKKLCVAVSFRS 268 R R C PD T W GL + + + R+ Sbjct: 239 RLVRRRVFAC--PDAGCFTTLRGWPGLTTVLASETIRA 274 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 157 bits (398), Expect = 5e-37, Method: Compositional matrix adjust. Identities = 107/351 (30%), Positives = 163/351 (46%), Gaps = 46/351 (13%) Query: 15 DYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTI 74 D RQ K+ H+ I++ + V + + W ++ DF DF++++ P HDT+ Sbjct: 29 DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFFPDIQKAPSHDTL 88 Query: 75 ARVVSCISPAKFHECFINW---MRDCHSSDDKD-----------------VIAIDGKTLR 114 R + P + W MR+ ++ ++ IAIDGKT++ Sbjct: 89 RRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKPFRQIAIDGKTIK 148 Query: 115 HSYDKSRRRGA--------------IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 + ++ RRR +H++SAFS L +GQ + D+K NEI AIP LL+ Sbjct: 149 KAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKKENEIVAIPRLLD 208 Query: 161 MLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL------NKAFEEKFPLK 213 LDI +G ++T DAMG QKDI +I K+ YL VK NQ L N E+ PL Sbjct: 209 DLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIAGNMRDFERIPLP 268 Query: 214 ELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 N + + E HG +R VC L +W+ L+ + + R + E Sbjct: 269 ---NEVYKVHKEGENGHGFVFLRECRVCSSLHSLGKIYKDWENLRSYGLIRTER--VDEA 323 Query: 274 KKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 E + Y+ISS + EK R HW +EN LHW+LD+ EDD ++ Sbjct: 324 TGESAVETHYFISSLENDPEKIMRVKRKHWGIENGLHWQLDITFKEDDGRM 374 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 157 bits (396), Expect = 7e-37, Method: Compositional matrix adjust. Identities = 105/345 (30%), Positives = 182/345 (52%), Gaps = 16/345 (4%) Query: 10 ISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG-- 67 I+++ D R ++++ L ILL+++ A ISG + WE IED+ H + L+ +G Sbjct: 9 IAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKLSGKE 68 Query: 68 -----IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 +P HDT V I P +F E + ++ + + IAIDGKT R ++ Sbjct: 69 LKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPR-GIKQTAN 127 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 +++SA+ T H VI I ++ K +E+++I +L+ +L ++ +T DA G ++ E Sbjct: 128 SHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGTYVEVIE 187 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIR-LHIVC 241 I +GG+++ VKGNQ +L + E++F N + D+ + HGR E R ++ + Sbjct: 188 MILSKGGNFVLPVKGNQKKLLEFIEKEFREYRGNTVSADT--QEDIGHGRVEKRTVYCIT 245 Query: 242 DV-PDELIDFTFE-WKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAI 299 ++ D+ ID + WKG+K L V R + + K + YYI++ + ++ AI Sbjct: 246 EIKTDDDIDGCMQKWKGVKTLVKIV--REVYKKADKSTRIETVYYITNL-IDPKEINRAI 302 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 R HW +EN LH LDV++NED + N E F + +A+ I+ Sbjct: 303 RAHWGIENNLHRSLDVLLNEDHSQKSNHNVVENFHIMSLLALFII 347 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 156 bits (395), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 107/365 (29%), Positives = 170/365 (46%), Gaps = 17/365 (4%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 +PD R A H L ++L++ +V+ GA ++ FG + + ++ +P HD Sbjct: 44 VPDPR-AENTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLKHAVPSHD 102 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSS-DDKDVIAIDGKTLRHSYDKSRRRGAIHVISA 131 T + V I P F + D + D DVIA+DGK LR + D ++SA Sbjct: 103 TFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGRTRMMVSA 162 Query: 132 FSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDY 191 ++ L + + D + E+ A E L ++ +KGK++T DA+ C + I GGD+ Sbjct: 163 YAARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAINAGGGDW 221 Query: 192 LFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFT 251 A+K NQ L F + AH S + HGR E R V V + + Sbjct: 222 CLALKANQDSLLSDARASFGAEP---DAHPSALSEDIGHGRTETRKATV--VSSKALAEH 276 Query: 252 FEWKGLKKLCVAVSFRSIIAEQKKELEMT--VRYYISSADLTAEKFATAIRNHWHVENKL 309 E+ GLK +F + A +K T RY+ S T E +R HW +EN L Sbjct: 277 HEFPGLK------AFGRVEATRKTAEGTTSETRYFALSWVPTPEVLLATVRAHWAIENSL 330 Query: 310 HWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLA 369 HW+LDV ED + R+ N+ + +R A++++ D K L K+++A D ++L Sbjct: 331 HWQLDVSFREDAARNRKDNSPGNIAILRRRALDVMRRD-TSKGSLSIKLKRAGWDDDFLR 389 Query: 370 SVLTG 374 +VL G Sbjct: 390 NVLNG 394 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 149 bits (375), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 74/165 (44%), Positives = 106/165 (64%), Gaps = 3/165 (1%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 +PD R+ + H+L ++LL IC VISGAE W + + + D+L+ Y + +GI HD Sbjct: 15 LPDPRRR-ECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPYAHGIASHD 73 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAF 132 T RV S + ++F CF+ W+ S + +AIDGK LR S+D + R IH++SA+ Sbjct: 74 TFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHDGA--RSPIHLVSAW 131 Query: 133 STMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 S+ +L +GQ++T +KSNEITAIPELL LDI+G IT DAMGC Sbjct: 132 SSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCH 176 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 149 bits (375), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 111/350 (31%), Positives = 171/350 (48%), Gaps = 16/350 (4%) Query: 3 LKKLMGHISIIPDYRQAWK--MEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 L+KL S IPD+R+A K + HKLSDI++L I +S +I +FG+ + ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-----DDKDVIAIDGKTLRH 115 NGIP T+ R+ I + H +++I IDGK R Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + K+ R I +SA S + + +EKSNEI A+P L++ +DI GKI+T DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 QKDI +KI+++ GD++ +K NQ L E+K +KEL +P + E HGR E Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDK--IKEL-SPVYSYCGEPELGHGRIET 269 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 R + V D D LI +W G L + + + R ++SS + Sbjct: 270 RSYRVFDGTD-LIANKEKWNG--NLTIIEYECETVKKSTGNCTTEKRLHVSSLPANTPRL 326 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 T +RNHW +E+ +HW LD + +D K + AA I+ I ++ + Sbjct: 327 GTPVRNHWSIES-MHWGLDRNLLQDKIKRKSARAARNLDTIQRIVYSVFS 375 >UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX Length = 105 Score = 147 bits (372), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 73/87 (83%), Positives = 76/87 (87%) Query: 272 EQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 EQKKE EMT RYY SADLTAEKFATA RNHW+VENKLHW LDVVMN+DDCKIRRGNAAE Sbjct: 19 EQKKEPEMTDRYYSISADLTAEKFATANRNHWYVENKLHWHLDVVMNKDDCKIRRGNAAE 78 Query: 332 LFSGIRHIAINILTNDKVFKAGLRRKM 358 LFSGIR IAINILT DK+ KAG R KM Sbjct: 79 LFSGIRKIAINILTKDKILKAGARCKM 105 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 144 bits (362), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 83/197 (42%), Positives = 113/197 (57%), Gaps = 8/197 (4%) Query: 133 STMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYL 192 S +LV+GQ K ++KSNEITAIP L+ ML+I+ IIT DAMGCQK+I I+K+ GDY+ Sbjct: 28 SLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESSIITIDAMGCQKEITSLIRKKKGDYI 87 Query: 193 FAVKGNQGRLNKAFEEKFPL---KELNNPAHDSYAMSEKSHGREEIRLHIVCDVPD-ELI 248 +K NQ L + +E F + +E + H Y E H R E R I V + Sbjct: 88 ITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQEIETGHHRIEKREVIAVSVSSLPCL 147 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 W LK + + S R + + E VR+YISS + ++K ATAIR+HW +EN Sbjct: 148 HNQDLWTELKTVVMVKSERRLWNKTTTE----VRFYISSVEKNSQKIATAIRSHWEIENS 203 Query: 309 LHWRLDVVMNEDDCKIR 325 LHW LDV +ED +IR Sbjct: 204 LHWTLDVTFSEDKSRIR 220 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 143 bits (360), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 92/237 (38%), Positives = 121/237 (51%), Gaps = 9/237 (3%) Query: 143 IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL 202 + T++KSNEITAIP LL L+ K ++T DAMGCQKDIA I GGD++ AVK NQ +L Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 NKAFE---EKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 A EK EL H +Y HGR + R H V VP EW +K Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVPPGFAA-KGEWPWIKA 119 Query: 260 LCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + AV I VRYY+ S L+ ++F +R HW +E+ +HW LDV E Sbjct: 120 IGTAV---RITTHADGTQSDEVRYYMLSRFLSGKRFGEVVRGHWGIES-MHWVLDVTFGE 175 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGSG 376 D + R+ A S +R AI +L K +R KM + MD ++L VLT G Sbjct: 176 DRTRTRQRVLANNLSWVRRFAITLLKRHPE-KDSIRGKMIRCLMDTSFLNEVLTLQG 231 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 143 bits (360), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 122/372 (32%), Positives = 182/372 (48%), Gaps = 27/372 (7%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L + ++P R K + L +LL+ + +SG W +IED+ E + + LK + Sbjct: 5 LFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEML 64 Query: 66 NG------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRH---- 115 G +P HDT+ R +S + F + W+ S+ I IDGKT+R Sbjct: 65 TGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRGVKKL 124 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 S+D HV+SAFS + Q+ D K+NEI AI +LL++LD+ G +++ DA+G Sbjct: 125 SFDTQS-----HVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIG 179 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF-PLKELNNPAHDSYAMSEKSHGREE 234 Q I E+I +GGDY+ VK NQ + E F PL + + + +E SHGR E Sbjct: 180 TQTAIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLDEQ---TELSHGRIE 236 Query: 235 IRLH--IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISS-ADLT 291 R + I+ + E + KGL+ + V R K E V YYISS D++ Sbjct: 237 TRRYESILNPLEIEANEVLTRRKGLRSIHKVVRKRRDKKSDKTSEE--VAYYISSLTDVS 294 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-F 350 + K AIR HW +ENKLH LDV D R N A++ I+ I + I+ K Sbjct: 295 SLK--QAIRGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKTNM 352 Query: 351 KAGLRRKMRKAA 362 K+ + R +K A Sbjct: 353 KSSIPRIQKKPA 364 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 140 bits (353), Expect = 8e-32, Method: Compositional matrix adjust. Identities = 105/341 (30%), Positives = 160/341 (46%), Gaps = 17/341 (4%) Query: 40 GAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISP---AKFHECFINWMRD 96 GA+ +I +F E LK+ +G P HDT +R+ I P A+ F+ +R Sbjct: 37 GAKNCVEIAEFVEGREAELKEIVTLRHGCPSHDTFSRIFRLIDPDELARALGAFLAALRQ 96 Query: 97 CHS--SDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITA 154 + V+A+DGK LR Y+K R ++S + L + K E S+E+ A Sbjct: 97 GLGLGPRPRGVVAVDGKALRRGYEKGRAFMPPVMVSVWDAETRLSVA-TKRAEGSDEVAA 155 Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 LL +D+KG I+T DA+ C+ D A+ + + Y A+K N+GRL E F + Sbjct: 156 TLALLKSIDLKGCIVTADALHCRPDTAKALIGRKAHYALALKANRGRLFACAEAGFVAAD 215 Query: 215 LNN--PAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAE 272 H++ E HGR E R V +P + + GLK + + R Sbjct: 216 AAGDLAFHET---RETGHGRLETRRASV--LPLKAFKQAPAFPGLKAIGRIQATRQ---G 267 Query: 273 QKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 +VRY S L K A +R HW +EN+LHW LDVV +EDD + R+ NA + Sbjct: 268 ADGRAVTSVRYIALSKVLAPHKLAEVVRAHWTIENQLHWSLDVVFHEDDARSRKDNAPQN 327 Query: 333 FSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 + IR +A +IL + K + KMR+ +R++ T Sbjct: 328 LAVIRRLARDILAAHPLDKP-IASKMRRVNWNRDFFHEFFT 367 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 138 bits (348), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 111/357 (31%), Positives = 179/357 (50%), Gaps = 32/357 (8%) Query: 3 LKKLMGHISIIPDYRQAWK--MEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 +K L + +PDYR+ K ++KL DILLL I + DI FG+ + + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSCISP-------AKFHECFINWMRDCHSSDDKDVIAIDGKTL 113 G +G+P T+ R+ I ++F F + + C D++ IDGK + Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAG----DILCIDGKAM 133 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 R + ++ R I +SA+S + + +EKSNEIT++P+LL+ +D+ G I+T DA Sbjct: 134 RGTVLENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADA 191 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSE-KSHGR 232 M QK I +KI+++GGD+L +K NQ L E+ EL P D Y+ HGR Sbjct: 192 MSFQKAIIDKIREKGGDFLIELKANQRTLRYGVEDNV---ELAEPV-DVYSEGPFLEHGR 247 Query: 233 EEIRLHIVCDV--PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTV--RYYISSA 288 E R VC + ++LI +W G L V V R+ E+K + + + R+Y+SS Sbjct: 248 IETR---VCRIFRGNDLITDREKWNG--NLTV-VEIRT-ATERKSDGQKSSERRFYVSSF 300 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 +A + T R HW +E+ +HW LD + +D + +A I+ + + IL+ Sbjct: 301 HGSARRLGTIARMHWAIES-MHWDLDRNLRQDFIRRNSARSARNLDTIQRMVLAILS 356 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 137 bits (346), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 79/195 (40%), Positives = 111/195 (56%), Gaps = 7/195 (3%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 I D R K HK+ I+ ++I AVI GA+ W +IE+FG F K IP HD Sbjct: 12 IEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPSLEFIPSHD 71 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLR--HSYDKSRRRGA----I 126 T R S I P F F NW++ + K V+AIDGK +R D RG + Sbjct: 72 TFNRFFSMIKPDYFELIFRNWVKQV-CQEVKGVVAIDGKLMRGPSQCDGEHTRGKEGFKL 130 Query: 127 HVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQK 186 ++SA+S + + +GQ+K D+KS+EITAIP L+N L++ G I+T DAMGCQKDI + I Sbjct: 131 WMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQKDITQTIIG 190 Query: 187 QGGDYLFAVKGNQGR 201 +Y+ A+K N+ + Sbjct: 191 HDANYIIAIKENKKK 205 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 133 bits (334), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 90/250 (36%), Positives = 133/250 (53%), Gaps = 16/250 (6%) Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YDKSR 121 IP HDT R S I P F F NW++ + K V+AIDGK +R + + Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQV-CQEVKGVVAIDGKLMRGPSQCDGEHTTGK 62 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + ++SA+S + + +GQ+K D+KSNEITAIP L+N L++ G I+T DAMGCQKDI Sbjct: 63 EGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDIT 122 Query: 182 EKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLH 238 + I + +Y+ A+K N+ + L K + + K+ + HGR E R Sbjct: 123 QTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRTC 182 Query: 239 IVCDVPDELIDFTFEWK--GLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT-AEKF 295 V +++ F+ K GLK + S R+I+A E VRYY++S D T E+ Sbjct: 183 TVVSY-GSIMEKMFKKKLVGLKSIVGIKSERTIVA--TGEYTQEVRYYVTSLDNTKPEEI 239 Query: 296 ATAIRNHWHV 305 A+AIR HW + Sbjct: 240 ASAIRQHWSI 249 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 111/375 (29%), Positives = 179/375 (47%), Gaps = 36/375 (9%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDF-GETHPDFLKQ 60 E+ L+ ++ +PD R + H L+ +L LT CAV++GA + ++ E + L++ Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 61 YGDFENGI------PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV--IAIDGKT 112 G + + P TI RV++ I W+ C D + +A+DGK+ Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWL-ACRQQDAGGLRALAVDGKS 156 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITT 171 LR + RR +H+++A + LV+ Q+ EK+NEIT LL+ L D+ G ++T+ Sbjct: 157 LRGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTS 214 Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHG 231 DA+ Q D A ++ + Y+ VK N +L+ + P +++ P D + HG Sbjct: 215 DALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLKS-LPWQQI--PLQDRTRTT--GHG 269 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT 291 R EIR VC V + L + G ++ V R + ++ + Y ++S L Sbjct: 270 RCEIRRLKVCTVNNLL------FPGARQAVQIVRRR--VNRTTGKVSLKTIYAVTS--LA 319 Query: 292 AE-----KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN 346 AE + A IR HW VE H R DV ED ++R GNA + + R++AI L Sbjct: 320 AEQAPPARVAQLIRGHWTVEALHHVR-DVTFAEDASQLRSGNAPQAMATYRNLAIGALRL 378 Query: 347 DKV--FKAGLRRKMR 359 V AGLRR R Sbjct: 379 AGVRNIAAGLRRTAR 393 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 67/134 (50%), Positives = 92/134 (68%), Gaps = 6/134 (4%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIK 165 +AIDGK LR S+D +R IH++SA+S+ +L +GQ++T +KSNEITAIPELL LDI+ Sbjct: 1 MAIDGKCLRGSHDGAR--SPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPAHDS 222 G IT DAMGCQ DIAE+I ++G DY+ VKGNQ L +A + F + L P Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFW-Q 117 Query: 223 YAMSEKSHGREEIR 236 ++ ++K+HGR E R Sbjct: 118 HSQTDKNHGRIETR 131 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 121 bits (304), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 91/349 (26%), Positives = 170/349 (48%), Gaps = 22/349 (6%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L+ + +PD R ++L ++ + +CAV +GA + I D+ P + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD-KDVIAIDGKTLRHSYDKS 120 +P TI +V + + +D + +A+DGKT+R + + Sbjct: 102 CGIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RI 159 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 ++ A H+++A + ++V+GQ +T +KSNEI + LL +DI G ++T DAM QK Sbjct: 160 GKQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKAT 219 Query: 181 AEKIQKQG-GDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHI 239 A +++Q +Y+ VK NQ L ++ P +++ D E+ HGREE R + Sbjct: 220 ARCLREQCRAEYVMIVKANQPGLLARVRDQ-PWEQVPVVWSDPV---ERGHGREEHRSYK 275 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEK----- 294 + V L F + +++ + R ++ E+ Y I S L E+ Sbjct: 276 ILTVARGL-RFPYA----QQVIQIIRRRRVLGAGAWSTEVV--YAICS--LPCEQAPPKL 326 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 A+ IR HWH+EN++H+ DV +ED +R G+ ++ + +R++ + + Sbjct: 327 LASWIRGHWHIENRIHYVRDVTFDEDRSAVRTGHGPQVMATLRNVVVGL 375 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 117 bits (294), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 95/346 (27%), Positives = 162/346 (46%), Gaps = 20/346 (5%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ-YGDF 64 L+ + + D+R+ H L +L++ I + G G+ ++ +F + + L Q + Sbjct: 4 LIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEFNII 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINW-MRDCHSSDDKDVIAIDGKTLRHSYDK--SR 121 +P + TI RV+ + + F W + + DD + + +DGK+L+++ + Sbjct: 64 PERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNPNNE 123 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTD-EKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 ++ I +S FS LV+ + + +K +EI ++ ++ K+ T DA+ CQK Sbjct: 124 QQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQKKT 183 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEK--SHGREEIRLH 238 I K DY+ VKGNQ L K +++L+N + E+ SHGR+ R Sbjct: 184 ISLIAKTKNDYVITVKGNQKNLYKR------IQDLSNSSKPESCFLEQDNSHGRKISRKI 237 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATA 298 V V E +G + L + + K E T YYISS +A+ FA Sbjct: 238 EVFKVRKN------ERQGFENLRRVIKVERKGSRGDKTYEETA-YYISSLTESAQVFAKI 290 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 IR HW +EN+LHW DV+ ED +I AA +S + I +N+ Sbjct: 291 IRGHWKIENQLHWVKDVIFEEDKSEISDFQAASNWSILTTIGLNLF 336 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 78/201 (38%), Positives = 101/201 (50%), Gaps = 24/201 (11%) Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIPELL LD++G +T DA+G Q IA I + G DY+ AVK NQ RL + + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWF--- 57 Query: 214 ELNNPAHDS--------YAMSEKSHGREEIRLHIVCDVPDE---LIDFTFEWKGLKKLCV 262 AHD + +K HGR E R VC V ++ L W GL++L + Sbjct: 58 ---EAAHDGKLEGSYWEHTEHDKGHGRLETR---VCRVSEDVAWLASTGQHWAGLQRLVM 111 Query: 263 AVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDC 322 R I QK E YYISS + A + A IR HW +EN+LHW LDV ED Sbjct: 112 LERTRQI--GQKVTTERC--YYISSKAVKAAQMAQLIRAHWGIENQLHWVLDVSWGEDAS 167 Query: 323 KIRRGNAAELFSGIRHIAINI 343 IR AA + +R I +N+ Sbjct: 168 LIRDTVAARNMASLRKITLNL 188 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 112 bits (281), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 59/132 (44%), Positives = 85/132 (64%), Gaps = 3/132 (2%) Query: 104 DVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD 163 D+IA+DGKTLR SYD++ + AIH++SA+ST + LV+GQ+KT+EKSNE TAIP+L +L Sbjct: 8 DIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIPKLFTLLA 67 Query: 164 IKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPAH 220 ++ +T DA+G Q+DIA++I + DYL VK NQ L++ + + K Sbjct: 68 LEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAKGFTEDFT 127 Query: 221 DSYAMSEKSHGR 232 DS HGR Sbjct: 128 DSVTEEGDKHGR 139 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 112 bits (279), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 109/394 (27%), Positives = 180/394 (45%), Gaps = 47/394 (11%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDI----EDFGETHPDF 57 ++ L+ + I D R+A + LS +L + A ++GA G +I DFG+ D Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQ---DL 77 Query: 58 LKQYG---DFENG---IPVHDTIARVVSCISPAKFHECFINWM--RDCHSSDDKDVIAID 109 L + G D G P I + + A F W+ + V+A+D Sbjct: 78 LARLGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMD 137 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELL-NMLDIKGKI 168 K LR ++ + +R + ++SA LV GQ++ + +NEIT + LL N+ DI G + Sbjct: 138 VKVLRGAWSEGNKR--VTLLSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPV 195 Query: 169 ITT-DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL-NKAFEEKFPLKELNNPAHDSYAMS 226 + T DA+ Q + A + + G DY VKGNQ L K FE+ PL + P H+ + Sbjct: 196 VATLDAVHTQHETAFLLVEHGIDYALTVKGNQPTLYRKTFEQTLPLLQ-KPPQHE---VE 251 Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELE--MTVRYY 284 E+ HGR + + +T E KG+ VA + ++I + +L+ R Y Sbjct: 252 ERGHGRIK-----------KWQAWTTEAKGIGFPEVATA--AVIRRDEFDLKGIRVSREY 298 Query: 285 ISSADLTAEKFATA------IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 A ATA IR HW +EN++H+ D ED + GN+ + R+ Sbjct: 299 AHILTSVAGNRATAAYIHRLIRGHWGIENEIHYPRDTAWREDANQTHTGNSPHTLASFRN 358 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 +AI I+ + + K ++ + A DR+ + +L Sbjct: 359 LAIGIIRRNGIRK--IKETLEYIAGDRDRVLPLL 390 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 110 bits (275), Expect = 8e-23, Method: Compositional matrix adjust. Identities = 95/343 (27%), Positives = 145/343 (42%), Gaps = 58/343 (16%) Query: 52 ETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGK 111 +TH + L+++ + GI TI R++ I F+ W+ + S + +A+DGK Sbjct: 24 KTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALYAFMEWVGEIVDSRNTH-LAVDGK 82 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITT 171 L + +K++ +++ T+ L++ Q+ D K+NEIT IPELL +LDI G I+T Sbjct: 83 ALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSKTNEITVIPELLKLLDISGSIVTI 142 Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPA------------ 219 DA+G Q I E+I +QGG + VK NQ +A+EE + A Sbjct: 143 DAVGTQTAIMEQIHEQGGHFALTVKKNQ---PEAYEEIHTFMDKLEAADVQRKKGEVLDS 199 Query: 220 --------HDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFR---- 267 ++ EK+ R E R +C L EW ++ + R Sbjct: 200 GMREYLEKYEEIIRIEKNRDRNEYRTCQICKDASNLTKSQKEWPHVQSIGRIKQVRIPSE 259 Query: 268 ------------------------SIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHW 303 + AE+ ++ IS LTAE+ + R HW Sbjct: 260 KDSHGNDVTPSKEEFLEKGSRRVPAPSAEEGTGKDVQCTALISDLILTAEELGSIKRMHW 319 Query: 304 HVENKLHWRLDVVMNED--DCKIRRGNAAELFSGIRHIAINIL 344 +EN+LH LD ED K R N S IR A NIL Sbjct: 320 SIENRLHHVLDDTFREDRSPAKKSRNN----LSLIRKYAYNIL 358 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 90/325 (27%), Positives = 140/325 (43%), Gaps = 29/325 (8%) Query: 50 FGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAID 109 FG + +LK GI H T + V C++ F ++ Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAALPKPLQRA------------ 90 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKII 169 K S + + ++ ++ LVIGQ +T NE+ + L +L ++G I+ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQ-QTAPGRNEVQGALDALALLSLEGAIV 149 Query: 170 TTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ-GRLNKAFEEKFPLKELNNPAHDSYAMSEK 228 T DA+ C+ D A I GGDY A+K NQ G L + ++ L +E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLG-----VQTAAEN 204 Query: 229 SHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSA 288 H R E R + V D IDF GL+ + S + L VRY++ S Sbjct: 205 DHDRCERRRACIVAVND--IDF----PGLQAIG---SVEATSRHADGRLTSHVRYFLLST 255 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 ++A R HW +ENKLHW LDV ED + R+ + + +R IA+N++ Sbjct: 256 IMSAAALIEVTRTHWQIENKLHWVLDVQFREDAARNRKDHGPANIALLRKIALNLIRAHP 315 Query: 349 VFKAGLRRKMRKAAMDRNYLASVLT 373 KA +RRK++ A D +L S++ Sbjct: 316 -DKASIRRKIKNAGWDDQFLISIIA 339 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 103 bits (257), Expect = 9e-21, Method: Compositional matrix adjust. Identities = 58/135 (42%), Positives = 82/135 (60%), Gaps = 3/135 (2%) Query: 105 VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDI 164 VIAI+GK+LR + + A+H +SA++ + L +GQ+ EKSNEITAI ELL L + Sbjct: 5 VIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLPTLAL 64 Query: 165 KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF-PLKELNNPAHDS- 222 +G ++T DA+GCQ +AE+I GGDY+ AVK NQ L A + F L +P + Sbjct: 65 EGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPVRQTC 124 Query: 223 -YAMSEKSHGREEIR 236 + +K HGR E R Sbjct: 125 VHETLDKGHGRIETR 139 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 95/368 (25%), Positives = 164/368 (44%), Gaps = 27/368 (7%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPD-FLKQYGDF 64 L+ ++ +PD R + H L +L + AV++GA + ++ P L + G F Sbjct: 29 LVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELGVF 88 Query: 65 ENGI------PVHDTIARVVSCISPAKFHECFINWMRDCH--SSDDKDVIAIDGKTLRHS 116 + P T R+++ + + W+ C ++ + V ++DGKTLR S Sbjct: 89 RDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLRGS 148 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 + +H+++ V+GQ+ D K+NE+T LL LD+ ++T DA+ Sbjct: 149 GPAGEQ---VHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADALHT 205 Query: 177 QKDIAE-KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 Q++ A + + Y+F VK NQ RL + + P ++ P D S + HGR +I Sbjct: 206 QREHARWLVDTKKAAYVFTVKKNQPRLYRQL-KTLPWTKI--PIQDE--TSTRGHGRYDI 260 Query: 236 R--LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE 293 R + C P L DF ++ L + ++ + + + +S+A Sbjct: 261 RRLQAVTCTGPLAL-DFPH---AVQALRIRRRRLNLATGRWSTVTVYAITNLSAAQAGPA 316 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI--LTNDKVFK 351 + A +R HW +E H R D ED ++R GNA + +R+ AIN+ LT Sbjct: 317 ELADWLRGHWAIETLHHIR-DTTYAEDASRLRTGNAPRAMATLRNTAINLLRLTGITTIA 375 Query: 352 AGLRRKMR 359 A LR R Sbjct: 376 AALRHNSR 383 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 102 bits (253), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 97/326 (29%), Positives = 145/326 (44%), Gaps = 28/326 (8%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPD-FLKQYG-DFENGIPVHDTIARVVSCISPAKF 86 +L + + A +G G+ + T D L Q G F P T V+S + PA Sbjct: 3 LLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRR--PSEKTFRAVLSRLDPADL 60 Query: 87 HECFINWMRDCHSSDDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 + ++ +S D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 61 NARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQL 118 Query: 144 KTDEKSNEITAIPELLNMLDIKGK-IITTDAMGCQKDIAEKI-QKQGGDYLFAVKGNQGR 201 EKSNEI + LL +L + ++T DAM Q A+ I YL VK NQ + Sbjct: 119 AVAEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAK 178 Query: 202 LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIR-LHIVCDVPDELIDFTFEWKGLKKL 260 + A P E+ A D + HGR E R L I+ I F + K++ Sbjct: 179 I-LARITALPWAEVPAAATDD----SRGHGRVETRTLQIITAARG--IGFPYA----KQI 227 Query: 261 CVAVSFRSIIAEQKKELEMTVRYYISSADLTAEK---FATAIRNHWHVENKLHWRLDVVM 317 R I A ++ +E V Y I S + T +R H +EN LHW DV Sbjct: 228 IRITRERLITATDQRSVE--VVYAICSLPFEHARPTAIMTWMRQHCRIENSLHWIRDVTF 285 Query: 318 NEDDCKIRRGNAAELFSGIRHIAINI 343 +ED + GN A++ + +R+ AIN+ Sbjct: 286 DEDRQRAHTGNGAQVLATLRNTAINL 311 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 101 bits (252), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 56/148 (37%), Positives = 84/148 (56%), Gaps = 3/148 (2%) Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 + NG P DT RV+ I P + C + ++ S + IAIDGK L+ S K+ Sbjct: 17 ELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHIAIDGKRLKGSKKKT-- 74 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 G+ H++SA+ L + Q EK NE+ AIPE+L+ LD+ G +I+ DAMG Q +IAE Sbjct: 75 -GSTHILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSGAVISIDAMGTQTNIAE 133 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 +I + DY+ ++KGNQ L + + F Sbjct: 134 QIIQSEADYILSLKGNQKHLYEDVRDCF 161 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 100 bits (249), Expect = 9e-20, Method: Compositional matrix adjust. Identities = 60/189 (31%), Positives = 95/189 (50%), Gaps = 12/189 (6%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 + +L IK I T DA+ CQK E I ++ Y+ VK NQ L +A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIEDTAKNSP 61 Query: 215 LNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 LN +++ ++K HG E H + + +W GL++ +S R Sbjct: 62 LN-----AWSWTQKGHGHES---HCRLKIWEATESMKMQWAGLERF---ISIRRQGFRHH 110 Query: 275 KELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 K+ + T Y+I+S L++ + A IR H +EN LHW DV++NED+C IR + A + Sbjct: 111 KKFDSTT-YHITSETLSSYRLAGFIRGHRRIENNLHWTKDVILNEDNCGIREPHPAAILG 169 Query: 335 GIRHIAINI 343 +R+IA N+ Sbjct: 170 ILRNIAFNL 178 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 99.8 bits (247), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 59/150 (39%), Positives = 80/150 (53%), Gaps = 8/150 (5%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 G H++SA++T H + +G + T+EKSNEITAI LL L K ++T DAMGCQKDIA Sbjct: 2 GPRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARN 61 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFE---EKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 I GGD++ AV+ NQ +L A EK E H ++ HGR + R + Sbjct: 62 IVAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWG 121 Query: 241 CDVPDELIDFTF--EWKGLKKLCVAVSFRS 268 VP DF EW +K + AV + Sbjct: 122 AQVPP---DFAAKGEWPWIKAIGTAVRITT 148 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 98.2 bits (243), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 57/171 (33%), Positives = 91/171 (53%), Gaps = 19/171 (11%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAF----EEKFPLKELNNPAHDSYAMSEKSHGRE 233 K + I + G DY+ AVKGNQ RL++ E++ P+ S ++ +S Sbjct: 3 KKTVQLIIEGGNDYVIAVKGNQKRLHEQIKLTTEQRLPVSLDITTERRSDRITTRS---- 58 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE 293 V D+L +++W+GL++L F + + + + YYISS + A Sbjct: 59 -------VSVFDDLSGISYDWEGLQRLVKVERFGTRAGKPYHQ----IVYYISSLTINAA 107 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 +FA IR HW +EN+LHW DVV++ED+ ++R+GNA FS IR + + IL Sbjct: 108 QFAQGIRGHWGIENRLHWVKDVVLDEDNSRMRQGNAPANFSIIRSLVLTIL 158 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 63/208 (30%), Positives = 100/208 (48%), Gaps = 5/208 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ ++ +PD R ++ L +L L + AV+ G E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NG +P +TIA ++ + P + W+RD H D + +A+DGK L S D Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGKRLCGSRDGQVP- 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITTDAMGCQKDIAE 182 H+++A++ S V+ Q+ + +NE A LL +L + G ++T DA+ Q D+ Sbjct: 122 -GTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 +Q +GGD + K NQG L E F Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAF 208 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 96.3 bits (238), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 65/178 (36%), Positives = 97/178 (54%), Gaps = 8/178 (4%) Query: 189 GDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELI 248 GDYL VKGNQ +L +A E F + + + + D A+ E+ HGR ++ V + I Sbjct: 7 GDYLLMVKGNQPKLLEAIEIAF-IDQHDVKSVDRSALVERGHGRTVGQIASVLSA--KGI 63 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 +W + S R ++ E++ +LE YYI+S LTAE+ A ++R W VEN+ Sbjct: 64 INPGDWPNCVTIGRIDSMR-VVDEKESDLERC--YYITSRALTAEQLAASVRARWGVENR 120 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKMRKAAMD 364 HW LDV +ED + + NA + S +R IA+NI+ DK K+ LR K + AA D Sbjct: 121 FHWILDVSFSEDASTVAKDNAPQNLSMLRKIALNIIRADKTDTRKSSLRLKRKGAARD 178 >UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica OS145 RepID=A3WIW9_9GAMM Length = 96 Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 43/90 (47%), Positives = 60/90 (66%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 M RYYISSA L+AE+FA+ +R HW +EN+LHW LDV + ED+C I RG+AA+ + RH Sbjct: 1 MQYRYYISSASLSAEEFASTVRAHWGIENRLHWVLDVTIREDNCPISRGHAADNLACFRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 +A+N + +K A + RK + A M L Sbjct: 61 VALNQIRREKTIDASVNRKQKMATMSEEVL 90 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 94.7 bits (234), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 64/211 (30%), Positives = 96/211 (45%), Gaps = 3/211 (1%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + L +S IPD R + H L +L L A++ G + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDK 119 F G P T++R + P + W+ + IA+DGKTLR S D Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGSRDG 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 H+++A++ V+ Q++ D K+NE A LL +L + G ++T DAM CQ+D Sbjct: 121 QVP--GQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 +A + G DY+ K NQ L + E Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGL 209 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 93.6 bits (231), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 59/148 (39%), Positives = 82/148 (55%), Gaps = 11/148 (7%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMS-----EK 228 MGCQK+IAE I +Q DY+ AVK NQ L++A ++ F +E N +SY + K Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYF--EEANEANFESYNIDFAETYNK 58 Query: 229 SHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSA 288 SHGR E R V L D + W+GL+ + + S R++ K++ + RYYISS Sbjct: 59 SHGRIESRRCWVGYDALPLTDDSQNWEGLQTIVMVESERTL----KEKTTIEHRYYISST 114 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVV 316 TA + R HW +EN LHWRLD+ Sbjct: 115 MATAAYLLNSSREHWGIENSLHWRLDIA 142 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 60/168 (35%), Positives = 88/168 (52%), Gaps = 19/168 (11%) Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 ++S ++T + LV+GQ+K E SNEITAIPELL +L++ G I+ A+ C KDI + I ++ Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 GGDYLFAVKGNQGRLNKAFEEKFP------LKELNNPAHDSYAMSEKSHGREEIRLHIVC 241 DY+ +K NQG L ++ E+ F +EL H +Y E HG EIR Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQELQ---HSTYKPEETGHGLHEIRNFGFQ 117 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSAD 289 PD + W LK +V I + + + RY+ISS D Sbjct: 118 LDPDSV------WSNLK----SVGMVEPIGQVDDKTTVETRYFISSLD 155 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 92.4 bits (228), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 89/375 (23%), Positives = 160/375 (42%), Gaps = 47/375 (12%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHP-DFLKQ 60 E++ L ++ +PD R + H+L IL L+ AV +G + E+I + P L Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 61 YGDFENGI------PVHDTIARVVSCISPAKFHEC---FINWMRDCHSSDDKDVIAIDGK 111 G + + P DT+ RV+S + + F + V+A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML----DIKGK 167 TLR + R A H+++ +V+ + + K+NE+TA LL L + G Sbjct: 158 TLRGAAGPEGR--APHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 IITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRLNKAFEE-----KFPLKELNNPAHD 221 ++T DA+ + A+ I + G ++F VK N L+ + K P+ Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPI--------- 266 Query: 222 SYAMSEKSHGREEIRLHIVCDVPDELIDFTFEW--------KGLKKLCVAVSFRSIIAEQ 273 ++ ++HGR E R I E I + + +++ + R A Sbjct: 267 GHSAEGRAHGRFERRT-IQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGR---ARV 322 Query: 274 KKELEMTVRYYISSA----DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + + TV ++ ++ +T A R HW +ENK+HW DV ED ++R G Sbjct: 323 TRTIPSTVTVHVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDASRVRTGPL 382 Query: 330 AELFSGIRHIAINIL 344 + + +R++ I ++ Sbjct: 383 PRIMTTLRNLIIGLI 397 >UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azollae' 0708 RepID=B9YJB3_ANAAZ Length = 124 Score = 91.7 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 43/113 (38%), Positives = 68/113 (60%), Gaps = 4/113 (3%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W LK + + S I + + + RY+ISS D E+ A ++R+HW +EN LHW L Sbjct: 15 WSNLKSVGMVES----IGQVDDKTTVETRYFISSLDSNGEQLANSVRSHWAIENSLHWVL 70 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRN 366 DV + +DDC+IR+ NA + F+ +R IA+++L + K G++ K AA+D N Sbjct: 71 DVALKQDDCQIRKDNAPQNFAVMRQIAVDLLGKENPVKRGIKNKQFLAAVDNN 123 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 90.5 bits (223), Expect = 9e-17, Method: Compositional matrix adjust. Identities = 54/182 (29%), Positives = 91/182 (50%), Gaps = 4/182 (2%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 + L+ + +PD R+A + L +L+ T+ A++SGA + I F E + L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 61 -YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHS 116 +G PV +T+ V+ + + F + + K V+A+DGKTLR S Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGS 130 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D R A ++AF + ++V+ + D+KSNEI A +++ L + G + T DAM C Sbjct: 131 FDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHC 190 Query: 177 QK 178 QK Sbjct: 191 QK 192 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 89.7 bits (221), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 66/190 (34%), Positives = 92/190 (48%), Gaps = 22/190 (11%) Query: 192 LFAVKGNQG----RLNKAFE--EKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPD 245 + AVK NQ R+ A + E F L + H +K HGR E R + D P Sbjct: 1 MLAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREV---DKGHGRIETRRCLALDFPG 57 Query: 246 ELIDFTFE---WKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNH 302 FE W GL+ + + S R I + RYY+SS A + A A+R H Sbjct: 58 P-----FEPDLWPGLQSIPMVESTREI----GDTVTTGRRYYVSSLPADAVRIAHAVRAH 108 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +E+ +HW LDV NED C+ R NAA+ F+ +R IA ++ D KAG+R + KA Sbjct: 109 WGIES-MHWVLDVTFNEDQCRTRLENAAQNFAILRRIATTLIRRDNSTKAGIRIRRLKAG 167 Query: 363 MDRNYLASVL 372 +Y A +L Sbjct: 168 ASDDYRAQLL 177 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 89.7 bits (221), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 53/127 (41%), Positives = 69/127 (54%), Gaps = 1/127 (0%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 + ++ +KI ++ DYL AVKGNQG L AF++ F LNN + Y E+S GR E Sbjct: 12 VRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHES 71 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 R V L D + EW GLK + VS S E +E ++ VRYYISS L AE+ Sbjct: 72 RAAFVSHDLSVLGDISDEWPGLKSMAFVVSMNS-EKEVAEEADIYVRYYISSKQLNAEEL 130 Query: 296 ATAIRNH 302 TA R H Sbjct: 131 LTASRLH 137 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 64/226 (28%), Positives = 108/226 (47%), Gaps = 17/226 (7%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ LM +S D R+ + H ++ +CA++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDF----ENGI---PVHDTIARVVSCISPAKFHECFINWMRD----CHSSDDKDVIAIDG 110 F E I P T+ R + I + W + C D V++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIIT 170 K +R + K++ IH ++AF +V+ Q DEK+NEI + LL ++I+G+I+T Sbjct: 341 KAVRGA-SKAKGGQKIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 TDAMGCQKDIAEKI-QKQGGDYLFAVKGNQGRLNKAFE----EKFP 211 DA+ Q + A I + + DY+F VK NQ + + E E FP Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIESLPWEAFP 445 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 84.0 bits (206), Expect = 8e-15, Method: Compositional matrix adjust. Identities = 84/356 (23%), Positives = 156/356 (43%), Gaps = 25/356 (7%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L+ + +PD+R + ++L+ +L L + I+G + + ++ P + Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS--SDDKDVIAI--DGKTLRHSY 117 F +P TI R+V P + + W +D ++A+ DGK ++ + Sbjct: 84 LGFPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGAR 143 Query: 118 DKSRRRGAIH---VISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAM 174 + + G++ V+ A +G + +EI ++ L+N + ++TTD + Sbjct: 144 SRPPQ-GSVRQEAVVEAVRHDTGTALGHQRV-VAGDEIASVRRLVNRVCDHNTLVTTDCL 201 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGR-E 233 + +A I+ +GG +LF++KGNQ + +A P E N + EK+HGR E Sbjct: 202 HAHEPLARAIRAKGGHWLFSIKGNQPTV-RAKLAGLPWDEFGN----QHVTREKAHGRIE 256 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLC-VAVSFRSIIAEQKKELEMTVRYY----ISSA 288 E L + L+ F +G +++ +A + R T +Y +S+ Sbjct: 257 ERALKALTPSAPSLVGF----RGTRQVVKLARTTRRKKTTTSPAATSTEEFYLVTSLSTD 312 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 + + A R HW VE H R D M+ED IR NAA ++ R I+ L Sbjct: 313 QASPAQLARWARGHWTVEAIHHVR-DRTMDEDRHTIRTKNAALNWAIARDTTISAL 367 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 56/170 (32%), Positives = 85/170 (50%), Gaps = 5/170 (2%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ-YGDFENGIPVH 71 IPD+R+A L +LL +I A++SGA + I F TH L +G P + Sbjct: 11 IPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAFGCRWRRTPAY 70 Query: 72 DTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISA 131 +I + + F ++ VIA+DGKTLR S D+ R A V+SA Sbjct: 71 SSIRYALQGLDVQALAPHF--RAHAARLAEGAAVIALDGKTLRGSLDRFEDRKAAQVLSA 128 Query: 132 FSTMHSLVIGQIKTDE--KSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 F+T +V+GQI ++ K +EI A L+ L + G++ T DA+ QK+ Sbjct: 129 FATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 55/148 (37%), Positives = 73/148 (49%), Gaps = 6/148 (4%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFE--WKGLKKLCVAVSFRSIIAEQKKELEMTVRYY 284 +K HGR E R D L + WK + + S R I + E RY Sbjct: 137 DKGHGRIETRRCTAAGDLDWLATLGLKERWKKITSVAGIDSSRVI----GSKTETDRRYV 192 Query: 285 ISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 ISS +E+ A+R HW +EN LHW LDV ED C IR NAA FS +R A+N+ Sbjct: 193 ISSLPADSERILHAVRMHWGIENGLHWCLDVAFGEDACPIRLRNAALDFSLLRRAAMNLF 252 Query: 345 TNDKVFKAGLRRKMRKAAMDRNYLASVL 372 D GL +K + AA + +YLA++L Sbjct: 253 RADHSRAMGLPKKRKAAAWNPDYLANIL 280 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 101/394 (25%), Positives = 159/394 (40%), Gaps = 85/394 (21%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAV-------ISGAEGW------EDIE 48 +++ L+ + D R A + +++S +L L +CA+ I+ A W E++ Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 49 DFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRD-----CHSSDD- 102 FG + +Y IP T+ V+ + P + + +R HS + Sbjct: 90 AFGLPYHPLRGRYR-----IPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPL 144 Query: 103 --------------------------KDVIAIDGKTLRHS--YDKSRRRGAIHVISAFST 134 + IA+DGK LR + D SR + V+SA Sbjct: 145 MPDGGIEREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR----VFVLSAVRH 200 Query: 135 MHSLVIGQIKTDEKSNEITAIPEL------LNMLDIKGKIITTDAMGCQKDIAEKIQKQG 188 + + + K+NEI PE L+ D+KG ++T DA+ Q+D A + ++G Sbjct: 201 GDGITLASREIGAKTNEI---PEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERG 257 Query: 189 GDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELI 248 YL +K NQ R P KE+ D + HGR E RL V V L Sbjct: 258 AHYLLTIKNNQ-RGQARQLHALPWKEIPVIHRDD----ARGHGRHEQRLVQVVTVNGLLF 312 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATA-----IRNHW 303 + + + R + KK TV Y I+ DL AE+ + A R HW Sbjct: 313 PHAAQ-------VLRIQRRRRLYGAKKWSSETV-YAIT--DLPAEEASAAEIASWARGHW 362 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIR 337 VEN +HW DV NED ++R N + + +R Sbjct: 363 TVENTVHWCRDVTFNEDKSQVRTHNTPSVLAAVR 396 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 82.0 bits (201), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 62/213 (29%), Positives = 97/213 (45%), Gaps = 10/213 (4%) Query: 40 GAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS 99 GA+ ++ +F E + L++ +G P HDT +RV + P + F +M Sbjct: 37 GAKTCVEMAEFSEARQEELREIVALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRG 96 Query: 100 S----DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAI 155 + K V+AIDGK+LR YDK R ++S + I ++ +EI A Sbjct: 97 ALGLPAPKGVVAIDGKSLRRGYDKGRAFMPPLMVSVWDVETRPSIAAMRAP-GGDEIKAT 155 Query: 156 PELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKEL 215 +L L +KG +T DA+ C +A+ + Y +K N G L +A E F + Sbjct: 156 LSVLKALTLKGCTVTADALHCHPAMAQALLAAKAQYALGLKANHGPLFRAAEAGF--AAV 213 Query: 216 NNPAHDSYAMSEKSHGREEIRLHIVCDVPDELI 248 + A + E+ HGREE R V V D L+ Sbjct: 214 TDLA--VFETRERGHGREEQRRASVLPV-DRLV 243 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 81.3 bits (199), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 92/374 (24%), Positives = 150/374 (40%), Gaps = 49/374 (13%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFG-ETHPDFLKQ 60 +++ L+ + +PD R+ + L +L L + AV GA G+ +I + + P+ Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMR--------------DCHSSDDKDVI 106 +G P T RV+ P E W + VI Sbjct: 91 FG-LVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVI 149 Query: 107 AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSL--VIGQIKTDEKSN---EITAIPELLNM 161 + DGKT+R +RRR I+ + L G + E N EI A+ ++ Sbjct: 150 SADGKTMR----GARRRTGDGKIAQDQVVEILDHASGAVVACEPVNDGDEIGAVRTVMGR 205 Query: 162 L-----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 L + G ++ TDA Q + E++ GG +L VK NQ R+ A P ++ Sbjct: 206 LADRWGSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRI-LAKVRALPWAQVR 264 Query: 217 NPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE 276 A D+ K+HGR E R V P +D G ++ + ++ + Sbjct: 265 --AQDT--CRGKAHGRAETRTVRVVQAPTH-VDLALA--GTAQV-IKITRHTRRRPHPGA 316 Query: 277 LEMTVR---YYISSADLTAE-----KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGN 328 + R Y ++S L AE A +R+HW +EN++HW D +ED R GN Sbjct: 317 PAASTRENAYLLTS--LPAEVADPATLAAMVRSHWLIENQVHWVRDTAYDEDRHTARTGN 374 Query: 329 AAELFSGIRHIAIN 342 + +R+ AI Sbjct: 375 GPINLACLRNTAIT 388 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 80.9 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 37/85 (43%), Positives = 55/85 (64%) Query: 11 SIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 S IPD R +H +I+ L + +V++GA+ + +IEDF E H D+LK Y + NGIP Sbjct: 10 SQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTYFNLPNGIPS 69 Query: 71 HDTIARVVSCISPAKFHECFINWMR 95 HDT +RV S I+PA F + F+ W++ Sbjct: 70 HDTFSRVFSAINPASFQDSFLIWLK 94 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 80.9 bits (198), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 60/200 (30%), Positives = 100/200 (50%), Gaps = 25/200 (12%) Query: 11 SIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDI----EDFGETHPDFLKQYG-DFE 65 + + D R+A + H +LL+ + V++G +E I +D ++ L++ G + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQ---LRRLGCRWS 285 Query: 66 NG-----IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 G P TI R++S P + ++ HSS IAIDGKT+R S Sbjct: 286 PGKERFLPPSEPTIRRILSKADPVELDRILSQYIV-AHSSGR--AIAIDGKTIRSS---- 338 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTD-EKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 ++ +++A V+ Q D K +EI A LL LD+ GK++T DA+ Q Sbjct: 339 ----SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNA 394 Query: 180 IAEKIQKQGGDYLFAVKGNQ 199 +A +I+++GGDY+F VK N+ Sbjct: 395 LASRIREKGGDYVFTVKDNR 414 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 97/390 (24%), Positives = 157/390 (40%), Gaps = 64/390 (16%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVIS-GAEGWEDIEDFGETHPDF----LKQ 60 L+ ++I D R H L+ IL + CA ++ G + IE + + P L Sbjct: 30 LIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLHI 89 Query: 61 YGDFENGI---PVHDTIARVVSCISPAKFHEC---FINWMRDCHSSDDKDVI-------- 106 + D G+ P TI RV++ + + C F+N +++ D + Sbjct: 90 WRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRRT 149 Query: 107 ----------------------AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIK 144 A+DGK L+ + G +H+IS + + + V Q + Sbjct: 150 EREARRAAHRSPTPAPGLLPAYAVDGKRLKGARHPDG--GRVHLISLAAHLDATVHAQRQ 207 Query: 145 TDEKSNEITAIPELLNM---LDIKGKIITTDAMGCQKDIAEK-IQKQGGDYLFAVKGNQG 200 KS+EI A+ LL D+ G +IT DA+ Q+ A I++ Y+ VK NQ Sbjct: 208 IPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQP 267 Query: 201 RLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKL 260 L+ + + A ++ + + HGR E R I+ P + IDF + + + L Sbjct: 268 TLHATAITALTGTDTDFAA-VTHRETHRGHGRTEYR--ILRTAPADGIDFPYAAQVFRVL 324 Query: 261 CVAVSFRSIIAEQKKELEMTVRYYISSADLTAEK-----FATAIRNHWH-VENKLHWRLD 314 I KE V Y I+ DLTA + A +R HW +EN +H D Sbjct: 325 RHRGGLDGI--RHSKE----VCYGIT--DLTARQAGPAHLAAYVRGHWKAIENGVHHVRD 376 Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 V ED C+ R + R++A L Sbjct: 377 VTFAEDACQARTATLPRALAAFRNLATGTL 406 >UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=Bacteroides RepID=C6Z3A5_9BACE Length = 115 Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 49/109 (44%), Positives = 63/109 (57%), Gaps = 4/109 (3%) Query: 265 SFRSIIAEQKKELEMTVRYYISSADLT-AEKFATAIRNHWHVENKLHWRLDVVMNEDDCK 323 S R+I+A E VRYY++S D T EK A+AIR HW + N LHW+LDV ED K Sbjct: 5 SERTIVA--IGEYTQEVRYYVTSLDNTRPEKIASAIRQHWSIGNNLHWQLDVTFREDYSK 62 Query: 324 IRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 + NAA FS +A+ IL N+K K + K KA D NYL+ +L Sbjct: 63 -KVKNAAGNFSVATKMALTILKNEKTTKGSMNLKRLKAGWDENYLSQLL 110 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 38/73 (52%), Positives = 49/73 (67%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + H + D R +H L DI+LL I AV+SG+EGWEDIE+FG D+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVV 78 GIP HDTIARV+ Sbjct: 67 AGIPRHDTIARVI 79 >UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC8_9GAMM Length = 154 Score = 78.6 bits (192), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 42/109 (38%), Positives = 61/109 (55%), Gaps = 4/109 (3%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W+ L+ + + S R+ +K E + RYYISS TA R HW +E LHW L Sbjct: 7 WEELQTIVMVESERA----EKGETTIEHRYYISSTLGTAAYLLDYKREHWGIETSLHWCL 62 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 D+ ED+ +I +GN AE F+ +RHIA+N+L + K G++ K KA Sbjct: 63 DIAFREDESRISKGNGAENFAILRHIALNLLKKEDTAKIGIKNKRLKAG 111 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 78.6 bits (192), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 71/228 (31%), Positives = 107/228 (46%), Gaps = 12/228 (5%) Query: 100 SDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTD-EKSNEITAIPEL 158 S +K + DGK LR S + ++RG V+ I Q D +K +EI + L Sbjct: 51 SQEKQWFSGDGKELRGSIESGKKRGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRAL 109 Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP 218 L+ D+ + IT DA+ E I K GG +L +K NQ L + + P Sbjct: 110 LSKDDLASQKITLDALHLCPSTTEMITKAGGVFLIGLKENQPTLLAH------MTDCALP 163 Query: 219 AHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELE 278 D + +HGR E R + + DV + D ++ K+L V V R+ I ++ ++ Sbjct: 164 PIDQKTTFDFNHGRVEQRKYWLYDVSKQGFDPRWDNTAFKRL-VKVQ-RTRINQKNAKIS 221 Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRR 326 V YYIS+ + E A+RNHW VE H R DV +NED K ++ Sbjct: 222 REVSYYISN-ETAKEGIFDAVRNHWSVEVNNHIR-DVTLNEDQLKSKK 267 >UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX Length = 98 Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 37/48 (77%), Positives = 39/48 (81%) Query: 78 VSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +SCI KFHECFIN MR+CHSSDD DVIAIDGK L HS DKSRRR A Sbjct: 1 MSCIRSVKFHECFINRMRECHSSDDIDVIAIDGKALPHSCDKSRRRRA 48 >UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina DSM 3645 RepID=A3ZX26_9PLAN Length = 115 Score = 77.4 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 43/105 (40%), Positives = 63/105 (60%) Query: 270 IAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + +Q + VRYYI S LT +FA A+R HW +EN LHW+LDV E +IR+G+A Sbjct: 10 LVKQNGKEASEVRYYIVSKYLTGRRFAEAVRGHWGIENSLHWQLDVTFGEHQSRIRKGHA 69 Query: 330 AELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTG 374 FS +R ++++L N+K + G++ K KA + YL VL G Sbjct: 70 DINFSLLRRTSLSLLKNNKTARVGVKNKRLKAGRNDKYLLEVLLG 114 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 49/176 (27%), Positives = 82/176 (46%), Gaps = 3/176 (1%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG-IPVHDTIARVVSCIS 82 H L +L L AV+ G + I FG + L F G P T+++ + I Sbjct: 6 HPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTLRRID 65 Query: 83 PAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQ 142 P + W+ + D + +A+DGK LR S D H ++A++ + V+GQ Sbjct: 66 PQQLEAALGRWIAGRLTPDARAHVALDGKCLRGSRDGDVP--GPHRVAAYAPHAAAVLGQ 123 Query: 143 IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 I+ D ++NE A LL ++ + G ++T A C +D+A + GG Y+ +G Sbjct: 124 IRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYVSHGQGQ 179 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 75.9 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 34/75 (45%), Positives = 52/75 (69%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++++ + + D R A + +H L DIL+L +CAV+SGA+GW+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARV 77 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRV 81 >UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7Q7_9GAMM Length = 164 Score = 75.1 bits (183), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 52/171 (30%), Positives = 86/171 (50%), Gaps = 11/171 (6%) Query: 206 FEEKFPLKELNNPAHDSYAMSEKSHGREEIR-LHIVCDVPDELIDFTFEWKGLKKLCVAV 264 F++ + L E +SY EK HGR+E+R ++++ E + +W +K + V Sbjct: 3 FQDYWALPE---DKQESYITEEKGHGRKEVREVYVLPAAFSEAL--RQKWCLVKSIVAVV 57 Query: 265 SFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 RS+ + E YYI + L+ E + A R HWH+EN+ HW LDV+ ED+ +I Sbjct: 58 RDRSVKGKGSYE----TSYYICTDHLSLELASKATRKHWHIENQQHWALDVIFKEDEQRI 113 Query: 325 RRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGS 375 G++A + R N+ + + RKM +AA +++Y VL S Sbjct: 114 YAGDSALNMACCRRFVQNLFRKSEG-NLSVPRKMNQAAWNKDYREKVLFTS 163 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 53/163 (32%), Positives = 79/163 (48%), Gaps = 6/163 (3%) Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH-DSYAMSEKSHGREEIRLHIV 240 EKI ++ GDY+ +K N + E F + P +++ R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 V D L EWKG+K + RS + KE + V +YISS D+ + A +R Sbjct: 61 LKVSDWLSKAE-EWKGIKSVLEVCRKRS---DNGKESQEKV-FYISSLDVDVQILAKCVR 115 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 HW VENK HW LDVV ED+C + AE + +R +A+N+ Sbjct: 116 GHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNL 158 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 50/165 (30%), Positives = 86/165 (52%), Gaps = 12/165 (7%) Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF--PLKELNNPAHDSYAMSEKSHGREEIRL 237 ++E+ ++ DY+ A+KGN + + ++ F P+ H ++ +K HGR E R+ Sbjct: 1 MSERNSEKDNDYILALKGNHPLMEQEVKDFFLSPVTS-TRSVHTTF---DKGHGRIERRI 56 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFAT 297 + + D + EWK L + S++ + KE +RY+I+S ++FA Sbjct: 57 YTL-DTNIGWFEDKKEWKHLAGFGMV---DSMVTRKGKECR-EIRYFITSV-TDVKQFAK 110 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 + +HW +EN LHW LDV+ +D+C + NAAE + IR I N Sbjct: 111 GVCSHWMIENNLHWCLDVLFCDDECTVLDRNAAENLAIIRRIVYN 155 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 66/244 (27%), Positives = 106/244 (43%), Gaps = 24/244 (9%) Query: 107 AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKG 166 A+DGKT R + K +H++ + ++GQ + D KSNE T LL L++ G Sbjct: 151 AVDGKTSRGA--KRADGSQVHLLGVAAHGAGALLGQREIDAKSNETTEFRALLAPLELAG 208 Query: 167 KIITTDAM-GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAM 225 ++ DA+ + ++ + ++ YL K NQ +L +AF P E+ P D Sbjct: 209 AFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKL-RAFLAALPWTEI--PTAD--LT 263 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYI 285 ++ HGREE R V V +DF A+ R ++ + Y I Sbjct: 264 RDRGHGREETRTLKVATV--THLDFPHA-------AQAIRIRRWRRQKGQPASHETIYAI 314 Query: 286 SSADLTAEKFATAI-----RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 + D TA++ + A+ R WH+E K H+ DV ED R G + + R Sbjct: 315 T--DATADQASPALLADLARGQWHIEVKQHYVRDVTFGEDSSTSRTGRGPAVLALFRATV 372 Query: 341 INIL 344 + L Sbjct: 373 ADTL 376 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 71.2 bits (173), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 36/73 (49%), Positives = 47/73 (64%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + H I D R +H L +I+LL I AV+SG+EGWE IE+FG D+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVV 78 GIP HDTIARV+ Sbjct: 67 AGIPRHDTIARVI 79 >UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C429C Length = 149 Score = 71.2 bits (173), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 42/124 (33%), Positives = 70/124 (56%), Gaps = 11/124 (8%) Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYI 285 SEK HGR E R + P ++ +WKGLK+ R++ + KK +E V Y I Sbjct: 4 SEKGHGRIEKR--TLETTP--IVTVGQKWKGLKQGLRITRERAV--KGKKTVE--VVYGI 55 Query: 286 SS---ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 +S A A T +R+HW +EN LH+ DV + ED C++R+G A ++ + +R++ ++ Sbjct: 56 TSLSMARANAATLLTILRDHWQIENGLHYVRDVTLGEDACRVRKGTAPQVLAAVRNVVVH 115 Query: 343 ILTN 346 +L + Sbjct: 116 LLAS 119 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 70.9 bits (172), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 33/64 (51%), Positives = 44/64 (68%) Query: 18 QAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARV 77 +A+ +H DI+ L + AVISGA W +I+ FGE H D+L++Y FE GIPV DTIARV Sbjct: 14 RAYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPFECGIPVDDTIARV 73 Query: 78 VSCI 81 + I Sbjct: 74 IKRI 77 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 70.5 bits (171), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 55/204 (26%), Positives = 94/204 (46%), Gaps = 16/204 (7%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGE--THPDFLKQYGDFENGI-- 68 +PD R +H L IL + + AV++ A+ + + ++ T + F Sbjct: 230 LPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKRIRARFNPRTQR 289 Query: 69 ---PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 P T+ RV+ + W+ + +A+DGK L+ + R G+ Sbjct: 290 YVAPSEPTLRRVLQGANVTALDAAIGAWLLGIAGFE---AVAVDGKVLKGAV---REDGS 343 Query: 126 -IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE-K 183 +H++SAF I Q + K+NEI + LL +DI+ K++T DA+ Q+ A Sbjct: 344 QVHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADALHTQRKTARFL 403 Query: 184 IQKQGGDYLF-AVKGNQGRLNKAF 206 ++ + DYLF AVKGNQ +L + Sbjct: 404 VEDKKADYLFTAVKGNQRKLRNSL 427 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 60/256 (23%), Positives = 112/256 (43%), Gaps = 22/256 (8%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ ++ +PD R+ + ++ + +L + +CA++SGA + I ++ P + Sbjct: 51 LLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAGLGLT 110 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK-------------DVIAIDGKT 112 +P TI RV+ + A W++ + D V+A+DGK Sbjct: 111 GRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAVDGKA 170 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITT 171 +R + + +H++ +V+ Q+ DEK+NEI +L+ + D+ +IT Sbjct: 171 MRATRHGTH---PVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDVLITV 227 Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHG 231 DAM Q A+ + +G L VK NQ ++ + P K++ + + + HG Sbjct: 228 DAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRL-KTLPWKDVPV----GHTTTGRGHG 282 Query: 232 REEIRLHIVCDVPDEL 247 R E R VP L Sbjct: 283 RIETRTLKAVTVPAGL 298 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 38/65 (58%), Positives = 42/65 (64%), Gaps = 12/65 (18%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTI------CAVISGAEGWEDIEDFGETH 54 MELKKLM HISIIPDYRQAWK+EHKL DIL + C ++ G FGETH Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRG------FGETH 54 Query: 55 PDFLK 59 DFLK Sbjct: 55 LDFLK 59 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 67.4 bits (163), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 50/180 (27%), Positives = 88/180 (48%), Gaps = 6/180 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF- 64 L+ ++ +PD+R A + L +LLL I +S G+ +EDF H + L Sbjct: 5 LLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQLP 64 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS---YDKSR 121 P T RV+ I F NW+ ++D + +DGK+++ + YD++ Sbjct: 65 PTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA- 123 Query: 122 RRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + I+V+S FS + I Q +++ +EI + LL LD++G + T D++ CQK + Sbjct: 124 YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKKL 183 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 63/215 (29%), Positives = 94/215 (43%), Gaps = 16/215 (7%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPD-FLKQYG-DFENGIPVHDTIARVVSCISPAKF 86 +L + + A + G+ + T D L Q G F P T V+S + PA Sbjct: 3 LLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRR--PSEKTFRAVLSRLDPADL 60 Query: 87 HECFINWMRDCHSSDDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 + ++ +S D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 61 NARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQL 118 Query: 144 KTDEKSNEITAIPELLNMLDIKGK-IITTDAMGCQKDIAEKI-QKQGGDYLFAVKGNQGR 201 EKSNEI + LL +L + ++T DAM Q A+ I YL VK NQ + Sbjct: 119 AVAEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAK 178 Query: 202 LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIR 236 + A P E+ A D + HGR + R Sbjct: 179 I-LARITALPWAEVPAAATD----DSRGHGRVKTR 208 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 76/314 (24%), Positives = 129/314 (41%), Gaps = 47/314 (14%) Query: 67 GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD-------DKDVIAIDGKTLRHSYDK 119 G P T+ R+++ SPA E ++D + V++ DGK D Sbjct: 98 GKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTWSRTDG 157 Query: 120 SRRRGAIHVI-----SAFSTMHSL-----------VIGQIKTDEKSNEITA----IPELL 159 + +GA S+ T +L +GQ K E TA +P + Sbjct: 158 EKVKGAQQSAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFRRLLPAIS 217 Query: 160 NMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPA 219 L + +I+T DA C ++ AE + G Y+F +K NQ L+ + +L P Sbjct: 218 EQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLHD-IARDYGQYDLGTPL 276 Query: 220 HDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFR--SIIAEQKKEL 277 + +E+ G +R DV + L +C + R I+A ++ Sbjct: 277 ART---AERYRGHTIVRELYARDVAGNPAAAIEAAQQLWYVCQTTTDRRGEIVAVEQ--- 330 Query: 278 EMTVRYYISS---ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD---CKIRRGNAAE 331 RY+++S LT ++ +R HW +EN HW +DV++ ED+ C+ R + E Sbjct: 331 ----RYFVTSIPTGTLTRDQELALVRMHWAIENGCHWTMDVMLGEDEGHPCQASRAS-IE 385 Query: 332 LFSGIRHIAINILT 345 S +R I N ++ Sbjct: 386 TVSWLRLIGYNAVS 399 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 44/179 (24%), Positives = 77/179 (43%), Gaps = 11/179 (6%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEK--------FPLKELNNPAHDSYAM 225 M Q D+ +Q++GGDY+ K NQG L E FP D+ Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYI 285 E S G + + L ++ W G++++ R + + E+ + + Sbjct: 61 CEVSKGHGWVERRTMTST-IWLNEYLTRWPGVQQVFRLTRTRQVGGKTTVEVVYGIS-SL 118 Query: 286 SSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 SS + R HW +E++ H R D + ED C++RRG A + + +R++A+ +L Sbjct: 119 SSVAAAPDALLRYTRTHWGIESRHHIR-DATLGEDRCRVRRGAAPRVLAVLRNVAVYLL 176 >UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9DIV7_9GAMM Length = 133 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 3/107 (2%) Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNH 302 V ++ ++W GLK + S + + + R+YISS DL AE+ +++RNH Sbjct: 3 VNKSWLNNKYQWVGLKSIIKVTS--DVHEKTTGKETTETRWYISSLDLNAEQALSSVRNH 60 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 W VE+ +HW L++ ED+ + R+G F+ +R IA+ + D+ Sbjct: 61 WQVES-MHWVLEMTFREDESRFRKGRGPLAFNVMRKIAMTLFKQDQT 106 >UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN77_ANAAZ Length = 77 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 26/61 (42%), Positives = 43/61 (70%) Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 A ++R+HW +EN LHW LDV + +DDC+IR+ NA + F+ +R IA+++L + K G+ Sbjct: 1 MANSVRSHWAIENSLHWVLDVALKQDDCRIRKDNAQQNFAVMRQIAVDLLGKENPVKRGI 60 Query: 355 R 355 + Sbjct: 61 K 61 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats. Identities = 28/70 (40%), Positives = 43/70 (61%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L+ H + I D RQ+ K+ + L D+L +T+C VI+GAEGW +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIPV 70 G G+PV Sbjct: 72 KGILTEGVPV 81 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 31/60 (51%), Positives = 34/60 (56%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 LKQYG FE GI HDTI +VSCIS F + FI WM C A DGKT+R S Sbjct: 11 LLKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 28/82 (34%), Positives = 52/82 (63%), Gaps = 2/82 (2%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + ++ H S + D RQ+W++ + L +I LL +CA +SG E + +I +G+ +FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTIARV--VSC 80 + +E G+P HDT+ + +SC Sbjct: 77 FLPYERGLPAHDTLKGLSGISC 98 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 63.9 bits (154), Expect = 8e-09, Method: Composition-based stats. Identities = 34/83 (40%), Positives = 50/83 (60%), Gaps = 1/83 (1%) Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE-K 183 A+H++SAF + +V+ Q+ EKSNEI A ELL LDI G +T DAM Q++ A Sbjct: 8 AVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQREHARFA 67 Query: 184 IQKQGGDYLFAVKGNQGRLNKAF 206 ++ + D++ VK NQ L +A Sbjct: 68 VEDKRADFVMTVKDNQPELREAL 90 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 63.9 bits (154), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 11/131 (8%) Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN--- 217 M +KG ++T DAMGCQ+ IA+++++ G D + ++KGNQG+ A F ++ Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 218 --PAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKK 275 P HD + E SHGR R V + E + W ++ L V R A + Sbjct: 61 LKPDHDEF---EDSHGRTVRRRGWVLPLTPE-TKHSGSWPDIQALLVTEKIRQ--AHYSE 114 Query: 276 ELEMTVRYYIS 286 + RYY+S Sbjct: 115 TVTSDFRYYLS 125 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 58/198 (29%), Positives = 86/198 (43%), Gaps = 14/198 (7%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 IP +L +++ GK IT DA+ QK +AE I + YLF VK NQ L F+ K + Sbjct: 3 IP-ILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTL--YFDIKNYFEH 59 Query: 215 LNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 P D HGR + R +E ++F + + +S + Sbjct: 60 RKEP--DYCLQDPPGHGRIDTRSIWTTTELNEYLEFPHVGQAF-----CIHKKSYDPKTN 112 Query: 275 KELEMTVRYYISSADLTAEKFATAI---RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 K E T Y ++S A + R HW +EN H+ LD +ED +IR GN Sbjct: 113 KVCENTF-YGVTSHHPNKADPARILQIHRGHWSIENSKHYILDWTYDEDRNRIRTGNGPA 171 Query: 332 LFSGIRHIAINILTNDKV 349 + +R AI +L + V Sbjct: 172 NTNRLRGFAIGLLKSKGV 189 >UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C6B8_ACAM1 Length = 145 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 52/150 (34%), Positives = 73/150 (48%), Gaps = 13/150 (8%) Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTF-EWKGLKK-LCVAVSFRSIIAEQKKELEMTVRY 283 S +S GREE R C E + EW+ ++ LCV + Q K T Y Sbjct: 7 SIQSRGREEHR----CIQVYEPVGIALQEWEAIRSVLCV----QRWGTRQGKAYHNTA-Y 57 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 YISSA + + + +R HW +EN+LHW DVV EDD ++ A +S +R I INI Sbjct: 58 YISSAATSPHHWQSLVREHWGIENRLHWPKDVVFGEDDYRLEDEQALLNWSVLRTIVINI 117 Query: 344 LTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 L + L+ M K A + + S+LT Sbjct: 118 LRLNGY--QSLKTAMTKLANRVDIIFSLLT 145 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 50/167 (29%), Positives = 81/167 (48%), Gaps = 13/167 (7%) Query: 3 LKKLMGHISIIPDYRQAWK--MEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 L+KL S IPD+R+A K + HKL D+++L I +S +I +FG+ + ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAK-------FHECFINWMRDCHSSDDKDVIAIDGKTL 113 NGIP T+ R+ I F E F + + ++++ IDGK Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCA--QEIVCIDGKAE 152 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 R + K+ R I +SA S + + +EKSNEI A+P L++ Sbjct: 153 RGTVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLID 197 >UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REV9_LEGLO Length = 139 Score = 61.6 bits (148), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 38/120 (31%), Positives = 62/120 (51%), Gaps = 9/120 (7%) Query: 264 VSFRSIIAEQ---KKELEMTV----RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVV 316 V +SIIA + K E + RYY++S + +RNHW +EN+LHW LDV Sbjct: 20 VGIKSIIATETISSKTNETAISAEWRYYVTSHETEKSDLHLYVRNHWSIENELHWHLDVH 79 Query: 317 MNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKAGLRRKMRKAAMDRNYLASVLTG 374 +N+D K R A FS I+ + ++++ K +R ++++ D YL S+L+ Sbjct: 80 LNDDADKKRDDTTAINFSSIKRMLLSLVKTKLPPGKKRSVRSRLKQVGWDTEYLVSLLSA 139 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 61.6 bits (148), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 51/176 (28%), Positives = 81/176 (46%), Gaps = 15/176 (8%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRL 237 K E + G D L +KGN +L A L + A SY + R E R Sbjct: 6 KKTVETVLATGNDLLVQLKGNHPKLLAAVRT---LCQSRAHAEQSYTVDLGRRNRIEQRT 62 Query: 238 HIVCDVP-----DELID-FTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT 291 + +P D D F +G +++ V + E ++E + YY+++ + Sbjct: 63 VRLWPLPPGSGTDPWHDHFQTVIEGQRQIEVFNPYHRRF-EPRQE---SPAYYLATCTAS 118 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 A A IR HW +EN+LH LDV + ED +IRR +F+ +RH A+N+L ++ Sbjct: 119 AATLAQVIRGHWAIENRLHHVLDVSLGEDSSRIRRNPG--VFALLRHFALNLLRHN 172 >UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N3J1_VIBHB Length = 184 Score = 60.8 bits (146), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 39/119 (32%), Positives = 64/119 (53%), Gaps = 7/119 (5%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMT-VRYYI 285 ++ HGR R + +P+EL + G+K C+AV I+ E K E + + YYI Sbjct: 34 DEGHGRLVRRRYFAFPLPEELHNHALS--GIKS-CIAVE--RIVQEGKGEPKTSHFSYYI 88 Query: 286 SSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 ++ + K A +R HW +E+ HW LDV N+D K N+AE F+ I+ + +N++ Sbjct: 89 TNHPASDPKLADYVRQHWEIES-YHWLLDVYFNDDRDKKYEENSAENFAQIKRLPLNLV 146 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 42/157 (26%), Positives = 75/157 (47%), Gaps = 10/157 (6%) Query: 99 SSDDKDVIAIDGKTLRHSYD-KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 ++ + IA+DGK L+ S S RR H++SA + + + +++ K+NE T Sbjct: 127 TAGPRRAIAVDGKALKASARLTSPRR---HLLSAVTHGRVVTLARVEVGAKTNETTHFKP 183 Query: 158 LLNMLDIKGKIITTDAM-GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 LL LD+ ++T DA+ + +I+ ++ + Y+ +K NQ + P +++ Sbjct: 184 LLAPLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQLAT-LPWRDIP 242 Query: 217 NPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFE 253 +A SE HGR E C +PDEL + Sbjct: 243 V----QHAASEVGHGRRESSSIKTCAIPDELGGIAYP 275 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 27/43 (62%), Positives = 38/43 (88%) Query: 109 DGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNE 151 DGKTLR S+D+S + AIH++SA+++ +SLV+GQ+KTDEKSNE Sbjct: 29 DGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNE 71 >UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammaproteobacteria RepID=Q47YV8_COLP3 Length = 151 Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 30/92 (32%), Positives = 51/92 (55%), Gaps = 1/92 (1%) Query: 281 VRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 R+ ISS DL + A+R+HW VE+ +HW LD+ D+ +I R +F+ +R IA Sbjct: 54 TRWNISSLDLHVVQALNAVRSHWQVES-IHWMLDMTFRVDESRICRKQGPHVFNVMRKIA 112 Query: 341 INILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 + + D + RK + A +D +Y +++L Sbjct: 113 MTLFKQDTTKLVSMARKKKMAGLDDDYRSNLL 144 >UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3V5_9PROT Length = 174 Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 47/159 (29%), Positives = 77/159 (48%), Gaps = 12/159 (7%) Query: 195 VKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K NQ L FE + + PA +++ + K R+E R V V D L EW Sbjct: 1 MKANQSNL---FETACAIAANDAPADTAFSRN-KGRSRQEDRTVEVFPVGDALAGT--EW 54 Query: 255 KGLKKLCVAVSFRSIIAEQKKEL---EMTVRYYISSA-DLTAEKFATAIRNHWHVENKLH 310 + K + V+ R+++ L V +Y+SSA + A +A AIR HW +EN+ H Sbjct: 55 QPFIKTIIRVTRRTLLHSAATGLWDQRGEVAFYVSSAANFPATAWAAAIRGHWGIENRNH 114 Query: 311 WRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + DV +ED +IR + + + R A+NI+ + + Sbjct: 115 YVRDVSCDEDKSRIR--DNPGIMARARSFALNIMRKNGI 151 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 74/333 (22%), Positives = 130/333 (39%), Gaps = 67/333 (20%) Query: 59 KQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYD 118 + G P ++T+ +++C+ WM + A DGK L Sbjct: 15 RPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGGIRA-DGKVL----G 69 Query: 119 KSRRRGA--IHVISAFSTMHSLVIGQ---IKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 S+R GA +H + + + + Q + D + + + E + G++++ DA Sbjct: 70 GSKRAGAPALHGVELVTHTTGMALAQREAVGGDAAAALLALLTEA----PLDGRMVSMDA 125 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNK-----------------------AFEEKF 210 + + I ++ G+YL VKG+Q + A ++ Sbjct: 126 GFLNAAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALDQIA 185 Query: 211 PLKELNNP-----------AHDSYAMSEKSHGREEIRLHIVCDVPD--ELIDFTFEWK-- 255 P + P A D+ + E+S GR EIR V D D + + W+ Sbjct: 186 PPRRKRQPIGFRRELQPRRAPDAQTI-EQSRGRLEIRELWVVDAGDVGPSLMTAYGWRQV 244 Query: 256 ----GLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHW 311 GL++ C R A+ E+TV +SS T +F +IRNHW +EN++H Sbjct: 245 TQIGGLRRWC-----RRRHADLWTVEEVTV---VSSRQRTPAQFLASIRNHWTIENQVHR 296 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 D M ED R + + R++ IN++ Sbjct: 297 PRDGSMQEDRLHGR--AIGVILAVCRNVVINLI 327 >UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY37_9BACT Length = 138 Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 1/74 (1%) Query: 271 AEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 + +KE + V +SS + + +R HW +EN+LHW D V ED C R GN A Sbjct: 38 GKTRKETALGV-TSLSSGQASPRELLDFVRGHWEIENRLHWIRDSVFREDTCTTRTGNGA 96 Query: 331 ELFSGIRHIAINIL 344 + + +R++ I++L Sbjct: 97 HVMATLRNMTISLL 110 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 26/85 (30%), Positives = 43/85 (50%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++ H + D R L I+ + I AV++GA+G+ IE +G+ +L+ + D Sbjct: 27 VLKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLP 86 Query: 66 NGIPVHDTIARVVSCISPAKFHECF 90 GIP HDT RV+ + P + F Sbjct: 87 KGIPSHDTFGRVLRILEPKQLQSGF 111 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 33/95 (34%), Positives = 47/95 (49%), Gaps = 4/95 (4%) Query: 69 PVHDTIARVVSCISPAKFHECFINWM----RDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 PV+ ++ ++ I P F R C + IAIDGKTLR S+D Sbjct: 12 PVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAFSDTK 71 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELL 159 A +V+SAF+ H +++ DEKSNEI A L+ Sbjct: 72 AAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALI 106 >UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A0_AZOSE Length = 92 Score = 54.7 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 25/66 (37%), Positives = 41/66 (62%) Query: 307 NKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRN 366 ++LHW LDV N+D ++RRG AA F +RHI +N+L ++ KA ++ K A M+ + Sbjct: 23 HQLHWSLDVQFNDDQSRVRRGYAANNFVVLRHIVLNLLRHNTTRKASIKSKRLLACMEDD 82 Query: 367 YLASVL 372 + +L Sbjct: 83 FREELL 88 >UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ26_GLOVI Length = 125 Score = 54.3 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 30/81 (37%), Positives = 48/81 (59%), Gaps = 4/81 (4%) Query: 267 RSIIAEQKKELE--MTVR--YYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDC 322 RSI E+ +EL +TV+ +Y+SS + +A + IR HW VEN++H+ DV ED Sbjct: 15 RSIRLERYRELRGIVTVKTHWYLSSIEASASELGRRIRGHWGVENQVHYPKDVTFGEDRS 74 Query: 323 KIRRGNAAELFSGIRHIAINI 343 +IR +++S R A+N+ Sbjct: 75 RIRTLPLVQVWSVARSFALNL 95 >UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D5 Length = 148 Score = 54.3 bits (129), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 64/123 (52%), Gaps = 17/123 (13%) Query: 228 KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKK---ELEMTVRYY 284 K HGR E R L ++ W G++++ FR + Q++ + + V Y Sbjct: 3 KGHGRVERR---SITTTTWLNEYLTRWPGVQQV-----FR--LERQRRADGKTTVEVVYG 52 Query: 285 ISSADLTAEKFATAI---RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAI 341 ISS A T + R+HW +E+ LH+ DV ++ED C++RRG A + + +R++A+ Sbjct: 53 ISSLSPVAAPPDTVLGYTRSHWGIES-LHYVRDVTLDEDRCRVRRGTAPRVLASLRNVAV 111 Query: 342 NIL 344 +L Sbjct: 112 YLL 114 >UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE7_9GAMM Length = 185 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 31/95 (32%), Positives = 47/95 (49%), Gaps = 13/95 (13%) Query: 266 FRSIIAEQKKELEMTVR-----------YYISSADLTAEKFATAIRNHWHVENKLHWRLD 314 FR++I Q+ R YY+ L A +F+ AIRNHW VEN+ H+ D Sbjct: 70 FRALIRVQRHTERFDTRLRDWRVSKECAYYLCDLVLPAARFSEAIRNHWRVENRAHYVRD 129 Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ED +IRR F+ +R A+N++ ++V Sbjct: 130 TRFQEDASRIRRNPCT--FALLRSFALNLMRFNRV 162 >UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C56B7 Length = 116 Score = 53.5 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 23/71 (32%), Positives = 42/71 (59%), Gaps = 5/71 (7%) Query: 278 EMTVRYYISSADLTAEKFATA-----IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 + TV + L+AEK A +R HW +EN+LH+ DV + ED C++R G+A ++ Sbjct: 16 QTTVEVHFGITSLSAEKADAATLLNHVRTHWRIENELHYVRDVTLGEDVCRVRMGHAPQV 75 Query: 333 FSGIRHIAINI 343 + +R+ +++ Sbjct: 76 LAALRNAVVHL 86 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 44/183 (24%), Positives = 84/183 (45%), Gaps = 13/183 (7%) Query: 10 ISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP 69 +S IPD R A ++ L +L L + A +S + +E F +P L G + P Sbjct: 7 LSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLGLRKP--P 63 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVI 129 H + ++ + P K E + +D +V+ +DGK L+ S + + ++ Sbjct: 64 GHTILTLLLHRLDPEKLQEALLQVF---PGADLGEVLVVDGKHLKGSGKGKSPQ--VRLV 118 Query: 130 SAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD---IKGKIITTDAMGCQKDIAEKIQK 186 + + Q K + + ++ A+ ELL+ L +KGK++ DA ++A K+ + Sbjct: 119 EVLALHLLTTLAQAKAEGREDQ--ALLELLDRLGAEGLKGKVVVGDAGYLYPELAGKVVQ 176 Query: 187 QGG 189 +GG Sbjct: 177 KGG 179 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 47/184 (25%), Positives = 86/184 (46%), Gaps = 15/184 (8%) Query: 10 ISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP 69 +S +PD R A + L +L L + A +S + +E F +P L G + P Sbjct: 7 LSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLGLRKA--P 63 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS-YDKSRRRGAIHV 128 H I ++ + P K + +D +V+ +DGK LR S KS + + V Sbjct: 64 GHTAITLLLHRLDPEKLQAALGQVFPE---ADLGEVLVVDGKHLRGSGKGKSPQVKLVEV 120 Query: 129 ISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD---IKGKIITTDAMGCQKDIAEKIQ 185 ++ +H+ + Q + + E A ELL+ L+ ++GK++ DA ++A +++ Sbjct: 121 LALH--LHT-TLAQARAE--GREEKAFLELLDRLEARELEGKVVVGDAGYLYPEVAARVR 175 Query: 186 KQGG 189 K+GG Sbjct: 176 KKGG 179 >UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XTV7_9DEIN Length = 118 Score = 52.4 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 30/108 (27%), Positives = 55/108 (50%), Gaps = 5/108 (4%) Query: 267 RSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRR 326 R ++ + EL Y ++S A++ R HW VEN+LH + D V+ ED + R+ Sbjct: 15 RRVVRKNSGELREETAYALTSLQAPAKRLYALWRGHWEVENRLHHKRDTVLGEDASRSRK 74 Query: 327 GNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTG 374 G A ++ +R + +N+L + + + R +RK + D L ++ G Sbjct: 75 GAAGLMY--LRDVILNLL---HLKRWPVLRSVRKFSADPKVLLRLIRG 117 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 47/189 (24%), Positives = 80/189 (42%), Gaps = 6/189 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ H+ IPD R + +LL+ + ++S E D+E F H L + E Sbjct: 13 LISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGIE 72 Query: 66 NGIPVHDTIARVVSC-ISPAKFHECFINWM--RDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 P D+ R + A +W + + D D + DGKTLR S + + Sbjct: 73 LKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTSG 132 Query: 123 RGAIHV--ISAFSTMHSLVIGQ-IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 GA + ++ +S + I Q + +E + +LL LD++G +I DA+ Q+ Sbjct: 133 GGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQA 192 Query: 180 IAEKIQKQG 188 Q +G Sbjct: 193 FFGSSQSRG 201 >UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria RepID=A8FXM7_SHESH Length = 57 Score = 51.6 bits (122), Expect = 5e-05, Method: Composition-based stats. Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 4/55 (7%) Query: 265 SFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 +FR +I + L + RYYISS +LTAE+ A + HW +E+ +HW LDV MNE Sbjct: 4 NFRFVIGNK---LVLEYRYYISSKELTAEQAANTVSEHWGIES-MHWVLDVSMNE 54 >UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JBB1_FRASC Length = 173 Score = 51.2 bits (121), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%) Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI--LTNDKVFKAGLRRKMR 359 HW +EN+LHW DV +ED + R GNA ++ + +R++AI I LT K LR R Sbjct: 100 HWAIENRLHWVRDVTYDEDRHRARTGNAPQVMTSLRNLAITILRLTGAKNIAKALRHHAR 159 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 50.8 bits (120), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 34/102 (33%), Positives = 54/102 (52%), Gaps = 14/102 (13%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVI----SGAEGWEDIEDFGETHPDF 57 +LKKL+ S IPD R+A ++H+L+ +LL + + + S E D+ + P F Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDM-----SRPAF 133 Query: 58 LKQ----YGDFENGIPVHDTIARVVSCISPAKFHECFINWMR 95 L+ + + E +P DT+ARV+ I P K E FI +R Sbjct: 134 LQALQGLFPELET-LPHGDTLARVLERIEPQKLEESFIRLLR 174 >UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmonella enterica RepID=C0Q104_SALPC Length = 177 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 28/55 (50%), Positives = 31/55 (56%), Gaps = 13/55 (23%) Query: 309 LHWRLDVVMNEDDCKIRRGNAAELF----SG---------IRHIAINILTNDKVF 350 +HWRLDV MNEDDC+IRRGN F SG +R I INIL VF Sbjct: 1 MHWRLDVAMNEDDCRIRRGNVKSFFEIIKSGEYEIWGCEIMRWIRINILKCTLVF 55 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 48/203 (23%), Positives = 79/203 (38%), Gaps = 52/203 (25%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGN----QGRLNKAFEEKFPLKELNNPAHDSYAMSEKS 229 MGCQK+IA+ I KQ DY+ A+KG+ QG L +A+ K + D + + Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGEL-EAWWHKCQREGFTADNFDEHTTIDSG 59 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSAD 289 HGR E R V ++ ++W GLK + I Sbjct: 60 HGRIETRRCQQVLVNKSWLNNKYQWVGLKSI------------------------IKVTS 95 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 EK T + +IR+G F+ +R IA+ + ++ Sbjct: 96 DVHEKTTT-----------------------ESRIRKGRGPLAFNVMRKIAMTLFKQEQT 132 Query: 350 FKAGLRRKMRKAAMDRNYLASVL 372 +A + K + A +D Y +++L Sbjct: 133 KRASIVAKKKMAGLDDEYRSTLL 155 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 36/115 (31%), Positives = 57/115 (49%), Gaps = 8/115 (6%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 +L ++ IPD+R+A + L+ +LL +I AV+SGA + I+ F + H + L Sbjct: 2 QLKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQL 61 Query: 65 E-NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV----IAIDGKTLR 114 PVH +I + + AK E + + R D + IA+DGKTLR Sbjct: 62 HWKRAPVHTSIRYALQGLD-AKAGE--LAFHRHASGLDGEGAQHASIAMDGKTLR 113 >UniRef50_Q8X3B6 H repeat-associated protein n=1 Tax=Escherichia coli O157:H7 RepID=Q8X3B6_ECO57 Length = 50 Score = 48.9 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 24/36 (66%), Positives = 26/36 (72%) Query: 343 ILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGSGLS 378 I ND VFKAGL KMRKA MDRN+LAS + GLS Sbjct: 15 ISDNDNVFKAGLSCKMRKAVMDRNFLASGIAACGLS 50 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 48/206 (23%), Positives = 90/206 (43%), Gaps = 14/206 (6%) Query: 7 MGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLK-----QY 61 + +++ IPD R+ K +H+ D+LL+ + AV SG + + + FL + Sbjct: 10 LPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDEVHIRT 69 Query: 62 GDFENGIPVHDTIARVVSCISP--AKFHECFINWMRDC-----HSSDDKDVIAIDGKTLR 114 E +P T+ R+ +S + ++W R+ D+ +A+DGK LR Sbjct: 70 RRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVDGKHLR 129 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 + R A+ +SA L +G Q D ++ + + L + ++T DA Sbjct: 130 GTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGVD-WVLTGDA 188 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQ 199 C +++A + +Q G A KG + Sbjct: 189 ALCTQELAAVVVEQKGGICSASKGTR 214 >UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobacteria RepID=Q2RP40_RHORT Length = 152 Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust. Identities = 39/118 (33%), Positives = 50/118 (42%), Gaps = 8/118 (6%) Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEL---EMTVRYYIS 286 HGR+E R V DV L W GL V+ + + K L Y Sbjct: 12 HGRQEHRWVEVFDVSGRLGP---TWDGLIAAVARVTRLTWHKDTKSGLWHKTQETALYAC 68 Query: 287 SADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 +L A TAIR HW VE + H+ DV ED +IR F+ +R A+NIL Sbjct: 69 QINLPAAVAGTAIRQHWGVEKRSHYVRDVTFFEDQSRIR--TKPGHFARLRSFALNIL 124 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 36/125 (28%), Positives = 52/125 (41%), Gaps = 9/125 (7%) Query: 223 YAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVR 282 + S HGR E R C + DEL F +L + V R + E Sbjct: 25 HTASSAGHGRRESRSIKTCGIADELGGIAFPHG---RLALRVHRRRKQTGGCESRETV-- 79 Query: 283 YYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHI 339 Y ++S D T + A A+R HW VE H R DV E+ + G A + R++ Sbjct: 80 YAVTSLDAHETTPAELAAAVRGHWTVEALRHVR-DVTYAEEASTLHTGTAPRAMATFRNL 138 Query: 340 AINIL 344 A+ +L Sbjct: 139 AVGLL 143 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 41/118 (34%), Positives = 57/118 (48%), Gaps = 16/118 (13%) Query: 31 LLTIC--AVISGAEGWEDIEDFGETHPDFLKQYGDFENG-IPVHDTIARVVSCISPAKFH 87 LLT+C AV++G E I FG P L F+NG +P +TIA ++ + Sbjct: 3 LLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLDADHLD 62 Query: 88 ECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAI---HVISAF----STMHSL 138 W+ D H D D IA+DGK L S D GA+ H+++A+ S MH L Sbjct: 63 RIIGAWLGDRH-PDGWDHIALDGKRLCGSRD-----GAVPGTHLLAAYAPQASAMHIL 114 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 30/82 (36%), Positives = 42/82 (51%), Gaps = 8/82 (9%) Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMS----- 226 D +GCQK IA+ I +Q DYL AVK NQ L++A F +E N Y + Sbjct: 8 DGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYF--EEANKARFAGYNIDYDEKI 65 Query: 227 EKSHGR-EEIRLHIVCDVPDEL 247 K GR E+ R + ++PD + Sbjct: 66 NKGPGRLEQRRCWVGYEIPDTI 87 >UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H9Y8_ANADF Length = 431 Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust. Identities = 77/348 (22%), Positives = 131/348 (37%), Gaps = 49/348 (14%) Query: 10 ISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP 69 + +PD R A + L++IL + +++GA + E+ + ++ +P Sbjct: 22 LEAVPDVR-AREGRWSLAEILTGVLLGIVAGARSLAEAEELTDGMSPAARRLASVPRRLP 80 Query: 70 VHDTIARVVSCISP-----AKFHECF-INWMRDCHSSDDKDV--IAIDGK-----TLRHS 116 DT AR C P A H W R + D V +A+DGK TL H Sbjct: 81 --DTTARDALCAVPLDGLRAALHRLVRAAWRRKALTPVDLPVGVVALDGKVTALPTLNHP 138 Query: 117 Y------DKSRRRGAIHVISA--FSTMHSLVIGQIKTDEKSNEITAIPELL-NMLDIKG- 166 D G ++ S I + ++NE +L +++ G Sbjct: 139 LIQNQHPDVGLPYGLARTVTCALVSAPGRPCIDAVPIPAETNEAGHFQHVLAGLVETYGA 198 Query: 167 --KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYA 224 +++T DA + + G DY+FA+K + + K E E+ D Sbjct: 199 LFQVVTYDAGALSEANGAAVVAAGKDYVFALKNDHFTMVKLATELLDPHEIAARRED--V 256 Query: 225 MSEKSHGREEIRLHIVCDV--------PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE 276 + + EI++ V P+E + W + + S + Sbjct: 257 LDNATTATREIQILAVDPSHGYGAGKGPEESV-----WSHARTF---LRVTSTVRRSGVV 308 Query: 277 LEMTVRYYISS--AD-LTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 +E R ++SS AD LT +++ +R HW VEN H LD ED+ Sbjct: 309 IERDSRLFVSSRAADQLTPDQWLQVVRAHWGVENNNHHTLDTAFAEDE 356 >UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus aquaticus Y51MC23 RepID=B7A7V9_THEAQ Length = 161 Score = 44.3 bits (103), Expect = 0.008, Method: Compositional matrix adjust. Identities = 29/103 (28%), Positives = 55/103 (53%), Gaps = 15/103 (14%) Query: 267 RSIIAEQKKELEMTVRYYISS-----ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 R ++ + E+ TV Y ++S AD A + + + W VEN+ W D +++ED Sbjct: 51 REVVRKGTGEVRRTVSYALTSLGPEVAD--ARRLGELLLSRWEVENRSFWVRDFLLHEDA 108 Query: 322 CKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 C++ RG A++ + +R +++L + G+R K KAA++ Sbjct: 109 CQV-RGVGAQVLAALRAFLVSLL-----HRQGVREK--KAALE 143 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 42.7 bits (99), Expect = 0.021, Method: Composition-based stats. Identities = 20/54 (37%), Positives = 35/54 (64%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFL 58 +L ++S IPD+R+A + L+ +LL +I A++SGA + I+ F +TH + L Sbjct: 2 QLKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERL 55 >UniRef50_C7YKI1 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7YKI1_NECH7 Length = 684 Score = 42.0 bits (97), Expect = 0.036, Method: Compositional matrix adjust. Identities = 32/115 (27%), Positives = 45/115 (39%), Gaps = 10/115 (8%) Query: 54 HPD-FLKQYGDFENGI-------PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV 105 HPD F K + G+ P A C SP ++W+RD H DD D Sbjct: 63 HPDEFYKAHSSTAEGVACHRKQGPAGTYGAETTDCDSPLALVNATLDWIRD-HVKDDVDF 121 Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 + G T RH D+ R + HV+ + +I +D +I IP N Sbjct: 122 VVWTGDTARHDSDEKLPRNSGHVLGTNRQVAQKIIDTF-SDNGQLDIPVIPTFGN 175 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 42.0 bits (97), Expect = 0.038, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 40/74 (54%), Gaps = 6/74 (8%) Query: 103 KDVIAIDGKTLRHS--YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 + +A+DGKT RH+ D S+ +H++ S ++ Q++ + K+NE LL Sbjct: 153 ESAVALDGKTSRHAKRADGSK----VHLVGVASHGDGRLLAQVEVEAKTNETAVFRRLLR 208 Query: 161 MLDIKGKIITTDAM 174 LD+ ++T DA+ Sbjct: 209 PLDLTNVLVTADAL 222 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 42.0 bits (97), Expect = 0.039, Method: Compositional matrix adjust. Identities = 31/112 (27%), Positives = 51/112 (45%), Gaps = 9/112 (8%) Query: 9 HISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGET-HPDFLKQYGDFENG 67 H++ +PD R + H L IL + + A+ SGAE + + ++ T + L++ G E+ Sbjct: 19 HLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGCQESP 78 Query: 68 ------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTL 113 P T+ RV+ I NW+ S +A+DGKTL Sbjct: 79 SRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSLGLS--PAALAVDGKTL 128 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 41.2 bits (95), Expect = 0.058, Method: Compositional matrix adjust. Identities = 39/161 (24%), Positives = 66/161 (40%), Gaps = 19/161 (11%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHP-DFLKQYGDFENGIPVHDTIARVVSCISPA 84 L+ +L L V++G + + + ++ P + L +G GIP T R+V P Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFG-LTRGIPSERTTRRLVEGCDPV 106 Query: 85 KFHECFINWMRDCHSSDDKDV--IAIDGKTLR--HSYDKSRRRGAIHVISA------FST 134 E W+ + D +A DGKTL+ S+ ++ V+ A + Sbjct: 107 ALDEALSGWIARAATVGDPGPRGLAFDGKTLKGTRSFTEAGAMSQEAVLEAVWHDTGITA 166 Query: 135 MHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 H V+G +EI A+ L LD+ ++TT G Sbjct: 167 GHQRVVG-------GDEIAALEALAGRLDLTDVLVTTAEKG 200 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 41.2 bits (95), Expect = 0.061, Method: Composition-based stats. Identities = 18/74 (24%), Positives = 36/74 (48%) Query: 13 IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 + D R +H+ DI+++ +C V+ G +G I + ++L+ + + NG+P D Sbjct: 18 MTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQGFLELPNGLPSRD 77 Query: 73 TIARVVSCISPAKF 86 I + + P F Sbjct: 78 CIRNWLMALQPDAF 91 >UniRef50_UPI000023EBF2 hypothetical protein FG01150.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023EBF2 Length = 712 Score = 40.8 bits (94), Expect = 0.088, Method: Compositional matrix adjust. Identities = 33/115 (28%), Positives = 46/115 (40%), Gaps = 10/115 (8%) Query: 54 HPD-FLKQYGDFENGIPVHDTI-------ARVVSCISPAKFHECFINWMRDCHSSDDKDV 105 HPD F K + GI H A C SP + ++W+R+ H DD D Sbjct: 55 HPDEFYKAHASTAQGIACHRGEGPAGIYGAETTDCDSPLELVNSTLDWIRN-HVKDDIDF 113 Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 + G T RH D+ R R A V+ + V+ +D+ I IP N Sbjct: 114 VVWTGDTARHDSDEKRPRSASQVLEMNRRVAKKVVKTF-SDDGVLTIPVIPTFGN 167 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 508 e-142 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 424 e-117 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 420 e-116 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 416 e-115 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 414 e-114 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 413 e-114 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 407 e-112 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 403 e-111 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 392 e-108 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 385 e-105 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 373 e-102 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 373 e-102 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 369 e-100 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 365 2e-99 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 364 2e-99 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 358 2e-97 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 358 2e-97 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 357 4e-97 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 355 1e-96 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 354 3e-96 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 352 9e-96 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 347 5e-94 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 346 9e-94 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 344 3e-93 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 344 3e-93 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 344 5e-93 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 343 6e-93 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 342 9e-93 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 342 2e-92 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 337 5e-91 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 337 5e-91 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 336 7e-91 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 335 1e-90 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 331 2e-89 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 331 3e-89 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 330 4e-89 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 330 4e-89 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 323 6e-87 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 317 4e-85 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 316 7e-85 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 312 2e-83 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 312 2e-83 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 311 3e-83 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 305 2e-81 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 295 1e-78 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 285 2e-75 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 282 1e-74 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 279 2e-73 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 275 2e-72 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 274 4e-72 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 272 1e-71 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 271 4e-71 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 264 5e-69 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 263 7e-69 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 262 1e-68 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 262 2e-68 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 256 9e-67 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 256 1e-66 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 248 3e-64 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 247 5e-64 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 241 3e-62 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 241 3e-62 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 241 4e-62 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 234 4e-60 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 233 7e-60 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 227 6e-58 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 222 2e-56 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 221 4e-56 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 220 6e-56 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 220 7e-56 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 220 7e-56 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 212 1e-53 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 206 9e-52 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 203 1e-50 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 199 2e-49 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 198 3e-49 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 193 8e-48 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 193 1e-47 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 186 1e-45 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 186 2e-45 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 185 2e-45 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 179 2e-43 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 176 1e-42 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 176 1e-42 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 168 3e-40 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 166 2e-39 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 164 6e-39 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 163 8e-39 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 163 9e-39 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 163 1e-38 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 162 1e-38 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 162 2e-38 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 154 7e-36 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 152 2e-35 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 151 5e-35 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 150 7e-35 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 149 1e-34 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 147 6e-34 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 146 2e-33 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 145 2e-33 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 144 4e-33 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 144 5e-33 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 143 9e-33 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 142 3e-32 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 140 1e-31 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 138 3e-31 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 138 3e-31 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 137 4e-31 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 135 2e-30 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 135 2e-30 UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7... 134 4e-30 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 134 7e-30 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 133 1e-29 UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=B... 132 2e-29 UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp... 132 2e-29 UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azolla... 132 2e-29 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 130 5e-29 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 125 2e-27 UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=B... 125 2e-27 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 125 3e-27 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 125 3e-27 UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammap... 117 1e-24 UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica... 115 2e-24 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 115 2e-24 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 114 4e-24 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 112 2e-23 UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX 111 5e-23 UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina... 109 2e-22 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 107 5e-22 UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewane... 106 1e-21 UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris mari... 102 2e-20 UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mo... 102 2e-20 UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata ob... 101 5e-20 UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legione... 100 6e-20 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 99 2e-19 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 99 2e-19 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 98 4e-19 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 98 6e-19 UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata ob... 97 8e-19 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 97 1e-18 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 96 2e-18 UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswald... 95 4e-18 UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio ... 95 4e-18 UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobac... 91 5e-17 UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum fe... 91 6e-17 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 91 8e-17 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 89 3e-16 UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum ... 89 4e-16 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 89 4e-16 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 86 2e-15 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 84 1e-14 UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus Re... 82 2e-14 UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax... 82 4e-14 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 81 6e-14 UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothe... 81 6e-14 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 79 2e-13 UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscurig... 74 6e-12 UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia... 72 2e-11 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 66 2e-09 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 63 1e-08 UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria... 61 5e-08 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 59 3e-07 UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX 48 6e-04 UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmone... 47 0.002 Sequences not found previously or not previously below threshold: UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 128 3e-28 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 99 2e-19 UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaerom... 99 3e-19 UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfo... 93 2e-17 UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteoba... 91 5e-17 UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis... 91 1e-16 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 89 2e-16 UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis... 86 2e-15 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 86 2e-15 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 86 3e-15 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 84 6e-15 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 82 4e-14 UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 82 4e-14 UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia ... 79 4e-13 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 75 4e-12 UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_... 74 6e-12 UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferro... 72 3e-11 UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium m... 70 1e-10 UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodoco... 70 1e-10 UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synecho... 70 2e-10 UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoa... 68 4e-10 UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitroso... 67 9e-10 UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus... 67 1e-09 UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylob... 66 3e-09 UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinis... 65 4e-09 UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylo... 64 7e-09 UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=... 64 9e-09 UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 62 3e-08 UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus... 60 2e-07 UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia... 59 2e-07 UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia... 58 7e-07 UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synecho... 57 1e-06 UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliob... 57 1e-06 UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia... 57 2e-06 UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=... 57 2e-06 UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitroco... 56 2e-06 UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=... 56 2e-06 UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candida... 56 3e-06 UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polarom... 55 5e-06 UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobi... 54 8e-06 UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus... 53 2e-05 UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=... 53 2e-05 UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastop... 52 3e-05 UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=... 52 5e-05 UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 R... 49 2e-04 UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillace... 49 2e-04 UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 49 2e-04 UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Strepto... 49 4e-04 UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata ob... 48 5e-04 UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinis... 47 8e-04 UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiat... 47 0.001 UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia... 47 0.001 UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitroso... 46 0.002 UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodoco... 45 0.003 UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms Rep... 45 0.003 UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus p... 45 0.003 UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 45 0.005 UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromon... 45 0.005 UniRef50_Q2RR82 Putative uncharacterized protein n=1 Tax=Rhodosp... 44 0.007 UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aro... 44 0.007 UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewane... 44 0.011 UniRef50_Q8X3B6 H repeat-associated protein n=1 Tax=Escherichia ... 44 0.012 UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID... 44 0.014 UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicute... 42 0.033 UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia ... 42 0.042 UniRef50_A6FLE0 Transposase, IS4 n=2 Tax=Roseobacter sp. AzwK-3b... 42 0.046 UniRef50_Q745Z8 Putative uncharacterized protein n=1 Tax=Thermus... 41 0.078 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 508 bits (1309), Expect = e-142, Method: Composition-based stats. Identities = 369/378 (97%), Positives = 371/378 (98%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 MELKKLM HISIIPDYRQ WK+EHKLSDILLLTICAVISGAEGWEDIEDFGETH DFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 AEKIQKQGGDYLFAVKG QGRLNKAFEEKFPLKELNNP HDSYA+SEKSHGREEIRLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE EMTVRYYISSADLTAEKFATAIR Sbjct: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK Sbjct: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 Query: 361 AAMDRNYLASVLTGSGLS 378 AAMDRNYLASVL GSGLS Sbjct: 361 AAMDRNYLASVLAGSGLS 378 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 424 bits (1090), Expect = e-117, Method: Composition-based stats. Identities = 157/377 (41%), Positives = 221/377 (58%), Gaps = 10/377 (2%) Query: 2 ELKKLMGHISIIPDYRQA-WKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 +K + + D R+ H DIL++ +CA+ISGA + +IE FG + ++ + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 + NGIP HDT V++ +SP +F CF+ W + IAID KTLR S DK Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKK 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + +H++SA++T +LVIGQIKT+E SNEITAIPELLN LD+KG +++ DAMGCQ +I Sbjct: 125 NGKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEI 184 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH---DSYAMSEKSHGREEIRL 237 AEKI ++ DY+ A+KGNQ +L+++ E F L N D E S+GREEIR Sbjct: 185 AEKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRC 244 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFAT 297 + +++I EWK +K + + S R KKE E +RYYISSA L+AE Sbjct: 245 AYATNEIEKIIAN-DEWKNIKTVAMIESQRI-----KKEKEFDIRYYISSAKLSAEDCLK 298 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +ENKLHW LDV ED+ +IR+ N AE + +R IA+N++ +K K G K Sbjct: 299 VVRKHWEIENKLHWTLDVAFREDESRIRQRNTAENMAILRRIALNLVKQEKTAKVGQATK 358 Query: 358 MRKAAMDRNYLASVLTG 374 A D YL +L G Sbjct: 359 RLMAGWDEKYLLKLLNG 375 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 420 bits (1080), Expect = e-116, Method: Composition-based stats. Identities = 181/377 (48%), Positives = 250/377 (66%), Gaps = 4/377 (1%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L+ +SII D RQ K+ H L D+L L I AVISG EGWE+I+DFG D+L++ Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Y F GIP DTI+R+ I P +F +CF WM+ C DVIAIDGKTLR S++K Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKK 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + IH++SAF+ +S+V+GQ+KT+ KSNEITAIP+LL++LD++G ++T DAMGCQ I Sbjct: 121 DKSDTIHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 A+KI +GGDYL VKGNQ RL A + F ++ L P ++Y EK HGRE+ R+ +V Sbjct: 181 AKKIVDKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 D +E+ D FEW GLK L AVSFR+ E+ + + V++YISSA L A+ A R Sbjct: 241 ADA-NEIGDLVFEWPGLKTLGYAVSFRT---EKDMQTTVAVKFYISSAKLDAKSLLEASR 296 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW VEN LHW+LD+ MNED C+IR+ N+ E + +RH ++N+L N+K F G++RK ++ Sbjct: 297 AHWTVENNLHWQLDISMNEDSCRIRQQNSTENLATVRHTSLNLLKNEKSFDGGIKRKHKQ 356 Query: 361 AAMDRNYLASVLTGSGL 377 A +Y V++G L Sbjct: 357 ANRSDSYRELVVSGLSL 373 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 416 bits (1069), Expect = e-115, Method: Composition-based stats. Identities = 168/373 (45%), Positives = 234/373 (62%), Gaps = 8/373 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ SII D RQ K++H+L DIL L + AVI GAEGW+DIE+ G ++L++ G F Sbjct: 6 SLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFF 65 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 + GIPV DTIAR++S ++P + CFI WM + D +IA+DGK++RHSYDK +R+ Sbjct: 66 KKGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKS 125 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 AIH++SA++ + +V+GQ KTD+KSNEI AIP LL++LDIKG I+T DAMGCQ+ IAEKI Sbjct: 126 AIHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKI 185 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPAHDSYAMSEKSHGREEIRLHIVC 241 + GDY+ AVK NQ +L++ + F HD + S K HGR E+R + + Sbjct: 186 VTKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRRYWIS 245 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 D+ L W L+ + + S R I + RY+I+S A+ FA A+R Sbjct: 246 DMLSTL-GNPERWASLQSIGMVESERYI----DGKTTAETRYFITSIAPDAKIFANAVRK 300 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDV EDD ++RR NA+E F RH+AIN L N+K K G++ K KA Sbjct: 301 HWAIENQLHWVLDVSFREDDSRVRRDNASENFGVFRHVAINALRNEKSCKKGIKAKRYKA 360 Query: 362 AMDRNYLASVLTG 374 + +Y VL G Sbjct: 361 TLQPDYAQKVLNG 373 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 414 bits (1063), Expect = e-114, Method: Composition-based stats. Identities = 149/385 (38%), Positives = 224/385 (58%), Gaps = 20/385 (5%) Query: 8 GHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG 67 + + D R +HKL DI+ +TICAVI GA+ W DIE FG+ +LK++ + NG Sbjct: 11 DYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNG 70 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIH 127 IP HDT RV S ++P + F+ W++ S +++AIDGKTLRHSYD+S+ + A+ Sbjct: 71 IPSHDTFGRVFSLLNPEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQ 130 Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPE---------------LLNMLDIKGKIITTD 172 +ISA++T + LV+GQ DEKSNEITAIP+ LL +L + G I+T D Sbjct: 131 MISAWATTNGLVLGQSIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLD 190 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPA---HDSYAMSEKS 229 A+GCQK+I ++I +Q DY+ +K NQG L + E F ++N Y + ++ Sbjct: 191 AIGCQKEIVKQITEQDADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEG 250 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSAD 289 HGR+E+R + + E ID ++W L + R + + + RY+ISS + Sbjct: 251 HGRQEVRYYQMLSNVAEEIDPDWQWLNLNSIGYVEYLR--VENGTDKTSLERRYFISSLN 308 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + FA+++R HW +EN+ HW LDV NEDD +IR+ NA + +RH+A+N+L +K Sbjct: 309 NNIKLFASSVREHWCIENQCHWILDVQFNEDDSRIRKDNAPANMAILRHLALNLLKQEKT 368 Query: 350 FKAGLRRKMRKAAMDRNYLASVLTG 374 K G++ K +KA D NYL VL Sbjct: 369 LKVGVKAKRKKAGWDENYLLKVLRN 393 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 413 bits (1062), Expect = e-114, Method: Composition-based stats. Identities = 145/375 (38%), Positives = 217/375 (57%), Gaps = 7/375 (1%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ H S + D R A ++E+ L DI+++T+CAV+ GA+ W ++ ++G + +LKQ+ Sbjct: 5 PFASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQW 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 NG+P HDT V + + P + +CF+NW + + ++IAIDGKTLR + Sbjct: 65 IALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGE 124 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + IH++SA+++ + LV+GQ DEKSNEITAIPELL +L+++G +++ DAMGCQ IA Sbjct: 125 QCSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIA 184 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPAHDSYAMSEKSHGREEIRLH 238 E I + GDY+ A+KGNQG L + F + HDSY EK HGR E R + Sbjct: 185 ETIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTY 244 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATA 298 D L+ W LK + S R + RYY+ S + A++FA A Sbjct: 245 WTMGQTDYLLG-AERWAQLKSIGCVESCRRQPGHPG---TLQRRYYLLSIESDAQRFADA 300 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R+HW +EN+LHW LDV ED + +G +A+ S IRHIA N+L + K G++ K Sbjct: 301 VRSHWGIENQLHWILDVGFREDKLRACQGCSAQNLSVIRHIAANLLQQESTAKCGVKAKR 360 Query: 359 RKAAMDRNYLASVLT 373 KA D NYL +L+ Sbjct: 361 LKAGWDDNYLVKILS 375 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 407 bits (1046), Expect = e-112, Method: Composition-based stats. Identities = 137/373 (36%), Positives = 214/373 (57%), Gaps = 8/373 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+G++ I D R +H L D+L + I AVI+G++GWED+E++G ++L ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 +GIP DT RV I P +C W++ +S ++I IDGKTLR SYD++ + Sbjct: 90 PHGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQC 149 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A++ ++A+++ SLV+GQ+K + SNEITAIP LL +LDI G IIT DAMG Q I ++I Sbjct: 150 ALYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQI 209 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPAHDSYAMSEKSHGREEIRLHIVC 241 +Q DY+ +K N L ++ F + + HD Y K H R E R Sbjct: 210 CRQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRYVWAI 269 Query: 242 DVPDEL-IDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 V + +W GL+ + V R + + +++ +Y++S A+ AIR Sbjct: 270 PVAAMGELYQQQQWHGLQTIVVVERIRHLWNKTTHDIQ----FYLTSLPPNAQFLCHAIR 325 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +EN LHW LDV +ED C+IR + + F+ +R +A+N+L +K FK LR+KM++ Sbjct: 326 THWSIENNLHWTLDVTFSEDQCRIRSEYSPQNFALLRRLALNVLHQEKTFKRSLRQKMKQ 385 Query: 361 AAMDRNYLASVLT 373 AAM+ NY+ +VL Sbjct: 386 AAMNNNYMMTVLN 398 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 403 bits (1035), Expect = e-111, Method: Composition-based stats. Identities = 160/372 (43%), Positives = 230/372 (61%), Gaps = 7/372 (1%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M ++ H S I D+RQ+ K+ + L D+L ++CAVI+ + GW +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 F +GIP DTIAR+VS I P F+ CF+ WM+ H + +VIAIDGKTLR SY++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 R IH+ISA+++ + LV+GQ+K ++KSNEITAIP LL MLD++G ++T DAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 A I +GGDYL AVK NQG L KA + F + D + EKSHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFS-PHRSAGLSDDHVNIEKSHGRIENRTCYV 239 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 DFT W+ LK + + SFR++ + K + RYYISS L+AE+ +A R Sbjct: 240 LSSAALDGDFTH-WEALKSIVMVESFRAV---KGKTASLEYRYYISSKVLSAEQALSATR 295 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +E+ +HW LDV MNED+C+I + N AE + +RH+++N+L + K + K ++ Sbjct: 296 EHWGIES-MHWVLDVSMNEDECQIYKNNGAENLAYLRHMSLNMLQKEPT-KLSIVGKRKR 353 Query: 361 AAMDRNYLASVL 372 M+ +L VL Sbjct: 354 CLMNPAFLEKVL 365 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 392 bits (1008), Expect = e-108, Method: Composition-based stats. Identities = 154/372 (41%), Positives = 217/372 (58%), Gaps = 10/372 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + + H S I D RQ K+ + L +ILLLT+CAV+SGA W I +G FLK++ F Sbjct: 24 EFLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPF 83 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 +G P HD + + + + F CFI+W+ + + V+AIDGKT R S DK+ + Sbjct: 84 ADGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTVTG-VVAIDGKTSRRSLDKAGGKA 142 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 AIH+ISA+S+ +L + Q + D KSNEITAIPELL +L +KG I+T DAMGCQ++IA KI Sbjct: 143 AIHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKI 202 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPAHDSYAMSEKSHGREEIRLHIVC 241 + DY+ A+KGNQG L K E + + ++ + EKSHGR E R VC Sbjct: 203 ISKEADYILALKGNQGSLRKDTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVTVC 262 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 D L W GLK + + + + + E RYYISS AE A AIR+ Sbjct: 263 TDIDWL-KADHNWPGLKSIVMVQYHAILQDKTRAET----RYYISSMTSDAEHHAKAIRD 317 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN LHW +D+V +D+C+IR GNA F+ I+H+A N+L + K K LR K A Sbjct: 318 HWGIENGLHWVMDMVFRDDECRIRNGNAPANFTTIKHVASNMLRSVK-GKHSLRSKRHIA 376 Query: 362 AMDRNYLASVLT 373 + D ++LA ++ Sbjct: 377 SWDDDFLAEIIN 388 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 385 bits (989), Expect = e-105, Method: Composition-based stats. Identities = 134/380 (35%), Positives = 217/380 (57%), Gaps = 16/380 (4%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L+ H I D R + +H+L D+L++ +C ++ G E + D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 +GIP HDT RV + + P F +CF+ W + ++ +++A+DGK LR + ++ + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQG--Q 124 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 ++SA++ +SLV+GQI+ +K+NEITA+P+LL +L++ G I+T DAMGCQK+IA + Sbjct: 125 SPRVIVSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLK----------ELNNPAHDSYAMSEKSHGRE 233 I + +Y+ A+KGNQG+ ++ + E N A +EK HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE 293 E R + L D +W GL+ + V S R + + + RYY+SS ++ E Sbjct: 245 ETRRYWQSGDVSWLAD-RQQWAGLRSVGVVESVRQVGQQA---PTVERRYYLSSLNVDVE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 KFA A+R HW VEN LHW LDV ED + R G+AAE + +R +A+N+L + K G Sbjct: 301 KFARAVRGHWSVENSLHWVLDVQCGEDRNRARSGHAAENLATLRRLALNLLKRESTKKRG 360 Query: 354 LRRKMRKAAMDRNYLASVLT 373 ++ K A+ D +YL +L+ Sbjct: 361 IKGKQLNASWDHDYLLRLLS 380 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 137/384 (35%), Positives = 197/384 (51%), Gaps = 15/384 (3%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 ++ I D R K HK+ I+ ++I AVI GA+ W +IE+FG F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPD 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YD 118 IP HDT R S I P F F NW++ + K V+AIDGK +R + Sbjct: 64 LEFIPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQ-EVKGVVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 + + ++SA+S ++ + +GQ+K D+KSNEITAIP L+N L++ G I+T DAMGCQK Sbjct: 123 TGKEGFKLWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 DI + I ++ +Y+ A+K N+ + L K + + ++ + HGR E Sbjct: 183 DITQTIIERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEK 242 Query: 236 RLHIVCDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT-AE 293 R V F + GLK + S R+I+A E VRYY++S D T E Sbjct: 243 RTCTVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIVA--TGEYTQEVRYYVTSLDNTKPE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A+AIR HW +EN LHW+LDV ED K + NAA FS +A+ IL DK K Sbjct: 301 EIASAIRQHWSIENNLHWQLDVTFREDYSK-KVKNAARNFSVATKMALTILKKDKTTKGS 359 Query: 354 LRRKMRKAAMDRNYLASVLTGSGL 377 + K KA D YL+ +L + Sbjct: 360 MNLKRLKAGWDEKYLSQLLQNNNF 383 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 126/380 (33%), Positives = 196/380 (51%), Gaps = 16/380 (4%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 L I D+R H L+DIL++ CA++ G + +E FG +L+ + Sbjct: 14 SNLREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLA 73 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCH--------SSDDKDVIAIDGKTLRH 115 NGIP HDT +V S + P +F E F W + S K VIAIDGK LR Sbjct: 74 LPNGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRG 133 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + DK + ++ A+++ SL +GQ+K +KSNEI A+PELL ML +KG I+T DAMG Sbjct: 134 AVDKG--QAPAVIVGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMG 191 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE-LNNPAHDSYAMSEKSHGREE 234 CQ+++A KI +Q GDY+ A+K NQ L++ E L + + HGR E Sbjct: 192 CQREVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHE 251 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEK 294 +R V + + + +W GL+ + R++ + + RY+ISS A Sbjct: 252 VRRCWVSEEVECWLQGAEKWAGLRSVAAVECERTVAGQ----TTVQRRYFISSLKADAAL 307 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKAG 353 A ++R HW +EN LHW LDV ED+ + RRG +AE + +R + ++ + K Sbjct: 308 IAASVRAHWGIENSLHWVLDVTFGEDESRSRRGYSAENLATLRRLTHAMIKRENPNSKKS 367 Query: 354 LRRKMRKAAMDRNYLASVLT 373 + ++ +A + +YL ++L Sbjct: 368 VNQRRFEAGLSTDYLQTLLG 387 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 369 bits (947), Expect = e-100, Method: Composition-based stats. Identities = 134/369 (36%), Positives = 194/369 (52%), Gaps = 9/369 (2%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L + I D R H+L DI+ + + AV++GA+ W IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + + P F +W++ S+ VIAIDGKT + SYD+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + ++SA+++ H LV+GQ D KSNEITAIP LL L + G I++ DAMG + IA +I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELN---NPAHDSYAMSEKSHGREEIRLHIVCD 242 KQ DY+ A+KGNQ L K ++ F + + + E +H R E R Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRRVFQVP 253 Query: 243 VPDELIDFT-FEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 V +W GL+ L V S R + + RY++SS A FA IR Sbjct: 254 VEQVFTPKQGRDWAGLRSLVVIQSQRCLWNKD----TTETRYFLSSLSTDAATFAHYIRA 309 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDVV NED +IR+ +A FS +R + +N+L D K L K +A Sbjct: 310 HWGIENQLHWCLDVVFNEDKSRIRKDHAPRNFSLLRRLTLNLLHRDSS-KGSLVMKRYRA 368 Query: 362 AMDRNYLAS 370 +D ++ Sbjct: 369 GLDDQFMMQ 377 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 365 bits (936), Expect = 2e-99, Method: Composition-based stats. Identities = 142/376 (37%), Positives = 202/376 (53%), Gaps = 12/376 (3%) Query: 5 KLMGHISIIPDYR-QAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 LM + D R ++ H ++L++ I AV+S + EDI +G D+L+Q+ Sbjct: 8 SLMCCFREVDDPRKRSNGTLHDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLV 67 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NG+ +T R+ + P +F F W+ + + +DGKT+R S S Sbjct: 68 LLNGVASEETFLRIFRALDPKQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGS--GSGGE 124 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 AIH++SAF+T +V+GQ K KSNEITAIPELL L I G ++T DAMGCQK+IA + Sbjct: 125 SAIHMVSAFATELGVVLGQEKVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIARQ 184 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDV 243 I QGGDYL AVKGNQ L A E +F + + + D + SHGR ++ V Sbjct: 185 ITDQGGDYLLAVKGNQPTLLDAIETEF-IDQYQSDDVDRHRQVHPSHGRIVAQIASVL-- 241 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHW 303 P E I +W KK+ S R + + K + RYYISS +LTAE+ A A+R HW Sbjct: 242 PAEGIVDLADWPECKKIARVDSLRKVGNHESK---LERRYYISSRELTAEQLAAAVRAHW 298 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV--FKAGLRRKMRKA 361 +EN+LHW LDV ED IR+GNA + S ++ I +N++ D K LR K + A Sbjct: 299 GIENRLHWVLDVSFGEDASTIRKGNAPQNLSLLKKIVLNLIRLDTADKTKTSLRLKRKCA 358 Query: 362 AMDRNYLASVLTGSGL 377 A + +L + L Sbjct: 359 AWTDDVRMRILGFTSL 374 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 364 bits (935), Expect = 2e-99, Method: Composition-based stats. Identities = 129/373 (34%), Positives = 199/373 (53%), Gaps = 7/373 (1%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +L ++ + D+R A + H+LS++L + +CAV+SGA+ +E+I +G +L+ + Sbjct: 5 KLADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPWLRGF 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHSYDKS 120 + G+ DT RV + + P +F + F W+ + KD VIAIDGK+ R + K+ Sbjct: 65 LRLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRTTSKA 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 +H++SAF+ +V+GQ T EKSNEITAIPELL +LDI+G I+T DAMG Q I Sbjct: 125 A-AAPLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGTQTKI 183 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 A I+++G Y+ VK N +L + ++ + HGR E+R Sbjct: 184 ARAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLTPSSTHETTSTGHGRIEVRRCTA 243 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 D D L WK + V R++ + YYISS AE+ A AIR Sbjct: 244 FDATDRLHK-AEAWKDVASFAVVERVRTV----GERTSTERVYYISSLPADAERIAVAIR 298 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 +HW VEN+LHW LDV +D + R G+ A + +RH+A+N++ DK K ++ K Sbjct: 299 SHWEVENRLHWCLDVQFGDDYARGRIGHIAHNLALVRHMALNLIRLDKSIKTSIKTKRLL 358 Query: 361 AAMDRNYLASVLT 373 AA + A++L Sbjct: 359 AATSDEFRAALLG 371 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 358 bits (919), Expect = 2e-97, Method: Composition-based stats. Identities = 130/373 (34%), Positives = 205/373 (54%), Gaps = 10/373 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + H + D R +H L D++ LT+ A++SGAEGW+DI+ FG++ D+L+++ F Sbjct: 2 SFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAF 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 + G+PV DTIAR++S + P FI+W+ + + VIA DGKTLRHS+D R+ Sbjct: 62 KEGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFDGDRK-T 120 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A+H +SA++ LV+ Q K+ K NE++ + EL+ +L++KG I+T DAM C K +A+ I Sbjct: 121 ALHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAI 180 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH---DSYAMSEKSHGREEIRLHIVC 241 +GGDY+ VK NQG+L F + P +S ++ HGR E R ++ Sbjct: 181 NAKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYVQL 240 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 + L + W +K + R + K + YYISS ++ + A AIR+ Sbjct: 241 PITPWLTQ-SQGWTNIKPVIEVTRKRYL----KDKETSETAYYISSLEVNLPQIAKAIRS 295 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN HW LD+ EDD +IRRG+A E + R A+N+ K ++ K+++A Sbjct: 296 HWSIENASHWVLDMTYREDDSRIRRGDAPENMATFRRFAMNLARLSP-IKDSMKGKLKQA 354 Query: 362 AMDRNYLASVLTG 374 A +L Sbjct: 355 AWSDEVREKLLFA 367 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 358 bits (918), Expect = 2e-97, Method: Composition-based stats. Identities = 129/380 (33%), Positives = 218/380 (57%), Gaps = 14/380 (3%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ ++ + + + D R+ +H L D+L++ + AVI+GA+G I + E H ++LK Sbjct: 9 DVVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSR 68 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-----DDKDVIAIDGKTLRHS 116 + +G+P HDTI R+++ + P F +CF W+ + D +++IAIDGKTLR S Sbjct: 69 LELPSGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRS 128 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ + G + + SA++ + +GQ+ +KSNEI PEL+ +D++ I+T DA GC Sbjct: 129 HDRGKGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGC 188 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH---DSYAMSEKSHGRE 233 Q+D+AEKI GDY+ A+K NQ RL++ + + N+ A + + K HGR Sbjct: 189 QRDVAEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRL 248 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE 293 + R + +PDE + +W+GLK + VA+ I+++ RYYISS A+ Sbjct: 249 DKRFYYQVKLPDE-VPAGEDWRGLKTIGVAIR----ISQENGRETCDTRYYISSLKPDAK 303 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 +FA A+R HW +EN LHW LDV ED+ ++R AAE + ++ +A++++ K K Sbjct: 304 QFAAAVRGHWGIENSLHWTLDVTFREDESRLRNRIAAENLAWLKRLAVSLIKQHKS-KES 362 Query: 354 LRRKMRKAAMDRNYLASVLT 373 + + R A + N+LA +L Sbjct: 363 VVMRRRMAGWNVNFLAEILG 382 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 357 bits (916), Expect = 4e-97, Method: Composition-based stats. Identities = 133/375 (35%), Positives = 192/375 (51%), Gaps = 12/375 (3%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M K L+ ++ IPD R K H LS+++ + ICA++ G + W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK-DVIAIDGKTLRHSYDK 119 + GIP HDT R+ + + PA F W+ D D +A+DGK LR + K Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMGDDKLVGQLAVDGKALR-ATAK 119 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A+H+++ +ST + +GQ K +KSNEITAIPELL +L++KG +++ DAMG Q Sbjct: 120 GRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQVK 179 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK----ELNNPAHDSYAMSEKSHGREEI 235 IA+ I K+ GDYL AVK NQ LN +E+F E + H + HGR+E Sbjct: 180 IADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKEH 239 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 R V V DE + +WK K + + R + VR+YISS L A Sbjct: 240 RRCWVLMV-DESMPVCQQWKA-KTIIAVQAERIENGKGYD----FVRFYISSRALDATSA 293 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A R HW VEN LHW LD+ ED + R G A E + IR +N+L +K + Sbjct: 294 LKATRAHWSVENLLHWTLDIAFGEDQLQARAGFAGENLAVIRQWILNMLKQNKSRNLSMA 353 Query: 356 RKMRKAAMDRNYLAS 370 K R ++ YL Sbjct: 354 NKRRLCCLNEQYLFE 368 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 355 bits (912), Expect = 1e-96, Method: Composition-based stats. Identities = 142/375 (37%), Positives = 201/375 (53%), Gaps = 16/375 (4%) Query: 7 MGHISIIPDYRQA-WKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 M + I D R+ H +IL++ I AV+S + EDI + T +L+++ + Sbjct: 1 MSCLDEIDDPRKPSNGTLHDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLK 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV-----IAIDGKTLRHSYDKS 120 NGIP +T R++ + P +F F W+ + D IAIDGKT+R S S Sbjct: 61 NGIPSEETFLRILRALDPKQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGS--GS 118 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 AIH++SAF+T LV+GQ K KSNEITAIPELL L IKG ++T DAMGCQK I Sbjct: 119 GGESAIHMVSAFATELGLVLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSI 178 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 A++I + GDYL VKGNQ +L +A E F + + + D + E+ HGR ++ V Sbjct: 179 AKQIVAKKGDYLLMVKGNQPKLLEAIETAF-IDQHGVESVDRSSRVERGHGRTVGQIASV 237 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 I +W + S R + K+ ++ RYYISS L+AE+ A A+R Sbjct: 238 LSAKG--IVDPADWPKCVTIGRIDSMRVV---GDKQSDLERRYYISSRALSAEQLAAAVR 292 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKM 358 HW VEN+LHW LDV +ED + + NA + S +R IA+ I+ DK K+ LR K Sbjct: 293 AHWGVENRLHWILDVSFSEDASTVAKDNAPQNLSLLRKIALTIIRADKTDTRKSSLRLKR 352 Query: 359 RKAAMDRNYLASVLT 373 + AA D +L Sbjct: 353 KGAAWDDGVRERMLG 367 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 354 bits (908), Expect = 3e-96, Method: Composition-based stats. Identities = 127/351 (36%), Positives = 188/351 (53%), Gaps = 5/351 (1%) Query: 22 MEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCI 81 + + L+++LL T+ +I A +++IE G D+L+Q+ FE+G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 SPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 P F W+ V AIDGKTLR S + GA+H++SA++ LVIG Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 Q + KSNEITAIPELL+ L + G I+T DAMG QK IA K+ +G DY+ A+KGNQG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 L+ + F +L HGR E R V D L + W GL + Sbjct: 181 LHDDVRDFFADPDLLRECARHDDTC-IGHGRIEERTCQVADASAWLTEQHSGWAGLASIA 239 Query: 262 VAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 ++ R+ ++ E+ R YISS + A R+HW VEN LHW+LDV ED+ Sbjct: 240 AVIATRT--DKKSGEVSSETRLYISSLPPDPKTILNACRSHWSVENNLHWQLDVTFREDE 297 Query: 322 CKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 C+ R+ +A + IRH A N+L + K ++RK KAAM++ + +V+ Sbjct: 298 CRTRKDHAPLSLAIIRHAAFNMLKREPS-KMSIKRKRLKAAMNQAFRKTVI 347 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 352 bits (904), Expect = 9e-96, Method: Composition-based stats. Identities = 125/375 (33%), Positives = 193/375 (51%), Gaps = 13/375 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY--GD 63 L+ S +PD R+ + L+ IL++ +CA++ GA+ W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 E G P HDT + + F F +W+R+ D V+AIDGKTLR S K Sbjct: 63 LEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVIDG-VVAIDGKTLRGSGKKGSNE 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H+++A++ L + Q T K +E+ + LL++L +KG I+T DA+GCQ ++AEK Sbjct: 122 -LLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAEK 180 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPAHDSYAMSEKSHGREEIRLH-I 239 I +GGDY+ VK NQ L +A E F + + +EK HGR E R + Sbjct: 181 IVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYTW 240 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADL-TAEKFATA 298 + DV WK L + + S R I ++ + RY I S + T E FA A Sbjct: 241 INDVTWMDRPMRAAWKKLGGVGMIESIRQI----GDKVSVDQRYAIGSCGVQTVEMFAKA 296 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 R+HW +EN LHW LDVV ED C+ R GN+A S +R + L ++ K GL R+ Sbjct: 297 SRSHWGIENGLHWTLDVVFREDQCRARLGNSARNLSVLRKFVLTTLRKEEGCKMGLNRRR 356 Query: 359 RKAAMDRNYLASVLT 373 A + +Y S++ Sbjct: 357 LHADRNESYRESLIA 371 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 347 bits (889), Expect = 5e-94, Method: Composition-based stats. Identities = 129/371 (34%), Positives = 194/371 (52%), Gaps = 9/371 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + + D R A+ +H DI+ L + AVISGA W +I+ FGE H D+L++Y F Sbjct: 2 SVFRFFENLSDPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPF 60 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 E GIPV DTIARV+ I P F+E F+N++ + + ++VIAIDGKTLRHS++ + Sbjct: 61 ECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFNPET-QS 119 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A+H ++ +S L++ Q K+ K NE A+ E+++ +K +IT DAM QK IAEKI Sbjct: 120 ALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKIAEKI 179 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH-DSYAMSEKSHGREEIRLHIVCDV 243 ++ GDY+ +K N + E F + P ++Y R + R + V Sbjct: 180 IEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYRKLKV 239 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHW 303 D L EWKG+K + RS + +YISS D+ + A +R HW Sbjct: 240 SDWLSK-AEEWKGIKSVLEVCRKRS----DNGKESQEKVFYISSLDVDIQILAKCVRGHW 294 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 VENK HW LDVV ED+C + AE + +R +A+N+ K ++ K+ A Sbjct: 295 EVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHP-KKQSMKGKLTAAGW 353 Query: 364 DRNYLASVLTG 374 + +L G Sbjct: 354 SDEFRDELLLG 364 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 346 bits (887), Expect = 9e-94, Method: Composition-based stats. Identities = 131/377 (34%), Positives = 198/377 (52%), Gaps = 13/377 (3%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 L+ L+ H S I D R ++ H L +ILLL +C ++ + +E+I +G H FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 + +G+P + +++ I PA F F W+R D +AIDGKT R S+D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFP-GRADFVAIDGKTSRRSHDRRAG 130 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML----DIKGKIITTDAMGCQK 178 IH++SAF+T LV+ Q +K+NE+ AIP LL+ L + G +++ DA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLH 238 IA I+ QG DYL AVK NQ L E F + + + HD +K HGR E R Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGADHHHD----LDKGHGRVEERHV 246 Query: 239 IVCDVPDELIDFTFEWKGLKKL--CVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 V D L T + G +L A+ A RY+ISSA LTAE A Sbjct: 247 SVIREVDWL-SGTRRFPGEMRLPDVAAIVRVHTTAHIADRTRTDTRYFISSAPLTAEHAA 305 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 A+R HW +EN+LHW LDV+ +D ++R G+ A+ + +RH A+N++ K L+ Sbjct: 306 DAVRGHWGIENRLHWVLDVLFKDDLSRLRTGHGAKNMAVVRHFALNLIRAANDQK-SLKT 364 Query: 357 KMRKAAMDRNYLASVLT 373 + + A +YLAS+L Sbjct: 365 RRKMAGWSDDYLASLLN 381 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 344 bits (883), Expect = 3e-93, Method: Composition-based stats. Identities = 112/369 (30%), Positives = 189/369 (51%), Gaps = 6/369 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ H++++ + R +H L D++ L I A++SGAEGW DIE +G++ D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP T+AR++ CI E + W+ + + K +IA DGK LR S+ + + A Sbjct: 69 NGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKDA 127 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + +++A+ T + LV+ Q T K EI + ++L++L++KG ++T DA+ CQ++ EKI Sbjct: 128 LQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKIS 187 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPD 245 ++ + VK NQ +L +A + +F E HGR+E R Sbjct: 188 EKKAHVVVQVKNNQPKLYQAVQSQFQSLFDAQKEKIVVEHKESGHGRQEERYVFQLKAKL 247 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHV 305 + T +W ++ + RS + + YY+SS + IR HW + Sbjct: 248 PP-ELTEKWPTIRSIIAVERHRSANGKG----TVDTSYYVSSLSPKHKLLGHYIRQHWRI 302 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN H+ LDVV NED +I +A E + R +NI+ R K+++A + Sbjct: 303 ENSQHYILDVVFNEDASRIAMEDAVENMALFRRFVLNIVKQSNCGARSQRNKLKRAGWND 362 Query: 366 NYLASVLTG 374 +Y A + G Sbjct: 363 DYRAQLFFG 371 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 344 bits (882), Expect = 3e-93, Method: Composition-based stats. Identities = 123/374 (32%), Positives = 185/374 (49%), Gaps = 11/374 (2%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L + I D RQA K+ H++ ++L++ C+ + E + D+ DF ++ +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 ++G P HD V+ I P E W D + IAIDGK LR +++ Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGWCGDL----EGRHIAIDGKALRGTHNAETG 116 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 R +H++ A+ + L GQI EKSNEI AIP LL L +KG +T DAMG Q IAE Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPAHDSYAMSEKSHGREEIRLHI 239 +I G DY+ A+K N R ++ + F E L+ H E SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAI 299 + + D +++W GL+ + R + V Y++ S E+ A + Sbjct: 237 ITEELDWYHK-SWKWAGLQSVAQV--RRQVQRSHDGPPLEEVHYFLCSFKADVERLAKLV 293 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VEN+ HW LDV NED C++R NAA + +R + I L K LRRK + Sbjct: 294 RGHWSVENRCHWVLDVTFNEDHCQVRDRNAAHNLTILREMVITTLHRHP-AKVSLRRKRK 352 Query: 360 KAAMDRNYLASVLT 373 A MD + +L Sbjct: 353 LATMDPAFRLQMLG 366 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 344 bits (881), Expect = 5e-93, Method: Composition-based stats. Identities = 124/374 (33%), Positives = 193/374 (51%), Gaps = 9/374 (2%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L I D R + + L +ILL+T+CA+I G + W+ I DFG+ +L Q+ + Sbjct: 14 QYLFHCFLSIKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVN 73 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 G+P T ARV S I P +F C WM D+I +DGK+L S + + + Sbjct: 74 MRCGVPSTLTFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQ 133 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 A H+++A+ + +G+++ +KSNEI AIP LLN L+++G II+ DAMG QK IA Sbjct: 134 KATHIVNAYLPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANL 193 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEK---SHGREEIRLHIV 240 I+ + DY+ A+K N R + E F + + Y E HGR E R + V Sbjct: 194 IRLKQADYVLALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSYCV 253 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSAD-LTAEKFATAI 299 + + W+ L+ + S R + E+E RYYI+S + + AI Sbjct: 254 LPM-MYFHKYKKYWRDLQAIVRVQSKR----HKGNEIETATRYYITSLPFAEHRRMSQAI 308 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN+LHW+LD+ + ED I RG A + + +R + + +L N+ K G+ K Sbjct: 309 RQHWAIENQLHWKLDIGLGEDASLITRGYADQNLATLRKMVLKMLENENSSKQGIAGKRI 368 Query: 360 KAAMDRNYLASVLT 373 +AA+ YL V+ Sbjct: 369 QAALSTRYLRKVVG 382 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 343 bits (880), Expect = 6e-93, Method: Composition-based stats. Identities = 125/372 (33%), Positives = 194/372 (52%), Gaps = 15/372 (4%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + + ++ I D+R + ++L DILL++ AVI + + ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV------IAIDGKTLRHSY 117 F +G P HDT +V+S + P E F WM + + K V +AIDGKT+ S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS- 121 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 S + A HV++AF++ LV+GQIKTDEKSNEITAIPELL + +K ++T DAMG Q Sbjct: 122 -GSAEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHD------SYAMSEKSHG 231 K+IA KI ++GGDY+ AVKGNQ +L + + + EK HG Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTRELKAKGQYAVTLEKDHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT 291 R E R + + + +W+G+ + + R + + K + S + Sbjct: 241 RIEKRECYLSNDLS-WFEGLEDWRGIAGVGWIRNTRHVDNKNDKTTTEDHYFIYSLKEAQ 299 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFK 351 A+ R HW +EN LHW LD+ EDDC+ R NAAE+ + +R +A+ +L K Sbjct: 300 AKDLLRIKREHWAIENNLHWMLDMAFREDDCRARAKNAAEVMNILRKLALQMLKTCDTCK 359 Query: 352 AGLRRKMRKAAM 363 G+R K + + Sbjct: 360 CGMRSKRKLCGL 371 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 342 bits (878), Expect = 9e-93, Method: Composition-based stats. Identities = 125/381 (32%), Positives = 190/381 (49%), Gaps = 19/381 (4%) Query: 6 LMGHISIIPDYR-QAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L + +PD R + H L+DIL + CAVI+GAEGWEDI ++G + F +++ + Sbjct: 5 LTSVFADLPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLEL 64 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS--------DDKDVIAIDGKTLRHS 116 +NG+P HDT RV + + P F + F W + + D +A+DGK+ R S Sbjct: 65 KNGVPSHDTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGKSARRS 124 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K G +H++ + +L++GQ E +EIT ++L LD+ G ++T DA GC Sbjct: 125 A-KPTFSGCLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTLDAAGC 183 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP-LKELNNPAHDSYAMSEKSHGREEI 235 Q + E I+ +GG+Y+ VKGNQ L A F E D + +HGR E Sbjct: 184 QTETLEVIRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAHGRHEE 243 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 R V PD L W G+ + + R + + E T YY+SS + A + Sbjct: 244 RNVTVVHDPDGL---PAGWAGVGSVALVCRDRQVKGKAN---ESTAHYYLSSLRVGAAEL 297 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A IR HWH+E+ +HW LDV ED+ + R G+A IR +A+++L K + Sbjct: 298 AGYIRGHWHIES-MHWVLDVAFREDESRTRVGHAGANLGMIRRVAVSLLKRAG-KKGSIH 355 Query: 356 RKMRKAAMDRNYLASVLTGSG 376 + +A D Y+A VL G Sbjct: 356 TRRLRAGWDDQYMAQVLQGLS 376 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 342 bits (876), Expect = 2e-92, Method: Composition-based stats. Identities = 133/369 (36%), Positives = 183/369 (49%), Gaps = 13/369 (3%) Query: 11 SIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 +IIPD R ++ ++I+ + + AVI GA+ W +IE FG+TH + K IP Sbjct: 8 AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKARLPGLVSIPS 67 Query: 71 HDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR-----RGA 125 HDT++R S + F ECF W+ D V+AIDGK + + DKS R Sbjct: 68 HDTLSRFFSILDIDWFEECFRLWVDDICRRIPG-VVAIDGKAICDNPDKSSNSKNGVRSK 126 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 ++++SA+S + + +GQ K +EKSNE AIPEL+ LD++ IIT DA+GCQK I + I Sbjct: 127 LYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIGCQKSITKLII 186 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH-DSYAMSEKSHGREEIRLHIVCDVP 244 + DY+ K N L E + H Y K HGR E R VC Sbjct: 187 ENKADYILCAKDNHEALRNIIEFNLSEESRYYLCHAKRYFEENKGHGRSEYREC-VCISA 245 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWH 304 L F W G+K L + S R + KE M RYYISS + +IR HW Sbjct: 246 KNLQYFLKGWTGIKTLAMINSIRKM---GDKEAVMETRYYISSLEPDPIIILKSIRPHWE 302 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 VEN LHW LD+ EDD + + GNAA FS I +A+ +L K G+ K + D Sbjct: 303 VENNLHWVLDIGFREDDDR-KIGNAAINFSAIPKLALMLLKQSD-IKLGMAGKRKACGWD 360 Query: 365 RNYLASVLT 373 V+ Sbjct: 361 EKIRDKVIG 369 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 337 bits (864), Expect = 5e-91, Method: Composition-based stats. Identities = 120/371 (32%), Positives = 189/371 (50%), Gaps = 9/371 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L H+S++ D R H L D+L L + AV SG +GW +I+ FGE ++L+++ F Sbjct: 2 SLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPF 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 NGIP TIAR++ + P C +W+ D ++ K +IAIDGKTLR + Sbjct: 62 ANGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLGC--N 119 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 +H + AF + L + Q K EI + L+ ML+I +IT DA+ Q+ E I Sbjct: 120 TLHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAI 179 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVP 244 + GDY+ VK NQ L +A + ++ + ++ +A SEK HGR E R I +P Sbjct: 180 VARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQR--ITFQIP 237 Query: 245 DELI-DFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHW 303 +L +W +K L R I + + +Y+SS D+ E ATA+R HW Sbjct: 238 SKLSPKLQEKWPSVKTLIAVERHRKIGNK----TSIETSFYLSSHDIDPEYIATAVRGHW 293 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 +EN LHW LDVV ED C++ AE + +R +A+N+ + K ++ K+ ++ + Sbjct: 294 RIENSLHWVLDVVYREDACRVHEQRTAESLAIVRRMALNLAKLEITQKRSMKSKLHRSLL 353 Query: 364 DRNYLASVLTG 374 Y ++ Sbjct: 354 SDEYRELMIFA 364 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 337 bits (863), Expect = 5e-91, Method: Composition-based stats. Identities = 127/379 (33%), Positives = 201/379 (53%), Gaps = 17/379 (4%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++L+ + I D RQ K+ H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + ++ K +I IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + G H++SA+S +GQ EKSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHD---SYAMSEKSHGREEIR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 + + + WKGLK + + R + ++ K L + RY+ISS E + Sbjct: 239 EYYQT-EKIKWLSQKKAWKGLKSIIM---ERKTLEKEGKRL-IEYRYFISSLKEEIETVS 293 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF--KAGL 354 A+R HW +E+ +HW LDV ED AA+ + IR +++IL +V K + Sbjct: 294 RAVRGHWSIES-MHWHLDVTFREDANTTIDKMAAQNLNIIRKWSLSILKTAEVSRHKLSM 352 Query: 355 RRKMRKAAMDR-NYLASVL 372 R+K + +L VL Sbjct: 353 RKKRYVIGLRPIKHLEEVL 371 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 336 bits (862), Expect = 7e-91, Method: Composition-based stats. Identities = 139/376 (36%), Positives = 194/376 (51%), Gaps = 25/376 (6%) Query: 15 DYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTI 74 D R +H+ S I+L+ I AVI GA+ W IEDFG++ F NGIP HDT Sbjct: 25 DNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKLSNFNGIPSHDTF 84 Query: 75 ARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSY---------------DK 119 R S + P KF E + W++ IAIDGKT+R +Y D Sbjct: 85 NRFFSALDPLKFEESYRQWVQSILKCYSG-HIAIDGKTIRGAYESEQDKRHRKQGVLPDS 143 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + + +HVISAF+T + +GQ+ T EK NEI IPELL+ML IK IIT DA+GCQ+ Sbjct: 144 NTGKYKLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDCIITIDALGCQRT 203 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP--LKELNNPAHDSYAMSEKSHGREEIRL 237 IAEK+ K GDY+F VK NQ +L + + + D Y E+ HGR E R+ Sbjct: 204 IAEKVIKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKGTTVRFDKYETHEEGHGRNESRI 263 Query: 238 HIVCDVPDELI-DFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 C+ P L D +WK ++ + R+ K + R +ISS + A+K Sbjct: 264 CYCCNDPGFLGADIRKKWKNIQSFGYIENTRN----TNKGTTVEKRCFISSLEPDAQKIL 319 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 R HW +EN LHW+LDV +ED+ + RR +A FS + IA+ L N+K + + R Sbjct: 320 KNSREHWEIENNLHWQLDVNFHEDNTR-RRNISALNFSVLAKIALATLRNNK-REIPINR 377 Query: 357 KMRKAAMDRNYLASVL 372 K A D +L ++ Sbjct: 378 KRLIAGWDNEFLWELI 393 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 335 bits (860), Expect = 1e-90, Method: Composition-based stats. Identities = 103/376 (27%), Positives = 173/376 (46%), Gaps = 17/376 (4%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 + + +PD R A H L++IL + + A + GA D+ F + Sbjct: 4 PMDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDV 62 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVIAIDGKTLRHSY 117 +NG+P HDT +RV + P F + F +M+ K VIA+DGK LR Y Sbjct: 63 LVLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGY 122 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 + R +++A++ + + ++ NE +L+ +L +KG ++T DA+ C Sbjct: 123 ESGRSHMPPVMVTAWAAQTRMALANVQAPNN-NEAAGALQLIELLQLKGCVVTADALHCH 181 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRL 237 + +AE I+ +GGDY+ AVK NQ L + + S + HGR+E R Sbjct: 182 RGMAEAIKARGGDYVLAVKDNQPALMRDAKAAIRAATRQG--KPSTITVDAGHGRKEKRR 239 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFAT 297 +V VP D ++ GLK + S R + RY++ S + Sbjct: 240 AVVAAVPQMAQD--HDFAGLKAVARITSKR-------GTDKTVERYFLMSQAYPPKDVLR 290 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +EN LHW LDVV++ED + R+ NA + +R +A+N+ LR K Sbjct: 291 IVRTHWTIENSLHWPLDVVLDEDLARNRKDNAPANLAVLRRLALNVARAHPDNTTSLRGK 350 Query: 358 MRKAAMDRNYLASVLT 373 +++A + +L ++ Sbjct: 351 LKRAGWNDTFLFELIQ 366 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 115/371 (30%), Positives = 187/371 (50%), Gaps = 7/371 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++ H+ I D R EH + DI L + AVISGA+ W +FG ++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP +I R+ +S + ++W+ + + + IAIDGK L+ + S A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLKGAKA-SASSAA 119 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H+++A+ T LV K +E+ + ELL LD+K +++T DA+ CQ + I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPD 245 K+GGD + VKGNQ +L +A + +F NNP + + + K HGR E R+ C + Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLNL 239 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHV 305 + +W LK L R + + + +Y+SSA LT+E F AIR HW Sbjct: 240 PA-EIKMKWSQLKTLIAVERHRKVGNK----TSIDTHFYVSSAVLTSEAFGRAIRAHWQT 294 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN HW LD + ED K+ + A + + +R A+N++ K +K +A Sbjct: 295 ENNQHWLLDKLFQEDKQKMYDEDGASILAILRRWALNLVKLHP-AKTSQTQKFNRACWSD 353 Query: 366 NYLASVLTGSG 376 ++ ++ G+G Sbjct: 354 DFREEIIFGTG 364 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 331 bits (848), Expect = 3e-89, Method: Composition-based stats. Identities = 133/381 (34%), Positives = 203/381 (53%), Gaps = 21/381 (5%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + ++ + + D R+ WK++H LSDI+LL A +SGAE W++IE FG+ + LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD---------DKDVIAIDGKTL 113 ENGIP HDT+ RV + + P E W SD K ++AIDGKT+ Sbjct: 66 QLENGIPSHDTLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTI 125 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 R + S ++ A+H+++A++T + GQ+ T+EKSNEITAIPELL+M+ +KG +++ DA Sbjct: 126 RG--NGSAKQKALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDA 183 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGRE 233 MG QK IA+KI K+ DY AVK NQ L E+ P E++ A D Y EK+HG+ Sbjct: 184 MGTQKAIADKIIKKKADYCLAVKENQKTL---LEDIVPFFEMSQEADDHYHTVEKAHGQI 240 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE 293 E R + V L E+ ++ + A I ++ + RY+I S ++A+ Sbjct: 241 ETRAYEVIHDVSWLRKTHPEFGHIQSIGRA----RIHLDKNGQESEESRYFILSCQVSAK 296 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKA 352 + +R HW +E+ +HW LDVV ED K A + + + +L K Sbjct: 297 ELCDYVRGHWQIES-MHWLLDVVFREDANKTLNKQLAFNLNVMDKFCLAVLKQLDFGKKM 355 Query: 353 GLRRKMRKAAMD-RNYLASVL 372 +RRK ++ YL +L Sbjct: 356 SMRRKKYALSLSFDKYLKQLL 376 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 330 bits (847), Expect = 4e-89, Method: Composition-based stats. Identities = 110/369 (29%), Positives = 177/369 (47%), Gaps = 12/369 (3%) Query: 10 ISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP 69 +PD R H L D+L + + A I GAE D F +++ + G+P Sbjct: 9 FQGLPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLP 68 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVI 129 HDT +RV + P F CF ++ D V+AIDGKTLR S+D++ R A+HV+ Sbjct: 69 SHDTFSRVFRLLDPVAFSRCFQQFLDHL-GEDGAGVLAIDGKTLRRSFDRAAGRSALHVV 127 Query: 130 SAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGG 189 SAF++ +++GQ NEI A LL + D+KG ++T DA+ Q+ A+ I ++GG Sbjct: 128 SAFASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGG 187 Query: 190 DYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELID 249 D+LF +K N+ L E F + + ++ HGR E+R H V L Sbjct: 188 DWLFPLKDNRPALRAEVERYFA--DPATVLAVPHVTTDADHGRIEVRRHWVSHDVAWLAS 245 Query: 250 FTF-----EWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWH 304 GLK L + + T Y+SSA L + A A+R HW Sbjct: 246 DRRFPDEAVLPGLKILGLVER---TVTSPDGRTTATRTLYLSSAALEPKTLARAVRAHWS 302 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +E +HW LD +ED + R+ + E + +R +A+N++ + + +R + ++A Sbjct: 303 IEAAVHWVLDTSFDEDRARNRKDHGPENLATLRKLALNVVRSAN-NQDSIRLRRKRAGWS 361 Query: 365 RNYLASVLT 373 +Y ++L Sbjct: 362 DDYARTILG 370 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 330 bits (847), Expect = 4e-89, Method: Composition-based stats. Identities = 105/374 (28%), Positives = 176/374 (47%), Gaps = 13/374 (3%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 + + + +PD R A + H L ++L++ +V+ G+ ++ FG F + Sbjct: 10 IAMHIFLSAFDEVPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-DDKDVIAIDGKTLRHSYDK 119 + ++ IP HDT + V I P F + D D D+IAIDGK LR + D Sbjct: 69 FLKLKHAIPSHDTFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDP 128 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 ++SA+++ L + + D + E++A E L ++D++GK++T DA+ C + Sbjct: 129 GESARTRMMVSAYASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRR 187 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHI 239 I GGD+ A+KGNQ L F ++P + HGR+E R + Sbjct: 188 TVAAINAGGGDWCLALKGNQESLLSDARGCFSKGHKSDP---TAVTENTGHGRKETRKAV 244 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAI 299 V + + E+ GLK + R E ++ RY+ S T E A+ Sbjct: 245 VVSA--KALAEYHEFPGLKGFGRIEATR----ETGGKVTSETRYFALSWVPTPEVLLAAV 298 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R+HW +EN LHW+LDV ED + R+ N + +R A+++L D K L K++ Sbjct: 299 RDHWAIENALHWQLDVSFREDAARNRKDNGPGNIAVLRRRALDVLRRD-TSKGSLSIKIK 357 Query: 360 KAAMDRNYLASVLT 373 +A D +L S+L+ Sbjct: 358 RAGWDTTFLRSILS 371 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 323 bits (828), Expect = 6e-87, Method: Composition-based stats. Identities = 103/370 (27%), Positives = 168/370 (45%), Gaps = 13/370 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++ +PD R A H L ++L++ +V+ GA ++ FG + + + Sbjct: 37 ILSAFEDVPDPR-AENTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLK 95 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-DDKDVIAIDGKTLRHSYDKSRRRG 124 + +P HDT + V I P F + D + D DVIA+DGK LR + D Sbjct: 96 HAVPSHDTFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGR 155 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++SA++ L + + D + E+ A E L ++ +KGK++T DA+ C + I Sbjct: 156 TRMMVSAYAARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAI 214 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVP 244 GGD+ A+K NQ L F + AH S + HGR E R V Sbjct: 215 NAGGGDWCLALKANQDSLLSDARASFGAEP---DAHPSALSEDIGHGRTETRKATVVS-- 269 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWH 304 + + E+ GLK + R + + RY+ S T E +R HW Sbjct: 270 SKALAEHHEFPGLKAFGRVEATR----KTAEGTTSETRYFALSWVPTPEVLLATVRAHWA 325 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN LHW+LDV ED + R+ N+ + +R A++++ D K L K+++A D Sbjct: 326 IENSLHWQLDVSFREDAARNRKDNSPGNIAILRRRALDVMRRD-TSKGSLSIKLKRAGWD 384 Query: 365 RNYLASVLTG 374 ++L +VL G Sbjct: 385 DDFLRNVLNG 394 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 317 bits (813), Expect = 4e-85, Method: Composition-based stats. Identities = 122/387 (31%), Positives = 188/387 (48%), Gaps = 20/387 (5%) Query: 3 LKKLMGHISIIPD------YRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPD 56 + ++ I I D RQ+WK+ + LS IL L ++G E +++EDF E + Sbjct: 1 MTTMIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEP 60 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD-KDVIAIDGKTLRH 115 Y D G P HDT+ RV+S ++ + E + + + S D +I++DGKT+R Sbjct: 61 LFATYVDLSEGCPSHDTLERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRG 120 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 ++ + + +H+++A+ H L +GQ+ +EKSNEI AIP+LL +DI+ I+T DAMG Sbjct: 121 --NRGKNQKPVHIVTAYDGGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMG 178 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPAHDSYAMSEKSHGR 232 Q I + I K DY AVKGNQ L F L E Y EKS G+ Sbjct: 179 TQTAIVDTIIKGKADYCLAVKGNQETLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQ 238 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTA 292 E+R + V L +W L+ + + + ++ +L RY+I S Sbjct: 239 IEVREYWVSSDIKWLCQNHPKWHKLRGIGMTRN----TIDKDGQLSQENRYFIFSFKPDV 294 Query: 293 EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA 352 FA +R HW +E+ +HW LDVV +ED + AA + IR + + L K Sbjct: 295 LTFANCVRGHWQIES-MHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKK 353 Query: 353 GL--RRKMRKAAMD-RNYLASVLTGSG 376 L RRK R ++ +YL + G Sbjct: 354 DLSYRRKQRYISVHLEDYLVQLFGERG 380 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 316 bits (810), Expect = 7e-85, Method: Composition-based stats. Identities = 104/369 (28%), Positives = 186/369 (50%), Gaps = 7/369 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ H+ II D R ++H L D++ LT+ A++SGA GW+ IE FG D+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +GIP IA ++ + + W+ D K +IA+DGKT+R ++ + A Sbjct: 63 HGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ-A 121 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H++SAF + + + ++K +E ++++ L + ++T DA+ CQK +KI Sbjct: 122 LHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKII 181 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPD 245 + D++ +KGNQ A + ++PA + HGR+E R + + Sbjct: 182 SKKSDFVIQIKGNQPA-LLAAVKAAFAACYDSPALAISEQTNTGHGRKECRRV-MQIEGN 239 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHV 305 + + +W ++ L S R++ + + R+Y+SS + + A IR HW + Sbjct: 240 LPPELSEKWPHIRTLVEVASERTVGNK----TACSSRWYVSSLPVDTAQLADIIRAHWAI 295 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN+LHW LDVV ED+ + + A+ + A++++ + K L K + AA D Sbjct: 296 ENQLHWVLDVVFREDELNVSDPDGAKHLALFNRAALSVIKQHQGKKDSLAAKRQSAAWDP 355 Query: 366 NYLASVLTG 374 + + +L G Sbjct: 356 AFRSELLFG 364 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 312 bits (798), Expect = 2e-83, Method: Composition-based stats. Identities = 121/412 (29%), Positives = 185/412 (44%), Gaps = 44/412 (10%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + I I D R+ K+ + I+L+T+ V + W DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK------------------- 103 P HDT+ R I + C+ W + Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 104 -DVIAIDGKTLRHSYDKSR--------------RRGAIHVISAFSTMHSLVIGQIKTDEK 148 IAIDGKT+ + + + +H++SAF + SL +GQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNMLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFE 207 NEI AIP+LL+ +DI +G ++T DA+G QK I EKI ++ DYL VK N +L + E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPLKELNNPAHDSYAMSE---KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 ++ +D +E + HG R I C P L +WK L+ + Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIK 316 Query: 265 SFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + IA E++ +ISS E R HW VEN LHW+LDV NEDD + Sbjct: 317 TEKINIA--TGEIQNEKHCFISSLVNNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR- 373 Query: 325 RRGNAAELFSGIRHIAINILT--NDKVFKAGLRRKMRKAAMDRNYLASVLTG 374 + N+A+ FS + +A+ IL D+ K + RK +KA YLA+++ Sbjct: 374 KMMNSAQNFSTLTKMALTILKNYQDEDKKTSVNRKRKKAGWSDEYLANLINN 425 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 312 bits (798), Expect = 2e-83, Method: Composition-based stats. Identities = 110/311 (35%), Positives = 165/311 (53%), Gaps = 10/311 (3%) Query: 1 MELKKLM---GHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDF 57 M+ +LM +PD R + H LS++L + +CAV+ GA + D+ +G+++ + Sbjct: 1 MKTGQLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAW 60 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHS 116 L+++ + G+P HDT RV++ I PA F F+ W+ + D V+AIDGKT R S Sbjct: 61 LRKFLKLKAGVPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRS 120 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K G +H++SAF+ LV+GQ TD+KSNEITAIPELL ML ++G I+T DAMG Sbjct: 121 GGKDT-SGPLHMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGT 179 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIR 236 Q IA I+ +G DY+ VK N L + + K HGR E+R Sbjct: 180 QAAIARTIRSRGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVR 239 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 D +L + +W GL+ + R++ + + YYISS A + A Sbjct: 240 RCWAYDAVSQLYK-SEQWAGLQSFALVERERTV----DGKTSVERHYYISSLPADAARIA 294 Query: 297 TAIRNHWHVEN 307 A+R+HW VE+ Sbjct: 295 QAVRSHWAVES 305 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 311 bits (796), Expect = 3e-83, Method: Composition-based stats. Identities = 115/339 (33%), Positives = 172/339 (50%), Gaps = 4/339 (1%) Query: 38 ISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 ++ AE WEDIE +G + +L+ + NGIP HDT RV + F CF ++ Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 ++V+A+DGK++R S G +H++S +++ L +GQ D KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 LL L + G I+T DAMGCQ IAE+I+ +G D L +K N G +A F L + Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 218 PAHDSYAMSE-KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE 276 A + HGR R V D + W L ++ + R I Sbjct: 184 GAAGRPVFDAFEGHGRLVRRRVFV-DAAATALAPLSGWPDLSRVLAVETLRGIPG--TGT 240 Query: 277 LEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGI 336 + +RY+++S IR HW VEN LHW L+V EDD ++R AA F+ + Sbjct: 241 VVADIRYFLTSCRDDPSVLVGVIRRHWSVENALHWVLEVSFREDDSRVRDRTAARNFALV 300 Query: 337 RHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGS 375 R IA+N++ D+ +A LR + +KAA D +Y+ ++ Sbjct: 301 RKIALNLIAQDRSTQASLRGRRKKAAWDDDYMLQIIANQ 339 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 305 bits (781), Expect = 2e-81, Method: Composition-based stats. Identities = 105/372 (28%), Positives = 169/372 (45%), Gaps = 14/372 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + +PD R +H L +IL + + AV+ GA ++E F + D L+Q+ E Sbjct: 3 FLDVFGEVPDPRD-LTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVIAIDGKTLRHSYDKSR 121 G P HDT +RV++ + P +E F+ +M K +A+DGK+LR +Y K R Sbjct: 62 RGAPSHDTFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGR 121 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 V++ F + + Q E E+ A L +L +KG +T DA+ C + + Sbjct: 122 SHMPPLVVTVFGCDTFMSLAQTVAQEGG-EVQAAIAALELLSLKGLTVTADALHCHRRMT 180 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVC 241 + ++ GG Y+ A+KGNQ +L A + E +HGR E+R V Sbjct: 181 KTVRDGGGHYVIAIKGNQSKLAAEANTALDKAA-AGKATKFHQTEEDAHGRHEVRRAFVI 239 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 L + S+R++ + + VR Y S + A + +R Sbjct: 240 PFAQTPGKNAL--VDLCAIGRVESWRTVEGKTTHK----VRCYALSRKMPAHELLATVRR 293 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN LHW+LDV++ ED + R+ N A + +R + +N+L D K L K KA Sbjct: 294 HWSIENDLHWQLDVLLGEDHIRGRKNNTAANHAILRRLTLNVLRADP-EKIPLSHKRLKA 352 Query: 362 AMDRNYLASVLT 373 L S+ T Sbjct: 353 RWADQDLLSLFT 364 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 295 bits (756), Expect = 1e-78, Method: Composition-based stats. Identities = 115/385 (29%), Positives = 174/385 (45%), Gaps = 43/385 (11%) Query: 30 LLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHEC 89 +L+T+ V + W DI DF DFL+++ P HDT+ R I + C Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FINWMRDCHSSDDK--------------------DVIAIDGKTLRHSYDKSR-------- 121 + W + IAIDGKT+ + + + Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 122 ------RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDI-KGKIITTDAM 174 +H++SAF + SL +GQ + K NEI AIP+LL+ +DI +G ++T DA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSE---KSHG 231 G QK I EKI ++ DYL VK N +L + E ++ +D +E + HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT 291 R I C P L +WK L+ + + + IA E++ +ISS Sbjct: 241 FMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIKTEKINIA--TGEIQNEKHCFISSLVNN 298 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT--NDKV 349 E R HW VEN LHW+LDV NEDD + + N+A+ FS + +A+ IL D+ Sbjct: 299 PELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTLTKMALTILKNYQDED 357 Query: 350 FKAGLRRKMRKAAMDRNYLASVLTG 374 K + RK +KA YLA+++ Sbjct: 358 KKTSVNRKRKKAGWSDEYLANLINN 382 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 285 bits (730), Expect = 2e-75, Method: Composition-based stats. Identities = 107/369 (28%), Positives = 168/369 (45%), Gaps = 14/369 (3%) Query: 10 ISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP 69 + I D R H L+++L L + A + GA+ +I +F E LK+ +G P Sbjct: 8 LREIHDPRD-INARHDLAELLFLALAATLCGAKNCVEIAEFVEGREAELKEIVTLRHGCP 66 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCH-----SSDDKDVIAIDGKTLRHSYDKSRRRG 124 HDT +R+ I P + ++ + V+A+DGK LR Y+K R Sbjct: 67 SHDTFSRIFRLIDPDELARALGAFLAALRQGLGLGPRPRGVVAVDGKALRRGYEKGRAFM 126 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++S + L + + + S+E+ A LL +D+KG I+T DA+ C+ D A+ + Sbjct: 127 PPVMVSVWDAETRLSVATKRAEG-SDEVAATLALLKSIDLKGCIVTADALHCRPDTAKAL 185 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVP 244 + Y A+K N+GRL E F + + E HGR E R V P Sbjct: 186 IGRKAHYALALKANRGRLFACAEAGFVAADAAGDLA-FHETRETGHGRLETRRASVL--P 242 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWH 304 + + GLK + + R +VRY S L K A +R HW Sbjct: 243 LKAFKQAPAFPGLKAIGRIQATRQ---GADGRAVTSVRYIALSKVLAPHKLAEVVRAHWT 299 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN+LHW LDVV +EDD + R+ NA + + IR +A +IL + K + KMR+ + Sbjct: 300 IENQLHWSLDVVFHEDDARSRKDNAPQNLAVIRRLARDILAAHPLDK-PIASKMRRVNWN 358 Query: 365 RNYLASVLT 373 R++ T Sbjct: 359 RDFFHEFFT 367 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 282 bits (722), Expect = 1e-74, Method: Composition-based stats. Identities = 116/367 (31%), Positives = 173/367 (47%), Gaps = 15/367 (4%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L + ++P R K + L +LL+ + +SG W +IED+ E + + LK + Sbjct: 4 SLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEM 63 Query: 65 ENG------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYD 118 G +P HDT+ R +S + F + W+ S+ I IDGKT+R Sbjct: 64 LTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRG-VK 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K HV+SAFS + Q+ D K+NEI AI +LL++LD+ G +++ DA+G Q Sbjct: 123 KLSFDTQSHVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQT 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLH 238 I E+I +GGDY+ VK NQ + E F + D +E SHGR E R + Sbjct: 183 AIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLD--EQTELSHGRIETRRY 240 Query: 239 --IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 I+ + E + KGL+ + V R ++ + V YYISS Sbjct: 241 ESILNPLEIEANEVLTRRKGLRSIHKVVRKRR--DKKSDKTSEEVAYYISSLT-DVSSLK 297 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKAGLR 355 AIR HW +ENKLH LDV D R N A++ I+ I + I+ K K+ + Sbjct: 298 QAIRGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKTNMKSSIP 357 Query: 356 RKMRKAA 362 R +K A Sbjct: 358 RIQKKPA 364 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 279 bits (713), Expect = 2e-73, Method: Composition-based stats. Identities = 103/367 (28%), Positives = 179/367 (48%), Gaps = 17/367 (4%) Query: 9 HISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG- 67 I+++ D R ++++ L ILL+++ A ISG + WE IED+ H + L+ +G Sbjct: 8 AIAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKLSGK 67 Query: 68 ------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 +P HDT V I P +F E + ++ + + IAIDGKT R ++ Sbjct: 68 ELKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPRG-IKQTA 126 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 +++SA+ T H VI I ++ K +E+++I +L+ +L ++ +T DA G ++ Sbjct: 127 NSHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGTYVEVI 186 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVC 241 E I +GG+++ VKGNQ +L + E++F +E + + HGR E R Sbjct: 187 EMILSKGGNFVLPVKGNQKKLLEFIEKEF--REYRGNTVSADTQEDIGHGRVEKRTVYCI 244 Query: 242 DVP---DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATA 298 D++ +WKG+K L V R + + K + YYI++ + ++ A Sbjct: 245 TEIKTDDDIDGCMQKWKGVKTLVKIV--REVYKKADKSTRIETVYYITNL-IDPKEINRA 301 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA-GLRRK 357 IR HW +EN LH LDV++NED + N E F + +A+ I+ + + R Sbjct: 302 IRAHWGIENNLHRSLDVLLNEDHSQKSNHNVVENFHIMSLLALFIIKEISKQRGISMNRT 361 Query: 358 MRKAAMD 364 + Sbjct: 362 RKLCGYS 368 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 275 bits (704), Expect = 2e-72, Method: Composition-based stats. Identities = 109/286 (38%), Positives = 156/286 (54%), Gaps = 9/286 (3%) Query: 9 HISIIPDYRQAW-KMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG 67 +IPD R+A H LSDIL + +CAV+SG + WE + +FG T +L+Q+ NG Sbjct: 17 FFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANG 76 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCH-SSDDKDVIAIDGKTLRHSYDKSRRRGAI 126 IP HDT RV S I P F F +W D D +A+DGKT+R S+ S R A+ Sbjct: 77 IPSHDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSHRGSAGR-AL 135 Query: 127 HVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQK 186 H++ A+S L++ Q + D KSNEITAIP++L++ D++G I+ DA+GCQK +A +I + Sbjct: 136 HLLHAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITE 195 Query: 187 QGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDE 246 GGDY+ A+KGNQ L+ + +P + EK HGR E R V D D Sbjct: 196 AGGDYVLALKGNQSALHDDVRLFMETQADRHPQGQA-EAVEKDHGRIETRRIWVNDEIDW 254 Query: 247 LIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTA 292 L +W GLK L + S R + ++ R +I+S Sbjct: 255 LTQKP-DWPGLKTLVMVESRREL----NGQVSCERRCFITSHTADP 295 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 274 bits (700), Expect = 4e-72, Method: Composition-based stats. Identities = 94/253 (37%), Positives = 147/253 (58%), Gaps = 7/253 (2%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 ++H+++A+ + +L++GQ+K D+KSNEITAIP+LL ML ++G I+T DAMGCQK IA++ Sbjct: 1 NSLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQ 60 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP-AHDSYAMSEKSHGREEIRLHIVCD 242 I + DY+ AVK NQ L + + F ++N H + + HGR E R + Sbjct: 61 IGSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREYS-TI 119 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNH 302 V D+L+ W L + + S R + + RY+I S + A++F A+R H Sbjct: 120 VGDDLLAGITGWDNLNAIGMVESKREV----GNTISNEKRYFIMSINGHAQRFGDAVREH 175 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN +HW LDV ED +IR+ N+ E S +R IA+N + + K ++RK + A Sbjct: 176 WGIENTVHWVLDVSFGEDQSRIRKDNSPENLSMLRKIALNCVKQEST-KTSMKRKRKMAG 234 Query: 363 MDRNYLASVLTGS 375 D ++L VLTG+ Sbjct: 235 WDNSFLIKVLTGN 247 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 108/350 (30%), Positives = 166/350 (47%), Gaps = 16/350 (4%) Query: 3 LKKLMGHISIIPDYRQAWK--MEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 L+KL S IPD+R+A K + HKLSDI++L I +S +I +FG+ + ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK-----DVIAIDGKTLRH 115 NGIP T+ R+ I + H ++I IDGK R Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + K+ R I +SA S + + +EKSNEI A+P L++ +DI GKI+T DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 QKDI +KI+++ GD++ +K NQ L E+K +P + E HGR E Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDKIKE---LSPVYSYCGEPELGHGRIET 269 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 R + V D D LI +W G L + + + R ++SS + Sbjct: 270 RSYRVFDGTD-LIANKEKWNG--NLTIIEYECETVKKSTGNCTTEKRLHVSSLPANTPRL 326 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 T +RNHW +E+ +HW LD + +D K + AA I+ I ++ + Sbjct: 327 GTPVRNHWSIES-MHWGLDRNLLQDKIKRKSARAARNLDTIQRIVYSVFS 375 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 271 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 103/372 (27%), Positives = 167/372 (44%), Gaps = 30/372 (8%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 E+ L+ ++ +PD R + H L+ +L LT CAV++GA + ++ P+ L + Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 62 GDFE-------NGIPVHDTIARVVSCISPAKFHECFINWMR-DCHSSDDKDVIAIDGKTL 113 P TI RV++ I W+ + +A+DGK+L Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWLACRQQDAGGLRALAVDGKSL 157 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITTD 172 R + RR +H+++A + LV+ Q+ EK+NEIT LL+ L D+ G ++T+D Sbjct: 158 RGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTSD 215 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGR 232 A+ Q D A ++ + Y+ VK N +L+ + P +++ HGR Sbjct: 216 ALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLK-SLPWQQIPL----QDRTRTTGHGR 270 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSA---D 289 EIR VC V + L + G ++ V R + ++ + Y ++S Sbjct: 271 CEIRRLKVCTVNNLL------FPGARQAVQIVRRR--VNRTTGKVSLKTIYAVTSLAAEQ 322 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + A IR HW VE LH DV ED ++R GNA + + R++AI L V Sbjct: 323 APPARVAQLIRGHWTVEA-LHHVRDVTFAEDASQLRSGNAPQAMATYRNLAIGALRLAGV 381 Query: 350 --FKAGLRRKMR 359 AGLRR R Sbjct: 382 RNIAAGLRRTAR 393 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 264 bits (674), Expect = 5e-69, Method: Composition-based stats. Identities = 90/247 (36%), Positives = 140/247 (56%), Gaps = 3/247 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ H + D R +HKL DI+++ +CA+I GA+ + +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + I P +F + F +W+ DV+ IDGKT++HS +K + A Sbjct: 69 NGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKKA 128 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH+++A+++ LV+ Q K E++ EITAIP L+ +L++ G ++T DAMG Q DIAE + Sbjct: 129 IHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELLH 188 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPAHDSYAMSEKSHGREEIRLHIVCD 242 +G DY A+KGNQ L + +E F E H + EK R E+ + Sbjct: 189 SKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEVSSAYRTE 248 Query: 243 VPDELID 249 Sbjct: 249 QERLWSH 255 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 263 bits (672), Expect = 7e-69, Method: Composition-based stats. Identities = 105/360 (29%), Positives = 161/360 (44%), Gaps = 41/360 (11%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 L+ + + I D RQ K+ H+ I++ + V + + W ++ DF DF++++ Sbjct: 18 LESMYNALGEI-DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFF 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINW--------------------MRDCHSSDD 102 P HDT+ R + P + W + + Sbjct: 77 PDIQKAPSHDTLRRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKP 136 Query: 103 KDVIAIDGKTLRHSYDKSRRRGA--------------IHVISAFSTMHSLVIGQIKTDEK 148 IAIDGKT++ + ++ RRR +H++SAFS L +GQ + D+K Sbjct: 137 FRQIAIDGKTIKKAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKK 196 Query: 149 SNEITAIPELLNMLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAF- 206 NEI AIP LL+ LDI +G ++T DAMG QKDI +I K+ YL VK NQ L + Sbjct: 197 ENEIVAIPRLLDDLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIA 256 Query: 207 --EEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 F L N + + E HG +R VC L +W+ L+ + Sbjct: 257 GNMRDFERIPLPNEVYKVHKEGENGHGFVFLRECRVCSSLHSLGKIYKDWENLRSYGLIR 316 Query: 265 SFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + R + E E + Y+ISS + EK R HW +EN LHW+LD+ EDD ++ Sbjct: 317 TER--VDEATGESAVETHYFISSLENDPEKIMRVKRKHWGIENGLHWQLDITFKEDDGRM 374 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 89/365 (24%), Positives = 168/365 (46%), Gaps = 18/365 (4%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L+ + +PD R ++L ++ + +CAV +GA + I D+ P + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV-IAIDGKTLRHSYDKS 120 +P TI +V + + +D + +A+DGKT+R + + Sbjct: 102 CGIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RI 159 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 ++ A H+++A + ++V+GQ +T +KSNEI + LL +DI G ++T DAM QK Sbjct: 160 GKQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKAT 219 Query: 181 AEKIQKQGG-DYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHI 239 A +++Q +Y+ VK NQ L ++ P +++ D E+ HGREE R + Sbjct: 220 ARCLREQCRAEYVMIVKANQPGLLARVRDQ-PWEQVPVVWSD---PVERGHGREEHRSYK 275 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSA---DLTAEKFA 296 + V L + +++ + R ++ V Y I S + A Sbjct: 276 ILTVARGL-----RFPYAQQVIQIIRRRRVLG--AGAWSTEVVYAICSLPCEQAPPKLLA 328 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 + IR HWH+EN++H+ DV +ED +R G+ ++ + +R++ + + Sbjct: 329 SWIRGHWHIENRIHYVRDVTFDEDRSAVRTGHGPQVMATLRNVVVGLHRRAGHSNIARAC 388 Query: 357 KMRKA 361 + A Sbjct: 389 RRLAA 393 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 91/348 (26%), Positives = 159/348 (45%), Gaps = 16/348 (4%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ-Y 61 + L+ + + D+R+ H L +L++ I + G G+ ++ +F + + L Q + Sbjct: 1 MLSLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINW-MRDCHSSDDKDVIAIDGKTLRHSYDK- 119 +P + TI RV+ + + F W + + DD + + +DGK+L+++ Sbjct: 61 NIIPERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNP 120 Query: 120 -SRRRGAIHVISAFSTMHSLVIGQIKTD-EKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 + ++ I +S FS LV+ + + +K +EI ++ ++ K+ T DA+ CQ Sbjct: 121 NNEQQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRL 237 K I K DY+ VKGNQ L K ++ ++ + + SHGR+ R Sbjct: 181 KKTISLIAKTKNDYVITVKGNQKNLYKRIQDL----SNSSKPESCFLEQDNSHGRKISRK 236 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFAT 297 V V E +G + L + + K E YYISS +A+ FA Sbjct: 237 IEVFKVRKN------ERQGFENLRRVIKVERKGSRGDKTYE-ETAYYISSLTESAQVFAK 289 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 IR HW +EN+LHW DV+ ED +I AA +S + I +N+ Sbjct: 290 IIRGHWKIENQLHWVKDVIFEEDKSEISDFQAASNWSILTTIGLNLFR 337 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 256 bits (654), Expect = 9e-67, Method: Composition-based stats. Identities = 88/210 (41%), Positives = 134/210 (63%), Gaps = 1/210 (0%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 +KL IPD+R++ K + L ILL+ I +VI GA+ W ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NGIP HDT RV S I +F +CFI W+ +++IAIDGKT+R + ++ Sbjct: 65 LPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGGKK 123 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H++SA++ ++LV+GQ+K KSNEITAIP+LL +L I+ I+T DAMGCQ IA+ Sbjct: 124 SPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIAKA 183 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 I K+ DY+ AVK NQ +L + E++F Sbjct: 184 IVKKNADYILAVKENQPQLLEHIEDEFRFG 213 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 256 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 106/363 (29%), Positives = 169/363 (46%), Gaps = 18/363 (4%) Query: 3 LKKLMGHISIIPDYRQA--WKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 +K L + +PDYR+ ++KL DILLL I + DI FG+ + + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD---DKDVIAIDGKTLRHSY 117 G +G+P T+ R+ I E + H D++ IDGK +R + Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAGDILCIDGKAMRGTV 137 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 ++ R I +SA+S + + +EKSNEIT++P+LL+ +D+ G I+T DAM Q Sbjct: 138 LENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADAMSFQ 195 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEK-SHGREEIR 236 K I +KI+++GGD+L +K NQ L E+ L E D Y+ HGR E R Sbjct: 196 KAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAEP----VDVYSEGPFLEHGRIETR 251 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 + + D LI +W G L V + + + R+Y+SS +A + Sbjct: 252 VCRIFRGND-LITDREKWNG--NLTVVEIRTATERKSDGQKSSERRFYVSSFHGSARRLG 308 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT--NDKVFKAGL 354 T R HW +E+ +HW LD + +D + +A I+ + + IL+ K K Sbjct: 309 TIARMHWAIES-MHWDLDRNLRQDFIRRNSARSARNLDTIQRMVLAILSIWKGKRKKPSE 367 Query: 355 RRK 357 + K Sbjct: 368 KAK 370 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 248 bits (633), Expect = 3e-64, Method: Composition-based stats. Identities = 88/241 (36%), Positives = 137/241 (56%), Gaps = 8/241 (3%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++L+ + I D RQ K+ H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + ++ K +I IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + G H++SA+S +GQ EKSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHD---SYAMSEKSHGREEIR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 L 237 Sbjct: 239 E 239 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 247 bits (630), Expect = 5e-64, Method: Composition-based stats. Identities = 85/273 (31%), Positives = 136/273 (49%), Gaps = 9/273 (3%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L+ + + D R K+EH+L DIL++ +CAV++ AE +EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD------VIAIDGKTLRHS 116 D GIP HDT RV I P F CF+NW R + D IA+DGK +RHS Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ R +H++SA++T LV+ Q D K E A+P +L L + G +++ DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMS--EKSHGREE 234 ++++A+ I +G YL +K NQ +++ F + A + +HGR Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAFDDTHGRLV 241 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFR 267 R C W GL + + + R Sbjct: 242 RRRVFACPDAG-CFTTLRGWPGLTTVLASETIR 273 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 84/324 (25%), Positives = 134/324 (41%), Gaps = 27/324 (8%) Query: 50 FGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAID 109 FG + +LK GI H T + V C++ F ++ Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAALPKPLQRA------------ 90 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKII 169 K S + + ++ ++ LVIGQ + NE+ + L +L ++G I+ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQQTAPGR-NEVQGALDALALLSLEGAIV 149 Query: 170 TTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKS 229 T DA+ C+ D A I GGDY A+K NQ L + E +E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLGVQ----TAAEND 205 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSAD 289 H R E R + V D ++ GL+ + + L VRY++ S Sbjct: 206 HDRCERRRACIVAVND------IDFPGLQAIGSVE---ATSRHADGRLTSHVRYFLLSTI 256 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ++A R HW +ENKLHW LDV ED + R+ + + +R IA+N++ Sbjct: 257 MSAAALIEVTRTHWQIENKLHWVLDVQFREDAARNRKDHGPANIALLRKIALNLIRAHP- 315 Query: 350 FKAGLRRKMRKAAMDRNYLASVLT 373 KA +RRK++ A D +L S++ Sbjct: 316 DKASIRRKIKNAGWDDQFLISIIA 339 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 91/362 (25%), Positives = 146/362 (40%), Gaps = 48/362 (13%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHE 88 +L+ + G + +TH + L+++ + GI TI R++ I Sbjct: 1 MLVCVTLGFLCGRTTIRRSLKWCKTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALY 60 Query: 89 CFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEK 148 F+ W+ + S + +A+DGK L + +K++ +++ T+ L++ Q+ D K Sbjct: 61 AFMEWVGEIVDSRNT-HLAVDGKALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSK 119 Query: 149 SNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEE 208 +NEIT IPELL +LDI G I+T DA+G Q I E+I +QGG + VK NQ + Sbjct: 120 TNEITVIPELLKLLDISGSIVTIDAVGTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHT 179 Query: 209 KFPLKELNNPA-----------------HDSYAMSEKSHGREEIRLHIVCDVPDELIDFT 251 E + ++ EK+ R E R +C L Sbjct: 180 FMDKLEAADVQRKKGEVLDSGMREYLEKYEEIIRIEKNRDRNEYRTCQICKDASNLTKSQ 239 Query: 252 FEWKGLKKLCVAVSFR----------------------------SIIAEQKKELEMTVRY 283 EW ++ + R + AE+ ++ Sbjct: 240 KEWPHVQSIGRIKQVRIPSEKDSHGNDVTPSKEEFLEKGSRRVPAPSAEEGTGKDVQCTA 299 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 IS LTAE+ + R HW +EN+LH LD ED ++ S IR A NI Sbjct: 300 LISDLILTAEELGSIKRMHWSIENRLHHVLDDTFREDRSPAKKSR--NNLSLIRKYAYNI 357 Query: 344 LT 345 L Sbjct: 358 LR 359 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 241 bits (614), Expect = 4e-62, Method: Composition-based stats. Identities = 95/387 (24%), Positives = 164/387 (42%), Gaps = 31/387 (8%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDI----EDFGETHPDF 57 ++ L+ + I D R+A + LS +L + A ++GA G +I DFG+ Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKQYGDFENGI---PVHDTIARVVSCISPAKFHECFINWM--RDCHSSDDKDVIAIDGKT 112 L D G P I + + A F W+ + V+A+D K Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKII-T 170 LR ++ + +R + ++SA LV GQ++ + +NEIT + LL L DI G ++ T Sbjct: 141 LRGAWSEGNKR--VTLLSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 171 TDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNK-AFEEKFPLKELNNPAHDSYAMSEKS 229 DA+ Q + A + + G DY VKGNQ L + FE+ PL + + + E+ Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLYRKTFEQTLPLLQKP----PQHEVEERG 254 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSAD 289 HGR + + + + V R + + + ++S Sbjct: 255 HGRIKKWQAWTTEAKG------IGFPEVATAAVI--RRDEFDLKGIRVSREYAHILTSVA 306 Query: 290 ---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN 346 TA IR HW +EN++H+ D ED + GN+ + R++AI I+ Sbjct: 307 GNRATAAYIHRLIRGHWGIENEIHYPRDTAWREDANQTHTGNSPHTLASFRNLAIGIIRR 366 Query: 347 DKVFKAGLRRKMRKAAMDRNYLASVLT 373 + + K ++ + A DR+ + +L Sbjct: 367 NGIRK--IKETLEYIAGDRDRVLPLLA 391 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 234 bits (597), Expect = 4e-60, Method: Composition-based stats. Identities = 78/373 (20%), Positives = 130/373 (34%), Gaps = 35/373 (9%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ L+ + +PD R+ + L +L L + AV GA G+ +I + L Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCH--------------SSDDKDVIA 107 P T RV+ P E W + VI+ Sbjct: 91 FGLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVIS 150 Query: 108 IDGKTLRHSYDKSRRRG--AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML--- 162 DGKT+R + ++ V+ V+ ++ +EI A+ ++ L Sbjct: 151 ADGKTMRGARRRTGDGKIAQDQVVEILDHASGAVVACEPVND-GDEIGAVRTVMGRLADR 209 Query: 163 --DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH 220 + G ++ TDA Q + E++ GG +L VK NQ R+ A P ++ Sbjct: 210 WGSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRIL-AKVRALPWAQVRA--- 265 Query: 221 DSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEL--E 278 K+HGR E R V P G ++ Sbjct: 266 -QDTCRGKAHGRAETRTVRVVQAP---THVDLALAGTAQVIKITRHTRRRPHPGAPAAST 321 Query: 279 MTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSG 335 Y ++S A +R+HW +EN++HW D +ED R GN + Sbjct: 322 RENAYLLTSLPAEVADPATLAAMVRSHWLIENQVHWVRDTAYDEDRHTARTGNGPINLAC 381 Query: 336 IRHIAINILTNDK 348 +R+ AI Sbjct: 382 LRNTAITRHRAHG 394 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 233 bits (595), Expect = 7e-60, Method: Composition-based stats. Identities = 88/397 (22%), Positives = 162/397 (40%), Gaps = 35/397 (8%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHP-DFLKQ 60 E++ L ++ +PD R + H+L IL L+ AV +G + E+I + P L Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 61 YGDFENGI------PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD---VIAIDGK 111 G + + P DT+ RV+S + + + V+A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML----DIKGK 167 TLR + R H+++ +V+ + + K+NE+TA LL L + G Sbjct: 158 TLRGAAGPEGRA--PHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 IITTDAMGCQKDIAEKIQ-KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMS 226 ++T DA+ + A+ I + G ++F VK N L+ + ++ ++ Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPI----GHSAE 271 Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMT-----V 281 ++HGR E R + + + + + ++ V + + T Sbjct: 272 GRAHGRFERRTIQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGRARVTRTIPSTVT 331 Query: 282 RYYISSADLTA---EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 + ++S L A A R HW +ENK+HW DV ED ++R G + + +R+ Sbjct: 332 VHVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRN 391 Query: 339 IAINILTN--DKVFKAGLRRKMRKAAMDRNYLASVLT 373 + I ++ +RR D L ++LT Sbjct: 392 LIIGLIRLAGHNRIAPTIRRIRH----DNALLLAILT 424 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 227 bits (578), Expect = 6e-58, Method: Composition-based stats. Identities = 80/383 (20%), Positives = 149/383 (38%), Gaps = 17/383 (4%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L+ + +PD+R + ++L+ +L L + I+G + + ++ P + Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS----SDDKDVIAIDGKTLRHSY 117 F +P TI R+V P + + W +A DGK ++ + Sbjct: 84 LGFPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGAR 143 Query: 118 DKSRRRGAIH--VISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + + V+ A +G + +EI ++ L+N + ++TTD + Sbjct: 144 SRPPQGSVRQEAVVEAVRHDTGTALGHQRV-VAGDEIASVRRLVNRVCDHNTLVTTDCLH 202 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 + +A I+ +GG +LF++KGNQ + A P E N + EK+HGR E Sbjct: 203 AHEPLARAIRAKGGHWLFSIKGNQPTVR-AKLAGLPWDEFGN----QHVTREKAHGRIEE 257 Query: 236 RLHIV-CDVPDELIDFTFEWKGLK-KLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE 293 R L+ F + +K + E + +S+ + Sbjct: 258 RALKALTPSAPSLVGFRGTRQVVKLARTTRRKKTTTSPAATSTEEFYLVTSLSTDQASPA 317 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A R HW VE +H D M+ED IR NAA ++ R I+ L Sbjct: 318 QLARWARGHWTVEA-IHHVRDRTMDEDRHTIRTKNAALNWAIARDTTISALRLAGYKN-- 374 Query: 354 LRRKMRKAAMDRNYLASVLTGSG 376 +R+ R D + ++ + Sbjct: 375 IRQARRATIRDPGLVLQIIALTS 397 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 222 bits (565), Expect = 2e-56, Method: Composition-based stats. Identities = 88/328 (26%), Positives = 138/328 (42%), Gaps = 22/328 (6%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHE 88 +L + + A +G G+ + T D + P T V+S + PA + Sbjct: 3 LLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLNA 62 Query: 89 CFINWMRDCHSSDDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKT 145 ++ +S D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 63 RMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQLAV 120 Query: 146 DEKSNEITAIPELLNMLDIKG-KIITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRLN 203 EKSNEI + LL +L ++T DAM Q A+ I YL VK NQ ++ Sbjct: 121 AEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKIL 180 Query: 204 KAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVA 263 A P E+ A D + HGR E R + + + K++ Sbjct: 181 -ARITALPWAEVPAAATD----DSRGHGRVETRTLQIITAARGIG-----FPYAKQIIRI 230 Query: 264 VSFRSIIAEQKKELEMTVRYYISSADLTAEK---FATAIRNHWHVENKLHWRLDVVMNED 320 R I A ++ + V Y I S + T +R H +EN LHW DV +ED Sbjct: 231 TRERLITATDQR--SVEVVYAICSLPFEHARPTAIMTWMRQHCRIENSLHWIRDVTFDED 288 Query: 321 DCKIRRGNAAELFSGIRHIAINILTNDK 348 + GN A++ + +R+ AIN+ + Sbjct: 289 RQRAHTGNGAQVLATLRNTAINLHRLNG 316 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 221 bits (562), Expect = 4e-56, Method: Composition-based stats. Identities = 89/249 (35%), Positives = 129/249 (51%), Gaps = 14/249 (5%) Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YDKSR 121 IP HDT R S I P F F NW++ + K V+AIDGK +R + + Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQ-EVKGVVAIDGKLMRGPSQCDGEHTTGK 62 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + ++SA+S + + +GQ+K D+KSNEITAIP L+N L++ G I+T DAMGCQKDI Sbjct: 63 EGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDIT 122 Query: 182 EKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLH 238 + I + +Y+ A+K N+ + L K + + K+ + HGR E R Sbjct: 123 QTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRTC 182 Query: 239 IVCDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT-AEKFA 296 V F + GLK + S R+I+A E VRYY++S D T E+ A Sbjct: 183 TVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIVA--TGEYTQEVRYYVTSLDNTKPEEIA 240 Query: 297 TAIRNHWHV 305 +AIR HW + Sbjct: 241 SAIRQHWSI 249 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 220 bits (561), Expect = 6e-56, Method: Composition-based stats. Identities = 90/369 (24%), Positives = 157/369 (42%), Gaps = 25/369 (6%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDF-LKQYG 62 L+ ++ +PD R + H L +L + AV++GA + ++ P L + G Sbjct: 27 SSLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELG 86 Query: 63 DFE------NGIPVHDTIARVVSCISPAKFHECFINWMRDCHS--SDDKDVIAIDGKTLR 114 F + P T R+++ + + W+ C + + V ++DGKTLR Sbjct: 87 VFRDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLR 146 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAM 174 S +H+++ V+GQ+ D K+NE+T LL LD+ ++T DA+ Sbjct: 147 GS---GPAGEQVHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADAL 203 Query: 175 GCQKDIAE-KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGRE 233 Q++ A + + Y+F VK NQ RL + + P ++ S + HGR Sbjct: 204 HTQREHARWLVDTKKAAYVFTVKKNQPRLYRQLKT-LPWTKIPI----QDETSTRGHGRY 258 Query: 234 EIRLHIVCDVPDEL-IDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTA 292 +IR L +DF ++ L + ++ + + + +S+A Sbjct: 259 DIRRLQAVTCTGPLALDFPHA---VQALRIRRRRLNLATGRWSTVTVYAITNLSAAQAGP 315 Query: 293 EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VF 350 + A +R HW +E LH D ED ++R GNA + +R+ AIN+L Sbjct: 316 AELADWLRGHWAIET-LHHIRDTTYAEDASRLRTGNAPRAMATLRNTAINLLRLTGITTI 374 Query: 351 KAGLRRKMR 359 A LR R Sbjct: 375 AAALRHNSR 383 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 220 bits (560), Expect = 7e-56, Method: Composition-based stats. Identities = 103/196 (52%), Positives = 133/196 (67%), Gaps = 13/196 (6%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L L H + + D RQA K+ +KL D+L L + AVISGAEGWE+IEDFG +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 YGDF +GIPVHDTIAR+V I P FH+ FI WM+ DK V+A+DGKTL Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 H+ISAF+T + +V+GQ +TDEKSNEITA+PELL +L+++G ++T DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 AEKIQKQGGDYLFAVK 196 + I K+ DY AVK Sbjct: 168 VKTIVKKKADYCIAVK 183 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 220 bits (560), Expect = 7e-56, Method: Composition-based stats. Identities = 77/236 (32%), Positives = 120/236 (50%), Gaps = 7/236 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 ++ I D R K HK+ I+ ++I AVI GA+ W +IE+FG F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPS 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YD 118 IP HDT R S I P F F NW++ + K V+AIDGK +R + Sbjct: 64 LEFIPSHDTFNRFFSMIKPDYFELIFRNWVKQVCQ-EVKGVVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 + + + ++SA+S + + +GQ+K D+KS+EITAIP L+N L++ G I+T DAMGCQK Sbjct: 123 RGKEGFKLWMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREE 234 DI + I +Y+ A+K N+ + + ++ + + + R Sbjct: 183 DITQTIIGHDANYIIAIKENKKKKYQPAKQIIDDYQDRDEIINRVIRHVSEKCRTW 238 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 212 bits (540), Expect = 1e-53, Method: Composition-based stats. Identities = 90/418 (21%), Positives = 151/418 (36%), Gaps = 62/418 (14%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVIS-GAEGWEDIEDFGETHPDF----LK 59 L+ ++I D R H L+ IL + CA ++ G + IE + + P L Sbjct: 29 DLIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLH 88 Query: 60 QYGDFENGI---PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD------------ 104 + D G+ P TI RV++ + + C ++ + Sbjct: 89 IWRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRR 148 Query: 105 ---------------------VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 A+DGK L+ + R +H+IS + + + V Q Sbjct: 149 TEREARRAAHRSPTPAPGLLPAYAVDGKRLKGARHPDGGR--VHLISLAAHLDATVHAQR 206 Query: 144 KTDEKSNEITAIPELLNM---LDIKGKIITTDAMGCQKDIAE-KIQKQGGDYLFAVKGNQ 199 + KS+EI A+ LL D+ G +IT DA+ Q+ A I++ Y+ VK NQ Sbjct: 207 QIPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQ 266 Query: 200 GRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 L+ + + A ++ + + HGR E R I+ P + IDF + + + Sbjct: 267 PTLHATAITALTGTDTDFAAV-THRETHRGHGRTEYR--ILRTAPADGIDFPYAAQVFRV 323 Query: 260 LCVAVSFRSIIAEQKKELEMTVRYYISSA---DLTAEKFATAIRNHWH-VENKLHWRLDV 315 L R V Y I+ A +R HW +EN +H DV Sbjct: 324 L------RHRGGLDGIRHSKEVCYGITDLTARQAGPAHLAAYVRGHWKAIENGVHHVRDV 377 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 ED C+ R + R++A L + R+ D + + Sbjct: 378 TFAEDACQARTATLPRALAAFRNLATGTLRRAGHVN--IAHARREHGYDHQRVLDLFN 433 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 206 bits (525), Expect = 9e-52, Method: Composition-based stats. Identities = 92/237 (38%), Positives = 121/237 (51%), Gaps = 9/237 (3%) Query: 143 IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL 202 + T++KSNEITAIP LL L+ K ++T DAMGCQKDIA I GGD++ AVK NQ +L Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 NKAFE---EKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 A EK EL H +Y HGR + R H V VP EW +K Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVPPG-FAAKGEWPWIKA 119 Query: 260 LCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + AV I VRYY+ S L+ ++F +R HW +E+ +HW LDV E Sbjct: 120 IGTAVR---ITTHADGTQSDEVRYYMLSRFLSGKRFGEVVRGHWGIES-MHWVLDVTFGE 175 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGSG 376 D + R+ A S +R AI +L K +R KM + MD ++L VLT G Sbjct: 176 DRTRTRQRVLANNLSWVRRFAITLLKRHP-EKDSIRGKMIRCLMDTSFLNEVLTLQG 231 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 203 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 89/404 (22%), Positives = 148/404 (36%), Gaps = 61/404 (15%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVI-SGAEGWEDIEDFGETH-PDFLK 59 +++ L+ + D R A + +++S +L L +CA+ +G + ++ P+ L Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 60 QY------GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS------------- 100 + IP T+ V+ + P + + +R S+ Sbjct: 90 AFGLPYHPLRGRYRIPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPLMPDGG 149 Query: 101 -------------------DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 + IA+DGK LR + R + V+SA + + Sbjct: 150 IEREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR--VFVLSAVRHGDGITLA 207 Query: 142 QIKTDEKSNEITAIPELLNMLDI---KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 + K+NEI LL+ LD KG ++T DA+ Q+D A + ++G YL +K N Sbjct: 208 SREIGAKTNEIPEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERGAHYLLTIKNN 267 Query: 199 QGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 Q R P KE+ D + HGR E RL V V L + Sbjct: 268 Q-RGQARQLHALPWKEIPVIHRD----DARGHGRHEQRLVQVVTVNGLL------FPHAA 316 Query: 259 KLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFAT---AIRNHWHVENKLHWRLDV 315 ++ R + +K Y I+ A R HW VEN +HW DV Sbjct: 317 QVLRIQRRRRLYGAKKW--SSETVYAITDLPAEEASAAEIASWARGHWTVENTVHWCRDV 374 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 NED ++R N + + +R + L R+ Sbjct: 375 TFNEDKSQVRTHNTPSVLAAVRDLIRGALKLAGYVNTAAGRRAH 418 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 199 bits (505), Expect = 2e-49, Method: Composition-based stats. Identities = 80/388 (20%), Positives = 142/388 (36%), Gaps = 46/388 (11%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAV-ISGAEGWEDIEDFGETHPDFLKQYGD 63 + ++ IPD+R A + + L + + +CAV +G + + ++ + Sbjct: 23 GIWERLAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERVRLR 82 Query: 64 FE------NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVI----------- 106 + +P TI R ++ + ++ +D D + Sbjct: 83 LPWNPWDGHLLPDEATIRRFLNTVDDQALATALLDPPLADTPADLTDAVPSAPVRPPAGD 142 Query: 107 --------AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPEL 158 A+DGKT R + K +H++ + ++GQ + D KSNE T L Sbjct: 143 QAVPVRAYAVDGKTSRGA--KRADGSQVHLLGVAAHGAGALLGQREIDAKSNETTEFRAL 200 Query: 159 LNMLDIKGKIITTDAMGC-QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 L L++ G ++ DA+ + ++ + ++ YL K NQ +L AF P E+ Sbjct: 201 LAPLELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKLR-AFLAALPWTEIPT 259 Query: 218 PAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEL 277 ++ HGREE R V V +DF + ++ R ++ + Sbjct: 260 ADL----TRDRGHGREETRTLKVATVTH--LDFPHAAQAIR-------IRRWRRQKGQPA 306 Query: 278 EMTVRYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 Y I+ A A R WH+E K H+ DV ED R G + + Sbjct: 307 SHETIYAITDATADQASPALLADLARGQWHIEVKQHYVRDVTFGEDSSTSRTGRGPAVLA 366 Query: 335 GIRHIAINILTNDKVFKAGLRRKMRKAA 362 R + L R+ K A Sbjct: 367 LFRATVADTLRRAGHRSVPACRRAHKTA 394 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 198 bits (503), Expect = 3e-49, Method: Composition-based stats. Identities = 76/178 (42%), Positives = 109/178 (61%), Gaps = 3/178 (1%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ +PD R+ + H+L ++LL IC VISGAE W + + + D+L+ Y + Sbjct: 7 SLLTAFDDLPDPRR-RECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 +GI HDT RV S + ++F CF+ W+ S + +AIDGK LR S+D + R Sbjct: 66 AHGIASHDTFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHDGA--RS 123 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 IH++SA+S+ +L +GQ++T +KSNEITAIPELL LDI+G IT DAMGC A Sbjct: 124 PIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCHGMPAR 181 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 193 bits (491), Expect = 8e-48, Method: Composition-based stats. Identities = 85/224 (37%), Positives = 116/224 (51%), Gaps = 11/224 (4%) Query: 111 KTLRHSYDKSRRRGAIHVISAF---STMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGK 167 K + S + S S +LV+GQ K ++KSNEITAIP L+ ML+I+ Sbjct: 3 KGFQRSVKTEEKHKPSQKKSQVLKDSLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESS 62 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPAHDSYA 224 IIT DAMGCQK+I I+K+ GDY+ +K NQ L + +E F +E + H Y Sbjct: 63 IITIDAMGCQKEITSLIRKKKGDYIITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQ 122 Query: 225 MSEKSHGREEIRLHIVCDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRY 283 E H R E R I V + W LK + + S R + + VR+ Sbjct: 123 EIETGHHRIEKREVIAVSVSSLPCLHNQDLWTELKTVVMVKSERRLWNK----TTTEVRF 178 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRG 327 YISS + ++K ATAIR+HW +EN LHW LDV +ED +IR Sbjct: 179 YISSVEKNSQKIATAIRSHWEIENSLHWTLDVTFSEDKSRIRTR 222 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 193 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 77/225 (34%), Positives = 106/225 (47%), Gaps = 9/225 (4%) Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIPELL LD++G +T DA+G Q IA I + G DY+ AVK NQ RL + + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 ELNNPAHDS--YAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIA 271 + +K HGR E R+ V + L W GL++L + R I Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETRVCRVSEDVAWLASTGQHWAGLQRLVMLERTRQI-- 118 Query: 272 EQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 +++ YYISS + A + A IR HW +EN+LHW LDV ED IR AA Sbjct: 119 --GQKVTTERCYYISSKAVKAAQMAQLIRAHWGIENQLHWVLDVSWGEDASLIRDTVAAR 176 Query: 332 LFSGIRHIAINILT---NDKVFKAGLRRKMRKAAMDRNYLASVLT 373 + +R I +N+ N + K L+ AA D +L Sbjct: 177 NMASLRKITLNLARLAQNRQPKKVSLKNIRNLAAWDTAMRDDILG 221 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 64/215 (29%), Positives = 99/215 (46%), Gaps = 3/215 (1%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + L +S IPD R + H L +L L A++ G + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDK 119 F G P T++R + P + W+ + IA+DGKTLR S D Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGSRDG 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + H+++A++ V+ Q++ D K+NE A LL +L + G ++T DAM CQ+D Sbjct: 121 --QVPGQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 +A + G DY+ K NQ L + E ++ Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGLGFED 213 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 186 bits (471), Expect = 2e-45, Method: Composition-based stats. Identities = 64/229 (27%), Positives = 104/229 (45%), Gaps = 5/229 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ ++ +PD R ++ L +L L + AV+ G E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NG +P +TIA ++ + P + W+RD H D + +A+DGK L S D + Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGKRLCGSRDG--QV 120 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD-IKGKIITTDAMGCQKDIAE 182 H+++A++ S V+ Q+ + +NE A LL +L + G ++T DA+ Q D+ Sbjct: 121 PGTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHG 231 +Q +GGD + K NQG L E F + + G Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAFATAAGGDFSPRVTGRVGSGRG 229 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 81/195 (41%), Positives = 119/195 (61%), Gaps = 2/195 (1%) Query: 94 MRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEIT 153 M+ H +V+AIDGKTLR SYD+ R+ IH++SA+++ + LV+GQ+KT+ KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIP L+ MLD++G I+T DAM CQ IA+ I ++GGDYL AVKGNQG+L A + F Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFT-P 119 Query: 214 ELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 P EK GR E R + V D + DF+ W GL + + ++R+ Q Sbjct: 120 HRRAPIDRDTCQIEKQKGRVEARTYHVLSASDLIRDFST-WSGLTSIVMVENYRAAKGRQ 178 Query: 274 KKELEMTVRYYISSA 288 + + + + + + S+ Sbjct: 179 RARVGVPLLHKVQSS 193 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 179 bits (453), Expect = 2e-43, Method: Composition-based stats. Identities = 59/227 (25%), Positives = 104/227 (45%), Gaps = 14/227 (6%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ LM +S D R+ + H ++ +CA++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDFENGI-------PVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVIAIDG 110 F P T+ R + I + W + D V++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIIT 170 K +R + K++ IH ++AF +V+ Q DEK+NEI + LL ++I+G+I+T Sbjct: 341 KAVRGA-SKAKGGQKIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 TDAMGCQKDIAEKIQK-QGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 DA+ Q + A I + + DY+F VK NQ + + E P + Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIE-SLPWEAFP 445 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 76/284 (26%), Positives = 121/284 (42%), Gaps = 15/284 (5%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + L+ + + D R H L ++L L + A + GA+ ++ +F E + L++ Sbjct: 1 MSVLISILREVRDPRD-VNARHDLGELLFLALLATLRGAKTCVEMAEFSEARQEELREIV 59 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD----KDVIAIDGKTLRHSYD 118 +G P HDT +RV + P + F +M + K V+AIDGK+LR YD Sbjct: 60 ALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRGALGLPAPKGVVAIDGKSLRRGYD 119 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K R ++S + I ++ +EI A +L L +KG +T DA+ C Sbjct: 120 KGRAFMPPLMVSVWDVETRPSIAAMRAPG-GDEIKATLSVLKALTLKGCTVTADALHCHP 178 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLH 238 +A+ + Y +K N G L +A E F + E+ HGREE R Sbjct: 179 AMAQALLAAKAQYALGLKANHGPLFRAAEAGFAA----VTDLAVFETRERGHGREEQRRA 234 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVR 282 V V D L+ GLK + + R+ + E VR Sbjct: 235 SVLPV-DRLVKRPS-LPGLKAIGRIEAVRT---GANGKPEQAVR 273 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 60/266 (22%), Positives = 113/266 (42%), Gaps = 22/266 (8%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 + L+ ++ +PD R+ + ++ + +L + +CA++SGA + I ++ P + Sbjct: 47 DQTALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAG 106 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD-------------KDVIAI 108 +P TI RV+ + A W++ + D + V+A+ Sbjct: 107 LGLTGRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAV 166 Query: 109 DGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGK 167 DGK +R + +H++ +V+ Q+ DEK+NEI +L+ + D+ Sbjct: 167 DGKAMRAT---RHGTHPVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDV 223 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSE 227 +IT DAM Q A+ + +G L VK NQ ++ + P K++ + + Sbjct: 224 LITVDAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRLKT-LPWKDVPV----GHTTTG 278 Query: 228 KSHGREEIRLHIVCDVPDELIDFTFE 253 + HGR E R VP L Sbjct: 279 RGHGRIETRTLKAVTVPAGLGFPHAA 304 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 168 bits (426), Expect = 3e-40, Method: Composition-based stats. Identities = 54/187 (28%), Positives = 93/187 (49%), Gaps = 4/187 (2%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFL-K 59 + L+ + +PD R+A + L +L+ T+ A++SGA + I F E + L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 60 QYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHS 116 +G PV +T+ V+ + + F + + K V+A+DGKTLR S Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGS 130 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D R A ++AF + ++V+ + D+KSNEI A +++ L + G + T DAM C Sbjct: 131 FDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHC 190 Query: 177 QKDIAEK 183 QK + + Sbjct: 191 QKKHSRR 197 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 166 bits (419), Expect = 2e-39, Method: Composition-based stats. Identities = 58/223 (26%), Positives = 99/223 (44%), Gaps = 19/223 (8%) Query: 11 SIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETH-PDFLKQYGDFENG-- 67 + + D R+A + H +LL+ + V++G +E I + + L++ G + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 68 ----IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 P TI R++S P + ++ + IAIDGKT+R S Sbjct: 289 ERFLPPSEPTIRRILSKADPVELDRILSQYI---VAHSSGRAIAIDGKTIRSS------- 338 Query: 124 GAIHVISAFSTMHSLVIGQIKTDE-KSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 ++ +++A V+ Q D K +EI A LL LD+ GK++T DA+ Q +A Sbjct: 339 -SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALAS 397 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAM 225 +I+++GGDY+F VK N+ L +P D Sbjct: 398 RIREKGGDYVFTVKDNRKTLKDEISGLDDEAFSPSPYDDLLRT 440 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 164 bits (414), Expect = 6e-39, Method: Composition-based stats. Identities = 62/189 (32%), Positives = 87/189 (46%), Gaps = 10/189 (5%) Query: 192 LFAVKGNQGRLNKAFEEKFPLKELNNPAHDS---YAMSEKSHGREEIRLHIVCDVPDELI 248 + AVK NQ L E + S + +K HGR E R + D P Sbjct: 1 MLAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREVDKGHGRIETRRCLALDFPGPFE 60 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 W GL+ + + S R I + RYY+SS A + A A+R HW +E+ Sbjct: 61 PDL--WPGLQSIPMVESTREI----GDTVTTGRRYYVSSLPADAVRIAHAVRAHWGIES- 113 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 +HW LDV NED C+ R NAA+ F+ +R IA ++ D KAG+R + KA +Y Sbjct: 114 MHWVLDVTFNEDQCRTRLENAAQNFAILRRIATTLIRRDNSTKAGIRIRRLKAGASDDYR 173 Query: 369 ASVLTGSGL 377 A +L L Sbjct: 174 AQLLGLKTL 182 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 163 bits (413), Expect = 8e-39, Method: Composition-based stats. Identities = 55/194 (28%), Positives = 85/194 (43%), Gaps = 7/194 (3%) Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH-DSYAMSEKSHGREEIRLHIV 240 EKI ++ GDY+ +K N + E F + P +++ R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 V D L EWKG+K + RS + +YISS D+ + A +R Sbjct: 61 LKVSDWLSK-AEEWKGIKSVLEVCRKRS----DNGKESQEKVFYISSLDVDVQILAKCVR 115 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW VENK HW LDVV ED+C + AE + +R +A+N+ K ++ K+ Sbjct: 116 GHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHP-KKQSMKGKLTA 174 Query: 361 AAMDRNYLASVLTG 374 A + +L G Sbjct: 175 AGWSDEFRDELLLG 188 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 163 bits (412), Expect = 9e-39, Method: Composition-based stats. Identities = 56/224 (25%), Positives = 100/224 (44%), Gaps = 15/224 (6%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFG-ETHPDFLKQ 60 +++ L +PD R +H L IL + + AV++ A+ + + ++ LK+ Sbjct: 219 QMEGLRQIFFRLPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKR 278 Query: 61 YGDFENGI------PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLR 114 N P T+ RV+ + W+ + +A+DGK L+ Sbjct: 279 IRARFNPRTQRYVAPSEPTLRRVLQGANVTALDAAIGAWLLGIAGFEA---VAVDGKVLK 335 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAM 174 + + + +H++SAF I Q + K+NEI + LL +DI+ K++T DA+ Sbjct: 336 GAVREDGSQ--VHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADAL 393 Query: 175 GCQKDIAEKIQK-QGGDYLF-AVKGNQGRLNKAFEEKFPLKELN 216 Q+ A + + + DYLF AVKGNQ +L + P + Sbjct: 394 HTQRKTARFLVEDKKADYLFTAVKGNQRKLRNSLI-CLPWGDFP 436 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 163 bits (412), Expect = 1e-38, Method: Composition-based stats. Identities = 57/165 (34%), Positives = 88/165 (53%), Gaps = 3/165 (1%) Query: 47 IEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVI 106 + + L+ + NG P DT RV+ I P + C + ++ S + I Sbjct: 1 MHELCLERGASLRPPVELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHI 60 Query: 107 AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKG 166 AIDGK L+ S K+ G+ H++SA+ L + Q EK NE+ AIPE+L+ LD+ G Sbjct: 61 AIDGKRLKGSKKKT---GSTHILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSG 117 Query: 167 KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP 211 +I+ DAMG Q +IAE+I + DY+ ++KGNQ L + + F Sbjct: 118 AVISIDAMGTQTNIAEQIIQSEADYILSLKGNQKHLYEDVRDCFT 162 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 162 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 65/189 (34%), Positives = 96/189 (50%), Gaps = 8/189 (4%) Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDV 243 I + GDYL VKGNQ +L +A E F + + + + D A+ E+ HGR ++ V Sbjct: 2 IIAKKGDYLLMVKGNQPKLLEAIEIAF-IDQHDVKSVDRSALVERGHGRTVGQIASVLSA 60 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHW 303 I +W + S R + +KE ++ YYI+S LTAE+ A ++R W Sbjct: 61 KG--IINPGDWPNCVTIGRIDSMRVV---DEKESDLERCYYITSRALTAEQLAASVRARW 115 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKMRKA 361 VEN+ HW LDV +ED + + NA + S +R IA+NI+ DK K+ LR K + A Sbjct: 116 GVENRFHWILDVSFSEDASTVAKDNAPQNLSMLRKIALNIIRADKTDTRKSSLRLKRKGA 175 Query: 362 AMDRNYLAS 370 A D Sbjct: 176 ARDDGVREP 184 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 162 bits (410), Expect = 2e-38, Method: Composition-based stats. Identities = 72/273 (26%), Positives = 111/273 (40%), Gaps = 13/273 (4%) Query: 58 LKQYGDF-ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 L + D E + + ++ + F S +K + DGK LR S Sbjct: 8 LCAFLDIPETTVVSRSHLPVLLQKVDVEVFDYLLFTHYGFRLDSQEKQWFSGDGKELRGS 67 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDE-KSNEITAIPELLNMLDIKGKIITTDAMG 175 + ++RG V+ I Q D K +EI + LL+ D+ + IT DA+ Sbjct: 68 IESGKKRGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRALLSKDDLASQKITLDALH 126 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 E I K GG +L +K NQ L + + P D + +HGR E Sbjct: 127 LCPSTTEMITKAGGVFLIGLKENQPTLLA------HMTDCALPPIDQKTTFDFNHGRVEQ 180 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 R + + DV + D ++ K+L R I ++ ++ V YYIS+ E Sbjct: 181 RKYWLYDVSKQGFDPRWDNTAFKRLVKVQRTR--INQKNAKISREVSYYISNETA-KEGI 237 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGN 328 A+RNHW VE H DV +NED K ++ Sbjct: 238 FDAVRNHWSVEVNNH-IRDVTLNEDQLKSKKRQ 269 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 154 bits (388), Expect = 7e-36, Method: Composition-based stats. Identities = 48/180 (26%), Positives = 84/180 (46%), Gaps = 4/180 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ ++ +PD+R A + L +LLL I +S G+ +EDF H + L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 EN-GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDK--SR 121 P T RV+ I F NW+ ++D + +DGK+++ + Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 122 RRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + I+V+S FS + I Q +++ +EI + LL LD++G + T D++ CQK + Sbjct: 124 YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKKL 183 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 152 bits (383), Expect = 2e-35, Method: Composition-based stats. Identities = 49/180 (27%), Positives = 82/180 (45%), Gaps = 3/180 (1%) Query: 20 WKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG-IPVHDTIARVV 78 H L +L L AV+ G + I FG + L F G P T+++ + Sbjct: 2 QGRIHPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTL 61 Query: 79 SCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSL 138 I P + W+ + D + +A+DGK LR S D H ++A++ + Sbjct: 62 RRIDPQQLEAALGRWIAGRLTPDARAHVALDGKCLRGSRDGDV--PGPHRVAAYAPHAAA 119 Query: 139 VIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 V+GQI+ D ++NE A LL ++ + G ++T A C +D+A + GG Y+ +G Sbjct: 120 VLGQIRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYVSHGQGQ 179 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 151 bits (380), Expect = 5e-35, Method: Composition-based stats. Identities = 58/142 (40%), Positives = 82/142 (57%), Gaps = 3/142 (2%) Query: 101 DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 VIAI+GK+LR + + A+H +SA++ + L +GQ+ EKSNEITAI ELL Sbjct: 1 MGGLVIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLP 60 Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF-PLKELNNPA 219 L ++G ++T DA+GCQ +AE+I GGDY+ AVK NQ L A + F L +P Sbjct: 61 TLALEGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPV 120 Query: 220 HDS--YAMSEKSHGREEIRLHI 239 + + +K HGR E R Sbjct: 121 RQTCVHETLDKGHGRIETRRCT 142 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 65/142 (45%), Positives = 92/142 (64%), Gaps = 4/142 (2%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIK 165 +AIDGK LR S+D + R IH++SA+S+ +L +GQ++T +KSNEITAIPELL LDI+ Sbjct: 1 MAIDGKCLRGSHDGA--RSPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDS--Y 223 G IT DAMGCQ DIAE+I ++G DY+ VKGNQ L +A + F + + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 AMSEKSHGREEIRLHIVCDVPD 245 + ++K+HGR E R + + Sbjct: 119 SQTDKNHGRIETRRCVATNDVA 140 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 61/244 (25%), Positives = 97/244 (39%), Gaps = 17/244 (6%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHE 88 +L + + A + G+ + T D + P T V+S + PA + Sbjct: 3 LLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLNA 62 Query: 89 CFINWMRDCHSSDDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKT 145 ++ +S D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 63 RMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQLAV 120 Query: 146 DEKSNEITAIPELLNMLDIKG-KIITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRLN 203 EKSNEI + LL +L ++T DAM Q A+ I YL VK NQ ++ Sbjct: 121 AEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKIL 180 Query: 204 KAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVA 263 A P E+ A D + HGR + R + + + K++ Sbjct: 181 -ARITALPWAEVPAAATD----DSRGHGRVKTRTLQIITAARGI-----GFPYAKQIIRI 230 Query: 264 VSFR 267 R Sbjct: 231 TRER 234 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 147 bits (371), Expect = 6e-34, Method: Composition-based stats. Identities = 59/199 (29%), Positives = 100/199 (50%), Gaps = 13/199 (6%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 K + I + G DY+ AVKGNQ RL++ K ++ + D +E+ R Sbjct: 1 MPKKTVQLIIEGGNDYVIAVKGNQKRLHEQI--KLTTEQRLPVSLD--ITTERRSDRITT 56 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 R V D+L +++W+GL++L F + + + YYISS + A +F Sbjct: 57 RS---VSVFDDLSGISYDWEGLQRLVKVERFGTRAGKPYH----QIVYYISSLTINAAQF 109 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A IR HW +EN+LHW DVV++ED+ ++R+GNA FS IR + + IL + + + Sbjct: 110 AQGIRGHWGIENRLHWVKDVVLDEDNSRMRQGNAPANFSIIRSLVLTILRYNGY--SSIT 167 Query: 356 RKMRKAAMDRNYLASVLTG 374 +R + + + ++ Sbjct: 168 TGIRLISHNLEQIFQLIRN 186 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 146 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 51/196 (26%), Positives = 88/196 (44%), Gaps = 9/196 (4%) Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHI 239 ++E+ ++ DY+ A+KGN + + ++ F + +K HGR E R++ Sbjct: 1 MSERNSEKDNDYILALKGNHPLMEQEVKDFFLSP--VTSTRSVHTTFDKGHGRIERRIYT 58 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAI 299 D + EWK L + S + ++ +E +RY+I+S ++FA + Sbjct: 59 -LDTNIGWFEDKKEWKHLAGFGMVDSMVTRKGKECRE----IRYFITSVT-DVKQFAKGV 112 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 +HW +EN LHW LDV+ +D+C + NAAE + IR I N + K Sbjct: 113 CSHWMIENNLHWCLDVLFCDDECTVLDRNAAENLAIIRRIVYNRIKMLSKMDTLSMGKR- 171 Query: 360 KAAMDRNYLASVLTGS 375 D + A +L Sbjct: 172 ACIYDDEFRAQILFSC 187 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 65/325 (20%), Positives = 117/325 (36%), Gaps = 43/325 (13%) Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSY 117 + G P ++T+ +++C+ WM + A DGK L S Sbjct: 14 WRPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGGIRA-DGKVLGGS- 71 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 K A+H + + + + Q + + A+ LL + G++++ DA Sbjct: 72 -KRAGAPALHGVELVTHTTGMALAQREAVG-GDAAAALLALLTEAPLDGRMVSMDAGFLN 129 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP-------------------------- 211 + + I ++ G+YL VKG+Q ++ P Sbjct: 130 AAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALDQIAPPRR 189 Query: 212 -------LKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPD--ELIDFTFEWKGLKKLCV 262 +EL E+S GR EIR V D D + + W+ + ++ Sbjct: 190 KRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGDVGPSLMTAYGWRQVTQIGG 249 Query: 263 AVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDC 322 + +E +SS T +F +IRNHW +EN++H D M ED Sbjct: 250 LRRWCRRRHADLWTVEEVTV--VSSRQRTPAQFLASIRNHWTIENQVHRPRDGSMQEDRL 307 Query: 323 KIRRGNAAELFSGIRHIAINILTND 347 R + + R++ IN++ Sbjct: 308 HGR--AIGVILAVCRNVVINLIRRH 330 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 144 bits (364), Expect = 4e-33, Method: Composition-based stats. Identities = 60/194 (30%), Positives = 84/194 (43%), Gaps = 11/194 (5%) Query: 186 KQGGDYLF--AVKGNQGRLNKAFEEKFPLKELNNPAHDS---YAMSEKSHGREEIRLHIV 240 +G + +G L A + F + + +K HGR E R Sbjct: 91 DRGRWWRLRACRQGQPTHLAHALRDFFGTLDAPGYPVRQTCVHETLDKGHGRIETRRCTA 150 Query: 241 CDVPDEL--IDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATA 298 D L + WK + + S R I + E RY ISS +E+ A Sbjct: 151 AGDLDWLATLGLKERWKKITSVAGIDSSRVI----GSKTETDRRYVISSLPADSERILHA 206 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R HW +EN LHW LDV ED C IR NAA FS +R A+N+ D GL +K Sbjct: 207 VRMHWGIENGLHWCLDVAFGEDACPIRLRNAALDFSLLRRAAMNLFRADHSRAMGLPKKR 266 Query: 359 RKAAMDRNYLASVL 372 + AA + +YLA++L Sbjct: 267 KAAAWNPDYLANIL 280 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 144 bits (363), Expect = 5e-33, Method: Composition-based stats. Identities = 57/180 (31%), Positives = 87/180 (48%), Gaps = 5/180 (2%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLK-QY 61 + L + IPD+R+A L +LL +I A++SGA + I F TH L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 G P + +I + + F ++ VIA+DGKTLR S D+ Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHFRAHAARL--AEGAAVIALDGKTLRGSLDRFE 118 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTD--EKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A V+SAF+T +V+GQI + K +EI A L+ L + G++ T DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 143 bits (361), Expect = 9e-33, Method: Composition-based stats. Identities = 74/318 (23%), Positives = 123/318 (38%), Gaps = 45/318 (14%) Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS-------SDDKDVIAIDGKTLR 114 G P T+ R+++ SPA E ++D + V++ DGK Sbjct: 93 LGLGRGKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTW 152 Query: 115 HSYDKSRRRGAIHVISAFSTMHS------------------LVIGQIKTDEKSNEITAIP 156 D + +GA SA+ S +GQ K E TA Sbjct: 153 SRTDGEKVKGAQQ--SAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFR 210 Query: 157 ELL----NMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL 212 LL L + +I+T DA C ++ AE + G Y+F +K NQ L+ + Sbjct: 211 RLLPAISEQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLHDIARDY-GQ 269 Query: 213 KELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAE 272 +L P +E+ G +R DV + L +C + R Sbjct: 270 YDLGTPLA---RTAERYRGHTIVRELYARDVAGNPAAAIEAAQQLWYVCQTTTDRR---- 322 Query: 273 QKKELEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + + + RY+++S LT ++ +R HW +EN HW +DV++ ED+ + + Sbjct: 323 -GEIVAVEQRYFVTSIPTGTLTRDQELALVRMHWAIENGCHWTMDVMLGEDEGHPCQASR 381 Query: 330 A--ELFSGIRHIAINILT 345 A E S +R I N ++ Sbjct: 382 ASIETVSWLRLIGYNAVS 399 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 142 bits (357), Expect = 3e-32, Method: Composition-based stats. Identities = 57/167 (34%), Positives = 85/167 (50%), Gaps = 13/167 (7%) Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 ++S ++T + LV+GQ+K E SNEITAIPELL +L++ G I+ A+ C KDI + I ++ Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 GGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPAHDSYAMSEKSHGREEIRLHIVCDVP 244 DY+ +K NQG L ++ E+ F H +Y E HG EIR P Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQELQHSTYKPEETGHGLHEIRNFGFQLDP 120 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT 291 D + W LK + + I + + + RY+ISS D Sbjct: 121 DSV------WSNLKSVGMVE----PIGQVDDKTTVETRYFISSLDSN 157 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 140 bits (352), Expect = 1e-31, Method: Composition-based stats. Identities = 62/201 (30%), Positives = 95/201 (47%), Gaps = 13/201 (6%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 + +L IK I T DA+ CQK E I ++ Y+ VK NQ L +A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIEDTAKNSP 61 Query: 215 LNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 LN ++ ++K HG H + + +W GL++ S R Sbjct: 62 LNA-----WSWTQKGHGH---ESHCRLKIWEATESMKMQWAGLERFI---SIRRQGFRHH 110 Query: 275 KELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 K+ + T Y+I+S L++ + A IR H +EN LHW DV++NED+C IR + A + Sbjct: 111 KKFDSTT-YHITSETLSSYRLAGFIRGHRRIENNLHWTKDVILNEDNCGIREPHPAAILG 169 Query: 335 GIRHIAINILTNDKVFKAGLR 355 +R+IA N L V L+ Sbjct: 170 ILRNIAFN-LRLGTVSNPSLK 189 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 61/158 (38%), Positives = 90/158 (56%), Gaps = 3/158 (1%) Query: 99 SSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPEL 158 + D+IA+DGKTLR SYD++ + AIH++SA+ST + LV+GQ+KT+EKSNE TAIP+L Sbjct: 3 ARIPGDIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIPKL 62 Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---L 215 +L ++ +T DA+G Q+DIA++I + DYL VK NQ L++ + + E Sbjct: 63 FTLLALEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAKGF 122 Query: 216 NNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFE 253 DS HGR + V L + Sbjct: 123 TEDFTDSVTEEGDKHGRIDKLHCRVTHRFSGLGALADK 160 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 56/208 (26%), Positives = 87/208 (41%), Gaps = 15/208 (7%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 +L +++ GK IT DA+ QK +AE I + YLF VK NQ L + F + Sbjct: 2 FIPILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYF--EH 59 Query: 215 LNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 P D HGR + R +E ++F + + +S + Sbjct: 60 RKEP--DYCLQDPPGHGRIDTRSIWTTTELNEYLEFPHVGQAF-----CIHKKSYDPKTN 112 Query: 275 KELEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 K E T Y ++S + R HW +EN H+ LD +ED +IR GN Sbjct: 113 KVCENTF-YGVTSHHPNKADPARILQIHRGHWSIENSKHYILDWTYDEDRNRIRTGNGPA 171 Query: 332 LFSGIRHIAINILTNDKVFKAGLRRKMR 359 + +R AI +L + V + +K+R Sbjct: 172 NTNRLRGFAIGLLKSKGVK--DIAQKVR 197 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 47/190 (24%), Positives = 80/190 (42%), Gaps = 6/190 (3%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ H+ IPD R + +LL+ + ++S E D+E F H L + Sbjct: 12 DLISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGI 71 Query: 65 ENGIPVHDTIARVVSC-ISPAKFHECFINW--MRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 E P D+ R + A +W + + D D + DGKTLR S + + Sbjct: 72 ELKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTS 131 Query: 122 RRGAIHV--ISAFSTMHSLVIGQ-IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 GA + ++ +S + I Q + +E + +LL LD++G +I DA+ Q+ Sbjct: 132 GGGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQ 191 Query: 179 DIAEKIQKQG 188 Q +G Sbjct: 192 AFFGSSQSRG 201 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 44/187 (23%), Positives = 81/187 (43%), Gaps = 13/187 (6%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L +S +PD R A + L +L L + A +S + +E F +P L G Sbjct: 3 LRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 P H I ++ + P K + D +V+ +DGK LR S + Sbjct: 60 RKAPGHTAITLLLHRLDPEKLQAALGQVFPEA---DLGEVLVVDGKHLRGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML---DIKGKIITTDAMGCQKDIAE 182 + ++ + + Q + + + E A ELL+ L +++GK++ DA ++A Sbjct: 115 VKLVEVLALHLHTTLAQARAEGR--EEKAFLELLDRLEARELEGKVVVGDAGYLYPEVAA 172 Query: 183 KIQKQGG 189 +++K+GG Sbjct: 173 RVRKKGG 179 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 57/146 (39%), Positives = 77/146 (52%), Gaps = 7/146 (4%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPA---HDSYAMSEKSH 230 MGCQK+IAE I +Q DY+ AVK NQ L++A ++ F N D KSH Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYFEEANEANFESYNIDFAETYNKSH 60 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADL 290 GR E R V L D + W+GL+ + + S R++ K++ + RYYISS Sbjct: 61 GRIESRRCWVGYDALPLTDDSQNWEGLQTIVMVESERTL----KEKTTIEHRYYISSTMA 116 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVV 316 TA + R HW +EN LHWRLD+ Sbjct: 117 TAAYLLNSSREHWGIENSLHWRLDIA 142 >UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7Q7_9GAMM Length = 164 Score = 134 bits (338), Expect = 4e-30, Method: Composition-based stats. Identities = 50/162 (30%), Positives = 76/162 (46%), Gaps = 6/162 (3%) Query: 214 ELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 L +SY EK HGR+E+R V +W +K + V RS+ + Sbjct: 8 ALPEDKQESYITEEKGHGRKEVREVYVLPAAFS-EALRQKWCLVKSIVAVVRDRSVKGKG 66 Query: 274 KKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELF 333 E YYI + L+ E + A R HWH+EN+ HW LDV+ ED+ +I G++A Sbjct: 67 SYETS----YYICTDHLSLELASKATRKHWHIENQQHWALDVIFKEDEQRIYAGDSALNM 122 Query: 334 SGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGS 375 + R N+ + + RKM +AA +++Y VL S Sbjct: 123 ACCRRFVQNLFRKSE-GNLSVPRKMNQAAWNKDYREKVLFTS 163 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 134 bits (336), Expect = 7e-30, Method: Composition-based stats. Identities = 48/167 (28%), Positives = 79/167 (47%), Gaps = 9/167 (5%) Query: 3 LKKLMGHISIIPDYRQAWK--MEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 L+KL S IPD+R+A K + HKL D+++L I +S +I +FG+ + ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD-----DKDVIAIDGKTLRH 115 NGIP T+ R+ I + H ++++ IDGK R Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCAQEIVCIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML 162 + K+ R I +SA S + + +EKSNEI A+P L++ + Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLIDKI 199 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 60/162 (37%), Positives = 81/162 (50%), Gaps = 7/162 (4%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 G H++SA++T H + +G + T+EKSNEITAI LL L K ++T DAMGCQKDIA Sbjct: 2 GPRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARN 61 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFE---EKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 I GGD++ AV+ NQ +L A EK E H ++ HGR + R + Sbjct: 62 IVAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWG 121 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVR 282 VP + EW +K + AV I VR Sbjct: 122 AQVPPD-FAAKGEWPWIKAIGTAVR---ITTHPDGTQTDEVR 159 >UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=Bacteria RepID=B7UED9_ECOLX Length = 104 Score = 132 bits (332), Expect = 2e-29, Method: Composition-based stats. Identities = 83/99 (83%), Positives = 89/99 (89%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 MTVRYYISSAD TAEKF TAIRNHWH+EN L+WRLDVVMNEDD KIRRGNAAE FSGIRH Sbjct: 1 MTVRYYISSADSTAEKFVTAIRNHWHMENNLYWRLDVVMNEDDYKIRRGNAAESFSGIRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGSGL 377 IAINILTN++VFKA RRKMRKA MD+NYLASVL G+G Sbjct: 61 IAINILTNNQVFKARSRRKMRKATMDKNYLASVLAGAGF 99 >UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC8_9GAMM Length = 154 Score = 132 bits (332), Expect = 2e-29, Method: Composition-based stats. Identities = 42/109 (38%), Positives = 61/109 (55%), Gaps = 4/109 (3%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W+ L+ + + S R+ +K E + RYYISS TA R HW +E LHW L Sbjct: 7 WEELQTIVMVESERA----EKGETTIEHRYYISSTLGTAAYLLDYKREHWGIETSLHWCL 62 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 D+ ED+ +I +GN AE F+ +RHIA+N+L + K G++ K KA Sbjct: 63 DIAFREDESRISKGNGAENFAILRHIALNLLKKEDTAKIGIKNKRLKAG 111 >UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azollae' 0708 RepID=B9YJB3_ANAAZ Length = 124 Score = 132 bits (332), Expect = 2e-29, Method: Composition-based stats. Identities = 44/119 (36%), Positives = 70/119 (58%), Gaps = 4/119 (3%) Query: 248 IDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVEN 307 +D W LK + + S I + + + RY+ISS D E+ A ++R+HW +EN Sbjct: 9 LDPDSVWSNLKSVGMVES----IGQVDDKTTVETRYFISSLDSNGEQLANSVRSHWAIEN 64 Query: 308 KLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRN 366 LHW LDV + +DDC+IR+ NA + F+ +R IA+++L + K G++ K AA+D N Sbjct: 65 SLHWVLDVALKQDDCQIRKDNAPQNFAVMRQIAVDLLGKENPVKRGIKNKQFLAAVDNN 123 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 130 bits (328), Expect = 5e-29, Method: Composition-based stats. Identities = 46/187 (24%), Positives = 83/187 (44%), Gaps = 13/187 (6%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L +S IPD R A ++ L +L L + A +S + +E F +P L G Sbjct: 3 LREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 P H + ++ + P K E + +D +V+ +DGK L+ S + Sbjct: 60 RKPPGHTILTLLLHRLDPEKLQEALLQVFP---GADLGEVLVVDGKHLKGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML---DIKGKIITTDAMGCQKDIAE 182 + ++ + + Q K + + E A+ ELL+ L +KGK++ DA ++A Sbjct: 115 VRLVEVLALHLLTTLAQAKAEGR--EDQALLELLDRLGAEGLKGKVVVGDAGYLYPELAG 172 Query: 183 KIQKQGG 189 K+ ++GG Sbjct: 173 KVVQKGG 179 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 128 bits (322), Expect = 3e-28, Method: Composition-based stats. Identities = 66/359 (18%), Positives = 113/359 (31%), Gaps = 72/359 (20%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 L+ +L L V++G + + + ++ P L GIP T R+V P Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFGLTRGIPSERTTRRLVEGCDPVA 107 Query: 86 FHECFINWMRDCHSSDDKDV--IAIDGKTLRH--SYDKSRRRGAIHVISAFSTMHSLVIG 141 E W+ + D +A DGKTL+ S+ ++ V+ A + G Sbjct: 108 LDEALSGWIARAATVGDPGPRGLAFDGKTLKGTRSFTEAGAMSQEAVLEAVWHDTGITAG 167 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 + +EI A+ L LD+ ++TT ++G Sbjct: 168 HQRVVG-GDEIAALEALAGRLDLTDVLVTT-------------AEKG------------- 200 Query: 202 LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 HGR E+R V + + G K++ Sbjct: 201 ----------------------------HGRVEVRSLKALTVTTPKLVGFW---GTKQVI 229 Query: 262 VAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFAT-------AIRNHWHVENKLHWRLD 314 ++ + L AE+ R HW VE +H D Sbjct: 230 ELRRRTRRKKTVTAAPTVSEEVFYLVTSLPAEQAHPRDLAARARARGHWTVEA-IHHVRD 288 Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 V++ED R NA ++ R AI+ L + + +R A + + Sbjct: 289 RVLDEDRHTARTANAPLAWAIARDTAISALRL--TGHRSIAKALRTTARQPERVLQTIA 345 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 125 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 46/187 (24%), Positives = 76/187 (40%), Gaps = 17/187 (9%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEK--------FPLKELNNPAHDSYAM 225 M Q D+ +Q++GGDY+ K NQG L E FP D+ Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYI 285 E S G + + L ++ W G++++ R + + + V Y I Sbjct: 61 CEVSKGHGWVERRTMTS-TIWLNEYLTRWPGVQQVFRLTRTRQV----GGKTTVEVVYGI 115 Query: 286 SSADLTAEK---FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 SS A R HW +E++ H D + ED C++RRG A + + +R++A+ Sbjct: 116 SSLSSVAAAPDALLRYTRTHWGIESRHH-IRDATLGEDRCRVRRGAAPRVLAVLRNVAVY 174 Query: 343 ILTNDKV 349 +L Sbjct: 175 LLRRLGT 181 >UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=Bacteroides RepID=C6Z3A5_9BACE Length = 115 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 64/118 (54%), Gaps = 4/118 (3%) Query: 261 CVAVSFRSIIAEQKKELEMTVRYYISSADLT-AEKFATAIRNHWHVENKLHWRLDVVMNE 319 S R+I+A E VRYY++S D T EK A+AIR HW + N LHW+LDV E Sbjct: 1 VRIKSERTIVA--IGEYTQEVRYYVTSLDNTRPEKIASAIRQHWSIGNNLHWQLDVTFRE 58 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGSGL 377 D K + NAA FS +A+ IL N+K K + K KA D NYL+ +L + Sbjct: 59 DYSK-KVKNAAGNFSVATKMALTILKNEKTTKGSMNLKRLKAGWDENYLSQLLQDNNF 115 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 125 bits (314), Expect = 3e-27, Method: Composition-based stats. Identities = 47/210 (22%), Positives = 91/210 (43%), Gaps = 14/210 (6%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLK--- 59 + + +++ IPD R+ K +H+ D+LL+ + AV SG + + + FL Sbjct: 6 IPSPLPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDEV 65 Query: 60 --QYGDFENGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSDDKD-----VIAIDG 110 + E +P T+ R+ + + ++W R+ + K+ +A+DG Sbjct: 66 HIRTRRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVDG 125 Query: 111 KTLRHSYDKSRRRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKII 169 K LR + R A+ +SA L +G Q D ++ + + L + ++ Sbjct: 126 KHLRGTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGVD-WVL 184 Query: 170 TTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 T DA C +++A + +Q G A KG + Sbjct: 185 TGDAALCTQELAAVVVEQKGGICSASKGTR 214 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 125 bits (313), Expect = 3e-27, Method: Composition-based stats. Identities = 46/202 (22%), Positives = 77/202 (38%), Gaps = 50/202 (24%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLN---KAFEEKFPLKELNNPAHDSYAMSEKSH 230 MGCQK+IA+ I KQ DY+ A+KG+ L +A+ K + D + + H Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGH 60 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADL 290 GR E R V ++ ++W GLK + S Sbjct: 61 GRIETRRCQQVLVNKSWLNNKYQWVGLKSIIKVTS------------------------D 96 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 EK T + +IR+G F+ +R IA+ + ++ Sbjct: 97 VHEKTTT-----------------------ESRIRKGRGPLAFNVMRKIAMTLFKQEQTK 133 Query: 351 KAGLRRKMRKAAMDRNYLASVL 372 +A + K + A +D Y +++L Sbjct: 134 RASIVAKKKMAGLDDEYRSTLL 155 >UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammaproteobacteria RepID=Q47YV8_COLP3 Length = 151 Score = 117 bits (292), Expect = 1e-24, Method: Composition-based stats. Identities = 33/128 (25%), Positives = 61/128 (47%), Gaps = 3/128 (2%) Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWH 304 + + + GLK + + + + R+ ISS DL + A+R+HW Sbjct: 20 KKWLAKAYRRSGLKSIIKV--HTQVHDKSTGKDTAETRWNISSLDLHVVQALNAVRSHWQ 77 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 VE+ +HW LD+ D+ +I R +F+ +R IA+ + D + RK + A +D Sbjct: 78 VES-IHWMLDMTFRVDESRICRKQGPHVFNVMRKIAMTLFKQDTTKLVSMARKKKMAGLD 136 Query: 365 RNYLASVL 372 +Y +++L Sbjct: 137 DDYRSNLL 144 >UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica OS145 RepID=A3WIW9_9GAMM Length = 96 Score = 115 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 43/96 (44%), Positives = 62/96 (64%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 M RYYISSA L+AE+FA+ +R HW +EN+LHW LDV + ED+C I RG+AA+ + RH Sbjct: 1 MQYRYYISSASLSAEEFASTVRAHWGIENRLHWVLDVTIREDNCPISRGHAADNLACFRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTG 374 +A+N + +K A + RK + A M L ++ Sbjct: 61 VALNQIRREKTIDASVNRKQKMATMSEEVLDLIVNA 96 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 46/177 (25%), Positives = 76/177 (42%), Gaps = 15/177 (8%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRL 237 K E + G D L +KGN +L A + A SY + R E R Sbjct: 6 KKTVETVLATGNDLLVQLKGNHPKLLAAVRTLC---QSRAHAEQSYTVDLGRRNRIEQRT 62 Query: 238 HIVCDVPD------ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT 291 + +P F +G +++ V + + + + YY+++ + Sbjct: 63 VRLWPLPPGSGTDPWHDHFQTVIEGQRQIEVFNPYHRRFEPR----QESPAYYLATCTAS 118 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 A A IR HW +EN+LH LDV + ED +IRR +F+ +RH A+N+L ++ Sbjct: 119 AATLAQVIRGHWAIENRLHHVLDVSLGEDSSRIRRN--PGVFALLRHFALNLLRHNG 173 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 114 bits (286), Expect = 4e-24, Method: Composition-based stats. Identities = 38/96 (39%), Positives = 58/96 (60%), Gaps = 1/96 (1%) Query: 3 LKKLM-GHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L S IPD R +H +I+ L + +V++GA+ + +IEDF E H D+LK Y Sbjct: 1 MEGLFVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTY 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 + NGIP HDT +RV S I+PA F + F+ W++ Sbjct: 61 FNLPNGIPSHDTFSRVFSAINPASFQDSFLIWLKAI 96 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 112 bits (280), Expect = 2e-23, Method: Composition-based stats. Identities = 49/205 (23%), Positives = 85/205 (41%), Gaps = 18/205 (8%) Query: 100 SDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELL 159 + + IA+DGK L+ S + R H++SA + + + +++ K+NE T LL Sbjct: 128 AGPRRAIAVDGKALKASARLTSPRR--HLLSAVTHGRVVTLARVEVGAKTNETTHFKPLL 185 Query: 160 NMLDIKGKIITTDAMG-CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP 218 LD+ ++T DA+ + +I+ ++ + Y+ +K NQ + P +++ Sbjct: 186 APLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQLAT-LPWRDIPV- 243 Query: 219 AHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELE 278 +A SE HGR E C +PDEL + L A+ K Sbjct: 244 ---QHAASEVGHGRRESSSIKTCAIPDELGGIAYPHARL-----AIRVHRRCQPTGKRES 295 Query: 279 MTVRYYISSADLTAEKFATAIRNHW 303 Y ++S D A R W Sbjct: 296 RESVYAVTSLDAH-----QATRPIW 315 >UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX Length = 105 Score = 111 bits (277), Expect = 5e-23, Method: Composition-based stats. Identities = 73/88 (82%), Positives = 76/88 (86%) Query: 271 AEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 EQKKE EMT RYY SADLTAEKFATA RNHW+VENKLHW LDVVMN+DDCKIRRGNAA Sbjct: 18 TEQKKEPEMTDRYYSISADLTAEKFATANRNHWYVENKLHWHLDVVMNKDDCKIRRGNAA 77 Query: 331 ELFSGIRHIAINILTNDKVFKAGLRRKM 358 ELFSGIR IAINILT DK+ KAG R KM Sbjct: 78 ELFSGIRKIAINILTKDKILKAGARCKM 105 >UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina DSM 3645 RepID=A3ZX26_9PLAN Length = 115 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 44/112 (39%), Positives = 65/112 (58%) Query: 263 AVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDC 322 A+ + +Q + VRYYI S LT +FA A+R HW +EN LHW+LDV E Sbjct: 3 AIGMTINLVKQNGKEASEVRYYIVSKYLTGRRFAEAVRGHWGIENSLHWQLDVTFGEHQS 62 Query: 323 KIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTG 374 +IR+G+A FS +R ++++L N+K + G++ K KA + YL VL G Sbjct: 63 RIRKGHADINFSLLRRTSLSLLKNNKTARVGVKNKRLKAGRNDKYLLEVLLG 114 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 107 bits (268), Expect = 5e-22, Method: Composition-based stats. Identities = 53/128 (41%), Positives = 69/128 (53%), Gaps = 1/128 (0%) Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREE 234 + ++ +KI ++ DYL AVKGNQG L AF++ F LNN + Y E+S GR E Sbjct: 11 SVRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHE 70 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEK 294 R V L D + EW GLK + VS S E +E ++ VRYYISS L AE+ Sbjct: 71 SRAAFVSHDLSVLGDISDEWPGLKSMAFVVSMNS-EKEVAEEADIYVRYYISSKQLNAEE 129 Query: 295 FATAIRNH 302 TA R H Sbjct: 130 LLTASRLH 137 >UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9DIV7_9GAMM Length = 133 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 34/112 (30%), Positives = 59/112 (52%), Gaps = 3/112 (2%) Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNH 302 V ++ ++W GLK + S + + + R+YISS DL AE+ +++RNH Sbjct: 3 VNKSWLNNKYQWVGLKSIIKVTS--DVHEKTTGKETTETRWYISSLDLNAEQALSSVRNH 60 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 W VE+ +HW L++ ED+ + R+G F+ +R IA+ + D+ L Sbjct: 61 WQVES-MHWVLEMTFREDESRFRKGRGPLAFNVMRKIAMTLFKQDQTNLDRL 111 >UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C6B8_ACAM1 Length = 145 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 45/152 (29%), Positives = 71/152 (46%), Gaps = 9/152 (5%) Query: 222 SYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTV 281 + S +S GREE R V + + EW+ ++ + + + + Sbjct: 3 EHTHSIQSRGREEHRCIQVY---EPVGIALQEWEAIRSVLCVQRWGTRQGKAYHNTA--- 56 Query: 282 RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAI 341 YYISSA + + + +R HW +EN+LHW DVV EDD ++ A +S +R I I Sbjct: 57 -YYISSAATSPHHWQSLVREHWGIENRLHWPKDVVFGEDDYRLEDEQALLNWSVLRTIVI 115 Query: 342 NILTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 NIL + L+ M K A + + S+LT Sbjct: 116 NILRLNGY--QSLKTAMTKLANRVDIIFSLLT 145 >UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE7_9GAMM Length = 185 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 44/172 (25%), Positives = 66/172 (38%), Gaps = 10/172 (5%) Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSH-GREEIRLHIVCDV 243 G L +K NQ L+ A E +P D + E R E R V + Sbjct: 2 IATGNHLLVQLKRNQPLLHDAMVEYT----RGHPFVDEHHTHEIGRRNRIEKRAVHVWHL 57 Query: 244 PDELIDFTFEWKGLKKLCVAVS--FRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 L + + L R + + YY+ L A +F+ AIRN Sbjct: 58 HPSLGSAPWY-DHFRALIRVQRHTERFDTRLRDWRVSKECAYYLCDLVLPAARFSEAIRN 116 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 HW VEN+ H+ D ED +IRR F+ +R A+N++ ++V Sbjct: 117 HWRVENRAHYVRDTRFQEDASRIRRN--PCTFALLRSFALNLMRFNRVENIS 166 >UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C429C Length = 149 Score = 101 bits (251), Expect = 5e-20, Method: Composition-based stats. Identities = 37/125 (29%), Positives = 63/125 (50%), Gaps = 11/125 (8%) Query: 224 AMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRY 283 SEK HGR E R + +WKGLK+ R++ ++ + V Y Sbjct: 2 TTSEKGHGRIEKRTLETTPIVT----VGQKWKGLKQGLRITRERAVKGKK----TVEVVY 53 Query: 284 YISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 I+S A T +R+HW +EN LH+ DV + ED C++R+G A ++ + +R++ Sbjct: 54 GITSLSMARANAATLLTILRDHWQIENGLHYVRDVTLGEDACRVRKGTAPQVLAAVRNVV 113 Query: 341 INILT 345 +++L Sbjct: 114 VHLLA 118 >UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REV9_LEGLO Length = 139 Score = 100 bits (250), Expect = 6e-20, Method: Composition-based stats. Identities = 37/142 (26%), Positives = 66/142 (46%), Gaps = 6/142 (4%) Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEK 294 R + +P + + G+K + + S E + RYY++S + Sbjct: 2 RRRYFAYRLPKTINTGSL--VGIKSIIATETISSKTNET--AISAEWRYYVTSHETEKSD 57 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKA 352 +RNHW +EN+LHW LDV +N+D K R A FS I+ + ++++ K Sbjct: 58 LHLYVRNHWSIENELHWHLDVHLNDDADKKRDDTTAINFSSIKRMLLSLVKTKLPPGKKR 117 Query: 353 GLRRKMRKAAMDRNYLASVLTG 374 +R ++++ D YL S+L+ Sbjct: 118 SVRSRLKQVGWDTEYLVSLLSA 139 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 99.4 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 29/131 (22%), Positives = 55/131 (41%), Gaps = 6/131 (4%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L ++ +PD R + L IL + +CAV++GA + I D+ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRD------CHSSDDKDVIAIDGKTLRHSY 117 F + +P T+ R++ I + W+R VIA+DGK +R + Sbjct: 89 FTDRVPAATTVWRLLIRIDAEVLPQVLARWLRARTAPVVVTGRRLCLVIAVDGKVVRGAR 148 Query: 118 DKSRRRGAIHV 128 ++ A+ + Sbjct: 149 LRAAGPSALGL 159 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 99.4 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 39/120 (32%), Positives = 58/120 (48%), Gaps = 4/120 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + H I D R +H L +I+LL I AV+SG+EGWE IE+FG D+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSDDKDVIAIDG--KTLRHSYDKSR 121 GIP HDTIARV+ + + + + D + + G + H + Sbjct: 67 AGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREG 126 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 3/79 (3%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLN---KAFEEKFPLKELNNPAHDSYAMSEKSHGREE 234 K+IA+ I KQ DY+ A+KG+ L +A+ K + D + + HGR E Sbjct: 87 KEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGHGRIE 146 Query: 235 IRLHIVCDVPDELIDFTFE 253 R V ++ + Sbjct: 147 TRRCQQVLVNKSWLNNKYR 165 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 99.0 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 42/85 (49%) Query: 7 MGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFEN 66 + H + D R L I+ + I AV++GA+G+ IE +G+ +L+ + D Sbjct: 28 LKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLPK 87 Query: 67 GIPVHDTIARVVSCISPAKFHECFI 91 GIP HDT RV+ + P + F Sbjct: 88 GIPSHDTFGRVLRILEPKQLQSGFR 112 >UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H9Y8_ANADF Length = 431 Score = 98.6 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 65/371 (17%), Positives = 120/371 (32%), Gaps = 45/371 (12%) Query: 10 ISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP 69 + +PD R L++IL + +++GA + E+ + ++ +P Sbjct: 22 LEAVPDVRAREG-RWSLAEILTGVLLGIVAGARSLAEAEELTDGMSPAARRLASVPRRLP 80 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCHSSD-------DKDVIAIDGK-----TLRHSY 117 T + + +R V+A+DGK TL H Sbjct: 81 -DTTARDALCAVPLDGLRAALHRLVRAAWRRKALTPVDLPVGVVALDGKVTALPTLNHPL 139 Query: 118 DKSRRRG--------AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML----DIK 165 +++ + S I + ++NE +L L Sbjct: 140 IQNQHPDVGLPYGLARTVTCALVSAPGRPCIDAVPIPAETNEAGHFQHVLAGLVETYGAL 199 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ---GRLNKAFEEKFPLKELNNPAHDS 222 +++T DA + + G DY+FA+K + +L + + D+ Sbjct: 200 FQVVTYDAGALSEANGAAVVAAGKDYVFALKNDHFTMVKLATELLDPHEIAARREDVLDN 259 Query: 223 YAMSEKSHGREEIRLHIVCDVPDELIDFTFE---WKGLKKLCVAVSFRSIIAEQKKELEM 279 + + EI++ V E W + S + +E Sbjct: 260 ATTATR-----EIQILAVDPSHGYGAGKGPEESVWSHARTFLRVTS---TVRRSGVVIER 311 Query: 280 TVRYYISSA---DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCK--IRRGNAAELFS 334 R ++SS LT +++ +R HW VEN H LD ED+ N Sbjct: 312 DSRLFVSSRAADQLTPDQWLQVVRAHWGVENNNHHTLDTAFAEDERPWIAADANGMLAVL 371 Query: 335 GIRHIAINILT 345 +R IA +L Sbjct: 372 LLRRIAYTLLA 382 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 98.2 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 38/80 (47%), Positives = 51/80 (63%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + H + D R +H L DI+LL I AV+SG+EGWEDIE+FG D+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVVSCISPAK 85 GIP HDTIARV+ + + Sbjct: 67 AGIPRHDTIARVICRLKADE 86 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 97.9 bits (242), Expect = 6e-19, Method: Composition-based stats. Identities = 34/75 (45%), Positives = 52/75 (69%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++++ + + D R A + +H L DIL+L +CAV+SGA+GW+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARV 77 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRV 81 >UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D5 Length = 148 Score = 97.5 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 34/122 (27%), Positives = 57/122 (46%), Gaps = 11/122 (9%) Query: 228 KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISS 287 K HGR E R L ++ W G++++ R + + V Y ISS Sbjct: 3 KGHGRVERRSITTTT---WLNEYLTRWPGVQQVFRLERQR----RADGKTTVEVVYGISS 55 Query: 288 ADLTAE---KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 A R+HW +E+ LH+ DV ++ED C++RRG A + + +R++A+ +L Sbjct: 56 LSPVAAPPDTVLGYTRSHWGIES-LHYVRDVTLDEDRCRVRRGTAPRVLASLRNVAVYLL 114 Query: 345 TN 346 Sbjct: 115 RR 116 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 97.1 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 35/88 (39%), Positives = 51/88 (57%), Gaps = 1/88 (1%) Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 S A+H++SAF + +V+ Q+ EKSNEI A ELL LDI G +T DAM Q+ Sbjct: 2 ASETVKAVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQR 61 Query: 179 DIAE-KIQKQGGDYLFAVKGNQGRLNKA 205 + A ++ + D++ VK NQ L +A Sbjct: 62 EHARFAVEDKRADFVMTVKDNQPELREA 89 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 95.9 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 49/77 (63%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + ++ H S + D RQ+W++ + L +I LL +CA +SG E + +I +G+ +FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTIARV 77 + +E G+P HDT+ + Sbjct: 77 FLPYERGLPAHDTLKGL 93 >UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3V5_9PROT Length = 174 Score = 95.2 bits (235), Expect = 4e-18, Method: Composition-based stats. Identities = 42/157 (26%), Positives = 70/157 (44%), Gaps = 10/157 (6%) Query: 195 VKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K NQ L FE + + PA +++ K R+E R V V D L ++ Sbjct: 1 MKANQSNL---FETACAIAANDAPADTAFSR-NKGRSRQEDRTVEVFPVGDALAGTEWQ- 55 Query: 255 KGLKKLCVAVSFRSIIAEQKK--ELEMTVRYYISSA-DLTAEKFATAIRNHWHVENKLHW 311 +K + + + + V +Y+SSA + A +A AIR HW +EN+ H+ Sbjct: 56 PFIKTIIRVTRRTLLHSAATGLWDQRGEVAFYVSSAANFPATAWAAAIRGHWGIENRNHY 115 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 DV +ED +IR + + R A+NI+ + Sbjct: 116 VRDVSCDEDKSRIRDN--PGIMARARSFALNIMRKNG 150 >UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N3J1_VIBHB Length = 184 Score = 94.8 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 7/132 (5%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYIS 286 ++ HGR R + +P+EL + G+K R + + + YYI+ Sbjct: 34 DEGHGRLVRRRYFAFPLPEELHNHAL--SGIKSCIAVE--RIVQEGKGEPKTSHFSYYIT 89 Query: 287 SADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN 346 + + K A +R HW +E+ HW LDV N+D K N+AE F+ I+ + +N++ Sbjct: 90 NHPASDPKLADYVRQHWEIES-YHWLLDVYFNDDRDKKYEENSAENFAQIKRLPLNLVKA 148 Query: 347 DK--VFKAGLRR 356 K ++ Sbjct: 149 KDWAGKKKSVKS 160 >UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV84_9DELT Length = 357 Score = 92.9 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 60/148 (40%), Gaps = 9/148 (6%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHPD 56 +++ L + + D R+ H++S +L + A + G +G++ I + + Sbjct: 214 QMESLPDYFKTVTDPRRTHGRRHRISTVLSIAAAATLCGMKGYKAIYGWANKLGQKARQR 273 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 F + + + IP I V+ P + + D + +A DGKT++++ Sbjct: 274 FRCRKENGKYVIPSQFVIRDVLVRADPVELDLAVQRFNED--QGLEDTCLAFDGKTMKNA 331 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIK 144 D++ R+ H+ S Q K Sbjct: 332 IDENARQT--HIASVVGHESKTTHTQKK 357 >UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobacteria RepID=Q2RP40_RHORT Length = 152 Score = 91.3 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 39/133 (29%), Positives = 51/133 (38%), Gaps = 8/133 (6%) Query: 224 AMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEL---EMT 280 HGR+E R V DV L W GL V+ + + K L Sbjct: 6 TTDRGRHGRQEHRWVEVFDVSGRLGPT---WDGLIAAVARVTRLTWHKDTKSGLWHKTQE 62 Query: 281 VRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 Y +L A TAIR HW VE + H+ DV ED +IR F+ +R A Sbjct: 63 TALYACQINLPAAVAGTAIRQHWGVEKRSHYVRDVTFFEDQSRIRTK--PGHFARLRSFA 120 Query: 341 INILTNDKVFKAG 353 +NIL + Sbjct: 121 LNILRANGTNNIS 133 >UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3K1_9PROT Length = 134 Score = 91.3 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 29/115 (25%), Positives = 48/115 (41%), Gaps = 6/115 (5%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF- 64 L+ S I D R+ + L+ +LL T+ A+++GA + ++ F TH D L D Sbjct: 5 LLSLFSQISDPRRDQGKIYPLAPVLLFTVLAMLAGARSYRQVQAFIRTHLDRLNDGFDLS 64 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCH-----SSDDKDVIAIDGKTLR 114 P + T+ ++ I + F + + IAIDGKT Sbjct: 65 LRRAPAYSTVRFILRGIDAEEMERAFRDHALGLADGPAEGAAIPGAIAIDGKTWC 119 >UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY37_9BACT Length = 138 Score = 90.9 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 51/120 (42%), Gaps = 5/120 (4%) Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTV-RYYISSADL 290 R E + V L+ ++ L+++ + K E + +SS Sbjct: 1 RIETQTIRVSS----LLKGYSDFPHLEQVFRIDRVTRFKKKGKTRKETALGVTSLSSGQA 56 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 + + +R HW +EN+LHW D V ED C R GN A + + +R++ I++L Sbjct: 57 SPRELLDFVRGHWEIENRLHWIRDSVFREDTCTTRTGNGAHVMATLRNMTISLLRVAGSK 116 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 90.5 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 35/130 (26%), Positives = 56/130 (43%), Gaps = 2/130 (1%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 +L ++ IPD+R+A + L+ +LL +I AV+SGA + I+ F + H + L Sbjct: 2 QLKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQL 61 Query: 65 -ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV-IAIDGKTLRHSYDKSRR 122 PVH +I + + F + IA+DGKTLR + + R Sbjct: 62 HWKRAPVHTSIRYALQGLDAKAGELAFHRHASGLDGEGAQHASIAMDGKTLRAAVSITSR 121 Query: 123 RGAIHVISAF 132 SA Sbjct: 122 TARPLRYSAH 131 >UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BR51_9GAMM Length = 153 Score = 90.5 bits (223), Expect = 1e-16, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 59/150 (39%), Gaps = 9/150 (6%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHPD 56 +++ L + + +PD +A H+L +L L A + G +G++ + ++ Sbjct: 7 QMRSLPQYFTDLPDPHRAEGRRHRLPVVLTLAAGASLCGMQGYKSMAEWASSLAQAARRR 66 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 F + + +P I + + P W + ++ +A+DGK ++ Sbjct: 67 FGCRRVNGHYLVPSLYVIRDCLVRLGPEALDRRLQAW--QAAQLNSEEALAMDGKIMKGG 124 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTD 146 D + + H++S + Q K+ Sbjct: 125 VDHTGAQT--HIVSLIGHESKHCVAQKKSA 152 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 68/212 (32%), Gaps = 34/212 (16%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICA-VISGAEGWEDIEDF-----GETHPDF 57 + + + + D R + + + +C+ +G + + G Sbjct: 22 EGIWERLDRVTDPRSTRGRVYSWLCLAAVWLCSLTAAGHHRVSAVRAWLARTSGAERARL 81 Query: 58 LKQYGDFEN-GIPVHDTIARVVSCISPAKFHECFINWM---------------------- 94 + F +P TI + + + ++ Sbjct: 82 RLPWDPFAGWRLPSTATIHCFLQAVDDGELAVALLDPPLDPDPPAEQGDDTDQRTEPSAA 141 Query: 95 ---RDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNE 151 + +A+DGKT RH+ K +H++ S ++ Q++ + K+NE Sbjct: 142 PVDPGHGCQPVESAVALDGKTSRHA--KRADGSKVHLVGVASHGDGRLLAQVEVEAKTNE 199 Query: 152 ITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 LL LD+ ++T DA+ + + Sbjct: 200 TAVFRRLLRPLDLTNVLVTADALHTVRANLDT 231 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 88.6 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 35/77 (45%), Positives = 47/77 (61%), Gaps = 1/77 (1%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + + D R A+ +H DI+ L + AVISGA W +I+ FGE H D+L++Y F Sbjct: 2 SVFRFFENLSDPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPF 60 Query: 65 ENGIPVHDTIARVVSCI 81 E GIPV DTIARV+ I Sbjct: 61 ECGIPVDDTIARVIKRI 77 >UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A0_AZOSE Length = 92 Score = 88.6 bits (218), Expect = 4e-16, Method: Composition-based stats. Identities = 26/75 (34%), Positives = 43/75 (57%) Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R ++LHW LDV N+D ++RRG AA F +RHI +N+L ++ KA ++ K Sbjct: 15 VRLPRPTRHQLHWSLDVQFNDDQSRVRRGYAANNFVVLRHIVLNLLRHNTTRKASIKSKR 74 Query: 359 RKAAMDRNYLASVLT 373 A M+ ++ +L Sbjct: 75 LLACMEDDFREELLG 89 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 88.6 bits (218), Expect = 4e-16, Method: Composition-based stats. Identities = 38/131 (29%), Positives = 62/131 (47%), Gaps = 5/131 (3%) Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH 220 M +KG ++T DAMGCQ+ IA+++++ G D + ++KGNQG+ A F ++ + Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 221 --DSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELE 278 + E SHGR R V + E + W ++ L V R A + + Sbjct: 61 LKPDHDEFEDSHGRTVRRRGWVLPLTPE-TKHSGSWPDIQALLVTEKIRQ--AHYSETVT 117 Query: 279 MTVRYYISSAD 289 RYY+S Sbjct: 118 SDFRYYLSRCQ 128 >UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BLK3_9GAMM Length = 190 Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 53/129 (41%), Gaps = 6/129 (4%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHPD 56 +++ L + + +PD R+A H+L + LT A + G +G++ + ++ Sbjct: 59 QMRALPQYFTDLPDPRRAQGRRHRLPVVPALTAGASLCGMQGYKAMAEWASSLGQAARQR 118 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 F + + +P I + + P W +S D + +A+DGK ++ Sbjct: 119 FGCRRVNGHYLVPSLYVIRDCLVRLGPKALDRRLQAWQAAQLNSSD-EALAMDGKIMKGG 177 Query: 117 YDKSRRRGA 125 D + + Sbjct: 178 VDHTGAQTQ 186 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 85.9 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 37/154 (24%), Positives = 55/154 (35%), Gaps = 14/154 (9%) Query: 195 VKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K NQ + P + + S HGR E R C + DEL F Sbjct: 2 IKRNQPTTYRQL-AALPWPDSAV----QHTASSAGHGRRESRSIKTCGIADELGGIAFPH 56 Query: 255 KGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHW 311 L A+ + Y ++S D T + A A+R HW VE H Sbjct: 57 GRL-----ALRVHRRRKQTGGCESRETVYAVTSLDAHETTPAELAAAVRGHWTVEALRH- 110 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 DV E+ + G A + R++A+ +L Sbjct: 111 VRDVTYAEEASTLHTGTAPRAMATFRNLAVGLLK 144 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 85.9 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 41/86 (47%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 + ++ + + + D R +H+ DI+++ +C V+ G +G I + ++L+ Sbjct: 6 VAVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQG 65 Query: 61 YGDFENGIPVHDTIARVVSCISPAKF 86 + + NG+P D I + + P F Sbjct: 66 FLELPNGLPSRDCIRNWLMALQPDAF 91 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 85.5 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 33/108 (30%), Positives = 51/108 (47%), Gaps = 4/108 (3%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG-IPVHDTIARVVSCISPAKFH 87 +L L + AV++G E I FG P L F+NG +P +TIA ++ + Sbjct: 3 LLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLDADHLD 62 Query: 88 ECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTM 135 W+ D H D D IA+DGK L S D + H+++A++ Sbjct: 63 RIIGAWLGDRHP-DGWDHIALDGKRLCGSRDGAV--PGTHLLAAYAPQ 107 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 84.4 bits (207), Expect = 6e-15, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 47/118 (39%), Gaps = 9/118 (7%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L H++ +PD R + H L IL + + A+ SGAE + + ++ T L Q Sbjct: 15 GLWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGC 74 Query: 65 ENG-------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRH 115 + P T+ RV+ I NW+ +A+DGKTL Sbjct: 75 QESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSL--GLSPAALAVDGKTLAG 130 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 27/69 (39%), Positives = 42/69 (60%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L+ H + I D RQ+ K+ + L D+L +T+C VI+GAEGW +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIP 69 G G+P Sbjct: 72 KGILTEGVP 80 >UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ26_GLOVI Length = 125 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 47/106 (44%), Gaps = 1/106 (0%) Query: 261 CVAVSFR-SIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 S R E + + + +Y+SS + +A + IR HW VEN++H+ DV E Sbjct: 12 GRTRSIRLERYRELRGIVTVKTHWYLSSIEASASELGRRIRGHWGVENQVHYPKDVTFGE 71 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 D +IR +++S R A+N+ + + ++ + Sbjct: 72 DRSRIRTLPLVQVWSVARSFALNLYRSLLMANRAQAQRRCMFGLST 117 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 82.1 bits (201), Expect = 4e-14, Method: Composition-based stats. Identities = 28/113 (24%), Positives = 47/113 (41%), Gaps = 6/113 (5%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 +L ++S IPD+R+A + L+ +LL +I A++SGA + I+ F +TH + L Sbjct: 2 QLKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQL 61 Query: 65 -ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD-----DKDVIAIDGK 111 P H +I + + F D VI + K Sbjct: 62 HRKRAPAHTSIRYALQGLDAKAVELAFPRHASGLDGEDHNRFFPSTVIDAEWK 114 >UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN77_ANAAZ Length = 77 Score = 81.7 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 26/61 (42%), Positives = 43/61 (70%) Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 A ++R+HW +EN LHW LDV + +DDC+IR+ NA + F+ +R IA+++L + K G+ Sbjct: 1 MANSVRSHWAIENSLHWVLDVALKQDDCRIRKDNAQQNFAVMRQIAVDLLGKENPVKRGI 60 Query: 355 R 355 + Sbjct: 61 K 61 >UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E6_9ACTO Length = 128 Score = 81.7 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 46/129 (35%), Gaps = 13/129 (10%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + L ++ + D R+ H +LL+ AV++GA + I ++ P + Sbjct: 1 MGPLAERLATLADPRRRRGKRHPFVAVLLIACSAVVTGARSFVAIHEWATDSPQDVLARL 60 Query: 63 DFENGI-------PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRH 115 P TI RV+ P + H D +AIDGK+ R Sbjct: 61 GARTATALAVRIPPSGVTIRRVIKDTCPGGLADLLG------HDPAGTDTLAIDGKSARG 114 Query: 116 SYDKSRRRG 124 S S R Sbjct: 115 SRLGSTRPP 123 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 81.3 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 45/169 (26%), Positives = 70/169 (41%), Gaps = 34/169 (20%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVI----SGAEGWEDIED--FGETHP 55 +LKKL+ S IPD R+A ++H+L+ +LL + + + S E D+ F + Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSRPAFLQALQ 138 Query: 56 DFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD--------VIA 107 + +G DT+ARV+ I P K E FI +R IA Sbjct: 139 GLFPELETLPHG----DTLARVLERIEPQKLEESFIRLLRRYIRHKKFKRHLINKCYPIA 194 Query: 108 IDG--KTLR-------------HSYDKSRRRGAIHVISA-FSTMHSLVI 140 IDG K +R + D + + I+V+ A F + L I Sbjct: 195 IDGTQKLVRDGELGEEWLERHIKTKDGEKVQQYIYVLEANFVFKNGLTI 243 >UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XTV7_9DEIN Length = 118 Score = 80.9 bits (198), Expect = 6e-14, Method: Composition-based stats. Identities = 32/122 (26%), Positives = 59/122 (48%), Gaps = 7/122 (5%) Query: 253 EWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWR 312 W+G + R ++ + EL Y ++S A++ R HW VEN+LH + Sbjct: 3 GWRGSRMALRM--RRRVVRKNSGELREETAYALTSLQAPAKRLYALWRGHWEVENRLHHK 60 Query: 313 LDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 D V+ ED + R+G A ++ +R + +N+L + + + R +RK + D L ++ Sbjct: 61 RDTVLGEDASRSRKGAAGLMY--LRDVILNLL---HLKRWPVLRSVRKFSADPKVLLRLI 115 Query: 373 TG 374 G Sbjct: 116 RG 117 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 37/129 (28%), Positives = 52/129 (40%), Gaps = 13/129 (10%) Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRD----CHSSDDKDVIAIDGKTLRHSYDKS 120 PV+ ++ ++ I P F C + IAIDGKTLR S+D Sbjct: 8 LRRAPVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAF 67 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELL---------NMLDIKGKIITT 171 A +V+SAF+ H +++ DEKSNEI A L+ I + Sbjct: 68 SDTKAAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALIVATALWKSREETSIYASSVML 127 Query: 172 DAMGCQKDI 180 DAM I Sbjct: 128 DAMTFAPAI 136 >UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q750_9BACT Length = 129 Score = 78.6 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 35/88 (39%), Gaps = 3/88 (3%) Query: 266 FRSIIAEQKKELEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDC 322 R + + + Y I+S + + R HW +EN LH+ D ED Sbjct: 33 HRIFTKVKTGKKTEEIVYGITSLTQQKASPKTILKFSRGHWSIENGLHYVRDTAFREDHS 92 Query: 323 KIRRGNAAELFSGIRHIAINILTNDKVF 350 +IR NA + ++++ + + V Sbjct: 93 QIRTQNAPRAMASLKNLVVGLFHFLNVP 120 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 75.1 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 30/96 (31%), Positives = 40/96 (41%), Gaps = 4/96 (4%) Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL---KELNNPAHDSYAMSEK 228 D +GCQK IA+ I +Q DYL AVK NQ L++A F D K Sbjct: 8 DGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYFEEANKARFAGYNIDYDEKINK 67 Query: 229 SHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 GR E R V + I + W L+ + + Sbjct: 68 GPGRLEQRRCWVGYEIPDTI-NSQNWAKLETIVMVE 102 >UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C56B7 Length = 116 Score = 74.4 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 43/84 (51%), Gaps = 3/84 (3%) Query: 268 SIIAEQKKELEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + + + V + I+S A +R HW +EN+LH+ DV + ED C++ Sbjct: 8 TRERTVRGQTTVEVHFGITSLSAEKADAATLLNHVRTHWRIENELHYVRDVTLGEDVCRV 67 Query: 325 RRGNAAELFSGIRHIAINILTNDK 348 R G+A ++ + +R+ +++ K Sbjct: 68 RMGHAPQVLAALRNAVVHLWREVK 91 >UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_9ACTO Length = 410 Score = 74.4 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 30/138 (21%), Positives = 50/138 (36%), Gaps = 11/138 (7%) Query: 42 EGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRD---CH 98 + + P G + P I R++ I P W+ Sbjct: 221 RATSALIAWVLARPTVAVLLGIDADRRPSEAMIRRLLQAIDPDLLTTAIGIWLAARIPAP 280 Query: 99 SSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE- 157 + + IA+DGKTLR S + HV++A +V+ D K+NEIT Sbjct: 281 APGSRRAIAVDGKTLRGSRTRDSAAR--HVLAAADQHTGIVLASTDVDTKTNEITRFTAS 338 Query: 158 -----LLNMLDIKGKIIT 170 LL+ I+ +++ Sbjct: 339 GSHADLLSSRCIRSGVVS 356 >UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JBB1_FRASC Length = 173 Score = 72.4 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 25/107 (23%), Positives = 45/107 (42%), Gaps = 5/107 (4%) Query: 273 QKKELEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + ++S + A + HW +EN+LHW DV +ED + R GNA Sbjct: 68 PGGPATAETVHAVTSLPTHHASPRLLAELAQAHWAIENRLHWVRDVTYDEDRHRARTGNA 127 Query: 330 AELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGSG 376 ++ + +R++AI IL + + +R A + +G Sbjct: 128 PQVMTSLRNLAITILRL--TGAKNIAKALRHHARHPERPLETIKKAG 172 >UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK19_ACIF5 Length = 104 Score = 72.1 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 38/93 (40%), Gaps = 3/93 (3%) Query: 273 QKKELEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + L + ++S T E R HW +EN+ H D +ED +IR N Sbjct: 2 KDGTLREDCAFGLTSLTKDRTTPENLLGIARGHWEIENRNHHVRDTTYHEDLSQIRTENG 61 Query: 330 AELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 + + +R +A++IL V + A+ Sbjct: 62 PHMMATLRGLAMSILRLIGVKNIAQAGRDFAAS 94 >UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium meliloti RepID=Q1WLF6_RHIME Length = 105 Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 18/64 (28%), Positives = 27/64 (42%), Gaps = 1/64 (1%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + +PD R H L+ IL + I A++ GAE D+ DFG +LK Sbjct: 1 MSRFAACFEDLPDPR-GRNARHPLTSILFIAIAAIVCGAESCTDMADFGVAKKKWLKTIV 59 Query: 63 DFEN 66 Sbjct: 60 PLPY 63 >UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW01_RHOSR Length = 130 Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 28/81 (34%) Query: 11 SIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 +PD R + H+ S IL + A +GA + I ++ P +P Sbjct: 49 DEVPDPRHRRGVRHRFSVILAPALSATCAGARSFIAIAEWAHDAPAEALGGLGVSAVVPS 108 Query: 71 HDTIARVVSCISPAKFHECFI 91 T R ++ + + Sbjct: 109 ESTSRRFLAGVDATALDQVLG 129 >UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H5_9SYNE Length = 177 Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 58/153 (37%), Gaps = 12/153 (7%) Query: 197 GNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKG 256 G+Q L + ++ K + E HGR+ + + W G Sbjct: 8 GDQKTLYRQIADQLLGKRHIPLMATDH---EIGHGRD---ILWTLRAKEAPQHIKANWHG 61 Query: 257 LKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVV 316 + ++ + + K +I+S T + +R W VE+ HW D Sbjct: 62 TSWIAEVIATGTRDRKPFKATHR----FITSLRTTPDALLRLVRERWSVESW-HWIRDTQ 116 Query: 317 MNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ++EDD + R GN A + + +R A+N+L Sbjct: 117 LHEDDHRYR-GNGAGVMAALRTAAMNLLRLTGF 148 >UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PA49_CLOTS Length = 245 Score = 68.2 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 47/228 (20%), Positives = 80/228 (35%), Gaps = 37/228 (16%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L I+ + D R ++ +S I + + + + +E + K+ Sbjct: 18 HLGEKINTLKDKRVKSSVK--ISTITFVVLFGFMLQIRSFNRLEHW--LKKGKFKKALPK 73 Query: 65 ENGIPVHDTIARVVSCISPAKFHE--------CFINWMRDCHSSDDKDVIAIDGKTLRHS 116 + +P DTI RV+S +E N + + D V+AIDG L S Sbjct: 74 KTKMPRIDTIRRVLSNFDLDGLNELNNSIIKTSIKNKVFRRGTIDGLKVVAIDGVELFES 133 Query: 117 YDKSRRR--------------GAIHVISAFSTMHSLVIGQIKTDEKSN-------EITAI 155 K V S + L++GQ + K + EITA Sbjct: 134 TKKCCGNCLTRVQKDGITHYFHRTVVCSTIGSDSHLILGQEILEPKKDGSDKDEGEITAG 193 Query: 156 PELLNMLDIKGK----IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 L+ L + II DA+ C+ +++ G D + VK + Sbjct: 194 KRLIRKLHREFHHFADIIVADALYCKSTWVKEVLSIGMDAVVRVKDER 241 >UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitrosococcus oceani AFC27 RepID=B6C2C4_9GAMM Length = 77 Score = 67.4 bits (163), Expect = 9e-10, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 31/57 (54%) Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 ED+C++ A F+ +R IAI++L D+ K LR + RK A D +Y+ + Sbjct: 21 SFREDECRVHDPMAGGNFALLRKIAISLLVRDRSNKTSLRGRCRKVAWDNDYMRQLF 77 >UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus aquaticus Y51MC23 RepID=B7A7V9_THEAQ Length = 161 Score = 67.0 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 56/143 (39%), Gaps = 9/143 (6%) Query: 213 KELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAE 272 +E P + G E + + G ++ R ++ + Sbjct: 2 EERRLPGETEAVWNLVRDGEVWTYRVWASPYLPEEM---RAFPGCGQVVRME--REVVRK 56 Query: 273 QKKELEMTVRYYISSA---DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 E+ TV Y ++S A + + + W VEN+ W D +++ED C++R G Sbjct: 57 GTGEVRRTVSYALTSLGPEVADARRLGELLLSRWEVENRSFWVRDFLLHEDACQVR-GVG 115 Query: 330 AELFSGIRHIAINILTNDKVFKA 352 A++ + +R +++L V + Sbjct: 116 AQVLAALRAFLVSLLHRQGVREK 138 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 31/60 (51%), Positives = 34/60 (56%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 LKQYG FE GI HDTI +VSCIS F + FI WM C A DGKT+R S Sbjct: 11 LLKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVV4_METNO Length = 123 Score = 65.5 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 24/100 (24%), Positives = 41/100 (41%), Gaps = 2/100 (2%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W GL + + R + VR+ + S+ +E A AIR H + W L Sbjct: 7 WPGLTTVLATETLR--GGNGTDSVPAQVRHSLGSSTAPSEVLAQAIRRHGALATGEPWVL 64 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 +V E+ ++R AA + +R +A++ D A Sbjct: 65 EVSFGEERSRVRERCAARHLALLRRVALDRRRADASLTAS 104 >UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XCB4_SALTO Length = 117 Score = 65.1 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 23/106 (21%), Positives = 46/106 (43%), Gaps = 3/106 (2%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 ++ +L +CAV++GA + D+ E F + +PV T+ R++ + Sbjct: 1 MASVLADAVCAVMAGASTFAAFGDWVEDLDAPAWSRLGFTDRVPVLTTLWRLLVRVDAET 60 Query: 86 FHECFINWMRD---CHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHV 128 + +W+ + VIA+DGK +R + R A+ + Sbjct: 61 LTAVWADWLCSRLPVAPPPVRRVIAVDGKVVRGAVLTEGRVPALWM 106 >UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ITI9_METNO Length = 74 Score = 64.4 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 17/63 (26%), Positives = 33/63 (52%), Gaps = 1/63 (1%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L + D R+ +H+L IL++ +CAVI+ AE +DI +G + +L+ + Sbjct: 2 IEGLAVCFPGLEDLRETSGCDHRLIAILVIAVCAVIACAES-KDIGLYGRSKQAWLQTFL 60 Query: 63 DFE 65 Sbjct: 61 PLP 63 >UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C544B Length = 103 Score = 64.0 bits (154), Expect = 9e-09, Method: Composition-based stats. Identities = 28/109 (25%), Positives = 42/109 (38%), Gaps = 11/109 (10%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYIS 286 + HGR E R L+ W GLK R++ K + V + I+ Sbjct: 2 DPGHGRIETRTVRATP----LLTCHDRWTGLKHGFRITRTRTV----KGVTTVEVVHGIT 53 Query: 287 SAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 S A +R+HW +EN+ H DV + ED+ + R A Sbjct: 54 SRPVERADARALLGLVRSHWRIENQRHDVRDVTLREDEPRCRAAGAGRA 102 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 63.2 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 27/48 (56%), Positives = 40/48 (83%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEIT 153 ++ DGKTLR S+D+S + AIH++SA+++ +SLV+GQ+KTDEKSNE Sbjct: 26 LSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNEHK 73 >UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RIX6_9ACTO Length = 90 Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 16/59 (27%), Positives = 24/59 (40%) Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 R WH+EN+LHW DV E + R G + + +R+ AI + Sbjct: 11 AQPADLQQWARLEWHIENRLHWVRDVTFGEGTHRARTGTGPAVAAVLRNTAIGFHRGNG 69 >UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria RepID=A8FXM7_SHESH Length = 57 Score = 61.3 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 26/58 (44%), Positives = 37/58 (63%), Gaps = 4/58 (6%) Query: 262 VAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + +FR +I +L + RYYISS +LTAE+ A + HW +E+ +HW LDV MNE Sbjct: 1 MVENFRFVIG---NKLVLEYRYYISSKELTAEQAANTVSEHWGIES-MHWVLDVSMNE 54 >UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TKB9_RICCO Length = 107 Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 16/98 (16%), Positives = 32/98 (32%), Gaps = 1/98 (1%) Query: 8 GHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ-YGDFEN 66 S + D R+A + L +L + +++SG+ ++ F E L + +G Sbjct: 10 DVFSELRDVRRAQGKRYALEPLLCAIVMSILSGSASLRKMQVFIEEQLPNLNRLFGTSWR 69 Query: 67 GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD 104 P I + + + F S Sbjct: 70 KAPCWVAIREFLLGLDEQELERAFREHANRQVSPPPGR 107 >UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LD44_FRASN Length = 261 Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 29/149 (19%), Positives = 48/149 (32%), Gaps = 8/149 (5%) Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVC 241 + + L L + + P L A + + G + R Sbjct: 51 RLVTEGDQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLTA----GSRQTRALKAV 106 Query: 242 DVPDEL-IDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT---AEKFAT 297 VP L + L + ++ + E K+ Y I + + AT Sbjct: 107 TVPAGLGFPHAAQAIQLTRTSRPINKNTKKTEGKRRQRRETVYAICTLPAHDALPAELAT 166 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRR 326 IR HW +E +L W DV + ED + R Sbjct: 167 WIRGHWSIEVRLRWVRDVTLGEDLHQART 195 Score = 47.8 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 8/35 (22%), Positives = 21/35 (60%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVIS 39 L+ ++ +PD R+ + H + +L + +CA+++ Sbjct: 60 ALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLT 94 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 38/65 (58%), Positives = 42/65 (64%), Gaps = 12/65 (18%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTI------CAVISGAEGWEDIEDFGETH 54 MELKKLM HISIIPDYRQAWK+EHKL DIL + C ++ G FGETH Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRG------FGETH 54 Query: 55 PDFLK 59 DFLK Sbjct: 55 LDFLK 59 >UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L7Y6_FRASN Length = 209 Score = 57.8 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 16/62 (25%), Positives = 29/62 (46%) Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 +TA T +R +W +EN++H+ D ED GN + R++AI ++ + Sbjct: 88 SVTAAYLHTHVRGNWGIENEVHYTRDAAWREDANPTYTGNTNHALASFRNLAIGVIGLNG 147 Query: 349 VF 350 Sbjct: 148 TR 149 Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 24/68 (35%), Gaps = 1/68 (1%) Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 + P T+ + I F W+ + + +AIDGK LR ++ Sbjct: 28 HFRRNTRAPSKKTLRAPLKKIDVDALDATFGAWLCAQI-ARGRVALAIDGKVLRGAWSGD 86 Query: 121 RRRGAIHV 128 A ++ Sbjct: 87 ESVTAAYL 94 >UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3Z283_9SYNE Length = 156 Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 42/93 (45%), Gaps = 4/93 (4%) Query: 84 AKFHECFINWMRDCHSSDDK-DVIAIDGKTLRHSYDKSRRRGAIHV--ISAFSTMHSLVI 140 F + WM + D D + DGKTLR S D+ A + +S +S + I Sbjct: 2 EAFEALLLQWMSQQPALADGVDTLVCDGKTLRGSIDQKPGAAASFIAQVSLYSQPLGVAI 61 Query: 141 GQ-IKTDEKSNEITAIPELLNMLDIKGKIITTD 172 Q ++S+E ++ LL+ +++ ++ D Sbjct: 62 AQTTYATDESSETASLLWLLSGIELTDMLVQAD 94 >UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TCH7_HELMI Length = 453 Score = 56.6 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 58/376 (15%), Positives = 110/376 (29%), Gaps = 44/376 (11%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + D R+ +++ I + E E ++ + +Q Sbjct: 40 GFSQMVRQAKDGRKQPRIK--APAIFTVAFFGAFFCMESMEQMDRW--QKTGVFRQLVPK 95 Query: 65 ENGIPVHDTIARVVSCIS---PAKFHECFINWMRDCHSSD-----DKDVIAIDGKTL--- 113 +P HDT+ + + + H C I ++ V AIDG L Sbjct: 96 NIRLPSHDTVRQALMKWDLKEQREQHNCVIQRYKEQRGPQKESINGWRVTAIDGVELFHT 155 Query: 114 ----------RHSYDKSRRRGAIHVISAFSTMHSLVIG-------QIKTDEKSNEITAIP 156 R DK+ V++ ++ +I Q D+ E T Sbjct: 156 KAYRCPECLTREHRDKTTDYYHAVVVAQQVGGNANLIYDWEMRKPQDGVDKDEGETTVAQ 215 Query: 157 ELLNML-DIKGK---IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL 212 L+ + + GK + T DA+ + + G + +K + R+ K F Sbjct: 216 RLIRRMAETYGKITDVYTLDALFAKAPVIHAALDAGAHVVVRMKEERRRIMKEANACFA- 274 Query: 213 KELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAE 272 N DS G + + + +K + + Sbjct: 275 ----NRLPDSTWEERDGKGNTVYVQAWDEEGLAQWPQVRVPMRIVKIIRHTNKTVIEANK 330 Query: 273 QKKELEMTVRYYI---SSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + ++ R+ SS + A W +EN L D C + A Sbjct: 331 EVFVTDVVERWIATTCSSEKADTQTIAQIAAARWDIENIGFRNLKTFNALDHCFVHDSVA 390 Query: 330 AELFSGIRHIAINILT 345 + G + +A N+ Sbjct: 391 IKAMIGFQVLAFNLKR 406 >UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J572_FRASC Length = 148 Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 12/46 (26%), Positives = 18/46 (39%), Gaps = 1/46 (2%) Query: 8 GHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGET 53 IPD R + H+L +L L AV+ G G + + Sbjct: 70 ECFEQIPDPRDPRGVRHRLPVVLSLC-AAVLCGESGLAGVAAWVAA 114 >UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C378D Length = 453 Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 49/336 (14%), Positives = 85/336 (25%), Gaps = 54/336 (16%) Query: 8 GHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG 67 IPD R L D+L+ + A F D ++ G Sbjct: 27 ERFETIPDAR--RGPTFSLPDVLMAGLALFALKAPSLLA---FQRRTLDHNLRHVFGLTG 81 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSS--------DDKDVIAIDG--------- 110 P + V+ + P F + ++ D V+A+DG Sbjct: 82 RPSDSQMRAVLDDVDPDHLRPVFRDVFARLQAAHVLDEYRVDGCYVVALDGVEYFCSQKV 141 Query: 111 -----KTLRHSYDKSRRRGAIHVISAFSTMHSLVIG------QIKTDEKSN--EITAIPE 157 T RH+ + + S V+ Q N E A Sbjct: 142 HCPHCMTRRHANGAVSYYHQMLGAAVVHPDFSAVLALAPEPIQRADGGTKNDCERNAARR 201 Query: 158 LLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 L ++ DA +QK +L VK F Sbjct: 202 WLGRFREEHPDLAVLVVEDARSSNAPHVRDLQKARCHFLLGVKA------ADHAHLFAHV 255 Query: 214 ELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 H ++ + E + R +R + L + + + + Sbjct: 256 CARQDQH-AFEVVEDADPRTGLRRSYLWIADLPLNESNDD-------VRVNFVHLVELDP 307 Query: 274 KKELEMTVRYY-ISSADLTAEKFATAIRNHWHVENK 308 ++ + A A R W +EN+ Sbjct: 308 DGTPREWTWVADMAVTGANVRQLARAGRARWRIENE 343 >UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BQC4_9GAMM Length = 96 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 5/84 (5%) Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 +T ++ R HW + LH+ D NED +IR G+ + + AI +L + Sbjct: 1 MTPQQVLAINRGHWSI-ASLHYISDWNYNEDRGQIRTGHGPANVTRLCRFAIGVLKHFPK 59 Query: 350 FKAGLRRKMRKAAMDR----NYLA 369 + MR+ A +YL Sbjct: 60 PGQYIPEMMRQLARRPRQVLDYLR 83 >UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3536 Length = 130 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 20/71 (28%), Positives = 34/71 (47%), Gaps = 7/71 (9%) Query: 252 FEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT---AEKFATAIRNHWHVENK 308 +WKGLK+ R++ + V + I+S A + +R+HW +EN+ Sbjct: 9 QDWKGLKQGFQITRERTV----NGVTTVEVVHGITSLSADRANAGALLSLLRDHWRIENQ 64 Query: 309 LHWRLDVVMNE 319 LH+ DV + E Sbjct: 65 LHYVPDVTLGE 75 >UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKL6_9PROT Length = 506 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 59/354 (16%), Positives = 115/354 (32%), Gaps = 57/354 (16%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHK----LSDILLLTICAVISGAEGWEDIEDFGETHPDF 57 EL L+G + IPD R K HK L LL+ + S E ++ Sbjct: 75 ELPALLGQLEQIPDPRDPRKRRHKLTVLLLYGLLMFVFQFASRRETNREMTR--PQFLAN 132 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD--------VIAID 109 L++ +P DT+ R++ I A + ++ +R IAID Sbjct: 133 LQRLFPEIEALPHADTLYRLLRDIDLAHLEQAHVDLVRRLIRGKSFRRYLINHCHPIAID 192 Query: 110 G------------KTLRHSYDKSRRRGAIHVI----SAFSTMHSLV-----------IGQ 142 G + L+ K R + + ++ + LV +G Sbjct: 193 GSQKLAGDTLWAEELLQRHVGKDETRHTQYFVYVLEASLVFHNGLVIPLLSEFLEHALGD 252 Query: 143 IKTDEKSNEITAIPELLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 + ++ E+ L + L ++ D + + ++ + ++ +K Sbjct: 253 SEAQKQDCELRGFARLSDRLKRLFPRLPILLLLDGLYANGPVMQRCLRAHWQFMIVLKD- 311 Query: 199 QGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 L +EE L+ P ++ GR + V D+ L Sbjct: 312 -KDLPTVWEEFRALQPRQLP------TLQQDWGRRQQHFSWVNDIEYAYGSNGRCRLKLH 364 Query: 259 KLCVAVSFRSIIAEQKKELEMTVRYYISSADLT----AEKFATAIRNHWHVENK 308 + ++ + E + E ++SS L+ E+ R+ W +E Sbjct: 365 VVVCEERWQGVDQEARIVTETARHAWLSSQPLSRENVHERCNLGARHRWGIEAG 418 >UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VX04_POLNA Length = 100 Score = 54.7 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 14/42 (33%), Positives = 26/42 (61%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDI 47 L+ SI+PD R + L +++++T+ AV+ GA+ W D+ Sbjct: 2 LIQAFSILPDPRTGPAQRYDLREMIVMTLSAVLCGADNWVDV 43 >UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B1F Length = 84 Score = 54.3 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 30/56 (53%) Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 L+M D+ + DA+G Q IAE+I + G DY+ A+K NQ +A F E Sbjct: 17 LDMEDLAQSQLVIDAVGTQGPIAEQIIEAGADYVLALKANQPSALQAVSAHFKEAE 72 >UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z7_THET2 Length = 112 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 36/111 (32%), Gaps = 7/111 (6%) Query: 41 AEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS 100 + +E F +P L G ++ + P K E + Sbjct: 1 MDSLRGVERFARANPHLLPHLGLRNPPGHTLL--PLLLHRLDPKKLQEALHQVFPEA--- 55 Query: 101 DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNE 151 D V+ +DGK LR S + + ++ + + Q + + K E Sbjct: 56 DLGGVLVVDGKHLRGS--GKGKSPQVRLVEVLALHLKTTLAQARVEGKVVE 104 >UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C435B Length = 133 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 42/137 (30%), Gaps = 18/137 (13%) Query: 192 LFAVKGNQGRLNKAFEEKFPLKE-----------LNNPAHDSYAMSEKSHGREEIRLHIV 240 + K NQ L E ++ L P + G R+ Sbjct: 1 MLTAKDNQPGLVADIEAGLGFEDAARGLAAATSPLTGPDARATGAPGHVGGPGHGRIETR 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSAD---LTAEKFAT 297 L+ W GLK R++ K + V + I+S A Sbjct: 61 TVRATPLLTCHDRWTGLKHGSRITRARTV----KGVTTVEVLHGITSLTVERADARALLG 116 Query: 298 AIRNHWHVENKLHWRLD 314 +R+HW +EN+ H D Sbjct: 117 LVRSHWRIENQRHDVRD 133 >UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ13_9PLAN Length = 75 Score = 52.0 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 18/56 (32%), Positives = 30/56 (53%), Gaps = 1/56 (1%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDF 57 EL++L + + D R HKL +++L+ +CAVI+GA+G IE + Sbjct: 19 ELRELAKYFQSLDDSRSHVNQRHKLVNVVLIAMCAVIAGADGPTAIE-WLAGRLQL 73 >UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7B Length = 481 Score = 51.6 bits (122), Expect = 5e-05, Method: Composition-based stats. Identities = 49/285 (17%), Positives = 93/285 (32%), Gaps = 55/285 (19%) Query: 47 IEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVI 106 IE G L+++ ++G H+ + I P E F+ + D ++ +V+ Sbjct: 81 IEHQGSGRQAHLRRHRQPDDG--CHEAFYGKLRRI-PRGLSEAFLRDVTDRFTALFPEVV 137 Query: 107 A--------------IDGKTLR----HSYDKSRRRGAIH---VISAFSTMHSLVIG-QIK 144 A +DGK+L+ D G + ++ A+ LV+ Sbjct: 138 AHRLPTSFDRLEVLILDGKSLKKVAKRLVDTRGTPGKLLGGKLLVAYRPRDGLVLDMAAD 197 Query: 145 TDEKSNEITAIPELLNMLDIKG---KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 D ++NE IP+L+ + +G K++ D + C + K G ++ Sbjct: 198 LDGETNEAKLIPDLMPRVHARGGPAKLVVGDRLFCASKHFAEFTKDNGHFVV-------- 249 Query: 202 LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 L +P + ++ S V +E L++ Sbjct: 250 -----RYARTLSFEPDPKRPAVTTADPSQR----------AVVEEWGWAGKPKDKLRRYV 294 Query: 262 VAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVE 306 R +A E + + SA A R W +E Sbjct: 295 R----RITVARPVGEAITILTDLLDSAPYPATDLLDLYRIRWTIE 335 >UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L6S8_RUMHA Length = 107 Score = 49.3 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 15/48 (31%), Positives = 25/48 (52%) Query: 47 IEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWM 94 + F + + ++ D + G P DT+ RV + I P KF E F +W+ Sbjct: 1 MAFFMKLQEPYFEKILDLKYGTPSADTLLRVFAIIEPEKFMEMFYHWI 48 >UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillaceae RepID=C5D2E6_GEOSW Length = 437 Score = 49.3 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 57/351 (16%), Positives = 112/351 (31%), Gaps = 74/351 (21%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDI-EDFGETH-PDFLKQ 60 K L+ + + D R + + IL + + G + + E F + + ++ Sbjct: 28 FKDLVDQLKKVKDKRHQSYITYGPETILYTILLKSVFGIKSMRSMTELFNKDECIENIRV 87 Query: 61 YGDFE--NGIPVHDTIARVVSCISPAKFHEC--------FINWMRDCHSSDDKDV-IAID 109 + N +P +DTI ++ + P + F + +K I D Sbjct: 88 VLGLKELNELPHYDTINDFLAKLEPKELETIRIYLIKKLFEKRCLESFRILNKYWPIVFD 147 Query: 110 GK------------TLRHSY-DKSRRRGAI----HVISA--FSTMHSLVIGQIKTDEKSN 150 G LR Y DK + HV+ A L I + +S Sbjct: 148 GTGIHTFKEKHCEHCLRREYKDKETGETKVVYMHHVLEAKLVVGDMVLSIATEFIENESE 207 Query: 151 -------EITAIPELLNMLD-----IKGKIITTDAMGCQKDIAEKIQKQGGD-YLFAVKG 197 E+ A L++ L + +I D++ + + E I + Y+F K Sbjct: 208 NVPKQDCELKAFMRLVDKLKKTFKRLPICLI-ADSLYACEPVFE-ICDKHNWKYIFRFKE 265 Query: 198 NQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGL 257 ++ + E N + + + V D+ Sbjct: 266 DRIKTVSQEFRAIQSLETNGKSSEYF---------------WVNDIAYND---------- 300 Query: 258 KKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 + + + + E+K+E + I+ + AE A R W +EN+ Sbjct: 301 RLVNLVEKVKVTENEKKQEFLFITNFRIT--ERNAEILVQAGRRRWKIENE 349 >UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E7_9ACTO Length = 72 Score = 49.3 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 14/45 (31%), Positives = 25/45 (55%) Query: 134 TMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 T + + Q++ E +NEIT LL+ D++ +T DA+ Q+ Sbjct: 2 TGTGMTVTQLRVPENTNEITCFAALLDPYDLREVTVTGDALHTQR 46 >UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HUT6_9ACTO Length = 130 Score = 48.6 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 33/84 (39%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L G +S IPD R+ + L +L L + AV+ GA I F L++ Sbjct: 45 SLAGTLSRIPDPRRVRGRRYHLGSLLALCLVAVLGGARSLATIARFAADTNSSLREQLGL 104 Query: 65 ENGIPVHDTIARVVSCISPAKFHE 88 + P T+ + + + E Sbjct: 105 ASSTPNASTLGGLRANLKDEWVRE 128 >UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36C2 Length = 109 Score = 48.2 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 26/69 (37%), Gaps = 5/69 (7%) Query: 268 SIIAEQKKELEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + + V Y I+S A R HW +EN LH+ DV + ED C + Sbjct: 5 ERRRKANGKATVEVVYGITSLSRLAADAAALLGYSRRHWGIENGLHYTRDVTLGEDRCPV 64 Query: 325 --RRGNAAE 331 R Sbjct: 65 GARSRPTPR 73 >UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX Length = 98 Score = 47.8 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 37/48 (77%), Positives = 39/48 (81%) Query: 78 VSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +SCI KFHECFIN MR+CHSSDD DVIAIDGK L HS DKSRRR A Sbjct: 1 MSCIRSVKFHECFINRMRECHSSDDIDVIAIDGKALPHSCDKSRRRRA 48 >UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4X2F9_SALTO Length = 143 Score = 47.4 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 24/63 (38%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L + +PD + H+L+ +L+ ICAV + I ++ P G Sbjct: 14 GLPAALLDLPDPLCRLGVLHRLTVVLIAAICAVAVSNRSYTAIAEWFPDVPAATGARGGH 73 Query: 65 ENG 67 G Sbjct: 74 RPG 76 >UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BWL6_9GAMM Length = 94 Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 32/55 (58%), Gaps = 1/55 (1%) Query: 8 GHISIIPDYRQAW-KMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 H +PD R+ + HK DIL++ ICA+I GA+ W + +FG+ D+ + + Sbjct: 40 EHFKSLPDPRRRTMNLRHKFIDILIIAICAIICGADSWVAVAEFGKAKEDWFRVF 94 >UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JGX0_FRASC Length = 222 Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 40/108 (37%), Gaps = 6/108 (5%) Query: 66 NGIPVHDTIARVVSCISPAKFHECFIN-WMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 G P + + + P + + V+ +DG T+R Sbjct: 31 PGTPAPGGVGKSCRSLDPGSLAALDAAPHRPTWRAGRVRRVLTVDGTTMR----PQHGSR 86 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITT 171 +H+ + +++ Q+ DEK+NE + L + D+ G +IT Sbjct: 87 HVHLPEGLAHACGVLLTQVDVDEKTNENPFVLRGLGQIPDLTGVLITA 134 >UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmonella enterica RepID=C0Q104_SALPC Length = 177 Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 24/50 (48%), Positives = 27/50 (54%), Gaps = 13/50 (26%) Query: 309 LHWRLDVVMNEDDCKIRRGNAAELF-------------SGIRHIAINILT 345 +HWRLDV MNEDDC+IRRGN F +R I INIL Sbjct: 1 MHWRLDVAMNEDDCRIRRGNVKSFFEIIKSGEYEIWGCEIMRWIRINILK 50 >UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AI67_NITEC Length = 94 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 16/59 (27%), Positives = 23/59 (38%), Gaps = 11/59 (18%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L I D RQ K H L +L++TI +I + D+L+QY Sbjct: 35 LADVFVSITDPRQ-RKSRHDLVKVLVITI----------NEILAWANEKLDWLRQYLKL 82 >UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW00_RHOSR Length = 98 Score = 45.5 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 23/48 (47%), Gaps = 2/48 (4%) Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 GKT R + D S H+++A +V+ Q+ + NEI + + Sbjct: 18 GKTWRGAKDGSG--HLTHLLAAVDHDAGVVLRQVAVGARINEIPLLLD 63 >UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms RepID=Q11MU1_MESSB Length = 447 Score = 45.5 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 48/333 (14%), Positives = 100/333 (30%), Gaps = 53/333 (15%) Query: 5 KLMGHISI-IPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 KL ++ I D R ++ H L+DIL I A+ G E D++ P F G Sbjct: 45 KLAEKLAAAIRDPRDPARVRHSLTDILRARIFAIACGYEDANDLDRL-RNDPAFKLACGR 103 Query: 64 FENG---IPVHDTIARVVSCIS---PAKFHECFIN-WMRDCHSSDDKDVIAID------- 109 + + T +R+ + + ++ W+ + + ID Sbjct: 104 LPDSGQDLCSQPTCSRLENLPDLRTVIRLGRVLVDLWLSSYPAPPKSVTLDIDDTLDVVH 163 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML------- 162 G ++ I + + I K+ I L L Sbjct: 164 GHQQLSLFNGHHDERCFLPIHIYDAATGRPVAMILRPGKTPSGKEIRGHLRRLARCIRAR 223 Query: 163 -DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHD 221 ++ D+ + ++ ++ DY+F + GN K + + Sbjct: 224 WPDTRILVRGDSHYGRVEVMAWCEENAIDYVFGLAGN-----KVLKRLVDASADDIRTRR 278 Query: 222 SYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTV 281 + G E R WK +++ + + + + + Sbjct: 279 ALEQKPVLRGYVETRY------------EAKSWKAERRVAARI--------EATTMGLDI 318 Query: 282 RYYISSA-DLTAEKFATAI---RNHWHVENKLH 310 R+ +++ +AE + R K+H Sbjct: 319 RFVVTNLAKGSAEHIYDVVYCARGQAENLIKMH 351 >UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus pneumoniae RepID=B2ISL1_STRPS Length = 535 Score = 45.5 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 44/253 (17%), Positives = 80/253 (31%), Gaps = 34/253 (13%) Query: 18 QAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG--IPVHDTIA 75 Q + SDIL+ + +++G D+ + G + T++ Sbjct: 142 QRRYCRYSDSDILVQFLFQLLTGYGT-----DYACKELSADAYFPKLLEGGQLASQPTLS 196 Query: 76 RVVSCISPA----------KFHECFINWMR--DCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 R +S + E F+ + + D GK +Y+ R Sbjct: 197 RFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDSTHFTTYGKQEGVAYNAHYRA 256 Query: 124 GAIHVISAFSTMHSLVI-GQIK------TDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 H + AF Q++ ++E + IT + E N L + D+ Sbjct: 257 HGYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFITPVLERFNQL-----LFRMDSGFA 311 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQ--GRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGR-E 233 + + I+K G YL +K N RL ++L H +Y+ + G Sbjct: 312 TPKLYDLIEKTGQYYLIKLKKNTVLSRLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWS 371 Query: 234 EIRLHIVCDVPDE 246 R E Sbjct: 372 HKRRVCQFSERKE 384 >UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 Length = 41 Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 16/30 (53%), Positives = 24/30 (80%) Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 +++A +T + + IGQ+K D KSNEITAIP+ Sbjct: 1 MVNALATANGMSIGQLKVDSKSNEITAIPK 30 >UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromonas gingivalis ATCC 33277 RepID=B2RI66_PORG3 Length = 87 Score = 44.7 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 13/47 (27%), Positives = 23/47 (48%) Query: 17 RQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 R K + L + L+ + +SG W +IED+ E + + LK + Sbjct: 23 RIESKEVYPLDFLFLIVFLSTLSGDTSWYEIEDYAEEYEEVLKSRYE 69 >UniRef50_Q2RR82 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RR82_RHORT Length = 84 Score = 44.3 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 1/46 (2%) Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 A E++ LD+ G++ T DA+ CQ E ++ G L K NQ Sbjct: 36 ATQEMIAPLDLTGRLFTLDALHCQ-KTFEIARQAGNHLLVQAKINQ 80 >UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P672_AZOSE Length = 47 Score = 44.3 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 15/31 (48%), Positives = 18/31 (58%) Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 HW VEN LHW L+V NED ++R A Sbjct: 1 HWGVENWLHWCLNVQFNEDRSRVRSAYAVNN 31 >UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TLQ7_SHEHH Length = 74 Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats. Identities = 18/44 (40%), Positives = 24/44 (54%) Query: 7 MGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDF 50 H+SII R + EH DI+ L A+ S EGW DI++F Sbjct: 4 FEHLSIIKAPRSSINHEHDPVDIMFLVNSAIASDCEGWLDIDEF 47 >UniRef50_Q8X3B6 H repeat-associated protein n=1 Tax=Escherichia coli O157:H7 RepID=Q8X3B6_ECO57 Length = 50 Score = 43.6 bits (101), Expect = 0.012, Method: Composition-based stats. Identities = 24/38 (63%), Positives = 27/38 (71%) Query: 341 INILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGSGLS 378 + I ND VFKAGL KMRKA MDRN+LAS + GLS Sbjct: 13 LLISDNDNVFKAGLSCKMRKAVMDRNFLASGIAACGLS 50 >UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID=A1RCW9_ARTAT Length = 436 Score = 43.6 bits (101), Expect = 0.014, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 67/205 (32%), Gaps = 19/205 (9%) Query: 11 SIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 I+PD R +++H L +L I A+ +G E D D G H L+ + + Sbjct: 49 KIVPDRRDPGRVQHGLQTLLAQRIYALAAGYEDLND-HD-GLRHDYALQTAVNRLQPLAG 106 Query: 71 HDTIARVVSCISPAKFHEC-------FINWMRDCHSSDDKDVIAID----GKTLRHSYDK 119 T+ R+ + FI + D A D G + Sbjct: 107 KSTLGRLEQQADRETVVQAHRLLWEHFIAQHDQAPAEIVLDFDATDVPVHGDQEGRFFHG 166 Query: 120 SRRRGAIHVISAFSTMHSLV--IGQIKTDEKSNE---ITAIPELLNMLDIKGKII-TTDA 173 + F H LV + D + + + + + + +I+ D Sbjct: 167 YYDHYCFLPLYVFCGRHLLVSYLRPSNIDGARHSWAILALLVKFIRRFWPETRIVFRGDG 226 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGN 198 C+ + + ++ DY+ + N Sbjct: 227 GFCRHRMLDWCDRKQVDYVVGLARN 251 >UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicutes RepID=B8FXU5_DESHD Length = 381 Score = 42.0 bits (97), Expect = 0.033, Method: Composition-based stats. Identities = 42/278 (15%), Positives = 91/278 (32%), Gaps = 25/278 (8%) Query: 51 GETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMR---DCHSSDDKDVIA 107 G + L + + + + I P F + R +S D ++A Sbjct: 3 GNSLSKELYDWLGYSSETATASAFVQQRDKIRPEALKLLFHEFTRLTVSENSLQDYRLLA 62 Query: 108 IDGKTLR------------HSYDKSRRRGAIHVISAFSTMHSL-VIGQIKTDEKSNEITA 154 +DG LR + + S+ +H+ + + M + V +++ + NE A Sbjct: 63 VDGSDLRLPSNSKDGFSSIRNSEDSKNYNLVHLDAMYDLMGKVYVDASVQSKKGMNEHKA 122 Query: 155 IPELLNMLDIKGKIITT-DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 + +++ +I G +I D + Q++ Y+ K + G + L Sbjct: 123 LVSMVDQSEINGNVIAIMDRGYESFNNIAHFQEKSWYYIIRAKESYG-----IISRLSLP 177 Query: 214 ELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK---KLCVAVSFRSII 270 + + + +E + L I + +K + FR++ Sbjct: 178 DYPEYDEEIMLTLTRRQTKETLPLLKAYPHRYRWIQPHTTFDFIKPKDSKFYDLHFRAVR 237 Query: 271 AEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 + TV +++ D EK W +E Sbjct: 238 FAIADGVYETVYTNLNAEDFPPEKLKQLYNLRWGIETS 275 >UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUW9_9BACT Length = 61 Score = 41.6 bits (96), Expect = 0.042, Method: Composition-based stats. Identities = 11/62 (17%), Positives = 23/62 (37%), Gaps = 2/62 (3%) Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASV 371 D ED +IR NA + ++++ + + V + + +R A + Sbjct: 1 MRDTSFREDHSQIRTQNAPRAMASLKNLVVGLFHFLNVPN--IAKTLRNFAARPFLALQM 58 Query: 372 LT 373 L Sbjct: 59 LR 60 >UniRef50_A6FLE0 Transposase, IS4 n=2 Tax=Roseobacter sp. AzwK-3b RepID=A6FLE0_9RHOB Length = 136 Score = 41.6 bits (96), Expect = 0.046, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 34/83 (40%), Gaps = 4/83 (4%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +L L+ +PD R+ K+ H L DI+ I + +G E D D + L Sbjct: 32 DLAGLVA--KCLPDPREPGKVRHSLEDIIRFRIMMIAAGYEDGNDAGDLRDDPAFKLALE 89 Query: 62 GDFENGIP--VHDTIARVVSCIS 82 D E G TI+R+ + Sbjct: 90 RDPETGAALCSQPTISRMENMAD 112 >UniRef50_Q745Z8 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z8_THET2 Length = 77 Score = 40.9 bits (94), Expect = 0.078, Method: Composition-based stats. Identities = 17/59 (28%), Positives = 35/59 (59%), Gaps = 1/59 (1%) Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 +EN+ W DV++ E+ C++R G A++ + +R +++L V + R++ KAA+ Sbjct: 1 MENRSFWVRDVLLYEEACQVR-GVGAQVLAALRAFLVSLLHRRGVREKVTRQRTLKAAL 58 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 435 e-120 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 404 e-111 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 393 e-108 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 393 e-108 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 393 e-108 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 391 e-107 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 388 e-106 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 379 e-103 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 373 e-102 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 364 4e-99 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 361 3e-98 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 355 2e-96 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 352 8e-96 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 352 1e-95 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 347 3e-94 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 347 5e-94 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 346 8e-94 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 345 2e-93 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 342 1e-92 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 342 2e-92 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 339 1e-91 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 339 1e-91 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 336 8e-91 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 332 2e-89 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 331 4e-89 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 327 3e-88 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 325 2e-87 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 324 4e-87 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 323 7e-87 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 322 1e-86 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 322 1e-86 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 321 2e-86 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 321 2e-86 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 320 4e-86 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 320 5e-86 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 315 1e-84 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 314 3e-84 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 308 2e-82 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 303 6e-81 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 300 4e-80 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 300 7e-80 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 297 3e-79 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 297 3e-79 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 296 1e-78 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 289 1e-76 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 284 3e-75 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 270 5e-71 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 267 4e-70 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 267 6e-70 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 266 8e-70 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 266 9e-70 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 265 2e-69 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 262 1e-68 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 260 5e-68 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 256 1e-66 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 255 2e-66 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 244 4e-63 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 243 9e-63 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 240 9e-62 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 238 2e-61 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 234 5e-60 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 233 8e-60 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 230 7e-59 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 228 2e-58 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 228 3e-58 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 225 1e-57 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 224 4e-57 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 223 1e-56 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 215 2e-54 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 215 2e-54 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 214 3e-54 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 211 2e-53 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 207 5e-52 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 203 6e-51 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 201 2e-50 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 201 5e-50 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 193 1e-47 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 178 2e-43 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 176 1e-42 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 173 1e-41 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 171 3e-41 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 170 9e-41 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 169 2e-40 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 169 2e-40 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 168 3e-40 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 168 4e-40 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 163 8e-39 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 163 1e-38 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 159 2e-37 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 158 3e-37 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 158 3e-37 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 156 2e-36 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 154 7e-36 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 151 4e-35 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 151 4e-35 UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliob... 150 1e-34 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 146 9e-34 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 146 9e-34 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 144 7e-33 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 143 2e-32 UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 142 3e-32 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 141 4e-32 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 141 5e-32 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 141 5e-32 UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7... 139 1e-31 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 139 2e-31 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 138 3e-31 UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azolla... 136 2e-30 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 135 2e-30 UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp... 135 3e-30 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 134 4e-30 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 133 8e-30 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 133 1e-29 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 131 6e-29 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 128 3e-28 UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=B... 126 1e-27 UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaerom... 126 2e-27 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 123 1e-26 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 122 3e-26 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 121 6e-26 UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=B... 121 6e-26 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 119 2e-25 UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammap... 119 2e-25 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 115 3e-24 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 114 4e-24 UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica... 114 4e-24 UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=... 111 3e-23 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 111 3e-23 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 111 4e-23 UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata ob... 111 5e-23 UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris mari... 111 6e-23 UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina... 110 8e-23 UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mo... 109 2e-22 UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legione... 108 4e-22 UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis... 105 3e-21 UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candida... 105 3e-21 UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata ob... 104 4e-21 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 104 4e-21 UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX 103 1e-20 UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfo... 101 3e-20 UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoa... 101 6e-20 UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio ... 100 7e-20 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 100 8e-20 UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum fe... 100 8e-20 UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillace... 100 1e-19 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 100 1e-19 UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis... 100 1e-19 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 100 2e-19 UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswald... 99 2e-19 UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewane... 99 3e-19 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 96 1e-18 UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia ... 94 8e-18 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 93 2e-17 UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobac... 92 3e-17 UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteoba... 92 4e-17 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 91 5e-17 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 91 9e-17 UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum ... 90 1e-16 UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia... 90 1e-16 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 89 2e-16 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 89 2e-16 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 89 3e-16 UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus Re... 89 3e-16 UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synecho... 87 1e-15 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 86 1e-15 UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscurig... 86 2e-15 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 85 4e-15 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 84 5e-15 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 84 6e-15 UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferro... 84 7e-15 UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothe... 84 9e-15 UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax... 84 9e-15 UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus... 83 2e-14 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 81 5e-14 UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 81 7e-14 UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_... 81 7e-14 UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=... 79 3e-13 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 78 5e-13 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 78 5e-13 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 76 1e-12 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 76 2e-12 UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodoco... 76 2e-12 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 76 3e-12 UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=... 75 3e-12 UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinis... 74 6e-12 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 71 5e-11 UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylob... 71 5e-11 UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia... 71 7e-11 UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 70 1e-10 UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus... 70 1e-10 UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitroco... 68 7e-10 UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitroso... 68 8e-10 UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium m... 68 8e-10 UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia... 64 6e-09 UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=... 64 8e-09 UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia... 63 1e-08 UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=... 63 2e-08 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 59 4e-07 UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylo... 58 5e-07 UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synecho... 58 7e-07 UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria... 57 1e-06 UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Strepto... 56 2e-06 UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata ob... 56 3e-06 UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia... 54 7e-06 UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobi... 54 1e-05 UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polarom... 52 4e-05 UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinis... 52 4e-05 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 51 8e-05 UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 51 8e-05 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 50 2e-04 UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastop... 49 2e-04 UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiat... 48 8e-04 Sequences not found previously or not previously below threshold: UniRef50_B2IT45 Putative uncharacterized protein n=5 Tax=Cyanoba... 72 3e-11 UniRef50_A5GAF0 Putative uncharacterized protein n=6 Tax=Deltapr... 69 4e-10 UniRef50_A8MIZ4 Putative uncharacterized protein n=1 Tax=Alkalip... 61 6e-08 UniRef50_A7C035 Transposase n=5 Tax=Bacteria RepID=A7C035_9GAMM 59 4e-07 UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicute... 53 2e-05 UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coproco... 50 2e-04 UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepI... 49 2e-04 UniRef50_C7GHC1 Transposase, IS4 family (Fragment) n=6 Tax=Roseb... 48 4e-04 UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteoba... 48 7e-04 UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium... 48 8e-04 UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aro... 47 0.001 UniRef50_C7G6U9 Putative uncharacterized protein (Fragment) n=7 ... 46 0.002 UniRef50_Q745Z8 Putative uncharacterized protein n=1 Tax=Thermus... 46 0.002 UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea f... 46 0.002 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 46 0.003 UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 R... 46 0.003 UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmone... 45 0.004 UniRef50_B0JNZ6 Transposase n=20 Tax=Cyanobacteria RepID=B0JNZ6_... 45 0.004 UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID... 45 0.004 UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus... 45 0.004 UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia ... 45 0.005 UniRef50_A3YV03 Putative uncharacterized protein n=1 Tax=Synecho... 44 0.006 UniRef50_A4BVT6 Putative uncharacterized protein n=1 Tax=Nitroco... 44 0.008 UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostri... 44 0.009 UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia R... 43 0.015 UniRef50_A5FU21 Transposase, IS4 family protein n=11 Tax=Alphapr... 43 0.017 UniRef50_D2KXE5 Putative transposase n=1 Tax=Lactobacillus ferme... 43 0.017 UniRef50_A7JYJ5 Putative uncharacterized protein n=1 Tax=Vibrio ... 43 0.019 UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 42 0.028 UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001... 42 0.034 UniRef50_Q877V8 ISPpu8, transposase n=3 Tax=Proteobacteria RepID... 42 0.046 UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=St... 41 0.081 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 435 bits (1119), Expect = e-120, Method: Composition-based stats. Identities = 369/378 (97%), Positives = 371/378 (98%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 MELKKLM HISIIPDYRQ WK+EHKLSDILLLTICAVISGAEGWEDIEDFGETH DFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 AEKIQKQGGDYLFAVKG QGRLNKAFEEKFPLKELNNP HDSYA+SEKSHGREEIRLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE EMTVRYYISSADLTAEKFATAIR Sbjct: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK Sbjct: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 Query: 361 AAMDRNYLASVLTGSGLS 378 AAMDRNYLASVL GSGLS Sbjct: 361 AAMDRNYLASVLAGSGLS 378 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 156/377 (41%), Positives = 220/377 (58%), Gaps = 10/377 (2%) Query: 2 ELKKLMGHISIIPDYRQAW-KMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 +K + + D R+ H DIL++ +CA+ISGA + +IE FG + ++ + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 + NGIP HDT V++ +SP +F CF+ W + IAID KTLR S DK Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKK 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + +H++SA++T +LVIGQIKT+E SNEITAIPELLN LD+KG +++ DAMGCQ +I Sbjct: 125 NGKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEI 184 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPAHDSYAMSEKSHGREEIRL 237 AEKI ++ DY+ A+KGNQ +L+++ E F E D E S+GREEIR Sbjct: 185 AEKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRC 244 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFAT 297 + +++I EWK +K + + S R KKE E +RYYISSA L+AE Sbjct: 245 AYATNEIEKIIAN-DEWKNIKTVAMIESQRI-----KKEKEFDIRYYISSAKLSAEDCLK 298 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +ENKLHW LDV ED+ +IR+ N AE + +R IA+N++ +K K G K Sbjct: 299 VVRKHWEIENKLHWTLDVAFREDESRIRQRNTAENMAILRRIALNLVKQEKTAKVGQATK 358 Query: 358 MRKAAMDRNYLASVLTG 374 A D YL +L G Sbjct: 359 RLMAGWDEKYLLKLLNG 375 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 393 bits (1010), Expect = e-108, Method: Composition-based stats. Identities = 149/385 (38%), Positives = 223/385 (57%), Gaps = 20/385 (5%) Query: 8 GHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG 67 + + D R +HKL DI+ +TICAVI GA+ W DIE FG+ +LK++ + NG Sbjct: 11 DYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNG 70 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIH 127 IP HDT RV S ++P + F+ W++ S +++AIDGKTLRHSYD+S+ + A+ Sbjct: 71 IPSHDTFGRVFSLLNPEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQ 130 Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPE---------------LLNMLDIKGKIITTD 172 +ISA++T + LV+GQ DEKSNEITAIP+ LL +L + G I+T D Sbjct: 131 MISAWATTNGLVLGQSIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLD 190 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPA---HDSYAMSEKS 229 A+GCQK+I ++I +Q DY+ +K NQG L + E F ++N Y + ++ Sbjct: 191 AIGCQKEIVKQITEQDADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEG 250 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSAD 289 HGR+E+R + + E ID ++W L + R + + RY+ISS + Sbjct: 251 HGRQEVRYYQMLSNVAEEIDPDWQWLNLNSIGYVEYLRVENG--TDKTSLERRYFISSLN 308 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + FA+++R HW +EN+ HW LDV NEDD +IR+ NA + +RH+A+N+L +K Sbjct: 309 NNIKLFASSVREHWCIENQCHWILDVQFNEDDSRIRKDNAPANMAILRHLALNLLKQEKT 368 Query: 350 FKAGLRRKMRKAAMDRNYLASVLTG 374 K G++ K +KA D NYL VL Sbjct: 369 LKVGVKAKRKKAGWDENYLLKVLRN 393 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 393 bits (1010), Expect = e-108, Method: Composition-based stats. Identities = 168/373 (45%), Positives = 235/373 (63%), Gaps = 8/373 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ SII D RQ K++H+L DIL L + AVI GAEGW+DIE+ G ++L++ G F Sbjct: 6 SLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFF 65 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 + GIPV DTIAR++S ++P + CFI WM + D +IA+DGK++RHSYDK +R+ Sbjct: 66 KKGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKS 125 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 AIH++SA++ + +V+GQ KTD+KSNEI AIP LL++LDIKG I+T DAMGCQ+ IAEKI Sbjct: 126 AIHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKI 185 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPAHDSYAMSEKSHGREEIRLHIVC 241 + GDY+ AVK NQ +L++ + F + HD + S K HGR E+R + + Sbjct: 186 VTKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRRYWIS 245 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 D+ L W L+ + + S R I + RY+I+S A+ FA A+R Sbjct: 246 DMLSTLG-NPERWASLQSIGMVESERYI----DGKTTAETRYFITSIAPDAKIFANAVRK 300 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDV EDD ++RR NA+E F RH+AIN L N+K K G++ K KA Sbjct: 301 HWAIENQLHWVLDVSFREDDSRVRRDNASENFGVFRHVAINALRNEKSCKKGIKAKRYKA 360 Query: 362 AMDRNYLASVLTG 374 + +Y VL G Sbjct: 361 TLQPDYAQKVLNG 373 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 393 bits (1008), Expect = e-108, Method: Composition-based stats. Identities = 145/375 (38%), Positives = 216/375 (57%), Gaps = 7/375 (1%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ H S + D R A ++E+ L DI+++T+CAV+ GA+ W ++ ++G + +LKQ+ Sbjct: 5 PFASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQW 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 NG+P HDT V + + P + +CF+NW + + ++IAIDGKTLR + Sbjct: 65 IALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGE 124 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + IH++SA+++ + LV+GQ DEKSNEITAIPELL +L+++G +++ DAMGCQ IA Sbjct: 125 QCSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIA 184 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPAHDSYAMSEKSHGREEIRLH 238 E I + GDY+ A+KGNQG L + F HDSY EK HGR E R + Sbjct: 185 ETIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTY 244 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATA 298 D L+ W LK + S R + RYY+ S + A++FA A Sbjct: 245 WTMGQTDYLLG-AERWAQLKSIGCVESCRRQPGHPG---TLQRRYYLLSIESDAQRFADA 300 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R+HW +EN+LHW LDV ED + +G +A+ S IRHIA N+L + K G++ K Sbjct: 301 VRSHWGIENQLHWILDVGFREDKLRACQGCSAQNLSVIRHIAANLLQQESTAKCGVKAKR 360 Query: 359 RKAAMDRNYLASVLT 373 KA D NYL +L+ Sbjct: 361 LKAGWDDNYLVKILS 375 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 391 bits (1004), Expect = e-107, Method: Composition-based stats. Identities = 181/377 (48%), Positives = 249/377 (66%), Gaps = 4/377 (1%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L+ +SII D RQ K+ H L D+L L I AVISG EGWE+I+DFG D+L++ Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Y F GIP DTI+R+ I P +F +CF WM+ C DVIAIDGKTLR S++K Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKK 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + IH++SAF+ +S+V+GQ+KT+ KSNEITAIP+LL++LD++G ++T DAMGCQ I Sbjct: 121 DKSDTIHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 A+KI +GGDYL VKGNQ RL A + F ++ L P ++Y EK HGRE+ R+ +V Sbjct: 181 AKKIVDKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 D E+ D FEW GLK L AVSFR+ E+ + + V++YISSA L A+ A R Sbjct: 241 ADAN-EIGDLVFEWPGLKTLGYAVSFRT---EKDMQTTVAVKFYISSAKLDAKSLLEASR 296 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW VEN LHW+LD+ MNED C+IR+ N+ E + +RH ++N+L N+K F G++RK ++ Sbjct: 297 AHWTVENNLHWQLDISMNEDSCRIRQQNSTENLATVRHTSLNLLKNEKSFDGGIKRKHKQ 356 Query: 361 AAMDRNYLASVLTGSGL 377 A +Y V++G L Sbjct: 357 ANRSDSYRELVVSGLSL 373 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 137/373 (36%), Positives = 213/373 (57%), Gaps = 8/373 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+G++ I D R +H L D+L + I AVI+G++GWED+E++G ++L ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 +GIP DT RV I P +C W++ +S ++I IDGKTLR SYD++ + Sbjct: 90 PHGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQC 149 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A++ ++A+++ SLV+GQ+K + SNEITAIP LL +LDI G IIT DAMG Q I ++I Sbjct: 150 ALYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQI 209 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPAHDSYAMSEKSHGREEIRLHIVC 241 +Q DY+ +K N L ++ F + + HD Y K H R E R Sbjct: 210 CRQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRYVWAI 269 Query: 242 DVPDE-LIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 V + +W GL+ + V R + + +++Y++S A+ AIR Sbjct: 270 PVAAMGELYQQQQWHGLQTIVVVERIRHLWNK----TTHDIQFYLTSLPPNAQFLCHAIR 325 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +EN LHW LDV +ED C+IR + + F+ +R +A+N+L +K FK LR+KM++ Sbjct: 326 THWSIENNLHWTLDVTFSEDQCRIRSEYSPQNFALLRRLALNVLHQEKTFKRSLRQKMKQ 385 Query: 361 AAMDRNYLASVLT 373 AAM+ NY+ +VL Sbjct: 386 AAMNNNYMMTVLN 398 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 379 bits (972), Expect = e-103, Method: Composition-based stats. Identities = 158/372 (42%), Positives = 228/372 (61%), Gaps = 7/372 (1%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M ++ H S I D+RQ+ K+ + L D+L ++CAVI+ + GW +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 F +GIP DTIAR+VS I P F+ CF+ WM+ H + +VIAIDGKTLR SY++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 R IH+ISA+++ + LV+GQ+K ++KSNEITAIP LL MLD++G ++T DAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 A I +GGDYL AVK NQG L KA + F + D + EKSHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFS-PHRSAGLSDDHVNIEKSHGRIENRTCYV 239 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 DF W+ LK + + SFR++ + + RYYISS L+AE+ +A R Sbjct: 240 LSSAALDGDF-THWEALKSIVMVESFRAVKGK---TASLEYRYYISSKVLSAEQALSATR 295 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +E+ +HW LDV MNED+C+I + N AE + +RH+++N+L + K + K ++ Sbjct: 296 EHWGIES-MHWVLDVSMNEDECQIYKNNGAENLAYLRHMSLNMLQKEPT-KLSIVGKRKR 353 Query: 361 AAMDRNYLASVL 372 M+ +L VL Sbjct: 354 CLMNPAFLEKVL 365 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 373 bits (957), Expect = e-102, Method: Composition-based stats. Identities = 153/371 (41%), Positives = 214/371 (57%), Gaps = 10/371 (2%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + H S I D RQ K+ + L +ILLLT+CAV+SGA W I +G FLK++ F Sbjct: 25 FLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPFA 84 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +G P HD + + + + F CFI+W+ + + V+AIDGKT R S DK+ + A Sbjct: 85 DGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTVTG-VVAIDGKTSRRSLDKAGGKAA 143 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH+ISA+S+ +L + Q + D KSNEITAIPELL +L +KG I+T DAMGCQ++IA KI Sbjct: 144 IHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKII 203 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPAHDSYAMSEKSHGREEIRLHIVCD 242 + DY+ A+KGNQG L K E + + ++ + EKSHGR E R VC Sbjct: 204 SKEADYILALKGNQGSLRKDTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVTVCT 263 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNH 302 D L W GLK + + + + RYYISS AE A AIR+H Sbjct: 264 DIDWL-KADHNWPGLKSIVMVQYHAILQDK----TRAETRYYISSMTSDAEHHAKAIRDH 318 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN LHW +D+V +D+C+IR GNA F+ I+H+A N+L + K K LR K A+ Sbjct: 319 WGIENGLHWVMDMVFRDDECRIRNGNAPANFTTIKHVASNMLRSVK-GKHSLRSKRHIAS 377 Query: 363 MDRNYLASVLT 373 D ++LA ++ Sbjct: 378 WDDDFLAEIIN 388 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 364 bits (933), Expect = 4e-99, Method: Composition-based stats. Identities = 136/384 (35%), Positives = 195/384 (50%), Gaps = 15/384 (3%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 ++ I D R K HK+ I+ ++I AVI GA+ W +IE+FG F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPD 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YD 118 IP HDT R S I P F F NW++ V+AIDGK +R + Sbjct: 64 LEFIPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 + + ++SA+S ++ + +GQ+K D+KSNEITAIP L+N L++ G I+T DAMGCQK Sbjct: 123 TGKEGFKLWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 DI + I ++ +Y+ A+K N+ + L K + + ++ + HGR E Sbjct: 183 DITQTIIERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEK 242 Query: 236 RLHIVCDV-PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT-AE 293 R V F + GLK + S R+I+A E VRYY++S D T E Sbjct: 243 RTCTVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIVA--TGEYTQEVRYYVTSLDNTKPE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A+AIR HW +EN LHW+LDV ED K + NAA FS +A+ IL DK K Sbjct: 301 EIASAIRQHWSIENNLHWQLDVTFREDYSK-KVKNAARNFSVATKMALTILKKDKTTKGS 359 Query: 354 LRRKMRKAAMDRNYLASVLTGSGL 377 + K KA D YL+ +L + Sbjct: 360 MNLKRLKAGWDEKYLSQLLQNNNF 383 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 361 bits (925), Expect = 3e-98, Method: Composition-based stats. Identities = 131/380 (34%), Positives = 215/380 (56%), Gaps = 16/380 (4%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L+ H I D R + +H+L D+L++ +C ++ G E + D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 +GIP HDT RV + + P F +CF+ W + ++ +++A+DGK LR + ++ + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQ--GQ 124 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 ++SA++ +SLV+GQI+ +K+NEITA+P+LL +L++ G I+T DAMGCQK+IA + Sbjct: 125 SPRVIVSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHD----------SYAMSEKSHGRE 233 I + +Y+ A+KGNQG+ ++ + + +EK HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE 293 E R + L D +W GL+ + V S R + + + RYY+SS ++ E Sbjct: 245 ETRRYWQSGDVSWLAD-RQQWAGLRSVGVVESVRQVGQQA---PTVERRYYLSSLNVDVE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 KFA A+R HW VEN LHW LDV ED + R G+AAE + +R +A+N+L + K G Sbjct: 301 KFARAVRGHWSVENSLHWVLDVQCGEDRNRARSGHAAENLATLRRLALNLLKRESTKKRG 360 Query: 354 LRRKMRKAAMDRNYLASVLT 373 ++ K A+ D +YL +L+ Sbjct: 361 IKGKQLNASWDHDYLLRLLS 380 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 355 bits (911), Expect = 2e-96, Method: Composition-based stats. Identities = 124/380 (32%), Positives = 196/380 (51%), Gaps = 16/380 (4%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 L I D+R H L+DIL++ CA++ G + +E FG +L+ + Sbjct: 14 SNLREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLA 73 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD--------KDVIAIDGKTLRH 115 NGIP HDT +V S + P +F E F W + K VIAIDGK LR Sbjct: 74 LPNGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRG 133 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + DK + ++ A+++ SL +GQ+K +KSNEI A+PELL ML +KG I+T DAMG Sbjct: 134 AVDK--GQAPAVIVGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMG 191 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL-KELNNPAHDSYAMSEKSHGREE 234 CQ+++A KI +Q GDY+ A+K NQ L++ ++L + + HGR E Sbjct: 192 CQREVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHE 251 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEK 294 +R V + + + +W GL+ + R++ + + RY+ISS A Sbjct: 252 VRRCWVSEEVECWLQGAEKWAGLRSVAAVECERTVA----GQTTVQRRYFISSLKADAAL 307 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN-DKVFKAG 353 A ++R HW +EN LHW LDV ED+ + RRG +AE + +R + ++ + K Sbjct: 308 IAASVRAHWGIENSLHWVLDVTFGEDESRSRRGYSAENLATLRRLTHAMIKRENPNSKKS 367 Query: 354 LRRKMRKAAMDRNYLASVLT 373 + ++ +A + +YL ++L Sbjct: 368 VNQRRFEAGLSTDYLQTLLG 387 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 352 bits (904), Expect = 8e-96, Method: Composition-based stats. Identities = 129/373 (34%), Positives = 199/373 (53%), Gaps = 7/373 (1%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +L ++ + D+R A + H+LS++L + +CAV+SGA+ +E+I +G +L+ + Sbjct: 5 KLADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPWLRGF 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHSYDKS 120 + G+ DT RV + + P +F + F W+ + KD VIAIDGK+ R + K+ Sbjct: 65 LRLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRTTSKA 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 +H++SAF+ +V+GQ T EKSNEITAIPELL +LDI+G I+T DAMG Q I Sbjct: 125 AAA-PLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGTQTKI 183 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 A I+++G Y+ VK N +L + ++ + HGR E+R Sbjct: 184 ARAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLTPSSTHETTSTGHGRIEVRRCTA 243 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 D D L WK + V R++ + YYISS AE+ A AIR Sbjct: 244 FDATDRLHK-AEAWKDVASFAVVERVRTV----GERTSTERVYYISSLPADAERIAVAIR 298 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 +HW VEN+LHW LDV +D + R G+ A + +RH+A+N++ DK K ++ K Sbjct: 299 SHWEVENRLHWCLDVQFGDDYARGRIGHIAHNLALVRHMALNLIRLDKSIKTSIKTKRLL 358 Query: 361 AAMDRNYLASVLT 373 AA + A++L Sbjct: 359 AATSDEFRAALLG 371 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 352 bits (903), Expect = 1e-95, Method: Composition-based stats. Identities = 141/376 (37%), Positives = 200/376 (53%), Gaps = 12/376 (3%) Query: 5 KLMGHISIIPDYRQAW-KMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 LM + D R+ H ++L++ I AV+S + EDI +G D+L+Q+ Sbjct: 8 SLMCCFREVDDPRKRSNGTLHDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLV 67 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NG+ +T R+ + P +F F W+ + + +DGKT+R S S Sbjct: 68 LLNGVASEETFLRIFRALDPKQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGS--GSGGE 124 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 AIH++SAF+T +V+GQ K KSNEITAIPELL L I G ++T DAMGCQK+IA + Sbjct: 125 SAIHMVSAFATELGVVLGQEKVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIARQ 184 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDV 243 I QGGDYL AVKGNQ L A E +F + + + D + SHGR ++ V Sbjct: 185 ITDQGGDYLLAVKGNQPTLLDAIETEF-IDQYQSDDVDRHRQVHPSHGRIVAQIASVLPA 243 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHW 303 E I +W KK+ S R + + K + RYYISS +LTAE+ A A+R HW Sbjct: 244 --EGIVDLADWPECKKIARVDSLRKVGNHESK---LERRYYISSRELTAEQLAAAVRAHW 298 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKAGLRRKMRKA 361 +EN+LHW LDV ED IR+GNA + S ++ I +N++ D K LR K + A Sbjct: 299 GIENRLHWVLDVSFGEDASTIRKGNAPQNLSLLKKIVLNLIRLDTADKTKTSLRLKRKCA 358 Query: 362 AMDRNYLASVLTGSGL 377 A + +L + L Sbjct: 359 AWTDDVRMRILGFTSL 374 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 347 bits (890), Expect = 3e-94, Method: Composition-based stats. Identities = 134/369 (36%), Positives = 194/369 (52%), Gaps = 9/369 (2%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L + I D R H+L DI+ + + AV++GA+ W IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + + P F +W++ S+ VIAIDGKT + SYD+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + ++SA+++ H LV+GQ D KSNEITAIP LL L + G I++ DAMG + IA +I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPAHDSYAMSEKSHGREEIRLHIVCD 242 KQ DY+ A+KGNQ L K ++ F + + + E +H R E R Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRRVFQVP 253 Query: 243 VPDELIDFT-FEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 V +W GL+ L V S R + + RY++SS A FA IR Sbjct: 254 VEQVFTPKQGRDWAGLRSLVVIQSQRCLWNKD----TTETRYFLSSLSTDAATFAHYIRA 309 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDVV NED +IR+ +A FS +R + +N+L D K L K +A Sbjct: 310 HWGIENQLHWCLDVVFNEDKSRIRKDHAPRNFSLLRRLTLNLLHRD-SSKGSLVMKRYRA 368 Query: 362 AMDRNYLAS 370 +D ++ Sbjct: 369 GLDDQFMMQ 377 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 347 bits (889), Expect = 5e-94, Method: Composition-based stats. Identities = 132/375 (35%), Positives = 192/375 (51%), Gaps = 12/375 (3%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M K L+ ++ IPD R K H LS+++ + ICA++ G + W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD-KDVIAIDGKTLRHSYDK 119 + GIP HDT R+ + + PA F W+ D D +A+DGK LR + K Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMGDDKLVGQLAVDGKALR-ATAK 119 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A+H+++ +ST + +GQ K +KSNEITAIPELL +L++KG +++ DAMG Q Sbjct: 120 GRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQVK 179 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK----ELNNPAHDSYAMSEKSHGREEI 235 IA+ I K+ GDYL AVK NQ LN +E+F E + H + HGR+E Sbjct: 180 IADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKEH 239 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 R V + DE + +WK K + + R + VR+YISS L A Sbjct: 240 RRCWVL-MVDESMPVCQQWKA-KTIIAVQAERIENGKGYD----FVRFYISSRALDATSA 293 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A R HW VEN LHW LD+ ED + R G A E + IR +N+L +K + Sbjct: 294 LKATRAHWSVENLLHWTLDIAFGEDQLQARAGFAGENLAVIRQWILNMLKQNKSRNLSMA 353 Query: 356 RKMRKAAMDRNYLAS 370 K R ++ YL Sbjct: 354 NKRRLCCLNEQYLFE 368 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 346 bits (887), Expect = 8e-94, Method: Composition-based stats. Identities = 142/375 (37%), Positives = 201/375 (53%), Gaps = 16/375 (4%) Query: 7 MGHISIIPDYRQAW-KMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 M + I D R+ H +IL++ I AV+S + EDI + T +L+++ + Sbjct: 1 MSCLDEIDDPRKPSNGTLHDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLK 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV-----IAIDGKTLRHSYDKS 120 NGIP +T R++ + P +F F W+ + D IAIDGKT+R S S Sbjct: 61 NGIPSEETFLRILRALDPKQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGS--GS 118 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 AIH++SAF+T LV+GQ K KSNEITAIPELL L IKG ++T DAMGCQK I Sbjct: 119 GGESAIHMVSAFATELGLVLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSI 178 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 A++I + GDYL VKGNQ +L +A E F + + + D + E+ HGR ++ V Sbjct: 179 AKQIVAKKGDYLLMVKGNQPKLLEAIETAF-IDQHGVESVDRSSRVERGHGRTVGQIASV 237 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 I +W + S R + K+ ++ RYYISS L+AE+ A A+R Sbjct: 238 LSAKG--IVDPADWPKCVTIGRIDSMRVV---GDKQSDLERRYYISSRALSAEQLAAAVR 292 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKM 358 HW VEN+LHW LDV +ED + + NA + S +R IA+ I+ DK K+ LR K Sbjct: 293 AHWGVENRLHWILDVSFSEDASTVAKDNAPQNLSLLRKIALTIIRADKTDTRKSSLRLKR 352 Query: 359 RKAAMDRNYLASVLT 373 + AA D +L Sbjct: 353 KGAAWDDGVRERMLG 367 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 345 bits (885), Expect = 2e-93, Method: Composition-based stats. Identities = 129/373 (34%), Positives = 205/373 (54%), Gaps = 10/373 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + H + D R +H L D++ LT+ A++SGAEGW+DI+ FG++ D+L+++ F Sbjct: 2 SFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAF 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 + G+PV DTIAR++S + P FI+W+ + + VIA DGKTLRHS+D R+ Sbjct: 62 KEGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFDGD-RKT 120 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A+H +SA++ LV+ Q K+ K NE++ + EL+ +L++KG I+T DAM C K +A+ I Sbjct: 121 ALHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAI 180 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH---DSYAMSEKSHGREEIRLHIVC 241 +GGDY+ VK NQG+L F + P +S ++ HGR E R ++ Sbjct: 181 NAKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYVQL 240 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 + L + W +K + R + ++ YYISS ++ + A AIR+ Sbjct: 241 PITPWLTQ-SQGWTNIKPVIEVTRKRYLKDKE----TSETAYYISSLEVNLPQIAKAIRS 295 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN HW LD+ EDD +IRRG+A E + R A+N+ K ++ K+++A Sbjct: 296 HWSIENASHWVLDMTYREDDSRIRRGDAPENMATFRRFAMNLARLSP-IKDSMKGKLKQA 354 Query: 362 AMDRNYLASVLTG 374 A +L Sbjct: 355 AWSDEVREKLLFA 367 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 342 bits (877), Expect = 1e-92, Method: Composition-based stats. Identities = 128/380 (33%), Positives = 217/380 (57%), Gaps = 14/380 (3%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ ++ + + + D R+ +H L D+L++ + AVI+GA+G I + E H ++LK Sbjct: 9 DVVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSR 68 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-----DDKDVIAIDGKTLRHS 116 + +G+P HDTI R+++ + P F +CF W+ + D +++IAIDGKTLR S Sbjct: 69 LELPSGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRS 128 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ + G + + SA++ + +GQ+ +KSNEI PEL+ +D++ I+T DA GC Sbjct: 129 HDRGKGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGC 188 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPA---HDSYAMSEKSHGRE 233 Q+D+AEKI GDY+ A+K NQ RL++ + + N+ A + + K HGR Sbjct: 189 QRDVAEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRL 248 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE 293 + R + +PDE + +W+GLK + VA+ +++ RYYISS A+ Sbjct: 249 DKRFYYQVKLPDE-VPAGEDWRGLKTIGVAIRI----SQENGRETCDTRYYISSLKPDAK 303 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 +FA A+R HW +EN LHW LDV ED+ ++R AAE + ++ +A++++ K K Sbjct: 304 QFAAAVRGHWGIENSLHWTLDVTFREDESRLRNRIAAENLAWLKRLAVSLIKQHKS-KES 362 Query: 354 LRRKMRKAAMDRNYLASVLT 373 + + R A + N+LA +L Sbjct: 363 VVMRRRMAGWNVNFLAEILG 382 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 342 bits (876), Expect = 2e-92, Method: Composition-based stats. Identities = 133/374 (35%), Positives = 183/374 (48%), Gaps = 14/374 (3%) Query: 7 MGHISII-PDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + S I PD R ++ ++I+ + + AVI GA+ W +IE FG+TH + K Sbjct: 3 IQAFSAIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKARLPGL 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR--- 122 IP HDT++R S + F ECF W+ D V+AIDGK + + DKS Sbjct: 63 VSIPSHDTLSRFFSILDIDWFEECFRLWVDDICRRIPG-VVAIDGKAICDNPDKSSNSKN 121 Query: 123 --RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 R ++++SA+S + + +GQ K +EKSNE AIPEL+ LD++ IIT DA+GCQK I Sbjct: 122 GVRSKLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIGCQKSI 181 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH-DSYAMSEKSHGREEIRLHI 239 + I + DY+ K N L E + H Y K HGR E R Sbjct: 182 TKLIIENKADYILCAKDNHEALRNIIEFNLSEESRYYLCHAKRYFEENKGHGRSEYREC- 240 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAI 299 VC L F W G+K L + S R + KE M RYYISS + +I Sbjct: 241 VCISAKNLQYFLKGWTGIKTLAMINSIRKM---GDKEAVMETRYYISSLEPDPIIILKSI 297 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VEN LHW LD+ EDD + + GNAA FS I +A+ +L K G+ K + Sbjct: 298 RPHWEVENNLHWVLDIGFREDDDR-KIGNAAINFSAIPKLALMLLKQSD-IKLGMAGKRK 355 Query: 360 KAAMDRNYLASVLT 373 D V+ Sbjct: 356 ACGWDEKIRDKVIG 369 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 339 bits (869), Expect = 1e-91, Method: Composition-based stats. Identities = 127/351 (36%), Positives = 188/351 (53%), Gaps = 5/351 (1%) Query: 22 MEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCI 81 + + L+++LL T+ +I A +++IE G D+L+Q+ FE+G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 SPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 P F W+ V AIDGKTLR S + GA+H++SA++ LVIG Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 Q + KSNEITAIPELL+ L + G I+T DAMG QK IA K+ +G DY+ A+KGNQG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 L+ + F +L HGR E R V D L + W GL + Sbjct: 181 LHDDVRDFFADPDLLRECARHDDTC-IGHGRIEERTCQVADASAWLTEQHSGWAGLASIA 239 Query: 262 VAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 ++ R+ ++ E+ R YISS + A R+HW VEN LHW+LDV ED+ Sbjct: 240 AVIATRT--DKKSGEVSSETRLYISSLPPDPKTILNACRSHWSVENNLHWQLDVTFREDE 297 Query: 322 CKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 C+ R+ +A + IRH A N+L + K ++RK KAAM++ + +V+ Sbjct: 298 CRTRKDHAPLSLAIIRHAAFNMLKREPS-KMSIKRKRLKAAMNQAFRKTVI 347 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 339 bits (868), Expect = 1e-91, Method: Composition-based stats. Identities = 129/371 (34%), Positives = 194/371 (52%), Gaps = 9/371 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + + D R A+ +H DI+ L + AVISGA W +I+ FGE H D+L++Y F Sbjct: 2 SVFRFFENLSDPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPF 60 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 E GIPV DTIARV+ I P F+E F+N++ + + ++VIAIDGKTLRHS++ + Sbjct: 61 ECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFNPET-QS 119 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A+H ++ +S L++ Q K+ K NE A+ E+++ +K +IT DAM QK IAEKI Sbjct: 120 ALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKIAEKI 179 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH-DSYAMSEKSHGREEIRLHIVCDV 243 ++ GDY+ +K N + E F + P ++Y R + R + V Sbjct: 180 IEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYRKLKV 239 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHW 303 D L EWKG+K + RS + +YISS D+ + A +R HW Sbjct: 240 SDWLSK-AEEWKGIKSVLEVCRKRS----DNGKESQEKVFYISSLDVDIQILAKCVRGHW 294 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 VENK HW LDVV ED+C + AE + +R +A+N+ K ++ K+ A Sbjct: 295 EVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHP-KKQSMKGKLTAAGW 353 Query: 364 DRNYLASVLTG 374 + +L G Sbjct: 354 SDEFRDELLLG 364 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 336 bits (861), Expect = 8e-91, Method: Composition-based stats. Identities = 125/377 (33%), Positives = 194/377 (51%), Gaps = 13/377 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG--D 63 L+ S +PD R+ + L+ IL++ +CA++ GA+ W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 E G P HDT + + F F +W+R+ D V+AIDGKTLR S K Sbjct: 63 LEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVIDG-VVAIDGKTLRGSGKKGS-N 120 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H+++A++ L + Q T K +E+ + LL++L +KG I+T DA+GCQ ++AEK Sbjct: 121 ELLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAEK 180 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPAHDSYAMSEKSHGREEIRLH-I 239 I +GGDY+ VK NQ L +A E F + + +EK HGR E R + Sbjct: 181 IVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYTW 240 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADL-TAEKFATA 298 + DV WK L + + S R I ++ + RY I S + T E FA A Sbjct: 241 INDVTWMDRPMRAAWKKLGGVGMIESIRQI----GDKVSVDQRYAIGSCGVQTVEMFAKA 296 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 R+HW +EN LHW LDVV ED C+ R GN+A S +R + L ++ K GL R+ Sbjct: 297 SRSHWGIENGLHWTLDVVFREDQCRARLGNSARNLSVLRKFVLTTLRKEEGCKMGLNRRR 356 Query: 359 RKAAMDRNYLASVLTGS 375 A + +Y S++ + Sbjct: 357 LHADRNESYRESLIALA 373 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 332 bits (850), Expect = 2e-89, Method: Composition-based stats. Identities = 124/374 (33%), Positives = 193/374 (51%), Gaps = 9/374 (2%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L I D R + + L +ILL+T+CA+I G + W+ I DFG+ +L Q+ + Sbjct: 14 QYLFHCFLSIKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVN 73 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 G+P T ARV S I P +F C WM D+I +DGK+L S + + + Sbjct: 74 MRCGVPSTLTFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQ 133 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 A H+++A+ + +G+++ +KSNEI AIP LLN L+++G II+ DAMG QK IA Sbjct: 134 KATHIVNAYLPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANL 193 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEK---SHGREEIRLHIV 240 I+ + DY+ A+K N R + E F + + Y E HGR E R + V Sbjct: 194 IRLKQADYVLALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSYCV 253 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSAD-LTAEKFATAI 299 + + W+ L+ + S R + E+E RYYI+S + + AI Sbjct: 254 LPM-MYFHKYKKYWRDLQAIVRVQSKR----HKGNEIETATRYYITSLPFAEHRRMSQAI 308 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN+LHW+LD+ + ED I RG A + + +R + + +L N+ K G+ K Sbjct: 309 RQHWAIENQLHWKLDIGLGEDASLITRGYADQNLATLRKMVLKMLENENSSKQGIAGKRI 368 Query: 360 KAAMDRNYLASVLT 373 +AA+ YL V+ Sbjct: 369 QAALSTRYLRKVVG 382 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 331 bits (847), Expect = 4e-89, Method: Composition-based stats. Identities = 117/370 (31%), Positives = 185/370 (50%), Gaps = 7/370 (1%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L H+S++ D R H L D+L L + AV SG +GW +I+ FGE ++L+++ F Sbjct: 2 SLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPF 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 NGIP TIAR++ + P C +W+ D ++ K +IAIDGKTLR + Sbjct: 62 ANGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLGC--N 119 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 +H + AF + L + Q K EI + L+ ML+I +IT DA+ Q+ E I Sbjct: 120 TLHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAI 179 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVP 244 + GDY+ VK NQ L +A + ++ + ++ +A SEK HGR E R+ Sbjct: 180 VARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQRITFQIPSK 239 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWH 304 +W +K L R I + + +Y+SS D+ E ATA+R HW Sbjct: 240 LSP-KLQEKWPSVKTLIAVERHRKIGNK----TSIETSFYLSSHDIDPEYIATAVRGHWR 294 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN LHW LDVV ED C++ AE + +R +A+N+ + K ++ K+ ++ + Sbjct: 295 IENSLHWVLDVVYREDACRVHEQRTAESLAIVRRMALNLAKLEITQKRSMKSKLHRSLLS 354 Query: 365 RNYLASVLTG 374 Y ++ Sbjct: 355 DEYRELMIFA 364 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 327 bits (839), Expect = 3e-88, Method: Composition-based stats. Identities = 112/369 (30%), Positives = 189/369 (51%), Gaps = 6/369 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ H++++ + R +H L D++ L I A++SGAEGW DIE +G++ D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP T+AR++ CI E + W+ + + K +IA DGK LR S+ + + A Sbjct: 69 NGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKDA 127 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + +++A+ T + LV+ Q T K EI + ++L++L++KG ++T DA+ CQ++ EKI Sbjct: 128 LQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKIS 187 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPD 245 ++ + VK NQ +L +A + +F E HGR+E R Sbjct: 188 EKKAHVVVQVKNNQPKLYQAVQSQFQSLFDAQKEKIVVEHKESGHGRQEERYVFQLKAKL 247 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHV 305 + T +W ++ + RS + + YY+SS + IR HW + Sbjct: 248 PP-ELTEKWPTIRSIIAVERHRSANGKG----TVDTSYYVSSLSPKHKLLGHYIRQHWRI 302 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN H+ LDVV NED +I +A E + R +NI+ R K+++A + Sbjct: 303 ENSQHYILDVVFNEDASRIAMEDAVENMALFRRFVLNIVKQSNCGARSQRNKLKRAGWND 362 Query: 366 NYLASVLTG 374 +Y A + G Sbjct: 363 DYRAQLFFG 371 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 325 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 127/382 (33%), Positives = 194/382 (50%), Gaps = 15/382 (3%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + + ++ I D+R + ++L DILL++ AVI + + ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV------IAIDGKTLRHSY 117 F +G P HDT +V+S + P E F WM + + K V +AIDGKT+ S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS- 121 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 S + A HV++AF++ LV+GQIKTDEKSNEITAIPELL + +K ++T DAMG Q Sbjct: 122 -GSAEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHD------SYAMSEKSHG 231 K+IA KI ++GGDY+ AVKGNQ +L + + + EK HG Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTRELKAKGQYAVTLEKDHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT 291 R E R + +W+G+ + + R + + K + S + Sbjct: 241 RIEKRECY-LSNDLSWFEGLEDWRGIAGVGWIRNTRHVDNKNDKTTTEDHYFIYSLKEAQ 299 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFK 351 A+ R HW +EN LHW LD+ EDDC+ R NAAE+ + +R +A+ +L K Sbjct: 300 AKDLLRIKREHWAIENNLHWMLDMAFREDDCRARAKNAAEVMNILRKLALQMLKTCDTCK 359 Query: 352 AGLRRKMRKAAMDRNYLASVLT 373 G+R K + + VL Sbjct: 360 CGMRSKRKLCGLGIPTALQVLG 381 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 324 bits (829), Expect = 4e-87, Method: Composition-based stats. Identities = 126/386 (32%), Positives = 191/386 (49%), Gaps = 20/386 (5%) Query: 1 MELKKLMGHISIIPDYRQA-WKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLK 59 M L L + +PD R H L+DIL + CAVI+GAEGWEDI ++G + F + Sbjct: 1 MALP-LTSVFADLPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFR 59 Query: 60 QYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS--------DDKDVIAIDGK 111 ++ + +NG+P HDT RV + + P F + F W + + D +A+DGK Sbjct: 60 RFLELKNGVPSHDTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGK 119 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITT 171 + R S + G +H++ + +L++GQ E +EIT ++L LD+ G ++T Sbjct: 120 SARRSAKPT-FSGCLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTL 178 Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPA-HDSYAMSEKSH 230 DA GCQ + E I+ +GG+Y+ VKGNQ L A F A D + +H Sbjct: 179 DAAGCQTETLEVIRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAH 238 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADL 290 GR E R V PD L W G+ + + R + + E T YY+SS + Sbjct: 239 GRHEERNVTVVHDPDGL---PAGWAGVGSVALVCRDRQVKGKAN---ESTAHYYLSSLRV 292 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 A + A IR HWH+E+ +HW LDV ED+ + R G+A IR +A+++L Sbjct: 293 GAAELAGYIRGHWHIES-MHWVLDVAFREDESRTRVGHAGANLGMIRRVAVSLLKRAG-K 350 Query: 351 KAGLRRKMRKAAMDRNYLASVLTGSG 376 K + + +A D Y+A VL G Sbjct: 351 KGSIHTRRLRAGWDDQYMAQVLQGLS 376 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 323 bits (827), Expect = 7e-87, Method: Composition-based stats. Identities = 141/392 (35%), Positives = 198/392 (50%), Gaps = 27/392 (6%) Query: 1 MELKKLMGHISIIP--DYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFL 58 +E+ L + D R +H+ S I+L+ I AVI GA+ W IEDFG++ F Sbjct: 9 IEISNLHEFADSLILIDNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFF 68 Query: 59 KQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSY- 117 NGIP HDT R S + P KF E + W++ IAIDGKT+R +Y Sbjct: 69 AAKLSNFNGIPSHDTFNRFFSALDPLKFEESYRQWVQSILKCYSG-HIAIDGKTIRGAYE 127 Query: 118 --------------DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD 163 D + + +HVISAF+T + +GQ+ T EK NEI IPELL+ML Sbjct: 128 SEQDKRHRKQGVLPDSNTGKYKLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLC 187 Query: 164 IKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL--KELNNPAHD 221 IK IIT DA+GCQ+ IAEK+ K GDY+F VK NQ +L + + D Sbjct: 188 IKDCIITIDALGCQRTIAEKVIKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKGTTVRFD 247 Query: 222 SYAMSEKSHGREEIRLHIVCDVPDELI-DFTFEWKGLKKLCVAVSFRSIIAEQKKELEMT 280 Y E+ HGR E R+ C+ P L D +WK ++ + R+ K + Sbjct: 248 KYETHEEGHGRNESRICYCCNDPGFLGADIRKKWKNIQSFGYIENTRNT----NKGTTVE 303 Query: 281 VRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 R +ISS + A+K R HW +EN LHW+LDV +ED+ + RR +A FS + IA Sbjct: 304 KRCFISSLEPDAQKILKNSREHWEIENNLHWQLDVNFHEDNTR-RRNISALNFSVLAKIA 362 Query: 341 INILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 + L N+K + + RK A D +L ++ Sbjct: 363 LATLRNNK-REIPINRKRLIAGWDNEFLWELI 393 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 129/377 (34%), Positives = 195/377 (51%), Gaps = 13/377 (3%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 L+ L+ H S I D R ++ H L +ILLL +C ++ + +E+I +G H FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 + +G+P + +++ I PA F F W+R D +AIDGKT R S+D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFPGRA-DFVAIDGKTSRRSHDRRAG 130 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD----IKGKIITTDAMGCQK 178 IH++SAF+T LV+ Q +K+NE+ AIP LL+ L + G +++ DA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLH 238 IA I+ QG DYL AVK NQ L E F + D + +K HGR E R Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAFAV----GDGADHHHDLDKGHGRVEERHV 246 Query: 239 IVCDVPDELIDFTFEWKGLKKL--CVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 V D L + G +L A+ A RY+ISSA LTAE A Sbjct: 247 SVIREVDWLSGTRR-FPGEMRLPDVAAIVRVHTTAHIADRTRTDTRYFISSAPLTAEHAA 305 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 A+R HW +EN+LHW LDV+ +D ++R G+ A+ + +RH A+N++ K L+ Sbjct: 306 DAVRGHWGIENRLHWVLDVLFKDDLSRLRTGHGAKNMAVVRHFALNLIRAANDQK-SLKT 364 Query: 357 KMRKAAMDRNYLASVLT 373 + + A +YLAS+L Sbjct: 365 RRKMAGWSDDYLASLLN 381 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 322 bits (825), Expect = 1e-86, Method: Composition-based stats. Identities = 128/379 (33%), Positives = 200/379 (52%), Gaps = 17/379 (4%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++L+ + I D RQ K+ H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + ++ K +I IDGKT+R +K Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRS--NK 118 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 H++SA+S +GQ EKSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 119 RNGEKPGHIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHD---SYAMSEKSHGREEIR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 + + L WKGLK + + R + ++ K L + RY+ISS E + Sbjct: 239 EYYQTEKIKWLSQ-KKAWKGLKSIIM---ERKTLEKEGKRL-IEYRYFISSLKEEIETVS 293 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF--KAGL 354 A+R HW +E+ +HW LDV ED AA+ + IR +++IL +V K + Sbjct: 294 RAVRGHWSIES-MHWHLDVTFREDANTTIDKMAAQNLNIIRKWSLSILKTAEVSRHKLSM 352 Query: 355 RRKMRKAAMDR-NYLASVL 372 R+K + +L VL Sbjct: 353 RKKRYVIGLRPIKHLEEVL 371 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 321 bits (823), Expect = 2e-86, Method: Composition-based stats. Identities = 121/412 (29%), Positives = 185/412 (44%), Gaps = 44/412 (10%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + I I D R+ K+ + I+L+T+ V + W DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK------------------- 103 P HDT+ R I + C+ W + Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 104 -DVIAIDGKTLRHSYDKSR--------------RRGAIHVISAFSTMHSLVIGQIKTDEK 148 IAIDGKT+ + + + +H++SAF + SL +GQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNMLDIK-GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFE 207 NEI AIP+LL+ +DI+ G ++T DA+G QK I EKI ++ DYL VK N +L + E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPLKELNNPAHDSYAMSE---KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 ++ +D +E + HG R I C P L +WK L+ + Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIK 316 Query: 265 SFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + IA E++ +ISS E R HW VEN LHW+LDV NEDD + Sbjct: 317 TEKINIA--TGEIQNEKHCFISSLVNNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR- 373 Query: 325 RRGNAAELFSGIRHIAINILT--NDKVFKAGLRRKMRKAAMDRNYLASVLTG 374 + N+A+ FS + +A+ IL D+ K + RK +KA YLA+++ Sbjct: 374 KMMNSAQNFSTLTKMALTILKNYQDEDKKTSVNRKRKKAGWSDEYLANLINN 425 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 321 bits (823), Expect = 2e-86, Method: Composition-based stats. Identities = 121/374 (32%), Positives = 185/374 (49%), Gaps = 11/374 (2%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L + I D RQA K+ H++ ++L++ C+ + E + D+ DF ++ +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 ++G P HD V+ I P E W D + IAIDGK LR +++ Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGWCGDL----EGRHIAIDGKALRGTHNAETG 116 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 R +H++ A+ + L GQI EKSNEI AIP LL L +KG +T DAMG Q IAE Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSY---AMSEKSHGREEIRLHI 239 +I G DY+ A+K N R ++ + F E + + + E SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAI 299 + + D +++W GL+ + R + V Y++ S E+ A + Sbjct: 237 ITEELDWYHK-SWKWAGLQSVAQV--RRQVQRSHDGPPLEEVHYFLCSFKADVERLAKLV 293 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VEN+ HW LDV NED C++R NAA + +R + I L K LRRK + Sbjct: 294 RGHWSVENRCHWVLDVTFNEDHCQVRDRNAAHNLTILREMVITTLHRHP-AKVSLRRKRK 352 Query: 360 KAAMDRNYLASVLT 373 A MD + +L Sbjct: 353 LATMDPAFRLQMLG 366 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 320 bits (820), Expect = 4e-86, Method: Composition-based stats. Identities = 131/381 (34%), Positives = 201/381 (52%), Gaps = 21/381 (5%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + ++ + + D R+ WK++H LSDI+LL A +SGAE W++IE FG+ + LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD---------DKDVIAIDGKTL 113 ENGIP HDT+ RV + + P E W SD K ++AIDGKT+ Sbjct: 66 QLENGIPSHDTLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTI 125 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 R + S ++ A+H+++A++T + GQ+ T+EKSNEITAIPELL+M+ +KG +++ DA Sbjct: 126 RG--NGSAKQKALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDA 183 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGRE 233 MG QK IA+KI K+ DY AVK NQ L + F ++ A D Y EK+HG+ Sbjct: 184 MGTQKAIADKIIKKKADYCLAVKENQKTLLEDIVPFFE---MSQEADDHYHTVEKAHGQI 240 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE 293 E R + V L E+ ++ + A I ++ + RY+I S ++A+ Sbjct: 241 ETRAYEVIHDVSWLRKTHPEFGHIQSIGRA----RIHLDKNGQESEESRYFILSCQVSAK 296 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKA 352 + +R HW +E+ +HW LDVV ED K A + + + +L K Sbjct: 297 ELCDYVRGHWQIES-MHWLLDVVFREDANKTLNKQLAFNLNVMDKFCLAVLKQLDFGKKM 355 Query: 353 GLRRKMRKAAMD-RNYLASVL 372 +RRK ++ YL +L Sbjct: 356 SMRRKKYALSLSFDKYLKQLL 376 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 320 bits (820), Expect = 5e-86, Method: Composition-based stats. Identities = 114/371 (30%), Positives = 187/371 (50%), Gaps = 7/371 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++ H+ I D R EH + DI L + AVISGA+ W +FG ++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP +I R+ +S + ++W+ + + + IAIDGK L+ + + A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLKGAKASAS-SAA 119 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H+++A+ T LV K +E+ + ELL LD+K +++T DA+ CQ + I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPD 245 K+GGD + VKGNQ +L +A + +F NNP + + + K HGR E R+ C + Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLNL 239 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHV 305 + +W LK L R + + + +Y+SSA LT+E F AIR HW Sbjct: 240 P-AEIKMKWSQLKTLIAVERHRKVGNK----TSIDTHFYVSSAVLTSEAFGRAIRAHWQT 294 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN HW LD + ED K+ + A + + +R A+N++ K +K +A Sbjct: 295 ENNQHWLLDKLFQEDKQKMYDEDGASILAILRRWALNLVKLHP-AKTSQTQKFNRACWSD 353 Query: 366 NYLASVLTGSG 376 ++ ++ G+G Sbjct: 354 DFREEIIFGTG 364 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 315 bits (807), Expect = 1e-84, Method: Composition-based stats. Identities = 111/369 (30%), Positives = 177/369 (47%), Gaps = 12/369 (3%) Query: 10 ISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP 69 +PD R H L D+L + + A I GAE D F +++ + G+P Sbjct: 9 FQGLPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLP 68 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVI 129 HDT +RV + P F CF ++ D D V+AIDGKTLR S+D++ R A+HV+ Sbjct: 69 SHDTFSRVFRLLDPVAFSRCFQQFL-DHLGEDGAGVLAIDGKTLRRSFDRAAGRSALHVV 127 Query: 130 SAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGG 189 SAF++ +++GQ NEI A LL + D+KG ++T DA+ Q+ A+ I ++GG Sbjct: 128 SAFASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGG 187 Query: 190 DYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELID 249 D+LF +K N+ L E F + ++ HGR E+R H V L Sbjct: 188 DWLFPLKDNRPALRAEVERYFADP--ATVLAVPHVTTDADHGRIEVRRHWVSHDVAWLAS 245 Query: 250 FTF-----EWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWH 304 GLK L + + T Y+SSA L + A A+R HW Sbjct: 246 DRRFPDEAVLPGLKILGLVER---TVTSPDGRTTATRTLYLSSAALEPKTLARAVRAHWS 302 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +E +HW LD +ED + R+ + E + +R +A+N++ + + +R + ++A Sbjct: 303 IEAAVHWVLDTSFDEDRARNRKDHGPENLATLRKLALNVVRSAN-NQDSIRLRRKRAGWS 361 Query: 365 RNYLASVLT 373 +Y ++L Sbjct: 362 DDYARTILG 370 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 314 bits (805), Expect = 3e-84, Method: Composition-based stats. Identities = 103/376 (27%), Positives = 173/376 (46%), Gaps = 17/376 (4%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 + + +PD R A H L++IL + + A + GA D+ F + Sbjct: 4 PMDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDV 62 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVIAIDGKTLRHSY 117 +NG+P HDT +RV + P F + F +M+ K VIA+DGK LR Y Sbjct: 63 LVLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGY 122 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 + R +++A++ + + ++ NE +L+ +L +KG ++T DA+ C Sbjct: 123 ESGRSHMPPVMVTAWAAQTRMALANVQAPNN-NEAAGALQLIELLQLKGCVVTADALHCH 181 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRL 237 + +AE I+ +GGDY+ AVK NQ L + + S + HGR+E R Sbjct: 182 RGMAEAIKARGGDYVLAVKDNQPALMRDAKAAIRAATRQGK--PSTITVDAGHGRKEKRR 239 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFAT 297 +V VP D ++ GLK + S R + RY++ S + Sbjct: 240 AVVAAVPQMAQD--HDFAGLKAVARITSKR-------GTDKTVERYFLMSQAYPPKDVLR 290 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +EN LHW LDVV++ED + R+ NA + +R +A+N+ LR K Sbjct: 291 IVRTHWTIENSLHWPLDVVLDEDLARNRKDNAPANLAVLRRLALNVARAHPDNTTSLRGK 350 Query: 358 MRKAAMDRNYLASVLT 373 +++A + +L ++ Sbjct: 351 LKRAGWNDTFLFELIQ 366 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 308 bits (789), Expect = 2e-82, Method: Composition-based stats. Identities = 105/374 (28%), Positives = 174/374 (46%), Gaps = 13/374 (3%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 + + + +PD R A + H L ++L++ +V+ G+ ++ FG F + Sbjct: 10 IAMHIFLSAFDEVPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD-DKDVIAIDGKTLRHSYDK 119 + ++ IP HDT + V I P F + D D D+IAIDGK LR + D Sbjct: 69 FLKLKHAIPSHDTFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDP 128 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 ++SA+++ L + + D + E++A E L ++D++GK++T DA+ C + Sbjct: 129 GESARTRMMVSAYASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRR 187 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHI 239 I GGD+ A+KGNQ L F ++P HGR+E R + Sbjct: 188 TVAAINAGGGDWCLALKGNQESLLSDARGCFSKGHKSDPTA---VTENTGHGRKETRKAV 244 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAI 299 V + E+ GLK + R E ++ RY+ S T E A+ Sbjct: 245 VVSAKA--LAEYHEFPGLKGFGRIEATR----ETGGKVTSETRYFALSWVPTPEVLLAAV 298 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R+HW +EN LHW+LDV ED + R+ N + +R A+++L D K L K++ Sbjct: 299 RDHWAIENALHWQLDVSFREDAARNRKDNGPGNIAVLRRRALDVLRRD-TSKGSLSIKIK 357 Query: 360 KAAMDRNYLASVLT 373 +A D +L S+L+ Sbjct: 358 RAGWDTTFLRSILS 371 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 303 bits (776), Expect = 6e-81, Method: Composition-based stats. Identities = 103/370 (27%), Positives = 167/370 (45%), Gaps = 13/370 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++ +PD R A H L ++L++ +V+ GA ++ FG + + + Sbjct: 37 ILSAFEDVPDPR-AENTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLK 95 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD-DKDVIAIDGKTLRHSYDKSRRRG 124 + +P HDT + V I P F + D + D DVIA+DGK LR + D Sbjct: 96 HAVPSHDTFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGR 155 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++SA++ L + + D + E+ A E L ++ +KGK++T DA+ C + I Sbjct: 156 TRMMVSAYAARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAI 214 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVP 244 GGD+ A+K NQ L F + AH S + HGR E R V Sbjct: 215 NAGGGDWCLALKANQDSLLSDARASFGAEP---DAHPSALSEDIGHGRTETRKATVVSSK 271 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWH 304 + E+ GLK + R + + RY+ S T E +R HW Sbjct: 272 A--LAEHHEFPGLKAFGRVEATR----KTAEGTTSETRYFALSWVPTPEVLLATVRAHWA 325 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN LHW+LDV ED + R+ N+ + +R A++++ D K L K+++A D Sbjct: 326 IENSLHWQLDVSFREDAARNRKDNSPGNIAILRRRALDVMRRD-TSKGSLSIKLKRAGWD 384 Query: 365 RNYLASVLTG 374 ++L +VL G Sbjct: 385 DDFLRNVLNG 394 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 300 bits (769), Expect = 4e-80, Method: Composition-based stats. Identities = 115/385 (29%), Positives = 174/385 (45%), Gaps = 43/385 (11%) Query: 30 LLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHEC 89 +L+T+ V + W DI DF DFL+++ P HDT+ R I + C Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FINWMRDCHSSDDK--------------------DVIAIDGKTLRHSYDKSR-------- 121 + W + IAIDGKT+ + + + Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 122 ------RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIK-GKIITTDAM 174 +H++SAF + SL +GQ + K NEI AIP+LL+ +DI+ G ++T DA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSE---KSHG 231 G QK I EKI ++ DYL VK N +L + E ++ +D +E + HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT 291 R I C P L +WK L+ + + + IA E++ +ISS Sbjct: 241 FMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIKTEKINIA--TGEIQNEKHCFISSLVNN 298 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT--NDKV 349 E R HW VEN LHW+LDV NEDD + + N+A+ FS + +A+ IL D+ Sbjct: 299 PELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTLTKMALTILKNYQDED 357 Query: 350 FKAGLRRKMRKAAMDRNYLASVLTG 374 K + RK +KA YLA+++ Sbjct: 358 KKTSVNRKRKKAGWSDEYLANLINN 382 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 300 bits (767), Expect = 7e-80, Method: Composition-based stats. Identities = 104/369 (28%), Positives = 186/369 (50%), Gaps = 7/369 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ H+ II D R ++H L D++ LT+ A++SGA GW+ IE FG D+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +GIP IA ++ + + W+ D K +IA+DGKT+R ++ + A Sbjct: 63 HGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ-A 121 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H++SAF + + + ++K +E ++++ L + ++T DA+ CQK +KI Sbjct: 122 LHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKII 181 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPD 245 + D++ +KGNQ A + ++PA + HGR+E R + + Sbjct: 182 SKKSDFVIQIKGNQPA-LLAAVKAAFAACYDSPALAISEQTNTGHGRKECRRVMQIEGNL 240 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHV 305 + + +W ++ L S R++ + + R+Y+SS + + A IR HW + Sbjct: 241 PP-ELSEKWPHIRTLVEVASERTVGNK----TACSSRWYVSSLPVDTAQLADIIRAHWAI 295 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN+LHW LDVV ED+ + + A+ + A++++ + K L K + AA D Sbjct: 296 ENQLHWVLDVVFREDELNVSDPDGAKHLALFNRAALSVIKQHQGKKDSLAAKRQSAAWDP 355 Query: 366 NYLASVLTG 374 + + +L G Sbjct: 356 AFRSELLFG 364 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 297 bits (761), Expect = 3e-79, Method: Composition-based stats. Identities = 119/387 (30%), Positives = 182/387 (47%), Gaps = 20/387 (5%) Query: 3 LKKLMGHISIIPD------YRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPD 56 + ++ I I D RQ+WK+ + LS IL L ++G E +++EDF E + Sbjct: 1 MTTMIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEP 60 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD-KDVIAIDGKTLRH 115 Y D G P HDT+ RV+S ++ + E + + + S D +I++DGKT+R Sbjct: 61 LFATYVDLSEGCPSHDTLERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRG 120 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + K+ + +H+++A+ H L +GQ+ +EKSNEI AIP+LL +DI+ I+T DAMG Sbjct: 121 NRGKN--QKPVHIVTAYDGGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMG 178 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPAHDSYAMSEKSHGR 232 Q I + I K DY AVKGNQ L F Y EKS G+ Sbjct: 179 TQTAIVDTIIKGKADYCLAVKGNQETLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQ 238 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTA 292 E+R + V L +W L+ + ++ +L RY+I S Sbjct: 239 IEVREYWVSSDIKWLCQNHPKWHKLRGIG----MTRNTIDKDGQLSQENRYFIFSFKPDV 294 Query: 293 EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVF 350 FA +R HW +E+ +HW LDVV +ED + AA + IR + + L Sbjct: 295 LTFANCVRGHWQIES-MHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKK 353 Query: 351 KAGLRRKMRKAAMD-RNYLASVLTGSG 376 RRK R ++ +YL + G Sbjct: 354 DLSYRRKQRYISVHLEDYLVQLFGERG 380 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 297 bits (761), Expect = 3e-79, Method: Composition-based stats. Identities = 107/307 (34%), Positives = 162/307 (52%), Gaps = 7/307 (2%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +L + +PD R + H LS++L + +CAV+ GA + D+ +G+++ +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHSYDKS 120 + G+P HDT RV++ I PA F F+ W+ + D V+AIDGKT R S K Sbjct: 65 LKLKAGVPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGKD 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 +H++SAF+ LV+GQ TD+KSNEITAIPELL ML ++G I+T DAMG Q I Sbjct: 125 TSG-PLHMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAI 183 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 A I+ +G DY+ VK N L + + K HGR E+R Sbjct: 184 ARTIRSRGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWA 243 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 D +L + +W GL+ + R++ + + YYISS A + A A+R Sbjct: 244 YDAVSQLYK-SEQWAGLQSFALVERERTV----DGKTSVERHYYISSLPADAARIAQAVR 298 Query: 301 NHWHVEN 307 +HW VE+ Sbjct: 299 SHWAVES 305 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 296 bits (757), Expect = 1e-78, Method: Composition-based stats. Identities = 115/339 (33%), Positives = 172/339 (50%), Gaps = 4/339 (1%) Query: 38 ISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 ++ AE WEDIE +G + +L+ + NGIP HDT RV + F CF ++ Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 ++V+A+DGK++R S G +H++S +++ L +GQ D KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 LL L + G I+T DAMGCQ IAE+I+ +G D L +K N G +A F L + Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 218 PAHDSYAMSE-KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE 276 A + HGR R V L + W L ++ + R I Sbjct: 184 GAAGRPVFDAFEGHGRLVRRRVFVDAAATALAPLSG-WPDLSRVLAVETLRGIPG--TGT 240 Query: 277 LEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGI 336 + +RY+++S IR HW VEN LHW L+V EDD ++R AA F+ + Sbjct: 241 VVADIRYFLTSCRDDPSVLVGVIRRHWSVENALHWVLEVSFREDDSRVRDRTAARNFALV 300 Query: 337 RHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGS 375 R IA+N++ D+ +A LR + +KAA D +Y+ ++ Sbjct: 301 RKIALNLIAQDRSTQASLRGRRKKAAWDDDYMLQIIANQ 339 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 289 bits (740), Expect = 1e-76, Method: Composition-based stats. Identities = 104/372 (27%), Positives = 169/372 (45%), Gaps = 14/372 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + +PD R +H L +IL + + AV+ GA ++E F + D L+Q+ E Sbjct: 3 FLDVFGEVPDPRD-LTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD----DKDVIAIDGKTLRHSYDKSR 121 G P HDT +RV++ + P +E F+ +M K +A+DGK+LR +Y K R Sbjct: 62 RGAPSHDTFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGR 121 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 V++ F + + Q ++ E+ A L +L +KG +T DA+ C + + Sbjct: 122 SHMPPLVVTVFGCDTFMSLAQT-VAQEGGEVQAAIAALELLSLKGLTVTADALHCHRRMT 180 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVC 241 + ++ GG Y+ A+KGNQ +L A + E +HGR E+R V Sbjct: 181 KTVRDGGGHYVIAIKGNQSKLAAEANTALDKAA-AGKATKFHQTEEDAHGRHEVRRAFVI 239 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 L + S+R++ + VR Y S + A + +R Sbjct: 240 PFAQTPGKNALV--DLCAIGRVESWRTVE----GKTTHKVRCYALSRKMPAHELLATVRR 293 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN LHW+LDV++ ED + R+ N A + +R + +N+L D K L K KA Sbjct: 294 HWSIENDLHWQLDVLLGEDHIRGRKNNTAANHAILRRLTLNVLRADP-EKIPLSHKRLKA 352 Query: 362 AMDRNYLASVLT 373 L S+ T Sbjct: 353 RWADQDLLSLFT 364 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 284 bits (726), Expect = 3e-75, Method: Composition-based stats. Identities = 116/368 (31%), Positives = 173/368 (47%), Gaps = 15/368 (4%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L + ++P R K + L +LL+ + +SG W +IED+ E + + LK + Sbjct: 4 SLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEM 63 Query: 65 ENG------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYD 118 G +P HDT+ R +S + F + W+ S+ I IDGKT+R Sbjct: 64 LTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRG-VK 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K HV+SAFS + Q+ D K+NEI AI +LL++LD+ G +++ DA+G Q Sbjct: 123 KLSFDTQSHVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQT 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLH 238 I E+I +GGDY+ VK NQ + E F + D +E SHGR E R + Sbjct: 183 AIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLD--EQTELSHGRIETRRY 240 Query: 239 --IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 I+ + E + KGL+ + V R ++ + V YYISS Sbjct: 241 ESILNPLEIEANEVLTRRKGLRSIHKVVRKRR--DKKSDKTSEEVAYYISSLT-DVSSLK 297 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKAGLR 355 AIR HW +ENKLH LDV D R N A++ I+ I + I+ K K+ + Sbjct: 298 QAIRGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKTNMKSSIP 357 Query: 356 RKMRKAAM 363 R +K A Sbjct: 358 RIQKKPAR 365 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 270 bits (691), Expect = 5e-71, Method: Composition-based stats. Identities = 103/370 (27%), Positives = 178/370 (48%), Gaps = 17/370 (4%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + I+++ D R ++++ L ILL+++ A ISG + WE IED+ H + L+ Sbjct: 5 IWNAIAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKL 64 Query: 66 NG-------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYD 118 +G +P HDT V I P +F E + ++ + + IAIDGKT R Sbjct: 65 SGKELKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPRG-IK 123 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 ++ +++SA+ T H VI I ++ K +E+++I +L+ +L ++ +T DA G Sbjct: 124 QTANSHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGTYV 183 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLH 238 ++ E I +GG+++ VKGNQ +L + E++F N + D + HGR E R Sbjct: 184 EVIEMILSKGGNFVLPVKGNQKKLLEFIEKEFREYRGNTVSAD--TQEDIGHGRVEKRTV 241 Query: 239 IVCDVP---DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 D++ +WKG+K L R + + K + YYI++ ++ Sbjct: 242 YCITEIKTDDDIDGCMQKWKGVKTLVKI--VREVYKKADKSTRIETVYYITNLI-DPKEI 298 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA-GL 354 AIR HW +EN LH LDV++NED + N E F + +A+ I+ + + Sbjct: 299 NRAIRAHWGIENNLHRSLDVLLNEDHSQKSNHNVVENFHIMSLLALFIIKEISKQRGISM 358 Query: 355 RRKMRKAAMD 364 R + Sbjct: 359 NRTRKLCGYS 368 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 267 bits (683), Expect = 4e-70, Method: Composition-based stats. Identities = 106/376 (28%), Positives = 169/376 (44%), Gaps = 14/376 (3%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + + I D R H L+++L L + A + GA+ +I +F E LK+ Sbjct: 1 MPFALSILREIHDPRD-INARHDLAELLFLALAATLCGAKNCVEIAEFVEGREAELKEIV 59 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCH-----SSDDKDVIAIDGKTLRHSY 117 +G P HDT +R+ I P + ++ + V+A+DGK LR Y Sbjct: 60 TLRHGCPSHDTFSRIFRLIDPDELARALGAFLAALRQGLGLGPRPRGVVAVDGKALRRGY 119 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 +K R ++S + L + + + S+E+ A LL +D+KG I+T DA+ C+ Sbjct: 120 EKGRAFMPPVMVSVWDAETRLSVATKRAEG-SDEVAATLALLKSIDLKGCIVTADALHCR 178 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRL 237 D A+ + + Y A+K N+GRL E F + + E HGR E R Sbjct: 179 PDTAKALIGRKAHYALALKANRGRLFACAEAGFVAADAAGDLA-FHETRETGHGRLETRR 237 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFAT 297 V + + GLK + + R +VRY S L K A Sbjct: 238 ASVLPLKA--FKQAPAFPGLKAIGRIQATRQ---GADGRAVTSVRYIALSKVLAPHKLAE 292 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +EN+LHW LDVV +EDD + R+ NA + + IR +A +IL + K + K Sbjct: 293 VVRAHWTIENQLHWSLDVVFHEDDARSRKDNAPQNLAVIRRLARDILAAHPLDK-PIASK 351 Query: 358 MRKAAMDRNYLASVLT 373 MR+ +R++ T Sbjct: 352 MRRVNWNRDFFHEFFT 367 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 267 bits (681), Expect = 6e-70, Method: Composition-based stats. Identities = 101/386 (26%), Positives = 170/386 (44%), Gaps = 30/386 (7%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 E+ L+ ++ +PD R + H L+ +L LT CAV++GA + ++ P+ L + Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 62 GDFE-------NGIPVHDTIARVVSCISPAKFHECFINW-MRDCHSSDDKDVIAIDGKTL 113 P TI RV++ I W + +A+DGK+L Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWLACRQQDAGGLRALAVDGKSL 157 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITTD 172 R + RR +H+++A + LV+ Q+ EK+NEIT LL+ L D+ G ++T+D Sbjct: 158 RGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTSD 215 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGR 232 A+ Q D A ++ + Y+ VK N +L+ + P +++ HGR Sbjct: 216 ALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLK-SLPWQQIPL----QDRTRTTGHGR 270 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSA---D 289 EIR VC V + L + G ++ V R + ++ + Y ++S Sbjct: 271 CEIRRLKVCTVNNLL------FPGARQAVQIVRRR--VNRTTGKVSLKTIYAVTSLAAEQ 322 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + A IR HW VE LH DV ED ++R GNA + + R++AI L V Sbjct: 323 APPARVAQLIRGHWTVEA-LHHVRDVTFAEDASQLRSGNAPQAMATYRNLAIGALRLAGV 381 Query: 350 FKAGLRRKMRKAAMDRNYLASVLTGS 375 + +R+ A D+ + L + Sbjct: 382 RN--IAAGLRRTARDQTRTLTHLGLT 405 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 266 bits (680), Expect = 8e-70, Method: Composition-based stats. Identities = 89/365 (24%), Positives = 167/365 (45%), Gaps = 18/365 (4%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L+ + +PD R ++L ++ + +CAV +GA + I D+ P + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHSYDKS 120 +P TI +V + + +D +A+DGKT+R + + Sbjct: 102 CGIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RI 159 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 ++ A H+++A + ++V+GQ +T +KSNEI + LL +DI G ++T DAM QK Sbjct: 160 GKQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKAT 219 Query: 181 AEKIQKQ-GGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHI 239 A +++Q +Y+ VK NQ L ++ P +++ D E+ HGREE R + Sbjct: 220 ARCLREQCRAEYVMIVKANQPGLLARVRDQ-PWEQVPVVWSD---PVERGHGREEHRSYK 275 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSA---DLTAEKFA 296 + V L + +++ + R ++ V Y I S + A Sbjct: 276 ILTVARGL-----RFPYAQQVIQIIRRRRVLGA--GAWSTEVVYAICSLPCEQAPPKLLA 328 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 + IR HWH+EN++H+ DV +ED +R G+ ++ + +R++ + + Sbjct: 329 SWIRGHWHIENRIHYVRDVTFDEDRSAVRTGHGPQVMATLRNVVVGLHRRAGHSNIARAC 388 Query: 357 KMRKA 361 + A Sbjct: 389 RRLAA 393 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 266 bits (680), Expect = 9e-70, Method: Composition-based stats. Identities = 87/352 (24%), Positives = 158/352 (44%), Gaps = 16/352 (4%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFL-KQY 61 + L+ + + D+R+ H L +L++ I + G G+ ++ +F + + L +++ Sbjct: 1 MLSLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWM-RDCHSSDDKDVIAIDGKTLRHSYDK- 119 +P + TI RV+ + + F W + DD + + +DGK+L+++ Sbjct: 61 NIIPERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNP 120 Query: 120 -SRRRGAIHVISAFSTMHSLVIGQIKTD-EKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 + ++ I +S FS LV+ + + +K +EI ++ ++ K+ T DA+ CQ Sbjct: 121 NNEQQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRL 237 K I K DY+ VKGNQ L K ++ ++ + + SHGR+ R Sbjct: 181 KKTISLIAKTKNDYVITVKGNQKNLYKRIQDL----SNSSKPESCFLEQDNSHGRKISRK 236 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFAT 297 V V ++ L+++ S + YYISS +A+ FA Sbjct: 237 IEVFKVRKNE---RQGFENLRRVIKVERKGSRGDKTY----EETAYYISSLTESAQVFAK 289 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 IR HW +EN+LHW DV+ ED +I AA +S + I +N+ Sbjct: 290 IIRGHWKIENQLHWVKDVIFEEDKSEISDFQAASNWSILTTIGLNLFRGLGF 341 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats. Identities = 105/360 (29%), Positives = 161/360 (44%), Gaps = 41/360 (11%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 L+ + + I D RQ K+ H+ I++ + V + + W ++ DF DF++++ Sbjct: 18 LESMYNALGEI-DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFF 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD--------------------D 102 P HDT+ R + P + W + + Sbjct: 77 PDIQKAPSHDTLRRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKP 136 Query: 103 KDVIAIDGKTLRHSYDKSRRR--------------GAIHVISAFSTMHSLVIGQIKTDEK 148 IAIDGKT++ + ++ RRR +H++SAFS L +GQ + D+K Sbjct: 137 FRQIAIDGKTIKKAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKK 196 Query: 149 SNEITAIPELLNMLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAF- 206 NEI AIP LL+ LDI +G ++T DAMG QKDI +I K+ YL VK NQ L + Sbjct: 197 ENEIVAIPRLLDDLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIA 256 Query: 207 --EEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 F L N + + E HG +R VC L +W+ L+ + Sbjct: 257 GNMRDFERIPLPNEVYKVHKEGENGHGFVFLRECRVCSSLHSLGKIYKDWENLRSYGLIR 316 Query: 265 SFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + R + E E + Y+ISS + EK R HW +EN LHW+LD+ EDD ++ Sbjct: 317 TER--VDEATGESAVETHYFISSLENDPEKIMRVKRKHWGIENGLHWQLDITFKEDDGRM 374 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 94/253 (37%), Positives = 147/253 (58%), Gaps = 7/253 (2%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 ++H+++A+ + +L++GQ+K D+KSNEITAIP+LL ML ++G I+T DAMGCQK IA++ Sbjct: 1 NSLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQ 60 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP-AHDSYAMSEKSHGREEIRLHIVCD 242 I + DY+ AVK NQ L + + F ++N H + + HGR E R + Sbjct: 61 IGSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREYS-TI 119 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNH 302 V D+L+ W L + + S R + + RY+I S + A++F A+R H Sbjct: 120 VGDDLLAGITGWDNLNAIGMVESKREVGN----TISNEKRYFIMSINGHAQRFGDAVREH 175 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN +HW LDV ED +IR+ N+ E S +R IA+N + + K ++RK + A Sbjct: 176 WGIENTVHWVLDVSFGEDQSRIRKDNSPENLSMLRKIALNCVKQEST-KTSMKRKRKMAG 234 Query: 363 MDRNYLASVLTGS 375 D ++L VLTG+ Sbjct: 235 WDNSFLIKVLTGN 247 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 260 bits (665), Expect = 5e-68, Method: Composition-based stats. Identities = 108/349 (30%), Positives = 165/349 (47%), Gaps = 16/349 (4%) Query: 3 LKKLMGHISIIPDYRQAWK--MEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 L+KL S IPD+R+A K + HKLSDI++L I +S +I +FG+ + ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK-----DVIAIDGKTLRH 115 NGIP T+ R+ I + H ++I IDGK R Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + K+ R I +SA S + + +EKSNEI A+P L++ +DI GKI+T DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 QKDI +KI+++ GD++ +K NQ L E+K +P + E HGR E Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDKIKEL---SPVYSYCGEPELGHGRIET 269 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 R + V D D LI +W G L + + + R ++SS + Sbjct: 270 RSYRVFDGTD-LIANKEKWNG--NLTIIEYECETVKKSTGNCTTEKRLHVSSLPANTPRL 326 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 T +RNHW +E+ +HW LD + +D K + AA I+ I ++ Sbjct: 327 GTPVRNHWSIES-MHWGLDRNLLQDKIKRKSARAARNLDTIQRIVYSVF 374 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 256 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 110/292 (37%), Positives = 157/292 (53%), Gaps = 9/292 (3%) Query: 3 LKKLMGHISIIPDYRQA-WKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 L +IPD R+A H LSDIL + +CAV+SG + WE + +FG T +L+Q+ Sbjct: 11 LPNPKPFFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQF 70 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMR-DCHSSDDKDVIAIDGKTLRHSYDKS 120 NGIP HDT RV S I P F F +W D D +A+DGKT+R S+ S Sbjct: 71 LPLANGIPSHDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSHRGS 130 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 R A+H++ A+S L++ Q + D KSNEITAIP++L++ D++G I+ DA+GCQK + Sbjct: 131 AGR-ALHLLHAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAV 189 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 A +I + GGDY+ A+KGNQ L+ + +P + EK HGR E R V Sbjct: 190 ARQITEAGGDYVLALKGNQSALHDDVRLFMETQADRHPQGQA-EAVEKDHGRIETRRIWV 248 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTA 292 D D L +W GLK L + S R + ++ R +I+S Sbjct: 249 NDEIDWLTQ-KPDWPGLKTLVMVESRREL----NGQVSCERRCFITSHTADP 295 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 255 bits (651), Expect = 2e-66, Method: Composition-based stats. Identities = 105/363 (28%), Positives = 168/363 (46%), Gaps = 18/363 (4%) Query: 3 LKKLMGHISIIPDYRQ--AWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 +K L + +PDYR+ ++KL DILLL I + DI FG+ + + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD---DKDVIAIDGKTLRHSY 117 G +G+P T+ R+ I E + H D++ IDGK +R + Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAGDILCIDGKAMRGTV 137 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 ++ R I +SA+S + + +EKSNEIT++P+LL+ +D+ G I+T DAM Q Sbjct: 138 LENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADAMSFQ 195 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEK-SHGREEIR 236 K I +KI+++GGD+L +K NQ L E+ L D Y+ HGR E R Sbjct: 196 KAIIDKIREKGGDFLIELKANQRTLRYGVEDNVEL----AEPVDVYSEGPFLEHGRIETR 251 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 + + D LI +W G L V + + + R+Y+SS +A + Sbjct: 252 VCRIFRGND-LITDREKWNG--NLTVVEIRTATERKSDGQKSSERRFYVSSFHGSARRLG 308 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT--NDKVFKAGL 354 T R HW +E+ +HW LD + +D + +A I+ + + IL+ K K Sbjct: 309 TIARMHWAIES-MHWDLDRNLRQDFIRRNSARSARNLDTIQRMVLAILSIWKGKRKKPSE 367 Query: 355 RRK 357 + K Sbjct: 368 KAK 370 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 244 bits (622), Expect = 4e-63, Method: Composition-based stats. Identities = 90/388 (23%), Positives = 157/388 (40%), Gaps = 29/388 (7%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDI----EDFGETHPDF 57 ++ L+ + I D R+A + LS +L + A ++GA G +I DFG+ Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKQ---YGDFENGIPVHDTIARVVSCISPAKFHECFINW--MRDCHSSDDKDVIAIDGKT 112 L + P I + + A F W + V+A+D K Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKII-T 170 LR ++ + +R + +SA LV GQ++ + +NEIT + LL L DI G ++ T Sbjct: 141 LRGAWSEGNKRVTL--LSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 171 TDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSH 230 DA+ Q + A + + G DY VKGNQ L + F + + E+ H Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLY---RKTFEQTLPLLQKPPQHEVEERGH 255 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSA-- 288 GR + + + + V R + + + ++S Sbjct: 256 GRIKKWQAWTTEAK------GIGFPEVATAAVI--RRDEFDLKGIRVSREYAHILTSVAG 307 Query: 289 -DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 TA IR HW +EN++H+ D ED + GN+ + R++AI I+ + Sbjct: 308 NRATAAYIHRLIRGHWGIENEIHYPRDTAWREDANQTHTGNSPHTLASFRNLAIGIIRRN 367 Query: 348 KVFKAGLRRKMRKAAMDRNYLASVLTGS 375 + K ++ + A DR+ + +L + Sbjct: 368 GIRK--IKETLEYIAGDRDRVLPLLATA 393 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 243 bits (619), Expect = 9e-63, Method: Composition-based stats. Identities = 77/378 (20%), Positives = 129/378 (34%), Gaps = 35/378 (9%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ L+ + +PD R+ + L +L L + AV GA G+ +I + L Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCH--------------SSDDKDVIA 107 P T RV+ P E W + VI+ Sbjct: 91 FGLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVIS 150 Query: 108 IDGKTLRHSYDKSRRRG--AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML--- 162 DGKT+R + ++ V+ V+ ++ +EI A+ ++ L Sbjct: 151 ADGKTMRGARRRTGDGKIAQDQVVEILDHASGAVVACEPVND-GDEIGAVRTVMGRLADR 209 Query: 163 --DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH 220 + G ++ TDA Q + E++ GG +L VK NQ R+ P ++ Sbjct: 210 WGSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRILAKVR-ALPWAQVRA--- 265 Query: 221 DSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEL--E 278 K+HGR E R V P G ++ Sbjct: 266 -QDTCRGKAHGRAETRTVRVVQAP---THVDLALAGTAQVIKITRHTRRRPHPGAPAAST 321 Query: 279 MTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSG 335 Y ++S A +R+HW +EN++HW D +ED R GN + Sbjct: 322 RENAYLLTSLPAEVADPATLAAMVRSHWLIENQVHWVRDTAYDEDRHTARTGNGPINLAC 381 Query: 336 IRHIAINILTNDKVFKAG 353 +R+ AI Sbjct: 382 LRNTAITRHRAHGASNIA 399 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 240 bits (611), Expect = 9e-62, Method: Composition-based stats. Identities = 90/247 (36%), Positives = 140/247 (56%), Gaps = 3/247 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ H + D R +HKL DI+++ +CA+I GA+ + +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + I P +F + F +W+ DV+ IDGKT++HS +K + A Sbjct: 69 NGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKKA 128 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH+++A+++ LV+ Q K E++ EITAIP L+ +L++ G ++T DAMG Q DIAE + Sbjct: 129 IHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELLH 188 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPAHDSYAMSEKSHGREEIRLHIVCD 242 +G DY A+KGNQ L + +E F E H + EK R E+ + Sbjct: 189 SKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEVSSAYRTE 248 Query: 243 VPDELID 249 Sbjct: 249 QERLWSH 255 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 89/363 (24%), Positives = 143/363 (39%), Gaps = 48/363 (13%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHE 88 +L+ + G + +TH + L+++ + GI TI R++ I Sbjct: 1 MLVCVTLGFLCGRTTIRRSLKWCKTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALY 60 Query: 89 CFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEK 148 F+ W+ + S + +A+DGK L + +K++ +++ T+ L++ Q+ D K Sbjct: 61 AFMEWVGEIVDSRN-THLAVDGKALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSK 119 Query: 149 SNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEE 208 +NEIT IPELL +LDI G I+T DA+G Q I E+I +QGG + VK NQ + Sbjct: 120 TNEITVIPELLKLLDISGSIVTIDAVGTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHT 179 Query: 209 KFPLKELNNPA-----------------HDSYAMSEKSHGREEIRLHIVCDVPDELIDFT 251 E + ++ EK+ R E R +C L Sbjct: 180 FMDKLEAADVQRKKGEVLDSGMREYLEKYEEIIRIEKNRDRNEYRTCQICKDASNLTKSQ 239 Query: 252 FEWKGLKKLCVAVSFRSIIAEQKK----------------------------ELEMTVRY 283 EW ++ + R + ++ Sbjct: 240 KEWPHVQSIGRIKQVRIPSEKDSHGNDVTPSKEEFLEKGSRRVPAPSAEEGTGKDVQCTA 299 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 IS LTAE+ + R HW +EN+LH LD ED ++ S IR A NI Sbjct: 300 LISDLILTAEELGSIKRMHWSIENRLHHVLDDTFREDRSPAKKSR--NNLSLIRKYAYNI 357 Query: 344 LTN 346 L Sbjct: 358 LRL 360 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 234 bits (596), Expect = 5e-60, Method: Composition-based stats. Identities = 85/395 (21%), Positives = 161/395 (40%), Gaps = 31/395 (7%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 E++ L ++ +PD R + H+L IL L+ AV +G + E+I + P + Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 62 GDFENGI-------PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD---VIAIDGK 111 P DT+ RV+S + + + V+A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML----DIKGK 167 TLR + R A H+++ +V+ + + K+NE+TA LL L + G Sbjct: 158 TLRGAAGPEGR--APHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 IITTDAMGCQKDIAEKIQ-KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMS 226 ++T DA+ + A+ I + G ++F VK N L+ + ++ ++ Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPIG----HSAE 271 Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMT-----V 281 ++HGR E R + + + + + ++ V + + T Sbjct: 272 GRAHGRFERRTIQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGRARVTRTIPSTVT 331 Query: 282 RYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 + ++S L A R HW +ENK+HW DV ED ++R G + + +R+ Sbjct: 332 VHVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRN 391 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 + I ++ + + +R+ D L ++LT Sbjct: 392 LIIGLIRLAGHNR--IAPTIRRIRHDNALLLAILT 424 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 233 bits (594), Expect = 8e-60, Method: Composition-based stats. Identities = 88/207 (42%), Positives = 134/207 (64%), Gaps = 1/207 (0%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 +KL IPD+R++ K + L ILL+ I +VI GA+ W ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NGIP HDT RV S I +F +CFI W+ +++IAIDGKT+R + ++ Sbjct: 65 LPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGGKK 123 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H++SA++ ++LV+GQ+K KSNEITAIP+LL +L I+ I+T DAMGCQ IA+ Sbjct: 124 SPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIAKA 183 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKF 210 I K+ DY+ AVK NQ +L + E++F Sbjct: 184 IVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 230 bits (586), Expect = 7e-59, Method: Composition-based stats. Identities = 88/240 (36%), Positives = 136/240 (56%), Gaps = 8/240 (3%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++L+ + I D RQ K+ H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + ++ K +I IDGKT+R +K Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRS--NK 118 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 H++SA+S +GQ EKSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 119 RNGEKPGHIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHD---SYAMSEKSHGREEIR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 228 bits (582), Expect = 2e-58, Method: Composition-based stats. Identities = 79/383 (20%), Positives = 148/383 (38%), Gaps = 17/383 (4%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L+ + +PD+R + ++L+ +L L + I+G + + ++ P + Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS----SDDKDVIAIDGKTLRHSY 117 F +P TI R+V P + + W +A DGK ++ + Sbjct: 84 LGFPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGAR 143 Query: 118 DKSRRRGAIH--VISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + + V+ A +G + +EI ++ L+N + ++TTD + Sbjct: 144 SRPPQGSVRQEAVVEAVRHDTGTALGHQRVVA-GDEIASVRRLVNRVCDHNTLVTTDCLH 202 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 + +A I+ +GG +LF++KGNQ + P E N + EK+HGR E Sbjct: 203 AHEPLARAIRAKGGHWLFSIKGNQPTVRAKL-AGLPWDEFGN----QHVTREKAHGRIEE 257 Query: 236 RLHIV-CDVPDELIDFTFEWKGLK-KLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE 293 R L+ F + +K + E + +S+ + Sbjct: 258 RALKALTPSAPSLVGFRGTRQVVKLARTTRRKKTTTSPAATSTEEFYLVTSLSTDQASPA 317 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A R HW VE +H D M+ED IR NAA ++ R I+ L Sbjct: 318 QLARWARGHWTVEA-IHHVRDRTMDEDRHTIRTKNAALNWAIARDTTISALRLAGYKN-- 374 Query: 354 LRRKMRKAAMDRNYLASVLTGSG 376 +R+ R D + ++ + Sbjct: 375 IRQARRATIRDPGLVLQIIALTS 397 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 228 bits (581), Expect = 3e-58, Method: Composition-based stats. Identities = 83/380 (21%), Positives = 153/380 (40%), Gaps = 23/380 (6%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIE-------DFGETHPD 56 L+ ++ +PD R + H L +L + AV++GA + T Sbjct: 27 SSLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELG 86 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS--DDKDVIAIDGKTLR 114 + + P T R+++ + + W+ C + + V ++DGKTLR Sbjct: 87 VFRDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLR 146 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAM 174 S +H+++ V+GQ+ D K+NE+T LL LD+ ++T DA+ Sbjct: 147 GS---GPAGEQVHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADAL 203 Query: 175 GCQKDIA-EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGRE 233 Q++ A + + Y+F VK NQ RL + + P ++ S + HGR Sbjct: 204 HTQREHARWLVDTKKAAYVFTVKKNQPRLYRQLKT-LPWTKIPI----QDETSTRGHGRY 258 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE 293 +IR L ++ L + ++ + + + +S+A Sbjct: 259 DIRRLQAVTCTGPLALDFPHA--VQALRIRRRRLNLATGRWSTVTVYAITNLSAAQAGPA 316 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A +R HW +E LH D ED ++R GNA + +R+ AIN+L + Sbjct: 317 ELADWLRGHWAIET-LHHIRDTTYAEDASRLRTGNAPRAMATLRNTAINLLRLTGI--TT 373 Query: 354 LRRKMRKAAMDRNYLASVLT 373 + +R + + +L Sbjct: 374 IAAALRHNSRNPYRPLQLLG 393 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 225 bits (574), Expect = 1e-57, Method: Composition-based stats. Identities = 84/273 (30%), Positives = 136/273 (49%), Gaps = 9/273 (3%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L+ + + D R K+EH+L DIL++ +CAV++ AE +EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD------KDVIAIDGKTLRHS 116 D GIP HDT RV I P F CF+NW R + + IA+DGK +RHS Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ R +H++SA++T LV+ Q D K E A+P +L L + G +++ DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMS--EKSHGREE 234 ++++A+ I +G YL +K NQ +++ F + A + +HGR Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAFDDTHGRLV 241 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFR 267 R C W GL + + + R Sbjct: 242 RRRVFACPDAGCFTTLRG-WPGLTTVLASETIR 273 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 224 bits (571), Expect = 4e-57, Method: Composition-based stats. Identities = 87/338 (25%), Positives = 137/338 (40%), Gaps = 22/338 (6%) Query: 28 DILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFH 87 +L + + A +G G+ + T D + P T V+S + PA + Sbjct: 2 ALLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 ECFINWMRDCHSSDDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIK 144 ++ +S D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQLA 119 Query: 145 TDEKSNEITAIPELLNMLDIKG-KIITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRL 202 EKSNEI + LL +L ++T DAM Q A+ I YL VK NQ ++ Sbjct: 120 VAEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 NKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCV 262 P E+ A D + HGR E R + + + K++ Sbjct: 180 LARI-TALPWAEVPAAATD----DSRGHGRVETRTLQIITAARGIG-----FPYAKQIIR 229 Query: 263 AVSFRSIIAEQKKELEMTVRYYISSADLTAEK---FATAIRNHWHVENKLHWRLDVVMNE 319 R I A + + V Y I S + T +R H +EN LHW DV +E Sbjct: 230 ITRERLITA--TDQRSVEVVYAICSLPFEHARPTAIMTWMRQHCRIENSLHWIRDVTFDE 287 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 D + GN A++ + +R+ AIN+ + + Sbjct: 288 DRQRAHTGNGAQVLATLRNTAINLHRLNGADNIAEACR 325 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 223 bits (567), Expect = 1e-56, Method: Composition-based stats. Identities = 84/324 (25%), Positives = 134/324 (41%), Gaps = 27/324 (8%) Query: 50 FGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAID 109 FG + +LK GI H T + V C++ F ++ Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAALPKPLQRA------------ 90 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKII 169 K S + + ++ ++ LVIGQ + NE+ + L +L ++G I+ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQQTAPGR-NEVQGALDALALLSLEGAIV 149 Query: 170 TTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKS 229 T DA+ C+ D A I GGDY A+K NQ L + E +E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLGVQ----TAAEND 205 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSAD 289 H R E R + V D ++ GL+ + + L VRY++ S Sbjct: 206 HDRCERRRACIVAVND------IDFPGLQAIGSVE---ATSRHADGRLTSHVRYFLLSTI 256 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ++A R HW +ENKLHW LDV ED + R+ + + +R IA+N++ Sbjct: 257 MSAAALIEVTRTHWQIENKLHWVLDVQFREDAARNRKDHGPANIALLRKIALNLIRAHPD 316 Query: 350 FKAGLRRKMRKAAMDRNYLASVLT 373 KA +RRK++ A D +L S++ Sbjct: 317 -KASIRRKIKNAGWDDQFLISIIA 339 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 215 bits (547), Expect = 2e-54, Method: Composition-based stats. Identities = 81/418 (19%), Positives = 142/418 (33%), Gaps = 62/418 (14%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVIS-GAEGWEDIEDFGETHPDF------ 57 L+ ++I D R H L+ IL + CA ++ G + IE + + P Sbjct: 29 DLIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLH 88 Query: 58 -LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD------------ 104 + + P TI RV++ + + C ++ + Sbjct: 89 IWRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRR 148 Query: 105 ---------------------VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 A+DGK L+ + R +H+IS + + + V Q Sbjct: 149 TEREARRAAHRSPTPAPGLLPAYAVDGKRLKGARHPDGGR--VHLISLAAHLDATVHAQR 206 Query: 144 KTDEKSNEITAIPELLNM---LDIKGKIITTDAMGCQKDIAEK-IQKQGGDYLFAVKGNQ 199 + KS+EI A+ LL D+ G +IT DA+ Q+ A I++ Y+ VK NQ Sbjct: 207 QIPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQ 266 Query: 200 GRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 L+ + + A + + + HGR E R+ ++ + Sbjct: 267 PTLHATAITALTGTDTDFAAVT-HRETHRGHGRTEYRILRTAPA------DGIDFPYAAQ 319 Query: 260 LCVAVSFRSIIAEQKKELEMTVRYYISSA---DLTAEKFATAIRNHW-HVENKLHWRLDV 315 + + R V Y I+ A +R HW +EN +H DV Sbjct: 320 VFRVLRHR--GGLDGIRHSKEVCYGITDLTARQAGPAHLAAYVRGHWKAIENGVHHVRDV 377 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 ED C+ R + R++A L + R+ D + + Sbjct: 378 TFAEDACQARTATLPRALAAFRNLATGTLRRAGHVN--IAHARREHGYDHQRVLDLFN 433 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 215 bits (547), Expect = 2e-54, Method: Composition-based stats. Identities = 76/237 (32%), Positives = 118/237 (49%), Gaps = 7/237 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 ++ I D R K HK+ I+ ++I AVI GA+ W +IE+FG F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPS 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YD 118 IP HDT R S I P F F NW++ V+AIDGK +R + Sbjct: 64 LEFIPSHDTFNRFFSMIKPDYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 + + + ++SA+S + + +GQ+K D+KS+EITAIP L+N L++ G I+T DAMGCQK Sbjct: 123 RGKEGFKLWMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 DI + I +Y+ A+K N+ + + ++ + + + R Sbjct: 183 DITQTIIGHDANYIIAIKENKKKKYQPAKQIIDDYQDRDEIINRVIRHVSEKCRTWK 239 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 214 bits (545), Expect = 3e-54, Method: Composition-based stats. Identities = 88/249 (35%), Positives = 127/249 (51%), Gaps = 14/249 (5%) Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YDKSR 121 IP HDT R S I P F F NW++ V+AIDGK +R + + Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHTTGK 62 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + ++SA+S + + +GQ+K D+KSNEITAIP L+N L++ G I+T DAMGCQKDI Sbjct: 63 EGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDIT 122 Query: 182 EKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLH 238 + I + +Y+ A+K N+ + L K + + K+ + HGR E R Sbjct: 123 QTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRTC 182 Query: 239 IVCDV-PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT-AEKFA 296 V F + GLK + S R+I+A E VRYY++S D T E+ A Sbjct: 183 TVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIVA--TGEYTQEVRYYVTSLDNTKPEEIA 240 Query: 297 TAIRNHWHV 305 +AIR HW + Sbjct: 241 SAIRQHWSI 249 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 211 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 77/388 (19%), Positives = 138/388 (35%), Gaps = 46/388 (11%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAV-ISGAEGWEDIEDFGETHPDFLKQYGD 63 + ++ IPD+R A + + L + + +CAV +G + + ++ + Sbjct: 23 GIWERLAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERVRLR 82 Query: 64 FE------NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVI----------- 106 + +P TI R ++ + ++ +D D + Sbjct: 83 LPWNPWDGHLLPDEATIRRFLNTVDDQALATALLDPPLADTPADLTDAVPSAPVRPPAGD 142 Query: 107 --------AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPEL 158 A+DGKT R + + +H++ + ++GQ + D KSNE T L Sbjct: 143 QAVPVRAYAVDGKTSRGAKRADGSQ--VHLLGVAAHGAGALLGQREIDAKSNETTEFRAL 200 Query: 159 LNMLDIKGKIITTDAMGC-QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 L L++ G ++ DA+ + ++ + ++ YL K NQ +L AF P E+ Sbjct: 201 LAPLELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKLR-AFLAALPWTEIPT 259 Query: 218 PAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEL 277 ++ HGREE R V V ++ + +R + Sbjct: 260 ADLTR----DRGHGREETRTLKVATVT------HLDFPHAAQAIRIRRWRRQKGQP---A 306 Query: 278 EMTVRYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 Y I+ A A R WH+E K H+ DV ED R G + + Sbjct: 307 SHETIYAITDATADQASPALLADLARGQWHIEVKQHYVRDVTFGEDSSTSRTGRGPAVLA 366 Query: 335 GIRHIAINILTNDKVFKAGLRRKMRKAA 362 R + L R+ K A Sbjct: 367 LFRATVADTLRRAGHRSVPACRRAHKTA 394 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 207 bits (526), Expect = 5e-52, Method: Composition-based stats. Identities = 84/404 (20%), Positives = 144/404 (35%), Gaps = 61/404 (15%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVI-SGAEGWEDIEDFGETHPDFLKQ 60 +++ L+ + D R A + +++S +L L +CA+ +G + ++ Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 61 YGDFEN-------GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD----------- 102 IP T+ V+ + P + + +R S+ Sbjct: 90 AFGLPYHPLRGRYRIPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPLMPDGG 149 Query: 103 ---------------------KDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 + IA+DGK LR + R + V+SA + + Sbjct: 150 IEREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR--VFVLSAVRHGDGITLA 207 Query: 142 QIKTDEKSNEITAIP---ELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 + K+NEI + L+ D+KG ++T DA+ Q+D A + ++G YL +K N Sbjct: 208 SREIGAKTNEIPEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERGAHYLLTIKNN 267 Query: 199 QGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 Q + P KE+ D + HGR E RL V V L + Sbjct: 268 QRGQARQL-HALPWKEIPVIHRD----DARGHGRHEQRLVQVVTVNGLL------FPHAA 316 Query: 259 KLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFAT---AIRNHWHVENKLHWRLDV 315 ++ R + +K Y I+ A R HW VEN +HW DV Sbjct: 317 QVLRIQRRRRLYGAKKW--SSETVYAITDLPAEEASAAEIASWARGHWTVENTVHWCRDV 374 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 NED ++R N + + +R + L R+ Sbjct: 375 TFNEDKSQVRTHNTPSVLAAVRDLIRGALKLAGYVNTAAGRRAH 418 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 203 bits (517), Expect = 6e-51, Method: Composition-based stats. Identities = 76/227 (33%), Positives = 106/227 (46%), Gaps = 9/227 (3%) Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIPELL LD++G +T DA+G Q IA I + G DY+ AVK NQ RL + + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 ELNNPAHDS--YAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIA 271 + +K HGR E R+ V + L W GL++L + R I Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETRVCRVSEDVAWLASTGQHWAGLQRLVMLERTRQI-- 118 Query: 272 EQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 +++ YYISS + A + A IR HW +EN+LHW LDV ED IR AA Sbjct: 119 --GQKVTTERCYYISSKAVKAAQMAQLIRAHWGIENQLHWVLDVSWGEDASLIRDTVAAR 176 Query: 332 LFSGIRHIAINILTND---KVFKAGLRRKMRKAAMDRNYLASVLTGS 375 + +R I +N+ + K L+ AA D +L + Sbjct: 177 NMASLRKITLNLARLAQNRQPKKVSLKNIRNLAAWDTAMRDDILGLA 223 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 201 bits (512), Expect = 2e-50, Method: Composition-based stats. Identities = 91/237 (38%), Positives = 121/237 (51%), Gaps = 9/237 (3%) Query: 143 IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL 202 + T++KSNEITAIP LL L+ K ++T DAMGCQKDIA I GGD++ AVK NQ +L Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 NKAFE---EKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 A EK EL H +Y HGR + R H V VP EW +K Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVP-PGFAAKGEWPWIKA 119 Query: 260 LCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + AV + VRYY+ S L+ ++F +R HW +E+ +HW LDV E Sbjct: 120 IGTAVRITT---HADGTQSDEVRYYMLSRFLSGKRFGEVVRGHWGIES-MHWVLDVTFGE 175 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGSG 376 D + R+ A S +R AI +L K +R KM + MD ++L VLT G Sbjct: 176 DRTRTRQRVLANNLSWVRRFAITLLKRHP-EKDSIRGKMIRCLMDTSFLNEVLTLQG 231 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 201 bits (510), Expect = 5e-50, Method: Composition-based stats. Identities = 85/224 (37%), Positives = 116/224 (51%), Gaps = 11/224 (4%) Query: 111 KTLRHSYDKSRRRGAIHVISAF---STMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGK 167 K + S + S S +LV+GQ K ++KSNEITAIP L+ ML+I+ Sbjct: 3 KGFQRSVKTEEKHKPSQKKSQVLKDSLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESS 62 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPAHDSYA 224 IIT DAMGCQK+I I+K+ GDY+ +K NQ L + +E F +E + H Y Sbjct: 63 IITIDAMGCQKEITSLIRKKKGDYIITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQ 122 Query: 225 MSEKSHGREEIRLHIVCDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRY 283 E H R E R I V + W LK + + S R + + VR+ Sbjct: 123 EIETGHHRIEKREVIAVSVSSLPCLHNQDLWTELKTVVMVKSERRLWNK----TTTEVRF 178 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRG 327 YISS + ++K ATAIR+HW +EN LHW LDV +ED +IR Sbjct: 179 YISSVEKNSQKIATAIRSHWEIENSLHWTLDVTFSEDKSRIRTR 222 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 193 bits (490), Expect = 1e-47, Method: Composition-based stats. Identities = 103/197 (52%), Positives = 133/197 (67%), Gaps = 13/197 (6%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L L H + + D RQA K+ +KL D+L L + AVISGAEGWE+IEDFG +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 YGDF +GIPVHDTIAR+V I P FH+ FI WM+ DK V+A+DGKTL Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 H+ISAF+T + +V+GQ +TDEKSNEITA+PELL +L+++G ++T DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 AEKIQKQGGDYLFAVKG 197 + I K+ DY AVK Sbjct: 168 VKTIVKKKADYCIAVKK 184 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 76/179 (42%), Positives = 108/179 (60%), Gaps = 3/179 (1%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ +PD R+ + H+L ++LL IC VISGAE W + + + D+L+ Y + Sbjct: 7 SLLTAFDDLPDPRR-RECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 +GI HDT RV S + ++F CF+ W+ S + +AIDGK LR S+D R Sbjct: 66 AHGIASHDTFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHD--GARS 123 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 IH++SA+S+ +L +GQ++T +KSNEITAIPELL LDI+G IT DAMGC A Sbjct: 124 PIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCHGMPARH 182 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 176 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 64/231 (27%), Positives = 104/231 (45%), Gaps = 5/231 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ ++ +PD R ++ L +L L + AV+ G E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NG +P +TIA ++ + P + W+RD H D + +A+DGK L S D + Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGKRLCGSRD--GQV 120 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD-IKGKIITTDAMGCQKDIAE 182 H+++A++ S V+ Q+ + +NE A LL +L + G ++T DA+ Q D+ Sbjct: 121 PGTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGRE 233 +Q +GGD + K NQG L E F + + G Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAFATAAGGDFSPRVTGRVGSGRGNG 231 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 173 bits (437), Expect = 1e-41, Method: Composition-based stats. Identities = 55/194 (28%), Positives = 85/194 (43%), Gaps = 7/194 (3%) Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH-DSYAMSEKSHGREEIRLHIV 240 EKI ++ GDY+ +K N + E F + P +++ R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR 300 V D L EWKG+K + RS + +YISS D+ + A +R Sbjct: 61 LKVSDWLSK-AEEWKGIKSVLEVCRKRS----DNGKESQEKVFYISSLDVDVQILAKCVR 115 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW VENK HW LDVV ED+C + AE + +R +A+N+ K ++ K+ Sbjct: 116 GHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHP-KKQSMKGKLTA 174 Query: 361 AAMDRNYLASVLTG 374 A + +L G Sbjct: 175 AGWSDEFRDELLLG 188 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 171 bits (433), Expect = 3e-41, Method: Composition-based stats. Identities = 74/284 (26%), Positives = 119/284 (41%), Gaps = 15/284 (5%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + L+ + + D R H L ++L L + A + GA+ ++ +F E + L++ Sbjct: 1 MSVLISILREVRDPRD-VNARHDLGELLFLALLATLRGAKTCVEMAEFSEARQEELREIV 59 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD----KDVIAIDGKTLRHSYD 118 +G P HDT +RV + P + F +M + K V+AIDGK+LR YD Sbjct: 60 ALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRGALGLPAPKGVVAIDGKSLRRGYD 119 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K R ++S + I ++ +EI A +L L +KG +T DA+ C Sbjct: 120 KGRAFMPPLMVSVWDVETRPSIAAMRAPG-GDEIKATLSVLKALTLKGCTVTADALHCHP 178 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLH 238 +A+ + Y +K N G L +A E F + E+ HGREE R Sbjct: 179 AMAQALLAAKAQYALGLKANHGPLFRAAEAGFAAVTDLAV----FETRERGHGREEQRRA 234 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVR 282 V V + GLK + + R+ + E VR Sbjct: 235 SVLPVDR--LVKRPSLPGLKAIGRIEAVRT---GANGKPEQAVR 273 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 170 bits (430), Expect = 9e-41, Method: Composition-based stats. Identities = 64/215 (29%), Positives = 99/215 (46%), Gaps = 3/215 (1%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + L +S IPD R + H L +L L A++ G + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDK 119 F G P T++R + P + W+ + IA+DGKTLR S D Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGSRD- 119 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + H+++A++ V+ Q++ D K+NE A LL +L + G ++T DAM CQ+D Sbjct: 120 -GQVPGQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 +A + G DY+ K NQ L + E ++ Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGLGFED 213 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 169 bits (428), Expect = 2e-40, Method: Composition-based stats. Identities = 64/189 (33%), Positives = 95/189 (50%), Gaps = 8/189 (4%) Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCD 242 I + GDYL VKGNQ +L +A E F + + + + D A+ E+ HGR ++ V Sbjct: 1 MIIAKKGDYLLMVKGNQPKLLEAIEIAF-IDQHDVKSVDRSALVERGHGRTVGQIASVLS 59 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNH 302 I +W + S R + + E ++ YYI+S LTAE+ A ++R Sbjct: 60 AKG--IINPGDWPNCVTIGRIDSMRVVDEK---ESDLERCYYITSRALTAEQLAASVRAR 114 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKMRK 360 W VEN+ HW LDV +ED + + NA + S +R IA+NI+ DK K+ LR K + Sbjct: 115 WGVENRFHWILDVSFSEDASTVAKDNAPQNLSMLRKIALNIIRADKTDTRKSSLRLKRKG 174 Query: 361 AAMDRNYLA 369 AA D Sbjct: 175 AARDDGVRE 183 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 169 bits (427), Expect = 2e-40, Method: Composition-based stats. Identities = 70/273 (25%), Positives = 108/273 (39%), Gaps = 13/273 (4%) Query: 58 LKQYGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 L + D + + ++ + F S +K + DGK LR S Sbjct: 8 LCAFLDIPETTVVSRSHLPVLLQKVDVEVFDYLLFTHYGFRLDSQEKQWFSGDGKELRGS 67 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDE-KSNEITAIPELLNMLDIKGKIITTDAMG 175 + ++RG V+ I Q D K +EI + LL+ D+ + IT DA+ Sbjct: 68 IESGKKRGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRALLSKDDLASQKITLDALH 126 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 E I K GG +L +K NQ L + P D + +HGR E Sbjct: 127 LCPSTTEMITKAGGVFLIGLKENQPTLLAHMTDC------ALPPIDQKTTFDFNHGRVEQ 180 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 R + + DV + D ++ K+L R ++ ++ V YYIS+ E Sbjct: 181 RKYWLYDVSKQGFDPRWDNTAFKRLVKVQRTRI--NQKNAKISREVSYYISNETA-KEGI 237 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGN 328 A+RNHW VE H DV +NED K ++ Sbjct: 238 FDAVRNHWSVEVNNH-IRDVTLNEDQLKSKKRQ 269 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 168 bits (425), Expect = 3e-40, Method: Composition-based stats. Identities = 58/227 (25%), Positives = 102/227 (44%), Gaps = 14/227 (6%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ LM +S D R+ + H ++ +CA++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDFENGI-------PVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVIAIDG 110 F P T+ R + I + W + D V++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIIT 170 K +R + + IH ++AF +V+ Q DEK+NEI + LL ++I+G+I+T Sbjct: 341 KAVRGASKAKGGQ-KIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 TDAMGCQKDIAEKIQK-QGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 DA+ Q + A I + + DY+F VK NQ + + E P + Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIE-SLPWEAFP 445 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 168 bits (424), Expect = 4e-40, Method: Composition-based stats. Identities = 81/194 (41%), Positives = 118/194 (60%), Gaps = 2/194 (1%) Query: 94 MRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEIT 153 M+ H +V+AIDGKTLR SYD+ R+ IH++SA+++ + LV+GQ+KT+ KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIP L+ MLD++G I+T DAM CQ IA+ I ++GGDYL AVKGNQG+L A + F Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFT-P 119 Query: 214 ELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 P EK GR E R + V D + DF+ W GL + + ++R+ Q Sbjct: 120 HRRAPIDRDTCQIEKQKGRVEARTYHVLSASDLIRDFS-TWSGLTSIVMVENYRAAKGRQ 178 Query: 274 KKELEMTVRYYISS 287 + + + + + + S Sbjct: 179 RARVGVPLLHKVQS 192 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 163 bits (413), Expect = 8e-39, Method: Composition-based stats. Identities = 60/264 (22%), Positives = 113/264 (42%), Gaps = 22/264 (8%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 + L+ ++ +PD R+ + ++ + +L + +CA++SGA + I ++ P + Sbjct: 47 DQTALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAG 106 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD-------------KDVIAI 108 +P TI RV+ + A W++ + D + V+A+ Sbjct: 107 LGLTGRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAV 166 Query: 109 DGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGK 167 DGK +R + +H++ +V+ Q+ DEK+NEI +L+ + D+ Sbjct: 167 DGKAMRATR---HGTHPVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDV 223 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSE 227 +IT DAM Q A+ + +G L VK NQ ++ + P K++ + + Sbjct: 224 LITVDAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRLKT-LPWKDVPVG----HTTTG 278 Query: 228 KSHGREEIRLHIVCDVPDELIDFT 251 + HGR E R VP L Sbjct: 279 RGHGRIETRTLKAVTVPAGLGFPH 302 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 163 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 62/189 (32%), Positives = 87/189 (46%), Gaps = 10/189 (5%) Query: 192 LFAVKGNQGRLNKAFEEKFPLKELNNPAHDS---YAMSEKSHGREEIRLHIVCDVPDELI 248 + AVK NQ L E + S + +K HGR E R + D P Sbjct: 1 MLAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREVDKGHGRIETRRCLALDFPGPFE 60 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 W GL+ + + S R I + RYY+SS A + A A+R HW +E+ Sbjct: 61 PDL--WPGLQSIPMVESTREI----GDTVTTGRRYYVSSLPADAVRIAHAVRAHWGIES- 113 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 +HW LDV NED C+ R NAA+ F+ +R IA ++ D KAG+R + KA +Y Sbjct: 114 MHWVLDVTFNEDQCRTRLENAAQNFAILRRIATTLIRRDNSTKAGIRIRRLKAGASDDYR 173 Query: 369 ASVLTGSGL 377 A +L L Sbjct: 174 AQLLGLKTL 182 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 159 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 56/165 (33%), Positives = 86/165 (52%), Gaps = 3/165 (1%) Query: 47 IEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVI 106 + + L+ + NG P DT RV+ I P + C + ++ S + I Sbjct: 1 MHELCLERGASLRPPVELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHI 60 Query: 107 AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKG 166 AIDGK L+ S K+ H++SA+ L + Q EK NE+ AIPE+L+ LD+ G Sbjct: 61 AIDGKRLKGSKKKTGS---THILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSG 117 Query: 167 KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP 211 +I+ DAMG Q +IAE+I + DY+ ++KGNQ L + + F Sbjct: 118 AVISIDAMGTQTNIAEQIIQSEADYILSLKGNQKHLYEDVRDCFT 162 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 158 bits (400), Expect = 3e-37, Method: Composition-based stats. Identities = 58/219 (26%), Positives = 98/219 (44%), Gaps = 19/219 (8%) Query: 11 SIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETH-PDFLKQYGDFENGI- 68 + + D R+A + H +LL+ + V++G +E I + + L++ G + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 69 -----PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 P TI R++S P + + + IAIDGKT+R S Sbjct: 289 ERFLPPSEPTIRRILSKADPVELDRILSQY---IVAHSSGRAIAIDGKTIRSS------- 338 Query: 124 GAIHVISAFSTMHSLVIGQIKTDE-KSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 ++ +++A V+ Q D K +EI A LL LD+ GK++T DA+ Q +A Sbjct: 339 -SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALAS 397 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHD 221 +I+++GGDY+F VK N+ L +P D Sbjct: 398 RIREKGGDYVFTVKDNRKTLKDEISGLDDEAFSPSPYDD 436 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 53/187 (28%), Positives = 93/187 (49%), Gaps = 4/187 (2%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 + L+ + +PD R+A + L +L+ T+ A++SGA + I F E + L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 61 YGDFE-NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHS 116 + + PV +T+ V+ + + F + + K V+A+DGKTLR S Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGS 130 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D R A ++AF + ++V+ + D+KSNEI A +++ L + G + T DAM C Sbjct: 131 FDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHC 190 Query: 177 QKDIAEK 183 QK + + Sbjct: 191 QKKHSRR 197 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 156 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 58/199 (29%), Positives = 96/199 (48%), Gaps = 13/199 (6%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 K + I + G DY+ AVKGNQ RL++ + L +E+ R Sbjct: 1 MPKKTVQLIIEGGNDYVIAVKGNQKRLHEQIK----LTTEQRLPVSLDITTERRSDRITT 56 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 R V D L +++W+GL++L F + + ++ YYISS + A +F Sbjct: 57 RSVSVFDD---LSGISYDWEGLQRLVKVERFGTRAGKPYHQI----VYYISSLTINAAQF 109 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A IR HW +EN+LHW DVV++ED+ ++R+GNA FS IR + + IL + + Sbjct: 110 AQGIRGHWGIENRLHWVKDVVLDEDNSRMRQGNAPANFSIIRSLVLTILRYNGYS--SIT 167 Query: 356 RKMRKAAMDRNYLASVLTG 374 +R + + + ++ Sbjct: 168 TGIRLISHNLEQIFQLIRN 186 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 154 bits (388), Expect = 7e-36, Method: Composition-based stats. Identities = 50/196 (25%), Positives = 84/196 (42%), Gaps = 9/196 (4%) Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHI 239 ++E+ ++ DY+ A+KGN + + ++ F + +K HGR E R Sbjct: 1 MSERNSEKDNDYILALKGNHPLMEQEVKDFF--LSPVTSTRSVHTTFDKGHGRIE-RRIY 57 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAI 299 D + EWK L + S + ++ +RY+I+S ++FA + Sbjct: 58 TLDTNIGWFEDKKEWKHLAGFGMVDSMVTRKGKEC----REIRYFITS-VTDVKQFAKGV 112 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 +HW +EN LHW LDV+ +D+C + NAAE + IR I N + K Sbjct: 113 CSHWMIENNLHWCLDVLFCDDECTVLDRNAAENLAIIRRIVYNRIKMLSKMDTLSMGKR- 171 Query: 360 KAAMDRNYLASVLTGS 375 D + A +L Sbjct: 172 ACIYDDEFRAQILFSC 187 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 53/224 (23%), Positives = 99/224 (44%), Gaps = 15/224 (6%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFG----ETHPDF 57 +++ L +PD R +H L IL + + AV++ A+ + + ++ + Sbjct: 219 QMEGLRQIFFRLPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKR 278 Query: 58 LKQYGDFE---NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLR 114 ++ + P T+ RV+ + W + +A+DGK L+ Sbjct: 279 IRARFNPRTQRYVAPSEPTLRRVLQGANVTALDAAIGAW---LLGIAGFEAVAVDGKVLK 335 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAM 174 + + + +H++SAF I Q + K+NEI + LL +DI+ K++T DA+ Sbjct: 336 GAVREDGSQ--VHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADAL 393 Query: 175 GCQKDIAEKIQK-QGGDYLF-AVKGNQGRLNKAFEEKFPLKELN 216 Q+ A + + + DYLF AVKGNQ +L + P + Sbjct: 394 HTQRKTARFLVEDKKADYLFTAVKGNQRKLRNSLI-CLPWGDFP 436 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 53/206 (25%), Positives = 84/206 (40%), Gaps = 13/206 (6%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 +L +++ GK IT DA+ QK +AE I + YLF VK NQ L + F ++ Sbjct: 2 FIPILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYFEHRK 61 Query: 215 LNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 D HGR + R +E ++F + + +S + Sbjct: 62 EP----DYCLQDPPGHGRIDTRSIWTTTELNEYLEFPHVGQAF-----CIHKKSYDPKTN 112 Query: 275 KELEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 K E T Y ++S + R HW +EN H+ LD +ED +IR GN Sbjct: 113 KVCENTF-YGVTSHHPNKADPARILQIHRGHWSIENSKHYILDWTYDEDRNRIRTGNGPA 171 Query: 332 LFSGIRHIAINILTNDKVFKAGLRRK 357 + +R AI +L + V + + Sbjct: 172 NTNRLRGFAIGLLKSKGVKDIAQKVR 197 >UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TCH7_HELMI Length = 453 Score = 150 bits (378), Expect = 1e-34, Method: Composition-based stats. Identities = 58/377 (15%), Positives = 110/377 (29%), Gaps = 44/377 (11%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + D R+ +++ I + E E ++ + +Q Sbjct: 40 GFSQMVRQAKDGRKQPRIK--APAIFTVAFFGAFFCMESMEQMDRW--QKTGVFRQLVPK 95 Query: 65 ENGIPVHDTIARVVSCIS---PAKFHECFINWMRDCHSSD-----DKDVIAIDGKTL--- 113 +P HDT+ + + + H C I ++ V AIDG L Sbjct: 96 NIRLPSHDTVRQALMKWDLKEQREQHNCVIQRYKEQRGPQKESINGWRVTAIDGVELFHT 155 Query: 114 ----------RHSYDKSRRRGAIHVISAFSTMHSLVIG-------QIKTDEKSNEITAIP 156 R DK+ V++ ++ +I Q D+ E T Sbjct: 156 KAYRCPECLTREHRDKTTDYYHAVVVAQQVGGNANLIYDWEMRKPQDGVDKDEGETTVAQ 215 Query: 157 ELLNML-DIKGK---IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL 212 L+ + + GK + T DA+ + + G + +K + R+ K F Sbjct: 216 RLIRRMAETYGKITDVYTLDALFAKAPVIHAALDAGAHVVVRMKEERRRIMKEANACFA- 274 Query: 213 KELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAE 272 N DS G + + + +K + + Sbjct: 275 ----NRLPDSTWEERDGKGNTVYVQAWDEEGLAQWPQVRVPMRIVKIIRHTNKTVIEANK 330 Query: 273 QKKELEMTVRYYI---SSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + ++ R+ SS + A W +EN L D C + A Sbjct: 331 EVFVTDVVERWIATTCSSEKADTQTIAQIAAARWDIENIGFRNLKTFNALDHCFVHDSVA 390 Query: 330 AELFSGIRHIAINILTN 346 + G + +A N+ Sbjct: 391 IKAMIGFQVLAFNLKRL 407 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 146 bits (369), Expect = 9e-34, Method: Composition-based stats. Identities = 62/326 (19%), Positives = 114/326 (34%), Gaps = 43/326 (13%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 + G P ++T+ +++C+ WM + A DGK L S Sbjct: 13 RWRPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGGIRA-DGKVLGGS 71 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K A+H + + + + Q + + A+ LL + G++++ DA Sbjct: 72 --KRAGAPALHGVELVTHTTGMALAQREAVG-GDAAAALLALLTEAPLDGRMVSMDAGFL 128 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKEL--------------------- 215 + + I ++ G+YL VKG+Q ++ P + Sbjct: 129 NAAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALDQIAPPR 188 Query: 216 ------------NNPAHDSYAMSEKSHGREEIRLHIVCDV--PDELIDFTFEWKGLKKLC 261 E+S GR EIR V D + + W+ + ++ Sbjct: 189 RKRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGDVGPSLMTAYGWRQVTQIG 248 Query: 262 VAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 + +E +SS T +F +IRNHW +EN++H D M ED Sbjct: 249 GLRRWCRRRHADLWTVEEVTV--VSSRQRTPAQFLASIRNHWTIENQVHRPRDGSMQEDR 306 Query: 322 CKIRRGNAAELFSGIRHIAINILTND 347 R + + R++ IN++ Sbjct: 307 LHGRA--IGVILAVCRNVVINLIRRH 330 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 146 bits (369), Expect = 9e-34, Method: Composition-based stats. Identities = 60/194 (30%), Positives = 85/194 (43%), Gaps = 11/194 (5%) Query: 186 KQGGDYLF--AVKGNQGRLNKAFEEKFPLKELNNPAHDS---YAMSEKSHGREEIRLHIV 240 +G + +G L A + F + + +K HGR E R Sbjct: 91 DRGRWWRLRACRQGQPTHLAHALRDFFGTLDAPGYPVRQTCVHETLDKGHGRIETRRCTA 150 Query: 241 CDVPDEL--IDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATA 298 D L + WK + + S R I ++ E RY ISS +E+ A Sbjct: 151 AGDLDWLATLGLKERWKKITSVAGIDSSRVIGSK----TETDRRYVISSLPADSERILHA 206 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R HW +EN LHW LDV ED C IR NAA FS +R A+N+ D GL +K Sbjct: 207 VRMHWGIENGLHWCLDVAFGEDACPIRLRNAALDFSLLRRAAMNLFRADHSRAMGLPKKR 266 Query: 359 RKAAMDRNYLASVL 372 + AA + +YLA++L Sbjct: 267 KAAAWNPDYLANIL 280 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 144 bits (362), Expect = 7e-33, Method: Composition-based stats. Identities = 56/142 (39%), Positives = 78/142 (54%), Gaps = 3/142 (2%) Query: 101 DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 VIAI+GK+LR + + A+H +SA++ + L +GQ+ EKSNEITAI ELL Sbjct: 1 MGGLVIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLP 60 Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH 220 L ++G ++T DA+GCQ +AE+I GGDY+ AVK NQ L A + F Sbjct: 61 TLALEGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPV 120 Query: 221 DS---YAMSEKSHGREEIRLHI 239 + +K HGR E R Sbjct: 121 RQTCVHETLDKGHGRIETRRCT 142 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 143 bits (359), Expect = 2e-32, Method: Composition-based stats. Identities = 60/245 (24%), Positives = 96/245 (39%), Gaps = 17/245 (6%) Query: 28 DILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFH 87 +L + + A + G+ + T D + P T V+S + PA + Sbjct: 2 ALLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 ECFINWMRDCHSSDDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIK 144 ++ +S D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQLA 119 Query: 145 TDEKSNEITAIPELLNMLDIKG-KIITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRL 202 EKSNEI + LL +L ++T DAM Q A+ I YL VK NQ ++ Sbjct: 120 VAEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 NKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCV 262 P E+ A D + HGR + R + + + K++ Sbjct: 180 LARI-TALPWAEVPAAATD----DSRGHGRVKTRTLQIITAARGIG-----FPYAKQIIR 229 Query: 263 AVSFR 267 R Sbjct: 230 ITRER 234 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 142 bits (357), Expect = 3e-32, Method: Composition-based stats. Identities = 63/362 (17%), Positives = 108/362 (29%), Gaps = 72/362 (19%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 L+ +L L V++G + + + ++ P L GIP T R+V P Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFGLTRGIPSERTTRRLVEGCDPVA 107 Query: 86 FHECFINWMRDCH--SSDDKDVIAIDGKTLRH--SYDKSRRRGAIHVISAFSTMHSLVIG 141 E W+ +A DGKTL+ S+ ++ V+ A + G Sbjct: 108 LDEALSGWIARAATVGDPGPRGLAFDGKTLKGTRSFTEAGAMSQEAVLEAVWHDTGITAG 167 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 + +EI A+ L LD+ ++T + G Sbjct: 168 HQRVVG-GDEIAALEALAGRLDLTDVLVT--------------TAEKGH----------- 201 Query: 202 LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 GR E+R V + + G K++ Sbjct: 202 -----------------------------GRVEVRSLKALTVTTPKLVGFW---GTKQVI 229 Query: 262 VAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA-------TAIRNHWHVENKLHWRLD 314 ++ + L AE+ R HW VE +H D Sbjct: 230 ELRRRTRRKKTVTAAPTVSEEVFYLVTSLPAEQAHPRDLAARARARGHWTVEA-IHHVRD 288 Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTG 374 V++ED R NA ++ R AI+ L + + +R A + + Sbjct: 289 RVLDEDRHTARTANAPLAWAIARDTAISALRLTGHR--SIAKALRTTARQPERVLQTIAL 346 Query: 375 SG 376 Sbjct: 347 IS 348 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 141 bits (355), Expect = 4e-32, Method: Composition-based stats. Identities = 56/145 (38%), Positives = 75/145 (51%), Gaps = 7/145 (4%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPA---HDSYAMSEKSH 230 MGCQK+IAE I +Q DY+ AVK NQ L++A ++ F N D KSH Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYFEEANEANFESYNIDFAETYNKSH 60 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADL 290 GR E R V L D + W+GL+ + + S R++ + + RYYISS Sbjct: 61 GRIESRRCWVGYDALPLTDDSQNWEGLQTIVMVESERTLKEK----TTIEHRYYISSTMA 116 Query: 291 TAEKFATAIRNHWHVENKLHWRLDV 315 TA + R HW +EN LHWRLD+ Sbjct: 117 TAAYLLNSSREHWGIENSLHWRLDI 141 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 141 bits (354), Expect = 5e-32, Method: Composition-based stats. Identities = 70/315 (22%), Positives = 118/315 (37%), Gaps = 41/315 (13%) Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCH-------SSDDKDVIAIDGKTLR 114 G P T+ R+++ SPA E ++D V++ DGK Sbjct: 93 LGLGRGKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTW 152 Query: 115 HSYDKSRRRGAIH----------------VISAFSTMHSLVIGQIKTDEKSNEITAIPEL 158 D + +GA + S+ +GQ K E TA L Sbjct: 153 SRTDGEKVKGAQQSAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFRRL 212 Query: 159 L----NMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 L L + +I+T DA C ++ AE + G Y+F +K NQ L+ + Sbjct: 213 LPAISEQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLHDIARDY----G 268 Query: 215 LNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 + +E+ G +R DV + L +C + R Sbjct: 269 QYDLGTPLARTAERYRGHTIVRELYARDVAGNPAAAIEAAQQLWYVCQTTTDRR-----G 323 Query: 275 KELEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA- 330 + + + RY+++S LT ++ +R HW +EN HW +DV++ ED+ + + A Sbjct: 324 EIVAVEQRYFVTSIPTGTLTRDQELALVRMHWAIENGCHWTMDVMLGEDEGHPCQASRAS 383 Query: 331 -ELFSGIRHIAINIL 344 E S +R I N + Sbjct: 384 IETVSWLRLIGYNAV 398 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 141 bits (354), Expect = 5e-32, Method: Composition-based stats. Identities = 48/180 (26%), Positives = 84/180 (46%), Gaps = 4/180 (2%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ ++ +PD+R A + L +LLL I +S G+ +EDF H + L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 EN-GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDK--SR 121 P T RV+ I F NW+ ++D + +DGK+++ + Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 122 RRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + I+V+S FS + I Q +++ +EI + LL LD++G + T D++ CQK + Sbjct: 124 YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKKL 183 >UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7Q7_9GAMM Length = 164 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 49/161 (30%), Positives = 75/161 (46%), Gaps = 6/161 (3%) Query: 215 LNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 L +SY EK HGR+E+R V +W +K + V RS+ + Sbjct: 9 LPEDKQESYITEEKGHGRKEVREVYVLPAAFS-EALRQKWCLVKSIVAVVRDRSVKGKG- 66 Query: 275 KELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 YYI + L+ E + A R HWH+EN+ HW LDV+ ED+ +I G++A + Sbjct: 67 ---SYETSYYICTDHLSLELASKATRKHWHIENQQHWALDVIFKEDEQRIYAGDSALNMA 123 Query: 335 GIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGS 375 R N+ + + RKM +AA +++Y VL S Sbjct: 124 CCRRFVQNLFRKSE-GNLSVPRKMNQAAWNKDYREKVLFTS 163 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 139 bits (350), Expect = 2e-31, Method: Composition-based stats. Identities = 52/214 (24%), Positives = 90/214 (42%), Gaps = 3/214 (1%) Query: 20 WKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGI-PVHDTIARVV 78 H L +L L AV+ G + I FG + L F G P T+++ + Sbjct: 2 QGRIHPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTL 61 Query: 79 SCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSL 138 I P + W+ + D + +A+DGK LR S D H ++A++ + Sbjct: 62 RRIDPQQLEAALGRWIAGRLTPDARAHVALDGKCLRGSRDGDV--PGPHRVAAYAPHAAA 119 Query: 139 VIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 V+GQI+ D ++NE A LL ++ + G ++T A C +D+A + GG Y+ +G Sbjct: 120 VLGQIRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYVSHGQGQ 179 Query: 199 QGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGR 232 R + ++ A + +S Sbjct: 180 PTRPGGRHRGRVGVRGRRPRARGGHVPLSRSRRP 213 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 138 bits (347), Expect = 3e-31, Method: Composition-based stats. Identities = 57/201 (28%), Positives = 92/201 (45%), Gaps = 13/201 (6%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 + +L IK I T DA+ CQK E I ++ Y+ VK NQ L +A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIED-----T 56 Query: 215 LNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 N ++++ ++K HG H + + +W GL++ ++ Sbjct: 57 AKNSPLNAWSWTQKGHGH---ESHCRLKIWEATESMKMQWAGLERFISIRRQGFRHHKKF 113 Query: 275 KELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 Y+I+S L++ + A IR H +EN LHW DV++NED+C IR + A + Sbjct: 114 DSTT----YHITSETLSSYRLAGFIRGHRRIENNLHWTKDVILNEDNCGIREPHPAAILG 169 Query: 335 GIRHIAINILTNDKVFKAGLR 355 +R+IA N L V L+ Sbjct: 170 ILRNIAFN-LRLGTVSNPSLK 189 >UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azollae' 0708 RepID=B9YJB3_ANAAZ Length = 124 Score = 136 bits (341), Expect = 2e-30, Method: Composition-based stats. Identities = 42/113 (37%), Positives = 67/113 (59%), Gaps = 4/113 (3%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W LK + + S + + + RY+ISS D E+ A ++R+HW +EN LHW L Sbjct: 15 WSNLKSVGMVESIGQV----DDKTTVETRYFISSLDSNGEQLANSVRSHWAIENSLHWVL 70 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRN 366 DV + +DDC+IR+ NA + F+ +R IA+++L + K G++ K AA+D N Sbjct: 71 DVALKQDDCQIRKDNAPQNFAVMRQIAVDLLGKENPVKRGIKNKQFLAAVDNN 123 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 57/180 (31%), Positives = 85/180 (47%), Gaps = 5/180 (2%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLK-QY 61 + L + IPD+R+A L +LL +I A++SGA + I F TH L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 G P + +I + + F VIA+DGKTLR S D+ Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHFRAHAARLAE--GAAVIALDGKTLRGSLDRFE 118 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTD--EKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A V+SAF+T +V+GQI + K +EI A L+ L + G++ T DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC8_9GAMM Length = 154 Score = 135 bits (339), Expect = 3e-30, Method: Composition-based stats. Identities = 42/111 (37%), Positives = 61/111 (54%), Gaps = 4/111 (3%) Query: 252 FEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHW 311 W+ L+ + + S R+ +K E + RYYISS TA R HW +E LHW Sbjct: 5 ENWEELQTIVMVESERA----EKGETTIEHRYYISSTLGTAAYLLDYKREHWGIETSLHW 60 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 LD+ ED+ +I +GN AE F+ +RHIA+N+L + K G++ K KA Sbjct: 61 CLDIAFREDESRISKGNGAENFAILRHIALNLLKKEDTAKIGIKNKRLKAG 111 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 134 bits (338), Expect = 4e-30, Method: Composition-based stats. Identities = 65/141 (46%), Positives = 91/141 (64%), Gaps = 4/141 (2%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIK 165 +AIDGK LR S+D R IH++SA+S+ +L +GQ++T +KSNEITAIPELL LDI+ Sbjct: 1 MAIDGKCLRGSHD--GARSPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHD--SY 223 G IT DAMGCQ DIAE+I ++G DY+ VKGNQ L +A + F + + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 AMSEKSHGREEIRLHIVCDVP 244 + ++K+HGR E R + + Sbjct: 119 SQTDKNHGRIETRRCVATNDV 139 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 133 bits (335), Expect = 8e-30, Method: Composition-based stats. Identities = 56/167 (33%), Positives = 83/167 (49%), Gaps = 13/167 (7%) Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 ++S ++T + LV+GQ+K E SNEITAIPELL +L++ G I+ A+ C KDI + I ++ Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 GGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPAHDSYAMSEKSHGREEIRLHIVCDVP 244 DY+ +K NQG L ++ E+ F H +Y E HG EIR P Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQELQHSTYKPEETGHGLHEIRNFGFQLDP 120 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT 291 D W LK + + + + + RY+ISS D Sbjct: 121 DS------VWSNLKSVGMVEPIGQV----DDKTTVETRYFISSLDSN 157 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 133 bits (333), Expect = 1e-29, Method: Composition-based stats. Identities = 61/160 (38%), Positives = 90/160 (56%), Gaps = 3/160 (1%) Query: 97 CHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIP 156 + D+IA+DGKTLR SYD++ + AIH++SA+ST + LV+GQ+KT+EKSNE TAIP Sbjct: 1 MAARIPGDIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIP 60 Query: 157 ELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLK 213 +L +L ++ +T DA+G Q+DIA++I + DYL VK NQ L++ + + K Sbjct: 61 KLFTLLALEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAK 120 Query: 214 ELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFE 253 DS HGR + V L + Sbjct: 121 GFTEDFTDSVTEEGDKHGRIDKLHCRVTHRFSGLGALADK 160 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 131 bits (328), Expect = 6e-29, Method: Composition-based stats. Identities = 45/190 (23%), Positives = 74/190 (38%), Gaps = 6/190 (3%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ H+ IPD R + +LL+ + ++S E D+E F H L + Sbjct: 12 DLISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGI 71 Query: 65 E-NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK--DVIAIDGKTLRHSYDKSR 121 E P + A +W D + DGKTLR S + + Sbjct: 72 ELKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTS 131 Query: 122 RRGA--IHVISAFSTMHSLVIGQ-IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 GA I ++ +S + I Q + +E + +LL LD++G +I DA+ Q+ Sbjct: 132 GGGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQ 191 Query: 179 DIAEKIQKQG 188 Q +G Sbjct: 192 AFFGSSQSRG 201 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 128 bits (322), Expect = 3e-28, Method: Composition-based stats. Identities = 45/187 (24%), Positives = 76/187 (40%), Gaps = 17/187 (9%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEK--------FPLKELNNPAHDSYAM 225 M Q D+ +Q++GGDY+ K NQG L E FP D+ Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYI 285 E S G + + L ++ W G++++ R + + + V Y I Sbjct: 61 CEVSKGHGWVERRTMTS-TIWLNEYLTRWPGVQQVFRLTRTRQV----GGKTTVEVVYGI 115 Query: 286 SSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 SS + R HW +E++ H D + ED C++RRG A + + +R++A+ Sbjct: 116 SSLSSVAAAPDALLRYTRTHWGIESR-HHIRDATLGEDRCRVRRGAAPRVLAVLRNVAVY 174 Query: 343 ILTNDKV 349 +L Sbjct: 175 LLRRLGT 181 >UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=Bacteroides RepID=C6Z3A5_9BACE Length = 115 Score = 126 bits (317), Expect = 1e-27, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 64/118 (54%), Gaps = 4/118 (3%) Query: 261 CVAVSFRSIIAEQKKELEMTVRYYISSADLT-AEKFATAIRNHWHVENKLHWRLDVVMNE 319 S R+I+A E VRYY++S D T EK A+AIR HW + N LHW+LDV E Sbjct: 1 VRIKSERTIVA--IGEYTQEVRYYVTSLDNTRPEKIASAIRQHWSIGNNLHWQLDVTFRE 58 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGSGL 377 D K + NAA FS +A+ IL N+K K + K KA D NYL+ +L + Sbjct: 59 DYSK-KVKNAAGNFSVATKMALTILKNEKTTKGSMNLKRLKAGWDENYLSQLLQDNNF 115 >UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H9Y8_ANADF Length = 431 Score = 126 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 62/370 (16%), Positives = 117/370 (31%), Gaps = 45/370 (12%) Query: 10 ISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP 69 + +PD R L++IL + +++GA + E+ + ++ +P Sbjct: 22 LEAVPDVRAREG-RWSLAEILTGVLLGIVAGARSLAEAEELTDGMSPAARRLASVPRRLP 80 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCHSS-------DDKDVIAIDGK-----TLRHSY 117 T + + +R V+A+DGK TL H Sbjct: 81 -DTTARDALCAVPLDGLRAALHRLVRAAWRRKALTPVDLPVGVVALDGKVTALPTLNHPL 139 Query: 118 DKSRRRG--------AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE----LLNMLDIK 165 +++ + S I + ++NE L+ Sbjct: 140 IQNQHPDVGLPYGLARTVTCALVSAPGRPCIDAVPIPAETNEAGHFQHVLAGLVETYGAL 199 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ---GRLNKAFEEKFPLKELNNPAHDS 222 +++T DA + + G DY+FA+K + +L + + D+ Sbjct: 200 FQVVTYDAGALSEANGAAVVAAGKDYVFALKNDHFTMVKLATELLDPHEIAARREDVLDN 259 Query: 223 YAMSEKSHGREEIRLHIVCDVPDELIDFTFE---WKGLKKLCVAVSFRSIIAEQKKELEM 279 + + EI++ V E W + S + +E Sbjct: 260 ATTATR-----EIQILAVDPSHGYGAGKGPEESVWSHARTFLRVTS---TVRRSGVVIER 311 Query: 280 TVRYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCK--IRRGNAAELFS 334 R ++SS +++ +R HW VEN H LD ED+ N Sbjct: 312 DSRLFVSSRAADQLTPDQWLQVVRAHWGVENNNHHTLDTAFAEDERPWIAADANGMLAVL 371 Query: 335 GIRHIAINIL 344 +R IA +L Sbjct: 372 LLRRIAYTLL 381 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 123 bits (307), Expect = 1e-26, Method: Composition-based stats. Identities = 57/161 (35%), Positives = 78/161 (48%), Gaps = 7/161 (4%) Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 H++SA++T H + +G + T+EKSNEITAI LL L K ++T DAMGCQKDIA I Sbjct: 3 PRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARNI 62 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEE---KFPLKELNNPAHDSYAMSEKSHGREEIRLHIVC 241 GGD++ AV+ NQ +L A K E H ++ HGR + R + Sbjct: 63 VAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWGA 122 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVR 282 VP EW +K + AV + VR Sbjct: 123 QVP-PDFAAKGEWPWIKAIGTAVRITT---HPDGTQTDEVR 159 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 122 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 48/167 (28%), Positives = 78/167 (46%), Gaps = 9/167 (5%) Query: 3 LKKLMGHISIIPDYRQAWK--MEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 L+KL S IPD+R+A K + HKL D+++L I +S +I +FG+ + ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK-----DVIAIDGKTLRH 115 NGIP T+ R+ I + H +++ IDGK R Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCAQEIVCIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML 162 + K+ R I +SA S + + +EKSNEI A+P L++ + Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLIDKI 199 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 121 bits (302), Expect = 6e-26, Method: Composition-based stats. Identities = 44/187 (23%), Positives = 82/187 (43%), Gaps = 13/187 (6%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L +S +PD R A + L +L L + A +S + +E F +P L G Sbjct: 3 LRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 P H I ++ + P K + +D +V+ +DGK LR S + Sbjct: 60 RKAPGHTAITLLLHRLDPEKLQAALGQVFPE---ADLGEVLVVDGKHLRGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML---DIKGKIITTDAMGCQKDIAE 182 + ++ + + Q + + + E A ELL+ L +++GK++ DA ++A Sbjct: 115 VKLVEVLALHLHTTLAQARAEGR--EEKAFLELLDRLEARELEGKVVVGDAGYLYPEVAA 172 Query: 183 KIQKQGG 189 +++K+GG Sbjct: 173 RVRKKGG 179 >UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=Bacteria RepID=B7UED9_ECOLX Length = 104 Score = 121 bits (302), Expect = 6e-26, Method: Composition-based stats. Identities = 83/99 (83%), Positives = 89/99 (89%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 MTVRYYISSAD TAEKF TAIRNHWH+EN L+WRLDVVMNEDD KIRRGNAAE FSGIRH Sbjct: 1 MTVRYYISSADSTAEKFVTAIRNHWHMENNLYWRLDVVMNEDDYKIRRGNAAESFSGIRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGSGL 377 IAINILTN++VFKA RRKMRKA MD+NYLASVL G+G Sbjct: 61 IAINILTNNQVFKARSRRKMRKATMDKNYLASVLAGAGF 99 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 119 bits (298), Expect = 2e-25, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 74/202 (36%), Gaps = 50/202 (24%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLN---KAFEEKFPLKELNNPAHDSYAMSEKSH 230 MGCQK+IA+ I KQ DY+ A+KG+ L +A+ K + D + + H Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGH 60 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADL 290 GR E R V ++ ++W GLK + S Sbjct: 61 GRIETRRCQQVLVNKSWLNNKYQWVGLKSIIKVTSDVHEKTTT----------------- 103 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 + +IR+G F+ +R IA+ + ++ Sbjct: 104 ------------------------------ESRIRKGRGPLAFNVMRKIAMTLFKQEQTK 133 Query: 351 KAGLRRKMRKAAMDRNYLASVL 372 +A + K + A +D Y +++L Sbjct: 134 RASIVAKKKMAGLDDEYRSTLL 155 >UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammaproteobacteria RepID=Q47YV8_COLP3 Length = 151 Score = 119 bits (298), Expect = 2e-25, Method: Composition-based stats. Identities = 34/136 (25%), Positives = 65/136 (47%), Gaps = 3/136 (2%) Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFA 296 +H+ + + + + GLK + + + + R+ ISS DL + Sbjct: 12 IHLRTLIDKKWLAKAYRRSGLKSIIKV--HTQVHDKSTGKDTAETRWNISSLDLHVVQAL 69 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 A+R+HW VE+ +HW LD+ D+ +I R +F+ +R IA+ + D + R Sbjct: 70 NAVRSHWQVES-IHWMLDMTFRVDESRICRKQGPHVFNVMRKIAMTLFKQDTTKLVSMAR 128 Query: 357 KMRKAAMDRNYLASVL 372 K + A +D +Y +++L Sbjct: 129 KKKMAGLDDDYRSNLL 144 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 115 bits (287), Expect = 3e-24, Method: Composition-based stats. Identities = 45/184 (24%), Positives = 75/184 (40%), Gaps = 15/184 (8%) Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREE 234 K E + G D L +KGN +L A + SY + R E Sbjct: 3 STFKKTVETVLATGNDLLVQLKGNHPKLLAAVRTLCQSRAHAE---QSYTVDLGRRNRIE 59 Query: 235 IRLHIVCDVPD------ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSA 288 R + +P F +G +++ V + + + + YY+++ Sbjct: 60 QRTVRLWPLPPGSGTDPWHDHFQTVIEGQRQIEVFNPYHRRFEPR----QESPAYYLATC 115 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 +A A IR HW +EN+LH LDV + ED +IRR +F+ +RH A+N+L ++ Sbjct: 116 TASAATLAQVIRGHWAIENRLHHVLDVSLGEDSSRIRRN--PGVFALLRHFALNLLRHNG 173 Query: 349 VFKA 352 Sbjct: 174 QANI 177 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 114 bits (286), Expect = 4e-24, Method: Composition-based stats. Identities = 46/187 (24%), Positives = 83/187 (44%), Gaps = 13/187 (6%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L +S IPD R A ++ L +L L + A +S + +E F +P L G Sbjct: 3 LREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 P H + ++ + P K E + +D +V+ +DGK L+ S + Sbjct: 60 RKPPGHTILTLLLHRLDPEKLQEALLQVFP---GADLGEVLVVDGKHLKGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML---DIKGKIITTDAMGCQKDIAE 182 + ++ + + Q K + + E A+ ELL+ L +KGK++ DA ++A Sbjct: 115 VRLVEVLALHLLTTLAQAKAEGR--EDQALLELLDRLGAEGLKGKVVVGDAGYLYPELAG 172 Query: 183 KIQKQGG 189 K+ ++GG Sbjct: 173 KVVQKGG 179 >UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica OS145 RepID=A3WIW9_9GAMM Length = 96 Score = 114 bits (286), Expect = 4e-24, Method: Composition-based stats. Identities = 43/96 (44%), Positives = 62/96 (64%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 M RYYISSA L+AE+FA+ +R HW +EN+LHW LDV + ED+C I RG+AA+ + RH Sbjct: 1 MQYRYYISSASLSAEEFASTVRAHWGIENRLHWVLDVTIREDNCPISRGHAADNLACFRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTG 374 +A+N + +K A + RK + A M L ++ Sbjct: 61 VALNQIRREKTIDASVNRKQKMATMSEEVLDLIVNA 96 >UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C378D Length = 453 Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats. Identities = 54/365 (14%), Positives = 93/365 (25%), Gaps = 58/365 (15%) Query: 8 GHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG 67 IPD R L D+L+ + A F D ++ G Sbjct: 27 ERFETIPDAR--RGPTFSLPDVLMAGLALFALKAPSLLA---FQRRTLDHNLRHVFGLTG 81 Query: 68 IPVHDTIARVVSCISPAKFHECFIN--------WMRDCHSSDDKDVIAIDG--------- 110 P + V+ + P F + + D + D V+A+DG Sbjct: 82 RPSDSQMRAVLDDVDPDHLRPVFRDVFARLQAAHVLDEYRVDGCYVVALDGVEYFCSQKV 141 Query: 111 -----KTLRHSYDKSRRRGAIHVISAFSTMHSLVIG------QIKTDEKSN--EITAIPE 157 T RH+ + + S V+ Q N E A Sbjct: 142 HCPHCMTRRHANGAVSYYHQMLGAAVVHPDFSAVLALAPEPIQRADGGTKNDCERNAARR 201 Query: 158 LLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 L ++ DA +QK +L VK F Sbjct: 202 WLGRFREEHPDLAVLVVEDARSSNAPHVRDLQKARCHFLLGVKA------ADHAHLFAHV 255 Query: 214 ELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 H ++ + E + R +R + L + + + + Sbjct: 256 CARQDQH-AFEVVEDADPRTGLRRSYLWIADLPLNESNDD-------VRVNFVHLVELDP 307 Query: 274 KKELEMTVRYY-ISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRR-GNAAE 331 ++ + A A R W +EN+ L N+ G+ Sbjct: 308 DGTPREWTWVADMAVTGANVRQLARAGRARWRIENETFNTLK---NQGYHFAHNFGHGDN 364 Query: 332 LFSGI 336 S + Sbjct: 365 NLSVV 369 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats. Identities = 49/208 (23%), Positives = 86/208 (41%), Gaps = 18/208 (8%) Query: 98 HSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 ++ + IA+DGK L+ S + R H++SA + + + +++ K+NE T Sbjct: 126 ATAGPRRAIAVDGKALKASARLTSPRR--HLLSAVTHGRVVTLARVEVGAKTNETTHFKP 183 Query: 158 LLNMLDIKGKIITTDAMG-CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 LL LD+ ++T DA+ + +I+ ++ + Y+ +K NQ + P +++ Sbjct: 184 LLAPLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQLAT-LPWRDIP 242 Query: 217 NPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE 276 +A SE HGR E C +PDEL + L A+ K Sbjct: 243 V----QHAASEVGHGRRESSSIKTCAIPDELGGIAYPHARL-----AIRVHRRCQPTGKR 293 Query: 277 LEMTVRYYISSADLTAEKFATAIRNHWH 304 Y ++S D A R W Sbjct: 294 ESRESVYAVTSLDAH-----QATRPIWP 316 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 111 bits (278), Expect = 4e-23, Method: Composition-based stats. Identities = 47/210 (22%), Positives = 91/210 (43%), Gaps = 14/210 (6%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLK--- 59 + + +++ IPD R+ K +H+ D+LL+ + AV SG + + + FL Sbjct: 6 IPSPLPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDEV 65 Query: 60 --QYGDFENGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSDDKD-----VIAIDG 110 + E +P T+ R+ + + ++W R+ + K+ +A+DG Sbjct: 66 HIRTRRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVDG 125 Query: 111 KTLRHSYDKSRRRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKII 169 K LR + R A+ +SA L +G Q D ++ + + L + ++ Sbjct: 126 KHLRGTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGV-DWVL 184 Query: 170 TTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 T DA C +++A + +Q G A KG + Sbjct: 185 TGDAALCTQELAAVVVEQKGGICSASKGTR 214 >UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C429C Length = 149 Score = 111 bits (276), Expect = 5e-23, Method: Composition-based stats. Identities = 38/153 (24%), Positives = 69/153 (45%), Gaps = 13/153 (8%) Query: 224 AMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRY 283 SEK HGR E R + +WKGLK+ R++ ++ + V Y Sbjct: 2 TTSEKGHGRIEKRTLETTPIVT----VGQKWKGLKQGLRITRERAVKGKK----TVEVVY 53 Query: 284 YISSA---DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 I+S A T +R+HW +EN LH+ DV + ED C++R+G A ++ + +R++ Sbjct: 54 GITSLSMARANAATLLTILRDHWQIENGLHYVRDVTLGEDACRVRKGTAPQVLAAVRNVV 113 Query: 341 INILTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 +++L V + + +++ Sbjct: 114 VHLL--ASVEAKSRPEAIELLQLHPENARNLIG 144 >UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C6B8_ACAM1 Length = 145 Score = 111 bits (276), Expect = 6e-23, Method: Composition-based stats. Identities = 45/152 (29%), Positives = 71/152 (46%), Gaps = 9/152 (5%) Query: 222 SYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTV 281 + S +S GREE R V + + EW+ ++ + + + + Sbjct: 3 EHTHSIQSRGREEHRCIQVYE---PVGIALQEWEAIRSVLCVQRWGTRQGKAYHN----T 55 Query: 282 RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAI 341 YYISSA + + + +R HW +EN+LHW DVV EDD ++ A +S +R I I Sbjct: 56 AYYISSAATSPHHWQSLVREHWGIENRLHWPKDVVFGEDDYRLEDEQALLNWSVLRTIVI 115 Query: 342 NILTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 NIL + L+ M K A + + S+LT Sbjct: 116 NILRLNGYQ--SLKTAMTKLANRVDIIFSLLT 145 >UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina DSM 3645 RepID=A3ZX26_9PLAN Length = 115 Score = 110 bits (275), Expect = 8e-23, Method: Composition-based stats. Identities = 43/104 (41%), Positives = 62/104 (59%) Query: 272 EQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 +Q + VRYYI S LT +FA A+R HW +EN LHW+LDV E +IR+G+A Sbjct: 12 KQNGKEASEVRYYIVSKYLTGRRFAEAVRGHWGIENSLHWQLDVTFGEHQSRIRKGHADI 71 Query: 332 LFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGS 375 FS +R ++++L N+K + G++ K KA + YL VL G Sbjct: 72 NFSLLRRTSLSLLKNNKTARVGVKNKRLKAGRNDKYLLEVLLGK 115 >UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE7_9GAMM Length = 185 Score = 109 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 44/173 (25%), Positives = 66/173 (38%), Gaps = 10/173 (5%) Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSH-GREEIRLHIVCDV 243 G L +K NQ L+ A E +P D + E R E R V + Sbjct: 2 IATGNHLLVQLKRNQPLLHDAMVEY----TRGHPFVDEHHTHEIGRRNRIEKRAVHVWHL 57 Query: 244 PDELIDFTFEWKGLKKLCVAVSF--RSIIAEQKKELEMTVRYYISSADLTAEKFATAIRN 301 L + + L R + + YY+ L A +F+ AIRN Sbjct: 58 HPSLGSAPWY-DHFRALIRVQRHTERFDTRLRDWRVSKECAYYLCDLVLPAARFSEAIRN 116 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 HW VEN+ H+ D ED +IRR F+ +R A+N++ ++V Sbjct: 117 HWRVENRAHYVRDTRFQEDASRIRRN--PCTFALLRSFALNLMRFNRVENISQ 167 >UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REV9_LEGLO Length = 139 Score = 108 bits (269), Expect = 4e-22, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 4/121 (3%) Query: 256 GLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDV 315 G+K + + S E + RYY++S + +RNHW +EN+LHW LDV Sbjct: 21 GIKSIIATETISSKTNET--AISAEWRYYVTSHETEKSDLHLYVRNHWSIENELHWHLDV 78 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKAGLRRKMRKAAMDRNYLASVLT 373 +N+D K R A FS I+ + ++++ K +R ++++ D YL S+L+ Sbjct: 79 HLNDDADKKRDDTTAINFSSIKRMLLSLVKTKLPPGKKRSVRSRLKQVGWDTEYLVSLLS 138 Query: 374 G 374 Sbjct: 139 A 139 >UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BR51_9GAMM Length = 153 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 59/150 (39%), Gaps = 9/150 (6%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHPD 56 +++ L + + +PD +A H+L +L L A + G +G++ + ++ Sbjct: 7 QMRSLPQYFTDLPDPHRAEGRRHRLPVVLTLAAGASLCGMQGYKSMAEWASSLAQAARRR 66 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 F + + +P I + + P W + ++ +A+DGK ++ Sbjct: 67 FGCRRVNGHYLVPSLYVIRDCLVRLGPEALDRRLQAWQAA--QLNSEEALAMDGKIMKGG 124 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTD 146 D + + H++S + Q K+ Sbjct: 125 VDHTGAQ--THIVSLIGHESKHCVAQKKSA 152 >UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKL6_9PROT Length = 506 Score = 105 bits (261), Expect = 3e-21, Method: Composition-based stats. Identities = 62/386 (16%), Positives = 123/386 (31%), Gaps = 57/386 (14%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHK----LSDILLLTICAVISGAEGWEDIEDFGETHPDF 57 EL L+G + IPD R K HK L LL+ + S E ++ Sbjct: 75 ELPALLGQLEQIPDPRDPRKRRHKLTVLLLYGLLMFVFQFASRRETNREMTR--PQFLAN 132 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD--------VIAID 109 L++ +P DT+ R++ I A + ++ +R IAID Sbjct: 133 LQRLFPEIEALPHADTLYRLLRDIDLAHLEQAHVDLVRRLIRGKSFRRYLINHCHPIAID 192 Query: 110 G------------KTLRHSYDKSRRRGAIHVI----SAFSTMHSLV-----------IGQ 142 G + L+ K R + + ++ + LV +G Sbjct: 193 GSQKLAGDTLWAEELLQRHVGKDETRHTQYFVYVLEASLVFHNGLVIPLLSEFLEHALGD 252 Query: 143 IKTDEKSNEITAIPELLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 + ++ E+ L + L ++ D + + ++ + ++ +K Sbjct: 253 SEAQKQDCELRGFARLSDRLKRLFPRLPILLLLDGLYANGPVMQRCLRAHWQFMIVLKD- 311 Query: 199 QGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 L +EE L+ P ++ GR + V D+ L Sbjct: 312 -KDLPTVWEEFRALQPRQLP------TLQQDWGRRQQHFSWVNDIEYAYGSNGRCRLKLH 364 Query: 259 KLCVAVSFRSIIAEQKKELEMTVRYYISSADLT----AEKFATAIRNHWHVENKLHWRLD 314 + ++ + E + E ++SS L+ E+ R+ W +E Sbjct: 365 VVVCEERWQGVDQEARIVTETARHAWLSSQPLSRENVHERCNLGARHRWGIEAGFLVEKH 424 Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIA 340 + + NA + + +A Sbjct: 425 QGYHYEHAFALDWNAMRGYHLLMRLA 450 >UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D5 Length = 148 Score = 104 bits (260), Expect = 4e-21, Method: Composition-based stats. Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 11/124 (8%) Query: 228 KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISS 287 K HGR E R L ++ W G++++ R + + V Y ISS Sbjct: 3 KGHGRVERRSITTTT---WLNEYLTRWPGVQQVFRLERQRR----ADGKTTVEVVYGISS 55 Query: 288 AD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 + R+HW +E+ LH+ DV ++ED C++RRG A + + +R++A+ +L Sbjct: 56 LSPVAAPPDTVLGYTRSHWGIES-LHYVRDVTLDEDRCRVRRGTAPRVLASLRNVAVYLL 114 Query: 345 TNDK 348 Sbjct: 115 RRLG 118 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 104 bits (260), Expect = 4e-21, Method: Composition-based stats. Identities = 38/99 (38%), Positives = 60/99 (60%), Gaps = 1/99 (1%) Query: 3 LKKLM-GHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L S IPD R +H +I+ L + +V++GA+ + +IEDF E H D+LK Y Sbjct: 1 MEGLFVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTY 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS 100 + NGIP HDT +RV S I+PA F + F+ W++ + + Sbjct: 61 FNLPNGIPSHDTFSRVFSAINPASFQDSFLIWLKAINDA 99 >UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX Length = 105 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 73/88 (82%), Positives = 76/88 (86%) Query: 271 AEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 EQKKE EMT RYY SADLTAEKFATA RNHW+VENKLHW LDVVMN+DDCKIRRGNAA Sbjct: 18 TEQKKEPEMTDRYYSISADLTAEKFATANRNHWYVENKLHWHLDVVMNKDDCKIRRGNAA 77 Query: 331 ELFSGIRHIAINILTNDKVFKAGLRRKM 358 ELFSGIR IAINILT DK+ KAG R KM Sbjct: 78 ELFSGIRKIAINILTKDKILKAGARCKM 105 >UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV84_9DELT Length = 357 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 60/148 (40%), Gaps = 9/148 (6%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHPD 56 +++ L + + D R+ H++S +L + A + G +G++ I + + Sbjct: 214 QMESLPDYFKTVTDPRRTHGRRHRISTVLSIAAAATLCGMKGYKAIYGWANKLGQKARQR 273 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 F + + + IP I V+ P + + D + +A DGKT++++ Sbjct: 274 FRCRKENGKYVIPSQFVIRDVLVRADPVELDLAVQRFNED--QGLEDTCLAFDGKTMKNA 331 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIK 144 D++ R+ H+ S Q K Sbjct: 332 IDENARQ--THIASVVGHESKTTHTQKK 357 >UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PA49_CLOTS Length = 245 Score = 101 bits (250), Expect = 6e-20, Method: Composition-based stats. Identities = 47/228 (20%), Positives = 80/228 (35%), Gaps = 37/228 (16%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L I+ + D R ++ +S I + + + + +E + K+ Sbjct: 18 HLGEKINTLKDKRVKSSVK--ISTITFVVLFGFMLQIRSFNRLEHW--LKKGKFKKALPK 73 Query: 65 ENGIPVHDTIARVVSCISPAKFHEC--------FINWMRDCHSSDDKDVIAIDGKTLRHS 116 + +P DTI RV+S +E N + + D V+AIDG L S Sbjct: 74 KTKMPRIDTIRRVLSNFDLDGLNELNNSIIKTSIKNKVFRRGTIDGLKVVAIDGVELFES 133 Query: 117 YDKSRRRG--------------AIHVISAFSTMHSLVIGQIKTDEKSN-------EITAI 155 K V S + L++GQ + K + EITA Sbjct: 134 TKKCCGNCLTRVQKDGITHYFHRTVVCSTIGSDSHLILGQEILEPKKDGSDKDEGEITAG 193 Query: 156 PELLNMLDIKGK----IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 L+ L + II DA+ C+ +++ G D + VK + Sbjct: 194 KRLIRKLHREFHHFADIIVADALYCKSTWVKEVLSIGMDAVVRVKDER 241 >UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N3J1_VIBHB Length = 184 Score = 100 bits (249), Expect = 7e-20, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 7/132 (5%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYIS 286 ++ HGR R + +P+EL + G+K R + + + YYI+ Sbjct: 34 DEGHGRLVRRRYFAFPLPEELHNHALS--GIKSCIAVE--RIVQEGKGEPKTSHFSYYIT 89 Query: 287 SADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN 346 + + K A +R HW +E+ HW LDV N+D K N+AE F+ I+ + +N++ Sbjct: 90 NHPASDPKLADYVRQHWEIES-YHWLLDVYFNDDRDKKYEENSAENFAQIKRLPLNLVKA 148 Query: 347 DK--VFKAGLRR 356 K ++ Sbjct: 149 KDWAGKKKSVKS 160 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 100 bits (249), Expect = 8e-20, Method: Composition-based stats. Identities = 53/128 (41%), Positives = 69/128 (53%), Gaps = 1/128 (0%) Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREE 234 + ++ +KI ++ DYL AVKGNQG L AF++ F LNN + Y E+S GR E Sbjct: 11 SVRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHE 70 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEK 294 R V L D + EW GLK + VS S E +E ++ VRYYISS L AE+ Sbjct: 71 SRAAFVSHDLSVLGDISDEWPGLKSMAFVVSMNS-EKEVAEEADIYVRYYISSKQLNAEE 129 Query: 295 FATAIRNH 302 TA R H Sbjct: 130 LLTASRLH 137 >UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY37_9BACT Length = 138 Score = 100 bits (249), Expect = 8e-20, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 51/120 (42%), Gaps = 5/120 (4%) Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTV-RYYISSADL 290 R E + V L+ ++ L+++ + K E + +SS Sbjct: 1 RIETQTIRVSS----LLKGYSDFPHLEQVFRIDRVTRFKKKGKTRKETALGVTSLSSGQA 56 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 + + +R HW +EN+LHW D V ED C R GN A + + +R++ I++L Sbjct: 57 SPRELLDFVRGHWEIENRLHWIRDSVFREDTCTTRTGNGAHVMATLRNMTISLLRVAGSK 116 >UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillaceae RepID=C5D2E6_GEOSW Length = 437 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 64/416 (15%), Positives = 126/416 (30%), Gaps = 83/416 (19%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDI-EDFGETH-PDFLKQ 60 K L+ + + D R + + IL + + G + + E F + + ++ Sbjct: 28 FKDLVDQLKKVKDKRHQSYITYGPETILYTILLKSVFGIKSMRSMTELFNKDECIENIRV 87 Query: 61 YGDFE--NGIPVHDTIARVVSCISPAKFHEC--------FINWMRDCHSSDDKDV-IAID 109 + N +P +DTI ++ + P + F + +K I D Sbjct: 88 VLGLKELNELPHYDTINDFLAKLEPKELETIRIYLIKKLFEKRCLESFRILNKYWPIVFD 147 Query: 110 GKTL-------------RHSYDKSRRRGAI----HVISA--FSTMHSLVIGQIKTDEKS- 149 G + R DK + HV+ A L I + +S Sbjct: 148 GTGIHTFKEKHCEHCLRREYKDKETGETKVVYMHHVLEAKLVVGDMVLSIATEFIENESE 207 Query: 150 ------NEITAIPELLNMLD-----IKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 E+ A L++ L + +I D++ + + E K Y+F K + Sbjct: 208 NVPKQDCELKAFMRLVDKLKKTFKRLPICLI-ADSLYACEPVFEICDKHNWKYIFRFKED 266 Query: 199 QGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 + + E N + + + V D+ + Sbjct: 267 RIKTVSQEFRAIQSLETNGKSSEYF---------------WVNDIAYND----------R 301 Query: 259 KLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMN 318 + + + E+K+E + I+ + AE A R W +EN+ Sbjct: 302 LVNLVEKVKVTENEKKQEFLFITNFRIT--ERNAEILVQAGRRRWKIENEGFNNQKNGWY 359 Query: 319 EDDC-KIRRGNAAELFSGIRHIA----------INILTNDKVFKAGLRRKMRKAAM 363 E + NA + + IA +L K + K+ +A Sbjct: 360 EIEHVNCHNYNALKNHYLLVQIADILVQLYKYGSKLLKQLKKSAKEISSKLLEAIR 415 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 66/212 (31%), Gaps = 34/212 (16%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICA-VISGAEGWEDIEDFGETHPDFLKQYG 62 + + + + D R + + + +C+ +G + + + Sbjct: 22 EGIWERLDRVTDPRSTRGRVYSWLCLAAVWLCSLTAAGHHRVSAVRAWLARTSGAERARL 81 Query: 63 DFEN------GIPVHDTIARVVSCISPAKFHECFINWM---------------------- 94 +P TI + + + ++ Sbjct: 82 RLPWDPFAGWRLPSTATIHCFLQAVDDGELAVALLDPPLDPDPPAEQGDDTDQRTEPSAA 141 Query: 95 ---RDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNE 151 + +A+DGKT RH+ K +H++ S ++ Q++ + K+NE Sbjct: 142 PVDPGHGCQPVESAVALDGKTSRHA--KRADGSKVHLVGVASHGDGRLLAQVEVEAKTNE 199 Query: 152 ITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 LL LD+ ++T DA+ + + Sbjct: 200 TAVFRRLLRPLDLTNVLVTADALHTVRANLDT 231 >UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BLK3_9GAMM Length = 190 Score = 99.9 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 53/129 (41%), Gaps = 6/129 (4%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHPD 56 +++ L + + +PD R+A H+L + LT A + G +G++ + ++ Sbjct: 59 QMRALPQYFTDLPDPRRAQGRRHRLPVVPALTAGASLCGMQGYKAMAEWASSLGQAARQR 118 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 F + + +P I + + P W +S D + +A+DGK ++ Sbjct: 119 FGCRRVNGHYLVPSLYVIRDCLVRLGPKALDRRLQAWQAAQLNSSD-EALAMDGKIMKGG 177 Query: 117 YDKSRRRGA 125 D + + Sbjct: 178 VDHTGAQTQ 186 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 99.5 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 54/129 (41%), Gaps = 6/129 (4%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L ++ +PD R + L IL + +CAV++GA + I D+ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRD------CHSSDDKDVIAIDGKTLRHSY 117 F + +P T+ R++ I + W+R VIA+DGK +R + Sbjct: 89 FTDRVPAATTVWRLLIRIDAEVLPQVLARWLRARTAPVVVTGRRLCLVIAVDGKVVRGAR 148 Query: 118 DKSRRRGAI 126 ++ A+ Sbjct: 149 LRAAGPSAL 157 >UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3V5_9PROT Length = 174 Score = 99.1 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 40/164 (24%), Positives = 67/164 (40%), Gaps = 10/164 (6%) Query: 195 VKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K NQ L + N+ D+ K R+E R V V D L ++ Sbjct: 1 MKANQSNLFETA----CAIAANDAPADTAFSRNKGRSRQEDRTVEVFPVGDALAGTEWQ- 55 Query: 255 KGLKKLCVAVSFRSIIAEQKK--ELEMTVRYYISSA-DLTAEKFATAIRNHWHVENKLHW 311 +K + + + + V +Y+SSA + A +A AIR HW +EN+ H+ Sbjct: 56 PFIKTIIRVTRRTLLHSAATGLWDQRGEVAFYVSSAANFPATAWAAAIRGHWGIENRNHY 115 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 DV +ED +IR + + R A+NI+ + + Sbjct: 116 VRDVSCDEDKSRIRDN--PGIMARARSFALNIMRKNGIANVAQA 157 >UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9DIV7_9GAMM Length = 133 Score = 98.7 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 3/107 (2%) Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNH 302 V ++ ++W GLK + S + + + R+YISS DL AE+ +++RNH Sbjct: 3 VNKSWLNNKYQWVGLKSIIKVTSD--VHEKTTGKETTETRWYISSLDLNAEQALSSVRNH 60 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 W VE+ +HW L++ ED+ + R+G F+ +R IA+ + D+ Sbjct: 61 WQVES-MHWVLEMTFREDESRFRKGRGPLAFNVMRKIAMTLFKQDQT 106 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 96.4 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 40/186 (21%), Positives = 61/186 (32%), Gaps = 18/186 (9%) Query: 195 VKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K NQ + P + + S HGR E R C + DEL F Sbjct: 2 IKRNQPTTYRQL-AALPWPDSAV----QHTASSAGHGRRESRSIKTCGIADELGGIAFPH 56 Query: 255 KGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT---AEKFATAIRNHWHVENKLHW 311 L A+ + Y ++S D + A A+R HW VE H Sbjct: 57 GRL-----ALRVHRRRKQTGGCESRETVYAVTSLDAHETTPAELAAAVRGHWTVEALRH- 110 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD-RNYLAS 370 DV E+ + G A + R++A+ +L K +A D Sbjct: 111 VRDVTYAEEASTLHTGTAPRAMATFRNLAVGLLKTLGAINI---AKTTRAIRDQPERALP 167 Query: 371 VLTGSG 376 +L + Sbjct: 168 LLGITN 173 >UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q750_9BACT Length = 129 Score = 94.1 bits (232), Expect = 8e-18, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 5/105 (4%) Query: 251 TFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSA---DLTAEKFATAIRNHWHVEN 307 + +K++ R + + + Y I+S + + R HW +EN Sbjct: 20 SACRSWVKQVFCI--HRIFTKVKTGKKTEEIVYGITSLTQQKASPKTILKFSRGHWSIEN 77 Query: 308 KLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA 352 LH+ D ED +IR NA + ++++ + + V Sbjct: 78 GLHYVRDTAFREDHSQIRTQNAPRAMASLKNLVVGLFHFLNVPNI 122 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 92.6 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 39/120 (32%), Positives = 58/120 (48%), Gaps = 4/120 (3%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + H I D R +H L +I+LL I AV+SG+EGWE IE+FG D+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSDDKDVIAIDG--KTLRHSYDKSR 121 GIP HDTIARV+ + + + + D + + G + H + Sbjct: 67 AGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREG 126 Score = 57.5 bits (137), Expect = 8e-07, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 3/79 (3%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLN---KAFEEKFPLKELNNPAHDSYAMSEKSHGREE 234 K+IA+ I KQ DY+ A+KG+ L +A+ K + D + + HGR E Sbjct: 87 KEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGHGRIE 146 Query: 235 IRLHIVCDVPDELIDFTFE 253 R V ++ + Sbjct: 147 TRRCQQVLVNKSWLNNKYR 165 >UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobacteria RepID=Q2RP40_RHORT Length = 152 Score = 92.2 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 38/133 (28%), Positives = 50/133 (37%), Gaps = 8/133 (6%) Query: 224 AMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK---KELEMT 280 HGR+E R V DV L W GL V+ + + K Sbjct: 6 TTDRGRHGRQEHRWVEVFDVSGRLGP---TWDGLIAAVARVTRLTWHKDTKSGLWHKTQE 62 Query: 281 VRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 Y +L A TAIR HW VE + H+ DV ED +IR F+ +R A Sbjct: 63 TALYACQINLPAAVAGTAIRQHWGVEKRSHYVRDVTFFEDQSRIRTK--PGHFARLRSFA 120 Query: 341 INILTNDKVFKAG 353 +NIL + Sbjct: 121 LNILRANGTNNIS 133 >UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3K1_9PROT Length = 134 Score = 91.8 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 29/121 (23%), Positives = 49/121 (40%), Gaps = 6/121 (4%) Query: 4 KKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 L+ S I D R+ + L+ +LL T+ A+++GA + ++ F TH D L D Sbjct: 3 STLLSLFSQISDPRRDQGKIYPLAPVLLFTVLAMLAGARSYRQVQAFIRTHLDRLNDGFD 62 Query: 64 FE-NGIPVHDTIARVVSCISPAKFHECFINWMRDCH-----SSDDKDVIAIDGKTLRHSY 117 P + T+ ++ I + F + + IAIDGKT Sbjct: 63 LSLRRAPAYSTVRFILRGIDAEEMERAFRDHALGLADGPAEGAAIPGAIAIDGKTWCCHV 122 Query: 118 D 118 + Sbjct: 123 N 123 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 91.4 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 34/84 (40%), Positives = 50/84 (59%), Gaps = 1/84 (1%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE- 182 A+H++SAF + +V+ Q+ EKSNEI A ELL LDI G +T DAM Q++ A Sbjct: 7 KAVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQREHARF 66 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAF 206 ++ + D++ VK NQ L +A Sbjct: 67 AVEDKRADFVMTVKDNQPELREAL 90 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 90.6 bits (223), Expect = 9e-17, Method: Composition-based stats. Identities = 41/117 (35%), Positives = 60/117 (51%), Gaps = 2/117 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + H + D R +H L DI+LL I AV+SG+EGWEDIE+FG D+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 GIP HDTIARV+ + + + + D + + G+ + K Sbjct: 67 AGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQGELKQDVVSKC 123 >UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A0_AZOSE Length = 92 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 44/77 (57%) Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R ++LHW LDV N+D ++RRG AA F +RHI +N+L ++ KA ++ K Sbjct: 15 VRLPRPTRHQLHWSLDVQFNDDQSRVRRGYAANNFVVLRHIVLNLLRHNTTRKASIKSKR 74 Query: 359 RKAAMDRNYLASVLTGS 375 A M+ ++ +L + Sbjct: 75 LLACMEDDFREELLGLA 91 >UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JBB1_FRASC Length = 173 Score = 89.9 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 25/107 (23%), Positives = 45/107 (42%), Gaps = 5/107 (4%) Query: 273 QKKELEMTVRYYISSA---DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + ++S + A + HW +EN+LHW DV +ED + R GNA Sbjct: 68 PGGPATAETVHAVTSLPTHHASPRLLAELAQAHWAIENRLHWVRDVTYDEDRHRARTGNA 127 Query: 330 AELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLTGSG 376 ++ + +R++AI IL + + +R A + +G Sbjct: 128 PQVMTSLRNLAITILRLTGAKN--IAKALRHHARHPERPLETIKKAG 172 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 89.5 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 38/141 (26%), Positives = 62/141 (43%), Gaps = 5/141 (3%) Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAH 220 M +KG ++T DAMGCQ+ IA+++++ G D + ++KGNQG+ A F ++ + Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 221 --DSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELE 278 + E SHGR R V + E + W ++ L V R A + + Sbjct: 61 LKPDHDEFEDSHGRTVRRRGWVLPLTPE-TKHSGSWPDIQALLVTEKIRQ--AHYSETVT 117 Query: 279 MTVRYYISSADLTAEKFATAI 299 RYY+S Sbjct: 118 SDFRYYLSRCQEARPDIGHTT 138 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 89.1 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 47/118 (39%), Gaps = 9/118 (7%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L H++ +PD R + H L IL + + A+ SGAE + + ++ T L Q Sbjct: 15 GLWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGC 74 Query: 65 ENG-------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRH 115 + P T+ RV+ I NW+ +A+DGKTL Sbjct: 75 QESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSLG--LSPAALAVDGKTLAG 130 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 88.7 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 42/85 (49%) Query: 7 MGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFEN 66 + H + D R L I+ + I AV++GA+G+ IE +G+ +L+ + D Sbjct: 28 LKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLPK 87 Query: 67 GIPVHDTIARVVSCISPAKFHECFI 91 GIP HDT RV+ + P + F Sbjct: 88 GIPSHDTFGRVLRILEPKQLQSGFR 112 >UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ26_GLOVI Length = 125 Score = 88.7 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 47/106 (44%), Gaps = 1/106 (0%) Query: 261 CVAVSFR-SIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 S R E + + + +Y+SS + +A + IR HW VEN++H+ DV E Sbjct: 12 GRTRSIRLERYRELRGIVTVKTHWYLSSIEASASELGRRIRGHWGVENQVHYPKDVTFGE 71 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 D +IR +++S R A+N+ + + ++ + Sbjct: 72 DRSRIRTLPLVQVWSVARSFALNLYRSLLMANRAQAQRRCMFGLST 117 >UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H5_9SYNE Length = 177 Score = 86.8 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 58/153 (37%), Gaps = 12/153 (7%) Query: 197 GNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKG 256 G+Q L + ++ K + E HGR+ + + W G Sbjct: 8 GDQKTLYRQIADQLLGKRHIPLMATDH---EIGHGRD---ILWTLRAKEAPQHIKANWHG 61 Query: 257 LKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVV 316 + ++ + + K +I+S T + +R W VE+ HW D Sbjct: 62 TSWIAEVIATGTRDRKPFKATHR----FITSLRTTPDALLRLVRERWSVESW-HWIRDTQ 116 Query: 317 MNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ++EDD + R GN A + + +R A+N+L Sbjct: 117 LHEDDHRYR-GNGAGVMAALRTAAMNLLRLTGF 148 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 86.4 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 35/131 (26%), Positives = 55/131 (41%), Gaps = 2/131 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF- 64 L ++ IPD+R+A + L+ +LL +I AV+SGA + I+ F + H + L Sbjct: 3 LKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQLH 62 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV-IAIDGKTLRHSYDKSRRR 123 PVH +I + + F + IA+DGKTLR + + R Sbjct: 63 WKRAPVHTSIRYALQGLDAKAGELAFHRHASGLDGEGAQHASIAMDGKTLRAAVSITSRT 122 Query: 124 GAIHVISAFST 134 SA Sbjct: 123 ARPLRYSAHWP 133 >UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C56B7 Length = 116 Score = 86.0 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 49/109 (44%), Gaps = 5/109 (4%) Query: 268 SIIAEQKKELEMTVRYYISSA---DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + + + V + I+S A +R HW +EN+LH+ DV + ED C++ Sbjct: 8 TRERTVRGQTTVEVHFGITSLSAEKADAATLLNHVRTHWRIENELHYVRDVTLGEDVCRV 67 Query: 325 RRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 R G+A ++ + +R+ +++ K + + MD ++ Sbjct: 68 RMGHAPQVLAALRNAVVHLWREVKAVSCPEAIERLQ--MDPAMAKGLIG 114 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 84.9 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 41/86 (47%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 + ++ + + + D R +H+ DI+++ +C V+ G +G I + ++L+ Sbjct: 6 VAVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQG 65 Query: 61 YGDFENGIPVHDTIARVVSCISPAKF 86 + + NG+P D I + + P F Sbjct: 66 FLELPNGLPSRDCIRNWLMALQPDAF 91 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 84.5 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 49/77 (63%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + ++ H S + D RQ+W++ + L +I LL +CA +SG E + +I +G+ +FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTIARV 77 + +E G+P HDT+ + Sbjct: 77 FLPYERGLPAHDTLKGL 93 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 84.5 bits (207), Expect = 6e-15, Method: Composition-based stats. Identities = 34/75 (45%), Positives = 52/75 (69%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++++ + + D R A + +H L DIL+L +CAV+SGA+GW+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARV 77 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRV 81 >UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK19_ACIF5 Length = 104 Score = 84.1 bits (206), Expect = 7e-15, Method: Composition-based stats. Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 3/99 (3%) Query: 273 QKKELEMTVRYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + L + ++S E R HW +EN+ H D +ED +IR N Sbjct: 2 KDGTLREDCAFGLTSLTKDRTTPENLLGIARGHWEIENRNHHVRDTTYHEDLSQIRTENG 61 Query: 330 AELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 + + +R +A++IL V + A+ + Sbjct: 62 PHMMATLRGLAMSILRLIGVKNIAQAGRDFAASARKTLR 100 >UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XTV7_9DEIN Length = 118 Score = 84.1 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 32/121 (26%), Positives = 56/121 (46%), Gaps = 7/121 (5%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W+G + R ++ + EL Y ++S A++ R HW VEN+LH + Sbjct: 4 WRGSRMALRM--RRRVVRKNSGELREETAYALTSLQAPAKRLYALWRGHWEVENRLHHKR 61 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLT 373 D V+ ED + R+G A +R + +N+L + + R +RK + D L ++ Sbjct: 62 DTVLGEDASRSRKGAAG--LMYLRDVILNLLHL---KRWPVLRSVRKFSADPKVLLRLIR 116 Query: 374 G 374 G Sbjct: 117 G 117 >UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN77_ANAAZ Length = 77 Score = 84.1 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 26/61 (42%), Positives = 43/61 (70%) Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 A ++R+HW +EN LHW LDV + +DDC+IR+ NA + F+ +R IA+++L + K G+ Sbjct: 1 MANSVRSHWAIENSLHWVLDVALKQDDCRIRKDNAQQNFAVMRQIAVDLLGKENPVKRGI 60 Query: 355 R 355 + Sbjct: 61 K 61 >UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus aquaticus Y51MC23 RepID=B7A7V9_THEAQ Length = 161 Score = 82.9 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 56/143 (39%), Gaps = 9/143 (6%) Query: 213 KELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAE 272 +E P + G L + + G ++ R ++ + Sbjct: 2 EERRLPGETEAVWNLVRDGEVWTYRVWASP---YLPEEMRAFPGCGQVVRME--REVVRK 56 Query: 273 QKKELEMTVRYYISSA---DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 E+ TV Y ++S A + + + W VEN+ W D +++ED C++R G Sbjct: 57 GTGEVRRTVSYALTSLGPEVADARRLGELLLSRWEVENRSFWVRDFLLHEDACQVR-GVG 115 Query: 330 AELFSGIRHIAINILTNDKVFKA 352 A++ + +R +++L V + Sbjct: 116 AQVLAALRAFLVSLLHRQGVREK 138 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 81.4 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 36/128 (28%), Positives = 51/128 (39%), Gaps = 13/128 (10%) Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDC----HSSDDKDVIAIDGKTLRHSYDKSR 121 PV+ ++ ++ I P F + IAIDGKTLR S+D Sbjct: 9 RRAPVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAFS 68 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELL---------NMLDIKGKIITTD 172 A +V+SAF+ H +++ DEKSNEI A L+ I + D Sbjct: 69 DTKAAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALIVATALWKSREETSIYASSVMLD 128 Query: 173 AMGCQKDI 180 AM I Sbjct: 129 AMTFAPAI 136 >UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E6_9ACTO Length = 128 Score = 81.0 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 46/129 (35%), Gaps = 13/129 (10%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + L ++ + D R+ H +LL+ AV++GA + I ++ P + Sbjct: 1 MGPLAERLATLADPRRRRGKRHPFVAVLLIACSAVVTGARSFVAIHEWATDSPQDVLARL 60 Query: 63 DFENG-------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRH 115 P TI RV+ P + H D +AIDGK+ R Sbjct: 61 GARTATALAVRIPPSGVTIRRVIKDTCPGGLADLLG------HDPAGTDTLAIDGKSARG 114 Query: 116 SYDKSRRRG 124 S S R Sbjct: 115 SRLGSTRPP 123 >UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_9ACTO Length = 410 Score = 81.0 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 32/141 (22%), Positives = 51/141 (36%), Gaps = 11/141 (7%) Query: 42 EGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS-- 99 + + P G + P I R++ I P W+ Sbjct: 221 RATSALIAWVLARPTVAVLLGIDADRRPSEAMIRRLLQAIDPDLLTTAIGIWLAARIPAP 280 Query: 100 -SDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE- 157 + IA+DGKTLR S + A HV++A +V+ D K+NEIT Sbjct: 281 APGSRRAIAVDGKTLRGSRTRDSA--ARHVLAAADQHTGIVLASTDVDTKTNEITRFTAS 338 Query: 158 -----LLNMLDIKGKIITTDA 173 LL+ I+ +++ A Sbjct: 339 GSHADLLSSRCIRSGVVSPAA 359 >UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7B Length = 481 Score = 78.7 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 49/285 (17%), Positives = 93/285 (32%), Gaps = 55/285 (19%) Query: 47 IEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVI 106 IE G L+++ ++G H+ + I P E F+ + D ++ +V+ Sbjct: 81 IEHQGSGRQAHLRRHRQPDDG--CHEAFYGKLRRI-PRGLSEAFLRDVTDRFTALFPEVV 137 Query: 107 A--------------IDGKTLR----HSYDKSRRRGAIH---VISAFSTMHSLVIG-QIK 144 A +DGK+L+ D G + ++ A+ LV+ Sbjct: 138 AHRLPTSFDRLEVLILDGKSLKKVAKRLVDTRGTPGKLLGGKLLVAYRPRDGLVLDMAAD 197 Query: 145 TDEKSNEITAIPELLNMLDIKG---KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 D ++NE IP+L+ + +G K++ D + C + K G ++ Sbjct: 198 LDGETNEAKLIPDLMPRVHARGGPAKLVVGDRLFCASKHFAEFTKDNGHFV--------- 248 Query: 202 LNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 L +P + ++ S V +E L++ Sbjct: 249 ----VRYARTLSFEPDPKRPAVTTADPSQR----------AVVEEWGWAGKPKDKLRRYV 294 Query: 262 VAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVE 306 R +A E + + SA A R W +E Sbjct: 295 R----RITVARPVGEAITILTDLLDSAPYPATDLLDLYRIRWTIE 335 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 77.9 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 48/198 (24%), Positives = 74/198 (37%), Gaps = 37/198 (18%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFG--ETHPDFLK 59 +LKKL+ S IPD R+A ++H+L+ +LL + + + + L+ Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSRPAFLQALQ 138 Query: 60 QYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD--------DKDVIAIDGK 111 +P DT+ARV+ I P K E FI +R IAIDG Sbjct: 139 GLFPELETLPHGDTLARVLERIEPQKLEESFIRLLRRYIRHKKFKRHLINKCYPIAIDGT 198 Query: 112 -------------TLRH--SYDKSRRRGAIHVISA-FSTMHSLV-------IGQIKTDEK 148 RH + D + + I+V+ A F + L + + D K Sbjct: 199 QKLVRDGELGEEWLERHIKTKDGEKVQQYIYVLEANFVFKNGLTIPIMSEFLSYSEDDSK 258 Query: 149 S----NEITAIPELLNML 162 EI A L + L Sbjct: 259 EVKQDCEIKAFKRLSHRL 276 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 77.9 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 29/96 (30%), Positives = 38/96 (39%), Gaps = 4/96 (4%) Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPAHDSYAMSEK 228 D +GCQK IA+ I +Q DYL AVK NQ L++A F D K Sbjct: 8 DGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYFEEANKARFAGYNIDYDEKINK 67 Query: 229 SHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 GR E R V + W L+ + + Sbjct: 68 GPGRLEQRRCWV-GYEIPDTINSQNWAKLETIVMVE 102 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 76.4 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 33/108 (30%), Positives = 50/108 (46%), Gaps = 4/108 (3%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG-IPVHDTIARVVSCISPAKFH 87 +L L + AV++G E I FG P L F+NG +P +TIA ++ + Sbjct: 3 LLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLDADHLD 62 Query: 88 ECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTM 135 W+ D H D D IA+DGK L S D H+++A++ Sbjct: 63 RIIGAWLGDRHP-DGWDHIALDGKRLCGSRD--GAVPGTHLLAAYAPQ 107 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 76.4 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 35/77 (45%), Positives = 47/77 (61%), Gaps = 1/77 (1%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + + D R A+ +H DI+ L + AVISGA W +I+ FGE H D+L++Y F Sbjct: 2 SVFRFFENLSDPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPF 60 Query: 65 ENGIPVHDTIARVVSCI 81 E GIPV DTIARV+ I Sbjct: 61 ECGIPVDDTIARVIKRI 77 >UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW01_RHOSR Length = 130 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 15/82 (18%), Positives = 28/82 (34%) Query: 11 SIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 +PD R + H+ S IL + A +GA + I ++ P +P Sbjct: 49 DEVPDPRHRRGVRHRFSVILAPALSATCAGARSFIAIAEWAHDAPAEALGGLGVSAVVPS 108 Query: 71 HDTIARVVSCISPAKFHECFIN 92 T R ++ + + Sbjct: 109 ESTSRRFLAGVDATALDQVLGM 130 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 75.6 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 28/112 (25%), Positives = 46/112 (41%), Gaps = 6/112 (5%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L ++S IPD+R+A + L+ +LL +I A++SGA + I+ F +TH + L Sbjct: 3 LKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQLH 62 Query: 66 N-GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD-----DKDVIAIDGK 111 P H +I + + F D VI + K Sbjct: 63 RKRAPAHTSIRYALQGLDAKAVELAFPRHASGLDGEDHNRFFPSTVIDAEWK 114 >UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C544B Length = 103 Score = 75.2 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 28/109 (25%), Positives = 42/109 (38%), Gaps = 11/109 (10%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYIS 286 + HGR E R L+ W GLK R++ K + V + I+ Sbjct: 2 DPGHGRIETRTVRATP----LLTCHDRWTGLKHGFRITRTRTV----KGVTTVEVVHGIT 53 Query: 287 SAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 S A +R+HW +EN+ H DV + ED+ + R A Sbjct: 54 SRPVERADARALLGLVRSHWRIENQRHDVRDVTLREDEPRCRAAGAGRA 102 >UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XCB4_SALTO Length = 117 Score = 74.5 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 23/107 (21%), Positives = 46/107 (42%), Gaps = 3/107 (2%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 ++ +L +CAV++GA + D+ E F + +PV T+ R++ + Sbjct: 1 MASVLADAVCAVMAGASTFAAFGDWVEDLDAPAWSRLGFTDRVPVLTTLWRLLVRVDAET 60 Query: 86 FHECFINWMRD---CHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVI 129 + +W+ + VIA+DGK +R + R A+ + Sbjct: 61 LTAVWADWLCSRLPVAPPPVRRVIAVDGKVVRGAVLTEGRVPALWMP 107 >UniRef50_B2IT45 Putative uncharacterized protein n=5 Tax=Cyanobacteria RepID=B2IT45_NOSP7 Length = 435 Score = 72.2 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 48/391 (12%), Positives = 110/391 (28%), Gaps = 68/391 (17%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIED-FGETHPDFLKQY 61 ++ + +PD R +++SD L + + + + + Q Sbjct: 11 VQYFQSILKDLPDKRTGKNKRYQMSDAALSAFSIFFTQSPSFLAHQRSMAHSKGHNNAQS 70 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECF---------INWMRDCHSSDDKDVIAIDGKT 112 + IP + I ++ I P F + S + +IA+DG Sbjct: 71 LFGVHQIPSDNHIRDLLDEIEPTVVFPVFTKIFKALENGKHLSKFRSFKNNLLIALDGTE 130 Query: 113 LRHSYD-----------KSRRRGAIHVIS--AFSTMHS---------LVIGQIKTDEKSN 150 S + K+ H + + V+ Q ++ Sbjct: 131 YFCSNEIHCEHCSSRTFKNGTTQYFHTVVTPVIVCPSNSQVIPLIPEFVVPQDGYQKQDC 190 Query: 151 EITAIPELLNMLDIK----GKIITTDAMGCQKDIAEKIQKQGGDYLF-AVKGNQGRLNKA 205 E A + + G I D + C + + E + ++ +++ + L + Sbjct: 191 ENAAAKRWIQKYAKQYASLGITILGDDLYCHQPLCELLLQEKLNFILVCRSKSHKTLYEW 250 Query: 206 FE----EKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 E + F +K ++ Y R + + W L Sbjct: 251 LEGMPLDTFSVKHWKGKVYEIYT----------YRYVNQIPLRNSEDALLVNWCELA--- 297 Query: 262 VAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLH-------WRLD 314 + + I+ E + R+ W +EN+ + + L+ Sbjct: 298 ----ITRSDGTIIYKNTFATNHRITDI--NVEAIVSDGRSRWKIENENNNTLKTKGYNLE 351 Query: 315 VVMNEDDCKIRRGNAAEL-FSGIRHIAINIL 344 + A + + H ++I+ Sbjct: 352 HNFGHGKTHLSSLLATFNILAFLFHTLLDII 382 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 71.4 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 28/71 (39%), Positives = 43/71 (60%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L+ H + I D RQ+ K+ + L D+L +T+C VI+GAEGW +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIPVH 71 G G+PV Sbjct: 72 KGILTEGVPVR 82 >UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVV4_METNO Length = 123 Score = 71.4 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 43/113 (38%), Gaps = 2/113 (1%) Query: 252 FEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHW 311 W GL + + R + VR+ + S+ +E A AIR H + W Sbjct: 5 RTWPGLTTVLATETLR--GGNGTDSVPAQVRHSLGSSTAPSEVLAQAIRRHGALATGEPW 62 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 L+V E+ ++R AA + +R +A++ D A + R Sbjct: 63 VLEVSFGEERSRVRERCAARHLALLRRVALDRRRADASLTASRPAQDRGLGRR 115 >UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LD44_FRASN Length = 261 Score = 71.0 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 29/149 (19%), Positives = 48/149 (32%), Gaps = 8/149 (5%) Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVC 241 + + L L + + P L A + + G + R Sbjct: 51 RLVTEGDQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLTA----GSRQTRALKAV 106 Query: 242 DVPDELI-DFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT---AEKFAT 297 VP L + L + ++ + E K+ Y I + + AT Sbjct: 107 TVPAGLGFPHAAQAIQLTRTSRPINKNTKKTEGKRRQRRETVYAICTLPAHDALPAELAT 166 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRR 326 IR HW +E +L W DV + ED + R Sbjct: 167 WIRGHWSIEVRLRWVRDVTLGEDLHQART 195 Score = 44.0 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 8/35 (22%), Positives = 21/35 (60%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVIS 39 L+ ++ +PD R+ + H + +L + +CA+++ Sbjct: 60 ALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLT 94 >UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RIX6_9ACTO Length = 90 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 17/65 (26%), Positives = 25/65 (38%) Query: 288 ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 A R WH+EN+LHW DV E + R G + + +R+ AI + Sbjct: 9 AYAQPADLQQWARLEWHIENRLHWVRDVTFGEGTHRARTGTGPAVAAVLRNTAIGFHRGN 68 Query: 348 KVFKA 352 Sbjct: 69 GETNI 73 >UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TKB9_RICCO Length = 107 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 16/100 (16%), Positives = 32/100 (32%), Gaps = 1/100 (1%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ-YGDF 64 S + D R+A + L +L + +++SG+ ++ F E L + +G Sbjct: 8 FGDVFSELRDVRRAQGKRYALEPLLCAIVMSILSGSASLRKMQVFIEEQLPNLNRLFGTS 67 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD 104 P I + + + F S Sbjct: 68 WRKAPCWVAIREFLLGLDEQELERAFREHANRQVSPPPGR 107 >UniRef50_A5GAF0 Putative uncharacterized protein n=6 Tax=Deltaproteobacteria RepID=A5GAF0_GEOUR Length = 439 Score = 68.7 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 50/378 (13%), Positives = 100/378 (26%), Gaps = 53/378 (14%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +L L + IPD R K+ L+D+L+ ++ Q Sbjct: 13 QLGVLRCCLEHIPDQRDGAKI--SLADVLMSGYAMFDLKDPSLLAFDE-RRCRDAANLQR 69 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIA---------IDGKT 112 + + V+ + PA F + +A +DG Sbjct: 70 IYGIGKVACDTQLRTVIDPVDPAGLRPGFKTIVATLQRGKALQQLAYYEGYYLLSLDGTG 129 Query: 113 LRHSYDKSRRRG--------------AIHVISAFSTMHSLVIG--------QIKTDEKSN 150 S + S + + +VI Q + Sbjct: 130 SFGSENLSSASCLVKNKSNGKKLYYQQVLGAALVHPDSRVVIPLAPEMIIPQDGATKNDC 189 Query: 151 EITAIPELL----NMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVK-GNQGRLNKA 205 E A L I+ D + +Q+ ++ K G+ L + Sbjct: 190 ERNASKRFLPNFREDFPRLPVIVVEDGLSSNGPHIRDLQQHNMRFILGAKPGDHPLLFEN 249 Query: 206 FEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVS 265 + K ++A + + + + D P LK + Sbjct: 250 LTDAIKKKT-----ATTFAQIDPKNPQIMHSYCFLNDTPLN-----QANPDLKVNFLVYE 299 Query: 266 FRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLD--VVMNEDDCK 323 A+ K + + + A R+ W +EN+ L E + Sbjct: 300 EH--NAKTGKTQRFSWVTDLPITEENAYILMRGGRSRWKIENETFNTLKNQGYNLEHNYG 357 Query: 324 IRRGNAAELFSGIRHIAI 341 + + + +E F + +A Sbjct: 358 LGKEHLSENFVMLMMLAF 375 >UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BQC4_9GAMM Length = 96 Score = 67.5 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 37/89 (41%), Gaps = 1/89 (1%) Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 +T ++ R HW + + LH+ D NED +IR G+ + + AI +L + Sbjct: 1 MTPQQVLAINRGHWSIAS-LHYISDWNYNEDRGQIRTGHGPANVTRLCRFAIGVLKHFPK 59 Query: 350 FKAGLRRKMRKAAMDRNYLASVLTGSGLS 378 + MR+ A + L + S Sbjct: 60 PGQYIPEMMRQLARRPRQVLDYLRLTAHS 88 >UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitrosococcus oceani AFC27 RepID=B6C2C4_9GAMM Length = 77 Score = 67.5 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 31/57 (54%) Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 ED+C++ A F+ +R IAI++L D+ K LR + RK A D +Y+ + Sbjct: 21 SFREDECRVHDPMAGGNFALLRKIAISLLVRDRSNKTSLRGRCRKVAWDNDYMRQLF 77 >UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium meliloti RepID=Q1WLF6_RHIME Length = 105 Score = 67.5 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 19/74 (25%), Positives = 29/74 (39%), Gaps = 1/74 (1%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + +PD R H L+ IL + I A++ GAE D+ DFG +LK Sbjct: 1 MSRFAACFEDLPDPR-GRNARHPLTSILFIAIAAIVCGAESCTDMADFGVAKKKWLKTIV 59 Query: 63 DFENGIPVHDTIAR 76 I + Sbjct: 60 PLPYASRCWRDIRK 73 >UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L7Y6_FRASN Length = 209 Score = 64.5 bits (155), Expect = 6e-09, Method: Composition-based stats. Identities = 16/63 (25%), Positives = 29/63 (46%) Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 +TA T +R +W +EN++H+ D ED GN + R++AI ++ + Sbjct: 89 VTAAYLHTHVRGNWGIENEVHYTRDAAWREDANPTYTGNTNHALASFRNLAIGVIGLNGT 148 Query: 350 FKA 352 Sbjct: 149 RNI 151 Score = 44.4 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 13/86 (15%), Positives = 26/86 (30%), Gaps = 1/86 (1%) Query: 53 THPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKT 112 + P T+ + I F W+ + + +AIDGK Sbjct: 20 ARLGAPLDHFRRNTRAPSKKTLRAPLKKIDVDALDATFGAWLCAQI-ARGRVALAIDGKV 78 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSL 138 LR ++ A ++ + + Sbjct: 79 LRGAWSGDESVTAAYLHTHVRGNWGI 104 >UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C435B Length = 133 Score = 64.1 bits (154), Expect = 8e-09, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 40/137 (29%), Gaps = 18/137 (13%) Query: 192 LFAVKGNQGRLNKAFEEKF-----------PLKELNNPAHDSYAMSEKSHGREEIRLHIV 240 + K NQ L E L P + G R+ Sbjct: 1 MLTAKDNQPGLVADIEAGLGFEDAARGLAAATSPLTGPDARATGAPGHVGGPGHGRIETR 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSA---DLTAEKFAT 297 L+ W GLK R++ K + V + I+S A Sbjct: 61 TVRATPLLTCHDRWTGLKHGSRITRARTV----KGVTTVEVLHGITSLTVERADARALLG 116 Query: 298 AIRNHWHVENKLHWRLD 314 +R+HW +EN+ H D Sbjct: 117 LVRSHWRIENQRHDVRD 133 >UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JGX0_FRASC Length = 222 Score = 63.3 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 52/189 (27%), Gaps = 12/189 (6%) Query: 66 NGIPVHDTIARVVSCISPAKFHECFIN-WMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 G P + + + P + + V+ +DG T+R Sbjct: 31 PGTPAPGGVGKSCRSLDPGSLAALDAAPHRPTWRAGRVRRVLTVDGTTMR----PQHGSR 86 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITTDAMGCQKDIAEK 183 +H+ + +++ Q+ DEK+NE + L + D+ G +IT A Sbjct: 87 HVHLPEGLAHACGVLLTQVDVDEKTNENPFVLRGLGQIPDLTGVLITAFPAPPSHAQAAG 146 Query: 184 IQKQGGDYLFAVKGNQGRLNKAF------EEKFPLKELNNPAHDSYAMSEKSHGREEIRL 237 + L + L E D + E R Sbjct: 147 SNGHQLNTLEPGEPCPQSLADLLRPPPYGEVVLDEVPQLRVTDDLAGVRETGRTSVHSRQ 206 Query: 238 HIVCDVPDE 246 H V Sbjct: 207 HFVVRCRGP 215 >UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3536 Length = 130 Score = 62.5 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 33/71 (46%), Gaps = 7/71 (9%) Query: 252 FEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAE---KFATAIRNHWHVENK 308 +WKGLK+ R++ + V + I+S + +R+HW +EN+ Sbjct: 9 QDWKGLKQGFQITRERTV----NGVTTVEVVHGITSLSADRANAGALLSLLRDHWRIENQ 64 Query: 309 LHWRLDVVMNE 319 LH+ DV + E Sbjct: 65 LHYVPDVTLGE 75 >UniRef50_A8MIZ4 Putative uncharacterized protein n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MIZ4_ALKOO Length = 218 Score = 61.4 bits (147), Expect = 6e-08, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 58/187 (31%), Gaps = 33/187 (17%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + I+ + D R ++ +S I + + + + + + E K+ Sbjct: 18 DIGEKINTLKDKRVKSPVK--VSTISFVVLFGFMLQIRSFNRLNHWIE--KGKFKKVVPK 73 Query: 65 ENGIPVHDTIARVVSCISPAKFHEC--------FINWMRDCHSSDDKDVIAIDGKTLRHS 116 + +P D++ R ++ N + + D V AIDG L S Sbjct: 74 KTKMPCIDSVRRFLADFDLHGLKNMHSHIVKTSIKNKVFRSGTVDGLKVAAIDGVELFES 133 Query: 117 YDKSRRRG--------------AIHVISAFSTMHSLVIGQIKTDEKSN-------EITAI 155 K + S + L++GQ + K + E+T Sbjct: 134 TKKCCNNCLTRVHKDEITHYFHRSVICSTVGSDPHLILGQEMLEPKRDGSNKDEGEVTGG 193 Query: 156 PELLNML 162 L+ L Sbjct: 194 KRLIKKL 200 >UniRef50_A7C035 Transposase n=5 Tax=Bacteria RepID=A7C035_9GAMM Length = 437 Score = 58.7 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 46/358 (12%), Positives = 95/358 (26%), Gaps = 65/358 (18%) Query: 1 MELKKLMG----HISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHP- 55 + + L+ + IP K LSD L+ + F Sbjct: 9 LSMPGLLSEIKNYFEKIPSPVVKQKDSISLSDCLMSGLAIFSLKYPSLL---QFDNDKRT 65 Query: 56 ---DFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-------- 104 + + IP + + + ++ + +R + Sbjct: 66 PVVEHNLKSLYKIGIIPSDTYMRERLDELPTSELRGAYTTLIRQAQRGKVLEKFTYYNDY 125 Query: 105 -VIAIDGKTLRHSYD--------------KSRRRGAIHVISAFSTMHSLVI--------G 141 ++++DG S+D K + I+ H V+ Sbjct: 126 YLVSMDGTGYFSSHDIHCDQCCEKHHRNGKITYHHQMLGIALVHPNHHHVLPLAPEPIIK 185 Query: 142 QIKTDEKSNEITAIPELLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVK- 196 Q ++ E A LL L IIT D + + ++ Y+ K Sbjct: 186 QDGVEKNDCERNAGKRLLTQLRKEYPKMKMIITEDGLASNGPHIKLLKSLNMSYILGAKP 245 Query: 197 GNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKG 256 + L + N+ Y + + R + + D + Sbjct: 246 KDHTYLFDRIK--------NSSQTKFYQTQDDDGTIHKYRYVNQVPLNESHFDLNVNFLI 297 Query: 257 LKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLD 314 +++ + + K + I ++ T E R W +EN+ L Sbjct: 298 YQEI----------SPKGKVTNFSWVTDILLSEQTLEIVMKGGRARWRIENETFNTLK 345 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 58.7 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 27/48 (56%), Positives = 40/48 (83%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEIT 153 ++ DGKTLR S+D+S + AIH++SA+++ +SLV+GQ+KTDEKSNE Sbjct: 26 LSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNEHK 73 >UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ITI9_METNO Length = 74 Score = 58.3 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 17/64 (26%), Positives = 33/64 (51%), Gaps = 1/64 (1%) Query: 3 LKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L + D R+ +H+L IL++ +CAVI+ AE +DI +G + +L+ + Sbjct: 2 IEGLAVCFPGLEDLRETSGCDHRLIAILVIAVCAVIACAES-KDIGLYGRSKQAWLQTFL 60 Query: 63 DFEN 66 Sbjct: 61 PLPC 64 >UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3Z283_9SYNE Length = 156 Score = 57.5 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 25/96 (26%), Positives = 45/96 (46%), Gaps = 4/96 (4%) Query: 84 AKFHECFINWM-RDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA--IHVISAFSTMHSLVI 140 F + WM + +D D + DGKTLR S D+ A I +S +S + I Sbjct: 2 EAFEALLLQWMSQQPALADGVDTLVCDGKTLRGSIDQKPGAAASFIAQVSLYSQPLGVAI 61 Query: 141 GQ-IKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 Q ++S+E ++ LL+ +++ ++ D +G Sbjct: 62 AQTTYATDESSETASLLWLLSGIELTDMLVQADEVG 97 >UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria RepID=A8FXM7_SHESH Length = 57 Score = 56.8 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 26/58 (44%), Positives = 36/58 (62%), Gaps = 4/58 (6%) Query: 262 VAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + +FR +I +L + RYYISS +LTAE+ A + HW +E +HW LDV MNE Sbjct: 1 MVENFRFVIGN---KLVLEYRYYISSKELTAEQAANTVSEHWGIE-SMHWVLDVSMNE 54 >UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HUT6_9ACTO Length = 130 Score = 56.0 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 33/84 (39%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L G +S IPD R+ + L +L L + AV+ GA I F L++ Sbjct: 45 SLAGTLSRIPDPRRVRGRRYHLGSLLALCLVAVLGGARSLATIARFAADTNSSLREQLGL 104 Query: 65 ENGIPVHDTIARVVSCISPAKFHE 88 + P T+ + + + E Sbjct: 105 ASSTPNASTLGGLRANLKDEWVRE 128 >UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36C2 Length = 109 Score = 55.6 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 17/60 (28%), Positives = 25/60 (41%), Gaps = 3/60 (5%) Query: 268 SIIAEQKKELEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + + V Y I+S A R HW +EN LH+ DV + ED C + Sbjct: 5 ERRRKANGKATVEVVYGITSLSRLAADAAALLGYSRRHWGIENGLHYTRDVTLGEDRCPV 64 >UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J572_FRASC Length = 148 Score = 54.4 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 12/46 (26%), Positives = 18/46 (39%), Gaps = 1/46 (2%) Query: 8 GHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGET 53 IPD R + H+L +L L AV+ G G + + Sbjct: 70 ECFEQIPDPRDPRGVRHRLPVVLSLC-AAVLCGESGLAGVAAWVAA 114 >UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B1F Length = 84 Score = 53.7 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 20/54 (37%), Positives = 29/54 (53%) Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL 212 L+M D+ + DA+G Q IAE+I + G DY+ A+K NQ +A F Sbjct: 17 LDMEDLAQSQLVIDAVGTQGPIAEQIIEAGADYVLALKANQPSALQAVSAHFKE 70 >UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicutes RepID=B8FXU5_DESHD Length = 381 Score = 52.9 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 42/278 (15%), Positives = 89/278 (32%), Gaps = 25/278 (8%) Query: 51 GETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMR---DCHSSDDKDVIA 107 G + L + + + + I P F + R +S D ++A Sbjct: 3 GNSLSKELYDWLGYSSETATASAFVQQRDKIRPEALKLLFHEFTRLTVSENSLQDYRLLA 62 Query: 108 IDGKTLR------------HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKS-NEITA 154 +DG LR + + S+ +H+ + + M + + +K NE A Sbjct: 63 VDGSDLRLPSNSKDGFSSIRNSEDSKNYNLVHLDAMYDLMGKVYVDASVQSKKGMNEHKA 122 Query: 155 IPELLNMLDIKGKIITT-DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 + +++ +I G +I D + Q++ Y+ K + G + L Sbjct: 123 LVSMVDQSEINGNVIAIMDRGYESFNNIAHFQEKSWYYIIRAKESYG-----IISRLSLP 177 Query: 214 ELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK---KLCVAVSFRSII 270 + + + +E + L I + +K + FR++ Sbjct: 178 DYPEYDEEIMLTLTRRQTKETLPLLKAYPHRYRWIQPHTTFDFIKPKDSKFYDLHFRAVR 237 Query: 271 AEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 + TV +++ D EK W +E Sbjct: 238 FAIADGVYETVYTNLNAEDFPPEKLKQLYNLRWGIETS 275 >UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VX04_POLNA Length = 100 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 14/42 (33%), Positives = 26/42 (61%) Query: 6 LMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDI 47 L+ SI+PD R + L +++++T+ AV+ GA+ W D+ Sbjct: 2 LIQAFSILPDPRTGPAQRYDLREMIVMTLSAVLCGADNWVDV 43 >UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4X2F9_SALTO Length = 143 Score = 51.7 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 14/64 (21%), Positives = 24/64 (37%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L + +PD + H+L+ +L+ ICAV + I ++ P G Sbjct: 14 GLPAALLDLPDPLCRLGVLHRLTVVLIAAICAVAVSNRSYTAIAEWFPDVPAATGARGGH 73 Query: 65 ENGI 68 G Sbjct: 74 RPGP 77 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 50.6 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 36/59 (61%), Positives = 38/59 (64%) Query: 1 MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLK 59 MELKKLM HISIIPDYRQAWK+EHKL DIL + FGETH DFLK Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRGFGETHLDFLK 59 >UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E7_9ACTO Length = 72 Score = 50.6 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 14/45 (31%), Positives = 25/45 (55%) Query: 134 TMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 T + + Q++ E +NEIT LL+ D++ +T DA+ Q+ Sbjct: 2 TGTGMTVTQLRVPENTNEITCFAALLDPYDLREVTVTGDALHTQR 46 >UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCH0_9FIRM Length = 435 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 40/275 (14%), Positives = 84/275 (30%), Gaps = 23/275 (8%) Query: 51 GETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVI 106 G T L + DF+ P + + I P F F + + + + ++ Sbjct: 54 GCTLNKELLDFFDFDVNAPTVSAYTQQRAKILPEAFEYLFHAFTEENAQTKNLYEGYQLL 113 Query: 107 AIDG------------KTLRHSYDKSRRRGAIHVISAFSTMHSLVI-GQIKTDEKSNEIT 153 A DG +TL S +H+ + + ++ I ++T E Sbjct: 114 ACDGSNLTIAPNLNDPETLWKSNQLGATGNHLHLNALYDVLNRTYIDALVQTASTYQEHR 173 Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 A +++ + + I+ D +I ++G +L +K L Sbjct: 174 ACIQMIERVTLDKVILIADRGYENYNIMSHAIEKGWKFLIRIKDVHSN---GIASGLELP 230 Query: 214 ELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 + D + ++ + + + + K +SFR + Sbjct: 231 QTAVFDMDINLILTRNQTKSKKQAGYKF---MPTVQTFDYLPIGSKEDYPISFRIARFKI 287 Query: 274 KKELEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 + TV + +AEK W +E Sbjct: 288 ADDSYETVITNLDRFCFSAEKLKELYHLRWGIETS 322 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 31/59 (52%), Positives = 34/59 (57%) Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 LKQYG FE GI HDTI +VSCIS F + FI WM C A DGKT+R S Sbjct: 12 LKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ13_9PLAN Length = 75 Score = 49.4 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 1/55 (1%) Query: 2 ELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPD 56 EL++L + + D R HKL +++L+ +CAVI+GA+G IE + Sbjct: 19 ELRELAKYFQSLDDSRSHVNQRHKLVNVVLIAMCAVIAGADGPTAIE-WLAGRLQ 72 >UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepID=B0R8M6_HALS3 Length = 449 Score = 49.4 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 37/355 (10%), Positives = 100/355 (28%), Gaps = 33/355 (9%) Query: 19 AWKMEHKLSDILLLTICAVISGAEG--------WEDIEDFGETHPDFLKQYGDFENGIPV 70 + + + + +G++ + ++ D + + + + Sbjct: 36 ERERKFDIVALFYTLSFGFAAGSDRSLQAFLERYVEMADCDDLSYAAFHDWFEPGFVALL 95 Query: 71 HDTIARVVSCISP--AKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHV 128 + + + + A + + + D + + + + +H+ Sbjct: 96 REILDDAIENLDTGRADLSGRLERFRDVLIADATIVSLYQDAADVYAATGEDQAELKLHL 155 Query: 129 ISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQG 188 I + ST TD ++E + +P + +I D + ++I + G Sbjct: 156 IESLSTGLPTRF--RTTDGTTHERSQLP---TGEWVADALILLDLGFYDFWLFDRIDQNG 210 Query: 189 GDYLFAVKGNQG-RLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDEL 247 G ++ VK N + + + + ++R+ Sbjct: 211 GWFVSRVKDNANFEIVEELRTWRGNSIPLEGESLQAVLDDLQRQEIDVRI---------- 260 Query: 248 IDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT---AEKFATAIRNHWH 304 T ++ + + + + + E Y+++ A A R W Sbjct: 261 ---TLSFERKRGSGASATRTFRLVGLRNEETEEYHLYLTNLGNDDYSAPDIAQLYRARWE 317 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 VE L L D+ E + I++ + L + R Sbjct: 318 VE-LLFKELKSRFGLDEINTTDAYIIEALIIMAAISLMMSRVIVDELRSLEARQR 371 >UniRef50_C7GHC1 Transposase, IS4 family (Fragment) n=6 Tax=Roseburia intestinalis L1-82 RepID=C7GHC1_9FIRM Length = 232 Score = 48.3 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 55/201 (27%), Gaps = 27/201 (13%) Query: 137 SLVIGQIK------TDEKSNEITAIPELLNMLDIK----GKIITTDAMGCQKDIAEKIQK 186 +++GQ + + E+T L+ L + +I DA+ +++ Sbjct: 8 HVILGQEMLKPRDGSGKDEGELTGGKRLIERLKKRHGHFADVIVADALYLNAPFINTLKE 67 Query: 187 QGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDE 246 G + + +K + + + E F E + Sbjct: 68 NGLEGVIRLKDERRMIFQDAERLF-------------KQDEGKKASFWKGK----KKIEV 110 Query: 247 LIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVE 306 F+ +G V + E KE E + + + W +E Sbjct: 111 WDLSGFKMEGCPYKLRVVRYHEQWEENGKETERFMWLVTTLEAADYRVLWEMMHRRWDIE 170 Query: 307 NKLHWRLDVVMNEDDCKIRRG 327 +L + C R Sbjct: 171 ENGFHQLKTYYHAKHCYCRDA 191 >UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteobacteria RepID=A6X872_OCHA4 Length = 330 Score = 47.5 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 74/231 (32%), Gaps = 32/231 (13%) Query: 17 RQAWK--MEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTI 74 R+ + I IC + + E L + + E +P H T Sbjct: 53 RKTRGGQCRYSDLAIETTLICGKV-----FNQPLRQTEGLMASLLRLLNVELPVPDHTTF 107 Query: 75 ARVVSCISPAKFHECFINWMRDCHSSDDKDVIAID-----------GKTLRHSYDKSRRR 123 +R + + + C +D+ + +D +H +R+ Sbjct: 108 SRRCANLVVSSLTRCTRR-----DGTDEPLHVIVDSTGMKIYEAGQWLEEKHGAKSARKW 162 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H+ A + VI + TD+ +++++ +P+LL+M+D D + Sbjct: 163 LKLHL--AIDADSNQVIAETLTDQNTSDLSQVPDLLDMIDRPIACFMADGAYDSDQTYQA 220 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREE 234 ++ + L +A D ++ + GR E Sbjct: 221 LRSHSPGVSIIIPPRIRDLQEA-------SYGPPDQRDWHSRTNAQRGRME 264 >UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BWL6_9GAMM Length = 94 Score = 47.5 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 32/55 (58%), Gaps = 1/55 (1%) Query: 8 GHISIIPDYRQAW-KMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 H +PD R+ + HK DIL++ ICA+I GA+ W + +FG+ D+ + + Sbjct: 40 EHFKSLPDPRRRTMNLRHKFIDILIIAICAIICGADSWVAVAEFGKAKEDWFRVF 94 >UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XCY0_9BACT Length = 481 Score = 47.5 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 40/319 (12%), Positives = 78/319 (24%), Gaps = 34/319 (10%) Query: 45 EDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFIN-WMRDCHSSDDK 103 + + R + + + Sbjct: 74 TACREVVRQLLSDWQAQAGRTRAQAGTAAYCRARQRLPLERLQAILQATLGPEPPRWRGH 133 Query: 104 DVIAIDGKTLR--------------HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKS 149 V +DG T + + V++ FS L + + + Sbjct: 134 AVKLVDGTTFSLPDTAANQKKFPQSGAQKPGCGFPTLKVVALFSLASGLALNWARGSLRV 193 Query: 150 NEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEK 209 +EI +L + L + II D + +G D LF + Sbjct: 194 HEIPLFRKLWSGLRRRDLII-GDRGFSSYTNLALLLGRGVDCLFRL-------------- 238 Query: 210 FPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSI 269 K++ +P +K R+ + EW + F I Sbjct: 239 HQGKKVRHPRRSRLQRKQKLGPRQWLVQ-WKKPYQKPEYMRPKEWAAVPSEMQVRVFEVI 297 Query: 270 IAEQKKELE--MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRG 327 + + M V + E+ A W +E L + + + + Sbjct: 298 VCTRGMRTRKLMLVTTLLDPVRYPVEELAELYLRRWEIELS-FRDLKTTLGLEVLRCQSP 356 Query: 328 NAAELFSGIRHIAINILTN 346 E + IA N+L Sbjct: 357 AMVEKEVWMHLIAFNLLRR 375 >UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P672_AZOSE Length = 47 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 15/31 (48%), Positives = 18/31 (58%) Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 HW VEN LHW L+V NED ++R A Sbjct: 1 HWGVENWLHWCLNVQFNEDRSRVRSAYAVNN 31 >UniRef50_C7G6U9 Putative uncharacterized protein (Fragment) n=7 Tax=Clostridiales RepID=C7G6U9_9FIRM Length = 212 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 63/211 (29%), Gaps = 36/211 (17%) Query: 5 KLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 K+ I + D R+ + L +I++ + ++ E + I F + Sbjct: 6 KIPQKIKCLTDERKRKSI--PLFNIVMPVLLFLMLQYESFHTI--FSAPESMSKRLKNCI 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECF--------INWMRDCHSSDDKDVIAIDGKTLRHS 116 IP D + ++S I+P + N + + V +DG L S Sbjct: 62 SGRIPKVDAVRDLLSRINPDEIRSIHEEMIDIIKRNRVFREGTIGGYVVAGLDGVELFSS 121 Query: 117 YDKSRRRG--------------AIHVISAFSTMHSLVIGQIK------TDEKSNEITAIP 156 KS V +++GQ + + E+T Sbjct: 122 TKKSCPNCLSRKKHTGETEYFYRSVVCMIIGKSPHVILGQEMLKPRDGSGKDEGELTGGK 181 Query: 157 ELLNMLDIK----GKIITTDAMGCQKDIAEK 183 L+ L + +I DA+ Sbjct: 182 RLIERLKKRHGHFADVIVADALYLNAPFINT 212 >UniRef50_Q745Z8 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z8_THET2 Length = 77 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 17/59 (28%), Positives = 35/59 (59%), Gaps = 1/59 (1%) Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 +EN+ W DV++ E+ C++R G A++ + +R +++L V + R++ KAA+ Sbjct: 1 MENRSFWVRDVLLYEEACQVR-GVGAQVLAALRAFLVSLLHRRGVREKVTRQRTLKAAL 58 >UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G346_9FIRM Length = 443 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 38/313 (12%), Positives = 91/313 (29%), Gaps = 42/313 (13%) Query: 19 AWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVV 78 + + +++ + + ++ + +GIP + + Sbjct: 35 SRNRKLPFEEVIRFLLPLQGQCMD-----QELFRHFSKKPLFFSTDYSGIPHSSAMIQAR 89 Query: 79 SCISPAKFHECFINWMRDCHS---SDDKDVIAIDG------------KTLRHSYDKSRRR 123 +S + F ++ C ++AIDG R + S+ R Sbjct: 90 QKLSDSAMPALFHSFTETCKKGALFQGYQLLAIDGSQFSVPENLKEPLCWRKIPNISKGR 149 Query: 124 GAIHVISAFSTMHSL---VIGQIKTDEKSNEITAIPELLNMLDIKG-KIITTDAMGCQKD 179 IH+ + + + V+ Q + NE A+ ++++ I D + Sbjct: 150 NVIHLNAMYHLQSGIFEDVVFQPICE--CNEHKALAQMVDRRSSAFPAIFMADRGYESYN 207 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSH-----GREE 234 I+++G Y+ + + + P E + + Y + S R+ Sbjct: 208 TFAHIEQKGDKYVVRGRESGTGICSGLN--LPDTEEYDIEKELYICKKHSKKVKTNPRKY 265 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEK 294 R+ D L R + + + + +S +A+ Sbjct: 266 KRIRSDATFDFFTDDCEEYRLNL---------RIVKIKLSETTTEVLFTNLSKEKFSADD 316 Query: 295 FATAIRNHWHVEN 307 W +E Sbjct: 317 LKRLYHMRWGIET 329 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 46.0 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 67/176 (38%), Gaps = 21/176 (11%) Query: 43 GWEDIEDF-----GETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 +E++ F G+ D L +Y +F+N P + + + + I P F F + + Sbjct: 40 SFEEVMKFMLTMEGKALRDELLEYFEFDNTTPSNSSFNQRRAQILPEAFEFLFQEFTKSF 99 Query: 98 HSS---DDKDVIAIDGKTLRHSYDK------------SRRRGAIHVISAFS-TMHSLVIG 141 + + +IA DG L +++ + +H+ + + Sbjct: 100 TDNVTYNGLRLIACDGSDLCIAHNPQDETTYFQTLPDRKGYNLLHLNAFYDLCSRQYTDA 159 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKG 197 I+ +NE A+ E+++ + I D +I ++ +G YL VK Sbjct: 160 IIQPSRLANERRAMCEMIDRYNDTSAIFIADRGYENYNIFAHVEHKGMYYLIRVKD 215 >UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L6S8_RUMHA Length = 107 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 15/53 (28%), Positives = 25/53 (47%) Query: 47 IEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS 99 + F + + ++ D + G P DT+ RV + I P KF E F +W+ Sbjct: 1 MAFFMKLQEPYFEKILDLKYGTPSADTLLRVFAIIEPEKFMEMFYHWILFLMQ 53 >UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmonella enterica RepID=C0Q104_SALPC Length = 177 Score = 45.2 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 24/50 (48%), Positives = 27/50 (54%), Gaps = 13/50 (26%) Query: 309 LHWRLDVVMNEDDCKIRRGNAAELF-------------SGIRHIAINILT 345 +HWRLDV MNEDDC+IRRGN F +R I INIL Sbjct: 1 MHWRLDVAMNEDDCRIRRGNVKSFFEIIKSGEYEIWGCEIMRWIRINILK 50 >UniRef50_B0JNZ6 Transposase n=20 Tax=Cyanobacteria RepID=B0JNZ6_MICAN Length = 382 Score = 45.2 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 47/310 (15%), Positives = 83/310 (26%), Gaps = 57/310 (18%) Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVI------------AID 109 IP + I ++ P F ++ + VI A+D Sbjct: 13 LFGIKKIPGDNQIRNLL---DPIPAATIFGSFQQVYQWLKKPGVIKKFFYLDEEILIALD 69 Query: 110 GKTLRHSYD-----------KSRRRGAIHVIS---AFSTMHSLVIG--------QIKTDE 147 G S ++ H S VI Q + Sbjct: 70 GTEYFSSKKISCPHCNCRNPRNGTTTYFHGCVTPIVVSPEQKQVINLEPEFIKKQDGQQK 129 Query: 148 KSNEITAIPELLNMLDIK--GKIITT--DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLN 203 + E A+ L+ K G +T D + ++ I E KQG +++F + Sbjct: 130 QDCENAAVKRWLDKNHQKKYGYPVTLLGDDLYSRQPICELALKQGYNFIFVCLETSHKTL 189 Query: 204 KAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVA 263 + E R V VP ++ + E + + Sbjct: 190 YEWREFLEKSGEVKTVEKKQW---DGRKNLIYRYRYVSRVPLREVESSLEVNWCEVTVIN 246 Query: 264 VSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIRNHWHVENKL------H-WRLDVV 316 + II + + + EK A R+ W VEN+ H + L+ Sbjct: 247 EKTQKIIYQNNWITNHQI------TENNVEKIVKAGRSRWKVENEGNNVLKNHGYNLEHN 300 Query: 317 MNEDDCKIRR 326 + Sbjct: 301 FGHGQSHLCE 310 >UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID=A1RCW9_ARTAT Length = 436 Score = 45.2 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 69/207 (33%), Gaps = 19/207 (9%) Query: 11 SIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 I+PD R +++H L +L I A+ +G E D + G H L+ + + Sbjct: 49 KIVPDRRDPGRVQHGLQTLLAQRIYALAAGYEDLNDHD--GLRHDYALQTAVNRLQPLAG 106 Query: 71 HDTIARVVSCISPAKFHECFI----NWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR---- 122 T+ R+ + +++ + + V+ D + D+ R Sbjct: 107 KSTLGRLEQQADRETVVQAHRLLWEHFIAQHDQAPAEIVLDFDATDVPVHGDQEGRFFHG 166 Query: 123 ---RGAIHVISAFSTMHSLVIGQIKTDEKSNEIT-AIPELLNMLDIKGK-----IITTDA 173 + F H LV ++ + AI LL + + D Sbjct: 167 YYDHYCFLPLYVFCGRHLLVSYLRPSNIDGARHSWAILALLVKFIRRFWPETRIVFRGDG 226 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQG 200 C+ + + ++ DY+ + N Sbjct: 227 GFCRHRMLDWCDRKQVDYVVGLARNTR 253 >UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z7_THET2 Length = 112 Score = 45.2 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 35/111 (31%), Gaps = 7/111 (6%) Query: 41 AEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS 100 + +E F +P L G + P K E + + Sbjct: 1 MDSLRGVERFARANPHLLPHLGLRNPPGHTLLPLLLHRLD--PKKLQEALHQVFPE---A 55 Query: 101 DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNE 151 D V+ +DGK LR S + + ++ + + Q + + K E Sbjct: 56 DLGGVLVVDGKHLRGS--GKGKSPQVRLVEVLALHLKTTLAQARVEGKVVE 104 >UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUW9_9BACT Length = 61 Score = 44.8 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 11/62 (17%), Positives = 23/62 (37%), Gaps = 2/62 (3%) Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASV 371 D ED +IR NA + ++++ + + V + + +R A + Sbjct: 1 MRDTSFREDHSQIRTQNAPRAMASLKNLVVGLFHFLNVPN--IAKTLRNFAARPFLALQM 58 Query: 372 LT 373 L Sbjct: 59 LR 60 >UniRef50_A3YV03 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3YV03_9SYNE Length = 113 Score = 44.4 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 12/66 (18%), Positives = 27/66 (40%), Gaps = 2/66 (3%) Query: 281 VRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 +++S T + +R+ W +EN H+ + ++E + N A + + Sbjct: 17 THLFLTSLSSTPKTLLQLVRDRWSIENW-HFFRNTQLHESAH-GYQDNGACAMTTQKTGT 74 Query: 341 INILTN 346 N+L Sbjct: 75 QNLLRL 80 >UniRef50_A4BVT6 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVT6_9GAMM Length = 120 Score = 44.0 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 11/94 (11%), Positives = 27/94 (28%), Gaps = 4/94 (4%) Query: 3 LKKLMGHISIIPDYRQAWK-MEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 L+ + + D R + L+D L + + +D + + Sbjct: 14 LRTVRACFEALDDPRSRPNSTRYTLADALSSALAMFLLKYPSLLQFDDSARAADEVTRHN 73 Query: 62 GDFENGI---PVHDTIARVVSCISPAKFHECFIN 92 G+ P + ++ + P+ F Sbjct: 74 LGTLYGVEQVPCDTQMRAILDPLKPSTLRGAFRA 107 >UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostridium spiroforme DSM 1552 RepID=B1C560_9FIRM Length = 399 Score = 44.0 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 42/272 (15%), Positives = 80/272 (29%), Gaps = 25/272 (9%) Query: 56 DFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD---DKDVIAIDGKT 112 D L ++ DF P + S I P F F + + ++AIDG Sbjct: 21 DELLKFNDFSITTPSASAFVQARSKIKPEAFRTLFDGFNKKTFKKKLYHGYRLLAIDGSE 80 Query: 113 LR-------HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDE-------KSNEITAIPEL 158 L R SA+ S + + D+ K +E A +L Sbjct: 81 LPIDNTIFDDETTVLRHGTLAKTFSAYHLNASYDLMERTYDDIIIQGEAKRDEHGAFCQL 140 Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP 218 ++ D + I D + E + G YL V+ + P Sbjct: 141 VDRYDGQKAIFIADRGYESYNGFEHVVHSGHKYLIRVRD------IESQSSITKSLGPFP 194 Query: 219 AHDSYAMSEKSHGREEIRLHIVCDVPDELID--FTFEWKGLKKLCVAVSFRSIIAEQKKE 276 + + ++ ++ C + + F++ + + R + + + Sbjct: 195 DGEFDVDVSRMLTLKQTKMIKACPDVYKFVPKNMRFDFMNKQNPWYEFNCRVVRLKITEN 254 Query: 277 LEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 TV +S + + E W E Sbjct: 255 TYETVITNLSRNEFSMEDICEIYNMRWGEETS 286 >UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LF21_FRASN Length = 420 Score = 43.3 bits (100), Expect = 0.015, Method: Composition-based stats. Identities = 40/315 (12%), Positives = 89/315 (28%), Gaps = 43/315 (13%) Query: 18 QAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARV 77 + K ++ T+ + + ++++ +L + +P I++ Sbjct: 47 EQRKRLLPARVVVYFTMAMCLFFDDDYDEVMRRLVGTLRWLGSWKGDW-KVPSTGAISQA 105 Query: 78 VSCISPAKFHECFINWMRDCHSSD-------DKDVIAIDGKTL--------------RHS 116 + + P F + ++A+DG L Sbjct: 106 RTRLGPEPLKLLFERVAVPVAGLGTKGAWLGSRRLVAVDGVHLDTADTPENADAFGRFSH 165 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K+ +HV++ V S+E + L + + G ++T D Sbjct: 166 GPKTAAFPQVHVVALAECGTHAVFAAAIGAYTSDERSLAATLFDACE-PGMLLTADRNFY 224 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIR 236 + ++ G D L+ V N L P + D + G+ Sbjct: 225 GYGLWQQALATGADLLWRVNAN---LTLPVIRALPDGSYLSLLID-PKIPVARRGQL--- 277 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT----A 292 + D T ++ + +V + I++ A Sbjct: 278 ---IADARAGHAPPTESALPVRVIEYSVPDHEENG------TSELICLITNILDPTDVAA 328 Query: 293 EKFATAIRNHWHVEN 307 + ATA W +E+ Sbjct: 329 IELATAYHERWEIES 343 >UniRef50_A5FU21 Transposase, IS4 family protein n=11 Tax=Alphaproteobacteria RepID=A5FU21_ACICJ Length = 448 Score = 43.3 bits (100), Expect = 0.017, Method: Composition-based stats. Identities = 41/323 (12%), Positives = 99/323 (30%), Gaps = 56/323 (17%) Query: 11 SIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDFGET-HPDFLKQYGDFENGIP 69 + I D R +++H L +I+ + + +G E D + + + Sbjct: 55 ACIDDPRTPERVQHGLDEIIRFRMLMIAAGYEDGNDADRLRNDPMFKLAMERLPEAGDLC 114 Query: 70 VHDTIARVVSCISPAKFHE----CFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 TI+R + P ++ + ++ V+ ID ++D + Sbjct: 115 SQATISRTENLPGPRALLRMGLAMVEHYCASFRTIPNRVVLDID-----DTFDAAHGAQQ 169 Query: 126 IHVISAFSTM-----------HSLVIGQI---KTDEKSNEI-TAIPELLNML----DIKG 166 + + +A ++ + K ++I + L++ + Sbjct: 170 LCLFNAHHDEYGFQPIVVFDGDGRMLAAVLRPACRPKGSQIVKWLRRLIDAIRSHWPRTA 229 Query: 167 KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMS 226 ++ D+ C ++ + + DY+F V L K + A + Sbjct: 230 IMLRGDSHYCTPEVLRFCRARRLDYIFGV-APTTTLRKHVIAL---------EASTTARA 279 Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYIS 286 +++ G + R E D W + R I + + + R+ ++ Sbjct: 280 QQAPGEKIRRF-------KEFNDGAASWDRV--------ERIIARVEAGPMGVDTRFIVT 324 Query: 287 SADLTAEKFA--TAIRNHWHVEN 307 S + + EN Sbjct: 325 SLKAGSPRTLYQEIYCARGQAEN 347 >UniRef50_D2KXE5 Putative transposase n=1 Tax=Lactobacillus fermentum RepID=D2KXE5_LACFE Length = 373 Score = 42.9 bits (99), Expect = 0.017, Method: Composition-based stats. Identities = 50/356 (14%), Positives = 94/356 (26%), Gaps = 57/356 (16%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 + DI L + ++G D+ D + + + +++R + A Sbjct: 1 MIDIFLEKLLLDVAGYLHDSSANDW---QRDPVLAATLGSSRLVSQPSLSRFFKRLEDAD 57 Query: 86 FHECFIN--WM----RDCHSSDDKDVIAID-------GKTLRHSYDKSRRRGAIHVISAF 132 + F W S + V+ ID G R +++ H F Sbjct: 58 LDD-FRKLIWQVAALAFRLSGQSRFVLDIDSTHCDTFGNQERSAFNAHYMAYGYHPQVVF 116 Query: 133 STMHSLVIGQIKTDEK---SNEI-----TAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 L++ + E T L + II D+ E Sbjct: 117 DQESGLLLDALMRPGNEYIGKEADKFVETTFSRLSALPQSTSVIIRGDSGFAAPKFYEMC 176 Query: 185 QKQGGDYLFAVKGNQG--RLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCD 242 G D+L +K N +L + P K + Sbjct: 177 DTHGVDFLVRLKANSKLGKLAETALVDCPPKYEESKC----------------------- 213 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLT-AEKFATAIRN 301 V E W +++ + + + + ++S E R Sbjct: 214 VYHEFKYQAASWGKARRVIICSTHTADELVPWN-----HAFVVTSLASEAPETLFKTYRQ 268 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 + EN++ L D + I IA N++ K R+ Sbjct: 269 RGNAENQIK-ELKCGFGFDKTDSSTFARNTARALITGIAYNLVQLFKQLFVSEDRR 323 >UniRef50_A7JYJ5 Putative uncharacterized protein n=1 Tax=Vibrio sp. Ex25 RepID=A7JYJ5_VIBSE Length = 47 Score = 42.9 bits (99), Expect = 0.019, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 23/48 (47%), Gaps = 1/48 (2%) Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 + + EDD + R AE S IR +N++ + K L + ++A Sbjct: 1 MNLKEDDLRNRVAGGAENVSVIRRFTLNLVRL-QSKKYSLGAEAKQAG 47 >UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 Length = 41 Score = 42.5 bits (98), Expect = 0.028, Method: Composition-based stats. Identities = 16/30 (53%), Positives = 24/30 (80%) Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 +++A +T + + IGQ+K D KSNEITAIP+ Sbjct: 1 MVNALATANGMSIGQLKVDSKSNEITAIPK 30 >UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001905F7C Length = 367 Score = 42.1 bits (97), Expect = 0.034, Method: Composition-based stats. Identities = 35/285 (12%), Positives = 81/285 (28%), Gaps = 21/285 (7%) Query: 27 SDILLLTICAVISGAEGWEDIEDFGETH-PDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 +D+L L V+ G + + + + + D + +VS + + Sbjct: 43 ADLLRLCFAYVLGGF-SLRTLAAWADQRGLASMSDVAMLKRLKASADWVGYLVSELLAER 101 Query: 86 FHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKT 145 E F D ++A+D + ++ + L + ++ Sbjct: 102 CPEAFAGVHSDLR------LMAVDATV----VAPPGPKRDYWMVHTVFDLSRLKLSSVEV 151 Query: 146 DEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKA 205 ++ E + + +++ D + + K G D+L N RL Sbjct: 152 TDRR-EAERLSRGVKAGELRI----ADRAHAKATDLAAVVKAGADFLVRAPSNYPRLLDG 206 Query: 206 FEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVC-DVPDELIDFTFEWKGLKKLCVAV 264 + L A D + ++ V V + K + Sbjct: 207 DGQLLERLALCREAGDKGVLDRSVRIQDGKSKVEVAARVVILPLPPEAAAKARRAARRLA 266 Query: 265 SFRSIIAEQKKELEMTVRYYISSADLT---AEKFATAIRNHWHVE 306 + + ++S + E+ A+ R W +E Sbjct: 267 AKARYKPSEAGIEMAGYLVLLTSLNADDWPPERLASTYRLRWQIE 311 >UniRef50_Q877V8 ISPpu8, transposase n=3 Tax=Proteobacteria RepID=Q877V8_PSEPK Length = 433 Score = 41.7 bits (96), Expect = 0.046, Method: Composition-based stats. Identities = 52/366 (14%), Positives = 103/366 (28%), Gaps = 50/366 (13%) Query: 7 MGHISII-PDYRQAWKMEHKLSDILLLTICAVISG-AEGWEDIEDFGETHPDFLKQYGDF 64 + + ++RQ L ++ + V G + P L D Sbjct: 12 PEWVDQVFEEHRQRQYSRELLFSTIIKLMSLVSLGLKPSLHAAARQLDDLPVSLAALYDK 71 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDK----- 119 + + R + + D V +DG L + + Sbjct: 72 ISR--TEPALLRALVTGCAQRLAPTIHELGCSAMLPD-WQVRVVDGSHLASTEKRLGALR 128 Query: 120 --SRRRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 + + VI Q D ++E + LL I D + C Sbjct: 129 QERGAARPGFSVVVYDPDLDQVIDLQPCEDAYASERVCVLPLLAEAKTNQVWI-ADRLYC 187 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIR 236 + E ++ ++ + RL + E + P+ + E G R Sbjct: 188 TLPVMEACEQVKTSFVIRQQAKHPRLIQEGEWQAPMPVATGTVREQ--SIEVKGGHRWRR 245 Query: 237 LHIVCDVPDELIDFT-FEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKF 295 + + P++ D + W L ++A++ Sbjct: 246 VELTLHSPNDSGDNSLMFWSNLP-----------------------------ESISAQQI 276 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A R W +E + RL+ ++ + + AA L +A N+L K + Sbjct: 277 ADFYRRRWSIE-GMFQRLEAILESEIETLGSPRAALLGFTTAVLAYNVLALL---KRSVE 332 Query: 356 RKMRKA 361 + R A Sbjct: 333 QAHRDA 338 >UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4D726 Length = 464 Score = 41.0 bits (94), Expect = 0.081, Method: Composition-based stats. Identities = 34/258 (13%), Positives = 80/258 (31%), Gaps = 26/258 (10%) Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + + + + + ++G + E+ LL +L G ++ D Sbjct: 138 ATSRPAGYPQVRLTALVECGTRALMGAVFGPMHDKELPQARRLLPVL-RPGILLLADRGY 196 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEI 235 + G D L+ V+ GRL + L + +H S + +S R Sbjct: 197 DGYEAIRDAASTGADLLWRVQS--GRLLPVI------QPLPDGSHLSQILDRRSGDRLAA 248 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTA--- 292 P + + ++ + + I++ A Sbjct: 249 WQRRKRPTPPPALTA--------MAVRVIRYQVTVTTADGRQHSSTVRLITTLLDPARHP 300 Query: 293 -EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFK 351 + A W +E ++ L V + ++ R + + G+ +LT ++ + Sbjct: 301 AAELAELYHQRWEIETA-YYGLKVTLR-GSDRVLRSHTVQ---GVEQEIYALLTVFQLTR 355 Query: 352 AGLRRKMRKAAMDRNYLA 369 + A +D + L+ Sbjct: 356 TAIHNTAHIAGLDPDRLS 373 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.121 0.310 Lambda K H 0.267 0.0376 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,900,413,746 Number of Sequences: 3077464 Number of extensions: 67731745 Number of successful extensions: 211656 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 565 Number of HSP's successfully gapped in prelim test: 111 Number of HSP's that attempted gapping in prelim test: 209752 Number of HSP's gapped (non-prelim): 728 length of query: 378 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 248 effective length of database: 640,326,036 effective search space: 158800856928 effective search space used: 158800856928 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 93 (40.6 bits)