BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (378 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 766 0.0 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 375 e-102 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 322 1e-86 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 291 2e-77 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 278 2e-73 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 278 3e-73 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 276 8e-73 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 275 2e-72 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 260 6e-68 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 253 1e-65 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 238 2e-61 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 236 8e-61 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 228 3e-58 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 226 1e-57 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 221 4e-56 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 219 1e-55 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 218 3e-55 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 218 3e-55 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 218 4e-55 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 216 1e-54 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 215 2e-54 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 213 6e-54 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 211 4e-53 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 211 4e-53 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 211 4e-53 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 210 5e-53 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 210 6e-53 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 207 8e-52 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 206 9e-52 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 204 3e-51 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 204 6e-51 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 202 1e-50 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 197 5e-49 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 197 7e-49 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 195 2e-48 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 192 2e-47 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 190 6e-47 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 186 1e-45 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 181 5e-44 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 181 5e-44 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 179 2e-43 UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=B... 176 1e-42 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 174 6e-42 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 172 2e-41 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 169 1e-40 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 168 4e-40 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 167 4e-40 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 166 1e-39 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 166 1e-39 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 165 3e-39 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 165 3e-39 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 164 4e-39 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 160 1e-37 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 159 2e-37 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 157 6e-37 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 154 3e-36 UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX 152 2e-35 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 151 4e-35 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 149 1e-34 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 148 3e-34 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 145 3e-33 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 141 3e-32 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 141 3e-32 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 139 1e-31 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 138 3e-31 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 132 3e-29 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 129 1e-28 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 127 5e-28 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 118 5e-25 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 117 5e-25 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 114 5e-24 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 113 1e-23 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 110 7e-23 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 110 1e-22 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 104 5e-21 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 104 5e-21 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 102 2e-20 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 102 3e-20 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 102 3e-20 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 101 4e-20 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 100 1e-19 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 99 3e-19 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 99 3e-19 UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica... 96 2e-18 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 96 3e-18 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 95 4e-18 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 94 9e-18 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 93 1e-17 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 92 2e-17 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 92 4e-17 UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azolla... 92 4e-17 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 92 4e-17 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 90 1e-16 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 87 7e-16 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 85 4e-15 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 85 6e-15 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 83 2e-14 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 83 2e-14 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 83 2e-14 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 82 3e-14 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 82 4e-14 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 81 7e-14 UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=B... 79 3e-13 UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp... 79 3e-13 UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina... 78 5e-13 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 77 1e-12 UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX 76 2e-12 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 76 2e-12 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 76 2e-12 UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7... 76 2e-12 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 75 4e-12 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 74 1e-11 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 73 2e-11 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 72 2e-11 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 72 3e-11 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 72 4e-11 UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata ob... 71 8e-11 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 70 9e-11 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 70 1e-10 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 69 3e-10 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 69 3e-10 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 67 8e-10 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 67 1e-09 UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewane... 66 2e-09 UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax... 66 2e-09 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 66 2e-09 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 65 3e-09 UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio ... 65 5e-09 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 65 5e-09 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 64 8e-09 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 64 1e-08 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 63 1e-08 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 63 2e-08 UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legione... 62 3e-08 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 62 3e-08 UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris mari... 61 6e-08 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 61 8e-08 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 60 1e-07 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 60 1e-07 UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammap... 59 3e-07 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 57 1e-06 UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum fe... 57 1e-06 UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswald... 57 1e-06 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 56 2e-06 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 56 2e-06 UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum ... 55 6e-06 UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mo... 54 7e-06 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 54 9e-06 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 54 9e-06 UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata ob... 54 9e-06 UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscurig... 54 1e-05 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 54 1e-05 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 53 2e-05 UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus Re... 51 6e-05 UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia... 51 6e-05 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 51 8e-05 UniRef50_Q8X3B6 H repeat-associated protein n=1 Tax=Escherichia ... 51 9e-05 UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmone... 50 1e-04 UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothe... 49 2e-04 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 49 2e-04 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 49 3e-04 UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria... 49 3e-04 UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaerom... 48 6e-04 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 47 0.001 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 47 0.001 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 46 0.002 UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobac... 46 0.003 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 44 0.008 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 44 0.009 UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus... 43 0.020 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 42 0.037 UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synecho... 41 0.060 UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 40 0.096 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 766 bits (1978), Expect = 0.0, Method: Compositional matrix adjust. Identities = 370/378 (97%), Positives = 373/378 (98%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 MELKKLMEHISIIPDYRQ WKVEHKLS ILLLTI AVISGAE WEDIEDFGETHLDFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS+DKDVIAIDGKTLRHSYDKS Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 RRRGAIHVISAFSTMHSLVIGQIKTD+KSNEITAIPELLNMLDIKGKIITTDAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 AEKIQKQGGDYLFAVKG QGRLNKAFEEKFPLKELNNPEHDSYA+SEKSHGREEIRLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR Sbjct: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK Sbjct: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 Query: 361 AAMDRNYLASVLAGSGLS 378 AAMDRNYLASVLAGSGLS Sbjct: 361 AAMDRNYLASVLAGSGLS 378 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 375 bits (962), Expect = e-102, Method: Compositional matrix adjust. Identities = 181/372 (48%), Positives = 250/372 (67%), Gaps = 4/372 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L+ +SII D RQ KV H L +L L I AVISG E WE+I+DFG LD+L++Y F Sbjct: 6 LINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRKYLPFS 65 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 GIP DTI+R+ I P +F +CF WM+ C + DVIAIDGKTLR S++K + Sbjct: 66 GGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKKDKSDT 125 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH++SAF+ +S+V+GQ+KT+ KSNEITAIP+LL++LD++G ++T DAMGCQ IA+KI Sbjct: 126 IHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKIAKKIV 185 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPD 245 +GGDYL VKGNQ RL A + F ++ L PE ++Y EK HGRE+ R+ +V D + Sbjct: 186 DKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMVADA-N 244 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 E+ D FEW GLK L AVSFR+ E+ + + V++YISSA L A+ A R HW V Sbjct: 245 EIGDLVFEWPGLKTLGYAVSFRT---EKDMQTTVAVKFYISSAKLDAKSLLEASRAHWTV 301 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN LHW+LD+ MNED C+IR+ N+ E + +RH ++N+L N+K F G++RK ++A Sbjct: 302 ENNLHWQLDISMNEDSCRIRQQNSTENLATVRHTSLNLLKNEKSFDGGIKRKHKQANRSD 361 Query: 366 NYLASVLAGSGL 377 +Y V++G L Sbjct: 362 SYRELVVSGLSL 373 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 322 bits (826), Expect = 1e-86, Method: Compositional matrix adjust. Identities = 169/372 (45%), Positives = 236/372 (63%), Gaps = 8/372 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L+E SII D RQ K++H+L IL L + AVI GAE W+DIE+ G L++L++ G F+ Sbjct: 7 LVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFFK 66 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 GIPV DTIAR++S ++P + CFI WM + D +IA+DGK++RHSYDK +R+ A Sbjct: 67 KGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKSA 126 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH++SA++ + +V+GQ KTD KSNEI AIP LL++LDIKG I+T DAMGCQ+ IAEKI Sbjct: 127 IHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKIV 186 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPEHDSYAMSEKSHGREEIRLHIVCD 242 + GDY+ AVK NQ +L++ + F + HD + S K HGR E+R + + D Sbjct: 187 TKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRRYWISD 246 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 + L + W L+ + + S R I + E RY+I+S A+ FA A+R H Sbjct: 247 MLSTLGN-PERWASLQSIGMVESERYIDGKTTAE----TRYFITSIAPDAKIFANAVRKH 301 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN+LHW LDV EDD ++RR NA+E F RH+AIN L N+K K G++ K KA Sbjct: 302 WAIENQLHWVLDVSFREDDSRVRRDNASENFGVFRHVAINALRNEKSCKKGIKAKRYKAT 361 Query: 363 MDRNYLASVLAG 374 + +Y VL G Sbjct: 362 LQPDYAQKVLNG 373 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 291 bits (746), Expect = 2e-77, Method: Compositional matrix adjust. Identities = 158/372 (42%), Positives = 228/372 (61%), Gaps = 7/372 (1%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M ++ +H S I D+RQ+ KV + L +L ++ AVI+ + W +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 F +GIP DTIAR+VS I P F+ CF+ WM+ H + +VIAIDGKTLR SY++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 R IH+ISA+++ + LV+GQ+K ++KSNEITAIP LL MLD++G ++T DAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 A I +GGDYL AVK NQG L KA + F + D + EKSHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFS-PHRSAGLSDDHVNIEKSHGRIENRTCYV 239 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 DFT W+ LK + + SFR++ + K + RYYISS L+AE+ +A R Sbjct: 240 LSSAALDGDFT-HWEALKSIVMVESFRAV---KGKTASLEYRYYISSKVLSAEQALSATR 295 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +E+ +HW LDV MNED+C+I + N AE + +RH+++N+L + K + K ++ Sbjct: 296 EHWGIES-MHWVLDVSMNEDECQIYKNNGAENLAYLRHMSLNMLQKEPT-KLSIVGKRKR 353 Query: 361 AAMDRNYLASVL 372 M+ +L VL Sbjct: 354 CLMNPAFLEKVL 365 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 278 bits (712), Expect = 2e-73, Method: Compositional matrix adjust. Identities = 146/368 (39%), Positives = 220/368 (59%), Gaps = 20/368 (5%) Query: 23 EHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCIS 82 +HKL I+ +TI AVI GA+SW DIE FG+ +LK++ + NGIP HDT RV S ++ Sbjct: 26 KHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNGIPSHDTFGRVFSLLN 85 Query: 83 PAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQ 142 P + F+ W++ S +++AIDGKTLRHSYD+S+ + A+ +ISA++T + LV+GQ Sbjct: 86 PEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQMISAWATTNGLVLGQ 145 Query: 143 IKTDKKSNEITAIPE---------------LLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 D+KSNEITAIP+ LL +L + G I+T DA+GCQK+I ++I +Q Sbjct: 146 SIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLDAIGCQKEIVKQITEQ 205 Query: 188 GGDYLFAVKGNQGRLNKAFEEKFPLKELNNPE---HDSYAMSEKSHGREEIRLHIVCDVP 244 DY+ +K NQG L + E F ++N E Y + ++ HGR+E+R + + Sbjct: 206 DADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEGHGRQEVRYYQMLSNV 265 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 E ID ++W L + R + + + RY+ISS + + FA+++R HW Sbjct: 266 AEEIDPDWQWLNLNSIGYVEYLR--VENGTDKTSLERRYFISSLNNNIKLFASSVREHWC 323 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN+ HW LDV NEDD +IR+ NA + +RH+A+N+L +K K G++ K +KA D Sbjct: 324 IENQCHWILDVQFNEDDSRIRKDNAPANMAILRHLALNLLKQEKTLKVGVKAKRKKAGWD 383 Query: 365 RNYLASVL 372 NYL VL Sbjct: 384 ENYLLKVL 391 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 278 bits (710), Expect = 3e-73, Method: Compositional matrix adjust. Identities = 156/376 (41%), Positives = 222/376 (59%), Gaps = 10/376 (2%) Query: 3 LKKLMEHISIIPD-YRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 +K E+ + D R+ H IL++ + A+ISGA ++ +IE FG + ++ + + Sbjct: 6 VKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQTF 65 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSR 121 NGIP HDT V++ +SP +F CF+ W + IAID KTLR S DK Sbjct: 66 LALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKKN 125 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + +H++SA++T +LVIGQIKT++ SNEITAIPELLN LD+KG +++ DAMGCQ +IA Sbjct: 126 GKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEIA 185 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH---DSYAMSEKSHGREEIRLH 238 EKI ++ DY+ A+KGNQ +L+++ E F L N E D E S+GREEIR Sbjct: 186 EKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRCA 245 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 + +++I EWK +K + + S R KKE E +RYYISSA L+AE Sbjct: 246 YATNEIEKIIA-NDEWKNIKTVAMIESQRI-----KKEKEFDIRYYISSAKLSAEDCLKV 299 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R HW +ENKLHW LDV ED+ +IR+ N AE + +R IA+N++ +K K G K Sbjct: 300 VRKHWEIENKLHWTLDVAFREDESRIRQRNTAENMAILRRIALNLVKQEKTAKVGQATKR 359 Query: 359 RKAAMDRNYLASVLAG 374 A D YL +L G Sbjct: 360 LMAGWDEKYLLKLLNG 375 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 276 bits (707), Expect = 8e-73, Method: Compositional matrix adjust. Identities = 158/373 (42%), Positives = 223/373 (59%), Gaps = 14/373 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 + + H S I D RQ KV + L ILLLT+ AV+SGA W I +G L FLK++ F Sbjct: 24 EFLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPF 83 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 +G P HD + + + + F CFI+W+ + + V+AIDGKT R S DK+ + Sbjct: 84 ADGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTV-TGVVAIDGKTSRRSLDKAGGKA 142 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 AIH+ISA+S+ +L + Q + D KSNEITAIPELL +L +KG I+T DAMGCQ++IA KI Sbjct: 143 AIHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKI 202 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMS-----EKSHGREEIRLHI 239 + DY+ A+KGNQG L K + + + E ++D ++ EKSHGR E R Sbjct: 203 ISKEADYILALKGNQGSLRK--DTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVT 260 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 VC D L W GLK + V V + +I+ ++ + RYYISS AE A AI Sbjct: 261 VCTDIDWL-KADHNWPGLKSI-VMVQYHAILQDKTRAE---TRYYISSMTSDAEHHAKAI 315 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R+HW +EN LHW +D+V +D+C+IR GNA F+ I+H+A N+L + K K LR K Sbjct: 316 RDHWGIENGLHWVMDMVFRDDECRIRNGNAPANFTTIKHVASNMLRSVK-GKHSLRSKRH 374 Query: 360 KAAMDRNYLASVL 372 A+ D ++LA ++ Sbjct: 375 IASWDDDFLAEII 387 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 275 bits (703), Expect = 2e-72, Method: Compositional matrix adjust. Identities = 146/372 (39%), Positives = 221/372 (59%), Gaps = 9/372 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 ++EH S + D R A ++E+ L I+++T+ AV+ GA++W ++ ++G + +LKQ+ Sbjct: 9 IIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQWIALP 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NG+P HDT V + + P + +CF+NW + + + ++IAIDGKTLR + + Sbjct: 69 NGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGEQCSL 128 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH++SA+++ + LV+GQ D+KSNEITAIPELL +L+++G +++ DAMGCQ IAE I Sbjct: 129 IHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIAETII 188 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPEHDSYAMSEKSHGREEIRLHIVCD 242 + GDY+ A+KGNQG L + F + EHDSY EK HGR E R + Sbjct: 189 EGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTYWTMG 248 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEP-EMTVRYYISSADLTAEKFATAIRN 301 D L+ W LK + S R Q P + RYY+ S + A++FA A+R+ Sbjct: 249 QTDYLLGAE-RWAQLKSIGCVESCR----RQPGHPGTLQRRYYLLSIESDAQRFADAVRS 303 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDV ED + +G +A+ S IRHIA N+L + K G++ K KA Sbjct: 304 HWGIENQLHWILDVGFREDKLRACQGCSAQNLSVIRHIAANLLQQESTAKCGVKAKRLKA 363 Query: 362 AMDRNYLASVLA 373 D NYL +L+ Sbjct: 364 GWDDNYLVKILS 375 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 260 bits (664), Expect = 6e-68, Method: Compositional matrix adjust. Identities = 137/373 (36%), Positives = 215/373 (57%), Gaps = 12/373 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L+ ++ I D R +H L +L + I AVI+G++ WED+E++G ++L ++ + Sbjct: 31 LLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLELP 90 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 +GIP DT RV I P +C W++ +S ++I IDGKTLR SYD++ + A Sbjct: 91 HGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQCA 150 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 ++ ++A+++ SLV+GQ+K + SNEITAIP LL +LDI G IIT DAMG Q I ++I Sbjct: 151 LYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQIC 210 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELN---NPEHDSYAMSEKSHGREEIRLHIVCD 242 +Q DY+ +K N L ++ F + N EHD Y K H R E R V Sbjct: 211 RQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRY--VWA 268 Query: 243 VPDELIDFTF---EWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 +P + + +W GL+ + V R + + + +++Y++S A+ AI Sbjct: 269 IPVAAMGELYQQQQWHGLQTIVVVERIRHLWNKTTHD----IQFYLTSLPPNAQFLCHAI 324 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN LHW LDV +ED C+IR + + F+ +R +A+N+L +K FK LR+KM+ Sbjct: 325 RTHWSIENNLHWTLDVTFSEDQCRIRSEYSPQNFALLRRLALNVLHQEKTFKRSLRQKMK 384 Query: 360 KAAMDRNYLASVL 372 +AAM+ NY+ +VL Sbjct: 385 QAAMNNNYMMTVL 397 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 253 bits (645), Expect = 1e-65, Method: Compositional matrix adjust. Identities = 138/380 (36%), Positives = 225/380 (59%), Gaps = 16/380 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 + L+EH I D R + +H+L +L++ + ++ G E++ D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 +GIP HDT RV + + P F +CF+ W + ++ +++A+DGK LR + ++ + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQGQSP 126 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 I +SA++ +SLV+GQI+ K+NEITA+P+LL +L++ G I+T DAMGCQK+IA + Sbjct: 127 RVI--VSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IQKQGGDYLFAVKGNQGRLN---KAF-EEKFPLKELNNP-EHDSYAM-----SEKSHGRE 233 I + +Y+ A+KGNQG+ + KA+ E+ + P E ++ A+ +EK HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 E R + L D +W GL+ + V S R + ++ P + RYY+SS ++ E Sbjct: 245 ETRRYWQSGDVSWLADRQ-QWAGLRSVGVVESVRQV---GQQAPTVERRYYLSSLNVDVE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 KFA A+R HW VEN LHW LDV ED + R G+AAE + +R +A+N+L + K G Sbjct: 301 KFARAVRGHWSVENSLHWVLDVQCGEDRNRARSGHAAENLATLRRLALNLLKRESTKKRG 360 Query: 354 LRRKMRKAAMDRNYLASVLA 373 ++ K A+ D +YL +L+ Sbjct: 361 IKGKQLNASWDHDYLLRLLS 380 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 238 bits (608), Expect = 2e-61, Method: Compositional matrix adjust. Identities = 129/376 (34%), Positives = 220/376 (58%), Gaps = 14/376 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 ++E+ + + D R+ +H L +L++ + AVI+GA+ I + E H+++LK + Sbjct: 13 ILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSRLELP 72 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINW---MRDCHSSND--KDVIAIDGKTLRHSYDKS 120 +G+P HDTI R+++ + P F +CF W MR +++D +++IAIDGKTLR S+D+ Sbjct: 73 SGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRSHDRG 132 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + G + + SA++ + +GQ+ KSNEI PEL+ +D++ I+T DA GCQ+D+ Sbjct: 133 KGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGCQRDV 192 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN---PEHDSYAMSEKSHGREEIRL 237 AEKI GDY+ A+K NQ RL++ + + N+ + + + K HGR + R Sbjct: 193 AEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRLDKRF 252 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 + +PDE + +W+GLK + VA+ I+++ RYYISS A++FA Sbjct: 253 YYQVKLPDE-VPAGEDWRGLKTIGVAIR----ISQENGRETCDTRYYISSLKPDAKQFAA 307 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 A+R HW +EN LHW LDV ED+ ++R AAE + ++ +A++++ K ++ + R+ Sbjct: 308 AVRGHWGIENSLHWTLDVTFREDESRLRNRIAAENLAWLKRLAVSLIKQHKSKESVVMRR 367 Query: 358 MRKAAMDRNYLASVLA 373 R A + N+LA +L Sbjct: 368 -RMAGWNVNFLAEILG 382 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 236 bits (603), Expect = 8e-61, Method: Compositional matrix adjust. Identities = 131/360 (36%), Positives = 205/360 (56%), Gaps = 10/360 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 + H + D R +H L ++ LT+ A++SGAE W+DI+ FG++ LD+L+++ F+ Sbjct: 3 FITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAFK 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 G+PV DTIAR++S + P FI+W+ + + VIA DGKTLRHS+D R+ A Sbjct: 63 EGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFDGD-RKTA 121 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H +SA++ LV+ Q K+ K NE++ + EL+ +L++KG I+T DAM C K +A+ I Sbjct: 122 LHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAIN 181 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH---DSYAMSEKSHGREEIRLHIVCD 242 +GGDY+ VK NQG+L F + P+ +S ++ HGR E R ++ Sbjct: 182 AKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYVQLP 241 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 + L + W +K + R + + KE T YYISS ++ + A AIR+H Sbjct: 242 ITPWLTQ-SQGWTNIKPVIEVTRKRYL---KDKETSETA-YYISSLEVNLPQIAKAIRSH 296 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN HW LD+ EDD +IRRG+A E + R A+N L K ++ K+++AA Sbjct: 297 WSIENASHWVLDMTYREDDSRIRRGDAPENMATFRRFAMN-LARLSPIKDSMKGKLKQAA 355 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 228 bits (581), Expect = 3e-58, Method: Compositional matrix adjust. Identities = 139/342 (40%), Positives = 193/342 (56%), Gaps = 13/342 (3%) Query: 24 HKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISP 83 H +L++ I AV+S ++ EDI +G D+L+Q+ NG+ +T R+ + P Sbjct: 28 HDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLVLLNGVASEETFLRIFRALDP 87 Query: 84 AKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 +F F W+ + + +DGKT+R S S AIH++SAF+T +V+GQ Sbjct: 88 KQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGS--GSGGESAIHMVSAFATELGVVLGQE 144 Query: 144 KTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLN 203 K KSNEITAIPELL L I G ++T DAMGCQK+IA +I QGGDYL AVKGNQ L Sbjct: 145 KVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIARQITDQGGDYLLAVKGNQPTLL 204 Query: 204 KAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVA 263 A E +F + + + + D + SHGR I I +P E I +W KK+ Sbjct: 205 DAIETEF-IDQYQSDDVDRHRQVHPSHGR--IVAQIASVLPAEGIVDLADWPECKKIARV 261 Query: 264 VSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCK 323 S R + E ++ RYYISS +LTAE+ A A+R HW +EN+LHW LDV ED Sbjct: 262 DSLRKV---GNHESKLERRYYISSRELTAEQLAAAVRAHWGIENRLHWVLDVSFGEDAST 318 Query: 324 IRRGNAAELFSGIRHIAINIL---TNDKVFKAGLRRKMRKAA 362 IR+GNA + S ++ I +N++ T DK K LR K + AA Sbjct: 319 IRKGNAPQNLSLLKKIVLNLIRLDTADKT-KTSLRLKRKCAA 359 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 226 bits (576), Expect = 1e-57, Method: Compositional matrix adjust. Identities = 143/370 (38%), Positives = 204/370 (55%), Gaps = 15/370 (4%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L ++ I D R H+L I+ + +FAV++GA+SW IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + + P F +W++ S+ VIAIDGKT + SYD+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + ++SA+++ H LV+GQ D KSNEITAIP LL L + G I++ DAMG + IA +I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 KQGGDYLFAVKGNQGRLNKAFE---EKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCD 242 KQ DY+ A+KGNQ L K + E+F E+ + E +H R E R V Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRR--VFQ 251 Query: 243 VPDELIDFT----FEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 VP E + FT +W GL+ L V S R + + E RY++SS A FA Sbjct: 252 VPVEQV-FTPKQGRDWAGLRSLVVIQSQRCLWNKDTTE----TRYFLSSLSTDAATFAHY 306 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 IR HW +EN+LHW LDVV NED +IR+ +A FS +R + +N+L D K L K Sbjct: 307 IRAHWGIENQLHWCLDVVFNEDKSRIRKDHAPRNFSLLRRLTLNLLHRDSS-KGSLVMKR 365 Query: 359 RKAAMDRNYL 368 +A +D ++ Sbjct: 366 YRAGLDDQFM 375 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 221 bits (562), Expect = 4e-56, Method: Compositional matrix adjust. Identities = 131/344 (38%), Positives = 193/344 (56%), Gaps = 11/344 (3%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M + + E++S Y Q +H I+ L + AVISGA SW +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLSDPRAYNQ----KHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 Y FE GIPV DTIARV+ I P F+E F+N++ + + ++VIAIDGKTLRHS++ Sbjct: 57 YRPFECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFN-P 115 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + A+H ++ +S L++ Q K+ K NE A+ E+++ +K +IT DAM QK I Sbjct: 116 ETQSALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKI 175 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH-DSYAMSEKSHGREEIRLHI 239 AEKI ++ GDY+ +K N + E F + PE ++Y R + R + Sbjct: 176 AEKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYR 235 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 V D L EWKG+K + RS + KE + V +YISS D+ + A + Sbjct: 236 KLKVSDWLSKAE-EWKGIKSVLEVCRKRS---DNGKESQEKV-FYISSLDVDIQILAKCV 290 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 R HW VENK HW LDVV ED+C + AE + +R +A+N+ Sbjct: 291 RGHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNL 334 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 219 bits (559), Expect = 1e-55, Method: Compositional matrix adjust. Identities = 137/379 (36%), Positives = 197/379 (51%), Gaps = 24/379 (6%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M K L++++ IPD R K H LS ++ + I A++ G ++W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDV--IAIDGKTLRHSYD 118 + GIP HDT R+ + + PA F W+ D +DK V +A+DGK LR + Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMG-DDKLVGQLAVDGKALRATA- 118 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K R A+H+++ +ST + +GQ K KSNEITAIPELL +L++KG +++ DAMG Q Sbjct: 119 KGRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQV 178 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL----KELNNPEHDSYAMSEKSHGREE 234 IA+ I K+ GDYL AVK NQ LN +E+F E + H + HGR+E Sbjct: 179 KIADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKE 238 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMT-----VRYYISSAD 289 R V V DE + +WK ++IIA Q + E VR+YISS Sbjct: 239 HRRCWVLMV-DESMPVCQQWKA----------KTIIAVQAERIENGKGYDFVRFYISSRA 287 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 L A A R HW VEN LHW LD+ ED + R G A E + IR +N+L +K Sbjct: 288 LDATSALKATRAHWSVENLLHWTLDIAFGEDQLQARAGFAGENLAVIRQWILNMLKQNKS 347 Query: 350 FKAGLRRKMRKAAMDRNYL 368 + K R ++ YL Sbjct: 348 RNLSMANKRRLCCLNEQYL 366 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 218 bits (555), Expect = 3e-55, Method: Compositional matrix adjust. Identities = 133/379 (35%), Positives = 206/379 (54%), Gaps = 14/379 (3%) Query: 1 MELKKLMEHISI---IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDF 57 M++ KL + + + + D+R A + H+LS +L + + AV+SGA+ +E+I +G + + Sbjct: 1 MDIGKLADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPW 60 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD-VIAIDGKTLRHS 116 L+ + + G+ DT RV + + P +F + F W+ + KD VIAIDGK+ R + Sbjct: 61 LRGFLRLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRT 120 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K+ +H++SAF+ +V+GQ T +KSNEITAIPELL +LDI+G I+T DAMG Sbjct: 121 TSKAAA-APLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGT 179 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRL--NKAFEEKFPLKELNNPEHDSYAMSEKSHGREE 234 Q IA I+++G Y+ VK N +L + F + P L ++ + HGR E Sbjct: 180 QTKIARAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLT--PSSTHETTSTGHGRIE 237 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 +R D D L WK + V R++ E YYISS AE+ Sbjct: 238 VRRCTAFDATDRLHKAE-AWKDVASFAVVERVRTVGERTSTERV----YYISSLPADAER 292 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 A AIR+HW VEN+LHW LDV +D + R G+ A + +RH+A+N++ DK K + Sbjct: 293 IAVAIRSHWEVENRLHWCLDVQFGDDYARGRIGHIAHNLALVRHMALNLIRLDKSIKTSI 352 Query: 355 RRKMRKAAMDRNYLASVLA 373 + K AA + A++L Sbjct: 353 KTKRLLAATSDEFRAALLG 371 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 218 bits (555), Expect = 3e-55, Method: Compositional matrix adjust. Identities = 120/363 (33%), Positives = 189/363 (52%), Gaps = 9/363 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L +H+S++ D R H L +L L + AV SG + W +I+ FGE L++L+++ F Sbjct: 3 LFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPFA 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP TIAR++ + P C +W+ D +++ K +IAIDGKTLR + Sbjct: 63 NGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLG--CNT 120 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H + AF + L + Q K EI + L+ ML+I +IT DA+ Q+ E I Sbjct: 121 LHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAIV 180 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPD 245 + GDY+ VK NQ L +A + ++ + ++ + +A SEK HGR E R I +P Sbjct: 181 ARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQR--ITFQIPS 238 Query: 246 ELIDFTFE-WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 +L E W +K L R I + + +Y+SS D+ E ATA+R HW Sbjct: 239 KLSPKLQEKWPSVKTLIAVERHRKI----GNKTSIETSFYLSSHDIDPEYIATAVRGHWR 294 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN LHW LDVV ED C++ AE + +R +A+N+ + K ++ K+ ++ + Sbjct: 295 IENSLHWVLDVVYREDACRVHEQRTAESLAIVRRMALNLAKLEITQKRSMKSKLHRSLLS 354 Query: 365 RNY 367 Y Sbjct: 355 DEY 357 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 218 bits (554), Expect = 4e-55, Method: Compositional matrix adjust. Identities = 138/349 (39%), Positives = 195/349 (55%), Gaps = 17/349 (4%) Query: 24 HKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISP 83 H IL++ I AV+S ++ EDI + T +L+++ +NGIP +T R++ + P Sbjct: 19 HDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLKNGIPSEETFLRILRALDP 78 Query: 84 AKFHECFINWMRDCHSSNDKD-----VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSL 138 +F F W+ + D IAIDGKT+R S S AIH++SAF+T L Sbjct: 79 KQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGS--GSGGESAIHMVSAFATELGL 136 Query: 139 VIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 V+GQ K KSNEITAIPELL L IKG ++T DAMGCQK IA++I + GDYL VKGN Sbjct: 137 VLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSIAKQIVAKKGDYLLMVKGN 196 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 Q +L +A E F + + D + E+ HGR ++ V ++D +W Sbjct: 197 QPKLLEAIETAF-IDQHGVESVDRSSRVERGHGRTVGQIASVLSAKG-IVD-PADWPK-- 251 Query: 259 KLCVAVS-FRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVM 317 CV + S+ K+ ++ RYYISS L+AE+ A A+R HW VEN+LHW LDV Sbjct: 252 --CVTIGRIDSMRVVGDKQSDLERRYYISSRALSAEQLAAAVRAHWGVENRLHWILDVSF 309 Query: 318 NEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKMRKAAMD 364 +ED + + NA + S +R IA+ I+ DK K+ LR K + AA D Sbjct: 310 SEDASTVAKDNAPQNLSLLRKIALTIIRADKTDTRKSSLRLKRKGAAWD 358 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 216 bits (549), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 133/355 (37%), Positives = 192/355 (54%), Gaps = 13/355 (3%) Query: 22 VEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCI 81 V + L+ +LL T+ +I A +++IE G LD+L+Q+ FE+G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 SPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 P F W+ V AIDGKTLR S + GA+H++SA++ LVIG Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 Q + KSNEITAIPELL+ L + G I+T DAMG QK IA K+ +G DY+ A+KGNQG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LNKAFEEKFPLKEL--NNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 L+ + F +L HD + HGR E R V D L + W GL Sbjct: 181 LHDDVRDFFADPDLLRECARHDDTCI---GHGRIEERTCQVADASAWLTEQHSGWAGLAS 237 Query: 260 LCVAVSFRSIIAEQKKEPEMT--VRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVM 317 + ++ R+ KK E++ R YISS + A R+HW VEN LHW+LDV Sbjct: 238 IAAVIATRT----DKKSGEVSSETRLYISSLPPDPKTILNACRSHWSVENNLHWQLDVTF 293 Query: 318 NEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 ED+C+ R+ +A + IRH A N+L + K ++RK KAAM++ + +V+ Sbjct: 294 REDECRTRKDHAPLSLAIIRHAAFNMLKREPS-KMSIKRKRLKAAMNQAFRKTVI 347 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 215 bits (547), Expect = 2e-54, Method: Compositional matrix adjust. Identities = 138/382 (36%), Positives = 206/382 (53%), Gaps = 25/382 (6%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 L+ L+EH S I D R ++ H L ILLL + ++ + +E+I +G HL FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRR 122 + +G+P + +++ I PA F F W+R D +AIDGKT R S+D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFPGR-ADFVAIDGKTSRRSHDRRAG 130 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD----IKGKIITTDAMGCQK 178 IH++SAF+T LV+ Q K+NE+ AIP LL+ L + G +++ DA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLH 238 IA I+ QG DYL AVK NQ L E F + + + HD +K HGR E R H Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGADHHHDL----DKGHGRVEER-H 245 Query: 239 IVCDVPDELIDFTFEWKGLKKL-----CVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 + + + T + G +L V V + IA++ + RY+ISSA LTAE Sbjct: 246 VSVIREVDWLSGTRRFPGEMRLPDVAAIVRVHTTAHIADRTR---TDTRYFISSAPLTAE 302 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL--TND-KVF 350 A A+R HW +EN+LHW LDV+ +D ++R G+ A+ + +RH A+N++ ND K Sbjct: 303 HAADAVRGHWGIENRLHWVLDVLFKDDLSRLRTGHGAKNMAVVRHFALNLIRAANDQKSL 362 Query: 351 KAGLRRKMRKAAMDRNYLASVL 372 K RRKM A +YLAS+L Sbjct: 363 KT--RRKM--AGWSDDYLASLL 380 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 213 bits (543), Expect = 6e-54, Method: Compositional matrix adjust. Identities = 144/374 (38%), Positives = 202/374 (54%), Gaps = 37/374 (9%) Query: 23 EHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCIS 82 +H+ S I+L+ I AVI GA++W IEDFG++ F NGIP HDT R S + Sbjct: 33 KHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKLSNFNGIPSHDTFNRFFSALD 92 Query: 83 PAKFHECFINWMRD---CHSSNDKDVIAIDGKTLRHSYD-----KSRRRGAI-------- 126 P KF E + W++ C+S + IAIDGKT+R +Y+ + R++G + Sbjct: 93 PLKFEESYRQWVQSILKCYSGH----IAIDGKTIRGAYESEQDKRHRKQGVLPDSNTGKY 148 Query: 127 --HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 HVISAF+T + +GQ+ T +K NEI IPELL+ML IK IIT DA+GCQ+ IAEK+ Sbjct: 149 KLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDCIITIDALGCQRTIAEKV 208 Query: 185 QKQGGDYLFAVKGNQGRLNK---AFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVC 241 K GDY+F VK NQ +L + + E K D Y E+ HGR E R+ C Sbjct: 209 IKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKG-TTVRFDKYETHEEGHGRNESRICYCC 267 Query: 242 DVPDEL-IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTV--RYYISSADLTAEKFATA 298 + P L D +WK ++ SF I + TV R +ISS + A+K Sbjct: 268 NDPGFLGADIRKKWKNIQ------SFGYIENTRNTNKGTTVEKRCFISSLEPDAQKILKN 321 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 R HW +EN LHW+LDV +ED+ + RR +A FS + IA+ L N+K + + RK Sbjct: 322 SREHWEIENNLHWQLDVNFHEDNTR-RRNISALNFSVLAKIALATLRNNKR-EIPINRKR 379 Query: 359 RKAAMDRNYLASVL 372 A D +L ++ Sbjct: 380 LIAGWDNEFLWELI 393 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 211 bits (537), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 142/379 (37%), Positives = 204/379 (53%), Gaps = 21/379 (5%) Query: 13 IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLK-QYGDFENGIPVH 71 I D R K HK+ I+ ++I AVI GA+SW +IE+FG + F K + D E IP H Sbjct: 12 IEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPDLE-FIPSH 70 Query: 72 DTIARVVSCISPAKFHECFINWMRD-CHSSNDKDVIAIDGKTLRHS------YDKSRRRG 124 DT R S I P F F NW++ C K V+AIDGK +R + + Sbjct: 71 DTFNRFFSIIKPEYFELIFRNWVKQVCQEV--KGVVAIDGKLMRGPSQCDGEHTTGKEGF 128 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 + ++SA+S ++ + +GQ+K D KSNEITAIP L+N L++ G I+T DAMGCQKDI + I Sbjct: 129 KLWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDITQTI 188 Query: 185 QKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVC 241 ++ +Y+ A+K N+ + L K + + ++ + HGR E R V Sbjct: 189 IERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEKRTCTVV 248 Query: 242 DVPDELIDFTFEWK--GLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AEKFATA 298 +++ F+ K GLK + S R+I+A + E VRYY++S D T E+ A+A Sbjct: 249 SY-GSIMEKMFKKKLVGLKSIVGIKSERTIVATGEYTQE--VRYYVTSLDNTKPEEIASA 305 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 IR HW +EN LHW+LDV ED K + NAA FS +A+ IL DK K + K Sbjct: 306 IRQHWSIENNLHWQLDVTFREDYSK-KVKNAARNFSVATKMALTILKKDKTTKGSMNLKR 364 Query: 359 RKAAMDRNYLASVLAGSGL 377 KA D YL+ +L + Sbjct: 365 LKAGWDEKYLSQLLQNNNF 383 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 211 bits (536), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 131/374 (35%), Positives = 205/374 (54%), Gaps = 19/374 (5%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 + +E ++ I D+R + ++L ILL++ AVI +++ ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDV------IAIDGKTLRHSY 117 F +G P HDT +V+S + P E F WM + + K V +AIDGKT+ S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS- 121 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 S + A HV++AF++ LV+GQIKTD+KSNEITAIPELL + +K ++T DAMG Q Sbjct: 122 -GSAEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNK--AFEEKFPLKELNNPE---HDSYAMS-EKSHG 231 K+IA KI ++GGDY+ AVKGNQ +L + L++ + E YA++ EK HG Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTRELKAKGQYAVTLEKDHG 240 Query: 232 REEIRLHIVCDVPDELIDFTF--EWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 R E R C + ++L F +W+G+ + + R + + K + S + Sbjct: 241 RIEKR---ECYLSNDLSWFEGLEDWRGIAGVGWIRNTRHVDNKNDKTTTEDHYFIYSLKE 297 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 A+ R HW +EN LHW LD+ EDDC+ R NAAE+ + +R +A+ +L Sbjct: 298 AQAKDLLRIKREHWAIENNLHWMLDMAFREDDCRARAKNAAEVMNILRKLALQMLKTCDT 357 Query: 350 FKAGLRRKMRKAAM 363 K G+R K + + Sbjct: 358 CKCGMRSKRKLCGL 371 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 211 bits (536), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 129/378 (34%), Positives = 197/378 (52%), Gaps = 16/378 (4%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L E I D+R H L+ IL++ A++ G + +E FG +L+ + Sbjct: 16 LREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLALP 75 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINW----MRDCHS----SNDKDVIAIDGKTLRHSY 117 NGIP HDT +V S + P +F E F W +R S S K VIAIDGK LR + Sbjct: 76 NGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRGAV 135 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 DK + I + A+++ SL +GQ+K KSNEI A+PELL ML +KG I+T DAMGCQ Sbjct: 136 DKGQAPAVI--VGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMGCQ 193 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE-LNNPEHDSYAMSEKSHGREEIR 236 +++A KI +Q GDY+ A+K NQ L++ E L E + + HGR E+R Sbjct: 194 REVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHEVR 253 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 V + + + +W GL+ + R++ + + RY+ISS A A Sbjct: 254 RCWVSEEVECWLQGAEKWAGLRSVAAVECERTVAGQTTVQR----RYFISSLKADAALIA 309 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKAGLR 355 ++R HW +EN LHW LDV ED+ + RRG +AE + +R + ++ + K + Sbjct: 310 ASVRAHWGIENSLHWVLDVTFGEDESRSRRGYSAENLATLRRLTHAMIKRENPNSKKSVN 369 Query: 356 RKMRKAAMDRNYLASVLA 373 ++ +A + +YL ++L Sbjct: 370 QRRFEAGLSTDYLQTLLG 387 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 210 bits (535), Expect = 5e-53, Method: Compositional matrix adjust. Identities = 117/341 (34%), Positives = 178/341 (52%), Gaps = 12/341 (3%) Query: 38 ISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 ++ AESWEDIE +G + +L+ + NGIP HDT RV + F CF ++ Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 ++V+A+DGK++R S G +H++S +++ L +GQ D KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 LL L + G I+T DAMGCQ IAE+I+ +G D L +K N G +A F L + Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 218 -----PEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAE 272 P D++ + HGR +R + D + W L ++ + R I Sbjct: 184 GAAGRPVFDAF----EGHGR-LVRRRVFVDAAATALAPLSGWPDLSRVLAVETLRGIPGT 238 Query: 273 QKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 + +RY+++S IR HW VEN LHW L+V EDD ++R AA Sbjct: 239 GTVVAD--IRYFLTSCRDDPSVLVGVIRRHWSVENALHWVLEVSFREDDSRVRDRTAARN 296 Query: 333 FSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 F+ +R IA+N++ D+ +A LR + +KAA D +Y+ ++A Sbjct: 297 FALVRKIALNLIAQDRSTQASLRGRRKKAAWDDDYMLQIIA 337 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 210 bits (535), Expect = 6e-53, Method: Compositional matrix adjust. Identities = 127/374 (33%), Positives = 187/374 (50%), Gaps = 11/374 (2%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 ++ L + I D RQA KV H++ +L++ + + ES+ D+ DF ++ L +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRR 122 ++G P HD V+ I P E W D + IAIDGK LR +++ Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGWCGDLEGRH----IAIDGKALRGTHNAETG 116 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 R +H++ A+ + L GQI +KSNEI AIP LL L +KG +T DAMG Q IAE Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPEHDSYAMSEKSHGREEIRLHI 239 +I G DY+ A+K N R ++ + F E L+ H E SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 + + D +++W GL+ VA R + P V Y++ S E+ A + Sbjct: 237 ITEELD-WYHKSWKWAGLQS--VAQVRRQVQRSHDGPPLEEVHYFLCSFKADVERLAKLV 293 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VEN+ HW LDV NED C++R NAA + +R + I L K LRRK + Sbjct: 294 RGHWSVENRCHWVLDVTFNEDHCQVRDRNAAHNLTILREMVITTLHRHPA-KVSLRRKRK 352 Query: 360 KAAMDRNYLASVLA 373 A MD + +L Sbjct: 353 LATMDPAFRLQMLG 366 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 207 bits (526), Expect = 8e-52, Method: Compositional matrix adjust. Identities = 115/371 (30%), Positives = 194/371 (52%), Gaps = 10/371 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L+EH++++ + R +H L ++ L I A++SGAE W DIE +G++ +D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP T+AR++ CI E + W+ + + K +IA DGK LR S+ + + A Sbjct: 69 NGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKDA 127 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + +++A+ T + LV+ Q T K EI + ++L++L++KG ++T DA+ CQ++ EKI Sbjct: 128 LQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKIS 187 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIR--LHIVCDV 243 ++ + VK NQ +L +A + +F E E HGR+E R + + Sbjct: 188 EKKAHVVVQVKNNQPKLYQAVQSQFQSLFDAQKEKIVVEHKESGHGRQEERYVFQLKAKL 247 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 P EL T +W ++ + RS + + YY+SS + IR HW Sbjct: 248 PPEL---TEKWPTIRSIIAVERHRSA----NGKGTVDTSYYVSSLSPKHKLLGHYIRQHW 300 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 +EN H+ LDVV NED +I +A E + R +NI+ R K+++A Sbjct: 301 RIENSQHYILDVVFNEDASRIAMEDAVENMALFRRFVLNIVKQSNCGARSQRNKLKRAGW 360 Query: 364 DRNYLASVLAG 374 + +Y A + G Sbjct: 361 NDDYRAQLFFG 371 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 206 bits (525), Expect = 9e-52, Method: Compositional matrix adjust. Identities = 123/366 (33%), Positives = 190/366 (51%), Gaps = 11/366 (3%) Query: 13 IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 I D R + + L ILL+T+ A+I G ++W+ I DFG+ +L Q+ + G+P Sbjct: 23 IKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVNMRCGVPSTL 82 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAF 132 T ARV S I P +F C WM D+I +DGK+L S + + + A H+++A+ Sbjct: 83 TFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQKATHIVNAY 142 Query: 133 STMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYL 192 + +G+++ KSNEI AIP LLN L+++G II+ DAMG QK IA I+ + DY+ Sbjct: 143 LPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANLIRLKQADYV 202 Query: 193 FAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEK---SHGREEIRLHIVCDVPDELI- 248 A+K N R + E F + + + Y E HGR E R + C +P Sbjct: 203 LALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSY--CVLPMMYFH 260 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTA-EKFATAIRNHWHVEN 307 + W+ L+ + S R + E E RYYI+S + + AIR HW +EN Sbjct: 261 KYKKYWRDLQAIVRVQSKR----HKGNEIETATRYYITSLPFAEHRRMSQAIRQHWAIEN 316 Query: 308 KLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNY 367 +LHW+LD+ + ED I RG A + + +R + + +L N+ K G+ K +AA+ Y Sbjct: 317 QLHWKLDIGLGEDASLITRGYADQNLATLRKMVLKMLENENSSKQGIAGKRIQAALSTRY 376 Query: 368 LASVLA 373 L V+ Sbjct: 377 LRKVVG 382 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 204 bits (520), Expect = 3e-51, Method: Compositional matrix adjust. Identities = 131/353 (37%), Positives = 196/353 (55%), Gaps = 29/353 (8%) Query: 13 IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 + D R+ WK++H LS I+LL FA +SGAE W++IE FG+ + LK ENGIP HD Sbjct: 16 VKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVLQLENGIPSHD 75 Query: 73 TIARVVSCISPAKFHECFINWM-----RDCHSSN----DKDVIAIDGKTLRHSYDKSRRR 123 T+ RV + + P E W D S N K ++AIDGKT+R + S ++ Sbjct: 76 TLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTIRG--NGSAKQ 133 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 A+H+++A++T + GQ+ T++KSNEITAIPELL+M+ +KG +++ DAMG QK IA+K Sbjct: 134 KALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDAMGTQKAIADK 193 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDV 243 I K+ DY AVK NQ L E+ P E++ D Y EK+HG+ E R + V Sbjct: 194 IIKKKADYCLAVKENQKTL---LEDIVPFFEMSQEADDHYHTVEKAHGQIETRAYEVIHD 250 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 L E+ ++ + A R + + +E E + RY+I S ++A++ +R HW Sbjct: 251 VSWLRKTHPEFGHIQSIGRA---RIHLDKNGQESEES-RYFILSCQVSAKELCDYVRGHW 306 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 +E+ +HW LDVV ED K + +A N+ DK A L++ Sbjct: 307 QIES-MHWLLDVVFREDANKTLN----------KQLAFNLNVMDKFCLAVLKQ 348 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 204 bits (518), Expect = 6e-51, Method: Compositional matrix adjust. Identities = 118/373 (31%), Positives = 192/373 (51%), Gaps = 11/373 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 +++H+ I D R EH + I L + AVISGA+SW +FG L++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP +I R+ +S + ++W+ + + + IAIDGK L+ S A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLK-GAKASASSAA 119 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H+++A+ T LV K +E+ + ELL LD+K +++T DA+ CQ + I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVC--DV 243 K+GGD + VKGNQ +L +A + +F NNP+ + + + K HGR E R+ C ++ Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLNL 239 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 P E+ +W LK L R + + + +Y+SSA LT+E F AIR HW Sbjct: 240 PAEI---KMKWSQLKTLIAVERHRKV----GNKTSIDTHFYVSSAVLTSEAFGRAIRAHW 292 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 EN HW LD + ED K+ + A + + +R A+N++ K +K +A Sbjct: 293 QTENNQHWLLDKLFQEDKQKMYDEDGASILAILRRWALNLVKLHPA-KTSQTQKFNRACW 351 Query: 364 DRNYLASVLAGSG 376 ++ ++ G+G Sbjct: 352 SDDFREEIIFGTG 364 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 202 bits (515), Expect = 1e-50, Method: Compositional matrix adjust. Identities = 131/368 (35%), Positives = 200/368 (54%), Gaps = 26/368 (7%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 +++L+E + I D RQ KV H L IL++ +FA ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + + K +I IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + G H++SA+S +GQ +KSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN--PEHDSYAMS-EKSHGREEIR 236 IAEKI+ + DY+ ++K NQG L + E F E E Y + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK---KEPEMTV--RYYISSADLT 291 + + + + WKGLK SII E+K KE + + RY+ISS Sbjct: 239 EYYQTE-KIKWLSQKKAWKGLK---------SIIMERKTLEKEGKRLIEYRYFISSLKEE 288 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-- 349 E + A+R HW +E+ +HW LDV ED AA+ + IR +++IL +V Sbjct: 289 IETVSRAVRGHWSIES-MHWHLDVTFREDANTTIDKMAAQNLNIIRKWSLSILKTAEVSR 347 Query: 350 FKAGLRRK 357 K +R+K Sbjct: 348 HKLSMRKK 355 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 197 bits (502), Expect = 5e-49, Method: Compositional matrix adjust. Identities = 126/375 (33%), Positives = 194/375 (51%), Gaps = 13/375 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY--GD 63 L+E S +PD R+ + L+ IL++ + A++ GA++W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 E G P HDT + + F F +W+R+ D V+AIDGKTLR S K Sbjct: 63 LEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVID-GVVAIDGKTLRGSGKKGSNE 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H+++A++ L + Q T K +E+ + LL++L +KG I+T DA+GCQ ++AEK Sbjct: 122 -LLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAEK 180 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPEHDSYAMSEKSHGREEIRLHI- 239 I +GGDY+ VK NQ L +A E F + + +EK HGR E R + Sbjct: 181 IVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYTW 240 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL-TAEKFATA 298 + DV WK L + + S R I + + RY I S + T E FA A Sbjct: 241 INDVTWMDRPMRAAWKKLGGVGMIESIRQI----GDKVSVDQRYAIGSCGVQTVEMFAKA 296 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 R+HW +EN LHW LDVV ED C+ R GN+A S +R + L ++ K GL R+ Sbjct: 297 SRSHWGIENGLHWTLDVVFREDQCRARLGNSARNLSVLRKFVLTTLRKEEGCKMGLNRRR 356 Query: 359 RKAAMDRNYLASVLA 373 A + +Y S++A Sbjct: 357 LHADRNESYRESLIA 371 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 197 bits (500), Expect = 7e-49, Method: Compositional matrix adjust. Identities = 124/373 (33%), Positives = 191/373 (51%), Gaps = 21/373 (5%) Query: 13 IPDYRQ--AWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPV 70 +PD R A K+ H L+ IL + AVI+GAE WEDI ++G + F +++ + +NG+P Sbjct: 12 LPDPRTETANKI-HTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLELKNGVPS 70 Query: 71 HDTIARVVSCISPAKFHECFINWMRD-CHSSN--DKDV-----IAIDGKTLRHSYDKSRR 122 HDT RV + + P F + F W + C ++ D+ +A+DGK+ R S K Sbjct: 71 HDTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGKSARRSA-KPTF 129 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 G +H++ + +L++GQ + +EIT ++L LD+ G ++T DA GCQ + E Sbjct: 130 SGCLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTLDAAGCQTETLE 189 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFP-LKELNNPEHDSYAMSEKSHGREEIRLHIVC 241 I+ +GG+Y+ VKGNQ L A F E D + +HGR E R V Sbjct: 190 VIRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAHGRHEERNVTVV 249 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 PD L W G+ + + R + + K E T YY+SS + A + A IR Sbjct: 250 HDPDGL---PAGWAGVGSVALVCRDRQV---KGKANESTAHYYLSSLRVGAAELAGYIRG 303 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HWH+E+ +HW LDV ED+ + R G+A IR +A+++L K + + +A Sbjct: 304 HWHIES-MHWVLDVAFREDESRTRVGHAGANLGMIRRVAVSLLKRAGK-KGSIHTRRLRA 361 Query: 362 AMDRNYLASVLAG 374 D Y+A VL G Sbjct: 362 GWDDQYMAQVLQG 374 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 195 bits (495), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 134/363 (36%), Positives = 185/363 (50%), Gaps = 19/363 (5%) Query: 11 SIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPV 70 +IIPD R ++ + I+ + + AVI GA++W +IE FG+TH + K IP Sbjct: 8 AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKARLPGLVSIPS 67 Query: 71 HDTIARVVSCISPAKFHECFINWMRD-CHSSNDKDVIAIDGKTLRHSYDKSRR-----RG 124 HDT++R S + F ECF W+ D C V+AIDGK + + DKS R Sbjct: 68 HDTLSRFFSILDIDWFEECFRLWVDDICRRI--PGVVAIDGKAICDNPDKSSNSKNGVRS 125 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++++SA+S + + +GQ K ++KSNE AIPEL+ LD++ IIT DA+GCQK I + I Sbjct: 126 KLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIGCQKSITKLI 185 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN---PEHDSYAMSEKSHGREEIRLHIVC 241 + DY+ K N L E F L E + Y K HGR E R VC Sbjct: 186 IENKADYILCAKDNHEALRNIIE--FNLSEESRYYLCHAKRYFEENKGHGRSEYR-ECVC 242 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 L F W G+K L + S R + KE M RYYISS + +IR Sbjct: 243 ISAKNLQYFLKGWTGIKTLAMINSIRKM---GDKEAVMETRYYISSLEPDPIIILKSIRP 299 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW VEN LHW LD+ EDD + + GNAA FS I +A+ +L + K G+ K + Sbjct: 300 HWEVENNLHWVLDIGFREDDDR-KIGNAAINFSAIPKLALMLLKQSDI-KLGMAGKRKAC 357 Query: 362 AMD 364 D Sbjct: 358 GWD 360 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 192 bits (488), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 102/196 (52%), Positives = 133/196 (67%), Gaps = 13/196 (6%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M L L +H + + D RQA KV +KL +L L + AVISGAE WE+IEDFG L +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 YGDF +GIPVHDTIAR+V I P FH+ FI WM+ DK V+A+DGKTL Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 H+ISAF+T + +V+GQ +TD+KSNEITA+PELL +L+++G ++T DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 AEKIQKQGGDYLFAVK 196 + I K+ DY AVK Sbjct: 168 VKTIVKKKADYCIAVK 183 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 190 bits (483), Expect = 6e-47, Method: Compositional matrix adjust. Identities = 117/373 (31%), Positives = 185/373 (49%), Gaps = 26/373 (6%) Query: 13 IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 +PD R H L +L + + A I GAES D F +++ + G+P HD Sbjct: 12 LPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLPSHD 71 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAF 132 T +RV + P F CF ++ D + V+AIDGKTLR S+D++ R A+HV+SAF Sbjct: 72 TFSRVFRLLDPVAFSRCFQQFL-DHLGEDGAGVLAIDGKTLRRSFDRAAGRSALHVVSAF 130 Query: 133 STMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYL 192 ++ +++GQ NEI A LL + D+KG ++T DA+ Q+ A+ I ++GGD+L Sbjct: 131 ASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGGDWL 190 Query: 193 FAVKGNQGRLNKAFEEKF--PLKELNNPEHDSYAMSEKSHGREEIRLHIVC-DV------ 243 F +K N+ L E F P L P + ++ HGR E+R H V DV Sbjct: 191 FPLKDNRPALRAEVERYFADPATVLAVP----HVTTDADHGRIEVRRHWVSHDVAWLASD 246 Query: 244 ---PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 PDE + GLK L + + T Y+SSA L + A A+R Sbjct: 247 RRFPDEAV-----LPGLKILGL---VERTVTSPDGRTTATRTLYLSSAALEPKTLARAVR 298 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +E +HW LD +ED + R+ + E + +R +A+N++ + + +R + ++ Sbjct: 299 AHWSIEAAVHWVLDTSFDEDRARNRKDHGPENLATLRKLALNVVRSANN-QDSIRLRRKR 357 Query: 361 AAMDRNYLASVLA 373 A +Y ++L Sbjct: 358 AGWSDDYARTILG 370 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 186 bits (472), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 110/307 (35%), Positives = 163/307 (53%), Gaps = 7/307 (2%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 +L + E +PD R + H LS +L + + AV+ GA + D+ +G+++L +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD-VIAIDGKTLRHSYDKS 120 + G+P HDT RV++ I PA F F+ W+ + D V+AIDGKT R S K Sbjct: 65 LKLKAGVPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGKD 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 G +H++SAF+ LV+GQ TD+KSNEITAIPELL ML ++G I+T DAMG Q I Sbjct: 125 TS-GPLHMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAI 183 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 A I+ +G DY+ VK N L + + K HGR E+R Sbjct: 184 ARTIRSRGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWA 243 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 D +L + +W GL+ + R++ + E YYISS A + A A+R Sbjct: 244 YDAVSQLYK-SEQWAGLQSFALVERERTVDGKTSVE----RHYYISSLPADAARIAQAVR 298 Query: 301 NHWHVEN 307 +HW VE+ Sbjct: 299 SHWAVES 305 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 181 bits (458), Expect = 5e-44, Method: Compositional matrix adjust. Identities = 89/207 (42%), Positives = 135/207 (65%), Gaps = 1/207 (0%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 +KL IPD+R++ K + L ILL+ I +VI GA+SW ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 NGIP HDT RV S I +F +CFI W+ +++IAIDGKT+R + ++ Sbjct: 65 LPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGGKK 123 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H++SA++ ++LV+GQ+K KSNEITAIP+LL +L I+ I+T DAMGCQ IA+ Sbjct: 124 SPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIAKA 183 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKF 210 I K+ DY+ AVK NQ +L + E++F Sbjct: 184 IVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 181 bits (458), Expect = 5e-44, Method: Compositional matrix adjust. Identities = 105/260 (40%), Positives = 151/260 (58%), Gaps = 5/260 (1%) Query: 12 IIPDYRQAWKVE-HKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPV 70 +IPD R+A + H LS IL + + AV+SG + WE + +FG T +L+Q+ NGIP Sbjct: 20 LIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANGIPS 79 Query: 71 HDTIARVVSCISPAKFHECFINWMRDCHSSNDK-DVIAIDGKTLRHSYDKSRRRGAIHVI 129 HDT RV S I P F F +W D D +A+DGKT+R S+ S R A+H++ Sbjct: 80 HDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSHRGSAGR-ALHLL 138 Query: 130 SAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGG 189 A+S L++ Q + D KSNEITAIP++L++ D++G I+ DA+GCQK +A +I + GG Sbjct: 139 HAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITEAGG 198 Query: 190 DYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELID 249 DY+ A+KGNQ L+ + +P+ + A+ EK HGR E R V D D L Sbjct: 199 DYVLALKGNQSALHDDVRLFMETQADRHPQGQAEAV-EKDHGRIETRRIWVNDEIDWLTQ 257 Query: 250 FTFEWKGLKKLCVAVSFRSI 269 +W GLK L + S R + Sbjct: 258 KP-DWPGLKTLVMVESRREL 276 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 179 bits (453), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 112/374 (29%), Positives = 197/374 (52%), Gaps = 17/374 (4%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L++H+ II D R ++H L ++ LT+ A++SGA W+ IE FG LD+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 +GIP IA ++ + + W+ D K +IA+DGKT+R ++ + A Sbjct: 63 HGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ-A 121 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H++SAF + + + +KK +E ++++ L + ++T DA+ CQK +KI Sbjct: 122 LHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKII 181 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKS---HGREEIR--LHIV 240 + D++ +KGNQ L A + F ++P + A+SE++ HGR+E R + I Sbjct: 182 SKKSDFVIQIKGNQPALLAAVKAAF-AACYDSP---ALAISEQTNTGHGRKECRRVMQIE 237 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 ++P EL + +W ++ L S R++ + + R+Y+SS + + A IR Sbjct: 238 GNLPPELSE---KWPHIRTLVEVASERTV----GNKTACSSRWYVSSLPVDTAQLADIIR 290 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +EN+LHW LDVV ED+ + + A+ + A++++ + K L K + Sbjct: 291 AHWAIENQLHWVLDVVFREDELNVSDPDGAKHLALFNRAALSVIKQHQGKKDSLAAKRQS 350 Query: 361 AAMDRNYLASVLAG 374 AA D + + +L G Sbjct: 351 AAWDPAFRSELLFG 364 >UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=Bacteria RepID=B7UED9_ECOLX Length = 104 Score = 176 bits (446), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 84/99 (84%), Positives = 90/99 (90%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 MTVRYYISSAD TAEKF TAIRNHWH+EN L+WRLDVVMNEDD KIRRGNAAE FSGIRH Sbjct: 1 MTVRYYISSADSTAEKFVTAIRNHWHMENNLYWRLDVVMNEDDYKIRRGNAAESFSGIRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGL 377 IAINILTN++VFKA RRKMRKA MD+NYLASVLAG+G Sbjct: 61 IAINILTNNQVFKARSRRKMRKATMDKNYLASVLAGAGF 99 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 174 bits (440), Expect = 6e-42, Method: Compositional matrix adjust. Identities = 122/371 (32%), Positives = 191/371 (51%), Gaps = 18/371 (4%) Query: 15 DYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D RQ+WK+ + LS IL L ++G E+ +++EDF E + Y D G P HDT+ Sbjct: 19 DSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEPLFATYVDLSEGCPSHDTL 78 Query: 75 ARVVSCISPAKFHECFINWMRDCHSSND-KDVIAIDGKTLRHSYDKSRRRGAIHVISAFS 133 RV+S ++ + E + + + S + +I++DGKT+R + K+++ +H+++A+ Sbjct: 79 ERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRGNRGKNQK--PVHIVTAYD 136 Query: 134 TMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLF 193 H L +GQ+ ++KSNEI AIP+LL +DI+ I+T DAMG Q I + I K DY Sbjct: 137 GGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCL 196 Query: 194 AVKGNQGRLNKAFEEKFP----LKELN-NPEHDSYAMSEKSHGREEIRLHIVCDVPDELI 248 AVKGNQ L F L+EL N ++ Y EKS G+ E+R + V L Sbjct: 197 AVKGNQETLYDDIALYFSDVNLLEELQENAQY--YQTVEKSRGQIEVREYWVSSDIKWLC 254 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 +W L+ + + R+ I ++ + RY+I S FA +R HW +E+ Sbjct: 255 QNHPKWHKLRGIGMT---RNTI-DKDGQLSQENRYFIFSFKPDVLTFANCVRGHWQIES- 309 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL--RRKMRKAAMD-R 365 +HW LDVV +ED + AA + IR + + L K L RRK R ++ Sbjct: 310 MHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKKDLSYRRKQRYISVHLE 369 Query: 366 NYLASVLAGSG 376 +YL + G Sbjct: 370 DYLVQLFGERG 380 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 172 bits (436), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 109/371 (29%), Positives = 178/371 (47%), Gaps = 19/371 (5%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + + E +PD R A H L+ IL + + A + GA S D+ F + Sbjct: 5 MDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDVL 63 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMR----DCHSSNDKDVIAIDGKTLRHSYD 118 +NG+P HDT +RV + P F + F +M+ K VIA+DGK LR Y+ Sbjct: 64 VLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGYE 123 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 R +++A++ + + ++ +NE +L+ +L +KG ++T DA+ C + Sbjct: 124 SGRSHMPPVMVTAWAAQTRMALANVQA-PNNNEAAGALQLIELLQLKGCVVTADALHCHR 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLH 238 +AE I+ +GGDY+ AVK NQ L + + K ++ S + HGR+E R Sbjct: 183 GMAEAIKARGGDYVLAVKDNQPALMR--DAKAAIRAATRQGKPSTITVDAGHGRKEKRRA 240 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTV-RYYISSADLTAEKFAT 297 +V VP D F GLK + S K+ + TV RY++ S + Sbjct: 241 VVAAVPQMAQDHDFA--GLKAVARITS--------KRGTDKTVERYFLMSQAYPPKDVLR 290 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +EN LHW LDVV++ED + R+ NA + +R +A+N+ LR K Sbjct: 291 IVRTHWTIENSLHWPLDVVLDEDLARNRKDNAPANLAVLRRLALNVARAHPDNTTSLRGK 350 Query: 358 MRKAAMDRNYL 368 +++A + +L Sbjct: 351 LKRAGWNDTFL 361 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 169 bits (429), Expect = 1e-40, Method: Compositional matrix adjust. Identities = 134/410 (32%), Positives = 197/410 (48%), Gaps = 44/410 (10%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + + E I I D R+ KV + I+L+T+ V +SW DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINW----------MRDCHSS-------NDKDV 105 P HDT+ R I + C+ W + DC S ND Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 106 ---IAIDGKTL----------RHSYDKSRRRGA----IHVISAFSTMHSLVIGQIKTDKK 148 IAIDGKT+ + S K + A +H++SAF + SL +GQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNMLDIK-GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFE 207 NEI AIP+LL+ +DI+ G ++T DA+G QK I EKI ++ DYL VK N +L + E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPLKELNNPEHDSYAMSEKS---HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 ++ E+D +E++ HG R I C P L +WK L+ + Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIK 316 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + IA + + E +ISS E R HW VEN LHW+LDV NEDD + Sbjct: 317 TEKINIATGEIQNEKHC--FISSLVNNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR- 373 Query: 325 RRGNAAELFSGIRHIAINILTN--DKVFKAGLRRKMRKAAMDRNYLASVL 372 + N+A+ FS + +A+ IL N D+ K + RK +KA YLA+++ Sbjct: 374 KMMNSAQNFSTLTKMALTILKNYQDEDKKTSVNRKRKKAGWSDEYLANLI 423 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 168 bits (425), Expect = 4e-40, Method: Compositional matrix adjust. Identities = 112/365 (30%), Positives = 181/365 (49%), Gaps = 19/365 (5%) Query: 13 IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 +PD R A V H L +L++ +V+ G+ S ++ FG F + + ++ IP HD Sbjct: 22 VPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRNFLKLKHAIPSHD 80 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSS-NDKDVIAIDGKTLRHSYDKSRRRGAIHVISA 131 T + V I P F + D D D+IAIDGK LR + D ++SA Sbjct: 81 TFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDPGESARTRMMVSA 140 Query: 132 FSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDY 191 +++ L + + D+ E++A E L ++D++GK++T DA+ C + I GGD+ Sbjct: 141 YASRLRLTLATVPADR-GTELSAAIEALGLIDLRGKVVTGDALHCNRRTVAAINAGGGDW 199 Query: 192 LFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKS-HGREEIRLHIVCDVPDELIDF 250 A+KGNQ L F ++P A++E + HGR+E R +V V + + Sbjct: 200 CLALKGNQESLLSDARGCFSKGHKSDPT----AVTENTGHGRKETRKAVV--VSAKALAE 253 Query: 251 TFEWKGLKKLCVAVSFRSIIAEQKKEPEMT--VRYYISSADLTAEKFATAIRNHWHVENK 308 E+ GLK F I A ++ ++T RY+ S T E A+R+HW +EN Sbjct: 254 YHEFPGLK------GFGRIEATRETGGKVTSETRYFALSWVPTPEVLLAAVRDHWAIENA 307 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 LHW+LDV ED + R+ N + +R A+++L D K L K+++A D +L Sbjct: 308 LHWQLDVSFREDAARNRKDNGPGNIAVLRRRALDVLRRD-TSKGSLSIKIKRAGWDTTFL 366 Query: 369 ASVLA 373 S+L+ Sbjct: 367 RSILS 371 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 167 bits (424), Expect = 4e-40, Method: Compositional matrix adjust. Identities = 89/233 (38%), Positives = 140/233 (60%), Gaps = 3/233 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L++H + D R +HKL I+++ + A+I GA+S+ +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + I P +F + F +W+ DV+ IDGKT++HS +K + A Sbjct: 69 NGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKKA 128 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH+++A+++ LV+ Q K +++ EITAIP L+ +L++ G ++T DAMG Q DIAE + Sbjct: 129 IHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELLH 188 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPEHDSYAMSEKSHGREEI 235 +G DY A+KGNQ L + +E F E EH + EK R E+ Sbjct: 189 SKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEV 241 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 166 bits (421), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 91/240 (37%), Positives = 139/240 (57%), Gaps = 8/240 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 +++L+E + I D RQ KV H L IL++ +FA ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + + K +I IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + G H++SA+S +GQ +KSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN--PEHDSYAMS-EKSHGREEIR 236 IAEKI+ + DY+ ++K NQG L + E F E E Y + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 166 bits (421), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 94/252 (37%), Positives = 145/252 (57%), Gaps = 7/252 (2%) Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++H+++A+ + +L++GQ+K D KSNEITAIP+LL ML ++G I+T DAMGCQK IA++I Sbjct: 2 SLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQI 61 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP-EHDSYAMSEKSHGREEIRLHIVCDV 243 + DY+ AVK NQ L + + F ++N H + + HGR E R + V Sbjct: 62 GSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREYSTI-V 120 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 D+L+ W L + + S R + E RY+I S + A++F A+R HW Sbjct: 121 GDDLLAGITGWDNLNAIGMVESKREVGNTISNEK----RYFIMSINGHAQRFGDAVREHW 176 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 +EN +HW LDV ED +IR+ N+ E S +R IA+N + + K ++RK + A Sbjct: 177 GIENTVHWVLDVSFGEDQSRIRKDNSPENLSMLRKIALNCVKQEST-KTSMKRKRKMAGW 235 Query: 364 DRNYLASVLAGS 375 D ++L VL G+ Sbjct: 236 DNSFLIKVLTGN 247 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 165 bits (417), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 126/383 (32%), Positives = 185/383 (48%), Gaps = 43/383 (11%) Query: 30 LLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHEC 89 +L+T+ V +SW DI DF DFL+++ P HDT+ R I + C Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FINW----------MRDCHSS-------NDKDV---IAIDGKTL----------RHSYDK 119 + W + DC S ND IAIDGKT+ + S K Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 120 SRRRGA----IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIK-GKIITTDAM 174 + A +H++SAF + SL +GQ + K NEI AIP+LL+ +DI+ G ++T DA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKS---HG 231 G QK I EKI ++ DYL VK N +L + E ++ E+D +E++ HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 R I C P L +WK L+ + + + IA + + E +ISS Sbjct: 241 FMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIKTEKINIATGEIQNEKHC--FISSLVNN 298 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN--DKV 349 E R HW VEN LHW+LDV NEDD + + N+A+ FS + +A+ IL N D+ Sbjct: 299 PELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTLTKMALTILKNYQDED 357 Query: 350 FKAGLRRKMRKAAMDRNYLASVL 372 K + RK +KA YLA+++ Sbjct: 358 KKTSVNRKRKKAGWSDEYLANLI 380 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 165 bits (417), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 102/343 (29%), Positives = 168/343 (48%), Gaps = 21/343 (6%) Query: 13 IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 +PD R +H L IL + + AV+ GA ++E F + LD L+Q+ E G P HD Sbjct: 10 VPDPRD-LTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLERGAPSHD 68 Query: 73 TIARVVSCISPAKFHECFINWM----RDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHV 128 T +RV++ + P +E F+ +M K +A+DGK+LR +Y K R V Sbjct: 69 TFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGRSHMPPLV 128 Query: 129 ISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQG 188 ++ F + + Q ++ E+ A L +L +KG +T DA+ C + + + ++ G Sbjct: 129 VTVFGCDTFMSLAQT-VAQEGGEVQAAIAALELLSLKGLTVTADALHCHRRMTKTVRDGG 187 Query: 189 GDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEK-SHGREEIRLHIVCDVPDEL 247 G Y+ A+KGNQ +L A E L + + + +E+ +HGR E+R V Sbjct: 188 GHYVIAIKGNQSKL--AAEANTALDKAAAGKATKFHQTEEDAHGRHEVRRAFVIPFAQ-- 243 Query: 248 IDFTFEWKGLKKLCV---AVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 T L LC S+R++ + + VR Y S + A + +R HW Sbjct: 244 ---TPGKNALVDLCAIGRVESWRTVEGKTTHK----VRCYALSRKMPAHELLATVRRHWS 296 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 +EN LHW+LDV++ ED + R+ N A + +R + +N+L D Sbjct: 297 IENDLHWQLDVLLGEDHIRGRKNNTAANHAILRRLTLNVLRAD 339 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 164 bits (416), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 110/351 (31%), Positives = 166/351 (47%), Gaps = 46/351 (13%) Query: 15 DYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D RQ KV H+ I++ + V + +SW ++ DF +DF++++ P HDT+ Sbjct: 29 DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFFPDIQKAPSHDTL 88 Query: 75 ARVVSCISPAKFHECFINW---MRDCHSSNDKD-----------------VIAIDGKTLR 114 R + P + W MR+ +++ ++ IAIDGKT++ Sbjct: 89 RRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKPFRQIAIDGKTIK 148 Query: 115 HSYDKSRRRGA--------------IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLN 160 + ++ RRR +H++SAFS L +GQ + DKK NEI AIP LL+ Sbjct: 149 KAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKKENEIVAIPRLLD 208 Query: 161 MLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL------NKAFEEKFPLK 213 LDI +G ++T DAMG QKDI +I K+ YL VK NQ L N E+ PL Sbjct: 209 DLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIAGNMRDFERIPLP 268 Query: 214 ELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 N + + E HG +R VC L +W+ L+ + + R + E Sbjct: 269 ---NEVYKVHKEGENGHGFVFLRECRVCSSLHSLGKIYKDWENLRSYGLIRTER--VDEA 323 Query: 274 KKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 E + Y+ISS + EK R HW +EN LHW+LD+ EDD ++ Sbjct: 324 TGESAVETHYFISSLENDPEKIMRVKRKHWGIENGLHWQLDITFKEDDGRM 374 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 160 bits (404), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 81/195 (41%), Positives = 121/195 (62%), Gaps = 2/195 (1%) Query: 94 MRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEIT 153 M+ H +V+AIDGKTLR SYD+ R+ IH++SA+++ + LV+GQ+KT+ KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIP L+ MLD++G I+T DAM CQ IA+ I ++GGDYL AVKGNQG+L A + F Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFTPH 120 Query: 214 ELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 + D+ + EK GR E R + V D + DF+ W GL + + ++R+ Q Sbjct: 121 RRAPIDRDTCQI-EKQKGRVEARTYHVLSASDLIRDFS-TWSGLTSIVMVENYRAAKGRQ 178 Query: 274 KKEPEMTVRYYISSA 288 + + + + + S+ Sbjct: 179 RARVGVPLLHKVQSS 193 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 159 bits (402), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 105/345 (30%), Positives = 182/345 (52%), Gaps = 16/345 (4%) Query: 10 ISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG-- 67 I+++ D R ++++ L ILL++++A ISG + WE IED+ H + L+ +G Sbjct: 9 IAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKLSGKE 68 Query: 68 -----IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRR 122 +P HDT V I P +F E + ++ + + IAIDGKT R ++ Sbjct: 69 LKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPR-GIKQTAN 127 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 +++SA+ T H VI I ++ K +E+++I +L+ +L ++ +T DA G ++ E Sbjct: 128 SHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGTYVEVIE 187 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIR-LHIVC 241 I +GG+++ VKGNQ +L + E++F N D+ + HGR E R ++ + Sbjct: 188 MILSKGGNFVLPVKGNQKKLLEFIEKEFREYRGNTVSADT--QEDIGHGRVEKRTVYCIT 245 Query: 242 DV-PDELIDFTFE-WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 ++ D+ ID + WKG+K L V R + + K + YYI++ + ++ AI Sbjct: 246 EIKTDDDIDGCMQKWKGVKTLVKIV--REVYKKADKSTRIETVYYITNL-IDPKEINRAI 302 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 R HW +EN LH LDV++NED + N E F + +A+ I+ Sbjct: 303 RAHWGIENNLHRSLDVLLNEDHSQKSNHNVVENFHIMSLLALFII 347 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 157 bits (397), Expect = 6e-37, Method: Compositional matrix adjust. Identities = 109/366 (29%), Positives = 173/366 (47%), Gaps = 19/366 (5%) Query: 13 IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 +PD R A H L +L++ +V+ GA S ++ FG + + ++ +P HD Sbjct: 44 VPDPR-AENTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLKHAVPSHD 102 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSS-NDKDVIAIDGKTLRHSYDKSRRRGAIHVISA 131 T + V I P F + D + D DVIA+DGK LR + D ++SA Sbjct: 103 TFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGRTRMMVSA 162 Query: 132 FSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDY 191 ++ L + + D+ + E+ A E L ++ +KGK++T DA+ C + I GGD+ Sbjct: 163 YAARLRLTLASVPADRGT-ELEAAIEALGLIALKGKVVTADALHCNRRTVAAINAGGGDW 221 Query: 192 LFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEK-SHGREEIRLHIVCDVPDELIDF 250 A+K NQ L F + P+ A+SE HGR E R V V + + Sbjct: 222 CLALKANQDSLLSDARASFGAE----PDAHPSALSEDIGHGRTETRKATV--VSSKALAE 275 Query: 251 TFEWKGLKKLCVAVSFRSIIAEQKKEPEMT--VRYYISSADLTAEKFATAIRNHWHVENK 308 E+ GLK +F + A +K T RY+ S T E +R HW +EN Sbjct: 276 HHEFPGLK------AFGRVEATRKTAEGTTSETRYFALSWVPTPEVLLATVRAHWAIENS 329 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 LHW+LDV ED + R+ N+ + +R A++++ D K L K+++A D ++L Sbjct: 330 LHWQLDVSFREDAARNRKDNSPGNIAILRRRALDVMRRD-TSKGSLSIKLKRAGWDDDFL 388 Query: 369 ASVLAG 374 +VL G Sbjct: 389 RNVLNG 394 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 154 bits (390), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 88/278 (31%), Positives = 143/278 (51%), Gaps = 17/278 (6%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 ++ L+E + + D R K+EH+L IL++ + AV++ AE++EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD------VIAIDGKTLRHS 116 D GIP HDT RV I P F CF+NW R + D IA+DGK +RHS Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ R +H++SA++T LV+ Q D K E A+P +L L + G +++ DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN-----PEHDSYAMSEKSHG 231 ++++A+ I +G YL +K NQ +++ F + P D++ + +HG Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAF---DDTHG 238 Query: 232 REEIRLHIVCDVPDELIDFTFE-WKGLKKLCVAVSFRS 268 R R C PD T W GL + + + R+ Sbjct: 239 RLVRRRVFAC--PDAGCFTTLRGWPGLTTVLASETIRA 274 >UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX Length = 105 Score = 152 bits (384), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 74/88 (84%), Positives = 77/88 (87%) Query: 271 AEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 EQKKEPEMT RYY SADLTAEKFATA RNHW+VENKLHW LDVVMN+DDCKIRRGNAA Sbjct: 18 TEQKKEPEMTDRYYSISADLTAEKFATANRNHWYVENKLHWHLDVVMNKDDCKIRRGNAA 77 Query: 331 ELFSGIRHIAINILTNDKVFKAGLRRKM 358 ELFSGIR IAINILT DK+ KAG R KM Sbjct: 78 ELFSGIRKIAINILTKDKILKAGARCKM 105 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 151 bits (382), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 112/350 (32%), Positives = 173/350 (49%), Gaps = 16/350 (4%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 L+KL E S IPD+R+A K + HKLS I++L I +S S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-----NDKDVIAIDGKTLRH 115 NGIP T+ R+ I + H +++I IDGK R Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 + K+ R I +SA S + + ++KSNEI A+P L++ +DI GKI+T DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 QKDI +KI+++ GD++ +K NQ L E+K +KEL +P + E HGR E Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDK--IKEL-SPVYSYCGEPELGHGRIET 269 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R + V D D LI +W G L + + + R ++SS + Sbjct: 270 RSYRVFDGTD-LIANKEKWNG--NLTIIEYECETVKKSTGNCTTEKRLHVSSLPANTPRL 326 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 T +RNHW +E+ +HW LD + +D K + AA I+ I ++ + Sbjct: 327 GTPVRNHWSIES-MHWGLDRNLLQDKIKRKSARAARNLDTIQRIVYSVFS 375 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 149 bits (377), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 126/374 (33%), Positives = 185/374 (49%), Gaps = 31/374 (8%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L E + ++P R K + L +LL+ + +SG SW +IED+ E + + LK + Sbjct: 5 LFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEML 64 Query: 66 NG------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRH---- 115 G +P HDT+ R +S + F + W+ S+ I IDGKT+R Sbjct: 65 TGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRGVKKL 124 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 S+D HV+SAFS + Q+ D+K+NEI AI +LL++LD+ G +++ DA+G Sbjct: 125 SFDTQS-----HVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIG 179 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF-PL--KELNNPEHDSYAMSEKSHGR 232 Q I E+I +GGDY+ VK NQ + E F PL K + E +E SHGR Sbjct: 180 TQTAIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLDEQ-----TELSHGR 234 Query: 233 EEIRLH--IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISS-AD 289 E R + I+ + E + KGL+ + V R K E V YYISS D Sbjct: 235 IETRRYESILNPLEIEANEVLTRRKGLRSIHKVVRKRRDKKSDKTSEE--VAYYISSLTD 292 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 +++ K AIR HW +ENKLH LDV D R N A++ I+ I + I+ K Sbjct: 293 VSSLK--QAIRGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKT 350 Query: 350 -FKAGLRRKMRKAA 362 K+ + R +K A Sbjct: 351 NMKSSIPRIQKKPA 364 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 148 bits (374), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 75/165 (45%), Positives = 105/165 (63%), Gaps = 3/165 (1%) Query: 13 IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 +PD R+ + H+L +LL I VISGAESW + + + LD+L+ Y + +GI HD Sbjct: 15 LPDPRRR-ECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPYAHGIASHD 73 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAF 132 T RV S + ++F CF+ W+ S + +AIDGK LR S+D + R IH++SA+ Sbjct: 74 TFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHDGA--RSPIHLVSAW 131 Query: 133 STMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 S+ +L +GQ++T KSNEITAIPELL LDI+G IT DAMGC Sbjct: 132 SSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCH 176 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 145 bits (365), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 84/197 (42%), Positives = 113/197 (57%), Gaps = 8/197 (4%) Query: 133 STMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYL 192 S +LV+GQ K + KSNEITAIP L+ ML+I+ IIT DAMGCQK+I I+K+ GDY+ Sbjct: 28 SLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESSIITIDAMGCQKEITSLIRKKKGDYI 87 Query: 193 FAVKGNQGRLNKAFEEKFPL---KELNNPEHDSYAMSEKSHGREEIRLHIVCDVPD-ELI 248 +K NQ L + +E F + +E + EH Y E H R E R I V + Sbjct: 88 ITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQEIETGHHRIEKREVIAVSVSSLPCL 147 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 W LK + + S R + + E VR+YISS + ++K ATAIR+HW +EN Sbjct: 148 HNQDLWTELKTVVMVKSERRLWNKTTTE----VRFYISSVEKNSQKIATAIRSHWEIENS 203 Query: 309 LHWRLDVVMNEDDCKIR 325 LHW LDV +ED +IR Sbjct: 204 LHWTLDVTFSEDKSRIR 220 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 141 bits (356), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 110/357 (30%), Positives = 180/357 (50%), Gaps = 32/357 (8%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 +K L E + +PDYR+ K ++KL ILLL I + + DI FG+ +L + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSCISP-------AKFHECFINWMRDCHSSNDKDVIAIDGKTL 113 G +G+P T+ R+ I ++F F + + C D++ IDGK + Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAG----DILCIDGKAM 133 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDA 173 R + ++ R I +SA+S + + ++KSNEIT++P+LL+ +D+ G I+T DA Sbjct: 134 RGTVLENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADA 191 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSE-KSHGR 232 M QK I +KI+++GGD+L +K NQ L E+ L E D Y+ HGR Sbjct: 192 MSFQKAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAE----PVDVYSEGPFLEHGR 247 Query: 233 EEIRLHIVCDV--PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTV--RYYISSA 288 E R VC + ++LI +W G L V V R+ E+K + + + R+Y+SS Sbjct: 248 IETR---VCRIFRGNDLITDREKWNG--NLTV-VEIRT-ATERKSDGQKSSERRFYVSSF 300 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 +A + T R HW +E+ +HW LD + +D + +A I+ + + IL+ Sbjct: 301 HGSARRLGTIARMHWAIES-MHWDLDRNLRQDFIRRNSARSARNLDTIQRMVLAILS 356 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 141 bits (356), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 92/237 (38%), Positives = 123/237 (51%), Gaps = 9/237 (3%) Query: 143 IKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL 202 + T+ KSNEITAIP LL L+ K ++T DAMGCQKDIA I GGD++ AVK NQ +L Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 NKAFE---EKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 A EK EL H +Y HGR + R H V VP EW +K Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVPPGFAA-KGEWPWIKA 119 Query: 260 LCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + AV + A+ + E VRYY+ S L+ ++F +R HW +E+ +HW LDV E Sbjct: 120 IGTAVRI-TTHADGTQSDE--VRYYMLSRFLSGKRFGEVVRGHWGIES-MHWVLDVTFGE 175 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSG 376 D + R+ A S +R AI +L K +R KM + MD ++L VL G Sbjct: 176 DRTRTRQRVLANNLSWVRRFAITLLKRHPE-KDSIRGKMIRCLMDTSFLNEVLTLQG 231 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 139 bits (351), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 102/334 (30%), Positives = 159/334 (47%), Gaps = 13/334 (3%) Query: 40 GAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISP---AKFHECFINWMRD 96 GA++ +I +F E LK+ +G P HDT +R+ I P A+ F+ +R Sbjct: 37 GAKNCVEIAEFVEGREAELKEIVTLRHGCPSHDTFSRIFRLIDPDELARALGAFLAALRQ 96 Query: 97 CHS--SNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITA 154 + V+A+DGK LR Y+K R ++S + L + K + S+E+ A Sbjct: 97 GLGLGPRPRGVVAVDGKALRRGYEKGRAFMPPVMVSVWDAETRLSVA-TKRAEGSDEVAA 155 Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 LL +D+KG I+T DA+ C+ D A+ + + Y A+K N+GRL E F + Sbjct: 156 TLALLKSIDLKGCIVTADALHCRPDTAKALIGRKAHYALALKANRGRLFACAEAGFVAAD 215 Query: 215 LNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 + + E HGR E R V +P + + GLK + + R Sbjct: 216 AAG-DLAFHETRETGHGRLETRRASV--LPLKAFKQAPAFPGLKAIGRIQATRQ---GAD 269 Query: 275 KEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 +VRY S L K A +R HW +EN+LHW LDVV +EDD + R+ NA + + Sbjct: 270 GRAVTSVRYIALSKVLAPHKLAEVVRAHWTIENQLHWSLDVVFHEDDARSRKDNAPQNLA 329 Query: 335 GIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 IR +A +IL + K + KMR+ +R++ Sbjct: 330 VIRRLARDILAAHPLDKP-IASKMRRVNWNRDFF 362 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 138 bits (348), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 81/196 (41%), Positives = 112/196 (57%), Gaps = 9/196 (4%) Query: 13 IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 I D R K HK+ I+ ++I AVI GA+SW +IE+FG + F K IP HD Sbjct: 12 IEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPSLEFIPSHD 71 Query: 73 TIARVVSCISPAKFHECFINWMRD-CHSSNDKDVIAIDGKTLR--HSYDKSRRRGA---- 125 T R S I P F F NW++ C K V+AIDGK +R D RG Sbjct: 72 TFNRFFSMIKPDYFELIFRNWVKQVCQEV--KGVVAIDGKLMRGPSQCDGEHTRGKEGFK 129 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + ++SA+S + + +GQ+K D KS+EITAIP L+N L++ G I+T DAMGCQKDI + I Sbjct: 130 LWMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQKDITQTII 189 Query: 186 KQGGDYLFAVKGNQGR 201 +Y+ A+K N+ + Sbjct: 190 GHDANYIIAIKENKKK 205 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 132 bits (331), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 91/251 (36%), Positives = 133/251 (52%), Gaps = 18/251 (7%) Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRD-CHSSNDKDVIAIDGKTLRHS------YDKS 120 IP HDT R S I P F F NW++ C K V+AIDGK +R + Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEV--KGVVAIDGKLMRGPSQCDGEHTTG 61 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + + ++SA+S + + +GQ+K D KSNEITAIP L+N L++ G I+T DAMGCQKDI Sbjct: 62 KEGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDI 121 Query: 181 AEKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRL 237 + I + +Y+ A+K N+ + L K + + K+ + HGR E R Sbjct: 122 TQTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRT 181 Query: 238 HIVCDVPDELIDFTFEWK--GLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AEK 294 V +++ F+ K GLK + S R+I+A + E VRYY++S D T E+ Sbjct: 182 CTVVSY-GSIMEKMFKKKLVGLKSIVGIKSERTIVATGEYTQE--VRYYVTSLDNTKPEE 238 Query: 295 FATAIRNHWHV 305 A+AIR HW + Sbjct: 239 IASAIRQHWSI 249 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 129 bits (325), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 113/375 (30%), Positives = 181/375 (48%), Gaps = 36/375 (9%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDF-GETHLDFLKQ 60 E+ L+E ++ +PD R V H L+ +L LT AV++GA S + ++ E + L++ Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 61 YGDFENGI------PVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDV--IAIDGKT 112 G + + P TI RV++ I W+ C + + +A+DGK+ Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWL-ACRQQDAGGLRALAVDGKS 156 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIITT 171 LR + RR +H+++A + LV+ Q+ +K+NEIT LL+ L D+ G ++T+ Sbjct: 157 LRGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTS 214 Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHG 231 DA+ Q D A ++ + Y+ VK N +L+ + P +++ P D + HG Sbjct: 215 DALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLKS-LPWQQI--PLQDRTRTT--GHG 269 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 R EIR VC V + L + G ++ V + R + K T+ Y ++S L Sbjct: 270 RCEIRRLKVCTVNNLL------FPGARQ-AVQIVRRRVNRTTGKVSLKTI-YAVTS--LA 319 Query: 292 AEKFATA-----IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN 346 AE+ A IR HW VE H R DV ED ++R GNA + + R++AI L Sbjct: 320 AEQAPPARVAQLIRGHWTVEALHHVR-DVTFAEDASQLRSGNAPQAMATYRNLAIGALRL 378 Query: 347 DKV--FKAGLRRKMR 359 V AGLRR R Sbjct: 379 AGVRNIAAGLRRTAR 393 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 127 bits (320), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 66/133 (49%), Positives = 90/133 (67%), Gaps = 4/133 (3%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIK 165 +AIDGK LR S+D +R IH++SA+S+ +L +GQ++T KSNEITAIPELL LDI+ Sbjct: 1 MAIDGKCLRGSHDGAR--SPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD--SY 223 G IT DAMGCQ DIAE+I ++G DY+ VKGNQ L +A + F + E + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 AMSEKSHGREEIR 236 + ++K+HGR E R Sbjct: 119 SQTDKNHGRIETR 131 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 118 bits (295), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 77/195 (39%), Positives = 100/195 (51%), Gaps = 12/195 (6%) Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIPELL LD++G +T DA+G Q IA I + G DY+ AVK NQ RL + + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 ELNNPEHDSYAMSE--KSHGREEIRLHIVCDVPDE---LIDFTFEWKGLKKLCVAVSFRS 268 E + +E K HGR E R VC V ++ L W GL++L + R Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETR---VCRVSEDVAWLASTGQHWAGLQRLVMLERTRQ 117 Query: 269 IIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGN 328 I QK E YYISS + A + A IR HW +EN+LHW LDV ED IR Sbjct: 118 I--GQKVTTERC--YYISSKAVKAAQMAQLIRAHWGIENQLHWVLDVSWGEDASLIRDTV 173 Query: 329 AAELFSGIRHIAINI 343 AA + +R I +N+ Sbjct: 174 AARNMASLRKITLNL 188 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 117 bits (294), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 95/344 (27%), Positives = 161/344 (46%), Gaps = 16/344 (4%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ-YGDF 64 L+E + + D+R+ H L +L++ I + G + ++ +F + + L Q + Sbjct: 4 LIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEFNII 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINW-MRDCHSSNDKDVIAIDGKTLRHSYDK--SR 121 +P + TI RV+ + + F W + + +D + + +DGK+L+++ + Sbjct: 64 PERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNPNNE 123 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTD-KKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 ++ I +S FS LV+ + + KK +EI ++ ++ K+ T DA+ CQK Sbjct: 124 QQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQKKT 183 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 I K DY+ VKGNQ L K ++ L + PE + + SHGR+ R V Sbjct: 184 ISLIAKTKNDYVITVKGNQKNLYKRIQD---LSNSSKPE-SCFLEQDNSHGRKISRKIEV 239 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 V E +G + L + + K E T YYISS +A+ FA IR Sbjct: 240 FKVRKN------ERQGFENLRRVIKVERKGSRGDKTYEETA-YYISSLTESAQVFAKIIR 292 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 HW +EN+LHW DV+ ED +I AA +S + I +N+ Sbjct: 293 GHWKIENQLHWVKDVIFEEDKSEISDFQAASNWSILTTIGLNLF 336 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 114 bits (286), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 96/363 (26%), Positives = 153/363 (42%), Gaps = 52/363 (14%) Query: 29 ILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHE 88 +L+ + G + + +THL+ L+++ + GI TI R++ I Sbjct: 1 MLVCVTLGFLCGRTTIRRSLKWCKTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALY 60 Query: 89 CFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKK 148 F+ W+ + S + +A+DGK L + +K++ +++ T+ L++ Q+ D K Sbjct: 61 AFMEWVGEIVDSRNTH-LAVDGKALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSK 119 Query: 149 SNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNK---A 205 +NEIT IPELL +LDI G I+T DA+G Q I E+I +QGG + VK NQ + Sbjct: 120 TNEITVIPELLKLLDISGSIVTIDAVGTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHT 179 Query: 206 FEEKFPLKELNN--------------PEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFT 251 F +K ++ +++ EK+ R E R +C L Sbjct: 180 FMDKLEAADVQRKKGEVLDSGMREYLEKYEEIIRIEKNRDRNEYRTCQICKDASNLTKSQ 239 Query: 252 FEWKGLKKLCVAVSFR----------------------------SIIAEQKKEPEMTVRY 283 EW ++ + R + AE+ ++ Sbjct: 240 KEWPHVQSIGRIKQVRIPSEKDSHGNDVTPSKEEFLEKGSRRVPAPSAEEGTGKDVQCTA 299 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNED--DCKIRRGNAAELFSGIRHIAI 341 IS LTAE+ + R HW +EN+LH LD ED K R N S IR A Sbjct: 300 LISDLILTAEELGSIKRMHWSIENRLHHVLDDTFREDRSPAKKSRNN----LSLIRKYAY 355 Query: 342 NIL 344 NIL Sbjct: 356 NIL 358 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 89/349 (25%), Positives = 168/349 (48%), Gaps = 22/349 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 ++ L+ + +PD R ++L ++ + + AV +GA S+ I D+ + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND-KDVIAIDGKTLRHSYDKS 120 +P TI +V + + ++ + +A+DGKT+R + + Sbjct: 102 CGIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RI 159 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 ++ A H+++A + ++V+GQ +T KSNEI + LL +DI G ++T DAM QK Sbjct: 160 GKQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKAT 219 Query: 181 AEKIQKQG-GDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHI 239 A +++Q +Y+ VK NQ L ++ P +++ D E+ HGREE R + Sbjct: 220 ARCLREQCRAEYVMIVKANQPGLLARVRDQ-PWEQVPVVWSDPV---ERGHGREEHRSYK 275 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK----- 294 + V L F + +++ + R ++ E+ Y I S L E+ Sbjct: 276 ILTVARGL-RFPYA----QQVIQIIRRRRVLGAGAWSTEVV--YAICS--LPCEQAPPKL 326 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 A+ IR HWH+EN++H+ DV +ED +R G+ ++ + +R++ + + Sbjct: 327 LASWIRGHWHIENRIHYVRDVTFDEDRSAVRTGHGPQVMATLRNVVVGL 375 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 110 bits (276), Expect = 7e-23, Method: Compositional matrix adjust. Identities = 58/132 (43%), Positives = 85/132 (64%), Gaps = 3/132 (2%) Query: 104 DVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD 163 D+IA+DGKTLR SYD++ + AIH++SA+ST + LV+GQ+KT++KSNE TAIP+L +L Sbjct: 8 DIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIPKLFTLLA 67 Query: 164 IKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPEH 220 ++ +T DA+G Q+DIA++I + DYL VK NQ L++ + + K Sbjct: 68 LEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAKGFTEDFT 127 Query: 221 DSYAMSEKSHGR 232 DS HGR Sbjct: 128 DSVTEEGDKHGR 139 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 110 bits (274), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 107/390 (27%), Positives = 173/390 (44%), Gaps = 37/390 (9%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDI----EDFGETHLDF 57 ++ L+ + I D R+A + LS +L + A ++GA +I DFG+ L Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKQYGDFENG---IPVHDTIARVVSCISPAKFHECFINWM--RDCHSSNDKDVIAIDGKT 112 L D G P I + + A F W+ + V+A+D K Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELL-NMLDIKGKIITT 171 LR ++ + +R + ++SA LV GQ++ +NEIT + LL N+ DI G ++ T Sbjct: 141 LRGAWSEGNKR--VTLLSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 172 -DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL-NKAFEEKFPLKELNNPEHDSYAMSEKS 229 DA+ Q + A + + G DY VKGNQ L K FE+ PL + P+H+ + E+ Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLYRKTFEQTLPLLQ-KPPQHE---VEERG 254 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 HGR + + +T E KG+ VA + E + R Y Sbjct: 255 HGRIK-----------KWQAWTTEAKGIGFPEVATAAVIRRDEFDLKGIRVSREYAHILT 303 Query: 290 LTAEKFATA------IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 A ATA IR HW +EN++H+ D ED + GN+ + R++AI I Sbjct: 304 SVAGNRATAAYIHRLIRGHWGIENEIHYPRDTAWREDANQTHTGNSPHTLASFRNLAIGI 363 Query: 344 LTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 + + + K ++ + A DR+ + +LA Sbjct: 364 IRRNGIRK--IKETLEYIAGDRDRVLPLLA 391 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 104 bits (259), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 92/325 (28%), Positives = 144/325 (44%), Gaps = 29/325 (8%) Query: 50 FGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAID 109 FG + +LK GI H T + V C++ F ++ Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAALPKPLQRA------------ 90 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKII 169 K S + + ++ ++ LVIGQ +T NE+ + L +L ++G I+ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQ-QTAPGRNEVQGALDALALLSLEGAIV 149 Query: 170 TTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ-GRLNKAFEEKFPLKELNNPEHDSYAMSEK 228 T DA+ C+ D A I GGDY A+K NQ G L + ++ L +E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLG-----VQTAAEN 204 Query: 229 SHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA 288 H R E R + V D IDF GL+ + +V S A+ + VRY++ S Sbjct: 205 DHDRCERRRACIVAVND--IDF----PGLQAIG-SVEATSRHADGRLTSH--VRYFLLST 255 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 ++A R HW +ENKLHW LDV ED + R+ + + +R IA+N++ Sbjct: 256 IMSAAALIEVTRTHWQIENKLHWVLDVQFREDAARNRKDHGPANIALLRKIALNLIRAHP 315 Query: 349 VFKAGLRRKMRKAAMDRNYLASVLA 373 KA +RRK++ A D +L S++A Sbjct: 316 -DKASIRRKIKNAGWDDQFLISIIA 339 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 104 bits (259), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 59/171 (34%), Positives = 92/171 (53%), Gaps = 19/171 (11%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAF----EEKFPLKELNNPEHDSYAMSEKSHGRE 233 K + I + G DY+ AVKGNQ RL++ E++ P+ E S ++ +S Sbjct: 3 KKTVQLIIEGGNDYVIAVKGNQKRLHEQIKLTTEQRLPVSLDITTERRSDRITTRS---- 58 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 V D+L +++W+GL++L F + +P + YYISS + A Sbjct: 59 -------VSVFDDLSGISYDWEGLQRLVKVERF----GTRAGKPYHQIVYYISSLTINAA 107 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 +FA IR HW +EN+LHW DVV++ED+ ++R+GNA FS IR + + IL Sbjct: 108 QFAQGIRGHWGIENRLHWVKDVVLDEDNSRMRQGNAPANFSIIRSLVLTIL 158 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 102 bits (254), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 101/372 (27%), Positives = 164/372 (44%), Gaps = 35/372 (9%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDF----GETHLDFLKQY 61 L+ ++ +PD R V H L +L + AV++GA S + ++ + L L + Sbjct: 29 LVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELGVF 88 Query: 62 GDFENGI---PVHDTIARVVSCISPAKFHECFINWMRDCH--SSNDKDVIAIDGKTLRHS 116 D G+ P T R+++ + + W+ C ++ + V ++DGKTLR S Sbjct: 89 RDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLRGS 148 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 + +H+++ V+GQ+ D K+NE+T LL LD+ ++T DA+ Sbjct: 149 GPAGEQ---VHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADALHT 205 Query: 177 QKDIAE-KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 Q++ A + + Y+F VK NQ RL + + P ++ P D S + HGR +I Sbjct: 206 QREHARWLVDTKKAAYVFTVKKNQPRLYRQL-KTLPWTKI--PIQDE--TSTRGHGRYDI 260 Query: 236 R--LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYY----ISSAD 289 R + C P L DF + A+ R TV Y +S+A Sbjct: 261 RRLQAVTCTGPLAL-DFPHAVQ-------ALRIRRRRLNLATGRWSTVTVYAITNLSAAQ 312 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI--LTND 347 + A +R HW +E H R D ED ++R GNA + +R+ AIN+ LT Sbjct: 313 AGPAELADWLRGHWAIETLHHIR-DTTYAEDASRLRTGNAPRAMATLRNTAINLLRLTGI 371 Query: 348 KVFKAGLRRKMR 359 A LR R Sbjct: 372 TTIAAALRHNSR 383 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 102 bits (253), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 64/208 (30%), Positives = 103/208 (49%), Gaps = 5/208 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+E ++ +PD R ++ L G+L L + AV+ G + E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 NG +P +TIA ++ + P + W+RD H + + +A+DGK L S D Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGKRLCGSRDGQVP- 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIITTDAMGCQKDIAE 182 H+++A++ S V+ Q+ + +NE A LL +L + G ++T DA+ Q D+ Sbjct: 122 -GTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 +Q +GGD + K NQG L E F Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAF 208 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 102 bits (253), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 66/211 (31%), Positives = 98/211 (46%), Gaps = 3/211 (1%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M + L E +S IPD R + H L +L L A++ G S + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDK 119 F G P T++R + P + W+ + IA+DGKTLR S D Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGSRDG 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 H+++A++ V+ Q++ D K+NE A LL +L + G ++T DAM CQ+D Sbjct: 121 QVP--GQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 +A + G DY+ K NQ L + E Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGL 209 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 101 bits (252), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 57/135 (42%), Positives = 82/135 (60%), Gaps = 3/135 (2%) Query: 105 VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDI 164 VIAI+GK+LR + + A+H +SA++ + L +GQ+ +KSNEITAI ELL L + Sbjct: 5 VIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLPTLAL 64 Query: 165 KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF-PLKELNNPEHDS- 222 +G ++T DA+GCQ +AE+I GGDY+ AVK NQ L A + F L +P + Sbjct: 65 EGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPVRQTC 124 Query: 223 -YAMSEKSHGREEIR 236 + +K HGR E R Sbjct: 125 VHETLDKGHGRIETR 139 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 55/148 (37%), Positives = 84/148 (56%), Gaps = 3/148 (2%) Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRR 122 + NG P DT RV+ I P + C + ++ S + IAIDGK L+ S K+ Sbjct: 17 ELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHIAIDGKRLKGSKKKT-- 74 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 G+ H++SA+ L + Q +K NE+ AIPE+L+ LD+ G +I+ DAMG Q +IAE Sbjct: 75 -GSTHILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSGAVISIDAMGTQTNIAE 133 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 +I + DY+ ++KGNQ L + + F Sbjct: 134 QIIQSEADYILSLKGNQKHLYEDVRDCF 161 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 98.6 bits (244), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 58/150 (38%), Positives = 80/150 (53%), Gaps = 8/150 (5%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 G H++SA++T H + +G + T++KSNEITAI LL L K ++T DAMGCQKDIA Sbjct: 2 GPRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARN 61 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFE---EKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 I GGD++ AV+ NQ +L A EK E H ++ HGR + R + Sbjct: 62 IVAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWG 121 Query: 241 CDVPDELIDFTF--EWKGLKKLCVAVSFRS 268 VP DF EW +K + AV + Sbjct: 122 AQVPP---DFAAKGEWPWIKAIGTAVRITT 148 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 98.6 bits (244), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 60/189 (31%), Positives = 95/189 (50%), Gaps = 12/189 (6%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 + +L IK I T DA+ CQK E I ++ Y+ VK NQ L +A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIEDTAKNSP 61 Query: 215 LNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 LN +++ ++K HG E H + + +W GL++ +S R Sbjct: 62 LN-----AWSWTQKGHGHES---HCRLKIWEATESMKMQWAGLERF---ISIRRQGFRHH 110 Query: 275 KEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 K+ + T Y+I+S L++ + A IR H +EN LHW DV++NED+C IR + A + Sbjct: 111 KKFDSTT-YHITSETLSSYRLAGFIRGHRRIENNLHWTKDVILNEDNCGIREPHPAAILG 169 Query: 335 GIRHIAINI 343 +R+IA N+ Sbjct: 170 ILRNIAFNL 178 >UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica OS145 RepID=A3WIW9_9GAMM Length = 96 Score = 96.3 bits (238), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 43/90 (47%), Positives = 60/90 (66%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 M RYYISSA L+AE+FA+ +R HW +EN+LHW LDV + ED+C I RG+AA+ + RH Sbjct: 1 MQYRYYISSASLSAEEFASTVRAHWGIENRLHWVLDVTIREDNCPISRGHAADNLACFRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 +A+N + +K A + RK + A M L Sbjct: 61 VALNQIRREKTIDASVNRKQKMATMSEEVL 90 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 95.5 bits (236), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 65/179 (36%), Positives = 95/179 (53%), Gaps = 10/179 (5%) Query: 189 GDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELI 248 GDYL VKGNQ +L +A E F + + + D A+ E+ HGR ++ V + I Sbjct: 7 GDYLLMVKGNQPKLLEAIEIAF-IDQHDVKSVDRSALVERGHGRTVGQIASVLSA--KGI 63 Query: 249 DFTFEWKGLKKLCVAVS-FRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVEN 307 +W CV + S+ +KE ++ YYI+S LTAE+ A ++R W VEN Sbjct: 64 INPGDWPN----CVTIGRIDSMRVVDEKESDLERCYYITSRALTAEQLAASVRARWGVEN 119 Query: 308 KLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKMRKAAMD 364 + HW LDV +ED + + NA + S +R IA+NI+ DK K+ LR K + AA D Sbjct: 120 RFHWILDVSFSEDASTVAKDNAPQNLSMLRKIALNIIRADKTDTRKSSLRLKRKGAARD 178 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 95.1 bits (235), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 94/327 (28%), Positives = 143/327 (43%), Gaps = 28/327 (8%) Query: 28 GILLLTIFAVISGAESWEDIEDFGETHLD-FLKQYG-DFENGIPVHDTIARVVSCISPAK 85 +L + + A +G + + T D L Q G F P T V+S + PA Sbjct: 2 ALLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRR--PSEKTFRAVLSRLDPAD 59 Query: 86 FHECFINWMRDCHSSNDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQ 142 + ++ +S+D IA+DGK LR + + A H++S F+ LV+GQ Sbjct: 60 LNARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQ 117 Query: 143 IKTDKKSNEITAIPELLNMLDIKGK-IITTDAMGCQKDIAEKI-QKQGGDYLFAVKGNQG 200 + +KSNEI + LL +L + ++T DAM Q A+ I YL VK NQ Sbjct: 118 LAVAEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQA 177 Query: 201 RLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIR-LHIVCDVPDELIDFTFEWKGLKK 259 ++ A P E+ D + HGR E R L I+ I F + K+ Sbjct: 178 KI-LARITALPWAEVPAAATDD----SRGHGRVETRTLQIITAARG--IGFPYA----KQ 226 Query: 260 LCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK---FATAIRNHWHVENKLHWRLDVV 316 + R I A ++ E V Y I S + T +R H +EN LHW DV Sbjct: 227 IIRITRERLITATDQRSVE--VVYAICSLPFEHARPTAIMTWMRQHCRIENSLHWIRDVT 284 Query: 317 MNEDDCKIRRGNAAELFSGIRHIAINI 343 +ED + GN A++ + +R+ AIN+ Sbjct: 285 FDEDRQRAHTGNGAQVLATLRNTAINL 311 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 94.0 bits (232), Expect = 9e-18, Method: Compositional matrix adjust. Identities = 59/148 (39%), Positives = 82/148 (55%), Gaps = 11/148 (7%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMS-----EK 228 MGCQK+IAE I +Q DY+ AVK NQ L++A ++ F +E N +SY + K Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYF--EEANEANFESYNIDFAETYNK 58 Query: 229 SHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA 288 SHGR E R V L D + W+GL+ + + S R++ K++ + RYYISS Sbjct: 59 SHGRIESRRCWVGYDALPLTDDSQNWEGLQTIVMVESERTL----KEKTTIEHRYYISST 114 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVV 316 TA + R HW +EN LHWRLD+ Sbjct: 115 MATAAYLLNSSREHWGIENSLHWRLDIA 142 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 89/374 (23%), Positives = 160/374 (42%), Gaps = 45/374 (12%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDF-GETHLDFLKQ 60 E++ L + ++ +PD R + H+L IL L+ AV +G +S E+I + L Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 61 YGDFENGI------PVHDTIARVVSCISPAKFHEC---FINWMRDCHSSNDKDVIAIDGK 111 G + + P DT+ RV+S + + F + V+A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML----DIKGK 167 TLR + R A H+++ +V+ + + K+NE+TA LL L + G Sbjct: 158 TLRGAAGPEGR--APHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 IITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRLNKAFEE-----KFPLKELNNPEHD 221 ++T DA+ + A+ I + G ++F VK N L+ + K P+ Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPI--------- 266 Query: 222 SYAMSEKSHGREEIRLHIVCDVPDELIDFTFEW--------KGLKKLCVAVSFRSIIAEQ 273 ++ ++HGR E R I E I + + +++ + R+ + Sbjct: 267 GHSAEGRAHGRFERRT-IQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGRARV--T 323 Query: 274 KKEPEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 + P + ++S L T A R HW +ENK+HW DV ED ++R G Sbjct: 324 RTIPSTVTVHVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDASRVRTGPLP 383 Query: 331 ELFSGIRHIAINIL 344 + + +R++ I ++ Sbjct: 384 RIMTTLRNLIIGLI 397 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 92.4 bits (228), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 59/168 (35%), Positives = 89/168 (52%), Gaps = 19/168 (11%) Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 ++S ++T + LV+GQ+K + SNEITAIPELL +L++ G I+ A+ C KDI + I ++ Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 GGDYLFAVKGNQGRLNKAFEEKFP------LKELNNPEHDSYAMSEKSHGREEIRLHIVC 241 DY+ +K NQG L ++ E+ F +EL +H +Y E HG EIR Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQEL---QHSTYKPEETGHGLHEIRNFGFQ 117 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 PD + W LK +V I + + + RY+ISS D Sbjct: 118 LDPDSV------WSNLK----SVGMVEPIGQVDDKTTVETRYFISSLD 155 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 91.7 bits (226), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 53/127 (41%), Positives = 70/127 (55%), Gaps = 1/127 (0%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 + ++ +KI ++ DYL AVKGNQG L AF++ F LNN + + Y E+S GR E Sbjct: 12 VRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHES 71 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V L D + EW GLK + VS S E +E ++ VRYYISS L AE+ Sbjct: 72 RAAFVSHDLSVLGDISDEWPGLKSMAFVVSMNS-EKEVAEEADIYVRYYISSKQLNAEEL 130 Query: 296 ATAIRNH 302 TA R H Sbjct: 131 LTASRLH 137 >UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azollae' 0708 RepID=B9YJB3_ANAAZ Length = 124 Score = 91.7 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 43/113 (38%), Positives = 68/113 (60%), Gaps = 4/113 (3%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W LK + + S I + + + RY+ISS D E+ A ++R+HW +EN LHW L Sbjct: 15 WSNLKSVGMVES----IGQVDDKTTVETRYFISSLDSNGEQLANSVRSHWAIENSLHWVL 70 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRN 366 DV + +DDC+IR+ NA + F+ +R IA+++L + K G++ K AA+D N Sbjct: 71 DVALKQDDCQIRKDNAPQNFAVMRQIAVDLLGKENPVKRGIKNKQFLAAVDNN 123 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 91.7 bits (226), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 55/182 (30%), Positives = 91/182 (50%), Gaps = 4/182 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 + L+ + +PD R+A + L +L+ T+ A++SGA S+ I F E + L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 61 -YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND---KDVIAIDGKTLRHS 116 +G PV +T+ V+ + + F + + K V+A+DGKTLR S Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGS 130 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D R A ++AF + ++V+ + D KSNEI A +++ L + G + T DAM C Sbjct: 131 FDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHC 190 Query: 177 QK 178 QK Sbjct: 191 QK 192 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 90.1 bits (222), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 66/191 (34%), Positives = 92/191 (48%), Gaps = 22/191 (11%) Query: 192 LFAVKGNQG----RLNKAFE--EKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPD 245 + AVK NQ R+ A + E F L + +H +K HGR E R + D P Sbjct: 1 MLAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREV---DKGHGRIETRRCLALDFPG 57 Query: 246 ELIDFTFE---WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 FE W GL+ + + S R I RYY+SS A + A A+R H Sbjct: 58 P-----FEPDLWPGLQSIPMVESTREI----GDTVTTGRRYYVSSLPADAVRIAHAVRAH 108 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +E+ +HW LDV NED C+ R NAA+ F+ +R IA ++ D KAG+R + KA Sbjct: 109 WGIES-MHWVLDVTFNEDQCRTRLENAAQNFAILRRIATTLIRRDNSTKAGIRIRRLKAG 167 Query: 363 MDRNYLASVLA 373 +Y A +L Sbjct: 168 ASDDYRAQLLG 178 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 87.4 bits (215), Expect = 7e-16, Method: Composition-based stats. Identities = 40/90 (44%), Positives = 59/90 (65%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 +E S IPD R +H I+ L +F+V++GA+S+ +IEDF E H+D+LK Y + Sbjct: 5 FVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTYFNLP 64 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMR 95 NGIP HDT +RV S I+PA F + F+ W++ Sbjct: 65 NGIPSHDTFSRVFSAINPASFQDSFLIWLK 94 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 85.1 bits (209), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 69/247 (27%), Positives = 108/247 (43%), Gaps = 14/247 (5%) Query: 40 GAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS 99 GA++ ++ +F E + L++ +G P HDT +RV + P + F +M Sbjct: 37 GAKTCVEMAEFSEARQEELREIVALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRG 96 Query: 100 S----NDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAI 155 + K V+AIDGK+LR YDK R ++S + I ++ +EI A Sbjct: 97 ALGLPAPKGVVAIDGKSLRRGYDKGRAFMPPLMVSVWDVETRPSIAAMRA-PGGDEIKAT 155 Query: 156 PELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKEL 215 +L L +KG +T DA+ C +A+ + Y +K N G L +A E F Sbjct: 156 LSVLKALTLKGCTVTADALHCHPAMAQALLAAKAQYALGLKANHGPLFRAAEAGFAAVT- 214 Query: 216 NNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKK 275 + + E+ HGREE R V V D L+ GLK + + R+ Sbjct: 215 ---DLAVFETRERGHGREEQRRASVLPV-DRLVKRP-SLPGLKAIGRIEAVRT---GANG 266 Query: 276 EPEMTVR 282 +PE VR Sbjct: 267 KPEQAVR 273 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 61/197 (30%), Positives = 98/197 (49%), Gaps = 19/197 (9%) Query: 11 SIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDI----EDFGETHLDFLK---QYGD 63 + + D R+A + H +LL+ + V++G S+E I +D ++ L L G Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 P TI R++S P + ++ HSS IAIDGKT+R S Sbjct: 289 ERFLPPSEPTIRRILSKADPVELDRILSQYIV-AHSSGR--AIAIDGKTIRSS------- 338 Query: 124 GAIHVISAFSTMHSLVIGQIKTD-KKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 ++ +++A V+ Q D K +EI A LL LD+ GK++T DA+ Q +A Sbjct: 339 -SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALAS 397 Query: 183 KIQKQGGDYLFAVKGNQ 199 +I+++GGDY+F VK N+ Sbjct: 398 RIREKGGDYVFTVKDNR 414 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 58/180 (32%), Positives = 88/180 (48%), Gaps = 5/180 (2%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ-Y 61 + L + + IPD+R+A L +LL +I A++SGA S+ I F TH L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSR 121 G P + +I + + F + VIA+DGKTLR S D+ Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHF--RAHAARLAEGAAVIALDGKTLRGSLDRFE 118 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTD--KKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A V+SAF+T +V+GQI + K +EI A L+ L + G++ T DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 55/148 (37%), Positives = 74/148 (50%), Gaps = 6/148 (4%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFE--WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYY 284 +K HGR E R D L + WK + + S R I ++ E RY Sbjct: 137 DKGHGRIETRRCTAAGDLDWLATLGLKERWKKITSVAGIDSSRVIGSKT----ETDRRYV 192 Query: 285 ISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 ISS +E+ A+R HW +EN LHW LDV ED C IR NAA FS +R A+N+ Sbjct: 193 ISSLPADSERILHAVRMHWGIENGLHWCLDVAFGEDACPIRLRNAALDFSLLRRAAMNLF 252 Query: 345 TNDKVFKAGLRRKMRKAAMDRNYLASVL 372 D GL +K + AA + +YLA++L Sbjct: 253 RADHSRAMGLPKKRKAAAWNPDYLANIL 280 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 91/374 (24%), Positives = 146/374 (39%), Gaps = 49/374 (13%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 +++ L+E + +PD R+ V L +L L + AV GA + +I + L Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMR--------------DCHSSNDKDVIA 107 P T RV+ P E W + VI+ Sbjct: 91 FGLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVIS 150 Query: 108 IDGKTLRHSYDKSRRR---GAI---HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNM 161 DGKT+R +RRR G I V+ V+ + +EI A+ ++ Sbjct: 151 ADGKTMR----GARRRTGDGKIAQDQVVEILDHASGAVVA-CEPVNDGDEIGAVRTVMGR 205 Query: 162 L-----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 L + G ++ TDA Q + E++ GG +L VK NQ R+ A P ++ Sbjct: 206 LADRWGSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRI-LAKVRALPWAQVR 264 Query: 217 NPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE 276 D+ K+HGR E R V P +D G ++ + ++ + Sbjct: 265 --AQDT--CRGKAHGRAETRTVRVVQAPTH-VDLALA--GTAQV-IKITRHTRRRPHPGA 316 Query: 277 PEMTVR---YYISSADLTAE-----KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGN 328 P + R Y ++S L AE A +R+HW +EN++HW D +ED R GN Sbjct: 317 PAASTRENAYLLTS--LPAEVADPATLAAMVRSHWLIENQVHWVRDTAYDEDRHTARTGN 374 Query: 329 AAELFSGIRHIAIN 342 + +R+ AI Sbjct: 375 GPINLACLRNTAIT 388 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 82.0 bits (201), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 62/226 (27%), Positives = 108/226 (47%), Gaps = 17/226 (7%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 +++ LM+ +S D R+ + H ++ + A++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDF----ENGI---PVHDTIARVVSCISPAKFHECFINWMRD----CHSSNDKDVIAIDG 110 F E I P T+ R + I + W + C D V++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIT 170 K +R + K++ IH ++AF +V+ Q D+K+NEI + LL ++I+G+I+T Sbjct: 341 KAVRGA-SKAKGGQKIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 TDAMGCQKDIAEKI-QKQGGDYLFAVKGNQGRLNKAFE----EKFP 211 DA+ Q + A I + + DY+F VK NQ + + E E FP Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIESLPWEAFP 445 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 81.6 bits (200), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 85/357 (23%), Positives = 154/357 (43%), Gaps = 27/357 (7%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFG-ETHLDFLKQ 60 ++ L+ + +PD+R V ++L+ +L L + I+G ++ + ++ + L Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD----VIAIDGKTLRHS 116 G F +P TI R+V P + + W +A DGK ++ + Sbjct: 84 LG-FPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGA 142 Query: 117 YDKSRRRGAIH---VISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDA 173 + + G++ V+ A +G + +EI ++ L+N + ++TTD Sbjct: 143 RSRPPQ-GSVRQEAVVEAVRHDTGTALGHQRV-VAGDEIASVRRLVNRVCDHNTLVTTDC 200 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGR- 232 + + +A I+ +GG +LF++KGNQ + +A P E N + EK+HGR Sbjct: 201 LHAHEPLARAIRAKGGHWLFSIKGNQPTV-RAKLAGLPWDEFGN----QHVTREKAHGRI 255 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLC-VAVSFRSIIAEQKKEPEMTVRYY----ISS 287 EE L + L+ F +G +++ +A + R T +Y +S+ Sbjct: 256 EERALKALTPSAPSLVGF----RGTRQVVKLARTTRRKKTTTSPAATSTEEFYLVTSLST 311 Query: 288 ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 + + A R HW VE H R D M+ED IR NAA ++ R I+ L Sbjct: 312 DQASPAQLARWARGHWTVEAIHHVR-DRTMDEDRHTIRTKNAALNWAIARDTTISAL 367 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 80.9 bits (198), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 101/394 (25%), Positives = 159/394 (40%), Gaps = 85/394 (21%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAV-------ISGAESW------EDIE 48 +++ L+ + D R A V +++S +L L + A+ I+ A W E++ Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 49 DFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-------- 100 FG L + G + IP T+ V+ + P + + +R S+ Sbjct: 90 AFG---LPYHPLRGRYR--IPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPL 144 Query: 101 ------------------------NDKDVIAIDGKTLRHS--YDKSRRRGAIHVISAFST 134 + + IA+DGK LR + D SR + V+SA Sbjct: 145 MPDGGIEREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR----VFVLSAVRH 200 Query: 135 MHSLVIGQIKTDKKSNEITAIPEL------LNMLDIKGKIITTDAMGCQKDIAEKIQKQG 188 + + + K+NEI PE L+ D+KG ++T DA+ Q+D A + ++G Sbjct: 201 GDGITLASREIGAKTNEI---PEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERG 257 Query: 189 GDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELI 248 YL +K NQ R P KE+ D + HGR E RL V V L Sbjct: 258 AHYLLTIKNNQ-RGQARQLHALPWKEIPVIHRDD----ARGHGRHEQRLVQVVTVNGLLF 312 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA-----IRNHW 303 + + + R + KK TV Y I+ DL AE+ + A R HW Sbjct: 313 PHAAQ-------VLRIQRRRRLYGAKKWSSETV-YAIT--DLPAEEASAAEIASWARGHW 362 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIR 337 VEN +HW DV NED ++R N + + +R Sbjct: 363 TVENTVHWCRDVTFNEDKSQVRTHNTPSVLAAVR 396 >UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=Bacteroides RepID=C6Z3A5_9BACE Length = 115 Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 49/109 (44%), Positives = 64/109 (58%), Gaps = 4/109 (3%) Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLT-AEKFATAIRNHWHVENKLHWRLDVVMNEDDCK 323 S R+I+A + E VRYY++S D T EK A+AIR HW + N LHW+LDV ED K Sbjct: 5 SERTIVAIGEYTQE--VRYYVTSLDNTRPEKIASAIRQHWSIGNNLHWQLDVTFREDYSK 62 Query: 324 IRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 + NAA FS +A+ IL N+K K + K KA D NYL+ +L Sbjct: 63 -KVKNAAGNFSVATKMALTILKNEKTTKGSMNLKRLKAGWDENYLSQLL 110 >UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC8_9GAMM Length = 154 Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 42/109 (38%), Positives = 61/109 (55%), Gaps = 4/109 (3%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W+ L+ + + S R+ +K E + RYYISS TA R HW +E LHW L Sbjct: 7 WEELQTIVMVESERA----EKGETTIEHRYYISSTLGTAAYLLDYKREHWGIETSLHWCL 62 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 D+ ED+ +I +GN AE F+ +RHIA+N+L + K G++ K KA Sbjct: 63 DIAFREDESRISKGNGAENFAILRHIALNLLKKEDTAKIGIKNKRLKAG 111 >UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina DSM 3645 RepID=A3ZX26_9PLAN Length = 115 Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 43/105 (40%), Positives = 63/105 (60%) Query: 270 IAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + +Q + VRYYI S LT +FA A+R HW +EN LHW+LDV E +IR+G+A Sbjct: 10 LVKQNGKEASEVRYYIVSKYLTGRRFAEAVRGHWGIENSLHWQLDVTFGEHQSRIRKGHA 69 Query: 330 AELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 FS +R ++++L N+K + G++ K KA + YL VL G Sbjct: 70 DINFSLLRRTSLSLLKNNKTARVGVKNKRLKAGRNDKYLLEVLLG 114 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 54/163 (33%), Positives = 80/163 (49%), Gaps = 6/163 (3%) Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH-DSYAMSEKSHGREEIRLHIV 240 EKI ++ GDY+ +K N + E F + PE +++ R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 V D L EWKG+K + RS + KE + V +YISS D+ + A +R Sbjct: 61 LKVSDWLSKAE-EWKGIKSVLEVCRKRS---DNGKESQEKV-FYISSLDVDVQILAKCVR 115 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 HW VENK HW LDVV ED+C + AE + +R +A+N+ Sbjct: 116 GHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNL 158 >UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX Length = 98 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 36/48 (75%), Positives = 39/48 (81%) Query: 78 VSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 +SCI KFHECFIN MR+CHSS+D DVIAIDGK L HS DKSRRR A Sbjct: 1 MSCIRSVKFHECFINRMRECHSSDDIDVIAIDGKALPHSCDKSRRRRA 48 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 71/228 (31%), Positives = 106/228 (46%), Gaps = 12/228 (5%) Query: 100 SNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTD-KKSNEITAIPEL 158 S +K + DGK LR S + ++RG V+ I Q D +K +EI + L Sbjct: 51 SQEKQWFSGDGKELRGSIESGKKRGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRAL 109 Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP 218 L+ D+ + IT DA+ E I K GG +L +K NQ L + + P Sbjct: 110 LSKDDLASQKITLDALHLCPSTTEMITKAGGVFLIGLKENQPTLLAH------MTDCALP 163 Query: 219 EHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPE 278 D + +HGR E R + + DV + D ++ K+L V V R+ I ++ + Sbjct: 164 PIDQKTTFDFNHGRVEQRKYWLYDVSKQGFDPRWDNTAFKRL-VKVQ-RTRINQKNAKIS 221 Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRR 326 V YYIS+ + E A+RNHW VE H R DV +NED K ++ Sbjct: 222 REVSYYISN-ETAKEGIFDAVRNHWSVEVNNHIR-DVTLNEDQLKSKK 267 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 37/73 (50%), Positives = 49/73 (67%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 ++H + D R +H L I+LL I AV+SG+E WEDIE+FG LD+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVV 78 GIP HDTIARV+ Sbjct: 67 AGIPRHDTIARVI 79 >UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7Q7_9GAMM Length = 164 Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 52/171 (30%), Positives = 87/171 (50%), Gaps = 11/171 (6%) Query: 206 FEEKFPLKELNNPEHDSYAMSEKSHGREEIR-LHIVCDVPDELIDFTFEWKGLKKLCVAV 264 F++ + L E + +SY EK HGR+E+R ++++ E + +W +K + V Sbjct: 3 FQDYWALPE---DKQESYITEEKGHGRKEVREVYVLPAAFSEAL--RQKWCLVKSIVAVV 57 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 RS+ K + YYI + L+ E + A R HWH+EN+ HW LDV+ ED+ +I Sbjct: 58 RDRSV----KGKGSYETSYYICTDHLSLELASKATRKHWHIENQQHWALDVIFKEDEQRI 113 Query: 325 RRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGS 375 G++A + R N+ + + RKM +AA +++Y VL S Sbjct: 114 YAGDSALNMACCRRFVQNLFRKSEG-NLSVPRKMNQAAWNKDYREKVLFTS 163 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 75.1 bits (183), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 98/394 (24%), Positives = 157/394 (39%), Gaps = 72/394 (18%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVIS-GAESWEDIEDFGETH----LDFLKQ 60 L++ ++I D R H L+ IL + A ++ G + IE + + L L Sbjct: 30 LIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLHI 89 Query: 61 YGDFENGI---PVHDTIARVVSCISPAKFHEC---FINWMRDCHSSNDKDVI-------- 106 + D G+ P TI RV++ + + C F+N ++ D + Sbjct: 90 WRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRRT 149 Query: 107 ----------------------AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIK 144 A+DGK L+ + G +H+IS + + + V Q + Sbjct: 150 EREARRAAHRSPTPAPGLLPAYAVDGKRLKGARHPDG--GRVHLISLAAHLDATVHAQRQ 207 Query: 145 TDKKSNEITAIPELLNM---LDIKGKIITTDAMGCQKDIAEK-IQKQGGDYLFAVKGNQG 200 KS+EI A+ LL D+ G +IT DA+ Q+ A I++ Y+ VK NQ Sbjct: 208 IPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQP 267 Query: 201 RLNKAFEEKFPLKELNNPEHDSYAMSEK----SHGREEIRLHIVCDVPDELIDFTFEWKG 256 L+ + L + D A++ + HGR E R I+ P + IDF + + Sbjct: 268 TLHATA-----ITALTGTDTDFAAVTHRETHRGHGRTEYR--ILRTAPADGIDFPYAAQV 320 Query: 257 LKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK-----FATAIRNHWH-VENKLH 310 + L I KE V Y I+ DLTA + A +R HW +EN +H Sbjct: 321 FRVLRHRGGLDGI--RHSKE----VCYGIT--DLTARQAGPAHLAAYVRGHWKAIENGVH 372 Query: 311 WRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 DV ED C+ R + R++A L Sbjct: 373 HVRDVTFAEDACQARTATLPRALAAFRNLATGTL 406 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 47/176 (26%), Positives = 81/176 (46%), Gaps = 3/176 (1%) Query: 24 HKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG-IPVHDTIARVVSCIS 82 H L +L L AV+ + I FG + L F G P T+++ + I Sbjct: 6 HPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTLRRID 65 Query: 83 PAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQ 142 P + W+ + + + +A+DGK LR S D H ++A++ + V+GQ Sbjct: 66 PQQLEAALGRWIAGRLTPDARAHVALDGKCLRGSRDGDVP--GPHRVAAYAPHAAAVLGQ 123 Query: 143 IKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 I+ D ++NE A LL ++ + G ++T A C +D+A + GG Y+ +G Sbjct: 124 IRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYVSHGQGQ 179 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 66/244 (27%), Positives = 106/244 (43%), Gaps = 24/244 (9%) Query: 107 AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKG 166 A+DGKT R + K +H++ + ++GQ + D KSNE T LL L++ G Sbjct: 151 AVDGKTSRGA--KRADGSQVHLLGVAAHGAGALLGQREIDAKSNETTEFRALLAPLELAG 208 Query: 167 KIITTDAM-GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAM 225 ++ DA+ + ++ + ++ YL K NQ +L +AF P E+ P D Sbjct: 209 AFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKL-RAFLAALPWTEI--PTAD--LT 263 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYI 285 ++ HGREE R V V +DF A+ R ++ + Y I Sbjct: 264 RDRGHGREETRTLKVATV--THLDFPHA-------AQAIRIRRWRRQKGQPASHETIYAI 314 Query: 286 SSADLTAEKFATAI-----RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 + D TA++ + A+ R WH+E K H+ DV ED R G + + R Sbjct: 315 T--DATADQASPALLADLARGQWHIEVKQHYVRDVTFGEDSSTSRTGRGPAVLALFRATV 372 Query: 341 INIL 344 + L Sbjct: 373 ADTL 376 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 50/165 (30%), Positives = 87/165 (52%), Gaps = 12/165 (7%) Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF--PLKELNNPEHDSYAMSEKSHGREEIRL 237 ++E+ ++ DY+ A+KGN + + ++ F P+ + H ++ +K HGR E R+ Sbjct: 1 MSERNSEKDNDYILALKGNHPLMEQEVKDFFLSPVTSTRSV-HTTF---DKGHGRIERRI 56 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 + + D + EWK L + S++ + KE +RY+I+S ++FA Sbjct: 57 YTL-DTNIGWFEDKKEWKHLAGFGMV---DSMVTRKGKECR-EIRYFITSV-TDVKQFAK 110 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 + +HW +EN LHW LDV+ +D+C + NAAE + IR I N Sbjct: 111 GVCSHWMIENNLHWCLDVLFCDDECTVLDRNAAENLAIIRRIVYN 155 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 56/204 (27%), Positives = 96/204 (47%), Gaps = 16/204 (7%) Query: 13 IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGE--THLDFLKQYGDFENGI-- 68 +PD R +H L IL + + AV++ A+S+ + ++ T + F Sbjct: 230 LPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKRIRARFNPRTQR 289 Query: 69 ---PVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 P T+ RV+ + W+ + +A+DGK L+ + R G+ Sbjct: 290 YVAPSEPTLRRVLQGANVTALDAAIGAWLLGIAGF---EAVAVDGKVLKGAV---REDGS 343 Query: 126 -IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE-K 183 +H++SAF I Q + +K+NEI + LL +DI+ K++T DA+ Q+ A Sbjct: 344 QVHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADALHTQRKTARFL 403 Query: 184 IQKQGGDYLF-AVKGNQGRLNKAF 206 ++ + DYLF AVKGNQ +L + Sbjct: 404 VEDKKADYLFTAVKGNQRKLRNSL 427 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 71.6 bits (174), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 38/59 (64%), Positives = 39/59 (66%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLK 59 MELKKLMEHISIIPDYRQAWKVEHKL IL + FGETHLDFLK Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRGFGETHLDFLK 59 >UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C429C Length = 149 Score = 70.9 bits (172), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 40/124 (32%), Positives = 68/124 (54%), Gaps = 11/124 (8%) Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYI 285 SEK HGR E R + P ++ +WKGLK+ R++ K + + V Y I Sbjct: 4 SEKGHGRIEKR--TLETTP--IVTVGQKWKGLKQGLRITRERAV----KGKKTVEVVYGI 55 Query: 286 SS---ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 +S A A T +R+HW +EN LH+ DV + ED C++R+G A ++ + +R++ ++ Sbjct: 56 TSLSMARANAATLLTILRDHWQIENGLHYVRDVTLGEDACRVRKGTAPQVLAAVRNVVVH 115 Query: 343 ILTN 346 +L + Sbjct: 116 LLAS 119 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 70.5 bits (171), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 38/81 (46%), Positives = 51/81 (62%), Gaps = 4/81 (4%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M + + E++S Y Q +H I+ L + AVISGA SW +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLSDPRAYNQ----KHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCI 81 Y FE GIPV DTIARV+ I Sbjct: 57 YRPFECGIPVDDTIARVIKRI 77 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 36/73 (49%), Positives = 47/73 (64%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 ++H I D R +H L I+LL I AV+SG+E WE IE+FG LD+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVV 78 GIP HDTIARV+ Sbjct: 67 AGIPRHDTIARVI 79 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 68.9 bits (167), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 51/180 (28%), Positives = 88/180 (48%), Gaps = 6/180 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L+E ++ +PD+R A + L +LLL I +S + +EDF H + L Sbjct: 5 LLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQLP 64 Query: 66 -NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS---YDKSR 121 P T RV+ I F NW+ ++D + +DGK+++ + YD++ Sbjct: 65 PTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQAY 124 Query: 122 RRGAIHVISAFSTMHSLVIG-QIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + I+V+S FS + I Q +K+ +EI + LL LD++G + T D++ CQK + Sbjct: 125 Q-DFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKKL 183 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 68.9 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 32/75 (42%), Positives = 50/75 (66%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 +++++E + + D R A + +H L IL+L + AV+SGA+ W+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARV 77 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRV 81 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 67.4 bits (163), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 48/182 (26%), Positives = 79/182 (43%), Gaps = 17/182 (9%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEK--------FPLKELNNPEHDSYAM 225 M Q D+ +Q++GGDY+ K NQG L E FP + D+ Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYI 285 E S G + + L ++ W G++++ R + + E V Y I Sbjct: 61 CEVSKGHGWVERRTMTST-IWLNEYLTRWPGVQQVFRLTRTRQVGGKTTVE----VVYGI 115 Query: 286 SSADLTA---EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 SS A + R HW +E++ H R D + ED C++RRG A + + +R++A+ Sbjct: 116 SSLSSVAAAPDALLRYTRTHWGIESRHHIR-DATLGEDRCRVRRGAAPRVLAVLRNVAVY 174 Query: 343 IL 344 +L Sbjct: 175 LL 176 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 78/314 (24%), Positives = 131/314 (41%), Gaps = 47/314 (14%) Query: 67 GIPVHDTIARVVSCISPAKFHECFINWMRDCHSS----NDK---DVIAIDGKTLRHSYDK 119 G P T+ R+++ SPA E ++D + ND V++ DGK D Sbjct: 98 GKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTWSRTDG 157 Query: 120 SRRRGAIHVI-----SAFSTMHSL-----------VIGQIKTDKKSNEITA----IPELL 159 + +GA S+ T +L +GQ K E TA +P + Sbjct: 158 EKVKGAQQSAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFRRLLPAIS 217 Query: 160 NMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPE 219 L + +I+T DA C ++ AE + G Y+F +K NQ L+ + +L P Sbjct: 218 EQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLHD-IARDYGQYDLGTPL 276 Query: 220 HDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFR--SIIAEQKKEP 277 + +E+ G +R DV + L +C + R I+A ++ Sbjct: 277 ART---AERYRGHTIVRELYARDVAGNPAAAIEAAQQLWYVCQTTTDRRGEIVAVEQ--- 330 Query: 278 EMTVRYYISS---ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD---CKIRRGNAAE 331 RY+++S LT ++ +R HW +EN HW +DV++ ED+ C+ R + E Sbjct: 331 ----RYFVTSIPTGTLTRDQELALVRMHWAIENGCHWTMDVMLGEDEGHPCQASRAS-IE 385 Query: 332 LFSGIRHIAINILT 345 S +R I N ++ Sbjct: 386 TVSWLRLIGYNAVS 399 >UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9DIV7_9GAMM Length = 133 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 3/107 (2%) Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 V ++ ++W GLK + S + + + R+YISS DL AE+ +++RNH Sbjct: 3 VNKSWLNNKYQWVGLKSIIKVTS--DVHEKTTGKETTETRWYISSLDLNAEQALSSVRNH 60 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 W VE+ +HW L++ ED+ + R+G F+ +R IA+ + D+ Sbjct: 61 WQVES-MHWVLEMTFREDESRFRKGRGPLAFNVMRKIAMTLFKQDQT 106 >UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN77_ANAAZ Length = 77 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 26/61 (42%), Positives = 43/61 (70%) Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 A ++R+HW +EN LHW LDV + +DDC+IR+ NA + F+ +R IA+++L + K G+ Sbjct: 1 MANSVRSHWAIENSLHWVLDVALKQDDCRIRKDNAQQNFAVMRQIAVDLLGKENPVKRGI 60 Query: 355 R 355 + Sbjct: 61 K 61 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 51/167 (30%), Positives = 83/167 (49%), Gaps = 13/167 (7%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 L+KL E S IPD+R+A K + HKL +++L I +S S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAK-------FHECFINWMRDCHSSNDKDVIAIDGKTL 113 NGIP T+ R+ I F E F + + ++++ IDGK Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCA--QEIVCIDGKAE 152 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLN 160 R + K+ R I +SA S + + ++KSNEI A+P L++ Sbjct: 153 RGTVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLID 197 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 60/256 (23%), Positives = 112/256 (43%), Gaps = 22/256 (8%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L+E ++ +PD R+ V ++ + +L + + A++SGA S+ I ++ + Sbjct: 51 LLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAGLGLT 110 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDK-------------DVIAIDGKT 112 +P TI RV+ + A W++ + D V+A+DGK Sbjct: 111 GRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAVDGKA 170 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIITT 171 +R + + +H++ +V+ Q+ D+K+NEI +L+ + D+ +IT Sbjct: 171 MRATRHGTH---PVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDVLITV 227 Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHG 231 DAM Q A+ + +G L VK NQ ++ + P K++ + + + HG Sbjct: 228 DAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRL-KTLPWKDVPV----GHTTTGRGHG 282 Query: 232 REEIRLHIVCDVPDEL 247 R E R VP L Sbjct: 283 RIETRTLKAVTVPAGL 298 >UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N3J1_VIBHB Length = 184 Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 40/119 (33%), Positives = 65/119 (54%), Gaps = 7/119 (5%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMT-VRYYI 285 ++ HGR R + +P+EL + G+K C+AV I+ E K EP+ + YYI Sbjct: 34 DEGHGRLVRRRYFAFPLPEELHNHALS--GIKS-CIAVE--RIVQEGKGEPKTSHFSYYI 88 Query: 286 SSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 ++ + K A +R HW +E+ HW LDV N+D K N+AE F+ I+ + +N++ Sbjct: 89 TNHPASDPKLADYVRQHWEIES-YHWLLDVYFNDDRDKKYEENSAENFAQIKRLPLNLV 146 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 64.7 bits (156), Expect = 5e-09, Method: Composition-based stats. Identities = 31/60 (51%), Positives = 34/60 (56%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS 116 LKQYG FE GI HDTI +VSCIS F + FI WM C A DGKT+R S Sbjct: 11 LLKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 64.3 bits (155), Expect = 8e-09, Method: Composition-based stats. Identities = 29/82 (35%), Positives = 52/82 (63%), Gaps = 2/82 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M + +++H S + D RQ+W+V + L I LL + A +SG E + +I +G+ L+FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTIARV--VSC 80 + +E G+P HDT+ + +SC Sbjct: 77 FLPYERGLPAHDTLKGLSGISC 98 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 11/131 (8%) Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN--- 217 M +KG ++T DAMGCQ+ IA+++++ G D + ++KGNQG+ A F ++ Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 218 --PEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKK 275 P+HD + E SHGR R V + E + W ++ L V R A + Sbjct: 61 LKPDHDEF---EDSHGRTVRRRGWVLPLTPE-TKHSGSWPDIQALLVTEKIRQ--AHYSE 114 Query: 276 EPEMTVRYYIS 286 RYY+S Sbjct: 115 TVTSDFRYYLS 125 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 57/198 (28%), Positives = 86/198 (43%), Gaps = 14/198 (7%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 IP +L +++ GK IT DA+ QK +AE I + YLF VK NQ L + F ++ Sbjct: 3 IP-ILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYFEHRK 61 Query: 215 LNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 E D HGR + R +E ++F + + +S + Sbjct: 62 ----EPDYCLQDPPGHGRIDTRSIWTTTELNEYLEFPHVGQAF-----CIHKKSYDPKTN 112 Query: 275 KEPEMTVRYYISSADLTAEKFATAI---RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 K E T Y ++S A + R HW +EN H+ LD +ED +IR GN Sbjct: 113 KVCENTF-YGVTSHHPNKADPARILQIHRGHWSIENSKHYILDWTYDEDRNRIRTGNGPA 171 Query: 332 LFSGIRHIAINILTNDKV 349 + +R AI +L + V Sbjct: 172 NTNRLRGFAIGLLKSKGV 189 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 49/176 (27%), Positives = 79/176 (44%), Gaps = 15/176 (8%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRL 237 K E + G D L +KGN +L A L + SY + R E R Sbjct: 6 KKTVETVLATGNDLLVQLKGNHPKLLAAVRT---LCQSRAHAEQSYTVDLGRRNRIEQRT 62 Query: 238 HIVCDVP-----DELID-FTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 + +P D D F +G +++ V + +++ P YY+++ + Sbjct: 63 VRLWPLPPGSGTDPWHDHFQTVIEGQRQIEVFNPYHRRFEPRQESPA----YYLATCTAS 118 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 A A IR HW +EN+LH LDV + ED +IRR +F+ +RH A+N+L ++ Sbjct: 119 AATLAQVIRGHWAIENRLHHVLDVSLGEDSSRIRRNPG--VFALLRHFALNLLRHN 172 >UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REV9_LEGLO Length = 139 Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 38/120 (31%), Positives = 62/120 (51%), Gaps = 9/120 (7%) Query: 264 VSFRSIIAEQ---KKEPEMTV----RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVV 316 V +SIIA + K E + RYY++S + +RNHW +EN+LHW LDV Sbjct: 20 VGIKSIIATETISSKTNETAISAEWRYYVTSHETEKSDLHLYVRNHWSIENELHWHLDVH 79 Query: 317 MNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKAGLRRKMRKAAMDRNYLASVLAG 374 +N+D K R A FS I+ + ++++ K +R ++++ D YL S+L+ Sbjct: 80 LNDDADKKRDDTTAINFSSIKRMLLSLVKTKLPPGKKRSVRSRLKQVGWDTEYLVSLLSA 139 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 33/83 (39%), Positives = 50/83 (60%), Gaps = 1/83 (1%) Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE-K 183 A+H++SAF + +V+ Q+ +KSNEI A ELL LDI G +T DAM Q++ A Sbjct: 8 AVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQREHARFA 67 Query: 184 IQKQGGDYLFAVKGNQGRLNKAF 206 ++ + D++ VK NQ L +A Sbjct: 68 VEDKRADFVMTVKDNQPELREAL 90 >UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C6B8_ACAM1 Length = 145 Score = 61.2 bits (147), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 45/121 (37%), Positives = 61/121 (50%), Gaps = 11/121 (9%) Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTF-EWKGLKK-LCVAVSFRSIIAEQKKEPEMTVRY 283 S +S GREE R C E + EW+ ++ LCV + Q K T Y Sbjct: 7 SIQSRGREEHR----CIQVYEPVGIALQEWEAIRSVLCV----QRWGTRQGKAYHNTA-Y 57 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 YISSA + + + +R HW +EN+LHW DVV EDD ++ A +S +R I INI Sbjct: 58 YISSAATSPHHWQSLVREHWGIENRLHWPKDVVFGEDDYRLEDEQALLNWSVLRTIVINI 117 Query: 344 L 344 L Sbjct: 118 L 118 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 60.8 bits (146), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 15/187 (8%) Query: 56 DFLKQYG-DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD---VIAIDGK 111 D L Q G F P T V+S + PA + ++ +S+D IA+DGK Sbjct: 31 DVLAQLGVRFRR--PSEKTFRAVLSRLDPADLNARMGSYFTAHVASSDPSGLVPIALDGK 88 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGK-IIT 170 LR + + A H++S F+ LV+GQ+ +KSNEI + LL +L + ++T Sbjct: 89 MLRGALRA--KATATHLVSVFAHRARLVLGQLAVAEKSNEIPCVRALLTLLPDNLRWLVT 146 Query: 171 TDAMGCQKDIAEKI-QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKS 229 DAM Q A+ I YL VK NQ ++ A P E+ D + Sbjct: 147 VDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI-LARITALPWAEVPAAATD----DSRG 201 Query: 230 HGREEIR 236 HGR + R Sbjct: 202 HGRVKTR 208 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 26/46 (56%), Positives = 40/46 (86%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNE 151 ++ DGKTLR S+D+S + AIH++SA+++ +SLV+GQ+KTD+KSNE Sbjct: 26 LSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNE 71 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 42/157 (26%), Positives = 75/157 (47%), Gaps = 10/157 (6%) Query: 99 SSNDKDVIAIDGKTLRHSYD-KSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 ++ + IA+DGK L+ S S RR H++SA + + + +++ K+NE T Sbjct: 127 TAGPRRAIAVDGKALKASARLTSPRR---HLLSAVTHGRVVTLARVEVGAKTNETTHFKP 183 Query: 158 LLNMLDIKGKIITTDAM-GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 LL LD+ ++T DA+ + +I+ ++ + Y+ +K NQ + P +++ Sbjct: 184 LLAPLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQLAT-LPWRDIP 242 Query: 217 NPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFE 253 +A SE HGR E C +PDEL + Sbjct: 243 V----QHAASEVGHGRRESSSIKTCAIPDELGGIAYP 275 >UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammaproteobacteria RepID=Q47YV8_COLP3 Length = 151 Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 30/92 (32%), Positives = 51/92 (55%), Gaps = 1/92 (1%) Query: 281 VRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 R+ ISS DL + A+R+HW VE+ +HW LD+ D+ +I R +F+ +R IA Sbjct: 54 TRWNISSLDLHVVQALNAVRSHWQVES-IHWMLDMTFRVDESRICRKQGPHVFNVMRKIA 112 Query: 341 INILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 + + D + RK + A +D +Y +++L Sbjct: 113 MTLFKQDTTKLVSMARKKKMAGLDDDYRSNLL 144 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 50/189 (26%), Positives = 81/189 (42%), Gaps = 6/189 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L+ H+ IPD R V +LL+ + ++S ES D+E F H L + E Sbjct: 13 LISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGIE 72 Query: 66 NGIPVHDTIARVVSC-ISPAKFHECFINWM--RDCHSSNDKDVIAIDGKTLRHSYDKSRR 122 P D+ R + A +W + + D D + DGKTLR S + + Sbjct: 73 LKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTSG 132 Query: 123 RGA--IHVISAFSTMHSLVIGQ-IKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 GA I ++ +S + I Q + +E + +LL LD++G +I DA+ Q+ Sbjct: 133 GGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQA 192 Query: 180 IAEKIQKQG 188 Q +G Sbjct: 193 FFGSSQSRG 201 >UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY37_9BACT Length = 138 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 1/74 (1%) Query: 271 AEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 + +KE + V +SS + + +R HW +EN+LHW D V ED C R GN A Sbjct: 38 GKTRKETALGV-TSLSSGQASPRELLDFVRGHWEIENRLHWIRDSVFREDTCTTRTGNGA 96 Query: 331 ELFSGIRHIAINIL 344 + + +R++ I++L Sbjct: 97 HVMATLRNMTISLL 110 >UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3V5_9PROT Length = 174 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 46/159 (28%), Positives = 77/159 (48%), Gaps = 12/159 (7%) Query: 195 VKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K NQ L FE + + P +++ + K R+E R V V D L EW Sbjct: 1 MKANQSNL---FETACAIAANDAPADTAFSRN-KGRSRQEDRTVEVFPVGDALAGT--EW 54 Query: 255 KGLKKLCVAVSFRSII---AEQKKEPEMTVRYYISSA-DLTAEKFATAIRNHWHVENKLH 310 + K + V+ R+++ A + V +Y+SSA + A +A AIR HW +EN+ H Sbjct: 55 QPFIKTIIRVTRRTLLHSAATGLWDQRGEVAFYVSSAANFPATAWAAAIRGHWGIENRNH 114 Query: 311 WRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + DV +ED +IR + + + R A+NI+ + + Sbjct: 115 YVRDVSCDEDKSRIR--DNPGIMARARSFALNIMRKNGI 151 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 26/70 (37%), Positives = 40/70 (57%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M L+ H + I D RQ+ KV + L +L +T+ VI+GAE W +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIPV 70 G G+PV Sbjct: 72 KGILTEGVPV 81 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 72/332 (21%), Positives = 127/332 (38%), Gaps = 65/332 (19%) Query: 59 KQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYD 118 + G P ++T+ +++C+ WM + A DGK L Sbjct: 15 RPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGGIRA-DGKVL----G 69 Query: 119 KSRRRGA--IHVISAFSTMHSLVIGQ---IKTDKKSNEITAIPELLNMLDIKGKIITTDA 173 S+R GA +H + + + + Q + D + + + E + G++++ DA Sbjct: 70 GSKRAGAPALHGVELVTHTTGMALAQREAVGGDAAAALLALLTEA----PLDGRMVSMDA 125 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP-----------------LKELN 216 + + I ++ G+YL VKG+Q ++ P L ++ Sbjct: 126 GFLNAAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALDQIA 185 Query: 217 NPEHDSYAMS----------------EKSHGREEIRLHIVCDVPD--ELIDFTFEWK--- 255 P + E+S GR EIR V D D + + W+ Sbjct: 186 PPRRKRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGDVGPSLMTAYGWRQVT 245 Query: 256 ---GLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWR 312 GL++ C R A+ E+TV +SS T +F +IRNHW +EN++H Sbjct: 246 QIGGLRRWC-----RRRHADLWTVEEVTV---VSSRQRTPAQFLASIRNHWTIENQVHRP 297 Query: 313 LDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 D M ED R + + R++ IN++ Sbjct: 298 RDGSMQEDRLHGR--AIGVILAVCRNVVINLI 327 >UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A0_AZOSE Length = 92 Score = 54.7 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 25/67 (37%), Positives = 41/67 (61%) Query: 307 NKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRN 366 ++LHW LDV N+D ++RRG AA F +RHI +N+L ++ KA ++ K A M+ + Sbjct: 23 HQLHWSLDVQFNDDQSRVRRGYAANNFVVLRHIVLNLLRHNTTRKASIKSKRLLACMEDD 82 Query: 367 YLASVLA 373 + +L Sbjct: 83 FREELLG 89 >UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE7_9GAMM Length = 185 Score = 54.3 bits (129), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 31/95 (32%), Positives = 47/95 (49%), Gaps = 13/95 (13%) Query: 266 FRSIIAEQKKEPEMTVR-----------YYISSADLTAEKFATAIRNHWHVENKLHWRLD 314 FR++I Q+ R YY+ L A +F+ AIRNHW VEN+ H+ D Sbjct: 70 FRALIRVQRHTERFDTRLRDWRVSKECAYYLCDLVLPAARFSEAIRNHWRVENRAHYVRD 129 Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ED +IRR F+ +R A+N++ ++V Sbjct: 130 TRFQEDASRIRRNPCT--FALLRSFALNLMRFNRV 162 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 53.9 bits (128), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 43/125 (34%), Positives = 62/125 (49%), Gaps = 22/125 (17%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLL----TIFAVISGAESWEDIEDFGETHLDF 57 +LKKL+E S IPD R+A V+H+L+ +LL +F + S E+ D+ + F Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDM-----SRPAF 133 Query: 58 LKQ----YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC--------HSSNDKDV 105 L+ + + E +P DT+ARV+ I P K E FI +R H N Sbjct: 134 LQALQGLFPELET-LPHGDTLARVLERIEPQKLEESFIRLLRRYIRHKKFKRHLINKCYP 192 Query: 106 IAIDG 110 IAIDG Sbjct: 193 IAIDG 197 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 53.9 bits (128), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 4/95 (4%) Query: 69 PVHDTIARVVSCISPAKFHECFINWM----RDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 PV+ ++ ++ I P F R C + IAIDGKTLR S+D Sbjct: 12 PVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAFSDTK 71 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELL 159 A +V+SAF+ H +++ D+KSNEI A L+ Sbjct: 72 AAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALI 106 >UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D5 Length = 148 Score = 53.9 bits (128), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 37/122 (30%), Positives = 65/122 (53%), Gaps = 15/122 (12%) Query: 228 KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVR--YYI 285 K HGR E R L ++ W G++++ FR + +++ + + TV Y I Sbjct: 3 KGHGRVERR---SITTTTWLNEYLTRWPGVQQV-----FR-LERQRRADGKTTVEVVYGI 53 Query: 286 SSADLTAEKFATAI---RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 SS A T + R+HW +E+ LH+ DV ++ED C++RRG A + + +R++A+ Sbjct: 54 SSLSPVAAPPDTVLGYTRSHWGIES-LHYVRDVTLDEDRCRVRRGTAPRVLASLRNVAVY 112 Query: 343 IL 344 +L Sbjct: 113 LL 114 >UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C56B7 Length = 116 Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 24/77 (31%), Positives = 44/77 (57%), Gaps = 5/77 (6%) Query: 272 EQKKEPEMTVRYYISSADLTAEKFATA-----IRNHWHVENKLHWRLDVVMNEDDCKIRR 326 E+ + TV + L+AEK A +R HW +EN+LH+ DV + ED C++R Sbjct: 10 ERTVRGQTTVEVHFGITSLSAEKADAATLLNHVRTHWRIENELHYVRDVTLGEDVCRVRM 69 Query: 327 GNAAELFSGIRHIAINI 343 G+A ++ + +R+ +++ Sbjct: 70 GHAPQVLAALRNAVVHL 86 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 25/85 (29%), Positives = 43/85 (50%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 +++H + D R L I+ + I AV++GA+ + IE +G+ +L+ + D Sbjct: 27 VLKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLP 86 Query: 66 NGIPVHDTIARVVSCISPAKFHECF 90 GIP HDT RV+ + P + F Sbjct: 87 KGIPSHDTFGRVLRILEPKQLQSGF 111 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 46/187 (24%), Positives = 87/187 (46%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L E +S IPD R A ++ L G+L L + A +S +S +E F + L G + Sbjct: 3 LREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLGLRK 61 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 P H + ++ + P K E + ++ +V+ +DGK L+ S + Sbjct: 62 P--PGHTILTLLLHRLDPEKLQEALLQVF---PGADLGEVLVVDGKHLKGSGKGKSPQ-- 114 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD---IKGKIITTDAMGCQKDIAE 182 + ++ + + Q K + + ++ A+ ELL+ L +KGK++ DA ++A Sbjct: 115 VRLVEVLALHLLTTLAQAKAEGREDQ--ALLELLDRLGAEGLKGKVVVGDAGYLYPELAG 172 Query: 183 KIQKQGG 189 K+ ++GG Sbjct: 173 KVVQKGG 179 >UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ26_GLOVI Length = 125 Score = 51.2 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 29/81 (35%), Positives = 47/81 (58%), Gaps = 4/81 (4%) Query: 267 RSIIAEQKKEPE--MTVR--YYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDC 322 RSI E+ +E +TV+ +Y+SS + +A + IR HW VEN++H+ DV ED Sbjct: 15 RSIRLERYRELRGIVTVKTHWYLSSIEASASELGRRIRGHWGVENQVHYPKDVTFGEDRS 74 Query: 323 KIRRGNAAELFSGIRHIAINI 343 +IR +++S R A+N+ Sbjct: 75 RIRTLPLVQVWSVARSFALNL 95 >UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JBB1_FRASC Length = 173 Score = 51.2 bits (121), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%) Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI--LTNDKVFKAGLRRKMR 359 HW +EN+LHW DV +ED + R GNA ++ + +R++AI I LT K LR R Sbjct: 100 HWAIENRLHWVRDVTYDEDRHRARTGNAPQVMTSLRNLAITILRLTGAKNIAKALRHHAR 159 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 50.8 bits (120), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 48/188 (25%), Positives = 90/188 (47%), Gaps = 15/188 (7%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L + +S +PD R A + L G+L L + A +S +S +E F + L G + Sbjct: 3 LRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLGLRK 61 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS-YDKSRRRG 124 P H I ++ + P K + ++ +V+ +DGK LR S KS + Sbjct: 62 A--PGHTAITLLLHRLDPEKLQAALGQVFPE---ADLGEVLVVDGKHLRGSGKGKSPQVK 116 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD---IKGKIITTDAMGCQKDIA 181 + V++ +H+ + Q + + + E A ELL+ L+ ++GK++ DA ++A Sbjct: 117 LVEVLALH--LHT-TLAQARAEGR--EEKAFLELLDRLEARELEGKVVVGDAGYLYPEVA 171 Query: 182 EKIQKQGG 189 +++K+GG Sbjct: 172 ARVRKKGG 179 >UniRef50_Q8X3B6 H repeat-associated protein n=1 Tax=Escherichia coli O157:H7 RepID=Q8X3B6_ECO57 Length = 50 Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats. Identities = 25/36 (69%), Positives = 27/36 (75%) Query: 343 ILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGLS 378 I ND VFKAGL KMRKA MDRN+LAS +A GLS Sbjct: 15 ISDNDNVFKAGLSCKMRKAVMDRNFLASGIAACGLS 50 >UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmonella enterica RepID=C0Q104_SALPC Length = 177 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 28/55 (50%), Positives = 31/55 (56%), Gaps = 13/55 (23%) Query: 309 LHWRLDVVMNEDDCKIRRGNAAELF----SG---------IRHIAINILTNDKVF 350 +HWRLDV MNEDDC+IRRGN F SG +R I INIL VF Sbjct: 1 MHWRLDVAMNEDDCRIRRGNVKSFFEIIKSGEYEIWGCEIMRWIRINILKCTLVF 55 >UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XTV7_9DEIN Length = 118 Score = 49.3 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 29/108 (26%), Positives = 54/108 (50%), Gaps = 5/108 (4%) Query: 267 RSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRR 326 R ++ + E Y ++S A++ R HW VEN+LH + D V+ ED + R+ Sbjct: 15 RRVVRKNSGELREETAYALTSLQAPAKRLYALWRGHWEVENRLHHKRDTVLGEDASRSRK 74 Query: 327 GNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 G A ++ +R + +N+L + + + R +RK + D L ++ G Sbjct: 75 GAAGLMY--LRDVILNLL---HLKRWPVLRSVRKFSADPKVLLRLIRG 117 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 48/203 (23%), Positives = 79/203 (38%), Gaps = 52/203 (25%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGN----QGRLNKAFEEKFPLKELNNPEHDSYAMSEKS 229 MGCQK+IA+ I KQ DY+ A+KG+ QG L +A+ K + D + + Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGEL-EAWWHKCQREGFTADNFDEHTTIDSG 59 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 HGR E R V ++ ++W GLK + I Sbjct: 60 HGRIETRRCQQVLVNKSWLNNKYQWVGLKSI------------------------IKVTS 95 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 EK T + +IR+G F+ +R IA+ + ++ Sbjct: 96 DVHEKTTT-----------------------ESRIRKGRGPLAFNVMRKIAMTLFKQEQT 132 Query: 350 FKAGLRRKMRKAAMDRNYLASVL 372 +A + K + A +D Y +++L Sbjct: 133 KRASIVAKKKMAGLDDEYRSTLL 155 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 37/117 (31%), Positives = 57/117 (48%), Gaps = 12/117 (10%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 +L ++ IPD+R+A + L+ +LL +I AV+SGA S+ I+ F + H + L Sbjct: 2 QLKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQL 61 Query: 65 E-NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSN------DKDVIAIDGKTLR 114 PVH +I + + AK E + H+S IA+DGKTLR Sbjct: 62 HWKRAPVHTSIRYALQGLD-AKAGELAFHR----HASGLDGEGAQHASIAMDGKTLR 113 >UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria RepID=A8FXM7_SHESH Length = 57 Score = 48.9 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 22/38 (57%), Positives = 28/38 (73%), Gaps = 1/38 (2%) Query: 282 RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 RYYISS +LTAE+ A + HW +E+ +HW LDV MNE Sbjct: 18 RYYISSKELTAEQAANTVSEHWGIES-MHWVLDVSMNE 54 >UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H9Y8_ANADF Length = 431 Score = 48.1 bits (113), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 79/354 (22%), Positives = 134/354 (37%), Gaps = 61/354 (17%) Query: 10 ISIIPDYRQA---WKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFEN 66 + +PD R W + L+G+LL +++GA S + E+ + ++ Sbjct: 22 LEAVPDVRAREGRWSLAEILTGVLL----GIVAGARSLAEAEELTDGMSPAARRLASVPR 77 Query: 67 GIPVHDTIARVVSCISP-----AKFHECF-INWMRDCHSSNDKDV--IAIDGK-----TL 113 +P DT AR C P A H W R + D V +A+DGK TL Sbjct: 78 RLP--DTTARDALCAVPLDGLRAALHRLVRAAWRRKALTPVDLPVGVVALDGKVTALPTL 135 Query: 114 RHSY------DKSRRRGAIHVISA--FSTMHSLVIGQIKTDKKSNEITAIPELL-NMLDI 164 H D G ++ S I + ++NE +L +++ Sbjct: 136 NHPLIQNQHPDVGLPYGLARTVTCALVSAPGRPCIDAVPIPAETNEAGHFQHVLAGLVET 195 Query: 165 KG---KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD 221 G +++T DA + + G DY+FA+K + + K E E+ D Sbjct: 196 YGALFQVVTYDAGALSEANGAAVVAAGKDYVFALKNDHFTMVKLATELLDPHEIAARRED 255 Query: 222 SYAMSEKSHGREEIRLHIVCDV--------PDELIDFTFEW---KGLKKLCVAVSFRSII 270 + + EI++ V P+E + W + ++ V ++ Sbjct: 256 --VLDNATTATREIQILAVDPSHGYGAGKGPEESV-----WSHARTFLRVTSTVRRSGVV 308 Query: 271 AEQKKEPEMTVRYYISS--AD-LTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 E+ R ++SS AD LT +++ +R HW VEN H LD ED+ Sbjct: 309 IERDS------RLFVSSRAADQLTPDQWLQVVRAHWGVENNNHHTLDTAFAEDE 356 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 38/125 (30%), Positives = 54/125 (43%), Gaps = 9/125 (7%) Query: 223 YAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVR 282 + S HGR E R C + DEL F G L V + + +E TV Sbjct: 25 HTASSAGHGRRESRSIKTCGIADELGGIAFP-HGRLALRVHRRRKQTGGCESRE---TV- 79 Query: 283 YYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHI 339 Y ++S D T + A A+R HW VE H R DV E+ + G A + R++ Sbjct: 80 YAVTSLDAHETTPAELAAAVRGHWTVEALRHVR-DVTYAEEASTLHTGTAPRAMATFRNL 138 Query: 340 AINIL 344 A+ +L Sbjct: 139 AVGLL 143 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 47/206 (22%), Positives = 91/206 (44%), Gaps = 14/206 (6%) Query: 7 MEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLK-----QY 61 + +++ IPD R+ K +H+ +LL+ + AV SG + + + + FL + Sbjct: 10 LPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDEVHIRT 69 Query: 62 GDFENGIPVHDTIARVVSCISP--AKFHECFINWMRDCHSSNDKD-----VIAIDGKTLR 114 E +P T+ R+ +S + ++W R+ + K+ +A+DGK LR Sbjct: 70 RRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVDGKHLR 129 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIG-QIKTDKKSNEITAIPELLNMLDIKGKIITTDA 173 + R A+ +SA L +G Q D ++ + + L + ++T DA Sbjct: 130 GTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGVD-WVLTGDA 188 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQ 199 C +++A + +Q G A KG + Sbjct: 189 ALCTQELAAVVVEQKGGICSASKGTR 214 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 45.8 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 30/82 (36%), Positives = 42/82 (51%), Gaps = 8/82 (9%) Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMS----- 226 D +GCQK IA+ I +Q DYL AVK NQ L++A F +E N Y + Sbjct: 8 DGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYF--EEANKARFAGYNIDYDEKI 65 Query: 227 EKSHGR-EEIRLHIVCDVPDEL 247 K GR E+ R + ++PD + Sbjct: 66 NKGPGRLEQRRCWVGYEIPDTI 87 >UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobacteria RepID=Q2RP40_RHORT Length = 152 Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 53/122 (43%), Gaps = 16/122 (13%) Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVS-------FRSIIAEQKKEPEMTVR 282 HGR+E R V DV L W GL V+ +S + + +E + Sbjct: 12 HGRQEHRWVEVFDVSGRLGP---TWDGLIAAVARVTRLTWHKDTKSGLWHKTQETAL--- 65 Query: 283 YYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 Y +L A TAIR HW VE + H+ DV ED +IR F+ +R A+N Sbjct: 66 -YACQINLPAAVAGTAIRQHWGVEKRSHYVRDVTFFEDQSRIR--TKPGHFARLRSFALN 122 Query: 343 IL 344 IL Sbjct: 123 IL 124 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 44.3 bits (103), Expect = 0.008, Method: Compositional matrix adjust. Identities = 33/115 (28%), Positives = 53/115 (46%), Gaps = 9/115 (7%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGET-HLDFLKQYGDF 64 L EH++ +PD R + H L IL + + A+ SGAE + + ++ T + L++ G Sbjct: 16 LWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGCQ 75 Query: 65 ENG------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTL 113 E+ P T+ RV+ I NW+ S +A+DGKTL Sbjct: 76 ESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSLGLS--PAALAVDGKTL 128 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 43.9 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 21/54 (38%), Positives = 36/54 (66%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFL 58 +L ++S IPD+R+A + L+ +LL +I A++SGA S+ I+ F +TH + L Sbjct: 2 QLKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERL 55 >UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus aquaticus Y51MC23 RepID=B7A7V9_THEAQ Length = 161 Score = 42.7 bits (99), Expect = 0.020, Method: Compositional matrix adjust. Identities = 29/103 (28%), Positives = 54/103 (52%), Gaps = 15/103 (14%) Query: 267 RSIIAEQKKEPEMTVRYYISS-----ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 R ++ + E TV Y ++S AD A + + + W VEN+ W D +++ED Sbjct: 51 REVVRKGTGEVRRTVSYALTSLGPEVAD--ARRLGELLLSRWEVENRSFWVRDFLLHEDA 108 Query: 322 CKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 C++ RG A++ + +R +++L + G+R K KAA++ Sbjct: 109 CQV-RGVGAQVLAALRAFLVSLL-----HRQGVREK--KAALE 143 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 42.0 bits (97), Expect = 0.037, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 40/74 (54%), Gaps = 6/74 (8%) Query: 103 KDVIAIDGKTLRHS--YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLN 160 + +A+DGKT RH+ D S+ +H++ S ++ Q++ + K+NE LL Sbjct: 153 ESAVALDGKTSRHAKRADGSK----VHLVGVASHGDGRLLAQVEVEAKTNETAVFRRLLR 208 Query: 161 MLDIKGKIITTDAM 174 LD+ ++T DA+ Sbjct: 209 PLDLTNVLVTADAL 222 >UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H5_9SYNE Length = 177 Score = 41.2 bits (95), Expect = 0.060, Method: Compositional matrix adjust. Identities = 29/118 (24%), Positives = 52/118 (44%), Gaps = 9/118 (7%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYIS 286 E HGR+ + + P + W G + ++ + ++P +I+ Sbjct: 35 EIGHGRDILWTLRAKEAPQHI---KANWHGTSWIAEVIA----TGTRDRKPFKATHRFIT 87 Query: 287 SADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 S T + +R W VE+ HW D ++EDD + RGN A + + +R A+N+L Sbjct: 88 SLRTTPDALLRLVRERWSVES-WHWIRDTQLHEDDHRY-RGNGAGVMAALRTAAMNLL 143 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 40.4 bits (93), Expect = 0.096, Method: Compositional matrix adjust. Identities = 38/161 (23%), Positives = 67/161 (41%), Gaps = 19/161 (11%) Query: 26 LSGILLLTIFAVISGAESWEDIEDFG-ETHLDFLKQYGDFENGIPVHDTIARVVSCISPA 84 L+ +L L V++G +++ + ++ + + L +G GIP T R+V P Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFG-LTRGIPSERTTRRLVEGCDPV 106 Query: 85 KFHECFINWMRDCHSSNDKDV--IAIDGKTLR--HSYDKSRRRGAIHVISA------FST 134 E W+ + D +A DGKTL+ S+ ++ V+ A + Sbjct: 107 ALDEALSGWIARAATVGDPGPRGLAFDGKTLKGTRSFTEAGAMSQEAVLEAVWHDTGITA 166 Query: 135 MHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 H V+G +EI A+ L LD+ ++TT G Sbjct: 167 GHQRVVG-------GDEIAALEALAGRLDLTDVLVTTAEKG 200 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 506 e-142 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 423 e-117 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 416 e-115 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 415 e-114 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 410 e-113 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 408 e-112 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 404 e-111 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 399 e-109 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 387 e-106 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 378 e-103 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 371 e-101 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 368 e-100 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 368 e-100 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 362 1e-98 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 361 2e-98 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 359 1e-97 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 355 2e-96 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 354 4e-96 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 353 8e-96 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 349 1e-94 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 347 3e-94 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 347 3e-94 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 345 2e-93 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 345 2e-93 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 341 3e-92 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 340 4e-92 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 340 7e-92 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 338 2e-91 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 337 4e-91 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 334 3e-90 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 333 9e-90 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 332 1e-89 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 331 2e-89 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 330 6e-89 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 329 9e-89 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 325 1e-87 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 323 4e-87 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 318 2e-85 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 317 3e-85 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 314 3e-84 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 310 4e-83 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 309 1e-82 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 306 7e-82 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 300 5e-80 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 298 2e-79 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 283 1e-74 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 279 1e-73 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 275 3e-72 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 270 5e-71 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 269 2e-70 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 268 2e-70 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 266 1e-69 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 265 2e-69 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 261 4e-68 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 261 4e-68 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 260 5e-68 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 257 4e-67 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 254 5e-66 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 245 2e-63 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 244 4e-63 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 244 4e-63 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 243 6e-63 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 238 3e-61 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 237 7e-61 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 233 1e-59 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 221 5e-56 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 219 2e-55 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 218 3e-55 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 218 3e-55 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 217 4e-55 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 216 2e-54 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 207 7e-52 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 202 2e-50 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 201 4e-50 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 200 7e-50 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 194 6e-48 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 193 7e-48 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 189 1e-46 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 184 4e-45 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 182 2e-44 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 182 2e-44 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 176 1e-42 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 176 2e-42 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 175 2e-42 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 165 3e-39 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 164 3e-39 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 164 6e-39 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 163 9e-39 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 162 1e-38 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 162 2e-38 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 161 4e-38 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 161 5e-38 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 151 5e-35 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 150 7e-35 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 150 1e-34 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 149 2e-34 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 147 5e-34 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 147 7e-34 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 146 1e-33 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 146 1e-33 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 144 4e-33 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 143 1e-32 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 142 1e-32 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 140 8e-32 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 139 2e-31 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 138 3e-31 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 138 4e-31 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 138 4e-31 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 136 1e-30 UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7... 134 6e-30 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 132 1e-29 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 131 3e-29 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 130 8e-29 UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azolla... 130 1e-28 UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=B... 129 1e-28 UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp... 128 3e-28 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 128 3e-28 UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaerom... 127 6e-28 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 124 4e-27 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 124 6e-27 UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=B... 122 2e-26 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 122 3e-26 UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammap... 114 4e-24 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 114 4e-24 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 114 5e-24 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 114 7e-24 UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica... 113 1e-23 UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX 109 3e-22 UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina... 107 5e-22 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 105 3e-21 UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewane... 103 9e-21 UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata ob... 101 4e-20 UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mo... 101 5e-20 UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legione... 99 2e-19 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 97 7e-19 UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata ob... 97 1e-18 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 96 1e-18 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 96 2e-18 UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio ... 96 3e-18 UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris mari... 96 3e-18 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 96 3e-18 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 95 4e-18 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 95 5e-18 UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswald... 94 9e-18 UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum fe... 93 1e-17 UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum ... 89 3e-16 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 88 4e-16 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 88 4e-16 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 88 4e-16 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 87 1e-15 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 82 3e-14 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 82 4e-14 UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax... 82 4e-14 UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus Re... 81 8e-14 UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothe... 79 3e-13 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 79 3e-13 UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscurig... 74 7e-12 UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia... 72 5e-11 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 67 1e-09 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 67 1e-09 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 64 7e-09 UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria... 57 1e-06 UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX 47 0.001 UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmone... 46 0.002 Sequences not found previously or not previously below threshold: UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 129 1e-28 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 95 3e-18 UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis... 91 6e-17 UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfo... 91 1e-16 UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteoba... 88 4e-16 UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis... 86 3e-15 UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobac... 85 5e-15 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 84 9e-15 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 84 9e-15 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 82 3e-14 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 81 6e-14 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 80 1e-13 UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 80 1e-13 UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia ... 79 2e-13 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 77 1e-12 UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synecho... 73 2e-11 UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium m... 72 3e-11 UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_... 72 3e-11 UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferro... 71 5e-11 UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodoco... 68 5e-10 UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoa... 68 6e-10 UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitroso... 67 2e-09 UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=... 65 6e-09 UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinis... 64 1e-08 UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus... 63 2e-08 UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylo... 63 2e-08 UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus... 62 4e-08 UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylob... 62 5e-08 UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 61 6e-08 UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliob... 60 9e-08 UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synecho... 59 4e-07 UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia... 58 5e-07 UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia... 57 1e-06 UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia... 56 2e-06 UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=... 55 3e-06 UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=... 55 5e-06 UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candida... 54 7e-06 UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polarom... 54 8e-06 UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitroco... 54 1e-05 UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobi... 54 1e-05 UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=... 51 6e-05 UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastop... 51 7e-05 UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus... 50 9e-05 UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillace... 50 1e-04 UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 R... 50 1e-04 UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 49 3e-04 UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata ob... 48 6e-04 UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitroso... 47 0.001 UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiat... 47 0.001 UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia... 47 0.001 UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Strepto... 46 0.002 UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinis... 46 0.002 UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodoco... 45 0.004 UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 45 0.004 UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms Rep... 45 0.006 UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewane... 44 0.007 UniRef50_Q2RR82 Putative uncharacterized protein n=1 Tax=Rhodosp... 44 0.008 UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aro... 44 0.010 UniRef50_Q8X3B6 H repeat-associated protein n=1 Tax=Escherichia ... 44 0.010 UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus p... 44 0.011 UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteoba... 43 0.014 UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromon... 43 0.018 UniRef50_A6FLE0 Transposase, IS4 n=2 Tax=Roseobacter sp. AzwK-3b... 43 0.023 UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID... 41 0.064 UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia ... 41 0.079 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 506 bits (1303), Expect = e-142, Method: Composition-based stats. Identities = 370/378 (97%), Positives = 373/378 (98%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 MELKKLMEHISIIPDYRQ WKVEHKLS ILLLTI AVISGAE WEDIEDFGETHLDFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS+DKDVIAIDGKTLRHSYDKS Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 RRRGAIHVISAFSTMHSLVIGQIKTD+KSNEITAIPELLNMLDIKGKIITTDAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 AEKIQKQGGDYLFAVKG QGRLNKAFEEKFPLKELNNPEHDSYA+SEKSHGREEIRLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR Sbjct: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK Sbjct: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 Query: 361 AAMDRNYLASVLAGSGLS 378 AAMDRNYLASVLAGSGLS Sbjct: 361 AAMDRNYLASVLAGSGLS 378 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 423 bits (1087), Expect = e-117, Method: Composition-based stats. Identities = 156/377 (41%), Positives = 222/377 (58%), Gaps = 10/377 (2%) Query: 2 ELKKLMEHISIIPDYRQA-WKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 +K E+ + D R+ H IL++ + A+ISGA ++ +IE FG + ++ + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 + NGIP HDT V++ +SP +F CF+ W + IAID KTLR S DK Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKK 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + +H++SA++T +LVIGQIKT++ SNEITAIPELLN LD+KG +++ DAMGCQ +I Sbjct: 125 NGKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEI 184 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH---DSYAMSEKSHGREEIRL 237 AEKI ++ DY+ A+KGNQ +L+++ E F L N E D E S+GREEIR Sbjct: 185 AEKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRC 244 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 + +++I EWK +K + + S R KKE E +RYYISSA L+AE Sbjct: 245 AYATNEIEKIIAN-DEWKNIKTVAMIESQRI-----KKEKEFDIRYYISSAKLSAEDCLK 298 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +ENKLHW LDV ED+ +IR+ N AE + +R IA+N++ +K K G K Sbjct: 299 VVRKHWEIENKLHWTLDVAFREDESRIRQRNTAENMAILRRIALNLVKQEKTAKVGQATK 358 Query: 358 MRKAAMDRNYLASVLAG 374 A D YL +L G Sbjct: 359 RLMAGWDEKYLLKLLNG 375 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 416 bits (1069), Expect = e-115, Method: Composition-based stats. Identities = 182/377 (48%), Positives = 250/377 (66%), Gaps = 4/377 (1%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M L+ +SII D RQ KV H L +L L I AVISG E WE+I+DFG LD+L++ Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 Y F GIP DTI+R+ I P +F +CF WM+ C + DVIAIDGKTLR S++K Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKK 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + IH++SAF+ +S+V+GQ+KT+ KSNEITAIP+LL++LD++G ++T DAMGCQ I Sbjct: 121 DKSDTIHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 A+KI +GGDYL VKGNQ RL A + F ++ L PE ++Y EK HGRE+ R+ +V Sbjct: 181 AKKIVDKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 D +E+ D FEW GLK L AVSFR E+ + + V++YISSA L A+ A R Sbjct: 241 ADA-NEIGDLVFEWPGLKTLGYAVSFR---TEKDMQTTVAVKFYISSAKLDAKSLLEASR 296 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW VEN LHW+LD+ MNED C+IR+ N+ E + +RH ++N+L N+K F G++RK ++ Sbjct: 297 AHWTVENNLHWQLDISMNEDSCRIRQQNSTENLATVRHTSLNLLKNEKSFDGGIKRKHKQ 356 Query: 361 AAMDRNYLASVLAGSGL 377 A +Y V++G L Sbjct: 357 ANRSDSYRELVVSGLSL 373 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 415 bits (1067), Expect = e-114, Method: Composition-based stats. Identities = 168/373 (45%), Positives = 234/373 (62%), Gaps = 8/373 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+E SII D RQ K++H+L IL L + AVI GAE W+DIE+ G L++L++ G F Sbjct: 6 SLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFF 65 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 + GIPV DTIAR++S ++P + CFI WM + D +IA+DGK++RHSYDK +R+ Sbjct: 66 KKGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKS 125 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 AIH++SA++ + +V+GQ KTD KSNEI AIP LL++LDIKG I+T DAMGCQ+ IAEKI Sbjct: 126 AIHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKI 185 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPEHDSYAMSEKSHGREEIRLHIVC 241 + GDY+ AVK NQ +L++ + F + HD + S K HGR E+R + + Sbjct: 186 VTKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRRYWIS 245 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 D+ L W L+ + + S R I + RY+I+S A+ FA A+R Sbjct: 246 DMLSTLG-NPERWASLQSIGMVESERYI----DGKTTAETRYFITSIAPDAKIFANAVRK 300 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDV EDD ++RR NA+E F RH+AIN L N+K K G++ K KA Sbjct: 301 HWAIENQLHWVLDVSFREDDSRVRRDNASENFGVFRHVAINALRNEKSCKKGIKAKRYKA 360 Query: 362 AMDRNYLASVLAG 374 + +Y VL G Sbjct: 361 TLQPDYAQKVLNG 373 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 410 bits (1053), Expect = e-113, Method: Composition-based stats. Identities = 148/385 (38%), Positives = 225/385 (58%), Gaps = 20/385 (5%) Query: 8 EHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG 67 ++ + D R +HKL I+ +TI AVI GA+SW DIE FG+ +LK++ + NG Sbjct: 11 DYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNG 70 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIH 127 IP HDT RV S ++P + F+ W++ S +++AIDGKTLRHSYD+S+ + A+ Sbjct: 71 IPSHDTFGRVFSLLNPEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQ 130 Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPE---------------LLNMLDIKGKIITTD 172 +ISA++T + LV+GQ D+KSNEITAIP+ LL +L + G I+T D Sbjct: 131 MISAWATTNGLVLGQSIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLD 190 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPE---HDSYAMSEKS 229 A+GCQK+I ++I +Q DY+ +K NQG L + E F ++N E Y + ++ Sbjct: 191 AIGCQKEIVKQITEQDADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEG 250 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 HGR+E+R + + E ID ++W L + R + + + RY+ISS + Sbjct: 251 HGRQEVRYYQMLSNVAEEIDPDWQWLNLNSIGYVEYLR--VENGTDKTSLERRYFISSLN 308 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + FA+++R HW +EN+ HW LDV NEDD +IR+ NA + +RH+A+N+L +K Sbjct: 309 NNIKLFASSVREHWCIENQCHWILDVQFNEDDSRIRKDNAPANMAILRHLALNLLKQEKT 368 Query: 350 FKAGLRRKMRKAAMDRNYLASVLAG 374 K G++ K +KA D NYL VL Sbjct: 369 LKVGVKAKRKKAGWDENYLLKVLRN 393 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 408 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 144/375 (38%), Positives = 220/375 (58%), Gaps = 7/375 (1%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 ++EH S + D R A ++E+ L I+++T+ AV+ GA++W ++ ++G + +LKQ+ Sbjct: 5 PFASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQW 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSR 121 NG+P HDT V + + P + +CF+NW + + + ++IAIDGKTLR + Sbjct: 65 IALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGE 124 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + IH++SA+++ + LV+GQ D+KSNEITAIPELL +L+++G +++ DAMGCQ IA Sbjct: 125 QCSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIA 184 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPEHDSYAMSEKSHGREEIRLH 238 E I + GDY+ A+KGNQG L + F + EHDSY EK HGR E R + Sbjct: 185 ETIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTY 244 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 D L+ W LK + S R + + RYY+ S + A++FA A Sbjct: 245 WTMGQTDYLLG-AERWAQLKSIGCVESCRR---QPGHPGTLQRRYYLLSIESDAQRFADA 300 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R+HW +EN+LHW LDV ED + +G +A+ S IRHIA N+L + K G++ K Sbjct: 301 VRSHWGIENQLHWILDVGFREDKLRACQGCSAQNLSVIRHIAANLLQQESTAKCGVKAKR 360 Query: 359 RKAAMDRNYLASVLA 373 KA D NYL +L+ Sbjct: 361 LKAGWDDNYLVKILS 375 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 404 bits (1037), Expect = e-111, Method: Composition-based stats. Identities = 136/373 (36%), Positives = 211/373 (56%), Gaps = 8/373 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+ ++ I D R +H L +L + I AVI+G++ WED+E++G ++L ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 +GIP DT RV I P +C W++ +S ++I IDGKTLR SYD++ + Sbjct: 90 PHGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQC 149 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A++ ++A+++ SLV+GQ+K + SNEITAIP LL +LDI G IIT DAMG Q I ++I Sbjct: 150 ALYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQI 209 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP---EHDSYAMSEKSHGREEIRLHIVC 241 +Q DY+ +K N L ++ F + N EHD Y K H R E R Sbjct: 210 CRQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRYVWAI 269 Query: 242 DVPDEL-IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 V + +W GL+ + V R + + +++Y++S A+ AIR Sbjct: 270 PVAAMGELYQQQQWHGLQTIVVVERIRHLWN----KTTHDIQFYLTSLPPNAQFLCHAIR 325 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +EN LHW LDV +ED C+IR + + F+ +R +A+N+L +K FK LR+KM++ Sbjct: 326 THWSIENNLHWTLDVTFSEDQCRIRSEYSPQNFALLRRLALNVLHQEKTFKRSLRQKMKQ 385 Query: 361 AAMDRNYLASVLA 373 AAM+ NY+ +VL Sbjct: 386 AAMNNNYMMTVLN 398 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 399 bits (1025), Expect = e-109, Method: Composition-based stats. Identities = 158/372 (42%), Positives = 228/372 (61%), Gaps = 7/372 (1%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M ++ +H S I D+RQ+ KV + L +L ++ AVI+ + W +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 F +GIP DTIAR+VS I P F+ CF+ WM+ H + +VIAIDGKTLR SY++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 R IH+ISA+++ + LV+GQ+K ++KSNEITAIP LL MLD++G ++T DAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 A I +GGDYL AVK NQG L KA + F + D + EKSHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFS-PHRSAGLSDDHVNIEKSHGRIENRTCYV 239 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 DFT W+ LK + + SFR++ + K + RYYISS L+AE+ +A R Sbjct: 240 LSSAALDGDFTH-WEALKSIVMVESFRAV---KGKTASLEYRYYISSKVLSAEQALSATR 295 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +E+ +HW LDV MNED+C+I + N AE + +RH+++N+L + K + K ++ Sbjct: 296 EHWGIES-MHWVLDVSMNEDECQIYKNNGAENLAYLRHMSLNMLQKEPT-KLSIVGKRKR 353 Query: 361 AAMDRNYLASVL 372 M+ +L VL Sbjct: 354 CLMNPAFLEKVL 365 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 387 bits (993), Expect = e-106, Method: Composition-based stats. Identities = 155/372 (41%), Positives = 214/372 (57%), Gaps = 10/372 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 + + H S I D RQ KV + L ILLLT+ AV+SGA W I +G L FLK++ F Sbjct: 24 EFLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPF 83 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 +G P HD + + + + F CFI+W+ + + V+AIDGKT R S DK+ + Sbjct: 84 ADGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTVTG-VVAIDGKTSRRSLDKAGGKA 142 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 AIH+ISA+S+ +L + Q + D KSNEITAIPELL +L +KG I+T DAMGCQ++IA KI Sbjct: 143 AIHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKI 202 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPEHDSYAMSEKSHGREEIRLHIVC 241 + DY+ A+KGNQG L K E + ++ + EKSHGR E R VC Sbjct: 203 ISKEADYILALKGNQGSLRKDTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVTVC 262 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 D L W GLK + + A + + RYYISS AE A AIR+ Sbjct: 263 TDIDWL-KADHNWPGLKSIVMVQY----HAILQDKTRAETRYYISSMTSDAEHHAKAIRD 317 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN LHW +D+V +D+C+IR GNA F+ I+H+A N+L + K K LR K A Sbjct: 318 HWGIENGLHWVMDMVFRDDECRIRNGNAPANFTTIKHVASNMLRSVK-GKHSLRSKRHIA 376 Query: 362 AMDRNYLASVLA 373 + D ++LA ++ Sbjct: 377 SWDDDFLAEIIN 388 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 378 bits (971), Expect = e-103, Method: Composition-based stats. Identities = 133/380 (35%), Positives = 217/380 (57%), Gaps = 16/380 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 + L+EH I D R + +H+L +L++ + ++ G E++ D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 +GIP HDT RV + + P F +CF+ W + ++ +++A+DGK LR + ++ + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQG--Q 124 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 ++SA++ +SLV+GQI+ K+NEITA+P+LL +L++ G I+T DAMGCQK+IA + Sbjct: 125 SPRVIVSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLK----------ELNNPEHDSYAMSEKSHGRE 233 I + +Y+ A+KGNQG+ ++ + E N +EK HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 E R + L D +W GL+ + V S R + ++ P + RYY+SS ++ E Sbjct: 245 ETRRYWQSGDVSWLAD-RQQWAGLRSVGVVESVRQV---GQQAPTVERRYYLSSLNVDVE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 KFA A+R HW VEN LHW LDV ED + R G+AAE + +R +A+N+L + K G Sbjct: 301 KFARAVRGHWSVENSLHWVLDVQCGEDRNRARSGHAAENLATLRRLALNLLKRESTKKRG 360 Query: 354 LRRKMRKAAMDRNYLASVLA 373 ++ K A+ D +YL +L+ Sbjct: 361 IKGKQLNASWDHDYLLRLLS 380 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 371 bits (953), Expect = e-101, Method: Composition-based stats. Identities = 126/380 (33%), Positives = 195/380 (51%), Gaps = 16/380 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 L E I D+R H L+ IL++ A++ G + +E FG +L+ + Sbjct: 14 SNLREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLA 73 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCH--------SSNDKDVIAIDGKTLRH 115 NGIP HDT +V S + P +F E F W + S K VIAIDGK LR Sbjct: 74 LPNGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRG 133 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 + DK + ++ A+++ SL +GQ+K KSNEI A+PELL ML +KG I+T DAMG Sbjct: 134 AVDKG--QAPAVIVGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMG 191 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE-LNNPEHDSYAMSEKSHGREE 234 CQ+++A KI +Q GDY+ A+K NQ L++ E L E + + HGR E Sbjct: 192 CQREVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHE 251 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 +R V + + + +W GL+ + R++ + + RY+ISS A Sbjct: 252 VRRCWVSEEVECWLQGAEKWAGLRSVAAVECERTV----AGQTTVQRRYFISSLKADAAL 307 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKAG 353 A ++R HW +EN LHW LDV ED+ + RRG +AE + +R + ++ + K Sbjct: 308 IAASVRAHWGIENSLHWVLDVTFGEDESRSRRGYSAENLATLRRLTHAMIKRENPNSKKS 367 Query: 354 LRRKMRKAAMDRNYLASVLA 373 + ++ +A + +YL ++L Sbjct: 368 VNQRRFEAGLSTDYLQTLLG 387 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 368 bits (945), Expect = e-100, Method: Composition-based stats. Identities = 137/384 (35%), Positives = 197/384 (51%), Gaps = 15/384 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 +++ I D R K HK+ I+ ++I AVI GA+SW +IE+FG + F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPD 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS------YD 118 IP HDT R S I P F F NW++ V+AIDGK +R + Sbjct: 64 LEFIPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 + + ++SA+S ++ + +GQ+K D KSNEITAIP L+N L++ G I+T DAMGCQK Sbjct: 123 TGKEGFKLWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 DI + I ++ +Y+ A+K N+ + L K + + ++ + HGR E Sbjct: 183 DITQTIIERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEK 242 Query: 236 RLHIVCDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AE 293 R V F + GLK + S R+I+A E VRYY++S D T E Sbjct: 243 RTCTVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIVAT--GEYTQEVRYYVTSLDNTKPE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A+AIR HW +EN LHW+LDV ED K + NAA FS +A+ IL DK K Sbjct: 301 EIASAIRQHWSIENNLHWQLDVTFREDYSK-KVKNAARNFSVATKMALTILKKDKTTKGS 359 Query: 354 LRRKMRKAAMDRNYLASVLAGSGL 377 + K KA D YL+ +L + Sbjct: 360 MNLKRLKAGWDEKYLSQLLQNNNF 383 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 368 bits (944), Expect = e-100, Method: Composition-based stats. Identities = 137/369 (37%), Positives = 197/369 (53%), Gaps = 9/369 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L ++ I D R H+L I+ + +FAV++GA+SW IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + + P F +W++ S+ VIAIDGKT + SYD+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + ++SA+++ H LV+GQ D KSNEITAIP LL L + G I++ DAMG + IA +I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 KQGGDYLFAVKGNQGRLNK---AFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCD 242 KQ DY+ A+KGNQ L K + E+F E+ + E +H R E R Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRRVFQVP 253 Query: 243 VPDELIDFT-FEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 V +W GL+ L V S R + + RY++SS A FA IR Sbjct: 254 VEQVFTPKQGRDWAGLRSLVVIQSQRCLWNKD----TTETRYFLSSLSTDAATFAHYIRA 309 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDVV NED +IR+ +A FS +R + +N+L D K L K +A Sbjct: 310 HWGIENQLHWCLDVVFNEDKSRIRKDHAPRNFSLLRRLTLNLLHRDSS-KGSLVMKRYRA 368 Query: 362 AMDRNYLAS 370 +D ++ Sbjct: 369 GLDDQFMMQ 377 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 362 bits (928), Expect = 1e-98, Method: Composition-based stats. Identities = 128/372 (34%), Positives = 198/372 (53%), Gaps = 7/372 (1%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 L ++E + D+R A + H+LS +L + + AV+SGA+ +E+I +G + +L+ + Sbjct: 6 LADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPWLRGFL 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD-VIAIDGKTLRHSYDKSR 121 + G+ DT RV + + P +F + F W+ + KD VIAIDGK+ R + K+ Sbjct: 66 RLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRTTSKAA 125 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 +H++SAF+ +V+GQ T +KSNEITAIPELL +LDI+G I+T DAMG Q IA Sbjct: 126 -AAPLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGTQTKIA 184 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVC 241 I+++G Y+ VK N +L + ++ + HGR E+R Sbjct: 185 RAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLTPSSTHETTSTGHGRIEVRRCTAF 244 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 D D L WK + V R++ + YYISS AE+ A AIR+ Sbjct: 245 DATDRLHK-AEAWKDVASFAVVERVRTV----GERTSTERVYYISSLPADAERIAVAIRS 299 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW VEN+LHW LDV +D + R G+ A + +RH+A+N++ DK K ++ K A Sbjct: 300 HWEVENRLHWCLDVQFGDDYARGRIGHIAHNLALVRHMALNLIRLDKSIKTSIKTKRLLA 359 Query: 362 AMDRNYLASVLA 373 A + A++L Sbjct: 360 ATSDEFRAALLG 371 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 361 bits (927), Expect = 2e-98, Method: Composition-based stats. Identities = 142/376 (37%), Positives = 202/376 (53%), Gaps = 12/376 (3%) Query: 5 KLMEHISIIPDYRQA-WKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 LM + D R+ H +L++ I AV+S ++ EDI +G D+L+Q+ Sbjct: 8 SLMCCFREVDDPRKRSNGTLHDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLV 67 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 NG+ +T R+ + P +F F W+ + + +DGKT+R S S Sbjct: 68 LLNGVASEETFLRIFRALDPKQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGS--GSGGE 124 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 AIH++SAF+T +V+GQ K KSNEITAIPELL L I G ++T DAMGCQK+IA + Sbjct: 125 SAIHMVSAFATELGVVLGQEKVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIARQ 184 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDV 243 I QGGDYL AVKGNQ L A E +F + + + + D + SHGR ++ V Sbjct: 185 ITDQGGDYLLAVKGNQPTLLDAIETEF-IDQYQSDDVDRHRQVHPSHGRIVAQIASVL-- 241 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 P E I +W KK+ S R + E ++ RYYISS +LTAE+ A A+R HW Sbjct: 242 PAEGIVDLADWPECKKIARVDSLRKV---GNHESKLERRYYISSRELTAEQLAAAVRAHW 298 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV--FKAGLRRKMRKA 361 +EN+LHW LDV ED IR+GNA + S ++ I +N++ D K LR K + A Sbjct: 299 GIENRLHWVLDVSFGEDASTIRKGNAPQNLSLLKKIVLNLIRLDTADKTKTSLRLKRKCA 358 Query: 362 AMDRNYLASVLAGSGL 377 A + +L + L Sbjct: 359 AWTDDVRMRILGFTSL 374 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 359 bits (921), Expect = 1e-97, Method: Composition-based stats. Identities = 129/373 (34%), Positives = 205/373 (54%), Gaps = 10/373 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 + H + D R +H L ++ LT+ A++SGAE W+DI+ FG++ LD+L+++ F Sbjct: 2 SFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAF 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 + G+PV DTIAR++S + P FI+W+ + + VIA DGKTLRHS+D R+ Sbjct: 62 KEGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFDGDRK-T 120 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A+H +SA++ LV+ Q K+ K NE++ + EL+ +L++KG I+T DAM C K +A+ I Sbjct: 121 ALHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAI 180 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPE---HDSYAMSEKSHGREEIRLHIVC 241 +GGDY+ VK NQG+L F + P+ +S ++ HGR E R ++ Sbjct: 181 NAKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYVQL 240 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 + L + W +K + R + K + YYISS ++ + A AIR+ Sbjct: 241 PITPWLTQ-SQGWTNIKPVIEVTRKRYL----KDKETSETAYYISSLEVNLPQIAKAIRS 295 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN HW LD+ EDD +IRRG+A E + R A+N+ K ++ K+++A Sbjct: 296 HWSIENASHWVLDMTYREDDSRIRRGDAPENMATFRRFAMNLARLSP-IKDSMKGKLKQA 354 Query: 362 AMDRNYLASVLAG 374 A +L Sbjct: 355 AWSDEVREKLLFA 367 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 355 bits (910), Expect = 2e-96, Method: Composition-based stats. Identities = 125/380 (32%), Positives = 216/380 (56%), Gaps = 14/380 (3%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 ++ ++E+ + + D R+ +H L +L++ + AVI+GA+ I + E H+++LK Sbjct: 9 DVVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSR 68 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-----NDKDVIAIDGKTLRHS 116 + +G+P HDTI R+++ + P F +CF W+ + + +++IAIDGKTLR S Sbjct: 69 LELPSGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRS 128 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ + G + + SA++ + +GQ+ KSNEI PEL+ +D++ I+T DA GC Sbjct: 129 HDRGKGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGC 188 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPEHDSYAMSEKSHGRE 233 Q+D+AEKI GDY+ A+K NQ RL++ + + + + + + K HGR Sbjct: 189 QRDVAEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRL 248 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 + R + +PDE + +W+GLK + VA+ I+++ RYYISS A+ Sbjct: 249 DKRFYYQVKLPDE-VPAGEDWRGLKTIGVAIR----ISQENGRETCDTRYYISSLKPDAK 303 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 +FA A+R HW +EN LHW LDV ED+ ++R AAE + ++ +A++++ K K Sbjct: 304 QFAAAVRGHWGIENSLHWTLDVTFREDESRLRNRIAAENLAWLKRLAVSLIKQHKS-KES 362 Query: 354 LRRKMRKAAMDRNYLASVLA 373 + + R A + N+LA +L Sbjct: 363 VVMRRRMAGWNVNFLAEILG 382 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 354 bits (908), Expect = 4e-96, Method: Composition-based stats. Identities = 131/378 (34%), Positives = 193/378 (51%), Gaps = 12/378 (3%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M K L++++ IPD R K H LS ++ + I A++ G ++W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDK-DVIAIDGKTLRHSYDK 119 + GIP HDT R+ + + PA F W+ D + +A+DGK LR + K Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMGDDKLVGQLAVDGKALR-ATAK 119 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A+H+++ +ST + +GQ K KSNEITAIPELL +L++KG +++ DAMG Q Sbjct: 120 GRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQVK 179 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK----ELNNPEHDSYAMSEKSHGREEI 235 IA+ I K+ GDYL AVK NQ LN +E+F E + H + HGR+E Sbjct: 180 IADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKEH 239 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V V DE + +WK K + + R + + VR+YISS L A Sbjct: 240 RRCWVLMV-DESMPVCQQWKA-KTIIAVQAERI----ENGKGYDFVRFYISSRALDATSA 293 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A R HW VEN LHW LD+ ED + R G A E + IR +N+L +K + Sbjct: 294 LKATRAHWSVENLLHWTLDIAFGEDQLQARAGFAGENLAVIRQWILNMLKQNKSRNLSMA 353 Query: 356 RKMRKAAMDRNYLASVLA 373 K R ++ YL + Sbjct: 354 NKRRLCCLNEQYLFECMG 371 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 353 bits (905), Expect = 8e-96, Method: Composition-based stats. Identities = 142/375 (37%), Positives = 200/375 (53%), Gaps = 16/375 (4%) Query: 7 MEHISIIPDYRQA-WKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 M + I D R+ H IL++ I AV+S ++ EDI + T +L+++ + Sbjct: 1 MSCLDEIDDPRKPSNGTLHDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLK 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDV-----IAIDGKTLRHSYDKS 120 NGIP +T R++ + P +F F W+ + D IAIDGKT+R S S Sbjct: 61 NGIPSEETFLRILRALDPKQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGS--GS 118 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 AIH++SAF+T LV+GQ K KSNEITAIPELL L IKG ++T DAMGCQK I Sbjct: 119 GGESAIHMVSAFATELGLVLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSI 178 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 A++I + GDYL VKGNQ +L +A E F + + D + E+ HGR ++ V Sbjct: 179 AKQIVAKKGDYLLMVKGNQPKLLEAIETAF-IDQHGVESVDRSSRVERGHGRTVGQIASV 237 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 I +W + S R + K+ ++ RYYISS L+AE+ A A+R Sbjct: 238 LSAKG--IVDPADWPKCVTIGRIDSMRVV---GDKQSDLERRYYISSRALSAEQLAAAVR 292 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKM 358 HW VEN+LHW LDV +ED + + NA + S +R IA+ I+ DK K+ LR K Sbjct: 293 AHWGVENRLHWILDVSFSEDASTVAKDNAPQNLSLLRKIALTIIRADKTDTRKSSLRLKR 352 Query: 359 RKAAMDRNYLASVLA 373 + AA D +L Sbjct: 353 KGAAWDDGVRERMLG 367 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 349 bits (895), Expect = 1e-94, Method: Composition-based stats. Identities = 129/351 (36%), Positives = 187/351 (53%), Gaps = 5/351 (1%) Query: 22 VEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCI 81 V + L+ +LL T+ +I A +++IE G LD+L+Q+ FE+G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 SPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 P F W+ V AIDGKTLR S + GA+H++SA++ LVIG Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 Q + KSNEITAIPELL+ L + G I+T DAMG QK IA K+ +G DY+ A+KGNQG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 L+ + F +L HGR E R V D L + W GL + Sbjct: 181 LHDDVRDFFADPDLLRECARHDDTC-IGHGRIEERTCQVADASAWLTEQHSGWAGLASIA 239 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 ++ R+ ++ E R YISS + A R+HW VEN LHW+LDV ED+ Sbjct: 240 AVIATRT--DKKSGEVSSETRLYISSLPPDPKTILNACRSHWSVENNLHWQLDVTFREDE 297 Query: 322 CKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 C+ R+ +A + IRH A N+L + K ++RK KAAM++ + +V+ Sbjct: 298 CRTRKDHAPLSLAIIRHAAFNMLKREPS-KMSIKRKRLKAAMNQAFRKTVI 347 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 347 bits (891), Expect = 3e-94, Method: Composition-based stats. Identities = 112/369 (30%), Positives = 190/369 (51%), Gaps = 6/369 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L+EH++++ + R +H L ++ L I A++SGAE W DIE +G++ +D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP T+AR++ CI E + W+ + + K +IA DGK LR S+ + + A Sbjct: 69 NGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKDA 127 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + +++A+ T + LV+ Q T K EI + ++L++L++KG ++T DA+ CQ++ EKI Sbjct: 128 LQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKIS 187 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPD 245 ++ + VK NQ +L +A + +F E E HGR+E R Sbjct: 188 EKKAHVVVQVKNNQPKLYQAVQSQFQSLFDAQKEKIVVEHKESGHGRQEERYVFQLKAKL 247 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 + T +W ++ + RS + + YY+SS + IR HW + Sbjct: 248 PP-ELTEKWPTIRSIIAVERHRS----ANGKGTVDTSYYVSSLSPKHKLLGHYIRQHWRI 302 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN H+ LDVV NED +I +A E + R +NI+ R K+++A + Sbjct: 303 ENSQHYILDVVFNEDASRIAMEDAVENMALFRRFVLNIVKQSNCGARSQRNKLKRAGWND 362 Query: 366 NYLASVLAG 374 +Y A + G Sbjct: 363 DYRAQLFFG 371 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 347 bits (891), Expect = 3e-94, Method: Composition-based stats. Identities = 126/375 (33%), Positives = 194/375 (51%), Gaps = 13/375 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG--D 63 L+E S +PD R+ + L+ IL++ + A++ GA++W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 E G P HDT + + F F +W+R+ D V+AIDGKTLR S K Sbjct: 63 LEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVIDG-VVAIDGKTLRGSGKKGSNE 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H+++A++ L + Q T K +E+ + LL++L +KG I+T DA+GCQ ++AEK Sbjct: 122 -LLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAEK 180 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH---DSYAMSEKSHGREEIRLH-I 239 I +GGDY+ VK NQ L +A E F + + +EK HGR E R + Sbjct: 181 IVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYTW 240 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL-TAEKFATA 298 + DV WK L + + S R I + + RY I S + T E FA A Sbjct: 241 INDVTWMDRPMRAAWKKLGGVGMIESIRQI----GDKVSVDQRYAIGSCGVQTVEMFAKA 296 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 R+HW +EN LHW LDVV ED C+ R GN+A S +R + L ++ K GL R+ Sbjct: 297 SRSHWGIENGLHWTLDVVFREDQCRARLGNSARNLSVLRKFVLTTLRKEEGCKMGLNRRR 356 Query: 359 RKAAMDRNYLASVLA 373 A + +Y S++A Sbjct: 357 LHADRNESYRESLIA 371 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 345 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 128/382 (33%), Positives = 200/382 (52%), Gaps = 15/382 (3%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 + +E ++ I D+R + ++L ILL++ AVI +++ ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDV------IAIDGKTLRHSY 117 F +G P HDT +V+S + P E F WM + + K V +AIDGKT+ S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS- 121 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 S + A HV++AF++ LV+GQIKTD+KSNEITAIPELL + +K ++T DAMG Q Sbjct: 122 -GSAEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKA--FEEKFPLKELNNPE----HDSYAMSEKSHG 231 K+IA KI ++GGDY+ AVKGNQ +L + L++ + E EK HG Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTRELKAKGQYAVTLEKDHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 R E R + + + +W+G+ + + R + + K + S + Sbjct: 241 RIEKRECYLSNDLS-WFEGLEDWRGIAGVGWIRNTRHVDNKNDKTTTEDHYFIYSLKEAQ 299 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFK 351 A+ R HW +EN LHW LD+ EDDC+ R NAAE+ + +R +A+ +L K Sbjct: 300 AKDLLRIKREHWAIENNLHWMLDMAFREDDCRARAKNAAEVMNILRKLALQMLKTCDTCK 359 Query: 352 AGLRRKMRKAAMDRNYLASVLA 373 G+R K + + VL Sbjct: 360 CGMRSKRKLCGLGIPTALQVLG 381 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 345 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 134/375 (35%), Positives = 201/375 (53%), Gaps = 12/375 (3%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M + + E++S D R A+ +H I+ L + AVISGA SW +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 Y FE GIPV DTIARV+ I P F+E F+N++ + + ++VIAIDGKTLRHS++ Sbjct: 57 YRPFECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFNPE 116 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + A+H ++ +S L++ Q K+ K NE A+ E+++ +K +IT DAM QK I Sbjct: 117 T-QSALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKI 175 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH-DSYAMSEKSHGREEIRLHI 239 AEKI ++ GDY+ +K N + E F + PE ++Y R + R + Sbjct: 176 AEKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYR 235 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 V D L EWKG+K + RS + +YISS D+ + A + Sbjct: 236 KLKVSDWLSK-AEEWKGIKSVLEVCRKRS----DNGKESQEKVFYISSLDVDIQILAKCV 290 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VENK HW LDVV ED+C + AE + +R +A+N+ K ++ K+ Sbjct: 291 RGHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHP-KKQSMKGKLT 349 Query: 360 KAAMDRNYLASVLAG 374 A + +L G Sbjct: 350 AAGWSDEFRDELLLG 364 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 341 bits (874), Expect = 3e-92, Method: Composition-based stats. Identities = 123/374 (32%), Positives = 191/374 (51%), Gaps = 9/374 (2%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 + L I D R + + L ILL+T+ A+I G ++W+ I DFG+ +L Q+ + Sbjct: 14 QYLFHCFLSIKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVN 73 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 G+P T ARV S I P +F C WM D+I +DGK+L S + + + Sbjct: 74 MRCGVPSTLTFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQ 133 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 A H+++A+ + +G+++ KSNEI AIP LLN L+++G II+ DAMG QK IA Sbjct: 134 KATHIVNAYLPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANL 193 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEK---SHGREEIRLHIV 240 I+ + DY+ A+K N R + E F + + + Y E HGR E R + V Sbjct: 194 IRLKQADYVLALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSYCV 253 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD-LTAEKFATAI 299 + + W+ L+ + S R + E E RYYI+S + + AI Sbjct: 254 LPM-MYFHKYKKYWRDLQAIVRVQSKRH----KGNEIETATRYYITSLPFAEHRRMSQAI 308 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN+LHW+LD+ + ED I RG A + + +R + + +L N+ K G+ K Sbjct: 309 RQHWAIENQLHWKLDIGLGEDASLITRGYADQNLATLRKMVLKMLENENSSKQGIAGKRI 368 Query: 360 KAAMDRNYLASVLA 373 +AA+ YL V+ Sbjct: 369 QAALSTRYLRKVVG 382 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 340 bits (873), Expect = 4e-92, Method: Composition-based stats. Identities = 125/374 (33%), Positives = 186/374 (49%), Gaps = 11/374 (2%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 ++ L + I D RQA KV H++ +L++ + + ES+ D+ DF ++ L +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRR 122 ++G P HD V+ I P E W D + IAIDGK LR +++ Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGWCGDL----EGRHIAIDGKALRGTHNAETG 116 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 R +H++ A+ + L GQI +KSNEI AIP LL L +KG +T DAMG Q IAE Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPEHDSYAMSEKSHGREEIRLHI 239 +I G DY+ A+K N R ++ + F E L+ H E SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 + + D +++W GL+ + R + P V Y++ S E+ A + Sbjct: 237 ITEELDWYHK-SWKWAGLQSVAQV--RRQVQRSHDGPPLEEVHYFLCSFKADVERLAKLV 293 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VEN+ HW LDV NED C++R NAA + +R + I L K LRRK + Sbjct: 294 RGHWSVENRCHWVLDVTFNEDHCQVRDRNAAHNLTILREMVITTLHRHP-AKVSLRRKRK 352 Query: 360 KAAMDRNYLASVLA 373 A MD + +L Sbjct: 353 LATMDPAFRLQMLG 366 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 340 bits (871), Expect = 7e-92, Method: Composition-based stats. Identities = 130/377 (34%), Positives = 196/377 (51%), Gaps = 13/377 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 L+ L+EH S I D R ++ H L ILLL + ++ + +E+I +G HL FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRR 122 + +G+P + +++ I PA F F W+R D +AIDGKT R S+D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFP-GRADFVAIDGKTSRRSHDRRAG 130 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML----DIKGKIITTDAMGCQK 178 IH++SAF+T LV+ Q K+NE+ AIP LL+ L + G +++ DA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLH 238 IA I+ QG DYL AVK NQ L E F + + + HD +K HGR E R Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGADHHHD----LDKGHGRVEERHV 246 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ--KKEPEMTVRYYISSADLTAEKFA 296 V D L T + G +L + + RY+ISSA LTAE A Sbjct: 247 SVIREVDWL-SGTRRFPGEMRLPDVAAIVRVHTTAHIADRTRTDTRYFISSAPLTAEHAA 305 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 A+R HW +EN+LHW LDV+ +D ++R G+ A+ + +RH A+N++ K L+ Sbjct: 306 DAVRGHWGIENRLHWVLDVLFKDDLSRLRTGHGAKNMAVVRHFALNLIRAANDQK-SLKT 364 Query: 357 KMRKAAMDRNYLASVLA 373 + + A +YLAS+L Sbjct: 365 RRKMAGWSDDYLASLLN 381 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 121/381 (31%), Positives = 187/381 (49%), Gaps = 19/381 (4%) Query: 6 LMEHISIIPDYRQA-WKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L + +PD R H L+ IL + AVI+GAE WEDI ++G + F +++ + Sbjct: 5 LTSVFADLPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLEL 64 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS--------NDKDVIAIDGKTLRHS 116 +NG+P HDT RV + + P F + F W + + + +A+DGK+ R S Sbjct: 65 KNGVPSHDTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGKSARRS 124 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K G +H++ + +L++GQ + +EIT ++L LD+ G ++T DA GC Sbjct: 125 A-KPTFSGCLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTLDAAGC 183 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP-LKELNNPEHDSYAMSEKSHGREEI 235 Q + E I+ +GG+Y+ VKGNQ L A F E D + +HGR E Sbjct: 184 QTETLEVIRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAHGRHEE 243 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V PD L W G+ + + R + + K E T YY+SS + A + Sbjct: 244 RNVTVVHDPDGL---PAGWAGVGSVALVCRDRQV---KGKANESTAHYYLSSLRVGAAEL 297 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A IR HWH+E+ +HW LDV ED+ + R G+A IR +A+++L K + Sbjct: 298 AGYIRGHWHIES-MHWVLDVAFREDESRTRVGHAGANLGMIRRVAVSLLKRAG-KKGSIH 355 Query: 356 RKMRKAAMDRNYLASVLAGSG 376 + +A D Y+A VL G Sbjct: 356 TRRLRAGWDDQYMAQVLQGLS 376 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 337 bits (864), Expect = 4e-91, Method: Composition-based stats. Identities = 135/381 (35%), Positives = 188/381 (49%), Gaps = 20/381 (5%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M ++ +IIPD R ++ + I+ + + AVI GA++W +IE FG+TH + K Sbjct: 1 MTIQAFS---AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKA 57 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 IP HDT++R S + F ECF W+ D V+AIDGK + + DKS Sbjct: 58 RLPGLVSIPSHDTLSRFFSILDIDWFEECFRLWVDDICRRIPG-VVAIDGKAICDNPDKS 116 Query: 121 RR-----RGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 R ++++SA+S + + +GQ K ++KSNE AIPEL+ LD++ IIT DA+G Sbjct: 117 SNSKNGVRSKLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIG 176 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN---PEHDSYAMSEKSHGR 232 CQK I + I + DY+ K N L E F L E + Y K HGR Sbjct: 177 CQKSITKLIIENKADYILCAKDNHEALRNIIE--FNLSEESRYYLCHAKRYFEENKGHGR 234 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTA 292 E R VC L F W G+K L + S R + KE M RYYISS + Sbjct: 235 SEYREC-VCISAKNLQYFLKGWTGIKTLAMINSIRKM---GDKEAVMETRYYISSLEPDP 290 Query: 293 EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA 352 +IR HW VEN LHW LD+ EDD + + GNAA FS I +A+ +L K Sbjct: 291 IIILKSIRPHWEVENNLHWVLDIGFREDDDR-KIGNAAINFSAIPKLALMLLKQ-SDIKL 348 Query: 353 GLRRKMRKAAMDRNYLASVLA 373 G+ K + D V+ Sbjct: 349 GMAGKRKACGWDEKIRDKVIG 369 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 334 bits (857), Expect = 3e-90, Method: Composition-based stats. Identities = 119/371 (32%), Positives = 191/371 (51%), Gaps = 9/371 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L +H+S++ D R H L +L L + AV SG + W +I+ FGE L++L+++ F Sbjct: 2 SLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPF 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 NGIP TIAR++ + P C +W+ D +++ K +IAIDGKTLR + Sbjct: 62 ANGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASK--LGCN 119 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 +H + AF + L + Q K EI + L+ ML+I +IT DA+ Q+ E I Sbjct: 120 TLHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAI 179 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVP 244 + GDY+ VK NQ L +A + ++ + ++ + +A SEK HGR E R I +P Sbjct: 180 VARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQR--ITFQIP 237 Query: 245 DELI-DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 +L +W +K L R I + + +Y+SS D+ E ATA+R HW Sbjct: 238 SKLSPKLQEKWPSVKTLIAVERHRKI----GNKTSIETSFYLSSHDIDPEYIATAVRGHW 293 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 +EN LHW LDVV ED C++ AE + +R +A+N+ + K ++ K+ ++ + Sbjct: 294 RIENSLHWVLDVVYREDACRVHEQRTAESLAIVRRMALNLAKLEITQKRSMKSKLHRSLL 353 Query: 364 DRNYLASVLAG 374 Y ++ Sbjct: 354 SDEYRELMIFA 364 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 333 bits (853), Expect = 9e-90, Method: Composition-based stats. Identities = 138/376 (36%), Positives = 194/376 (51%), Gaps = 25/376 (6%) Query: 15 DYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D R +H+ S I+L+ I AVI GA++W IEDFG++ F NGIP HDT Sbjct: 25 DNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKLSNFNGIPSHDTF 84 Query: 75 ARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSY---------------DK 119 R S + P KF E + W++ IAIDGKT+R +Y D Sbjct: 85 NRFFSALDPLKFEESYRQWVQSILKCYSG-HIAIDGKTIRGAYESEQDKRHRKQGVLPDS 143 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + + +HVISAF+T + +GQ+ T +K NEI IPELL+ML IK IIT DA+GCQ+ Sbjct: 144 NTGKYKLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDCIITIDALGCQRT 203 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP--LKELNNPEHDSYAMSEKSHGREEIRL 237 IAEK+ K GDY+F VK NQ +L + + + D Y E+ HGR E R+ Sbjct: 204 IAEKVIKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKGTTVRFDKYETHEEGHGRNESRI 263 Query: 238 HIVCDVPDELI-DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 C+ P L D +WK ++ + R K + R +ISS + A+K Sbjct: 264 CYCCNDPGFLGADIRKKWKNIQSFGYIENTR----NTNKGTTVEKRCFISSLEPDAQKIL 319 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 R HW +EN LHW+LDV +ED+ + RR +A FS + IA+ L N+K + + R Sbjct: 320 KNSREHWEIENNLHWQLDVNFHEDNTR-RRNISALNFSVLAKIALATLRNNK-REIPINR 377 Query: 357 KMRKAAMDRNYLASVL 372 K A D +L ++ Sbjct: 378 KRLIAGWDNEFLWELI 393 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 332 bits (851), Expect = 1e-89, Method: Composition-based stats. Identities = 105/375 (28%), Positives = 174/375 (46%), Gaps = 17/375 (4%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 + + E +PD R A H L+ IL + + A + GA S D+ F + Sbjct: 4 PMDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDV 62 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS----NDKDVIAIDGKTLRHSY 117 +NG+P HDT +RV + P F + F +M+ K VIA+DGK LR Y Sbjct: 63 LVLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGY 122 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 + R +++A++ + + ++ NE +L+ +L +KG ++T DA+ C Sbjct: 123 ESGRSHMPPVMVTAWAAQTRMALANVQAPNN-NEAAGALQLIELLQLKGCVVTADALHCH 181 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRL 237 + +AE I+ +GGDY+ AVK NQ L + + S + HGR+E R Sbjct: 182 RGMAEAIKARGGDYVLAVKDNQPALMRDAKAAIRAATRQGKP--STITVDAGHGRKEKRR 239 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 +V VP D ++ GLK + S R + RY++ S + Sbjct: 240 AVVAAVPQMAQD--HDFAGLKAVARITSKRGTDKTVE-------RYFLMSQAYPPKDVLR 290 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +EN LHW LDVV++ED + R+ NA + +R +A+N+ LR K Sbjct: 291 IVRTHWTIENSLHWPLDVVLDEDLARNRKDNAPANLAVLRRLALNVARAHPDNTTSLRGK 350 Query: 358 MRKAAMDRNYLASVL 372 +++A + +L ++ Sbjct: 351 LKRAGWNDTFLFELI 365 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 126/379 (33%), Positives = 199/379 (52%), Gaps = 17/379 (4%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 +++L+E + I D RQ KV H L IL++ +FA ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + + K +I IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + G H++SA+S +GQ +KSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD---SYAMSEKSHGREEIR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 + + + WKGLK + + E++ + + RY+ISS E + Sbjct: 239 EYYQT-EKIKWLSQKKAWKGLKSIIM----ERKTLEKEGKRLIEYRYFISSLKEEIETVS 293 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF--KAGL 354 A+R HW +E+ +HW LDV ED AA+ + IR +++IL +V K + Sbjct: 294 RAVRGHWSIES-MHWHLDVTFREDANTTIDKMAAQNLNIIRKWSLSILKTAEVSRHKLSM 352 Query: 355 RRKMRKAAMDR-NYLASVL 372 R+K + +L VL Sbjct: 353 RKKRYVIGLRPIKHLEEVL 371 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 330 bits (845), Expect = 6e-89, Method: Composition-based stats. Identities = 116/371 (31%), Positives = 189/371 (50%), Gaps = 7/371 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 +++H+ I D R EH + I L + AVISGA+SW +FG L++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP +I R+ +S + ++W+ + + + IAIDGK L+ S A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLKG-AKASASSAA 119 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H+++A+ T LV K +E+ + ELL LD+K +++T DA+ CQ + I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPD 245 K+GGD + VKGNQ +L +A + +F NNP+ + + + K HGR E R+ C + Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLNL 239 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 + +W LK L R + + + +Y+SSA LT+E F AIR HW Sbjct: 240 PA-EIKMKWSQLKTLIAVERHRKV----GNKTSIDTHFYVSSAVLTSEAFGRAIRAHWQT 294 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN HW LD + ED K+ + A + + +R A+N++ K +K +A Sbjct: 295 ENNQHWLLDKLFQEDKQKMYDEDGASILAILRRWALNLVKLHP-AKTSQTQKFNRACWSD 353 Query: 366 NYLASVLAGSG 376 ++ ++ G+G Sbjct: 354 DFREEIIFGTG 364 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 329 bits (844), Expect = 9e-89, Method: Composition-based stats. Identities = 110/369 (29%), Positives = 178/369 (48%), Gaps = 12/369 (3%) Query: 10 ISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIP 69 +PD R H L +L + + A I GAES D F +++ + G+P Sbjct: 9 FQGLPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLP 68 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVI 129 HDT +RV + P F CF ++ D + V+AIDGKTLR S+D++ R A+HV+ Sbjct: 69 SHDTFSRVFRLLDPVAFSRCFQQFL-DHLGEDGAGVLAIDGKTLRRSFDRAAGRSALHVV 127 Query: 130 SAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGG 189 SAF++ +++GQ NEI A LL + D+KG ++T DA+ Q+ A+ I ++GG Sbjct: 128 SAFASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGG 187 Query: 190 DYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELID 249 D+LF +K N+ L E F + + ++ HGR E+R H V L Sbjct: 188 DWLFPLKDNRPALRAEVERYF--ADPATVLAVPHVTTDADHGRIEVRRHWVSHDVAWLAS 245 Query: 250 FTF-----EWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 GLK L + + T Y+SSA L + A A+R HW Sbjct: 246 DRRFPDEAVLPGLKILGLVER---TVTSPDGRTTATRTLYLSSAALEPKTLARAVRAHWS 302 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +E +HW LD +ED + R+ + E + +R +A+N++ + + +R + ++A Sbjct: 303 IEAAVHWVLDTSFDEDRARNRKDHGPENLATLRKLALNVVRSANN-QDSIRLRRKRAGWS 361 Query: 365 RNYLASVLA 373 +Y ++L Sbjct: 362 DDYARTILG 370 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 107/374 (28%), Positives = 175/374 (46%), Gaps = 13/374 (3%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 + + + +PD R A V H L +L++ +V+ G+ S ++ FG F + Sbjct: 10 IAMHIFLSAFDEVPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSN-DKDVIAIDGKTLRHSYDK 119 + ++ IP HDT + V I P F + D D D+IAIDGK LR + D Sbjct: 69 FLKLKHAIPSHDTFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDP 128 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 ++SA+++ L + + D + E++A E L ++D++GK++T DA+ C + Sbjct: 129 GESARTRMMVSAYASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRR 187 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHI 239 I GGD+ A+KGNQ L F ++P + HGR+E R + Sbjct: 188 TVAAINAGGGDWCLALKGNQESLLSDARGCFSKGHKSDP---TAVTENTGHGRKETRKAV 244 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 V + + E+ GLK + R E + RY+ S T E A+ Sbjct: 245 VVSA--KALAEYHEFPGLKGFGRIEATR----ETGGKVTSETRYFALSWVPTPEVLLAAV 298 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R+HW +EN LHW+LDV ED + R+ N + +R A+++L D K L K++ Sbjct: 299 RDHWAIENALHWQLDVSFREDAARNRKDNGPGNIAVLRRRALDVLRRD-TSKGSLSIKIK 357 Query: 360 KAAMDRNYLASVLA 373 +A D +L S+L+ Sbjct: 358 RAGWDTTFLRSILS 371 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 323 bits (829), Expect = 4e-87, Method: Composition-based stats. Identities = 128/381 (33%), Positives = 201/381 (52%), Gaps = 21/381 (5%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + ++ + + D R+ WK++H LS I+LL FA +SGAE W++IE FG+ + LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSN---------DKDVIAIDGKTL 113 ENGIP HDT+ RV + + P E W S+ K ++AIDGKT+ Sbjct: 66 QLENGIPSHDTLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTI 125 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDA 173 R + S ++ A+H+++A++T + GQ+ T++KSNEITAIPELL+M+ +KG +++ DA Sbjct: 126 RG--NGSAKQKALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDA 183 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGRE 233 MG QK IA+KI K+ DY AVK NQ L + F + + + D Y EK+HG+ Sbjct: 184 MGTQKAIADKIIKKKADYCLAVKENQKTLLEDIVPFFEMSQEAD---DHYHTVEKAHGQI 240 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 E R + V L E+ ++ + A I ++ + RY+I S ++A+ Sbjct: 241 ETRAYEVIHDVSWLRKTHPEFGHIQSIGRA----RIHLDKNGQESEESRYFILSCQVSAK 296 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKA 352 + +R HW +E+ +HW LDVV ED K A + + + +L K Sbjct: 297 ELCDYVRGHWQIES-MHWLLDVVFREDANKTLNKQLAFNLNVMDKFCLAVLKQLDFGKKM 355 Query: 353 GLRRKMRKAAMD-RNYLASVL 372 +RRK ++ YL +L Sbjct: 356 SMRRKKYALSLSFDKYLKQLL 376 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 318 bits (814), Expect = 2e-85, Method: Composition-based stats. Identities = 126/412 (30%), Positives = 189/412 (45%), Gaps = 44/412 (10%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + + E I I D R+ KV + I+L+T+ V +SW DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDK------------------- 103 P HDT+ R I + C+ W + + Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 104 -DVIAIDGKTLRHSYDKSR--------------RRGAIHVISAFSTMHSLVIGQIKTDKK 148 IAIDGKT+ + + + +H++SAF + SL +GQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNMLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFE 207 NEI AIP+LL+ +DI +G ++T DA+G QK I EKI ++ DYL VK N +L + E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPLKELNNPEHDSYAMSE---KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 ++ E+D +E + HG R I C P L +WK L+ + Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIK 316 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + IA E + +ISS E R HW VEN LHW+LDV NEDD + Sbjct: 317 TEKINIAT--GEIQNEKHCFISSLVNNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR- 373 Query: 325 RRGNAAELFSGIRHIAINILTN--DKVFKAGLRRKMRKAAMDRNYLASVLAG 374 + N+A+ FS + +A+ IL N D+ K + RK +KA YLA+++ Sbjct: 374 KMMNSAQNFSTLTKMALTILKNYQDEDKKTSVNRKRKKAGWSDEYLANLINN 425 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 317 bits (813), Expect = 3e-85, Method: Composition-based stats. Identities = 103/370 (27%), Positives = 168/370 (45%), Gaps = 13/370 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 ++ +PD R A H L +L++ +V+ GA S ++ FG + + + Sbjct: 37 ILSAFEDVPDPR-AENTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLK 95 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-NDKDVIAIDGKTLRHSYDKSRRRG 124 + +P HDT + V I P F + D + D DVIA+DGK LR + D Sbjct: 96 HAVPSHDTFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGR 155 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++SA++ L + + D + E+ A E L ++ +KGK++T DA+ C + I Sbjct: 156 TRMMVSAYAARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAI 214 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVP 244 GGD+ A+K NQ L F + +P S + HGR E R V Sbjct: 215 NAGGGDWCLALKANQDSLLSDARASFGAEPDAHP---SALSEDIGHGRTETRKATVVS-- 269 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 + + E+ GLK + R + + RY+ S T E +R HW Sbjct: 270 SKALAEHHEFPGLKAFGRVEATR----KTAEGTTSETRYFALSWVPTPEVLLATVRAHWA 325 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN LHW+LDV ED + R+ N+ + +R A++++ D K L K+++A D Sbjct: 326 IENSLHWQLDVSFREDAARNRKDNSPGNIAILRRRALDVMRRD-TSKGSLSIKLKRAGWD 384 Query: 365 RNYLASVLAG 374 ++L +VL G Sbjct: 385 DDFLRNVLNG 394 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 314 bits (805), Expect = 3e-84, Method: Composition-based stats. Identities = 117/369 (31%), Positives = 182/369 (49%), Gaps = 14/369 (3%) Query: 15 DYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D RQ+WK+ + LS IL L ++G E+ +++EDF E + Y D G P HDT+ Sbjct: 19 DSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEPLFATYVDLSEGCPSHDTL 78 Query: 75 ARVVSCISPAKFHECFINWMRDCHSSND-KDVIAIDGKTLRHSYDKSRRRGAIHVISAFS 133 RV+S ++ + E + + + S + +I++DGKT+R ++ + + +H+++A+ Sbjct: 79 ERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRG--NRGKNQKPVHIVTAYD 136 Query: 134 TMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLF 193 H L +GQ+ ++KSNEI AIP+LL +DI+ I+T DAMG Q I + I K DY Sbjct: 137 GGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCL 196 Query: 194 AVKGNQGRLNKAFEEKFP---LKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDF 250 AVKGNQ L F L E Y EKS G+ E+R + V L Sbjct: 197 AVKGNQETLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQIEVREYWVSSDIKWLCQN 256 Query: 251 TFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLH 310 +W L+ + + ++ + RY+I S FA +R HW +E+ +H Sbjct: 257 HPKWHKLRGIGMT----RNTIDKDGQLSQENRYFIFSFKPDVLTFANCVRGHWQIES-MH 311 Query: 311 WRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL--RRKMRKAAMD-RNY 367 W LDVV +ED + AA + IR + + L K L RRK R ++ +Y Sbjct: 312 WLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKKDLSYRRKQRYISVHLEDY 371 Query: 368 LASVLAGSG 376 L + G Sbjct: 372 LVQLFGERG 380 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 310 bits (795), Expect = 4e-83, Method: Composition-based stats. Identities = 109/307 (35%), Positives = 163/307 (53%), Gaps = 7/307 (2%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 +L + E +PD R + H LS +L + + AV+ GA + D+ +G+++L +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD-VIAIDGKTLRHSYDKS 120 + G+P HDT RV++ I PA F F+ W+ + D V+AIDGKT R S K Sbjct: 65 LKLKAGVPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGKD 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 G +H++SAF+ LV+GQ TD+KSNEITAIPELL ML ++G I+T DAMG Q I Sbjct: 125 T-SGPLHMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAI 183 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 A I+ +G DY+ VK N L + + K HGR E+R Sbjct: 184 ARTIRSRGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWA 243 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 D +L + +W GL+ + R++ + + YYISS A + A A+R Sbjct: 244 YDAVSQLYK-SEQWAGLQSFALVERERTV----DGKTSVERHYYISSLPADAARIAQAVR 298 Query: 301 NHWHVEN 307 +HW VE+ Sbjct: 299 SHWAVES 305 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 309 bits (791), Expect = 1e-82, Method: Composition-based stats. Identities = 103/369 (27%), Positives = 185/369 (50%), Gaps = 7/369 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L++H+ II D R ++H L ++ LT+ A++SGA W+ IE FG LD+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 +GIP IA ++ + + W+ D K +IA+DGKT+R ++ + A Sbjct: 63 HGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ-A 121 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H++SAF + + + +KK +E ++++ L + ++T DA+ CQK +KI Sbjct: 122 LHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKII 181 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPD 245 + D++ +KGNQ A + ++P + HGR+E R + + Sbjct: 182 SKKSDFVIQIKGNQPA-LLAAVKAAFAACYDSPALAISEQTNTGHGRKECRRVMQIEGNL 240 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 + + +W ++ L S R++ + + R+Y+SS + + A IR HW + Sbjct: 241 PP-ELSEKWPHIRTLVEVASERTV----GNKTACSSRWYVSSLPVDTAQLADIIRAHWAI 295 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN+LHW LDVV ED+ + + A+ + A++++ + K L K + AA D Sbjct: 296 ENQLHWVLDVVFREDELNVSDPDGAKHLALFNRAALSVIKQHQGKKDSLAAKRQSAAWDP 355 Query: 366 NYLASVLAG 374 + + +L G Sbjct: 356 AFRSELLFG 364 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 306 bits (784), Expect = 7e-82, Method: Composition-based stats. Identities = 116/342 (33%), Positives = 172/342 (50%), Gaps = 4/342 (1%) Query: 35 FAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWM 94 ++ AESWEDIE +G + +L+ + NGIP HDT RV + F CF + Sbjct: 1 MRRVACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCV 60 Query: 95 RDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITA 154 + ++V+A+DGK++R S G +H++S +++ L +GQ D KSNEI A Sbjct: 61 QFRAGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRA 120 Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 IPELL L + G I+T DAMGCQ IAE+I+ +G D L +K N G +A F Sbjct: 121 IPELLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTC 180 Query: 215 LNNPEHDSYAMSE-KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 L + + HGR R V D + W L ++ + R I Sbjct: 181 LGSGAAGRPVFDAFEGHGRLVRRRVFV-DAAATALAPLSGWPDLSRVLAVETLRGI--PG 237 Query: 274 KKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELF 333 +RY+++S IR HW VEN LHW L+V EDD ++R AA F Sbjct: 238 TGTVVADIRYFLTSCRDDPSVLVGVIRRHWSVENALHWVLEVSFREDDSRVRDRTAARNF 297 Query: 334 SGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGS 375 + +R IA+N++ D+ +A LR + +KAA D +Y+ ++A Sbjct: 298 ALVRKIALNLIAQDRSTQASLRGRRKKAAWDDDYMLQIIANQ 339 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 300 bits (769), Expect = 5e-80, Method: Composition-based stats. Identities = 118/385 (30%), Positives = 177/385 (45%), Gaps = 43/385 (11%) Query: 30 LLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHEC 89 +L+T+ V +SW DI DF DFL+++ P HDT+ R I + C Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FINWMRDCHSSNDK--------------------DVIAIDGKTLRHSYDKSR-------- 121 + W + + IAIDGKT+ + + + Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 122 ------RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDI-KGKIITTDAM 174 +H++SAF + SL +GQ + K NEI AIP+LL+ +DI +G ++T DA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSE---KSHG 231 G QK I EKI ++ DYL VK N +L + E ++ E+D +E + HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 R I C P L +WK L+ + + + IA E + +ISS Sbjct: 241 FMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIKTEKINIAT--GEIQNEKHCFISSLVNN 298 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN--DKV 349 E R HW VEN LHW+LDV NEDD + + N+A+ FS + +A+ IL N D+ Sbjct: 299 PELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTLTKMALTILKNYQDED 357 Query: 350 FKAGLRRKMRKAAMDRNYLASVLAG 374 K + RK +KA YLA+++ Sbjct: 358 KKTSVNRKRKKAGWSDEYLANLINN 382 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 298 bits (764), Expect = 2e-79, Method: Composition-based stats. Identities = 103/372 (27%), Positives = 169/372 (45%), Gaps = 14/372 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 ++ +PD R +H L IL + + AV+ GA ++E F + LD L+Q+ E Sbjct: 3 FLDVFGEVPDPRD-LTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS----NDKDVIAIDGKTLRHSYDKSR 121 G P HDT +RV++ + P +E F+ +M K +A+DGK+LR +Y K R Sbjct: 62 RGAPSHDTFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGR 121 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 V++ F + + Q ++ E+ A L +L +KG +T DA+ C + + Sbjct: 122 SHMPPLVVTVFGCDTFMSLAQT-VAQEGGEVQAAIAALELLSLKGLTVTADALHCHRRMT 180 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVC 241 + ++ GG Y+ A+KGNQ +L + E +HGR E+R V Sbjct: 181 KTVRDGGGHYVIAIKGNQSKLAAEANTALDKAA-AGKATKFHQTEEDAHGRHEVRRAFVI 239 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 L + S+R++ + + VR Y S + A + +R Sbjct: 240 PFAQTPGKNAL--VDLCAIGRVESWRTV----EGKTTHKVRCYALSRKMPAHELLATVRR 293 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN LHW+LDV++ ED + R+ N A + +R + +N+L D K L K KA Sbjct: 294 HWSIENDLHWQLDVLLGEDHIRGRKNNTAANHAILRRLTLNVLRADP-EKIPLSHKRLKA 352 Query: 362 AMDRNYLASVLA 373 L S+ Sbjct: 353 RWADQDLLSLFT 364 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 283 bits (723), Expect = 1e-74, Method: Composition-based stats. Identities = 118/367 (32%), Positives = 176/367 (47%), Gaps = 15/367 (4%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L E + ++P R K + L +LL+ + +SG SW +IED+ E + + LK + Sbjct: 4 SLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEM 63 Query: 65 ENG------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYD 118 G +P HDT+ R +S + F + W+ S+ I IDGKT+R Sbjct: 64 LTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRG-VK 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K HV+SAFS + Q+ D+K+NEI AI +LL++LD+ G +++ DA+G Q Sbjct: 123 KLSFDTQSHVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQT 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLH 238 I E+I +GGDY+ VK NQ + E F + D +E SHGR E R + Sbjct: 183 AIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLD--EQTELSHGRIETRRY 240 Query: 239 --IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 I+ + E + KGL+ + V R ++ + V YYISS Sbjct: 241 ESILNPLEIEANEVLTRRKGLRSIHKVVRKRR--DKKSDKTSEEVAYYISSLT-DVSSLK 297 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKAGLR 355 AIR HW +ENKLH LDV D R N A++ I+ I + I+ K K+ + Sbjct: 298 QAIRGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKTNMKSSIP 357 Query: 356 RKMRKAA 362 R +K A Sbjct: 358 RIQKKPA 364 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 279 bits (714), Expect = 1e-73, Method: Composition-based stats. Identities = 106/372 (28%), Positives = 168/372 (45%), Gaps = 14/372 (3%) Query: 7 MEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFEN 66 + + I D R H L+ +L L + A + GA++ +I +F E LK+ + Sbjct: 5 LSILREIHDPRD-INARHDLAELLFLALAATLCGAKNCVEIAEFVEGREAELKEIVTLRH 63 Query: 67 GIPVHDTIARVVSCISPAKFHECFINWMRDCH-----SSNDKDVIAIDGKTLRHSYDKSR 121 G P HDT +R+ I P + ++ + V+A+DGK LR Y+K R Sbjct: 64 GCPSHDTFSRIFRLIDPDELARALGAFLAALRQGLGLGPRPRGVVAVDGKALRRGYEKGR 123 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 ++S + L + + + S+E+ A LL +D+KG I+T DA+ C+ D A Sbjct: 124 AFMPPVMVSVWDAETRLSVATKRAEG-SDEVAATLALLKSIDLKGCIVTADALHCRPDTA 182 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVC 241 + + + Y A+K N+GRL E F + + E HGR E R V Sbjct: 183 KALIGRKAHYALALKANRGRLFACAEAGFVAADAAGDLA-FHETRETGHGRLETRRASVL 241 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 P + + GLK + + R +VRY S L K A +R Sbjct: 242 --PLKAFKQAPAFPGLKAIGRIQATRQ---GADGRAVTSVRYIALSKVLAPHKLAEVVRA 296 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDVV +EDD + R+ NA + + IR +A +IL + K + KMR+ Sbjct: 297 HWTIENQLHWSLDVVFHEDDARSRKDNAPQNLAVIRRLARDILAAHPLDK-PIASKMRRV 355 Query: 362 AMDRNYLASVLA 373 +R++ Sbjct: 356 NWNRDFFHEFFT 367 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 275 bits (702), Expect = 3e-72, Method: Composition-based stats. Identities = 104/367 (28%), Positives = 179/367 (48%), Gaps = 17/367 (4%) Query: 9 HISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG- 67 I+++ D R ++++ L ILL++++A ISG + WE IED+ H + L+ +G Sbjct: 8 AIAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKLSGK 67 Query: 68 ------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSR 121 +P HDT V I P +F E + ++ + + IAIDGKT R ++ Sbjct: 68 ELKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPRG-IKQTA 126 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 +++SA+ T H VI I ++ K +E+++I +L+ +L ++ +T DA G ++ Sbjct: 127 NSHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGTYVEVI 186 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVC 241 E I +GG+++ VKGNQ +L + E++F N D + HGR E R Sbjct: 187 EMILSKGGNFVLPVKGNQKKLLEFIEKEFREYRGNTVSAD--TQEDIGHGRVEKRTVYCI 244 Query: 242 DVP---DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 D++ +WKG+K L V R + + K + YYI++ + ++ A Sbjct: 245 TEIKTDDDIDGCMQKWKGVKTLVKIV--REVYKKADKSTRIETVYYITNL-IDPKEINRA 301 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA-GLRRK 357 IR HW +EN LH LDV++NED + N E F + +A+ I+ + + R Sbjct: 302 IRAHWGIENNLHRSLDVLLNEDHSQKSNHNVVENFHIMSLLALFIIKEISKQRGISMNRT 361 Query: 358 MRKAAMD 364 + Sbjct: 362 RKLCGYS 368 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 270 bits (691), Expect = 5e-71, Method: Composition-based stats. Identities = 107/286 (37%), Positives = 154/286 (53%), Gaps = 9/286 (3%) Query: 9 HISIIPDYRQAW-KVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG 67 +IPD R+A H LS IL + + AV+SG + WE + +FG T +L+Q+ NG Sbjct: 17 FFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANG 76 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSND-KDVIAIDGKTLRHSYDKSRRRGAI 126 IP HDT RV S I P F F +W D D +A+DGKT+R S+ S R A+ Sbjct: 77 IPSHDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSHRGSAGR-AL 135 Query: 127 HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQK 186 H++ A+S L++ Q + D KSNEITAIP++L++ D++G I+ DA+GCQK +A +I + Sbjct: 136 HLLHAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITE 195 Query: 187 QGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDE 246 GGDY+ A+KGNQ L+ + +P+ + EK HGR E R V D D Sbjct: 196 AGGDYVLALKGNQSALHDDVRLFMETQADRHPQGQA-EAVEKDHGRIETRRIWVNDEIDW 254 Query: 247 LIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTA 292 L +W GLK L + S R + + R +I+S Sbjct: 255 LTQKP-DWPGLKTLVMVESRREL----NGQVSCERRCFITSHTADP 295 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 269 bits (687), Expect = 2e-70, Method: Composition-based stats. Identities = 93/253 (36%), Positives = 144/253 (56%), Gaps = 7/253 (2%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 ++H+++A+ + +L++GQ+K D KSNEITAIP+LL ML ++G I+T DAMGCQK IA++ Sbjct: 1 NSLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQ 60 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP-EHDSYAMSEKSHGREEIRLHIVCD 242 I + DY+ AVK NQ L + + F ++N H + + HGR E R + Sbjct: 61 IGSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREYS-TI 119 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 V D+L+ W L + + S R + RY+I S + A++F A+R H Sbjct: 120 VGDDLLAGITGWDNLNAIGMVESKREVGNT----ISNEKRYFIMSINGHAQRFGDAVREH 175 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN +HW LDV ED +IR+ N+ E S +R IA+N + + K ++RK + A Sbjct: 176 WGIENTVHWVLDVSFGEDQSRIRKDNSPENLSMLRKIALNCVKQEST-KTSMKRKRKMAG 234 Query: 363 MDRNYLASVLAGS 375 D ++L VL G+ Sbjct: 235 WDNSFLIKVLTGN 247 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 268 bits (686), Expect = 2e-70, Method: Composition-based stats. Identities = 109/350 (31%), Positives = 168/350 (48%), Gaps = 16/350 (4%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 L+KL E S IPD+R+A K + HKLS I++L I +S S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDK-----DVIAIDGKTLRH 115 NGIP T+ R+ I + H ++I IDGK R Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 + K+ R I +SA S + + ++KSNEI A+P L++ +DI GKI+T DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 QKDI +KI+++ GD++ +K NQ L E+K +P + E HGR E Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDKIKEL---SPVYSYCGEPELGHGRIET 269 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R + V D D LI +W G L + + + R ++SS + Sbjct: 270 RSYRVFDGTD-LIANKEKWNG--NLTIIEYECETVKKSTGNCTTEKRLHVSSLPANTPRL 326 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 T +RNHW +E+ +HW LD + +D K + AA I+ I ++ + Sbjct: 327 GTPVRNHWSIES-MHWGLDRNLLQDKIKRKSARAARNLDTIQRIVYSVFS 375 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 266 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 103/372 (27%), Positives = 166/372 (44%), Gaps = 30/372 (8%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 E+ L+E ++ +PD R V H L+ +L LT AV++GA S + ++ + L + Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 62 GDFE-------NGIPVHDTIARVVSCISPAKFHECFINWMR-DCHSSNDKDVIAIDGKTL 113 P TI RV++ I W+ + +A+DGK+L Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWLACRQQDAGGLRALAVDGKSL 157 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIITTD 172 R + RR +H+++A + LV+ Q+ +K+NEIT LL+ L D+ G ++T+D Sbjct: 158 RGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTSD 215 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGR 232 A+ Q D A ++ + Y+ VK N +L+ + P +++ HGR Sbjct: 216 ALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLK-SLPWQQIPL----QDRTRTTGHGR 270 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA---D 289 EIR VC V + L + G ++ V R + + + Y ++S Sbjct: 271 CEIRRLKVCTVNNLL------FPGARQAVQIVRRR--VNRTTGKVSLKTIYAVTSLAAEQ 322 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + A IR HW VE LH DV ED ++R GNA + + R++AI L V Sbjct: 323 APPARVAQLIRGHWTVEA-LHHVRDVTFAEDASQLRSGNAPQAMATYRNLAIGALRLAGV 381 Query: 350 --FKAGLRRKMR 359 AGLRR R Sbjct: 382 RNIAAGLRRTAR 393 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats. Identities = 86/348 (24%), Positives = 158/348 (45%), Gaps = 16/348 (4%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ-Y 61 + L+E + + D+R+ H L +L++ I + G + ++ +F + + L Q + Sbjct: 1 MLSLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINW-MRDCHSSNDKDVIAIDGKTLRHSYDK- 119 +P + TI RV+ + + F W + + +D + + +DGK+L+++ Sbjct: 61 NIIPERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNP 120 Query: 120 -SRRRGAIHVISAFSTMHSLVIGQIKTDKK-SNEITAIPELLNMLDIKGKIITTDAMGCQ 177 + ++ I +S FS LV+ + + K +EI ++ ++ K+ T DA+ CQ Sbjct: 121 NNEQQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRL 237 K I K DY+ VKGNQ L K ++ ++ + + SHGR+ R Sbjct: 181 KKTISLIAKTKNDYVITVKGNQKNLYKRIQDLS----NSSKPESCFLEQDNSHGRKISRK 236 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 V V + ++ L+++ + + YYISS +A+ FA Sbjct: 237 IEVFKVRK---NERQGFENLRRVIKVER----KGSRGDKTYEETAYYISSLTESAQVFAK 289 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 IR HW +EN+LHW DV+ ED +I AA +S + I +N+ Sbjct: 290 IIRGHWKIENQLHWVKDVIFEEDKSEISDFQAASNWSILTTIGLNLFR 337 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 261 bits (666), Expect = 4e-68, Method: Composition-based stats. Identities = 108/360 (30%), Positives = 163/360 (45%), Gaps = 41/360 (11%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 L+ + + I D RQ KV H+ I++ + V + +SW ++ DF +DF++++ Sbjct: 18 LESMYNALGEI-DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFF 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINW--------------------MRDCHSSND 102 P HDT+ R + P + W + + Sbjct: 77 PDIQKAPSHDTLRRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKP 136 Query: 103 KDVIAIDGKTLRHSYDKSRRRGA--------------IHVISAFSTMHSLVIGQIKTDKK 148 IAIDGKT++ + ++ RRR +H++SAFS L +GQ + DKK Sbjct: 137 FRQIAIDGKTIKKAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKK 196 Query: 149 SNEITAIPELLNMLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAF- 206 NEI AIP LL+ LDI +G ++T DAMG QKDI +I K+ YL VK NQ L + Sbjct: 197 ENEIVAIPRLLDDLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIA 256 Query: 207 --EEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 F L N + + E HG +R VC L +W+ L+ + Sbjct: 257 GNMRDFERIPLPNEVYKVHKEGENGHGFVFLRECRVCSSLHSLGKIYKDWENLRSYGLIR 316 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + R + E E + Y+ISS + EK R HW +EN LHW+LD+ EDD ++ Sbjct: 317 TER--VDEATGESAVETHYFISSLENDPEKIMRVKRKHWGIENGLHWQLDITFKEDDGRM 374 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 261 bits (666), Expect = 4e-68, Method: Composition-based stats. Identities = 87/365 (23%), Positives = 166/365 (45%), Gaps = 18/365 (4%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 ++ L+ + +PD R ++L ++ + + AV +GA S+ I D+ + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDV-IAIDGKTLRHSYDKS 120 +P TI +V + + ++ + +A+DGKT+R + + Sbjct: 102 CGIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RI 159 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 ++ A H+++A + ++V+GQ +T KSNEI + LL +DI G ++T DAM QK Sbjct: 160 GKQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKAT 219 Query: 181 AEKIQKQGG-DYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHI 239 A +++Q +Y+ VK NQ L ++ P +++ D E+ HGREE R + Sbjct: 220 ARCLREQCRAEYVMIVKANQPGLLARVRDQ-PWEQVPVVWSD---PVERGHGREEHRSYK 275 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA---DLTAEKFA 296 + V L + +++ + R ++ V Y I S + A Sbjct: 276 ILTVARGL-----RFPYAQQVIQIIRRRRVLGAGAW--STEVVYAICSLPCEQAPPKLLA 328 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 + IR HWH+EN++H+ DV +ED +R G+ ++ + +R++ + + Sbjct: 329 SWIRGHWHIENRIHYVRDVTFDEDRSAVRTGHGPQVMATLRNVVVGLHRRAGHSNIARAC 388 Query: 357 KMRKA 361 + A Sbjct: 389 RRLAA 393 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 260 bits (665), Expect = 5e-68, Method: Composition-based stats. Identities = 89/247 (36%), Positives = 141/247 (57%), Gaps = 3/247 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L++H + D R +HKL I+++ + A+I GA+S+ +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + I P +F + F +W+ DV+ IDGKT++HS +K + A Sbjct: 69 NGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKKA 128 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH+++A+++ LV+ Q K +++ EITAIP L+ +L++ G ++T DAMG Q DIAE + Sbjct: 129 IHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELLH 188 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPEHDSYAMSEKSHGREEIRLHIVCD 242 +G DY A+KGNQ L + +E F E EH + EK R E+ + Sbjct: 189 SKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEVSSAYRTE 248 Query: 243 VPDELID 249 Sbjct: 249 QERLWSH 255 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 257 bits (657), Expect = 4e-67, Method: Composition-based stats. Identities = 107/382 (28%), Positives = 176/382 (46%), Gaps = 25/382 (6%) Query: 3 LKKLMEHISIIPDYRQ--AWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 +K L E + +PDYR+ ++KL ILLL I + + DI FG+ +L + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSN---DKDVIAIDGKTLRHSY 117 G +G+P T+ R+ I E + H D++ IDGK +R + Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAGDILCIDGKAMRGTV 137 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 ++ R I +SA+S + + ++KSNEIT++P+LL+ +D+ G I+T DAM Q Sbjct: 138 LENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADAMSFQ 195 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEK-SHGREEIR 236 K I +KI+++GGD+L +K NQ L E+ L E D Y+ HGR E R Sbjct: 196 KAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAE----PVDVYSEGPFLEHGRIETR 251 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 + + D LI +W G L V + + + R+Y+SS +A + Sbjct: 252 VCRIFRGND-LITDREKWNG--NLTVVEIRTATERKSDGQKSSERRFYVSSFHGSARRLG 308 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL-R 355 T R HW +E+ +HW LD + +D + +A I+ + + IL + + Sbjct: 309 TIARMHWAIES-MHWDLDRNLRQDFIRRNSARSARNLDTIQRMVLAIL--------SIWK 359 Query: 356 RKMRKAAMDRNYLASVLAGSGL 377 K +K + A ++ L Sbjct: 360 GKRKKPSEKAKGTAELIGELSL 381 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 254 bits (648), Expect = 5e-66, Method: Composition-based stats. Identities = 89/207 (42%), Positives = 135/207 (65%), Gaps = 1/207 (0%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 +KL IPD+R++ K + L ILL+ I +VI GA+SW ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 NGIP HDT RV S I +F +CFI W+ +++IAIDGKT+R + ++ Sbjct: 65 LPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGGKK 123 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H++SA++ ++LV+GQ+K KSNEITAIP+LL +L I+ I+T DAMGCQ IA+ Sbjct: 124 SPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIAKA 183 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKF 210 I K+ DY+ AVK NQ +L + E++F Sbjct: 184 IVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 245 bits (626), Expect = 2e-63, Method: Composition-based stats. Identities = 89/241 (36%), Positives = 137/241 (56%), Gaps = 8/241 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 +++L+E + I D RQ KV H L IL++ +FA ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + + K +I IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + G H++SA+S +GQ +KSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD---SYAMSEKSHGREEIR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 L 237 Sbjct: 239 E 239 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 103/390 (26%), Positives = 166/390 (42%), Gaps = 37/390 (9%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDI----EDFGETHLDF 57 ++ L+ + I D R+A + LS +L + A ++GA +I DFG+ L Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKQYGDFENGI---PVHDTIARVVSCISPAKFHECFINWM--RDCHSSNDKDVIAIDGKT 112 L D G P I + + A F W+ + V+A+D K Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKII-T 170 LR ++ + +R + ++SA LV GQ++ +NEIT + LL L DI G ++ T Sbjct: 141 LRGAWSEGNKR--VTLLSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 171 TDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLN-KAFEEKFPLKELNNPEHDSYAMSEKS 229 DA+ Q + A + + G DY VKGNQ L K FE+ PL + + + E+ Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLYRKTFEQTLPLLQKPP----QHEVEERG 254 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 HGR + +T E KG+ VA + E + R Y Sbjct: 255 HGRI-----------KKWQAWTTEAKGIGFPEVATAAVIRRDEFDLKGIRVSREYAHILT 303 Query: 290 ------LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 TA IR HW +EN++H+ D ED + GN+ + R++AI I Sbjct: 304 SVAGNRATAAYIHRLIRGHWGIENEIHYPRDTAWREDANQTHTGNSPHTLASFRNLAIGI 363 Query: 344 LTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 + + + K ++ + A DR+ + +LA Sbjct: 364 IRRNGIRK--IKETLEYIAGDRDRVLPLLA 391 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 92/362 (25%), Positives = 148/362 (40%), Gaps = 48/362 (13%) Query: 29 ILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHE 88 +L+ + G + + +THL+ L+++ + GI TI R++ I Sbjct: 1 MLVCVTLGFLCGRTTIRRSLKWCKTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALY 60 Query: 89 CFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKK 148 F+ W+ + S + +A+DGK L + +K++ +++ T+ L++ Q+ D K Sbjct: 61 AFMEWVGEIVDSRN-THLAVDGKALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSK 119 Query: 149 SNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEE 208 +NEIT IPELL +LDI G I+T DA+G Q I E+I +QGG + VK NQ + Sbjct: 120 TNEITVIPELLKLLDISGSIVTIDAVGTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHT 179 Query: 209 KFPLKELNNPEH-----------------DSYAMSEKSHGREEIRLHIVCDVPDELIDFT 251 E + + + EK+ R E R +C L Sbjct: 180 FMDKLEAADVQRKKGEVLDSGMREYLEKYEEIIRIEKNRDRNEYRTCQICKDASNLTKSQ 239 Query: 252 FEWKGLKKLCVAVSFR----------------------------SIIAEQKKEPEMTVRY 283 EW ++ + R + AE+ ++ Sbjct: 240 KEWPHVQSIGRIKQVRIPSEKDSHGNDVTPSKEEFLEKGSRRVPAPSAEEGTGKDVQCTA 299 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 IS LTAE+ + R HW +EN+LH LD ED ++ S IR A NI Sbjct: 300 LISDLILTAEELGSIKRMHWSIENRLHHVLDDTFREDRSPAKKSR--NNLSLIRKYAYNI 357 Query: 344 LT 345 L Sbjct: 358 LR 359 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 243 bits (621), Expect = 6e-63, Method: Composition-based stats. Identities = 83/273 (30%), Positives = 135/273 (49%), Gaps = 9/273 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 ++ L+E + + D R K+EH+L IL++ + AV++ AE++EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD------VIAIDGKTLRHS 116 D GIP HDT RV I P F CF+NW R + D IA+DGK +RHS Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ R +H++SA++T LV+ Q D K E A+P +L L + G +++ DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPE--HDSYAMSEKSHGREE 234 ++++A+ I +G YL +K NQ +++ F + + +HGR Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAFDDTHGRLV 241 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFR 267 R C W GL + + + R Sbjct: 242 RRRVFACPDAG-CFTTLRGWPGLTTVLASETIR 273 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 84/324 (25%), Positives = 135/324 (41%), Gaps = 27/324 (8%) Query: 50 FGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAID 109 FG + +LK GI H T + V C++ F ++ Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAALPKPLQRA------------ 90 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKII 169 K S + + ++ ++ LVIGQ + NE+ + L +L ++G I+ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQQTAPGR-NEVQGALDALALLSLEGAIV 149 Query: 170 TTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKS 229 T DA+ C+ D A I GGDY A+K NQ L + E + +E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLGVQ----TAAEND 205 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 H R E R + V D ++ GL+ + + VRY++ S Sbjct: 206 HDRCERRRACIVAVND------IDFPGLQAIGSVEATSRH---ADGRLTSHVRYFLLSTI 256 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ++A R HW +ENKLHW LDV ED + R+ + + +R IA+N++ Sbjct: 257 MSAAALIEVTRTHWQIENKLHWVLDVQFREDAARNRKDHGPANIALLRKIALNLIRAHP- 315 Query: 350 FKAGLRRKMRKAAMDRNYLASVLA 373 KA +RRK++ A D +L S++A Sbjct: 316 DKASIRRKIKNAGWDDQFLISIIA 339 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 237 bits (604), Expect = 7e-61, Method: Composition-based stats. Identities = 79/372 (21%), Positives = 129/372 (34%), Gaps = 35/372 (9%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 +++ L+E + +PD R+ V L +L L + AV GA + +I + L Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC--------------HSSNDKDVIA 107 P T RV+ P E W + VI+ Sbjct: 91 FGLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVIS 150 Query: 108 IDGKTLRHSYDKSRRRGAI--HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML--- 162 DGKT+R + ++ V+ V+ + +EI A+ ++ L Sbjct: 151 ADGKTMRGARRRTGDGKIAQDQVVEILDHASGAVVA-CEPVNDGDEIGAVRTVMGRLADR 209 Query: 163 --DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH 220 + G ++ TDA Q + E++ GG +L VK NQ R+ P ++ Sbjct: 210 WGSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRILAKVR-ALPWAQVRA--- 265 Query: 221 DSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVS--FRSIIAEQKKEPE 278 K+HGR E R V P G ++ R Sbjct: 266 -QDTCRGKAHGRAETRTVRVVQAP---THVDLALAGTAQVIKITRHTRRRPHPGAPAAST 321 Query: 279 MTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSG 335 Y ++S A +R+HW +EN++HW D +ED R GN + Sbjct: 322 RENAYLLTSLPAEVADPATLAAMVRSHWLIENQVHWVRDTAYDEDRHTARTGNGPINLAC 381 Query: 336 IRHIAINILTND 347 +R+ AI Sbjct: 382 LRNTAITRHRAH 393 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 233 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 87/397 (21%), Positives = 161/397 (40%), Gaps = 35/397 (8%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDF-LKQ 60 E++ L + ++ +PD R + H+L IL L+ AV +G +S E+I + L Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 61 YGDFENGI------PVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD---VIAIDGK 111 G + + P DT+ RV+S + + + V+A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML----DIKGK 167 TLR + R H+++ +V+ + + K+NE+TA LL L + G Sbjct: 158 TLRGAAGPEGRA--PHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 IITTDAMGCQKDIAEKIQ-KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMS 226 ++T DA+ + A+ I + G ++F VK N L+ + ++ ++ Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPIG----HSAE 271 Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMT-----V 281 ++HGR E R + + + + + ++ V + T Sbjct: 272 GRAHGRFERRTIQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGRARVTRTIPSTVT 331 Query: 282 RYYISSADLTA---EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 + ++S L A A R HW +ENK+HW DV ED ++R G + + +R+ Sbjct: 332 VHVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRN 391 Query: 339 IAINILTN--DKVFKAGLRRKMRKAAMDRNYLASVLA 373 + I ++ +RR D L ++L Sbjct: 392 LIIGLIRLAGHNRIAPTIRRIRH----DNALLLAILT 424 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 221 bits (562), Expect = 5e-56, Method: Composition-based stats. Identities = 81/383 (21%), Positives = 150/383 (39%), Gaps = 17/383 (4%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 ++ L+ + +PD+R V ++L+ +L L + I+G ++ + ++ + Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS----SNDKDVIAIDGKTLRHSY 117 F +P TI R+V P + + W +A DGK ++ + Sbjct: 84 LGFPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGAR 143 Query: 118 DKSRRRGAIH--VISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 + + V+ A +G + +EI ++ L+N + ++TTD + Sbjct: 144 SRPPQGSVRQEAVVEAVRHDTGTALGHQRVVA-GDEIASVRRLVNRVCDHNTLVTTDCLH 202 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 + +A I+ +GG +LF++KGNQ + A P E N + EK+HGR E Sbjct: 203 AHEPLARAIRAKGGHWLFSIKGNQPTVR-AKLAGLPWDEFGN----QHVTREKAHGRIEE 257 Query: 236 RLHIV-CDVPDELIDFTFEWKGLK-KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 R L+ F + +K + E + +S+ + Sbjct: 258 RALKALTPSAPSLVGFRGTRQVVKLARTTRRKKTTTSPAATSTEEFYLVTSLSTDQASPA 317 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A R HW VE +H D M+ED IR NAA ++ R I+ L Sbjct: 318 QLARWARGHWTVEA-IHHVRDRTMDEDRHTIRTKNAALNWAIARDTTISALRLAGYKN-- 374 Query: 354 LRRKMRKAAMDRNYLASVLAGSG 376 +R+ R D + ++A + Sbjct: 375 IRQARRATIRDPGLVLQIIALTS 397 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 219 bits (557), Expect = 2e-55, Method: Composition-based stats. Identities = 85/328 (25%), Positives = 136/328 (41%), Gaps = 22/328 (6%) Query: 28 GILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFH 87 +L + + A +G + + T D + P T V+S + PA + Sbjct: 2 ALLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 ECFINWMRDCHSSNDKDV---IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIK 144 ++ +S+D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQLA 119 Query: 145 TDKKSNEITAIPELLNMLDIKG-KIITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRL 202 +KSNEI + LL +L ++T DAM Q A+ I YL VK NQ ++ Sbjct: 120 VAEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 NKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCV 262 A P E+ D + HGR E R + + + K++ Sbjct: 180 L-ARITALPWAEVPAAATD----DSRGHGRVETRTLQIITAARGIG-----FPYAKQIIR 229 Query: 263 AVSFRSIIAEQKKEPEMTVRYYISSADLTAEK---FATAIRNHWHVENKLHWRLDVVMNE 319 R I A + + V Y I S + T +R H +EN LHW DV +E Sbjct: 230 ITRERLITATD--QRSVEVVYAICSLPFEHARPTAIMTWMRQHCRIENSLHWIRDVTFDE 287 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTND 347 D + GN A++ + +R+ AIN+ + Sbjct: 288 DRQRAHTGNGAQVLATLRNTAINLHRLN 315 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 218 bits (555), Expect = 3e-55, Method: Composition-based stats. Identities = 91/369 (24%), Positives = 156/369 (42%), Gaps = 25/369 (6%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDF-LKQYG 62 L+ ++ +PD R V H L +L + AV++GA S + ++ L + G Sbjct: 27 SSLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELG 86 Query: 63 DFE------NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS--NDKDVIAIDGKTLR 114 F + P T R+++ + + W+ C + + V ++DGKTLR Sbjct: 87 VFRDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLR 146 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAM 174 S +H+++ V+GQ+ D K+NE+T LL LD+ ++T DA+ Sbjct: 147 GS---GPAGEQVHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADAL 203 Query: 175 GCQKDIAE-KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGRE 233 Q++ A + + Y+F VK NQ RL + + P ++ S + HGR Sbjct: 204 HTQREHARWLVDTKKAAYVFTVKKNQPRLYRQLKT-LPWTKIPI----QDETSTRGHGRY 258 Query: 234 EIRLHIVCDVPDEL-IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTA 292 +IR L +DF ++ L + ++ + + +S+A Sbjct: 259 DIRRLQAVTCTGPLALDFPHA---VQALRIRRRRLNLATGRWSTVTVYAITNLSAAQAGP 315 Query: 293 EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VF 350 + A +R HW +E LH D ED ++R GNA + +R+ AIN+L Sbjct: 316 AELADWLRGHWAIET-LHHIRDTTYAEDASRLRTGNAPRAMATLRNTAINLLRLTGITTI 374 Query: 351 KAGLRRKMR 359 A LR R Sbjct: 375 AAALRHNSR 383 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 218 bits (554), Expect = 3e-55, Method: Composition-based stats. Identities = 88/249 (35%), Positives = 126/249 (50%), Gaps = 14/249 (5%) Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS------YDKSR 121 IP HDT R S I P F F NW++ V+AIDGK +R + + Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHTTGK 62 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + ++SA+S + + +GQ+K D KSNEITAIP L+N L++ G I+T DAMGCQKDI Sbjct: 63 EGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDIT 122 Query: 182 EKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLH 238 + I + +Y+ A+K N+ + L K + + K+ + HGR E R Sbjct: 123 QTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRTC 182 Query: 239 IVCDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AEKFA 296 V F + GLK + S R+I+A E VRYY++S D T E+ A Sbjct: 183 TVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIVAT--GEYTQEVRYYVTSLDNTKPEEIA 240 Query: 297 TAIRNHWHV 305 +AIR HW + Sbjct: 241 SAIRQHWSI 249 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 217 bits (553), Expect = 4e-55, Method: Composition-based stats. Identities = 102/196 (52%), Positives = 133/196 (67%), Gaps = 13/196 (6%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M L L +H + + D RQA KV +KL +L L + AVISGAE WE+IEDFG L +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 YGDF +GIPVHDTIAR+V I P FH+ FI WM+ DK V+A+DGKTL Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 H+ISAF+T + +V+GQ +TD+KSNEITA+PELL +L+++G ++T DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 AEKIQKQGGDYLFAVK 196 + I K+ DY AVK Sbjct: 168 VKTIVKKKADYCIAVK 183 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 216 bits (549), Expect = 2e-54, Method: Composition-based stats. Identities = 76/219 (34%), Positives = 118/219 (53%), Gaps = 7/219 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 +++ I D R K HK+ I+ ++I AVI GA+SW +IE+FG + F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPS 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS------YD 118 IP HDT R S I P F F NW++ V+AIDGK +R + Sbjct: 64 LEFIPSHDTFNRFFSMIKPDYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 + + + ++SA+S + + +GQ+K D KS+EITAIP L+N L++ G I+T DAMGCQK Sbjct: 123 RGKEGFKLWMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 DI + I +Y+ A+K N+ + + ++ + + Sbjct: 183 DITQTIIGHDANYIIAIKENKKKKYQPAKQIIDDYQDRD 221 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 207 bits (526), Expect = 7e-52, Method: Composition-based stats. Identities = 83/418 (19%), Positives = 145/418 (34%), Gaps = 62/418 (14%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVIS-GAESWEDIEDFGETHLDF------ 57 L++ ++I D R H L+ IL + A ++ G + IE + + Sbjct: 29 DLIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLH 88 Query: 58 -LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD------------ 104 + + P TI RV++ + + C ++ + Sbjct: 89 IWRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRR 148 Query: 105 ---------------------VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 A+DGK L+ + R +H+IS + + + V Q Sbjct: 149 TEREARRAAHRSPTPAPGLLPAYAVDGKRLKGARHPDGGR--VHLISLAAHLDATVHAQR 206 Query: 144 KTDKKSNEITAIPELLNM---LDIKGKIITTDAMGCQKDIAE-KIQKQGGDYLFAVKGNQ 199 + KS+EI A+ LL D+ G +IT DA+ Q+ A I++ Y+ VK NQ Sbjct: 207 QIPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQ 266 Query: 200 GRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 L+ + + ++ + + HGR E R+ P + IDF + + + Sbjct: 267 PTLHATAITALTGTDTDFAAV-THRETHRGHGRTEYRILR--TAPADGIDFPYAAQVFRV 323 Query: 260 LCVAVSFRSIIAEQKKEPEMTVRYYISSA---DLTAEKFATAIRNHWH-VENKLHWRLDV 315 L R V Y I+ A +R HW +EN +H DV Sbjct: 324 L------RHRGGLDGIRHSKEVCYGITDLTARQAGPAHLAAYVRGHWKAIENGVHHVRDV 377 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 ED C+ R + R++A L + R+ D + + Sbjct: 378 TFAEDACQARTATLPRALAAFRNLATGTLRRAGHVN--IAHARREHGYDHQRVLDLFN 433 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 202 bits (514), Expect = 2e-50, Method: Composition-based stats. Identities = 91/237 (38%), Positives = 119/237 (50%), Gaps = 9/237 (3%) Query: 143 IKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL 202 + T+ KSNEITAIP LL L+ K ++T DAMGCQKDIA I GGD++ AVK NQ +L Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 NKAFE---EKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 A EK EL H +Y HGR + R H V VP EW +K Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVPPG-FAAKGEWPWIKA 119 Query: 260 LCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + AV I VRYY+ S L+ ++F +R HW +E+ +HW LDV E Sbjct: 120 IGTAVR---ITTHADGTQSDEVRYYMLSRFLSGKRFGEVVRGHWGIES-MHWVLDVTFGE 175 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSG 376 D + R+ A S +R AI +L K +R KM + MD ++L VL G Sbjct: 176 DRTRTRQRVLANNLSWVRRFAITLLKRHP-EKDSIRGKMIRCLMDTSFLNEVLTLQG 231 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 201 bits (511), Expect = 4e-50, Method: Composition-based stats. Identities = 88/390 (22%), Positives = 146/390 (37%), Gaps = 61/390 (15%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVI-SGAESWEDIEDFGE-THLDFLK 59 +++ L+ + D R A V +++S +L L + A+ +G +S ++ + L Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 60 QY------GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS------------- 100 + IP T+ V+ + P + + +R S+ Sbjct: 90 AFGLPYHPLRGRYRIPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPLMPDGG 149 Query: 101 -------------------NDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 + + IA+DGK LR + R + V+SA + + Sbjct: 150 IEREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR--VFVLSAVRHGDGITLA 207 Query: 142 QIKTDKKSNEITAIPELLNMLDI---KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 + K+NEI LL+ LD KG ++T DA+ Q+D A + ++G YL +K N Sbjct: 208 SREIGAKTNEIPEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERGAHYLLTIKNN 267 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 Q R P KE+ D + HGR E RL V V L + Sbjct: 268 Q-RGQARQLHALPWKEIPVIHRD----DARGHGRHEQRLVQVVTVNGLL------FPHAA 316 Query: 259 KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT---AIRNHWHVENKLHWRLDV 315 ++ R + +K Y I+ A R HW VEN +HW DV Sbjct: 317 QVLRIQRRRRLYGAKKW--SSETVYAITDLPAEEASAAEIASWARGHWTVENTVHWCRDV 374 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILT 345 NED ++R N + + +R + L Sbjct: 375 TFNEDKSQVRTHNTPSVLAAVRDLIRGALK 404 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 200 bits (508), Expect = 7e-50, Method: Composition-based stats. Identities = 78/385 (20%), Positives = 139/385 (36%), Gaps = 46/385 (11%) Query: 8 EHISIIPDYRQAWKVEHKLSGILLLTIFAV-ISGAESWEDIEDFGETHLDFLKQYGDFE- 65 E ++ IPD+R A + + L + + + AV +G + + ++ + Sbjct: 26 ERLAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERVRLRLPW 85 Query: 66 -----NGIPVHDTIARVVSCISPAKFHECFIN-------------------WMRDCHSSN 101 + +P TI R ++ + ++ + Sbjct: 86 NPWDGHLLPDEATIRRFLNTVDDQALATALLDPPLADTPADLTDAVPSAPVRPPAGDQAV 145 Query: 102 DKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNM 161 A+DGKT R + K +H++ + ++GQ + D KSNE T LL Sbjct: 146 PVRAYAVDGKTSRGA--KRADGSQVHLLGVAAHGAGALLGQREIDAKSNETTEFRALLAP 203 Query: 162 LDIKGKIITTDAMGC-QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH 220 L++ G ++ DA+ + ++ + ++ YL K NQ +L AF P E+ + Sbjct: 204 LELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKLR-AFLAALPWTEIPTAD- 261 Query: 221 DSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMT 280 ++ HGREE R V V +DF + ++ R ++ + Sbjct: 262 ---LTRDRGHGREETRTLKVATVT--HLDFPHAAQAIR-------IRRWRRQKGQPASHE 309 Query: 281 VRYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIR 337 Y I+ A A R WH+E K H+ DV ED R G + + R Sbjct: 310 TIYAITDATADQASPALLADLARGQWHIEVKQHYVRDVTFGEDSSTSRTGRGPAVLALFR 369 Query: 338 HIAINILTNDKVFKAGLRRKMRKAA 362 + L R+ K A Sbjct: 370 ATVADTLRRAGHRSVPACRRAHKTA 394 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 194 bits (492), Expect = 6e-48, Method: Composition-based stats. Identities = 77/178 (43%), Positives = 108/178 (60%), Gaps = 3/178 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+ +PD R+ + H+L +LL I VISGAESW + + + LD+L+ Y + Sbjct: 7 SLLTAFDDLPDPRR-RECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 +GI HDT RV S + ++F CF+ W+ S + +AIDGK LR S+D + R Sbjct: 66 AHGIASHDTFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHDGA--RS 123 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 IH++SA+S+ +L +GQ++T KSNEITAIPELL LDI+G IT DAMGC A Sbjct: 124 PIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCHGMPAR 181 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 193 bits (491), Expect = 7e-48, Method: Composition-based stats. Identities = 79/225 (35%), Positives = 107/225 (47%), Gaps = 9/225 (4%) Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIPELL LD++G +T DA+G Q IA I + G DY+ AVK NQ RL + + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 ELNNPEHDSYAMSE--KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIA 271 E + +E K HGR E R+ V + L W GL++L + R I Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETRVCRVSEDVAWLASTGQHWAGLQRLVMLERTRQI-- 118 Query: 272 EQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 ++ YYISS + A + A IR HW +EN+LHW LDV ED IR AA Sbjct: 119 --GQKVTTERCYYISSKAVKAAQMAQLIRAHWGIENQLHWVLDVSWGEDASLIRDTVAAR 176 Query: 332 LFSGIRHIAINILT---NDKVFKAGLRRKMRKAAMDRNYLASVLA 373 + +R I +N+ N + K L+ AA D +L Sbjct: 177 NMASLRKITLNLARLAQNRQPKKVSLKNIRNLAAWDTAMRDDILG 221 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 189 bits (481), Expect = 1e-46, Method: Composition-based stats. Identities = 83/199 (41%), Positives = 111/199 (55%), Gaps = 8/199 (4%) Query: 133 STMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYL 192 S +LV+GQ K + KSNEITAIP L+ ML+I+ IIT DAMGCQK+I I+K+ GDY+ Sbjct: 28 SLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESSIITIDAMGCQKEITSLIRKKKGDYI 87 Query: 193 FAVKGNQGRLNKAFEEKF---PLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPD-ELI 248 +K NQ L + +E F +E + EH Y E H R E R I V + Sbjct: 88 ITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQEIETGHHRIEKREVIAVSVSSLPCL 147 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 W LK + + S R + + VR+YISS + ++K ATAIR+HW +EN Sbjct: 148 HNQDLWTELKTVVMVKSERRLWN----KTTTEVRFYISSVEKNSQKIATAIRSHWEIENS 203 Query: 309 LHWRLDVVMNEDDCKIRRG 327 LHW LDV +ED +IR Sbjct: 204 LHWTLDVTFSEDKSRIRTR 222 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 184 bits (468), Expect = 4e-45, Method: Composition-based stats. Identities = 66/218 (30%), Positives = 101/218 (46%), Gaps = 3/218 (1%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M + L E +S IPD R + H L +L L A++ G S + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDK 119 F G P T++R + P + W+ + IA+DGKTLR S D Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGSRDG 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + H+++A++ V+ Q++ D K+NE A LL +L + G ++T DAM CQ+D Sbjct: 121 --QVPGQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 +A + G DY+ K NQ L + E ++ Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGLGFEDAAR 216 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 182 bits (462), Expect = 2e-44, Method: Composition-based stats. Identities = 81/194 (41%), Positives = 120/194 (61%), Gaps = 2/194 (1%) Query: 94 MRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEIT 153 M+ H +V+AIDGKTLR SYD+ R+ IH++SA+++ + LV+GQ+KT+ KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIP L+ MLD++G I+T DAM CQ IA+ I ++GGDYL AVKGNQG+L A + F Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFTPH 120 Query: 214 ELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 + D+ + EK GR E R + V D + DF+ W GL + + ++R+ Q Sbjct: 121 RRAPIDRDTCQI-EKQKGRVEARTYHVLSASDLIRDFST-WSGLTSIVMVENYRAAKGRQ 178 Query: 274 KKEPEMTVRYYISS 287 + + + + + S Sbjct: 179 RARVGVPLLHKVQS 192 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 65/229 (28%), Positives = 106/229 (46%), Gaps = 5/229 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+E ++ +PD R ++ L G+L L + AV+ G + E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 NG +P +TIA ++ + P + W+RD H + + +A+DGK L S D + Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGKRLCGSRDG--QV 120 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD-IKGKIITTDAMGCQKDIAE 182 H+++A++ S V+ Q+ + +NE A LL +L + G ++T DA+ Q D+ Sbjct: 121 PGTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHG 231 +Q +GGD + K NQG L E F + G Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAFATAAGGDFSPRVTGRVGSGRG 229 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 176 bits (447), Expect = 1e-42, Method: Composition-based stats. Identities = 73/270 (27%), Positives = 118/270 (43%), Gaps = 12/270 (4%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + L+ + + D R H L +L L + A + GA++ ++ +F E + L++ Sbjct: 1 MSVLISILREVRDPRD-VNARHDLGELLFLALLATLRGAKTCVEMAEFSEARQEELREIV 59 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND----KDVIAIDGKTLRHSYD 118 +G P HDT +RV + P + F +M + K V+AIDGK+LR YD Sbjct: 60 ALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRGALGLPAPKGVVAIDGKSLRRGYD 119 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K R ++S + I ++ +EI A +L L +KG +T DA+ C Sbjct: 120 KGRAFMPPLMVSVWDVETRPSIAAMRAPG-GDEIKATLSVLKALTLKGCTVTADALHCHP 178 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLH 238 +A+ + Y +K N G L +A E F + + E+ HGREE R Sbjct: 179 AMAQALLAAKAQYALGLKANHGPLFRAAEAGFA----AVTDLAVFETRERGHGREEQRRA 234 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRS 268 V V D L+ GLK + + R+ Sbjct: 235 SVLPV-DRLVKRPS-LPGLKAIGRIEAVRT 262 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 176 bits (445), Expect = 2e-42, Method: Composition-based stats. Identities = 58/228 (25%), Positives = 104/228 (45%), Gaps = 14/228 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 +++ LM+ +S D R+ + H ++ + A++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDFENGI-------PVHDTIARVVSCISPAKFHECFINWM----RDCHSSNDKDVIAIDG 110 F P T+ R + I + W C D V++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIT 170 K +R + K++ IH ++AF +V+ Q D+K+NEI + LL ++I+G+I+T Sbjct: 341 KAVRGA-SKAKGGQKIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 TDAMGCQKDIAEKIQK-QGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 DA+ Q + A I + + DY+F VK NQ + + E P + Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIE-SLPWEAFPP 446 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 60/266 (22%), Positives = 113/266 (42%), Gaps = 22/266 (8%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 + L+E ++ +PD R+ V ++ + +L + + A++SGA S+ I ++ + Sbjct: 47 DQTALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAG 106 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND-------------KDVIAI 108 +P TI RV+ + A W++ + D + V+A+ Sbjct: 107 LGLTGRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAV 166 Query: 109 DGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGK 167 DGK +R + +H++ +V+ Q+ D+K+NEI +L+ + D+ Sbjct: 167 DGKAMRAT---RHGTHPVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDV 223 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSE 227 +IT DAM Q A+ + +G L VK NQ ++ + P K++ + + Sbjct: 224 LITVDAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRLKT-LPWKDVPVG----HTTTG 278 Query: 228 KSHGREEIRLHIVCDVPDELIDFTFE 253 + HGR E R VP L Sbjct: 279 RGHGRIETRTLKAVTVPAGLGFPHAA 304 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 71/275 (25%), Positives = 109/275 (39%), Gaps = 13/275 (4%) Query: 56 DFLKQYGDFE-NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLR 114 L + D + + ++ + F S +K + DGK LR Sbjct: 6 SALCAFLDIPETTVVSRSHLPVLLQKVDVEVFDYLLFTHYGFRLDSQEKQWFSGDGKELR 65 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDK-KSNEITAIPELLNMLDIKGKIITTDA 173 S + ++RG V+ I Q D K +EI + LL+ D+ + IT DA Sbjct: 66 GSIESGKKRGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRALLSKDDLASQKITLDA 124 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGRE 233 + E I K GG +L +K NQ L + + P D + +HGR Sbjct: 125 LHLCPSTTEMITKAGGVFLIGLKENQPTLLA------HMTDCALPPIDQKTTFDFNHGRV 178 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 E R + + DV + D ++ K+L R I ++ + V YYIS+ E Sbjct: 179 EQRKYWLYDVSKQGFDPRWDNTAFKRLVKVQRTR--INQKNAKISREVSYYISNETA-KE 235 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGN 328 A+RNHW VE H DV +NED K ++ Sbjct: 236 GIFDAVRNHWSVEVNNH-IRDVTLNEDQLKSKKRQ 269 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 164 bits (416), Expect = 3e-39, Method: Composition-based stats. Identities = 55/187 (29%), Positives = 93/187 (49%), Gaps = 4/187 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFL-K 59 + L+ + +PD R+A + L +L+ T+ A++SGA S+ I F E + L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 60 QYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCH---SSNDKDVIAIDGKTLRHS 116 +G PV +T+ V+ + + F + +K V+A+DGKTLR S Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGS 130 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D R A ++AF + ++V+ + D KSNEI A +++ L + G + T DAM C Sbjct: 131 FDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHC 190 Query: 177 QKDIAEK 183 QK + + Sbjct: 191 QKKHSRR 197 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 164 bits (414), Expect = 6e-39, Method: Composition-based stats. Identities = 65/189 (34%), Positives = 95/189 (50%), Gaps = 8/189 (4%) Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDV 243 I + GDYL VKGNQ +L +A E F + + + D A+ E+ HGR ++ V Sbjct: 2 IIAKKGDYLLMVKGNQPKLLEAIEIAF-IDQHDVKSVDRSALVERGHGRTVGQIASVLSA 60 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 I +W + S R + +KE ++ YYI+S LTAE+ A ++R W Sbjct: 61 KG--IINPGDWPNCVTIGRIDSMRVVD---EKESDLERCYYITSRALTAEQLAASVRARW 115 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKMRKA 361 VEN+ HW LDV +ED + + NA + S +R IA+NI+ DK K+ LR K + A Sbjct: 116 GVENRFHWILDVSFSEDASTVAKDNAPQNLSMLRKIALNIIRADKTDTRKSSLRLKRKGA 175 Query: 362 AMDRNYLAS 370 A D Sbjct: 176 ARDDGVREP 184 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 163 bits (412), Expect = 9e-39, Method: Composition-based stats. Identities = 56/194 (28%), Positives = 86/194 (44%), Gaps = 7/194 (3%) Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH-DSYAMSEKSHGREEIRLHIV 240 EKI ++ GDY+ +K N + E F + PE +++ R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 V D L EWKG+K + RS + +YISS D+ + A +R Sbjct: 61 LKVSDWLSK-AEEWKGIKSVLEVCRKRS----DNGKESQEKVFYISSLDVDVQILAKCVR 115 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW VENK HW LDVV ED+C + AE + +R +A+N+ K ++ K+ Sbjct: 116 GHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHP-KKQSMKGKLTA 174 Query: 361 AAMDRNYLASVLAG 374 A + +L G Sbjct: 175 AGWSDEFRDELLLG 188 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 162 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 56/164 (34%), Positives = 88/164 (53%), Gaps = 3/164 (1%) Query: 47 IEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVI 106 + + L+ + NG P DT RV+ I P + C + ++ S + I Sbjct: 1 MHELCLERGASLRPPVELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHI 60 Query: 107 AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKG 166 AIDGK L+ S K+ G+ H++SA+ L + Q +K NE+ AIPE+L+ LD+ G Sbjct: 61 AIDGKRLKGSKKKT---GSTHILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSG 117 Query: 167 KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 +I+ DAMG Q +IAE+I + DY+ ++KGNQ L + + F Sbjct: 118 AVISIDAMGTQTNIAEQIIQSEADYILSLKGNQKHLYEDVRDCF 161 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 162 bits (410), Expect = 2e-38, Method: Composition-based stats. Identities = 57/227 (25%), Positives = 103/227 (45%), Gaps = 15/227 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFG-ETHLDFLKQ 60 +++ L + +PD R +H L IL + + AV++ A+S+ + ++ LK+ Sbjct: 219 QMEGLRQIFFRLPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKR 278 Query: 61 YGDFENGI------PVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLR 114 N P T+ RV+ + W+ + +A+DGK L+ Sbjct: 279 IRARFNPRTQRYVAPSEPTLRRVLQGANVTALDAAIGAWLLGIAGF---EAVAVDGKVLK 335 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAM 174 + + + +H++SAF I Q + +K+NEI + LL +DI+ K++T DA+ Sbjct: 336 GAVREDGSQ--VHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADAL 393 Query: 175 GCQKDIAEKIQK-QGGDYLF-AVKGNQGRLNKAFEEKFPLKELNNPE 219 Q+ A + + + DYLF AVKGNQ +L + P + Sbjct: 394 HTQRKTARFLVEDKKADYLFTAVKGNQRKLRNSLI-CLPWGDFPPQR 439 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 161 bits (407), Expect = 4e-38, Method: Composition-based stats. Identities = 62/189 (32%), Positives = 86/189 (45%), Gaps = 10/189 (5%) Query: 192 LFAVKGNQGRLNKAFEEKFPLKELNNPEHDS---YAMSEKSHGREEIRLHIVCDVPDELI 248 + AVK NQ L E + S + +K HGR E R + D P Sbjct: 1 MLAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREVDKGHGRIETRRCLALDFPGPFE 60 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 W GL+ + + S R I RYY+SS A + A A+R HW +E+ Sbjct: 61 PDL--WPGLQSIPMVESTREI----GDTVTTGRRYYVSSLPADAVRIAHAVRAHWGIES- 113 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 +HW LDV NED C+ R NAA+ F+ +R IA ++ D KAG+R + KA +Y Sbjct: 114 MHWVLDVTFNEDQCRTRLENAAQNFAILRRIATTLIRRDNSTKAGIRIRRLKAGASDDYR 173 Query: 369 ASVLAGSGL 377 A +L L Sbjct: 174 AQLLGLKTL 182 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 161 bits (407), Expect = 5e-38, Method: Composition-based stats. Identities = 59/223 (26%), Positives = 100/223 (44%), Gaps = 19/223 (8%) Query: 11 SIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLD-FLKQYGDFENG-- 67 + + D R+A + H +LL+ + V++G S+E I + + L++ G + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 68 ----IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 P TI R++S P + ++ + IAIDGKT+R S Sbjct: 289 ERFLPPSEPTIRRILSKADPVELDRILSQYIVA---HSSGRAIAIDGKTIRSS------- 338 Query: 124 GAIHVISAFSTMHSLVIGQIKTDK-KSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 ++ +++A V+ Q D K +EI A LL LD+ GK++T DA+ Q +A Sbjct: 339 -SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALAS 397 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAM 225 +I+++GGDY+F VK N+ L +P D Sbjct: 398 RIREKGGDYVFTVKDNRKTLKDEISGLDDEAFSPSPYDDLLRT 440 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 151 bits (381), Expect = 5e-35, Method: Composition-based stats. Identities = 66/142 (46%), Positives = 92/142 (64%), Gaps = 4/142 (2%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIK 165 +AIDGK LR S+D + R IH++SA+S+ +L +GQ++T KSNEITAIPELL LDI+ Sbjct: 1 MAIDGKCLRGSHDGA--RSPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDS--Y 223 G IT DAMGCQ DIAE+I ++G DY+ VKGNQ L +A + F + E + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 AMSEKSHGREEIRLHIVCDVPD 245 + ++K+HGR E R + + Sbjct: 119 SQTDKNHGRIETRRCVATNDVA 140 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 55/142 (38%), Positives = 78/142 (54%), Gaps = 3/142 (2%) Query: 101 NDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLN 160 VIAI+GK+LR + + A+H +SA++ + L +GQ+ +KSNEITAI ELL Sbjct: 1 MGGLVIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLP 60 Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH 220 L ++G ++T DA+GCQ +AE+I GGDY+ AVK NQ L A + F Sbjct: 61 TLALEGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPV 120 Query: 221 DS---YAMSEKSHGREEIRLHI 239 + +K HGR E R Sbjct: 121 RQTCVHETLDKGHGRIETRRCT 142 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 150 bits (378), Expect = 1e-34, Method: Composition-based stats. Identities = 49/180 (27%), Positives = 85/180 (47%), Gaps = 4/180 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+E ++ +PD+R A + L +LLL I +S + +EDF H + L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 -ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRR- 122 P T RV+ I F NW+ ++D + +DGK+++ + + Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 123 -RGAIHVISAFSTMHSLVIG-QIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + I+V+S FS + I Q +K+ +EI + LL LD++G + T D++ CQK + Sbjct: 124 YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKKL 183 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 149 bits (376), Expect = 2e-34, Method: Composition-based stats. Identities = 65/326 (19%), Positives = 116/326 (35%), Gaps = 43/326 (13%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS 116 + G P ++T+ +++C+ WM + A DGK L S Sbjct: 13 RWRPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGGIRA-DGKVLGGS 71 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K A+H + + + + Q + + A+ LL + G++++ DA Sbjct: 72 --KRAGAPALHGVELVTHTTGMALAQREAVG-GDAAAALLALLTEAPLDGRMVSMDAGFL 128 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP------------------------- 211 + + I ++ G+YL VKG+Q ++ P Sbjct: 129 NAAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALDQIAPPR 188 Query: 212 --------LKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPD--ELIDFTFEWKGLKKLC 261 +EL E+S GR EIR V D D + + W+ + ++ Sbjct: 189 RKRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGDVGPSLMTAYGWRQVTQIG 248 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 + E +SS T +F +IRNHW +EN++H D M ED Sbjct: 249 GLRRWCRRRHADLWTVEEVTV--VSSRQRTPAQFLASIRNHWTIENQVHRPRDGSMQEDR 306 Query: 322 CKIRRGNAAELFSGIRHIAINILTND 347 R + + R++ IN++ Sbjct: 307 LHGR--AIGVILAVCRNVVINLIRRH 330 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 147 bits (372), Expect = 5e-34, Method: Composition-based stats. Identities = 47/180 (26%), Positives = 81/180 (45%), Gaps = 3/180 (1%) Query: 20 WKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG-IPVHDTIARVV 78 H L +L L AV+ + I FG + L F G P T+++ + Sbjct: 2 QGRIHPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTL 61 Query: 79 SCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSL 138 I P + W+ + + + +A+DGK LR S D H ++A++ + Sbjct: 62 RRIDPQQLEAALGRWIAGRLTPDARAHVALDGKCLRGSRDGDV--PGPHRVAAYAPHAAA 119 Query: 139 VIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 V+GQI+ D ++NE A LL ++ + G ++T A C +D+A + GG Y+ +G Sbjct: 120 VLGQIRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYVSHGQGQ 179 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 147 bits (371), Expect = 7e-34, Method: Composition-based stats. Identities = 58/170 (34%), Positives = 88/170 (51%), Gaps = 11/170 (6%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 K + I + G DY+ AVKGNQ RL++ + L +E+ R Sbjct: 1 MPKKTVQLIIEGGNDYVIAVKGNQKRLHEQIK----LTTEQRLPVSLDITTERRSDRITT 56 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V D+L +++W+GL++L F + +P + YYISS + A +F Sbjct: 57 RS---VSVFDDLSGISYDWEGLQRLVKVERF----GTRAGKPYHQIVYYISSLTINAAQF 109 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 A IR HW +EN+LHW DVV++ED+ ++R+GNA FS IR + + IL Sbjct: 110 AQGIRGHWGIENRLHWVKDVVLDEDNSRMRQGNAPANFSIIRSLVLTILR 159 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 146 bits (369), Expect = 1e-33, Method: Composition-based stats. Identities = 58/245 (23%), Positives = 96/245 (39%), Gaps = 17/245 (6%) Query: 28 GILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFH 87 +L + + A + + + T D + P T V+S + PA + Sbjct: 2 ALLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 ECFINWMRDCHSSNDKDV---IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIK 144 ++ +S+D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQLA 119 Query: 145 TDKKSNEITAIPELLNMLDIKG-KIITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRL 202 +KSNEI + LL +L ++T DAM Q A+ I YL VK NQ ++ Sbjct: 120 VAEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 NKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCV 262 A P E+ D + HGR + R + + + K++ Sbjct: 180 L-ARITALPWAEVPAAATD----DSRGHGRVKTRTLQIITAARGIG-----FPYAKQIIR 229 Query: 263 AVSFR 267 R Sbjct: 230 ITRER 234 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 146 bits (369), Expect = 1e-33, Method: Composition-based stats. Identities = 51/196 (26%), Positives = 87/196 (44%), Gaps = 9/196 (4%) Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHI 239 ++E+ ++ DY+ A+KGN + + ++ F + + +K HGR E R++ Sbjct: 1 MSERNSEKDNDYILALKGNHPLMEQEVKDFFLSPVTST--RSVHTTFDKGHGRIERRIYT 58 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 D + EWK L + S +K + +RY+I+S ++FA + Sbjct: 59 -LDTNIGWFEDKKEWKHLAGFGMVDSM----VTRKGKECREIRYFITSVT-DVKQFAKGV 112 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 +HW +EN LHW LDV+ +D+C + NAAE + IR I N + K Sbjct: 113 CSHWMIENNLHWCLDVLFCDDECTVLDRNAAENLAIIRRIVYNRIKMLSKMDTLSMGKR- 171 Query: 360 KAAMDRNYLASVLAGS 375 D + A +L Sbjct: 172 ACIYDDEFRAQILFSC 187 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 144 bits (364), Expect = 4e-33, Method: Composition-based stats. Identities = 58/180 (32%), Positives = 87/180 (48%), Gaps = 5/180 (2%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLK-QY 61 + L + + IPD+R+A L +LL +I A++SGA S+ I F TH L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSR 121 G P + +I + + F VIA+DGKTLR S D+ Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHFRAHAARLAE--GAAVIALDGKTLRGSLDRFE 118 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTD--KKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A V+SAF+T +V+GQI + K +EI A L+ L + G++ T DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 60/194 (30%), Positives = 84/194 (43%), Gaps = 11/194 (5%) Query: 186 KQGGDYLF--AVKGNQGRLNKAFEEKFPLKELNNPEHDS---YAMSEKSHGREEIRLHIV 240 +G + +G L A + F + + +K HGR E R Sbjct: 91 DRGRWWRLRACRQGQPTHLAHALRDFFGTLDAPGYPVRQTCVHETLDKGHGRIETRRCTA 150 Query: 241 CDVPDEL--IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 D L + WK + + S R I + E RY ISS +E+ A Sbjct: 151 AGDLDWLATLGLKERWKKITSVAGIDSSRVI----GSKTETDRRYVISSLPADSERILHA 206 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R HW +EN LHW LDV ED C IR NAA FS +R A+N+ D GL +K Sbjct: 207 VRMHWGIENGLHWCLDVAFGEDACPIRLRNAALDFSLLRRAAMNLFRADHSRAMGLPKKR 266 Query: 359 RKAAMDRNYLASVL 372 + AA + +YLA++L Sbjct: 267 KAAAWNPDYLANIL 280 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 56/208 (26%), Positives = 86/208 (41%), Gaps = 15/208 (7%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 +L +++ GK IT DA+ QK +AE I + YLF VK NQ L + F Sbjct: 2 FIPILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYFEH-- 59 Query: 215 LNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 E D HGR + R +E ++F + + +S + Sbjct: 60 --RKEPDYCLQDPPGHGRIDTRSIWTTTELNEYLEFPHVGQAF-----CIHKKSYDPKTN 112 Query: 275 KEPEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 K E T Y ++S + R HW +EN H+ LD +ED +IR GN Sbjct: 113 KVCENTF-YGVTSHHPNKADPARILQIHRGHWSIENSKHYILDWTYDEDRNRIRTGNGPA 171 Query: 332 LFSGIRHIAINILTNDKVFKAGLRRKMR 359 + +R AI +L + V + +K+R Sbjct: 172 NTNRLRGFAIGLLKSKGVK--DIAQKVR 197 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 140 bits (353), Expect = 8e-32, Method: Composition-based stats. Identities = 74/318 (23%), Positives = 122/318 (38%), Gaps = 45/318 (14%) Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS-------SNDKDVIAIDGKTLR 114 G P T+ R+++ SPA E ++D + V++ DGK Sbjct: 93 LGLGRGKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTW 152 Query: 115 HSYDKSRRRGAIHVISAFSTMHS------------------LVIGQIKTDKKSNEITAIP 156 D + +GA SA+ S +GQ K E TA Sbjct: 153 SRTDGEKVKGAQQ--SAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFR 210 Query: 157 ELL----NMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL 212 LL L + +I+T DA C ++ AE + G Y+F +K NQ L+ + Sbjct: 211 RLLPAISEQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLHDIARDY-GQ 269 Query: 213 KELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAE 272 +L P +E+ G +R DV + L +C + R Sbjct: 270 YDLGTPLA---RTAERYRGHTIVRELYARDVAGNPAAAIEAAQQLWYVCQTTTDRR---- 322 Query: 273 QKKEPEMTVRYYISSA---DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + + RY+++S LT ++ +R HW +EN HW +DV++ ED+ + + Sbjct: 323 -GEIVAVEQRYFVTSIPTGTLTRDQELALVRMHWAIENGCHWTMDVMLGEDEGHPCQASR 381 Query: 330 A--ELFSGIRHIAINILT 345 A E S +R I N ++ Sbjct: 382 ASIETVSWLRLIGYNAVS 399 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 139 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 56/167 (33%), Positives = 86/167 (51%), Gaps = 13/167 (7%) Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 ++S ++T + LV+GQ+K + SNEITAIPELL +L++ G I+ A+ C KDI + I ++ Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 GGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPEHDSYAMSEKSHGREEIRLHIVCDVP 244 DY+ +K NQG L ++ E+ F +H +Y E HG EIR P Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQELQHSTYKPEETGHGLHEIRNFGFQLDP 120 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 D + W LK + + I + + + RY+ISS D Sbjct: 121 DSV------WSNLKSVGMVE----PIGQVDDKTTVETRYFISSLDSN 157 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 63/201 (31%), Positives = 97/201 (48%), Gaps = 13/201 (6%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 + +L IK I T DA+ CQK E I ++ Y+ VK NQ L +A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIEDTAKNSP 61 Query: 215 LNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 LN ++ ++K HG E H + + +W GL++ +S R Sbjct: 62 LNA-----WSWTQKGHGHE---SHCRLKIWEATESMKMQWAGLERF---ISIRRQGFRHH 110 Query: 275 KEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 K+ + T Y+I+S L++ + A IR H +EN LHW DV++NED+C IR + A + Sbjct: 111 KKFDSTT-YHITSETLSSYRLAGFIRGHRRIENNLHWTKDVILNEDNCGIREPHPAAILG 169 Query: 335 GIRHIAINILTNDKVFKAGLR 355 +R+IA N L V L+ Sbjct: 170 ILRNIAFN-LRLGTVSNPSLK 189 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 138 bits (347), Expect = 4e-31, Method: Composition-based stats. Identities = 60/155 (38%), Positives = 89/155 (57%), Gaps = 3/155 (1%) Query: 102 DKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNM 161 D+IA+DGKTLR SYD++ + AIH++SA+ST + LV+GQ+KT++KSNE TAIP+L + Sbjct: 6 PGDIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIPKLFTL 65 Query: 162 LDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNP 218 L ++ +T DA+G Q+DIA++I + DYL VK NQ L++ + + K Sbjct: 66 LALEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAKGFTED 125 Query: 219 EHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFE 253 DS HGR + V L + Sbjct: 126 FTDSVTEEGDKHGRIDKLHCRVTHRFSGLGALADK 160 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 138 bits (347), Expect = 4e-31, Method: Composition-based stats. Identities = 47/190 (24%), Positives = 77/190 (40%), Gaps = 6/190 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+ H+ IPD R V +LL+ + ++S ES D+E F H L + Sbjct: 12 DLISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGI 71 Query: 65 ENGIPVHDT-IARVVSCISPAKFHECFINW--MRDCHSSNDKDVIAIDGKTLRHSYDK-- 119 E P D+ + A +W + + D D + DGKTLR S + Sbjct: 72 ELKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTS 131 Query: 120 SRRRGAIHVISAFSTMHSLVIGQ-IKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 I ++ +S + I Q + +E + +LL LD++G +I DA+ Q+ Sbjct: 132 GGGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQ 191 Query: 179 DIAEKIQKQG 188 Q +G Sbjct: 192 AFFGSSQSRG 201 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 136 bits (343), Expect = 1e-30, Method: Composition-based stats. Identities = 58/146 (39%), Positives = 78/146 (53%), Gaps = 7/146 (4%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH---DSYAMSEKSH 230 MGCQK+IAE I +Q DY+ AVK NQ L++A ++ F N E D KSH Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYFEEANEANFESYNIDFAETYNKSH 60 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL 290 GR E R V L D + W+GL+ + + S R++ K++ + RYYISS Sbjct: 61 GRIESRRCWVGYDALPLTDDSQNWEGLQTIVMVESERTL----KEKTTIEHRYYISSTMA 116 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVV 316 TA + R HW +EN LHWRLD+ Sbjct: 117 TAAYLLNSSREHWGIENSLHWRLDIA 142 >UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7Q7_9GAMM Length = 164 Score = 134 bits (336), Expect = 6e-30, Method: Composition-based stats. Identities = 52/171 (30%), Positives = 81/171 (47%), Gaps = 9/171 (5%) Query: 205 AFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 F++ + L E +SY EK HGR+E+R V +W +K + V Sbjct: 2 QFQDYWALPEDKQ---ESYITEEKGHGRKEVREVYVLPAAFS-EALRQKWCLVKSIVAVV 57 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 RS+ K + YYI + L+ E + A R HWH+EN+ HW LDV+ ED+ +I Sbjct: 58 RDRSV----KGKGSYETSYYICTDHLSLELASKATRKHWHIENQQHWALDVIFKEDEQRI 113 Query: 325 RRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGS 375 G++A + R N+ + + RKM +AA +++Y VL S Sbjct: 114 YAGDSALNMACCRRFVQNLFRKSE-GNLSVPRKMNQAAWNKDYREKVLFTS 163 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 132 bits (333), Expect = 1e-29, Method: Composition-based stats. Identities = 44/187 (23%), Positives = 83/187 (44%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L + +S +PD R A + L G+L L + A +S +S +E F + L G Sbjct: 3 LRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 P H I ++ + P K + + +V+ +DGK LR S + Sbjct: 60 RKAPGHTAITLLLHRLDPEKLQAALGQVFPEA---DLGEVLVVDGKHLRGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML---DIKGKIITTDAMGCQKDIAE 182 + ++ + + Q + + + E A ELL+ L +++GK++ DA ++A Sbjct: 115 VKLVEVLALHLHTTLAQARAEGR--EEKAFLELLDRLEARELEGKVVVGDAGYLYPEVAA 172 Query: 183 KIQKQGG 189 +++K+GG Sbjct: 173 RVRKKGG 179 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 131 bits (330), Expect = 3e-29, Method: Composition-based stats. Identities = 49/167 (29%), Positives = 80/167 (47%), Gaps = 9/167 (5%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 L+KL E S IPD+R+A K + HKL +++L I +S S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDK-----DVIAIDGKTLRH 115 NGIP T+ R+ I + H +++ IDGK R Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCAQEIVCIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML 162 + K+ R I +SA S + + ++KSNEI A+P L++ + Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLIDKI 199 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 130 bits (327), Expect = 8e-29, Method: Composition-based stats. Identities = 54/157 (34%), Positives = 79/157 (50%), Gaps = 4/157 (2%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 G H++SA++T H + +G + T++KSNEITAI LL L K ++T DAMGCQKDIA Sbjct: 2 GPRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARN 61 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPEHDSYAMSEKSHGREEIRLHIV 240 I GGD++ AV+ NQ +L A E H ++ HGR + R + Sbjct: 62 IVAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWG 121 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEP 277 VP + EW +K + AV + + + Sbjct: 122 AQVPPD-FAAKGEWPWIKAIGTAVRITTHPDGTQTDE 157 >UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azollae' 0708 RepID=B9YJB3_ANAAZ Length = 124 Score = 130 bits (326), Expect = 1e-28, Method: Composition-based stats. Identities = 44/119 (36%), Positives = 70/119 (58%), Gaps = 4/119 (3%) Query: 248 IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVEN 307 +D W LK + + S I + + + RY+ISS D E+ A ++R+HW +EN Sbjct: 9 LDPDSVWSNLKSVGMVES----IGQVDDKTTVETRYFISSLDSNGEQLANSVRSHWAIEN 64 Query: 308 KLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRN 366 LHW LDV + +DDC+IR+ NA + F+ +R IA+++L + K G++ K AA+D N Sbjct: 65 SLHWVLDVALKQDDCQIRKDNAPQNFAVMRQIAVDLLGKENPVKRGIKNKQFLAAVDNN 123 >UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=Bacteria RepID=B7UED9_ECOLX Length = 104 Score = 129 bits (325), Expect = 1e-28, Method: Composition-based stats. Identities = 84/99 (84%), Positives = 90/99 (90%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 MTVRYYISSAD TAEKF TAIRNHWH+EN L+WRLDVVMNEDD KIRRGNAAE FSGIRH Sbjct: 1 MTVRYYISSADSTAEKFVTAIRNHWHMENNLYWRLDVVMNEDDYKIRRGNAAESFSGIRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGL 377 IAINILTN++VFKA RRKMRKA MD+NYLASVLAG+G Sbjct: 61 IAINILTNNQVFKARSRRKMRKATMDKNYLASVLAGAGF 99 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 129 bits (325), Expect = 1e-28, Method: Composition-based stats. Identities = 66/359 (18%), Positives = 113/359 (31%), Gaps = 72/359 (20%) Query: 26 LSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 L+ +L L V++G +++ + ++ L GIP T R+V P Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFGLTRGIPSERTTRRLVEGCDPVA 107 Query: 86 FHECFINWMRDCH--SSNDKDVIAIDGKTLRH--SYDKSRRRGAIHVISAFSTMHSLVIG 141 E W+ +A DGKTL+ S+ ++ V+ A + G Sbjct: 108 LDEALSGWIARAATVGDPGPRGLAFDGKTLKGTRSFTEAGAMSQEAVLEAVWHDTGITAG 167 Query: 142 QIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 + +EI A+ L LD+ ++TT ++G Sbjct: 168 HQRVVG-GDEIAALEALAGRLDLTDVLVTT-------------AEKG------------- 200 Query: 202 LNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 HGR E+R V + + G K++ Sbjct: 201 ----------------------------HGRVEVRSLKALTVTTPKLVGFW---GTKQVI 229 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT-------AIRNHWHVENKLHWRLD 314 P ++ + L AE+ R HW VE +H D Sbjct: 230 ELRRRTRRKKTVTAAPTVSEEVFYLVTSLPAEQAHPRDLAARARARGHWTVEA-IHHVRD 288 Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 V++ED R NA ++ R AI+ L + + +R A + +A Sbjct: 289 RVLDEDRHTARTANAPLAWAIARDTAISALRL--TGHRSIAKALRTTARQPERVLQTIA 345 >UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC8_9GAMM Length = 154 Score = 128 bits (322), Expect = 3e-28, Method: Composition-based stats. Identities = 42/109 (38%), Positives = 61/109 (55%), Gaps = 4/109 (3%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W+ L+ + + S R+ +K E + RYYISS TA R HW +E LHW L Sbjct: 7 WEELQTIVMVESERA----EKGETTIEHRYYISSTLGTAAYLLDYKREHWGIETSLHWCL 62 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 D+ ED+ +I +GN AE F+ +RHIA+N+L + K G++ K KA Sbjct: 63 DIAFREDESRISKGNGAENFAILRHIALNLLKKEDTAKIGIKNKRLKAG 111 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 128 bits (321), Expect = 3e-28, Method: Composition-based stats. Identities = 46/187 (24%), Positives = 85/187 (45%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L E +S IPD R A ++ L G+L L + A +S +S +E F + L G Sbjct: 3 LREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 P H + ++ + P K E + ++ +V+ +DGK L+ S + Sbjct: 60 RKPPGHTILTLLLHRLDPEKLQEALLQVFP---GADLGEVLVVDGKHLKGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML---DIKGKIITTDAMGCQKDIAE 182 + ++ + + Q K + + + A+ ELL+ L +KGK++ DA ++A Sbjct: 115 VRLVEVLALHLLTTLAQAKAEGRED--QALLELLDRLGAEGLKGKVVVGDAGYLYPELAG 172 Query: 183 KIQKQGG 189 K+ ++GG Sbjct: 173 KVVQKGG 179 >UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H9Y8_ANADF Length = 431 Score = 127 bits (319), Expect = 6e-28, Method: Composition-based stats. Identities = 79/369 (21%), Positives = 128/369 (34%), Gaps = 41/369 (11%) Query: 10 ISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIP 69 + +PD R L+ IL + +++GA S + E+ + ++ +P Sbjct: 22 LEAVPDVRAREG-RWSLAEILTGVLLGIVAGARSLAEAEELTDGMSPAARRLASVPRRLP 80 Query: 70 VHDTIARVVSCISP-----AKFHECFIN-WMRDCHSSNDK--DVIAIDGK-----TLRHS 116 DT AR C P A H W R + D V+A+DGK TL H Sbjct: 81 --DTTARDALCAVPLDGLRAALHRLVRAAWRRKALTPVDLPVGVVALDGKVTALPTLNHP 138 Query: 117 Y------DKSRRRGAIHVISA--FSTMHSLVIGQIKTDKKSNEITAIPELL-NMLDIKGK 167 D G ++ S I + ++NE +L +++ G Sbjct: 139 LIQNQHPDVGLPYGLARTVTCALVSAPGRPCIDAVPIPAETNEAGHFQHVLAGLVETYGA 198 Query: 168 ---IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYA 224 ++T DA + + G DY+FA+K + + K E E+ D Sbjct: 199 LFQVVTYDAGALSEANGAAVVAAGKDYVFALKNDHFTMVKLATELLDPHEIAARRED--V 256 Query: 225 MSEKSHGREEIRLHIVCDVPDELIDFTFE---WKGLKKLCVAVSFRSIIAEQKKEPEMTV 281 + + EI++ V E W + S + E Sbjct: 257 LDNATTATREIQILAVDPSHGYGAGKGPEESVWSHARTFLRVTS---TVRRSGVVIERDS 313 Query: 282 RYYISSA---DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCK--IRRGNAAELFSGI 336 R ++SS LT +++ +R HW VEN H LD ED+ N + Sbjct: 314 RLFVSSRAADQLTPDQWLQVVRAHWGVENNNHHTLDTAFAEDERPWIAADANGMLAVLLL 373 Query: 337 RHIAINILT 345 R IA +L Sbjct: 374 RRIAYTLLA 382 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 124 bits (312), Expect = 4e-27, Method: Composition-based stats. Identities = 46/202 (22%), Positives = 77/202 (38%), Gaps = 50/202 (24%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLN---KAFEEKFPLKELNNPEHDSYAMSEKSH 230 MGCQK+IA+ I KQ DY+ A+KG+ L +A+ K + D + + H Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGH 60 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL 290 GR E R V ++ ++W GLK + S Sbjct: 61 GRIETRRCQQVLVNKSWLNNKYQWVGLKSIIKVTS------------------------D 96 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 EK T + +IR+G F+ +R IA+ + ++ Sbjct: 97 VHEKTTT-----------------------ESRIRKGRGPLAFNVMRKIAMTLFKQEQTK 133 Query: 351 KAGLRRKMRKAAMDRNYLASVL 372 +A + K + A +D Y +++L Sbjct: 134 RASIVAKKKMAGLDDEYRSTLL 155 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 124 bits (311), Expect = 6e-27, Method: Composition-based stats. Identities = 46/187 (24%), Positives = 77/187 (41%), Gaps = 17/187 (9%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEK--------FPLKELNNPEHDSYAM 225 M Q D+ +Q++GGDY+ K NQG L E FP + D+ Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYI 285 E S G + + L ++ W G++++ R + + + V Y I Sbjct: 61 CEVSKGHGWVERRTMTS-TIWLNEYLTRWPGVQQVFRLTRTRQV----GGKTTVEVVYGI 115 Query: 286 SSADLTAEK---FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 SS A R HW +E++ H D + ED C++RRG A + + +R++A+ Sbjct: 116 SSLSSVAAAPDALLRYTRTHWGIESR-HHIRDATLGEDRCRVRRGAAPRVLAVLRNVAVY 174 Query: 343 ILTNDKV 349 +L Sbjct: 175 LLRRLGT 181 >UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=Bacteroides RepID=C6Z3A5_9BACE Length = 115 Score = 122 bits (305), Expect = 2e-26, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 64/118 (54%), Gaps = 4/118 (3%) Query: 261 CVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AEKFATAIRNHWHVENKLHWRLDVVMNE 319 S R+I+A E VRYY++S D T EK A+AIR HW + N LHW+LDV E Sbjct: 1 VRIKSERTIVAI--GEYTQEVRYYVTSLDNTRPEKIASAIRQHWSIGNNLHWQLDVTFRE 58 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGL 377 D K + NAA FS +A+ IL N+K K + K KA D NYL+ +L + Sbjct: 59 DYSK-KVKNAAGNFSVATKMALTILKNEKTTKGSMNLKRLKAGWDENYLSQLLQDNNF 115 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 122 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 46/211 (21%), Positives = 91/211 (43%), Gaps = 14/211 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLK-- 59 + + +++ IPD R+ K +H+ +LL+ + AV SG + + + + FL Sbjct: 5 SIPSPLPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDE 64 Query: 60 ---QYGDFENGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSNDKDV-----IAID 109 + E +P T+ R+ + + ++W R+ + K+ +A+D Sbjct: 65 VHIRTRRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVD 124 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG-QIKTDKKSNEITAIPELLNMLDIKGKI 168 GK LR + R A+ +SA L +G Q D ++ + + L + + Sbjct: 125 GKHLRGTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGVD-WV 183 Query: 169 ITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 +T DA C +++A + +Q G A KG + Sbjct: 184 LTGDAALCTQELAAVVVEQKGGICSASKGTR 214 >UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammaproteobacteria RepID=Q47YV8_COLP3 Length = 151 Score = 114 bits (286), Expect = 4e-24, Method: Composition-based stats. Identities = 33/128 (25%), Positives = 61/128 (47%), Gaps = 3/128 (2%) Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 + + + GLK + + + + R+ ISS DL + A+R+HW Sbjct: 20 KKWLAKAYRRSGLKSIIKV--HTQVHDKSTGKDTAETRWNISSLDLHVVQALNAVRSHWQ 77 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 VE+ +HW LD+ D+ +I R +F+ +R IA+ + D + RK + A +D Sbjct: 78 VES-IHWMLDMTFRVDESRICRKQGPHVFNVMRKIAMTLFKQDTTKLVSMARKKKMAGLD 136 Query: 365 RNYLASVL 372 +Y +++L Sbjct: 137 DDYRSNLL 144 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 114 bits (286), Expect = 4e-24, Method: Composition-based stats. Identities = 46/176 (26%), Positives = 76/176 (43%), Gaps = 15/176 (8%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRL 237 K E + G D L +KGN +L A + SY + R E R Sbjct: 6 KKTVETVLATGNDLLVQLKGNHPKLLAAVRTLC---QSRAHAEQSYTVDLGRRNRIEQRT 62 Query: 238 HIVCDVPD------ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 + +P F +G +++ V + +++ P YY+++ + Sbjct: 63 VRLWPLPPGSGTDPWHDHFQTVIEGQRQIEVFNPYHRRFEPRQESP----AYYLATCTAS 118 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 A A IR HW +EN+LH LDV + ED +IRR +F+ +RH A+N+L ++ Sbjct: 119 AATLAQVIRGHWAIENRLHHVLDVSLGEDSSRIRRN--PGVFALLRHFALNLLRHN 172 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 114 bits (286), Expect = 5e-24, Method: Composition-based stats. Identities = 40/92 (43%), Positives = 59/92 (64%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 +E S IPD R +H I+ L +F+V++GA+S+ +IEDF E H+D+LK Y + Sbjct: 5 FVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTYFNLP 64 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 NGIP HDT +RV S I+PA F + F+ W++ Sbjct: 65 NGIPSHDTFSRVFSAINPASFQDSFLIWLKAI 96 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 114 bits (284), Expect = 7e-24, Method: Composition-based stats. Identities = 49/205 (23%), Positives = 85/205 (41%), Gaps = 18/205 (8%) Query: 100 SNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELL 159 + + IA+DGK L+ S + R H++SA + + + +++ K+NE T LL Sbjct: 128 AGPRRAIAVDGKALKASARLTSPRR--HLLSAVTHGRVVTLARVEVGAKTNETTHFKPLL 185 Query: 160 NMLDIKGKIITTDAMG-CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP 218 LD+ ++T DA+ + +I+ ++ + Y+ +K NQ + P +++ Sbjct: 186 APLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQLAT-LPWRDIPV- 243 Query: 219 EHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPE 278 +A SE HGR E C +PDEL + L A+ K Sbjct: 244 ---QHAASEVGHGRRESSSIKTCAIPDELGGIAYPHARL-----AIRVHRRCQPTGKRES 295 Query: 279 MTVRYYISSADLTAEKFATAIRNHW 303 Y ++S D A R W Sbjct: 296 RESVYAVTSLDAH-----QATRPIW 315 >UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica OS145 RepID=A3WIW9_9GAMM Length = 96 Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats. Identities = 43/96 (44%), Positives = 62/96 (64%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 M RYYISSA L+AE+FA+ +R HW +EN+LHW LDV + ED+C I RG+AA+ + RH Sbjct: 1 MQYRYYISSASLSAEEFASTVRAHWGIENRLHWVLDVTIREDNCPISRGHAADNLACFRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 +A+N + +K A + RK + A M L ++ Sbjct: 61 VALNQIRREKTIDASVNRKQKMATMSEEVLDLIVNA 96 >UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX Length = 105 Score = 109 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 74/88 (84%), Positives = 77/88 (87%) Query: 271 AEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 EQKKEPEMT RYY SADLTAEKFATA RNHW+VENKLHW LDVVMN+DDCKIRRGNAA Sbjct: 18 TEQKKEPEMTDRYYSISADLTAEKFATANRNHWYVENKLHWHLDVVMNKDDCKIRRGNAA 77 Query: 331 ELFSGIRHIAINILTNDKVFKAGLRRKM 358 ELFSGIR IAINILT DK+ KAG R KM Sbjct: 78 ELFSGIRKIAINILTKDKILKAGARCKM 105 >UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina DSM 3645 RepID=A3ZX26_9PLAN Length = 115 Score = 107 bits (268), Expect = 5e-22, Method: Composition-based stats. Identities = 44/112 (39%), Positives = 65/112 (58%) Query: 263 AVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDC 322 A+ + +Q + VRYYI S LT +FA A+R HW +EN LHW+LDV E Sbjct: 3 AIGMTINLVKQNGKEASEVRYYIVSKYLTGRRFAEAVRGHWGIENSLHWQLDVTFGEHQS 62 Query: 323 KIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 +IR+G+A FS +R ++++L N+K + G++ K KA + YL VL G Sbjct: 63 RIRKGHADINFSLLRRTSLSLLKNNKTARVGVKNKRLKAGRNDKYLLEVLLG 114 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 53/128 (41%), Positives = 70/128 (54%), Gaps = 1/128 (0%) Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREE 234 + ++ +KI ++ DYL AVKGNQG L AF++ F LNN + + Y E+S GR E Sbjct: 11 SVRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHE 70 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 R V L D + EW GLK + VS S E +E ++ VRYYISS L AE+ Sbjct: 71 SRAAFVSHDLSVLGDISDEWPGLKSMAFVVSMNS-EKEVAEEADIYVRYYISSKQLNAEE 129 Query: 295 FATAIRNH 302 TA R H Sbjct: 130 LLTASRLH 137 >UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9DIV7_9GAMM Length = 133 Score = 103 bits (257), Expect = 9e-21, Method: Composition-based stats. Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 3/107 (2%) Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 V ++ ++W GLK + S + + + R+YISS DL AE+ +++RNH Sbjct: 3 VNKSWLNNKYQWVGLKSIIKVTS--DVHEKTTGKETTETRWYISSLDLNAEQALSSVRNH 60 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 W VE+ +HW L++ ED+ + R+G F+ +R IA+ + D+ Sbjct: 61 WQVES-MHWVLEMTFREDESRFRKGRGPLAFNVMRKIAMTLFKQDQT 106 >UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C429C Length = 149 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 38/125 (30%), Positives = 63/125 (50%), Gaps = 11/125 (8%) Query: 224 AMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRY 283 SEK HGR E R + +WKGLK+ R++ K + + V Y Sbjct: 2 TTSEKGHGRIEKRTLETTPIVT----VGQKWKGLKQGLRITRERAV----KGKKTVEVVY 53 Query: 284 YISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 I+S A T +R+HW +EN LH+ DV + ED C++R+G A ++ + +R++ Sbjct: 54 GITSLSMARANAATLLTILRDHWQIENGLHYVRDVTLGEDACRVRKGTAPQVLAAVRNVV 113 Query: 341 INILT 345 +++L Sbjct: 114 VHLLA 118 >UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE7_9GAMM Length = 185 Score = 101 bits (251), Expect = 5e-20, Method: Composition-based stats. Identities = 43/172 (25%), Positives = 64/172 (37%), Gaps = 10/172 (5%) Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSH-GREEIRLHIVCDV 243 G L +K NQ L+ A E +P D + E R E R V + Sbjct: 2 IATGNHLLVQLKRNQPLLHDAMVEYT----RGHPFVDEHHTHEIGRRNRIEKRAVHVWHL 57 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQ--KKEPEMTVRYYISSADLTAEKFATAIRN 301 L + + L + YY+ L A +F+ AIRN Sbjct: 58 HPSLGSAPWY-DHFRALIRVQRHTERFDTRLRDWRVSKECAYYLCDLVLPAARFSEAIRN 116 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 HW VEN+ H+ D ED +IRR F+ +R A+N++ ++V Sbjct: 117 HWRVENRAHYVRDTRFQEDASRIRRN--PCTFALLRSFALNLMRFNRVENIS 166 >UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REV9_LEGLO Length = 139 Score = 99.0 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 36/142 (25%), Positives = 65/142 (45%), Gaps = 6/142 (4%) Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 R + +P + + G+K + + S + RYY++S + Sbjct: 2 RRRYFAYRLPKTINTGSLV--GIKSIIATETISS--KTNETAISAEWRYYVTSHETEKSD 57 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKA 352 +RNHW +EN+LHW LDV +N+D K R A FS I+ + ++++ K Sbjct: 58 LHLYVRNHWSIENELHWHLDVHLNDDADKKRDDTTAINFSSIKRMLLSLVKTKLPPGKKR 117 Query: 353 GLRRKMRKAAMDRNYLASVLAG 374 +R ++++ D YL S+L+ Sbjct: 118 SVRSRLKQVGWDTEYLVSLLSA 139 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 97.5 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 39/120 (32%), Positives = 58/120 (48%), Gaps = 4/120 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 ++H I D R +H L I+LL I AV+SG+E WE IE+FG LD+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSNDKDVIAIDG--KTLRHSYDKSR 121 GIP HDTIARV+ + + + + D + + G + H + Sbjct: 67 AGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREG 126 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 3/79 (3%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLN---KAFEEKFPLKELNNPEHDSYAMSEKSHGREE 234 K+IA+ I KQ DY+ A+KG+ L +A+ K + D + + HGR E Sbjct: 87 KEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGHGRIE 146 Query: 235 IRLHIVCDVPDELIDFTFE 253 R V ++ + Sbjct: 147 TRRCQQVLVNKSWLNNKYR 165 >UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D5 Length = 148 Score = 96.7 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 34/122 (27%), Positives = 57/122 (46%), Gaps = 11/122 (9%) Query: 228 KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISS 287 K HGR E R L ++ W G++++ R + + V Y ISS Sbjct: 3 KGHGRVERRSITTTT---WLNEYLTRWPGVQQVFRLERQRR----ADGKTTVEVVYGISS 55 Query: 288 ADLTAE---KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 A R+HW +E+ LH+ DV ++ED C++RRG A + + +R++A+ +L Sbjct: 56 LSPVAAPPDTVLGYTRSHWGIES-LHYVRDVTLDEDRCRVRRGTAPRVLASLRNVAVYLL 114 Query: 345 TN 346 Sbjct: 115 RR 116 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 96.3 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 42/85 (49%) Query: 7 MEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFEN 66 ++H + D R L I+ + I AV++GA+ + IE +G+ +L+ + D Sbjct: 28 LKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLPK 87 Query: 67 GIPVHDTIARVVSCISPAKFHECFI 91 GIP HDT RV+ + P + F Sbjct: 88 GIPSHDTFGRVLRILEPKQLQSGFR 112 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 95.9 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 2/107 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 ++H + D R +H L I+LL I AV+SG+E WEDIE+FG LD+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSNDKDVIAIDG 110 GIP HDTIARV+ + + + + D + + G Sbjct: 67 AGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQG 113 >UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N3J1_VIBHB Length = 184 Score = 95.6 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 7/132 (5%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYIS 286 ++ HGR R + +P+EL + G+K R + + + YYI+ Sbjct: 34 DEGHGRLVRRRYFAFPLPEELHNHAL--SGIKSCIAVE--RIVQEGKGEPKTSHFSYYIT 89 Query: 287 SADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN 346 + + K A +R HW +E+ HW LDV N+D K N+AE F+ I+ + +N++ Sbjct: 90 NHPASDPKLADYVRQHWEIES-YHWLLDVYFNDDRDKKYEENSAENFAQIKRLPLNLVKA 148 Query: 347 DK--VFKAGLRR 356 K ++ Sbjct: 149 KDWAGKKKSVKS 160 >UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C6B8_ACAM1 Length = 145 Score = 95.6 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 38/126 (30%), Positives = 60/126 (47%), Gaps = 7/126 (5%) Query: 222 SYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTV 281 + S +S GREE R V + + EW+ ++ + ++ + Sbjct: 3 EHTHSIQSRGREEHRCIQVY---EPVGIALQEWEAIRSVLCVQR----WGTRQGKAYHNT 55 Query: 282 RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAI 341 YYISSA + + + +R HW +EN+LHW DVV EDD ++ A +S +R I I Sbjct: 56 AYYISSAATSPHHWQSLVREHWGIENRLHWPKDVVFGEDDYRLEDEQALLNWSVLRTIVI 115 Query: 342 NILTND 347 NIL + Sbjct: 116 NILRLN 121 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 95.6 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 33/83 (39%), Positives = 50/83 (60%), Gaps = 1/83 (1%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE- 182 A+H++SAF + +V+ Q+ +KSNEI A ELL LDI G +T DAM Q++ A Sbjct: 7 KAVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQREHARF 66 Query: 183 KIQKQGGDYLFAVKGNQGRLNKA 205 ++ + D++ VK NQ L +A Sbjct: 67 AVEDKRADFVMTVKDNQPELREA 89 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 95.2 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 28/131 (21%), Positives = 55/131 (41%), Gaps = 6/131 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 + L ++ +PD R + L IL + + AV++GA ++ I D+ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRD------CHSSNDKDVIAIDGKTLRHSY 117 F + +P T+ R++ I + W+R VIA+DGK +R + Sbjct: 89 FTDRVPAATTVWRLLIRIDAEVLPQVLARWLRARTAPVVVTGRRLCLVIAVDGKVVRGAR 148 Query: 118 DKSRRRGAIHV 128 ++ A+ + Sbjct: 149 LRAAGPSALGL 159 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 94.8 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 32/78 (41%), Positives = 50/78 (64%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 +++++E + + D R A + +H L IL+L + AV+SGA+ W+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARVVSC 80 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRVSLR 84 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 94.8 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 27/77 (35%), Positives = 49/77 (63%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M + +++H S + D RQ+W+V + L I LL + A +SG E + +I +G+ L+FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTIARV 77 + +E G+P HDT+ + Sbjct: 77 FLPYERGLPAHDTLKGL 93 >UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3V5_9PROT Length = 174 Score = 94.0 bits (232), Expect = 9e-18, Method: Composition-based stats. Identities = 42/173 (24%), Positives = 72/173 (41%), Gaps = 16/173 (9%) Query: 195 VKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K NQ L E + + P +++ K R+E R V V D L ++ Sbjct: 1 MKANQSNLF---ETACAIAANDAPADTAFSR-NKGRSRQEDRTVEVFPVGDALAGTEWQ- 55 Query: 255 KGLKKLCVAVSFRSII--AEQKKEPEMTVRYYISSA-DLTAEKFATAIRNHWHVENKLHW 311 +K + + A + V +Y+SSA + A +A AIR HW +EN+ H+ Sbjct: 56 PFIKTIIRVTRRTLLHSAATGLWDQRGEVAFYVSSAANFPATAWAAAIRGHWGIENRNHY 115 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 DV +ED +IR + + R A+NI+ + + +A + Sbjct: 116 VRDVSCDEDKSRIRDN--PGIMARARSFALNIMRKNGIANVA------QALWN 160 >UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY37_9BACT Length = 138 Score = 93.2 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 51/120 (42%), Gaps = 5/120 (4%) Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTV-RYYISSADL 290 R E + V L+ ++ L+++ + K E + +SS Sbjct: 1 RIETQTIRVSS----LLKGYSDFPHLEQVFRIDRVTRFKKKGKTRKETALGVTSLSSGQA 56 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 + + +R HW +EN+LHW D V ED C R GN A + + +R++ I++L Sbjct: 57 SPRELLDFVRGHWEIENRLHWIRDSVFREDTCTTRTGNGAHVMATLRNMTISLLRVAGSK 116 >UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BR51_9GAMM Length = 153 Score = 90.9 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 60/151 (39%), Gaps = 9/151 (5%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFG-----ETHLD 56 +++ L ++ + +PD +A H+L +L L A + G + ++ + ++ Sbjct: 7 QMRSLPQYFTDLPDPHRAEGRRHRLPVVLTLAAGASLCGMQGYKSMAEWASSLAQAARRR 66 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS 116 F + + +P I + + P W N ++ +A+DGK ++ Sbjct: 67 FGCRRVNGHYLVPSLYVIRDCLVRLGPEALDRRLQAW--QAAQLNSEEALAMDGKIMKGG 124 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDK 147 D + + H++S + Q K+ + Sbjct: 125 VDHTGAQT--HIVSLIGHESKHCVAQKKSAR 153 >UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV84_9DELT Length = 357 Score = 90.5 bits (223), Expect = 1e-16, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 60/148 (40%), Gaps = 9/148 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFG-----ETHLD 56 +++ L ++ + D R+ H++S +L + A + G + ++ I + + Sbjct: 214 QMESLPDYFKTVTDPRRTHGRRHRISTVLSIAAAATLCGMKGYKAIYGWANKLGQKARQR 273 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS 116 F + + + IP I V+ P + + D + +A DGKT++++ Sbjct: 274 FRCRKENGKYVIPSQFVIRDVLVRADPVELDLAVQRFNED--QGLEDTCLAFDGKTMKNA 331 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIK 144 D++ R+ H+ S Q K Sbjct: 332 IDENARQT--HIASVVGHESKTTHTQKK 357 >UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A0_AZOSE Length = 92 Score = 89.0 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 26/75 (34%), Positives = 43/75 (57%) Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R ++LHW LDV N+D ++RRG AA F +RHI +N+L ++ KA ++ K Sbjct: 15 VRLPRPTRHQLHWSLDVQFNDDQSRVRRGYAANNFVVLRHIVLNLLRHNTTRKASIKSKR 74 Query: 359 RKAAMDRNYLASVLA 373 A M+ ++ +L Sbjct: 75 LLACMEDDFREELLG 89 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 88.2 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 39/81 (48%), Positives = 53/81 (65%), Gaps = 4/81 (4%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M + + E++S D R A+ +H I+ L + AVISGA SW +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCI 81 Y FE GIPV DTIARV+ I Sbjct: 57 YRPFECGIPVDDTIARVIKRI 77 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 88.2 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 37/154 (24%), Positives = 55/154 (35%), Gaps = 14/154 (9%) Query: 195 VKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K NQ + P + + S HGR E R C + DEL F Sbjct: 2 IKRNQPTTYRQL-AALPWPDSAV----QHTASSAGHGRRESRSIKTCGIADELGGIAFPH 56 Query: 255 KGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHW 311 L A+ + Y ++S D T + A A+R HW VE H Sbjct: 57 GRL-----ALRVHRRRKQTGGCESRETVYAVTSLDAHETTPAELAAAVRGHWTVEALRH- 110 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 DV E+ + G A + R++A+ +L Sbjct: 111 VRDVTYAEEASTLHTGTAPRAMATFRNLAVGLLK 144 >UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3K1_9PROT Length = 134 Score = 88.2 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 31/117 (26%), Positives = 49/117 (41%), Gaps = 6/117 (5%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 L+ S I D R+ + L+ +LL T+ A+++GA S+ ++ F THLD L D Sbjct: 3 STLLSLFSQISDPRRDQGKIYPLAPVLLFTVLAMLAGARSYRQVQAFIRTHLDRLNDGFD 62 Query: 64 F-ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND-----KDVIAIDGKTLR 114 P + T+ ++ I + F + IAIDGKT Sbjct: 63 LSLRRAPAYSTVRFILRGIDAEEMERAFRDHALGLADGPAEGAAIPGAIAIDGKTWC 119 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 88.2 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 36/130 (27%), Positives = 57/130 (43%), Gaps = 2/130 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 +L ++ IPD+R+A + L+ +LL +I AV+SGA S+ I+ F + H + L Sbjct: 2 QLKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQL 61 Query: 65 -ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDV-IAIDGKTLRHSYDKSRR 122 PVH +I + + F + IA+DGKTLR + + R Sbjct: 62 HWKRAPVHTSIRYALQGLDAKAGELAFHRHASGLDGEGAQHASIAMDGKTLRAAVSITSR 121 Query: 123 RGAIHVISAF 132 SA Sbjct: 122 TARPLRYSAH 131 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 5/128 (3%) Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH 220 M +KG ++T DAMGCQ+ IA+++++ G D + ++KGNQG+ A F ++ + Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 221 --DSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPE 278 + E SHGR R V + E + W ++ L V R A + Sbjct: 61 LKPDHDEFEDSHGRTVRRRGWVLPLTPE-TKHSGSWPDIQALLVTEKIRQ--AHYSETVT 117 Query: 279 MTVRYYIS 286 RYY+S Sbjct: 118 SDFRYYLS 125 >UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BLK3_9GAMM Length = 190 Score = 85.5 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 53/129 (41%), Gaps = 6/129 (4%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFG-----ETHLD 56 +++ L ++ + +PD R+A H+L + LT A + G + ++ + ++ Sbjct: 59 QMRALPQYFTDLPDPRRAQGRRHRLPVVPALTAGASLCGMQGYKAMAEWASSLGQAARQR 118 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS 116 F + + +P I + + P W + ++ + +A+DGK ++ Sbjct: 119 FGCRRVNGHYLVPSLYVIRDCLVRLGPKALDRRLQAW-QAAQLNSSDEALAMDGKIMKGG 177 Query: 117 YDKSRRRGA 125 D + + Sbjct: 178 VDHTGAQTQ 186 >UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobacteria RepID=Q2RP40_RHORT Length = 152 Score = 84.8 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 38/133 (28%), Positives = 50/133 (37%), Gaps = 8/133 (6%) Query: 224 AMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK---KEPEMT 280 HGR+E R V DV L W GL V+ + + K Sbjct: 6 TTDRGRHGRQEHRWVEVFDVSGRLGPT---WDGLIAAVARVTRLTWHKDTKSGLWHKTQE 62 Query: 281 VRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 Y +L A TAIR HW VE + H+ DV ED +IR F+ +R A Sbjct: 63 TALYACQINLPAAVAGTAIRQHWGVEKRSHYVRDVTFFEDQSRIRTK--PGHFARLRSFA 120 Query: 341 INILTNDKVFKAG 353 +NIL + Sbjct: 121 LNILRANGTNNIS 133 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 84.0 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 38/86 (44%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 + ++ + + + D R +H+ I+++ + V+ G + I + ++L+ Sbjct: 6 VAVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQG 65 Query: 61 YGDFENGIPVHDTIARVVSCISPAKF 86 + + NG+P D I + + P F Sbjct: 66 FLELPNGLPSRDCIRNWLMALQPDAF 91 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 84.0 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 29/212 (13%), Positives = 68/212 (32%), Gaps = 34/212 (16%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFA-VISGAESWEDIEDFGETHLDFLKQYG 62 + + E + + D R + + + + + +G + + + Sbjct: 22 EGIWERLDRVTDPRSTRGRVYSWLCLAAVWLCSLTAAGHHRVSAVRAWLARTSGAERARL 81 Query: 63 DFE------NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD------------ 104 +P TI + + + ++ D ++ Sbjct: 82 RLPWDPFAGWRLPSTATIHCFLQAVDDGELAVALLDPPLDPDPPAEQGDDTDQRTEPSAA 141 Query: 105 -------------VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNE 151 +A+DGKT RH+ K +H++ S ++ Q++ + K+NE Sbjct: 142 PVDPGHGCQPVESAVALDGKTSRHA--KRADGSKVHLVGVASHGDGRLLAQVEVEAKTNE 199 Query: 152 ITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 LL LD+ ++T DA+ + + Sbjct: 200 TAVFRRLLRPLDLTNVLVTADALHTVRANLDT 231 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 82.5 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 32/118 (27%), Positives = 49/118 (41%), Gaps = 9/118 (7%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L EH++ +PD R + H L IL + + A+ SGAE + + ++ T L Q Sbjct: 15 GLWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGC 74 Query: 65 ENG-------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRH 115 + P T+ RV+ I NW+ S +A+DGKTL Sbjct: 75 QESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSLGLSPA--ALAVDGKTLAG 130 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 82.1 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 51/167 (30%), Positives = 74/167 (44%), Gaps = 30/167 (17%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLL----TIFAVISGAESWEDIEDFGETHLDF 57 +LKKL+E S IPD R+A V+H+L+ +LL +F + S E+ D+ L Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSR--PAFLQA 136 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC--------HSSNDKDVIAID 109 L+ +P DT+ARV+ I P K E FI +R H N IAID Sbjct: 137 LQGLFPELETLPHGDTLARVLERIEPQKLEESFIRLLRRYIRHKKFKRHLINKCYPIAID 196 Query: 110 G--KTLR-------------HSYDKSRRRGAIHVISA-FSTMHSLVI 140 G K +R + D + + I+V+ A F + L I Sbjct: 197 GTQKLVRDGELGEEWLERHIKTKDGEKVQQYIYVLEANFVFKNGLTI 243 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 81.7 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 25/69 (36%), Positives = 39/69 (56%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M L+ H + I D RQ+ KV + L +L +T+ VI+GAE W +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIP 69 G G+P Sbjct: 72 KGILTEGVP 80 >UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN77_ANAAZ Length = 77 Score = 81.7 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 26/61 (42%), Positives = 43/61 (70%) Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 A ++R+HW +EN LHW LDV + +DDC+IR+ NA + F+ +R IA+++L + K G+ Sbjct: 1 MANSVRSHWAIENSLHWVLDVALKQDDCRIRKDNAQQNFAVMRQIAVDLLGKENPVKRGI 60 Query: 355 R 355 + Sbjct: 61 K 61 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 81.3 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 51/108 (47%), Gaps = 4/108 (3%) Query: 29 ILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG-IPVHDTIARVVSCISPAKFH 87 +L L + AV++G + E I FG L F+NG +P +TIA ++ + Sbjct: 3 LLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLDADHLD 62 Query: 88 ECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTM 135 W+ D H + D IA+DGK L S D + H+++A++ Sbjct: 63 RIIGAWLGDRHP-DGWDHIALDGKRLCGSRDGAV--PGTHLLAAYAPQ 107 >UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ26_GLOVI Length = 125 Score = 80.9 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 46/106 (43%), Gaps = 1/106 (0%) Query: 261 CVAVSFR-SIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 S R E + + +Y+SS + +A + IR HW VEN++H+ DV E Sbjct: 12 GRTRSIRLERYRELRGIVTVKTHWYLSSIEASASELGRRIRGHWGVENQVHYPKDVTFGE 71 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 D +IR +++S R A+N+ + + ++ + Sbjct: 72 DRSRIRTLPLVQVWSVARSFALNLYRSLLMANRAQAQRRCMFGLST 117 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 79.8 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 28/113 (24%), Positives = 48/113 (42%), Gaps = 6/113 (5%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 +L ++S IPD+R+A + L+ +LL +I A++SGA S+ I+ F +TH + L Sbjct: 2 QLKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQL 61 Query: 65 -ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSN-----DKDVIAIDGK 111 P H +I + + F + VI + K Sbjct: 62 HRKRAPAHTSIRYALQGLDAKAVELAFPRHASGLDGEDHNRFFPSTVIDAEWK 114 >UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E6_9ACTO Length = 128 Score = 79.8 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 30/129 (23%), Positives = 47/129 (36%), Gaps = 13/129 (10%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + L E ++ + D R+ H +LL+ AV++GA S+ I ++ + Sbjct: 1 MGPLAERLATLADPRRRRGKRHPFVAVLLIACSAVVTGARSFVAIHEWATDSPQDVLARL 60 Query: 63 DFENGI-------PVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRH 115 P TI RV+ P + H D +AIDGK+ R Sbjct: 61 GARTATALAVRIPPSGVTIRRVIKDTCPGGLADLLG------HDPAGTDTLAIDGKSARG 114 Query: 116 SYDKSRRRG 124 S S R Sbjct: 115 SRLGSTRPP 123 >UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q750_9BACT Length = 129 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 35/88 (39%), Gaps = 3/88 (3%) Query: 266 FRSIIAEQKKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDC 322 R + + + Y I+S + + R HW +EN LH+ D ED Sbjct: 33 HRIFTKVKTGKKTEEIVYGITSLTQQKASPKTILKFSRGHWSIENGLHYVRDTAFREDHS 92 Query: 323 KIRRGNAAELFSGIRHIAINILTNDKVF 350 +IR NA + ++++ + + V Sbjct: 93 QIRTQNAPRAMASLKNLVVGLFHFLNVP 120 >UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XTV7_9DEIN Length = 118 Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 29/109 (26%), Positives = 54/109 (49%), Gaps = 5/109 (4%) Query: 266 FRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIR 325 R ++ + E Y ++S A++ R HW VEN+LH + D V+ ED + R Sbjct: 14 RRRVVRKNSGELREETAYALTSLQAPAKRLYALWRGHWEVENRLHHKRDTVLGEDASRSR 73 Query: 326 RGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 +G A ++ +R + +N+L + + + R +RK + D L ++ G Sbjct: 74 KGAAGLMY--LRDVILNLL---HLKRWPVLRSVRKFSADPKVLLRLIRG 117 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 78.6 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 36/129 (27%), Positives = 52/129 (40%), Gaps = 13/129 (10%) Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRD----CHSSNDKDVIAIDGKTLRHSYDKS 120 PV+ ++ ++ I P F C + IAIDGKTLR S+D Sbjct: 8 LRRAPVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAF 67 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELL---------NMLDIKGKIITT 171 A +V+SAF+ H +++ D+KSNEI A L+ I + Sbjct: 68 SDTKAAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALIVATALWKSREETSIYASSVML 127 Query: 172 DAMGCQKDI 180 DAM I Sbjct: 128 DAMTFAPAI 136 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 76.7 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 30/96 (31%), Positives = 40/96 (41%), Gaps = 4/96 (4%) Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPEHDSYAMSEK 228 D +GCQK IA+ I +Q DYL AVK NQ L++A F D K Sbjct: 8 DGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYFEEANKARFAGYNIDYDEKINK 67 Query: 229 SHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 GR E R V + I + W L+ + + Sbjct: 68 GPGRLEQRRCWVGYEIPDTI-NSQNWAKLETIVMVE 102 >UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C56B7 Length = 116 Score = 74.4 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 49/109 (44%), Gaps = 5/109 (4%) Query: 268 SIIAEQKKEPEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + + + V + I+S A +R HW +EN+LH+ DV + ED C++ Sbjct: 8 TRERTVRGQTTVEVHFGITSLSAEKADAATLLNHVRTHWRIENELHYVRDVTLGEDVCRV 67 Query: 325 RRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 R G+A ++ + +R+ +++ K + + MD ++ Sbjct: 68 RMGHAPQVLAALRNAVVHLWREVKAVSCPEAIERLQ--MDPAMAKGLIG 114 >UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H5_9SYNE Length = 177 Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 31/153 (20%), Positives = 59/153 (38%), Gaps = 12/153 (7%) Query: 197 GNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKG 256 G+Q L + ++ K + + HGR+ + + W G Sbjct: 8 GDQKTLYRQIADQLLGKRHIPLMATDHEI---GHGRD---ILWTLRAKEAPQHIKANWHG 61 Query: 257 LKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVV 316 + ++ + ++P +I+S T + +R W VE+ HW D Sbjct: 62 TSWIAEVIAT----GTRDRKPFKATHRFITSLRTTPDALLRLVRERWSVESW-HWIRDTQ 116 Query: 317 MNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ++EDD + R GN A + + +R A+N+L Sbjct: 117 LHEDDHRYR-GNGAGVMAALRTAAMNLLRLTGF 148 >UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium meliloti RepID=Q1WLF6_RHIME Length = 105 Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 19/64 (29%), Positives = 28/64 (43%), Gaps = 1/64 (1%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + + +PD R H L+ IL + I A++ GAES D+ DFG +LK Sbjct: 1 MSRFAACFEDLPDPR-GRNARHPLTSILFIAIAAIVCGAESCTDMADFGVAKKKWLKTIV 59 Query: 63 DFEN 66 Sbjct: 60 PLPY 63 >UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_9ACTO Length = 410 Score = 72.1 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 49/134 (36%), Gaps = 11/134 (8%) Query: 46 DIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRD---CHSSND 102 + + G + P I R++ I P W+ + Sbjct: 225 ALIAWVLARPTVAVLLGIDADRRPSEAMIRRLLQAIDPDLLTTAIGIWLAARIPAPAPGS 284 Query: 103 KDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE----- 157 + IA+DGKTLR S + HV++A +V+ D K+NEIT Sbjct: 285 RRAIAVDGKTLRGSRTRDSAAR--HVLAAADQHTGIVLASTDVDTKTNEITRFTASGSHA 342 Query: 158 -LLNMLDIKGKIIT 170 LL+ I+ +++ Sbjct: 343 DLLSSRCIRSGVVS 356 >UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JBB1_FRASC Length = 173 Score = 71.7 bits (174), Expect = 5e-11, Method: Composition-based stats. Identities = 26/92 (28%), Positives = 41/92 (44%), Gaps = 5/92 (5%) Query: 274 KKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 + ++S + A + HW +EN+LHW DV +ED + R GNA Sbjct: 69 GGPATAETVHAVTSLPTHHASPRLLAELAQAHWAIENRLHWVRDVTYDEDRHRARTGNAP 128 Query: 331 ELFSGIRHIAINILTN--DKVFKAGLRRKMRK 360 ++ + +R++AI IL K LR R Sbjct: 129 QVMTSLRNLAITILRLTGAKNIAKALRHHARH 160 >UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK19_ACIF5 Length = 104 Score = 71.3 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 37/93 (39%), Gaps = 3/93 (3%) Query: 273 QKKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + + ++S T E R HW +EN+ H D +ED +IR N Sbjct: 2 KDGTLREDCAFGLTSLTKDRTTPENLLGIARGHWEIENRNHHVRDTTYHEDLSQIRTENG 61 Query: 330 AELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 + + +R +A++IL V + A+ Sbjct: 62 PHMMATLRGLAMSILRLIGVKNIAQAGRDFAAS 94 >UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW01_RHOSR Length = 130 Score = 68.2 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 28/81 (34%) Query: 11 SIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPV 70 +PD R V H+ S IL + A +GA S+ I ++ +P Sbjct: 49 DEVPDPRHRRGVRHRFSVILAPALSATCAGARSFIAIAEWAHDAPAEALGGLGVSAVVPS 108 Query: 71 HDTIARVVSCISPAKFHECFI 91 T R ++ + + Sbjct: 109 ESTSRRFLAGVDATALDQVLG 129 >UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PA49_CLOTS Length = 245 Score = 67.8 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 50/230 (21%), Positives = 84/230 (36%), Gaps = 37/230 (16%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + L E I+ + D R V+ +S I + +F + S+ +E + K+ Sbjct: 16 VYHLGEKINTLKDKRVKSSVK--ISTITFVVLFGFMLQIRSFNRLEHW--LKKGKFKKAL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHEC--------FINWMRDCHSSNDKDVIAIDGKTLR 114 + +P DTI RV+S +E N + + + V+AIDG L Sbjct: 72 PKKTKMPRIDTIRRVLSNFDLDGLNELNNSIIKTSIKNKVFRRGTIDGLKVVAIDGVELF 131 Query: 115 HSYDKSRRRGAIH--------------VISAFSTMHSLVIGQIKTDKKSN-------EIT 153 S K V S + L++GQ + K + EIT Sbjct: 132 ESTKKCCGNCLTRVQKDGITHYFHRTVVCSTIGSDSHLILGQEILEPKKDGSDKDEGEIT 191 Query: 154 AIPELLNMLDIK----GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 A L+ L + II DA+ C+ +++ G D + VK + Sbjct: 192 AGKRLIRKLHREFHHFADIIVADALYCKSTWVKEVLSIGMDAVVRVKDER 241 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 67.0 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 38/59 (64%), Positives = 39/59 (66%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLK 59 MELKKLMEHISIIPDYRQAWKVEHKL IL + FGETHLDFLK Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRGFGETHLDFLK 59 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 67.0 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 31/60 (51%), Positives = 34/60 (56%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS 116 LKQYG FE GI HDTI +VSCIS F + FI WM C A DGKT+R S Sbjct: 11 LLKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitrosococcus oceani AFC27 RepID=B6C2C4_9GAMM Length = 77 Score = 66.7 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 31/57 (54%) Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 ED+C++ A F+ +R IAI++L D+ K LR + RK A D +Y+ + Sbjct: 21 SFREDECRVHDPMAGGNFALLRKIAISLLVRDRSNKTSLRGRCRKVAWDNDYMRQLF 77 >UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C544B Length = 103 Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 28/109 (25%), Positives = 42/109 (38%), Gaps = 11/109 (10%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYIS 286 + HGR E R L+ W GLK R++ K + V + I+ Sbjct: 2 DPGHGRIETRTVRATP----LLTCHDRWTGLKHGFRITRTRTV----KGVTTVEVVHGIT 53 Query: 287 SAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 S A +R+HW +EN+ H DV + ED+ + R A Sbjct: 54 SRPVERADARALLGLVRSHWRIENQRHDVRDVTLREDEPRCRAAGAGRA 102 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 64.4 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 27/57 (47%), Positives = 42/57 (73%) Query: 97 CHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEIT 153 S + ++ DGKTLR S+D+S + AIH++SA+++ +SLV+GQ+KTD+KSNE Sbjct: 17 YQKSLKEKSLSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNEHK 73 >UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XCB4_SALTO Length = 117 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 22/106 (20%), Positives = 46/106 (43%), Gaps = 3/106 (2%) Query: 26 LSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 ++ +L + AV++GA ++ D+ E F + +PV T+ R++ + Sbjct: 1 MASVLADAVCAVMAGASTFAAFGDWVEDLDAPAWSRLGFTDRVPVLTTLWRLLVRVDAET 60 Query: 86 FHECFINWMRD---CHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHV 128 + +W+ + VIA+DGK +R + R A+ + Sbjct: 61 LTAVWADWLCSRLPVAPPPVRRVIAVDGKVVRGAVLTEGRVPALWM 106 >UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus aquaticus Y51MC23 RepID=B7A7V9_THEAQ Length = 161 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 27/144 (18%), Positives = 56/144 (38%), Gaps = 9/144 (6%) Query: 212 LKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIA 271 ++E P + G E + + G ++ R ++ Sbjct: 1 MEERRLPGETEAVWNLVRDGEVWTYRVWASPYLPEEM---RAFPGCGQVVRME--REVVR 55 Query: 272 EQKKEPEMTVRYYISSA---DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGN 328 + E TV Y ++S A + + + W VEN+ W D +++ED C++R G Sbjct: 56 KGTGEVRRTVSYALTSLGPEVADARRLGELLLSRWEVENRSFWVRDFLLHEDACQVR-GV 114 Query: 329 AAELFSGIRHIAINILTNDKVFKA 352 A++ + +R +++L V + Sbjct: 115 GAQVLAALRAFLVSLLHRQGVREK 138 >UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ITI9_METNO Length = 74 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 17/63 (26%), Positives = 33/63 (52%), Gaps = 1/63 (1%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 ++ L + D R+ +H+L IL++ + AVI+ AES +DI +G + +L+ + Sbjct: 2 IEGLAVCFPGLEDLRETSGCDHRLIAILVIAVCAVIACAES-KDIGLYGRSKQAWLQTFL 60 Query: 63 DFE 65 Sbjct: 61 PLP 63 >UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TKB9_RICCO Length = 107 Score = 61.7 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 18/98 (18%), Positives = 35/98 (35%), Gaps = 1/98 (1%) Query: 8 EHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ-YGDFEN 66 + S + D R+A + L +L + +++SG+ S ++ F E L L + +G Sbjct: 10 DVFSELRDVRRAQGKRYALEPLLCAIVMSILSGSASLRKMQVFIEEQLPNLNRLFGTSWR 69 Query: 67 GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD 104 P I + + + F S Sbjct: 70 KAPCWVAIREFLLGLDEQELERAFREHANRQVSPPPGR 107 >UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVV4_METNO Length = 123 Score = 61.7 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 24/100 (24%), Positives = 40/100 (40%), Gaps = 2/100 (2%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W GL + + R VR+ + S+ +E A AIR H + W L Sbjct: 7 WPGLTTVLATETLR--GGNGTDSVPAQVRHSLGSSTAPSEVLAQAIRRHGALATGEPWVL 64 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 +V E+ ++R AA + +R +A++ D A Sbjct: 65 EVSFGEERSRVRERCAARHLALLRRVALDRRRADASLTAS 104 >UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RIX6_9ACTO Length = 90 Score = 61.3 bits (147), Expect = 6e-08, Method: Composition-based stats. Identities = 16/58 (27%), Positives = 24/58 (41%) Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 R WH+EN+LHW DV E + R G + + +R+ AI + Sbjct: 11 AQPADLQQWARLEWHIENRLHWVRDVTFGEGTHRARTGTGPAVAAVLRNTAIGFHRGN 68 >UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TCH7_HELMI Length = 453 Score = 60.5 bits (145), Expect = 9e-08, Method: Composition-based stats. Identities = 59/385 (15%), Positives = 109/385 (28%), Gaps = 58/385 (15%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + + + D R+ +++ I + F ES E ++ + +Q Sbjct: 38 VYGFSQMVRQAKDGRKQPRIK--APAIFTVAFFGAFFCMESMEQMDRW--QKTGVFRQLV 93 Query: 63 DFENGIPVHDTIARVVSCIS---PAKFHECFINWMRDCHSSN-----DKDVIAIDGKTLR 114 +P HDT+ + + + H C I ++ V AIDG L Sbjct: 94 PKNIRLPSHDTVRQALMKWDLKEQREQHNCVIQRYKEQRGPQKESINGWRVTAIDGVELF 153 Query: 115 HSYDKSRRRGAI--HVISAFSTMHSLVIGQI------------------KTDKKSNEITA 154 H+ H H++V+ Q DK E T Sbjct: 154 HTKAYRCPECLTREHRDKTTDYYHAVVVAQQVGGNANLIYDWEMRKPQDGVDKDEGETTV 213 Query: 155 IPELLNML-DIKGK---IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 L+ + + GK + T DA+ + + G + +K + R+ K F Sbjct: 214 AQRLIRRMAETYGKITDVYTLDALFAKAPVIHAALDAGAHVVVRMKEERRRIMKEANACF 273 Query: 211 PLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSII 270 + ++ + V + +W ++ V Sbjct: 274 ANRLPDSTWEERDGKGNT------------VYVQAWDEEGLAQWPQVRVPMRIVKIIRHT 321 Query: 271 AEQKKEPEMTV----------RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNED 320 + E V SS + A W +EN L D Sbjct: 322 NKTVIEANKEVFVTDVVERWIATTCSSEKADTQTIAQIAAARWDIENIGFRNLKTFNALD 381 Query: 321 DCKIRRGNAAELFSGIRHIAINILT 345 C + A + G + +A N+ Sbjct: 382 HCFVHDSVAIKAMIGFQVLAFNLKR 406 >UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3Z283_9SYNE Length = 156 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 39/93 (41%), Gaps = 4/93 (4%) Query: 84 AKFHECFINWMRDCHSSNDK-DVIAIDGKTLRHSYD--KSRRRGAIHVISAFSTMHSLVI 140 F + WM + D D + DGKTLR S D I +S +S + I Sbjct: 2 EAFEALLLQWMSQQPALADGVDTLVCDGKTLRGSIDQKPGAAASFIAQVSLYSQPLGVAI 61 Query: 141 GQ-IKTDKKSNEITAIPELLNMLDIKGKIITTD 172 Q +S+E ++ LL+ +++ ++ D Sbjct: 62 AQTTYATDESSETASLLWLLSGIELTDMLVQAD 94 >UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LD44_FRASN Length = 261 Score = 58.2 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 31/144 (21%), Positives = 52/144 (36%), Gaps = 10/144 (6%) Query: 193 FAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAM------SEKSHGREEIRLHIVCDVPDE 246 +G+Q L +A + L+ H A+ + + G + R VP Sbjct: 52 LVTEGDQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLTAGSRQTRALKAVTVPAG 111 Query: 247 L-IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT---AEKFATAIRNH 302 L + L + ++ + E K+ Y I + + AT IR H Sbjct: 112 LGFPHAAQAIQLTRTSRPINKNTKKTEGKRRQRRETVYAICTLPAHDALPAELATWIRGH 171 Query: 303 WHVENKLHWRLDVVMNEDDCKIRR 326 W +E +L W DV + ED + R Sbjct: 172 WSIEVRLRWVRDVTLGEDLHQART 195 Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 9/38 (23%), Positives = 22/38 (57%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVIS 39 + L+E ++ +PD R+ V H + +L + + A+++ Sbjct: 57 DQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLT 94 >UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria RepID=A8FXM7_SHESH Length = 57 Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 25/58 (43%), Positives = 36/58 (62%), Gaps = 4/58 (6%) Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + +FR +I + + RYYISS +LTAE+ A + HW +E+ +HW LDV MNE Sbjct: 1 MVENFRFVI---GNKLVLEYRYYISSKELTAEQAANTVSEHWGIES-MHWVLDVSMNE 54 >UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L7Y6_FRASN Length = 209 Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 16/62 (25%), Positives = 29/62 (46%) Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 +TA T +R +W +EN++H+ D ED GN + R++AI ++ + Sbjct: 88 SVTAAYLHTHVRGNWGIENEVHYTRDAAWREDANPTYTGNTNHALASFRNLAIGVIGLNG 147 Query: 349 VF 350 Sbjct: 148 TR 149 Score = 43.6 bits (101), Expect = 0.011, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 28/84 (33%), Gaps = 4/84 (4%) Query: 48 EDFGETHLDFLKQYGDFENG---IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD 104 D + L L D P T+ + I F W+ + + Sbjct: 12 ADLPQPLLARLGAPLDHFRRNTRAPSKKTLRAPLKKIDVDALDATFGAWLCA-QIARGRV 70 Query: 105 VIAIDGKTLRHSYDKSRRRGAIHV 128 +AIDGK LR ++ A ++ Sbjct: 71 ALAIDGKVLRGAWSGDESVTAAYL 94 >UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J572_FRASC Length = 148 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 13/46 (28%), Positives = 18/46 (39%), Gaps = 1/46 (2%) Query: 8 EHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGET 53 E IPD R V H+L +L L AV+ G + + Sbjct: 70 ECFEQIPDPRDPRGVRHRLPVVLSLC-AAVLCGESGLAGVAAWVAA 114 >UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C378D Length = 453 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 50/336 (14%), Positives = 89/336 (26%), Gaps = 54/336 (16%) Query: 8 EHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG 67 E IPD R L +L+ + A S F LD ++ G Sbjct: 27 ERFETIPDAR--RGPTFSLPDVLMAGLALFALKAPSLLA---FQRRTLDHNLRHVFGLTG 81 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSS--------NDKDVIAIDG-------KT 112 P + V+ + P F + ++ + V+A+DG K Sbjct: 82 RPSDSQMRAVLDDVDPDHLRPVFRDVFARLQAAHVLDEYRVDGCYVVALDGVEYFCSQKV 141 Query: 113 ------LRHSYDKSRRRGAIHVISAFSTMH-SLVIG------QIKTDKKSN--EITAIPE 157 R + + + +A S V+ Q N E A Sbjct: 142 HCPHCMTRRHANGAVSYYHQMLGAAVVHPDFSAVLALAPEPIQRADGGTKNDCERNAARR 201 Query: 158 LLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 L ++ DA +QK +L VK A Sbjct: 202 WLGRFREEHPDLAVLVVEDARSSNAPHVRDLQKARCHFLLGVK-------AADHAHLFAH 254 Query: 214 ELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 + ++ + E + R +R + L + + + + Sbjct: 255 VCARQDQHAFEVVEDADPRTGLRRSYLWIADLPLNESNDD-------VRVNFVHLVELDP 307 Query: 274 KKEPEMTVRYY-ISSADLTAEKFATAIRNHWHVENK 308 P ++ + A A R W +EN+ Sbjct: 308 DGTPREWTWVADMAVTGANVRQLARAGRARWRIENE 343 >UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3536 Length = 130 Score = 54.7 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 20/71 (28%), Positives = 34/71 (47%), Gaps = 7/71 (9%) Query: 252 FEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT---AEKFATAIRNHWHVENK 308 +WKGLK+ R++ + V + I+S A + +R+HW +EN+ Sbjct: 9 QDWKGLKQGFQITRERTV----NGVTTVEVVHGITSLSADRANAGALLSLLRDHWRIENQ 64 Query: 309 LHWRLDVVMNE 319 LH+ DV + E Sbjct: 65 LHYVPDVTLGE 75 >UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKL6_9PROT Length = 506 Score = 54.3 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 56/354 (15%), Positives = 112/354 (31%), Gaps = 57/354 (16%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHK----LSGILLLTIFAVISGAESWEDIEDFGETHLDF 57 EL L+ + IPD R K HK L LL+ +F S E+ ++ L Sbjct: 75 ELPALLGQLEQIPDPRDPRKRRHKLTVLLLYGLLMFVFQFASRRETNREMTR--PQFLAN 132 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHEC-------------FINW-MRDCHSSNDK 103 L++ +P DT+ R++ I A + F + + CH Sbjct: 133 LQRLFPEIEALPHADTLYRLLRDIDLAHLEQAHVDLVRRLIRGKSFRRYLINHCHPIAID 192 Query: 104 DVIAIDGKTLR---------HSYDKSRRRGAIHVISA-FSTMHSLV-----------IGQ 142 + G TL + + ++V+ A + LV +G Sbjct: 193 GSQKLAGDTLWAEELLQRHVGKDETRHTQYFVYVLEASLVFHNGLVIPLLSEFLEHALGD 252 Query: 143 IKTDKKSNEITAIPELLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 + K+ E+ L + L ++ D + + ++ + ++ +K Sbjct: 253 SEAQKQDCELRGFARLSDRLKRLFPRLPILLLLDGLYANGPVMQRCLRAHWQFMIVLKD- 311 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 + +E + ++ GR + V D+ L Sbjct: 312 -------KDLPTVWEEFRALQPRQLPTLQQDWGRRQQHFSWVNDIEYAYGSNGRCRLKLH 364 Query: 259 KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT----AEKFATAIRNHWHVENK 308 + ++ + E + E ++SS L+ E+ R+ W +E Sbjct: 365 VVVCEERWQGVDQEARIVTETARHAWLSSQPLSRENVHERCNLGARHRWGIEAG 418 >UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VX04_POLNA Length = 100 Score = 54.0 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 14/42 (33%), Positives = 27/42 (64%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDI 47 L++ SI+PD R + L ++++T+ AV+ GA++W D+ Sbjct: 2 LIQAFSILPDPRTGPAQRYDLREMIVMTLSAVLCGADNWVDV 43 >UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BQC4_9GAMM Length = 96 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 5/84 (5%) Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 +T ++ R HW + LH+ D NED +IR G+ + + AI +L + Sbjct: 1 MTPQQVLAINRGHWSI-ASLHYISDWNYNEDRGQIRTGHGPANVTRLCRFAIGVLKHFPK 59 Query: 350 FKAGLRRKMRKAAMDR----NYLA 369 + MR+ A +YL Sbjct: 60 PGQYIPEMMRQLARRPRQVLDYLR 83 >UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B1F Length = 84 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 30/56 (53%) Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 L+M D+ + DA+G Q IAE+I + G DY+ A+K NQ +A F E Sbjct: 17 LDMEDLAQSQLVIDAVGTQGPIAEQIIEAGADYVLALKANQPSALQAVSAHFKEAE 72 >UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C435B Length = 133 Score = 51.3 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 43/137 (31%), Gaps = 18/137 (13%) Query: 192 LFAVKGNQGRLNKAFEEKFPLKE-----------LNNPEHDSYAMSEKSHGREEIRLHIV 240 + K NQ L E ++ L P+ + G R+ Sbjct: 1 MLTAKDNQPGLVADIEAGLGFEDAARGLAAATSPLTGPDARATGAPGHVGGPGHGRIETR 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD---LTAEKFAT 297 L+ W GLK R++ K + V + I+S A Sbjct: 61 TVRATPLLTCHDRWTGLKHGSRITRARTV----KGVTTVEVLHGITSLTVERADARALLG 116 Query: 298 AIRNHWHVENKLHWRLD 314 +R+HW +EN+ H D Sbjct: 117 LVRSHWRIENQRHDVRD 133 >UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ13_9PLAN Length = 75 Score = 50.9 bits (120), Expect = 7e-05, Method: Composition-based stats. Identities = 17/56 (30%), Positives = 29/56 (51%), Gaps = 1/56 (1%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDF 57 EL++L ++ + D R HKL ++L+ + AVI+GA+ IE + L Sbjct: 19 ELRELAKYFQSLDDSRSHVNQRHKLVNVVLIAMCAVIAGADGPTAIE-WLAGRLQL 73 >UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z7_THET2 Length = 112 Score = 50.5 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 35/108 (32%), Gaps = 7/108 (6%) Query: 41 AESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS 100 +S +E F + L G ++ + P K E + Sbjct: 1 MDSLRGVERFARANPHLLPHLGLRNPPGHTLL--PLLLHRLDPKKLQEALHQVFPEA--- 55 Query: 101 NDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKK 148 + V+ +DGK LR S + + ++ + + Q + + K Sbjct: 56 DLGGVLVVDGKHLRGS--GKGKSPQVRLVEVLALHLKTTLAQARVEGK 101 >UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillaceae RepID=C5D2E6_GEOSW Length = 437 Score = 50.5 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 58/351 (16%), Positives = 115/351 (32%), Gaps = 74/351 (21%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDI-EDFGETH-LDFLKQ 60 K L++ + + D R + + IL + + G +S + E F + ++ ++ Sbjct: 28 FKDLVDQLKKVKDKRHQSYITYGPETILYTILLKSVFGIKSMRSMTELFNKDECIENIRV 87 Query: 61 YGDFE--NGIPVHDTIARVVSCISPAKFH--------ECFINWMRDCHSSNDKDV-IAID 109 + N +P +DTI ++ + P + + F + +K I D Sbjct: 88 VLGLKELNELPHYDTINDFLAKLEPKELETIRIYLIKKLFEKRCLESFRILNKYWPIVFD 147 Query: 110 GK------------TLRHSY-DKSRRRGAI----HVISA--FSTMHSLVIGQIKTDKKSN 150 G LR Y DK + HV+ A L I + +S Sbjct: 148 GTGIHTFKEKHCEHCLRREYKDKETGETKVVYMHHVLEAKLVVGDMVLSIATEFIENESE 207 Query: 151 -------EITAIPELLNMLD-----IKGKIITTDAMGCQKDIAEKIQKQGGD-YLFAVKG 197 E+ A L++ L + +I D++ + + E I + Y+F K Sbjct: 208 NVPKQDCELKAFMRLVDKLKKTFKRLPICLI-ADSLYACEPVFE-ICDKHNWKYIFRFKE 265 Query: 198 NQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGL 257 ++ + E N + + V D+ Sbjct: 266 DRIKTVSQEFRAIQSLETNGKSSEYF---------------WVNDIAYND---------- 300 Query: 258 KKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 + + + + E+K+E + I+ + AE A R W +EN+ Sbjct: 301 RLVNLVEKVKVTENEKKQEFLFITNFRIT--ERNAEILVQAGRRRWKIENE 349 >UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L6S8_RUMHA Length = 107 Score = 50.5 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 15/48 (31%), Positives = 25/48 (52%) Query: 47 IEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWM 94 + F + + ++ D + G P DT+ RV + I P KF E F +W+ Sbjct: 1 MAFFMKLQEPYFEKILDLKYGTPSADTLLRVFAIIEPEKFMEMFYHWI 48 >UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E7_9ACTO Length = 72 Score = 48.9 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 13/45 (28%), Positives = 25/45 (55%) Query: 134 TMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 T + + Q++ + +NEIT LL+ D++ +T DA+ Q+ Sbjct: 2 TGTGMTVTQLRVPENTNEITCFAALLDPYDLREVTVTGDALHTQR 46 >UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36C2 Length = 109 Score = 47.8 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 26/69 (37%), Gaps = 5/69 (7%) Query: 268 SIIAEQKKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + + V Y I+S A R HW +EN LH+ DV + ED C + Sbjct: 5 ERRRKANGKATVEVVYGITSLSRLAADAAALLGYSRRHWGIENGLHYTRDVTLGEDRCPV 64 Query: 325 --RRGNAAE 331 R Sbjct: 65 GARSRPTPR 73 >UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AI67_NITEC Length = 94 Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 11/60 (18%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 +L + I D RQ K H L +L++TI +I + LD+L+QY Sbjct: 34 RLADVFVSITDPRQ-RKSRHDLVKVLVITI----------NEILAWANEKLDWLRQYLKL 82 >UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX Length = 98 Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 36/48 (75%), Positives = 39/48 (81%) Query: 78 VSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 +SCI KFHECFIN MR+CHSS+D DVIAIDGK L HS DKSRRR A Sbjct: 1 MSCIRSVKFHECFINRMRECHSSDDIDVIAIDGKALPHSCDKSRRRRA 48 >UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BWL6_9GAMM Length = 94 Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 32/55 (58%), Gaps = 1/55 (1%) Query: 8 EHISIIPDYRQAW-KVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 EH +PD R+ + HK IL++ I A+I GA+SW + +FG+ D+ + + Sbjct: 40 EHFKSLPDPRRRTMNLRHKFIDILIIAICAIICGADSWVAVAEFGKAKEDWFRVF 94 >UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JGX0_FRASC Length = 222 Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 40/108 (37%), Gaps = 6/108 (5%) Query: 66 NGIPVHDTIARVVSCISPAKFHEC-FINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 G P + + + P + + V+ +DG T+R Sbjct: 31 PGTPAPGGVGKSCRSLDPGSLAALDAAPHRPTWRAGRVRRVLTVDGTTMR----PQHGSR 86 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIITT 171 +H+ + +++ Q+ D+K+NE + L + D+ G +IT Sbjct: 87 HVHLPEGLAHACGVLLTQVDVDEKTNENPFVLRGLGQIPDLTGVLITA 134 >UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HUT6_9ACTO Length = 130 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 33/84 (39%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L +S IPD R+ + L +L L + AV+ GA S I F L++ Sbjct: 45 SLAGTLSRIPDPRRVRGRRYHLGSLLALCLVAVLGGARSLATIARFAADTNSSLREQLGL 104 Query: 65 ENGIPVHDTIARVVSCISPAKFHE 88 + P T+ + + + E Sbjct: 105 ASSTPNASTLGGLRANLKDEWVRE 128 >UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4X2F9_SALTO Length = 143 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 23/63 (36%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L + +PD V H+L+ +L+ I AV S+ I ++ G Sbjct: 14 GLPAALLDLPDPLCRLGVLHRLTVVLIAAICAVAVSNRSYTAIAEWFPDVPAATGARGGH 73 Query: 65 ENG 67 G Sbjct: 74 RPG 76 >UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmonella enterica RepID=C0Q104_SALPC Length = 177 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 24/50 (48%), Positives = 27/50 (54%), Gaps = 13/50 (26%) Query: 309 LHWRLDVVMNEDDCKIRRGNAAELF-------------SGIRHIAINILT 345 +HWRLDV MNEDDC+IRRGN F +R I INIL Sbjct: 1 MHWRLDVAMNEDDCRIRRGNVKSFFEIIKSGEYEIWGCEIMRWIRINILK 50 >UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW00_RHOSR Length = 98 Score = 45.1 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 23/48 (47%), Gaps = 2/48 (4%) Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 GKT R + D S H+++A +V+ Q+ + NEI + + Sbjct: 18 GKTWRGAKDGSG--HLTHLLAAVDHDAGVVLRQVAVGARINEIPLLLD 63 >UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 Length = 41 Score = 45.1 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 16/30 (53%), Positives = 24/30 (80%) Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 +++A +T + + IGQ+K D KSNEITAIP+ Sbjct: 1 MVNALATANGMSIGQLKVDSKSNEITAIPK 30 >UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms RepID=Q11MU1_MESSB Length = 447 Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 41/256 (16%), Positives = 77/256 (30%), Gaps = 29/256 (11%) Query: 5 KLMEHISI-IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 KL E ++ I D R +V H L+ IL IFA+ G E D++ F G Sbjct: 45 KLAEKLAAAIRDPRDPARVRHSLTDILRARIFAIACGYEDANDLDRL-RNDPAFKLACGR 103 Query: 64 FENG---IPVHDTIARVVSCIS---PAKFHECFIN-WMRDCHSSNDKDVIAID------- 109 + + T +R+ + + ++ W+ + + ID Sbjct: 104 LPDSGQDLCSQPTCSRLENLPDLRTVIRLGRVLVDLWLSSYPAPPKSVTLDIDDTLDVVH 163 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML------- 162 G ++ I + + I K+ I L L Sbjct: 164 GHQQLSLFNGHHDERCFLPIHIYDAATGRPVAMILRPGKTPSGKEIRGHLRRLARCIRAR 223 Query: 163 -DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD 221 ++ D+ + ++ ++ DY+F + GN K + + Sbjct: 224 WPDTRILVRGDSHYGRVEVMAWCEENAIDYVFGLAGN-----KVLKRLVDASADDIRTRR 278 Query: 222 SYAMSEKSHGREEIRL 237 + G E R Sbjct: 279 ALEQKPVLRGYVETRY 294 >UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TLQ7_SHEHH Length = 74 Score = 44.3 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 17/44 (38%), Positives = 23/44 (52%) Query: 7 MEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDF 50 EH+SII R + EH I+ L A+ S E W DI++F Sbjct: 4 FEHLSIIKAPRSSINHEHDPVDIMFLVNSAIASDCEGWLDIDEF 47 >UniRef50_Q2RR82 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RR82_RHORT Length = 84 Score = 43.9 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 1/46 (2%) Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 A E++ LD+ G++ T DA+ CQK E ++ G L K NQ Sbjct: 36 ATQEMIAPLDLTGRLFTLDALHCQKTF-EIARQAGNHLLVQAKINQ 80 >UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P672_AZOSE Length = 47 Score = 43.9 bits (102), Expect = 0.010, Method: Composition-based stats. Identities = 15/31 (48%), Positives = 18/31 (58%) Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 HW VEN LHW L+V NED ++R A Sbjct: 1 HWGVENWLHWCLNVQFNEDRSRVRSAYAVNN 31 >UniRef50_Q8X3B6 H repeat-associated protein n=1 Tax=Escherichia coli O157:H7 RepID=Q8X3B6_ECO57 Length = 50 Score = 43.9 bits (102), Expect = 0.010, Method: Composition-based stats. Identities = 25/38 (65%), Positives = 28/38 (73%) Query: 341 INILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGLS 378 + I ND VFKAGL KMRKA MDRN+LAS +A GLS Sbjct: 13 LLISDNDNVFKAGLSCKMRKAVMDRNFLASGIAACGLS 50 >UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus pneumoniae RepID=B2ISL1_STRPS Length = 535 Score = 43.6 bits (101), Expect = 0.011, Method: Composition-based stats. Identities = 43/232 (18%), Positives = 78/232 (33%), Gaps = 33/232 (14%) Query: 18 QAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG--IPVHDTIA 75 Q + S IL+ +F +++G + D+ L + G + T++ Sbjct: 142 QRRYCRYSDSDILVQFLFQLLTGYGT-----DYACKELSADAYFPKLLEGGQLASQPTLS 196 Query: 76 RVVSCISPA----------KFHECFINW--MRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 R +S + E F+ + + D GK +Y+ R Sbjct: 197 RFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDSTHFTTYGKQEGVAYNAHYRA 256 Query: 124 GAIHVISAFSTMHSLVI-GQIKTDKK--SNE----ITAIPELLNMLDIKGKIITTDAMGC 176 H + AF Q++ + S E IT + E N L + D+ Sbjct: 257 HGYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFITPVLERFNQL-----LFRMDSGFA 311 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQ--GRLNKAFEEKFPLKELNNPEHDSYAMS 226 + + I+K G YL +K N RL ++L H +Y+ + Sbjct: 312 TPKLYDLIEKTGQYYLIKLKKNTVLSRLGDLSLPCPQDEDLTILPHSAYSET 363 >UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteobacteria RepID=A6X872_OCHA4 Length = 330 Score = 43.2 bits (100), Expect = 0.014, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 70/194 (36%), Gaps = 15/194 (7%) Query: 52 ETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRD------CHSSNDKDV 105 E + L + + E +P H T +R + + + C D S+ K Sbjct: 85 EGLMASLLRLLNVELPVPDHTTFSRRCANLVVSSLTRCTRRDGTDEPLHVIVDSTGMKIY 144 Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIK 165 A +H +R+ +H+ A + VI + TD+ +++++ +P+LL+M+D Sbjct: 145 EAGQWLEEKHGAKSARKWLKLHL--AIDADSNQVIAETLTDQNTSDLSQVPDLLDMIDRP 202 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAM 225 D + ++ + R+ E + + + D ++ Sbjct: 203 IACFMADGAYDSDQTYQALRSHSPGVSIII---PPRIRDLQEASYGPPD----QRDWHSR 255 Query: 226 SEKSHGREEIRLHI 239 + GR E + Sbjct: 256 TNAQRGRMEWQNLT 269 >UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromonas gingivalis ATCC 33277 RepID=B2RI66_PORG3 Length = 87 Score = 43.2 bits (100), Expect = 0.018, Method: Composition-based stats. Identities = 14/47 (29%), Positives = 24/47 (51%) Query: 17 RQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 R K + L + L+ + +SG SW +IED+ E + + LK + Sbjct: 23 RIESKEVYPLDFLFLIVFLSTLSGDTSWYEIEDYAEEYEEVLKSRYE 69 >UniRef50_A6FLE0 Transposase, IS4 n=2 Tax=Roseobacter sp. AzwK-3b RepID=A6FLE0_9RHOB Length = 136 Score = 42.8 bits (99), Expect = 0.023, Method: Composition-based stats. Identities = 20/72 (27%), Positives = 29/72 (40%), Gaps = 2/72 (2%) Query: 13 IPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIP--V 70 +PD R+ KV H L I+ I + +G E D D + L D E G Sbjct: 41 LPDPREPGKVRHSLEDIIRFRIMMIAAGYEDGNDAGDLRDDPAFKLALERDPETGAALCS 100 Query: 71 HDTIARVVSCIS 82 TI+R+ + Sbjct: 101 QPTISRMENMAD 112 >UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID=A1RCW9_ARTAT Length = 436 Score = 41.2 bits (95), Expect = 0.064, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 67/205 (32%), Gaps = 19/205 (9%) Query: 11 SIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPV 70 I+PD R +V+H L +L I+A+ +G E D + G H L+ + + Sbjct: 49 KIVPDRRDPGRVQHGLQTLLAQRIYALAAGYEDLNDHD--GLRHDYALQTAVNRLQPLAG 106 Query: 71 HDTIARVVSCISPAKFHECFI-NWMRDCHSSNDKDV-IAIDGKTLRHSYDKSRRRGAIHV 128 T+ R+ + W + I +D + H Sbjct: 107 KSTLGRLEQQADRETVVQAHRLLWEHFIAQHDQAPAEIVLDFDATDVPVHGDQEGRFFHG 166 Query: 129 ---------ISAFSTMHSLV--IGQIKTDKKSNE---ITAIPELLNMLDIKGKII-TTDA 173 + F H LV + D + + + + + + +I+ D Sbjct: 167 YYDHYCFLPLYVFCGRHLLVSYLRPSNIDGARHSWAILALLVKFIRRFWPETRIVFRGDG 226 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGN 198 C+ + + ++ DY+ + N Sbjct: 227 GFCRHRMLDWCDRKQVDYVVGLARN 251 >UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUW9_9BACT Length = 61 Score = 40.9 bits (94), Expect = 0.079, Method: Composition-based stats. Identities = 8/39 (20%), Positives = 16/39 (41%) Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 D ED +IR NA + ++++ + + V Sbjct: 1 MRDTSFREDHSQIRTQNAPRAMASLKNLVVGLFHFLNVP 39 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 430 e-119 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 402 e-110 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 392 e-107 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 390 e-107 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 388 e-106 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 388 e-106 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 385 e-105 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 373 e-102 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 368 e-100 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 364 3e-99 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 359 7e-98 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 355 2e-96 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 351 2e-95 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 349 1e-94 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 348 2e-94 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 342 9e-93 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 342 1e-92 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 342 2e-92 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 341 3e-92 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 338 2e-91 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 338 2e-91 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 335 2e-90 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 334 2e-90 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 330 6e-89 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 327 6e-88 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 324 3e-87 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 324 4e-87 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 323 6e-87 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 321 2e-86 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 321 3e-86 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 319 1e-85 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 317 3e-85 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 317 4e-85 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 317 5e-85 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 317 6e-85 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 315 2e-84 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 313 6e-84 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 308 2e-82 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 301 2e-80 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 297 5e-79 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 295 1e-78 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 295 1e-78 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 294 3e-78 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 294 4e-78 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 288 2e-76 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 280 4e-74 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 273 7e-72 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 269 1e-70 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 265 2e-69 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 264 3e-69 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 264 4e-69 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 262 1e-68 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 260 6e-68 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 260 8e-68 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 259 1e-67 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 256 9e-67 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 240 5e-62 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 240 7e-62 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 238 2e-61 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 236 1e-60 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 233 6e-60 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 232 1e-59 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 228 3e-58 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 227 4e-58 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 226 1e-57 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 226 1e-57 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 226 1e-57 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 223 6e-57 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 218 4e-55 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 216 1e-54 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 211 3e-53 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 210 6e-53 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 203 1e-50 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 202 1e-50 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 199 1e-49 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 192 1e-47 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 192 2e-47 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 177 5e-43 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 172 2e-41 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 172 2e-41 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 172 2e-41 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 172 2e-41 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 169 2e-40 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 166 1e-39 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 165 2e-39 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 165 3e-39 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 164 3e-39 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 164 6e-39 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 158 3e-37 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 157 8e-37 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 156 1e-36 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 153 1e-35 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 150 8e-35 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 149 1e-34 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 149 2e-34 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 148 3e-34 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 148 4e-34 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 146 1e-33 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 144 4e-33 UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliob... 144 4e-33 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 140 9e-32 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 139 2e-31 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 138 3e-31 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 138 3e-31 UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7... 137 6e-31 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 136 1e-30 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 135 2e-30 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 135 2e-30 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 135 3e-30 UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azolla... 134 5e-30 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 133 1e-29 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 133 1e-29 UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp... 130 8e-29 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 126 1e-27 UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaerom... 125 3e-27 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 124 6e-27 UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=B... 123 9e-27 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 123 2e-26 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 121 5e-26 UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=B... 118 3e-25 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 118 4e-25 UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammap... 117 7e-25 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 115 3e-24 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 115 3e-24 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 113 1e-23 UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica... 112 1e-23 UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 112 2e-23 UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata ob... 112 2e-23 UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina... 108 4e-22 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 107 5e-22 UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mo... 107 6e-22 UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis... 106 2e-21 UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legione... 105 2e-21 UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris mari... 105 3e-21 UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata ob... 104 6e-21 UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candida... 104 7e-21 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 101 3e-20 UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswald... 101 3e-20 UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfo... 101 4e-20 UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX 101 4e-20 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 101 5e-20 UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=... 100 9e-20 UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis... 100 1e-19 UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum fe... 100 1e-19 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 99 2e-19 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 99 2e-19 UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewane... 99 2e-19 UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio ... 98 4e-19 UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillace... 97 1e-18 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 97 1e-18 UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoa... 96 2e-18 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 95 5e-18 UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobac... 94 6e-18 UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia ... 92 4e-17 UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteoba... 91 7e-17 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 91 8e-17 UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum ... 90 1e-16 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 89 2e-16 UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscurig... 89 3e-16 UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus Re... 88 7e-16 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 88 7e-16 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 87 1e-15 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 86 2e-15 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 85 4e-15 UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synecho... 84 5e-15 UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia... 84 7e-15 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 83 2e-14 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 83 2e-14 UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax... 83 2e-14 UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferro... 82 3e-14 UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus... 82 4e-14 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 81 6e-14 UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_... 81 6e-14 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 81 7e-14 UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 81 7e-14 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 79 2e-13 UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothe... 79 3e-13 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 77 1e-12 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 76 2e-12 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 74 7e-12 UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=... 74 9e-12 UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinis... 74 9e-12 UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodoco... 73 2e-11 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 73 2e-11 UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus... 72 4e-11 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 69 2e-10 UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia... 69 2e-10 UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylob... 69 2e-10 UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 68 4e-10 UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitroso... 66 2e-09 UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium m... 66 2e-09 UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitroco... 65 3e-09 UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia... 63 1e-08 UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia... 63 2e-08 UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=... 63 2e-08 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 63 3e-08 UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=... 61 6e-08 UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synecho... 61 7e-08 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 61 8e-08 UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria... 56 2e-06 UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Strepto... 56 3e-06 UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia... 55 4e-06 UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylo... 55 5e-06 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 55 5e-06 UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata ob... 54 6e-06 UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobi... 54 9e-06 UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polarom... 51 8e-05 UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 51 1e-04 UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinis... 48 5e-04 UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastop... 47 0.001 UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus... 47 0.001 UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiat... 47 0.002 Sequences not found previously or not previously below threshold: UniRef50_B2IT45 Putative uncharacterized protein n=5 Tax=Cyanoba... 68 4e-10 UniRef50_A5GAF0 Putative uncharacterized protein n=6 Tax=Deltapr... 61 5e-08 UniRef50_A8MIZ4 Putative uncharacterized protein n=1 Tax=Alkalip... 59 3e-07 UniRef50_A7C035 Transposase n=5 Tax=Bacteria RepID=A7C035_9GAMM 50 1e-04 UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicute... 49 2e-04 UniRef50_C7GHC1 Transposase, IS4 family (Fragment) n=6 Tax=Roseb... 48 6e-04 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 48 8e-04 UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteoba... 47 0.001 UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=... 46 0.002 UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aro... 46 0.002 UniRef50_Q745Z8 Putative uncharacterized protein n=1 Tax=Thermus... 46 0.002 UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus p... 46 0.003 UniRef50_A3YV03 Putative uncharacterized protein n=1 Tax=Synecho... 46 0.003 UniRef50_B0JNZ6 Transposase n=20 Tax=Cyanobacteria RepID=B0JNZ6_... 45 0.004 UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 R... 45 0.005 UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitroso... 44 0.006 UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmone... 44 0.007 UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia ... 44 0.007 UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID... 44 0.012 UniRef50_A5FU21 Transposase, IS4 family protein n=11 Tax=Alphapr... 44 0.013 UniRef50_A7JYJ5 Putative uncharacterized protein n=1 Tax=Vibrio ... 42 0.025 UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepI... 42 0.026 UniRef50_A7BZU0 Putative uncharacterized protein n=1 Tax=Beggiat... 42 0.027 UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodoco... 42 0.028 UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX 42 0.034 UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 42 0.038 UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostri... 42 0.039 UniRef50_A4BVT6 Putative uncharacterized protein n=1 Tax=Nitroco... 42 0.046 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 370/378 (97%), Positives = 373/378 (98%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 MELKKLMEHISIIPDYRQ WKVEHKLS ILLLTI AVISGAE WEDIEDFGETHLDFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS+DKDVIAIDGKTLRHSYDKS Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 RRRGAIHVISAFSTMHSLVIGQIKTD+KSNEITAIPELLNMLDIKGKIITTDAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 AEKIQKQGGDYLFAVKG QGRLNKAFEEKFPLKELNNPEHDSYA+SEKSHGREEIRLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR Sbjct: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK Sbjct: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 Query: 361 AAMDRNYLASVLAGSGLS 378 AAMDRNYLASVLAGSGLS Sbjct: 361 AAMDRNYLASVLAGSGLS 378 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 402 bits (1032), Expect = e-110, Method: Composition-based stats. Identities = 156/379 (41%), Positives = 222/379 (58%), Gaps = 10/379 (2%) Query: 2 ELKKLMEHISIIPDYRQAW-KVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 +K E+ + D R+ H IL++ + A+ISGA ++ +IE FG + ++ + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 + NGIP HDT V++ +SP +F CF+ W + IAID KTLR S DK Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKK 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + +H++SA++T +LVIGQIKT++ SNEITAIPELLN LD+KG +++ DAMGCQ +I Sbjct: 125 NGKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEI 184 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN---NPEHDSYAMSEKSHGREEIRL 237 AEKI ++ DY+ A+KGNQ +L+++ E F L N E D E S+GREEIR Sbjct: 185 AEKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRC 244 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 + +++I EWK +K + + S R KKE E +RYYISSA L+AE Sbjct: 245 AYATNEIEKIIAN-DEWKNIKTVAMIESQRI-----KKEKEFDIRYYISSAKLSAEDCLK 298 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +ENKLHW LDV ED+ +IR+ N AE + +R IA+N++ +K K G K Sbjct: 299 VVRKHWEIENKLHWTLDVAFREDESRIRQRNTAENMAILRRIALNLVKQEKTAKVGQATK 358 Query: 358 MRKAAMDRNYLASVLAGSG 376 A D YL +L G Sbjct: 359 RLMAGWDEKYLLKLLNGLA 377 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 392 bits (1007), Expect = e-107, Method: Composition-based stats. Identities = 168/373 (45%), Positives = 233/373 (62%), Gaps = 8/373 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+E SII D RQ K++H+L IL L + AVI GAE W+DIE+ G L++L++ G F Sbjct: 6 SLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFF 65 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 + GIPV DTIAR++S ++P + CFI WM + D +IA+DGK++RHSYDK +R+ Sbjct: 66 KKGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKS 125 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 AIH++SA++ + +V+GQ KTD KSNEI AIP LL++LDIKG I+T DAMGCQ+ IAEKI Sbjct: 126 AIHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKI 185 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPEHDSYAMSEKSHGREEIRLHIVC 241 + GDY+ AVK NQ +L++ + F HD + S K HGR E+R + + Sbjct: 186 VTKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRRYWIS 245 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 D+ L W L+ + + S R I + RY+I+S A+ FA A+R Sbjct: 246 DMLSTLG-NPERWASLQSIGMVESERYI----DGKTTAETRYFITSIAPDAKIFANAVRK 300 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDV EDD ++RR NA+E F RH+AIN L N+K K G++ K KA Sbjct: 301 HWAIENQLHWVLDVSFREDDSRVRRDNASENFGVFRHVAINALRNEKSCKKGIKAKRYKA 360 Query: 362 AMDRNYLASVLAG 374 + +Y VL G Sbjct: 361 TLQPDYAQKVLNG 373 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 390 bits (1002), Expect = e-107, Method: Composition-based stats. Identities = 182/377 (48%), Positives = 249/377 (66%), Gaps = 4/377 (1%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M L+ +SII D RQ KV H L +L L I AVISG E WE+I+DFG LD+L++ Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 Y F GIP DTI+R+ I P +F +CF WM+ C + DVIAIDGKTLR S++K Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKK 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + IH++SAF+ +S+V+GQ+KT+ KSNEITAIP+LL++LD++G ++T DAMGCQ I Sbjct: 121 DKSDTIHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 A+KI +GGDYL VKGNQ RL A + F ++ L PE ++Y EK HGRE+ R+ +V Sbjct: 181 AKKIVDKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 D E+ D FEW GLK L AVSFR E+ + + V++YISSA L A+ A R Sbjct: 241 ADAN-EIGDLVFEWPGLKTLGYAVSFR---TEKDMQTTVAVKFYISSAKLDAKSLLEASR 296 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW VEN LHW+LD+ MNED C+IR+ N+ E + +RH ++N+L N+K F G++RK ++ Sbjct: 297 AHWTVENNLHWQLDISMNEDSCRIRQQNSTENLATVRHTSLNLLKNEKSFDGGIKRKHKQ 356 Query: 361 AAMDRNYLASVLAGSGL 377 A +Y V++G L Sbjct: 357 ANRSDSYRELVVSGLSL 373 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 388 bits (996), Expect = e-106, Method: Composition-based stats. Identities = 148/385 (38%), Positives = 225/385 (58%), Gaps = 20/385 (5%) Query: 8 EHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG 67 ++ + D R +HKL I+ +TI AVI GA+SW DIE FG+ +LK++ + NG Sbjct: 11 DYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNG 70 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIH 127 IP HDT RV S ++P + F+ W++ S +++AIDGKTLRHSYD+S+ + A+ Sbjct: 71 IPSHDTFGRVFSLLNPEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQ 130 Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPE---------------LLNMLDIKGKIITTD 172 +ISA++T + LV+GQ D+KSNEITAIP+ LL +L + G I+T D Sbjct: 131 MISAWATTNGLVLGQSIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLD 190 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPE---HDSYAMSEKS 229 A+GCQK+I ++I +Q DY+ +K NQG L + E F ++N E Y + ++ Sbjct: 191 AIGCQKEIVKQITEQDADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEG 250 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 HGR+E+R + + E ID ++W L + R + + + RY+ISS + Sbjct: 251 HGRQEVRYYQMLSNVAEEIDPDWQWLNLNSIGYVEYLR--VENGTDKTSLERRYFISSLN 308 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + FA+++R HW +EN+ HW LDV NEDD +IR+ NA + +RH+A+N+L +K Sbjct: 309 NNIKLFASSVREHWCIENQCHWILDVQFNEDDSRIRKDNAPANMAILRHLALNLLKQEKT 368 Query: 350 FKAGLRRKMRKAAMDRNYLASVLAG 374 K G++ K +KA D NYL VL Sbjct: 369 LKVGVKAKRKKAGWDENYLLKVLRN 393 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 388 bits (995), Expect = e-106, Method: Composition-based stats. Identities = 144/375 (38%), Positives = 220/375 (58%), Gaps = 7/375 (1%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 ++EH S + D R A ++E+ L I+++T+ AV+ GA++W ++ ++G + +LKQ+ Sbjct: 5 PFASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQW 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSR 121 NG+P HDT V + + P + +CF+NW + + + ++IAIDGKTLR + Sbjct: 65 IALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGE 124 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + IH++SA+++ + LV+GQ D+KSNEITAIPELL +L+++G +++ DAMGCQ IA Sbjct: 125 QCSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIA 184 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPEHDSYAMSEKSHGREEIRLH 238 E I + GDY+ A+KGNQG L + F + EHDSY EK HGR E R + Sbjct: 185 ETIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTY 244 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 D L+ W LK + S R + + RYY+ S + A++FA A Sbjct: 245 WTMGQTDYLLG-AERWAQLKSIGCVESCR---RQPGHPGTLQRRYYLLSIESDAQRFADA 300 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R+HW +EN+LHW LDV ED + +G +A+ S IRHIA N+L + K G++ K Sbjct: 301 VRSHWGIENQLHWILDVGFREDKLRACQGCSAQNLSVIRHIAANLLQQESTAKCGVKAKR 360 Query: 359 RKAAMDRNYLASVLA 373 KA D NYL +L+ Sbjct: 361 LKAGWDDNYLVKILS 375 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 385 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 136/373 (36%), Positives = 211/373 (56%), Gaps = 8/373 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+ ++ I D R +H L +L + I AVI+G++ WED+E++G ++L ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 +GIP DT RV I P +C W++ +S ++I IDGKTLR SYD++ + Sbjct: 90 PHGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQC 149 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A++ ++A+++ SLV+GQ+K + SNEITAIP LL +LDI G IIT DAMG Q I ++I Sbjct: 150 ALYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQI 209 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP---EHDSYAMSEKSHGREEIRLHIVC 241 +Q DY+ +K N L ++ F + N EHD Y K H R E R Sbjct: 210 CRQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRYVWAI 269 Query: 242 DVPDE-LIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 V + +W GL+ + V R + + +++Y++S A+ AIR Sbjct: 270 PVAAMGELYQQQQWHGLQTIVVVERIRHLWN----KTTHDIQFYLTSLPPNAQFLCHAIR 325 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +EN LHW LDV +ED C+IR + + F+ +R +A+N+L +K FK LR+KM++ Sbjct: 326 THWSIENNLHWTLDVTFSEDQCRIRSEYSPQNFALLRRLALNVLHQEKTFKRSLRQKMKQ 385 Query: 361 AAMDRNYLASVLA 373 AAM+ NY+ +VL Sbjct: 386 AAMNNNYMMTVLN 398 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 158/372 (42%), Positives = 226/372 (60%), Gaps = 7/372 (1%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M ++ +H S I D+RQ+ KV + L +L ++ AVI+ + W +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 F +GIP DTIAR+VS I P F+ CF+ WM+ H + +VIAIDGKTLR SY++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 R IH+ISA+++ + LV+GQ+K ++KSNEITAIP LL MLD++G ++T DAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 A I +GGDYL AVK NQG L KA + F D + EKSHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFSPHRSAGL-SDDHVNIEKSHGRIENRTCYV 239 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 DFT W+ LK + + SFR++ + K + RYYISS L+AE+ +A R Sbjct: 240 LSSAALDGDFTH-WEALKSIVMVESFRAV---KGKTASLEYRYYISSKVLSAEQALSATR 295 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +E +HW LDV MNED+C+I + N AE + +RH+++N+L + K + K ++ Sbjct: 296 EHWGIE-SMHWVLDVSMNEDECQIYKNNGAENLAYLRHMSLNMLQKEPT-KLSIVGKRKR 353 Query: 361 AAMDRNYLASVL 372 M+ +L VL Sbjct: 354 CLMNPAFLEKVL 365 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 368 bits (945), Expect = e-100, Method: Composition-based stats. Identities = 155/371 (41%), Positives = 214/371 (57%), Gaps = 10/371 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 + H S I D RQ KV + L ILLLT+ AV+SGA W I +G L FLK++ F Sbjct: 25 FLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPFA 84 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 +G P HD + + + + F CFI+W+ + + V+AIDGKT R S DK+ + A Sbjct: 85 DGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTVTG-VVAIDGKTSRRSLDKAGGKAA 143 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH+ISA+S+ +L + Q + D KSNEITAIPELL +L +KG I+T DAMGCQ++IA KI Sbjct: 144 IHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKII 203 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPEHDSYAMSEKSHGREEIRLHIVCD 242 + DY+ A+KGNQG L K E + + ++ + EKSHGR E R VC Sbjct: 204 SKEADYILALKGNQGSLRKDTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVTVCT 263 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 D L W GLK + + A + + RYYISS AE A AIR+H Sbjct: 264 DIDWL-KADHNWPGLKSIVMVQY----HAILQDKTRAETRYYISSMTSDAEHHAKAIRDH 318 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN LHW +D+V +D+C+IR GNA F+ I+H+A N+L + K K LR K A+ Sbjct: 319 WGIENGLHWVMDMVFRDDECRIRNGNAPANFTTIKHVASNMLRSVKG-KHSLRSKRHIAS 377 Query: 363 MDRNYLASVLA 373 D ++LA ++ Sbjct: 378 WDDDFLAEIIN 388 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 364 bits (934), Expect = 3e-99, Method: Composition-based stats. Identities = 137/384 (35%), Positives = 197/384 (51%), Gaps = 15/384 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 +++ I D R K HK+ I+ ++I AVI GA+SW +IE+FG + F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPD 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS------YD 118 IP HDT R S I P F F NW++ V+AIDGK +R + Sbjct: 64 LEFIPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 + + ++SA+S ++ + +GQ+K D KSNEITAIP L+N L++ G I+T DAMGCQK Sbjct: 123 TGKEGFKLWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 DI + I ++ +Y+ A+K N+ + L K + + ++ + HGR E Sbjct: 183 DITQTIIERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEK 242 Query: 236 RLHIVCDV-PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AE 293 R V F + GLK + S R+I+A E VRYY++S D T E Sbjct: 243 RTCTVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIVAT--GEYTQEVRYYVTSLDNTKPE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A+AIR HW +EN LHW+LDV ED K + NAA FS +A+ IL DK K Sbjct: 301 EIASAIRQHWSIENNLHWQLDVTFREDYSK-KVKNAARNFSVATKMALTILKKDKTTKGS 359 Query: 354 LRRKMRKAAMDRNYLASVLAGSGL 377 + K KA D YL+ +L + Sbjct: 360 MNLKRLKAGWDEKYLSQLLQNNNF 383 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 359 bits (922), Expect = 7e-98, Method: Composition-based stats. Identities = 131/380 (34%), Positives = 216/380 (56%), Gaps = 16/380 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 + L+EH I D R + +H+L +L++ + ++ G E++ D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 +GIP HDT RV + + P F +CF+ W + ++ +++A+DGK LR + ++ + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQ--GQ 124 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 ++SA++ +SLV+GQI+ K+NEITA+P+LL +L++ G I+T DAMGCQK+IA + Sbjct: 125 SPRVIVSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD----------SYAMSEKSHGRE 233 I + +Y+ A+KGNQG+ ++ + + + +EK HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 E R + L D +W GL+ + V S R + + P + RYY+SS ++ E Sbjct: 245 ETRRYWQSGDVSWLAD-RQQWAGLRSVGVVESVRQVGQQA---PTVERRYYLSSLNVDVE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 KFA A+R HW VEN LHW LDV ED + R G+AAE + +R +A+N+L + K G Sbjct: 301 KFARAVRGHWSVENSLHWVLDVQCGEDRNRARSGHAAENLATLRRLALNLLKRESTKKRG 360 Query: 354 LRRKMRKAAMDRNYLASVLA 373 ++ K A+ D +YL +L+ Sbjct: 361 IKGKQLNASWDHDYLLRLLS 380 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 355 bits (910), Expect = 2e-96, Method: Composition-based stats. Identities = 125/380 (32%), Positives = 197/380 (51%), Gaps = 16/380 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 L E I D+R H L+ IL++ A++ G + +E FG +L+ + Sbjct: 14 SNLREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLA 73 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMR--------DCHSSNDKDVIAIDGKTLRH 115 NGIP HDT +V S + P +F E F W + + S K VIAIDGK LR Sbjct: 74 LPNGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRG 133 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 + DK + ++ A+++ SL +GQ+K KSNEI A+PELL ML +KG I+T DAMG Sbjct: 134 AVDK--GQAPAVIVGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMG 191 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL-KELNNPEHDSYAMSEKSHGREE 234 CQ+++A KI +Q GDY+ A+K NQ L++ ++L E + + HGR E Sbjct: 192 CQREVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHE 251 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 +R V + + + +W GL+ + R++ + + RY+ISS A Sbjct: 252 VRRCWVSEEVECWLQGAEKWAGLRSVAAVECERTVA----GQTTVQRRYFISSLKADAAL 307 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN-DKVFKAG 353 A ++R HW +EN LHW LDV ED+ + RRG +AE + +R + ++ + K Sbjct: 308 IAASVRAHWGIENSLHWVLDVTFGEDESRSRRGYSAENLATLRRLTHAMIKRENPNSKKS 367 Query: 354 LRRKMRKAAMDRNYLASVLA 373 + ++ +A + +YL ++L Sbjct: 368 VNQRRFEAGLSTDYLQTLLG 387 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 351 bits (901), Expect = 2e-95, Method: Composition-based stats. Identities = 128/372 (34%), Positives = 198/372 (53%), Gaps = 7/372 (1%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 L ++E + D+R A + H+LS +L + + AV+SGA+ +E+I +G + +L+ + Sbjct: 6 LADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPWLRGFL 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD-VIAIDGKTLRHSYDKSR 121 + G+ DT RV + + P +F + F W+ + KD VIAIDGK+ R + K+ Sbjct: 66 RLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRTTSKAA 125 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 +H++SAF+ +V+GQ T +KSNEITAIPELL +LDI+G I+T DAMG Q IA Sbjct: 126 AA-PLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGTQTKIA 184 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVC 241 I+++G Y+ VK N +L + ++ + HGR E+R Sbjct: 185 RAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLTPSSTHETTSTGHGRIEVRRCTAF 244 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 D D L WK + V R++ + YYISS AE+ A AIR+ Sbjct: 245 DATDRLHK-AEAWKDVASFAVVERVRTV----GERTSTERVYYISSLPADAERIAVAIRS 299 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW VEN+LHW LDV +D + R G+ A + +RH+A+N++ DK K ++ K A Sbjct: 300 HWEVENRLHWCLDVQFGDDYARGRIGHIAHNLALVRHMALNLIRLDKSIKTSIKTKRLLA 359 Query: 362 AMDRNYLASVLA 373 A + A++L Sbjct: 360 ATSDEFRAALLG 371 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 349 bits (894), Expect = 1e-94, Method: Composition-based stats. Identities = 136/369 (36%), Positives = 196/369 (53%), Gaps = 9/369 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L ++ I D R H+L I+ + +FAV++GA+SW IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + + P F +W++ S+ VIAIDGKT + SYD+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + ++SA+++ H LV+GQ D KSNEITAIP LL L + G I++ DAMG + IA +I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPL---KELNNPEHDSYAMSEKSHGREEIRLHIVCD 242 KQ DY+ A+KGNQ L K ++ F E+ + E +H R E R Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRRVFQVP 253 Query: 243 VPDELIDFT-FEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 V +W GL+ L V S R + + RY++SS A FA IR Sbjct: 254 VEQVFTPKQGRDWAGLRSLVVIQSQRCLWNKD----TTETRYFLSSLSTDAATFAHYIRA 309 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDVV NED +IR+ +A FS +R + +N+L D K L K +A Sbjct: 310 HWGIENQLHWCLDVVFNEDKSRIRKDHAPRNFSLLRRLTLNLLHRDSS-KGSLVMKRYRA 368 Query: 362 AMDRNYLAS 370 +D ++ Sbjct: 369 GLDDQFMMQ 377 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 348 bits (893), Expect = 2e-94, Method: Composition-based stats. Identities = 141/376 (37%), Positives = 201/376 (53%), Gaps = 12/376 (3%) Query: 5 KLMEHISIIPDYRQAW-KVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 LM + D R+ H +L++ I AV+S ++ EDI +G D+L+Q+ Sbjct: 8 SLMCCFREVDDPRKRSNGTLHDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLV 67 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 NG+ +T R+ + P +F F W+ + + +DGKT+R S S Sbjct: 68 LLNGVASEETFLRIFRALDPKQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGS--GSGGE 124 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 AIH++SAF+T +V+GQ K KSNEITAIPELL L I G ++T DAMGCQK+IA + Sbjct: 125 SAIHMVSAFATELGVVLGQEKVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIARQ 184 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDV 243 I QGGDYL AVKGNQ L A E +F + + + + D + SHGR ++ V Sbjct: 185 ITDQGGDYLLAVKGNQPTLLDAIETEF-IDQYQSDDVDRHRQVHPSHGRIVAQIASVLPA 243 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 E I +W KK+ S R + E ++ RYYISS +LTAE+ A A+R HW Sbjct: 244 --EGIVDLADWPECKKIARVDSLRKV---GNHESKLERRYYISSRELTAEQLAAAVRAHW 298 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKAGLRRKMRKA 361 +EN+LHW LDV ED IR+GNA + S ++ I +N++ D K LR K + A Sbjct: 299 GIENRLHWVLDVSFGEDASTIRKGNAPQNLSLLKKIVLNLIRLDTADKTKTSLRLKRKCA 358 Query: 362 AMDRNYLASVLAGSGL 377 A + +L + L Sbjct: 359 AWTDDVRMRILGFTSL 374 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 342 bits (878), Expect = 9e-93, Method: Composition-based stats. Identities = 142/375 (37%), Positives = 200/375 (53%), Gaps = 16/375 (4%) Query: 7 MEHISIIPDYRQAW-KVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 M + I D R+ H IL++ I AV+S ++ EDI + T +L+++ + Sbjct: 1 MSCLDEIDDPRKPSNGTLHDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLK 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDV-----IAIDGKTLRHSYDKS 120 NGIP +T R++ + P +F F W+ + D IAIDGKT+R S S Sbjct: 61 NGIPSEETFLRILRALDPKQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGS--GS 118 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 AIH++SAF+T LV+GQ K KSNEITAIPELL L IKG ++T DAMGCQK I Sbjct: 119 GGESAIHMVSAFATELGLVLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSI 178 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 A++I + GDYL VKGNQ +L +A E F + + D + E+ HGR ++ V Sbjct: 179 AKQIVAKKGDYLLMVKGNQPKLLEAIETAF-IDQHGVESVDRSSRVERGHGRTVGQIASV 237 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 I +W + S R + K+ ++ RYYISS L+AE+ A A+R Sbjct: 238 LSAKG--IVDPADWPKCVTIGRIDSMRVV---GDKQSDLERRYYISSRALSAEQLAAAVR 292 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKM 358 HW VEN+LHW LDV +ED + + NA + S +R IA+ I+ DK K+ LR K Sbjct: 293 AHWGVENRLHWILDVSFSEDASTVAKDNAPQNLSLLRKIALTIIRADKTDTRKSSLRLKR 352 Query: 359 RKAAMDRNYLASVLA 373 + AA D +L Sbjct: 353 KGAAWDDGVRERMLG 367 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 342 bits (876), Expect = 1e-92, Method: Composition-based stats. Identities = 130/378 (34%), Positives = 193/378 (51%), Gaps = 12/378 (3%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M K L++++ IPD R K H LS ++ + I A++ G ++W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDK-DVIAIDGKTLRHSYDK 119 + GIP HDT R+ + + PA F W+ D + +A+DGK LR + K Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMGDDKLVGQLAVDGKALR-ATAK 119 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A+H+++ +ST + +GQ K KSNEITAIPELL +L++KG +++ DAMG Q Sbjct: 120 GRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQVK 179 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK----ELNNPEHDSYAMSEKSHGREEI 235 IA+ I K+ GDYL AVK NQ LN +E+F E + H + HGR+E Sbjct: 180 IADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKEH 239 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V + DE + +WK K + + R + + VR+YISS L A Sbjct: 240 RRCWVL-MVDESMPVCQQWKA-KTIIAVQAERI----ENGKGYDFVRFYISSRALDATSA 293 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A R HW VEN LHW LD+ ED + R G A E + IR +N+L +K + Sbjct: 294 LKATRAHWSVENLLHWTLDIAFGEDQLQARAGFAGENLAVIRQWILNMLKQNKSRNLSMA 353 Query: 356 RKMRKAAMDRNYLASVLA 373 K R ++ YL + Sbjct: 354 NKRRLCCLNEQYLFECMG 371 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 342 bits (876), Expect = 2e-92, Method: Composition-based stats. Identities = 129/373 (34%), Positives = 205/373 (54%), Gaps = 10/373 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 + H + D R +H L ++ LT+ A++SGAE W+DI+ FG++ LD+L+++ F Sbjct: 2 SFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAF 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 + G+PV DTIAR++S + P FI+W+ + + VIA DGKTLRHS+D R+ Sbjct: 62 KEGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFDGD-RKT 120 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A+H +SA++ LV+ Q K+ K NE++ + EL+ +L++KG I+T DAM C K +A+ I Sbjct: 121 ALHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAI 180 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPE---HDSYAMSEKSHGREEIRLHIVC 241 +GGDY+ VK NQG+L F + P+ +S ++ HGR E R ++ Sbjct: 181 NAKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYVQL 240 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 + L + W +K + R + K + YYISS ++ + A AIR+ Sbjct: 241 PITPWLTQ-SQGWTNIKPVIEVTRKRYL----KDKETSETAYYISSLEVNLPQIAKAIRS 295 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN HW LD+ EDD +IRRG+A E + R A+N+ K ++ K+++A Sbjct: 296 HWSIENASHWVLDMTYREDDSRIRRGDAPENMATFRRFAMNLARLSP-IKDSMKGKLKQA 354 Query: 362 AMDRNYLASVLAG 374 A +L Sbjct: 355 AWSDEVREKLLFA 367 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 341 bits (873), Expect = 3e-92, Method: Composition-based stats. Identities = 124/380 (32%), Positives = 215/380 (56%), Gaps = 14/380 (3%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 ++ ++E+ + + D R+ +H L +L++ + AVI+GA+ I + E H+++LK Sbjct: 9 DVVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSR 68 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-----NDKDVIAIDGKTLRHS 116 + +G+P HDTI R+++ + P F +CF W+ + + +++IAIDGKTLR S Sbjct: 69 LELPSGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRS 128 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ + G + + SA++ + +GQ+ KSNEI PEL+ +D++ I+T DA GC Sbjct: 129 HDRGKGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGC 188 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPEHDSYAMSEKSHGRE 233 Q+D+AEKI GDY+ A+K NQ RL++ + + + + + + K HGR Sbjct: 189 QRDVAEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRL 248 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 + R + +PDE + +W+GLK + VA+ +++ RYYISS A+ Sbjct: 249 DKRFYYQVKLPDE-VPAGEDWRGLKTIGVAIRI----SQENGRETCDTRYYISSLKPDAK 303 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 +FA A+R HW +EN LHW LDV ED+ ++R AAE + ++ +A++++ K K Sbjct: 304 QFAAAVRGHWGIENSLHWTLDVTFREDESRLRNRIAAENLAWLKRLAVSLIKQHKS-KES 362 Query: 354 LRRKMRKAAMDRNYLASVLA 373 + + R A + N+LA +L Sbjct: 363 VVMRRRMAGWNVNFLAEILG 382 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 338 bits (866), Expect = 2e-91, Method: Composition-based stats. Identities = 129/351 (36%), Positives = 186/351 (52%), Gaps = 5/351 (1%) Query: 22 VEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCI 81 V + L+ +LL T+ +I A +++IE G LD+L+Q+ FE+G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 SPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 P F W+ V AIDGKTLR S + GA+H++SA++ LVIG Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 Q + KSNEITAIPELL+ L + G I+T DAMG QK IA K+ +G DY+ A+KGNQG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 L+ + F +L HGR E R V D L + W GL + Sbjct: 181 LHDDVRDFFADPDLLRECARHDDTC-IGHGRIEERTCQVADASAWLTEQHSGWAGLASIA 239 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 ++ R ++ E R YISS + A R+HW VEN LHW+LDV ED+ Sbjct: 240 AVIATR--TDKKSGEVSSETRLYISSLPPDPKTILNACRSHWSVENNLHWQLDVTFREDE 297 Query: 322 CKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 C+ R+ +A + IRH A N+L + K ++RK KAAM++ + +V+ Sbjct: 298 CRTRKDHAPLSLAIIRHAAFNMLKREPS-KMSIKRKRLKAAMNQAFRKTVI 347 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 338 bits (866), Expect = 2e-91, Method: Composition-based stats. Identities = 131/379 (34%), Positives = 183/379 (48%), Gaps = 16/379 (4%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M ++ +IIPD R ++ + I+ + + AVI GA++W +IE FG+TH + K Sbjct: 1 MTIQAFS---AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKA 57 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLR-----H 115 IP HDT++R S + F ECF W+ D V+AIDGK + Sbjct: 58 RLPGLVSIPSHDTLSRFFSILDIDWFEECFRLWVDDICRRIPG-VVAIDGKAICDNPDKS 116 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 S K+ R ++++SA+S + + +GQ K ++KSNE AIPEL+ LD++ IIT DA+G Sbjct: 117 SNSKNGVRSKLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIG 176 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF-PLKELNNPEHDSYAMSEKSHGREE 234 CQK I + I + DY+ K N L E Y K HGR E Sbjct: 177 CQKSITKLIIENKADYILCAKDNHEALRNIIEFNLSEESRYYLCHAKRYFEENKGHGRSE 236 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 R VC L F W G+K L + S R + KE M RYYISS + Sbjct: 237 YREC-VCISAKNLQYFLKGWTGIKTLAMINSIRKM---GDKEAVMETRYYISSLEPDPII 292 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 +IR HW VEN LHW LD+ EDD + + GNAA FS I +A+ +L K G+ Sbjct: 293 ILKSIRPHWEVENNLHWVLDIGFREDDDR-KIGNAAINFSAIPKLALMLLKQ-SDIKLGM 350 Query: 355 RRKMRKAAMDRNYLASVLA 373 K + D V+ Sbjct: 351 AGKRKACGWDEKIRDKVIG 369 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 335 bits (859), Expect = 2e-90, Method: Composition-based stats. Identities = 134/375 (35%), Positives = 201/375 (53%), Gaps = 12/375 (3%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M + + E++S D R A+ +H I+ L + AVISGA SW +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 Y FE GIPV DTIARV+ I P F+E F+N++ + + ++VIAIDGKTLRHS++ Sbjct: 57 YRPFECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFNPE 116 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + A+H ++ +S L++ Q K+ K NE A+ E+++ +K +IT DAM QK I Sbjct: 117 -TQSALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKI 175 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH-DSYAMSEKSHGREEIRLHI 239 AEKI ++ GDY+ +K N + E F + PE ++Y R + R + Sbjct: 176 AEKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYR 235 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 V D L EWKG+K + RS + +YISS D+ + A + Sbjct: 236 KLKVSDWLSK-AEEWKGIKSVLEVCRKRS----DNGKESQEKVFYISSLDVDIQILAKCV 290 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VENK HW LDVV ED+C + AE + +R +A+N+ K ++ K+ Sbjct: 291 RGHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHP-KKQSMKGKLT 349 Query: 360 KAAMDRNYLASVLAG 374 A + +L G Sbjct: 350 AAGWSDEFRDELLLG 364 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 334 bits (857), Expect = 2e-90, Method: Composition-based stats. Identities = 123/374 (32%), Positives = 191/374 (51%), Gaps = 9/374 (2%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 + L I D R + + L ILL+T+ A+I G ++W+ I DFG+ +L Q+ + Sbjct: 14 QYLFHCFLSIKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVN 73 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 G+P T ARV S I P +F C WM D+I +DGK+L S + + + Sbjct: 74 MRCGVPSTLTFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQ 133 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 A H+++A+ + +G+++ KSNEI AIP LLN L+++G II+ DAMG QK IA Sbjct: 134 KATHIVNAYLPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANL 193 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEK---SHGREEIRLHIV 240 I+ + DY+ A+K N R + E F + + + Y E HGR E R + V Sbjct: 194 IRLKQADYVLALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSYCV 253 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD-LTAEKFATAI 299 + + W+ L+ + S R + E E RYYI+S + + AI Sbjct: 254 LPM-MYFHKYKKYWRDLQAIVRVQSKRH----KGNEIETATRYYITSLPFAEHRRMSQAI 308 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN+LHW+LD+ + ED I RG A + + +R + + +L N+ K G+ K Sbjct: 309 RQHWAIENQLHWKLDIGLGEDASLITRGYADQNLATLRKMVLKMLENENSSKQGIAGKRI 368 Query: 360 KAAMDRNYLASVLA 373 +AA+ YL V+ Sbjct: 369 QAALSTRYLRKVVG 382 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 330 bits (845), Expect = 6e-89, Method: Composition-based stats. Identities = 126/375 (33%), Positives = 194/375 (51%), Gaps = 13/375 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG--D 63 L+E S +PD R+ + L+ IL++ + A++ GA++W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 E G P HDT + + F F +W+R+ D V+AIDGKTLR S K Sbjct: 63 LEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVIDG-VVAIDGKTLRGSGKKGSNE 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H+++A++ L + Q T K +E+ + LL++L +KG I+T DA+GCQ ++AEK Sbjct: 122 -LLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAEK 180 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPE---HDSYAMSEKSHGREEIRLH-I 239 I +GGDY+ VK NQ L +A E F + + +EK HGR E R + Sbjct: 181 IVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYTW 240 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL-TAEKFATA 298 + DV WK L + + S R I + + RY I S + T E FA A Sbjct: 241 INDVTWMDRPMRAAWKKLGGVGMIESIRQI----GDKVSVDQRYAIGSCGVQTVEMFAKA 296 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 R+HW +EN LHW LDVV ED C+ R GN+A S +R + L ++ K GL R+ Sbjct: 297 SRSHWGIENGLHWTLDVVFREDQCRARLGNSARNLSVLRKFVLTTLRKEEGCKMGLNRRR 356 Query: 359 RKAAMDRNYLASVLA 373 A + +Y S++A Sbjct: 357 LHADRNESYRESLIA 371 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 327 bits (837), Expect = 6e-88, Method: Composition-based stats. Identities = 112/369 (30%), Positives = 190/369 (51%), Gaps = 6/369 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L+EH++++ + R +H L ++ L I A++SGAE W DIE +G++ +D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP T+AR++ CI E + W+ + + K +IA DGK LR S+ + + A Sbjct: 69 NGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKDA 127 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + +++A+ T + LV+ Q T K EI + ++L++L++KG ++T DA+ CQ++ EKI Sbjct: 128 LQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKIS 187 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPD 245 ++ + VK NQ +L +A + +F E E HGR+E R Sbjct: 188 EKKAHVVVQVKNNQPKLYQAVQSQFQSLFDAQKEKIVVEHKESGHGRQEERYVFQLKAKL 247 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 + T +W ++ + RS + + YY+SS + IR HW + Sbjct: 248 PP-ELTEKWPTIRSIIAVERHRS----ANGKGTVDTSYYVSSLSPKHKLLGHYIRQHWRI 302 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN H+ LDVV NED +I +A E + R +NI+ R K+++A + Sbjct: 303 ENSQHYILDVVFNEDASRIAMEDAVENMALFRRFVLNIVKQSNCGARSQRNKLKRAGWND 362 Query: 366 NYLASVLAG 374 +Y A + G Sbjct: 363 DYRAQLFFG 371 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 324 bits (830), Expect = 3e-87, Method: Composition-based stats. Identities = 116/370 (31%), Positives = 187/370 (50%), Gaps = 7/370 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L +H+S++ D R H L +L L + AV SG + W +I+ FGE L++L+++ F Sbjct: 2 SLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPF 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 NGIP TIAR++ + P C +W+ D +++ K +IAIDGKTLR + Sbjct: 62 ANGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLGC--N 119 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 +H + AF + L + Q K EI + L+ ML+I +IT DA+ Q+ E I Sbjct: 120 TLHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAI 179 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVP 244 + GDY+ VK NQ L +A + ++ + ++ + +A SEK HGR E R+ Sbjct: 180 VARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQRITFQIPSK 239 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 +W +K L R I + + +Y+SS D+ E ATA+R HW Sbjct: 240 LSP-KLQEKWPSVKTLIAVERHRKI----GNKTSIETSFYLSSHDIDPEYIATAVRGHWR 294 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN LHW LDVV ED C++ AE + +R +A+N+ + K ++ K+ ++ + Sbjct: 295 IENSLHWVLDVVYREDACRVHEQRTAESLAIVRRMALNLAKLEITQKRSMKSKLHRSLLS 354 Query: 365 RNYLASVLAG 374 Y ++ Sbjct: 355 DEYRELMIFA 364 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 324 bits (830), Expect = 4e-87, Method: Composition-based stats. Identities = 129/377 (34%), Positives = 195/377 (51%), Gaps = 13/377 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 L+ L+EH S I D R ++ H L ILLL + ++ + +E+I +G HL FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRR 122 + +G+P + +++ I PA F F W+R D +AIDGKT R S+D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATF-PGRADFVAIDGKTSRRSHDRRAG 130 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD----IKGKIITTDAMGCQK 178 IH++SAF+T LV+ Q K+NE+ AIP LL+ L + G +++ DA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLH 238 IA I+ QG DYL AVK NQ L E F + + + HD +K HGR E R Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGADHHHD----LDKGHGRVEERHV 246 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK--KEPEMTVRYYISSADLTAEKFA 296 V D L + G +L + + RY+ISSA LTAE A Sbjct: 247 SVIREVDWLSGTRR-FPGEMRLPDVAAIVRVHTTAHIADRTRTDTRYFISSAPLTAEHAA 305 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 A+R HW +EN+LHW LDV+ +D ++R G+ A+ + +RH A+N++ K L+ Sbjct: 306 DAVRGHWGIENRLHWVLDVLFKDDLSRLRTGHGAKNMAVVRHFALNLIRAANDQK-SLKT 364 Query: 357 KMRKAAMDRNYLASVLA 373 + + A +YLAS+L Sbjct: 365 RRKMAGWSDDYLASLLN 381 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 323 bits (828), Expect = 6e-87, Method: Composition-based stats. Identities = 140/388 (36%), Positives = 200/388 (51%), Gaps = 26/388 (6%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 L + + + +I D R +H+ S I+L+ I AVI GA++W IEDFG++ F Sbjct: 14 LHEFADSLILI-DNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKL 72 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSY----- 117 NGIP HDT R S + P KF E + W++ IAIDGKT+R +Y Sbjct: 73 SNFNGIPSHDTFNRFFSALDPLKFEESYRQWVQSILKCYSG-HIAIDGKTIRGAYESEQD 131 Query: 118 ----------DKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGK 167 D + + +HVISAF+T + +GQ+ T +K NEI IPELL+ML IK Sbjct: 132 KRHRKQGVLPDSNTGKYKLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDC 191 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL--KELNNPEHDSYAM 225 IIT DA+GCQ+ IAEK+ K GDY+F VK NQ +L + + D Y Sbjct: 192 IITIDALGCQRTIAEKVIKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKGTTVRFDKYET 251 Query: 226 SEKSHGREEIRLHIVCDVPDELI-DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYY 284 E+ HGR E R+ C+ P L D +WK ++ + R+ K + R + Sbjct: 252 HEEGHGRNESRICYCCNDPGFLGADIRKKWKNIQSFGYIENTRN----TNKGTTVEKRCF 307 Query: 285 ISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 ISS + A+K R HW +EN LHW+LDV +ED+ + RR +A FS + IA+ L Sbjct: 308 ISSLEPDAQKILKNSREHWEIENNLHWQLDVNFHEDNTR-RRNISALNFSVLAKIALATL 366 Query: 345 TNDKVFKAGLRRKMRKAAMDRNYLASVL 372 N+K + + RK A D +L ++ Sbjct: 367 RNNK-REIPINRKRLIAGWDNEFLWELI 393 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 321 bits (823), Expect = 2e-86, Method: Composition-based stats. Identities = 124/376 (32%), Positives = 186/376 (49%), Gaps = 11/376 (2%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 ++ L + I D RQA KV H++ +L++ + + ES+ D+ DF ++ L +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRR 122 ++G P HD V+ I P E W D + IAIDGK LR +++ Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGWCGD----LEGRHIAIDGKALRGTHNAETG 116 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 R +H++ A+ + L GQI +KSNEI AIP LL L +KG +T DAMG Q IAE Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPEHDSYAMSEKSHGREEIRLHI 239 +I G DY+ A+K N R ++ + F +L+ H E SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 + + D +++W GL+ + R + P V Y++ S E+ A + Sbjct: 237 ITEELDWYHK-SWKWAGLQSVAQV--RRQVQRSHDGPPLEEVHYFLCSFKADVERLAKLV 293 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VEN+ HW LDV NED C++R NAA + +R + I L K LRRK + Sbjct: 294 RGHWSVENRCHWVLDVTFNEDHCQVRDRNAAHNLTILREMVITTLHRHP-AKVSLRRKRK 352 Query: 360 KAAMDRNYLASVLAGS 375 A MD + +L Sbjct: 353 LATMDPAFRLQMLGLL 368 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 321 bits (822), Expect = 3e-86, Method: Composition-based stats. Identities = 126/382 (32%), Positives = 197/382 (51%), Gaps = 15/382 (3%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 + +E ++ I D+R + ++L ILL++ AVI +++ ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDV------IAIDGKTLRHSY 117 F +G P HDT +V+S + P E F WM + + K V +AIDGKT+ S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS- 121 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 S + A HV++AF++ LV+GQIKTD+KSNEITAIPELL + +K ++T DAMG Q Sbjct: 122 -GSAEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD------SYAMSEKSHG 231 K+IA KI ++GGDY+ AVKGNQ +L + + + EK HG Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTRELKAKGQYAVTLEKDHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 R E R + + + +W+G+ + + R + + K + S + Sbjct: 241 RIEKRECYLSNDLSWF-EGLEDWRGIAGVGWIRNTRHVDNKNDKTTTEDHYFIYSLKEAQ 299 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFK 351 A+ R HW +EN LHW LD+ EDDC+ R NAAE+ + +R +A+ +L K Sbjct: 300 AKDLLRIKREHWAIENNLHWMLDMAFREDDCRARAKNAAEVMNILRKLALQMLKTCDTCK 359 Query: 352 AGLRRKMRKAAMDRNYLASVLA 373 G+R K + + VL Sbjct: 360 CGMRSKRKLCGLGIPTALQVLG 381 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 319 bits (816), Expect = 1e-85, Method: Composition-based stats. Identities = 117/381 (30%), Positives = 184/381 (48%), Gaps = 19/381 (4%) Query: 6 LMEHISIIPDYRQA-WKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L + +PD R H L+ IL + AVI+GAE WEDI ++G + F +++ + Sbjct: 5 LTSVFADLPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLEL 64 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS--------NDKDVIAIDGKTLRHS 116 +NG+P HDT RV + + P F + F W + + + +A+DGK+ R S Sbjct: 65 KNGVPSHDTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGKSARRS 124 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 + +H++ + +L++GQ + +EIT ++L LD+ G ++T DA GC Sbjct: 125 AKPTFSGC-LHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTLDAAGC 183 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPE-HDSYAMSEKSHGREEI 235 Q + E I+ +GG+Y+ VKGNQ L A F D + +HGR E Sbjct: 184 QTETLEVIRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAHGRHEE 243 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V PD L W G+ + + R + + E T YY+SS + A + Sbjct: 244 RNVTVVHDPDGL---PAGWAGVGSVALVCRDRQVKGKAN---ESTAHYYLSSLRVGAAEL 297 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A IR HWH+E+ +HW LDV ED+ + R G+A IR +A+++L K + Sbjct: 298 AGYIRGHWHIES-MHWVLDVAFREDESRTRVGHAGANLGMIRRVAVSLLKRAG-KKGSIH 355 Query: 356 RKMRKAAMDRNYLASVLAGSG 376 + +A D Y+A VL G Sbjct: 356 TRRLRAGWDDQYMAQVLQGLS 376 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 317 bits (813), Expect = 3e-85, Method: Composition-based stats. Identities = 127/379 (33%), Positives = 198/379 (52%), Gaps = 17/379 (4%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 +++L+E + I D RQ KV H L IL++ +FA ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + + K +I IDGKT+R +K Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRS--NK 118 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 H++SA+S +GQ +KSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 119 RNGEKPGHIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD---SYAMSEKSHGREEIR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 + + L WKGLK + + E++ + + RY+ISS E + Sbjct: 239 EYYQTEKIKWLSQ-KKAWKGLKSIIMERK----TLEKEGKRLIEYRYFISSLKEEIETVS 293 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF--KAGL 354 A+R HW +E+ +HW LDV ED AA+ + IR +++IL +V K + Sbjct: 294 RAVRGHWSIES-MHWHLDVTFREDANTTIDKMAAQNLNIIRKWSLSILKTAEVSRHKLSM 352 Query: 355 RRKMRKAAMDR-NYLASVL 372 R+K + +L VL Sbjct: 353 RKKRYVIGLRPIKHLEEVL 371 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 317 bits (812), Expect = 4e-85, Method: Composition-based stats. Identities = 128/381 (33%), Positives = 200/381 (52%), Gaps = 21/381 (5%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + ++ + + D R+ WK++H LS I+LL FA +SGAE W++IE FG+ + LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSN---------DKDVIAIDGKTL 113 ENGIP HDT+ RV + + P E W S+ K ++AIDGKT+ Sbjct: 66 QLENGIPSHDTLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTI 125 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDA 173 R + S ++ A+H+++A++T + GQ+ T++KSNEITAIPELL+M+ +KG +++ DA Sbjct: 126 RG--NGSAKQKALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDA 183 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGRE 233 MG QK IA+KI K+ DY AVK NQ L + F + + + D Y EK+HG+ Sbjct: 184 MGTQKAIADKIIKKKADYCLAVKENQKTLLEDIVPFFEMSQEAD---DHYHTVEKAHGQI 240 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 E R + V L E+ ++ + A I ++ + RY+I S ++A+ Sbjct: 241 ETRAYEVIHDVSWLRKTHPEFGHIQSIGRAR----IHLDKNGQESEESRYFILSCQVSAK 296 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKA 352 + +R HW +E +HW LDVV ED K A + + + +L K Sbjct: 297 ELCDYVRGHWQIE-SMHWLLDVVFREDANKTLNKQLAFNLNVMDKFCLAVLKQLDFGKKM 355 Query: 353 GLRRKMRKAAMD-RNYLASVL 372 +RRK ++ YL +L Sbjct: 356 SMRRKKYALSLSFDKYLKQLL 376 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 317 bits (811), Expect = 5e-85, Method: Composition-based stats. Identities = 116/371 (31%), Positives = 190/371 (51%), Gaps = 7/371 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 +++H+ I D R EH + I L + AVISGA+SW +FG L++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP +I R+ +S + ++W+ + + + IAIDGK L+ + S A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLKGAKA-SASSAA 119 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H+++A+ T LV K +E+ + ELL LD+K +++T DA+ CQ + I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPD 245 K+GGD + VKGNQ +L +A + +F NNP+ + + + K HGR E R+ C + Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLNL 239 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 + +W LK L R + + + +Y+SSA LT+E F AIR HW Sbjct: 240 P-AEIKMKWSQLKTLIAVERHRKV----GNKTSIDTHFYVSSAVLTSEAFGRAIRAHWQT 294 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN HW LD + ED K+ + A + + +R A+N++ K +K +A Sbjct: 295 ENNQHWLLDKLFQEDKQKMYDEDGASILAILRRWALNLVKLHP-AKTSQTQKFNRACWSD 353 Query: 366 NYLASVLAGSG 376 ++ ++ G+G Sbjct: 354 DFREEIIFGTG 364 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 317 bits (811), Expect = 6e-85, Method: Composition-based stats. Identities = 125/412 (30%), Positives = 188/412 (45%), Gaps = 44/412 (10%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + + E I I D R+ KV + I+L+T+ V +SW DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDK------------------- 103 P HDT+ R I + C+ W + + Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 104 -DVIAIDGKTLRHSYDKSR--------------RRGAIHVISAFSTMHSLVIGQIKTDKK 148 IAIDGKT+ + + + +H++SAF + SL +GQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNMLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFE 207 NEI AIP+LL+ +DI +G ++T DA+G QK I EKI ++ DYL VK N +L + E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPLKELNNPEHDSYAMSE---KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 ++ E+D +E + HG R I C P L +WK L+ + Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIK 316 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + IA E + +ISS E R HW VEN LHW+LDV NEDD + Sbjct: 317 TEKINIAT--GEIQNEKHCFISSLVNNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR- 373 Query: 325 RRGNAAELFSGIRHIAINILT--NDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 + N+A+ FS + +A+ IL D+ K + RK +KA YLA+++ Sbjct: 374 KMMNSAQNFSTLTKMALTILKNYQDEDKKTSVNRKRKKAGWSDEYLANLINN 425 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 315 bits (806), Expect = 2e-84, Method: Composition-based stats. Identities = 110/369 (29%), Positives = 177/369 (47%), Gaps = 12/369 (3%) Query: 10 ISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIP 69 +PD R H L +L + + A I GAES D F +++ + G+P Sbjct: 9 FQGLPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLP 68 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVI 129 HDT +RV + P F CF ++ D + V+AIDGKTLR S+D++ R A+HV+ Sbjct: 69 SHDTFSRVFRLLDPVAFSRCFQQFL-DHLGEDGAGVLAIDGKTLRRSFDRAAGRSALHVV 127 Query: 130 SAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGG 189 SAF++ +++GQ NEI A LL + D+KG ++T DA+ Q+ A+ I ++GG Sbjct: 128 SAFASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGG 187 Query: 190 DYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELID 249 D+LF +K N+ L E F + + ++ HGR E+R H V L Sbjct: 188 DWLFPLKDNRPALRAEVERYF--ADPATVLAVPHVTTDADHGRIEVRRHWVSHDVAWLAS 245 Query: 250 FTF-----EWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 GLK L + + T Y+SSA L + A A+R HW Sbjct: 246 DRRFPDEAVLPGLKILGLVER---TVTSPDGRTTATRTLYLSSAALEPKTLARAVRAHWS 302 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +E +HW LD +ED + R+ + E + +R +A+N++ + +R + ++A Sbjct: 303 IEAAVHWVLDTSFDEDRARNRKDHGPENLATLRKLALNVVRSANNQD-SIRLRRKRAGWS 361 Query: 365 RNYLASVLA 373 +Y ++L Sbjct: 362 DDYARTILG 370 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 313 bits (802), Expect = 6e-84, Method: Composition-based stats. Identities = 104/375 (27%), Positives = 173/375 (46%), Gaps = 17/375 (4%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 + + E +PD R A H L+ IL + + A + GA S D+ F + Sbjct: 4 PMDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDV 62 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS----SNDKDVIAIDGKTLRHSY 117 +NG+P HDT +RV + P F + F +M+ K VIA+DGK LR Y Sbjct: 63 LVLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGY 122 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 + R +++A++ + + ++ NE +L+ +L +KG ++T DA+ C Sbjct: 123 ESGRSHMPPVMVTAWAAQTRMALANVQAPNN-NEAAGALQLIELLQLKGCVVTADALHCH 181 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRL 237 + +AE I+ +GGDY+ AVK NQ L + + + HGR+E R Sbjct: 182 RGMAEAIKARGGDYVLAVKDNQPALMRDAKAAIRAATRQGKPST--ITVDAGHGRKEKRR 239 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 +V VP D ++ GLK + S R + RY++ S + Sbjct: 240 AVVAAVPQMAQD--HDFAGLKAVARITSKRGTDKTVE-------RYFLMSQAYPPKDVLR 290 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +EN LHW LDVV++ED + R+ NA + +R +A+N+ LR K Sbjct: 291 IVRTHWTIENSLHWPLDVVLDEDLARNRKDNAPANLAVLRRLALNVARAHPDNTTSLRGK 350 Query: 358 MRKAAMDRNYLASVL 372 +++A + +L ++ Sbjct: 351 LKRAGWNDTFLFELI 365 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 308 bits (789), Expect = 2e-82, Method: Composition-based stats. Identities = 107/377 (28%), Positives = 175/377 (46%), Gaps = 13/377 (3%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 + + + +PD R A V H L +L++ +V+ G+ S ++ FG F + Sbjct: 10 IAMHIFLSAFDEVPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSN-DKDVIAIDGKTLRHSYDK 119 + ++ IP HDT + V I P F + D D D+IAIDGK LR + D Sbjct: 69 FLKLKHAIPSHDTFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDP 128 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 ++SA+++ L + + D + E++A E L ++D++GK++T DA+ C + Sbjct: 129 GESARTRMMVSAYASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRR 187 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHI 239 I GGD+ A+KGNQ L F ++P + HGR+E R + Sbjct: 188 TVAAINAGGGDWCLALKGNQESLLSDARGCFSKGHKSDP---TAVTENTGHGRKETRKAV 244 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 V + + E+ GLK + R E + RY+ S T E A+ Sbjct: 245 VVSA--KALAEYHEFPGLKGFGRIEATR----ETGGKVTSETRYFALSWVPTPEVLLAAV 298 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R+HW +EN LHW+LDV ED + R+ N + +R A+++L D K L K++ Sbjct: 299 RDHWAIENALHWQLDVSFREDAARNRKDNGPGNIAVLRRRALDVLRRD-TSKGSLSIKIK 357 Query: 360 KAAMDRNYLASVLAGSG 376 +A D +L S+L+ Sbjct: 358 RAGWDTTFLRSILSDLA 374 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 301 bits (771), Expect = 2e-80, Method: Composition-based stats. Identities = 102/371 (27%), Positives = 167/371 (45%), Gaps = 13/371 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 ++ +PD R A H L +L++ +V+ GA S ++ FG + + + Sbjct: 37 ILSAFEDVPDPR-AENTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLK 95 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSN-DKDVIAIDGKTLRHSYDKSRRRG 124 + +P HDT + V I P F + D + D DVIA+DGK LR + D Sbjct: 96 HAVPSHDTFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGR 155 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++SA++ L + + D + E+ A E L ++ +KGK++T DA+ C + I Sbjct: 156 TRMMVSAYAARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAI 214 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVP 244 GGD+ A+K NQ L F + +P + HGR E R V Sbjct: 215 NAGGGDWCLALKANQDSLLSDARASFGAEPDAHPSA---LSEDIGHGRTETRKATVVSS- 270 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 + + E+ GLK + R + + RY+ S T E +R HW Sbjct: 271 -KALAEHHEFPGLKAFGRVEATR----KTAEGTTSETRYFALSWVPTPEVLLATVRAHWA 325 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN LHW+LDV ED + R+ N+ + +R A++++ D K L K+++A D Sbjct: 326 IENSLHWQLDVSFREDAARNRKDNSPGNIAILRRRALDVMRRD-TSKGSLSIKLKRAGWD 384 Query: 365 RNYLASVLAGS 375 ++L +VL G Sbjct: 385 DDFLRNVLNGL 395 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 297 bits (759), Expect = 5e-79, Method: Composition-based stats. Identities = 117/385 (30%), Positives = 176/385 (45%), Gaps = 43/385 (11%) Query: 30 LLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHEC 89 +L+T+ V +SW DI DF DFL+++ P HDT+ R I + C Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FINWMRDCHSSNDK--------------------DVIAIDGKTLRHSYDKSR-------- 121 + W + + IAIDGKT+ + + + Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 122 ------RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDI-KGKIITTDAM 174 +H++SAF + SL +GQ + K NEI AIP+LL+ +DI +G ++T DA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSE---KSHG 231 G QK I EKI ++ DYL VK N +L + E ++ E+D +E + HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 R I C P L +WK L+ + + + IA E + +ISS Sbjct: 241 FMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIKTEKINIAT--GEIQNEKHCFISSLVNN 298 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT--NDKV 349 E R HW VEN LHW+LDV NEDD + + N+A+ FS + +A+ IL D+ Sbjct: 299 PELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTLTKMALTILKNYQDED 357 Query: 350 FKAGLRRKMRKAAMDRNYLASVLAG 374 K + RK +KA YLA+++ Sbjct: 358 KKTSVNRKRKKAGWSDEYLANLINN 382 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 295 bits (756), Expect = 1e-78, Method: Composition-based stats. Identities = 109/307 (35%), Positives = 163/307 (53%), Gaps = 7/307 (2%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 +L + E +PD R + H LS +L + + AV+ GA + D+ +G+++L +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD-VIAIDGKTLRHSYDKS 120 + G+P HDT RV++ I PA F F+ W+ + D V+AIDGKT R S K Sbjct: 65 LKLKAGVPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGKD 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 G +H++SAF+ LV+GQ TD+KSNEITAIPELL ML ++G I+T DAMG Q I Sbjct: 125 -TSGPLHMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAI 183 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIV 240 A I+ +G DY+ VK N L + + K HGR E+R Sbjct: 184 ARTIRSRGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWA 243 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 D +L + +W GL+ + R++ + + YYISS A + A A+R Sbjct: 244 YDAVSQLYK-SEQWAGLQSFALVERERTV----DGKTSVERHYYISSLPADAARIAQAVR 298 Query: 301 NHWHVEN 307 +HW VE+ Sbjct: 299 SHWAVES 305 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 295 bits (756), Expect = 1e-78, Method: Composition-based stats. Identities = 104/369 (28%), Positives = 186/369 (50%), Gaps = 7/369 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L++H+ II D R ++H L ++ LT+ A++SGA W+ IE FG LD+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 +GIP IA ++ + + W+ D K +IA+DGKT+R ++ + A Sbjct: 63 HGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ-A 121 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H++SAF + + + +KK +E ++++ L + ++T DA+ CQK +KI Sbjct: 122 LHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKII 181 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPD 245 + D++ +KGNQ L A + ++P + HGR+E R + + Sbjct: 182 SKKSDFVIQIKGNQPALLAAVKAA-FAACYDSPALAISEQTNTGHGRKECRRVMQIEGNL 240 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 + + +W ++ L S R++ + + R+Y+SS + + A IR HW + Sbjct: 241 PP-ELSEKWPHIRTLVEVASERTV----GNKTACSSRWYVSSLPVDTAQLADIIRAHWAI 295 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN+LHW LDVV ED+ + + A+ + A++++ + K L K + AA D Sbjct: 296 ENQLHWVLDVVFREDELNVSDPDGAKHLALFNRAALSVIKQHQGKKDSLAAKRQSAAWDP 355 Query: 366 NYLASVLAG 374 + + +L G Sbjct: 356 AFRSELLFG 364 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 294 bits (753), Expect = 3e-78, Method: Composition-based stats. Identities = 116/387 (29%), Positives = 185/387 (47%), Gaps = 20/387 (5%) Query: 3 LKKLMEHISIIPD------YRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLD 56 + +++ I I D RQ+WK+ + LS IL L ++G E+ +++EDF E + Sbjct: 1 MTTMIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEP 60 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND-KDVIAIDGKTLRH 115 Y D G P HDT+ RV+S ++ + E + + + S + +I++DGKT+R Sbjct: 61 LFATYVDLSEGCPSHDTLERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRG 120 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 ++ + + +H+++A+ H L +GQ+ ++KSNEI AIP+LL +DI+ I+T DAMG Sbjct: 121 --NRGKNQKPVHIVTAYDGGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMG 178 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL---KELNNPEHDSYAMSEKSHGR 232 Q I + I K DY AVKGNQ L F E Y EKS G+ Sbjct: 179 TQTAIVDTIIKGKADYCLAVKGNQETLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQ 238 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTA 292 E+R + V L +W L+ + + + ++ + RY+I S Sbjct: 239 IEVREYWVSSDIKWLCQNHPKWHKLRGIGMTRNTI----DKDGQLSQENRYFIFSFKPDV 294 Query: 293 EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVF 350 FA +R HW +E +HW LDVV +ED + AA + IR + + L Sbjct: 295 LTFANCVRGHWQIE-SMHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKK 353 Query: 351 KAGLRRKMRKAAMD-RNYLASVLAGSG 376 RRK R ++ +YL + G Sbjct: 354 DLSYRRKQRYISVHLEDYLVQLFGERG 380 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 294 bits (752), Expect = 4e-78, Method: Composition-based stats. Identities = 116/343 (33%), Positives = 172/343 (50%), Gaps = 4/343 (1%) Query: 35 FAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWM 94 ++ AESWEDIE +G + +L+ + NGIP HDT RV + F CF + Sbjct: 1 MRRVACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCV 60 Query: 95 RDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITA 154 + ++V+A+DGK++R S G +H++S +++ L +GQ D KSNEI A Sbjct: 61 QFRAGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRA 120 Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 IPELL L + G I+T DAMGCQ IAE+I+ +G D L +K N G +A F Sbjct: 121 IPELLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTC 180 Query: 215 LNNPEHDSYAMSE-KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 L + + HGR R V L + W L ++ + R I Sbjct: 181 LGSGAAGRPVFDAFEGHGRLVRRRVFVDAAATALAPLSG-WPDLSRVLAVETLRGIPGT- 238 Query: 274 KKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELF 333 +RY+++S IR HW VEN LHW L+V EDD ++R AA F Sbjct: 239 -GTVVADIRYFLTSCRDDPSVLVGVIRRHWSVENALHWVLEVSFREDDSRVRDRTAARNF 297 Query: 334 SGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSG 376 + +R IA+N++ D+ +A LR + +KAA D +Y+ ++A Sbjct: 298 ALVRKIALNLIAQDRSTQASLRGRRKKAAWDDDYMLQIIANQA 340 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 103/372 (27%), Positives = 169/372 (45%), Gaps = 14/372 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 ++ +PD R +H L IL + + AV+ GA ++E F + LD L+Q+ E Sbjct: 3 FLDVFGEVPDPRD-LTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSN----DKDVIAIDGKTLRHSYDKSR 121 G P HDT +RV++ + P +E F+ +M K +A+DGK+LR +Y K R Sbjct: 62 RGAPSHDTFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGR 121 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 V++ F + + Q ++ E+ A L +L +KG +T DA+ C + + Sbjct: 122 SHMPPLVVTVFGCDTFMSLAQT-VAQEGGEVQAAIAALELLSLKGLTVTADALHCHRRMT 180 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVC 241 + ++ GG Y+ A+KGNQ +L + E +HGR E+R V Sbjct: 181 KTVRDGGGHYVIAIKGNQSKLAAEANTALDKAA-AGKATKFHQTEEDAHGRHEVRRAFVI 239 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 L + S+R++ + + VR Y S + A + +R Sbjct: 240 PFAQTPGKNALV--DLCAIGRVESWRTV----EGKTTHKVRCYALSRKMPAHELLATVRR 293 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN LHW+LDV++ ED + R+ N A + +R + +N+L D K L K KA Sbjct: 294 HWSIENDLHWQLDVLLGEDHIRGRKNNTAANHAILRRLTLNVLRADP-EKIPLSHKRLKA 352 Query: 362 AMDRNYLASVLA 373 L S+ Sbjct: 353 RWADQDLLSLFT 364 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 280 bits (717), Expect = 4e-74, Method: Composition-based stats. Identities = 117/368 (31%), Positives = 175/368 (47%), Gaps = 15/368 (4%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L E + ++P R K + L +LL+ + +SG SW +IED+ E + + LK + Sbjct: 4 SLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEM 63 Query: 65 ENG------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYD 118 G +P HDT+ R +S + F + W+ S+ I IDGKT+R Sbjct: 64 LTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRG-VK 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K HV+SAFS + Q+ D+K+NEI AI +LL++LD+ G +++ DA+G Q Sbjct: 123 KLSFDTQSHVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQT 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLH 238 I E+I +GGDY+ VK NQ + E F + D +E SHGR E R + Sbjct: 183 AIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLD--EQTELSHGRIETRRY 240 Query: 239 I--VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 + + E + KGL+ + V R ++ + V YYISS Sbjct: 241 ESILNPLEIEANEVLTRRKGLRSIHKVVRKR--RDKKSDKTSEEVAYYISSLT-DVSSLK 297 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKAGLR 355 AIR HW +ENKLH LDV D R N A++ I+ I + I+ K K+ + Sbjct: 298 QAIRGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKTNMKSSIP 357 Query: 356 RKMRKAAM 363 R +K A Sbjct: 358 RIQKKPAR 365 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 273 bits (698), Expect = 7e-72, Method: Composition-based stats. Identities = 103/370 (27%), Positives = 179/370 (48%), Gaps = 17/370 (4%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF- 64 + I+++ D R ++++ L ILL++++A ISG + WE IED+ H + L+ Sbjct: 5 IWNAIAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKL 64 Query: 65 ------ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYD 118 + +P HDT V I P +F E + ++ + + IAIDGKT R Sbjct: 65 SGKELKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPRG-IK 123 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 ++ +++SA+ T H VI I ++ K +E+++I +L+ +L ++ +T DA G Sbjct: 124 QTANSHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGTYV 183 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLH 238 ++ E I +GG+++ VKGNQ +L + E++F N D + HGR E R Sbjct: 184 EVIEMILSKGGNFVLPVKGNQKKLLEFIEKEFREYRGNTVSAD--TQEDIGHGRVEKRTV 241 Query: 239 IVCDVP---DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 D++ +WKG+K L V R + + K + YYI++ + ++ Sbjct: 242 YCITEIKTDDDIDGCMQKWKGVKTLVKIV--REVYKKADKSTRIETVYYITNL-IDPKEI 298 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA-GL 354 AIR HW +EN LH LDV++NED + N E F + +A+ I+ + + Sbjct: 299 NRAIRAHWGIENNLHRSLDVLLNEDHSQKSNHNVVENFHIMSLLALFIIKEISKQRGISM 358 Query: 355 RRKMRKAAMD 364 R + Sbjct: 359 NRTRKLCGYS 368 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 269 bits (687), Expect = 1e-70, Method: Composition-based stats. Identities = 105/372 (28%), Positives = 168/372 (45%), Gaps = 14/372 (3%) Query: 7 MEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFEN 66 + + I D R H L+ +L L + A + GA++ +I +F E LK+ + Sbjct: 5 LSILREIHDPRD-INARHDLAELLFLALAATLCGAKNCVEIAEFVEGREAELKEIVTLRH 63 Query: 67 GIPVHDTIARVVSCISPAKFHECFINWMRDCH-----SSNDKDVIAIDGKTLRHSYDKSR 121 G P HDT +R+ I P + ++ + V+A+DGK LR Y+K R Sbjct: 64 GCPSHDTFSRIFRLIDPDELARALGAFLAALRQGLGLGPRPRGVVAVDGKALRRGYEKGR 123 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 ++S + L + + + S+E+ A LL +D+KG I+T DA+ C+ D A Sbjct: 124 AFMPPVMVSVWDAETRLSVATKRAEG-SDEVAATLALLKSIDLKGCIVTADALHCRPDTA 182 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVC 241 + + + Y A+K N+GRL E F + + E HGR E R V Sbjct: 183 KALIGRKAHYALALKANRGRLFACAEAGFVAADAAGDLA-FHETRETGHGRLETRRASVL 241 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 + + + GLK + + R +VRY S L K A +R Sbjct: 242 PL--KAFKQAPAFPGLKAIGRIQATRQ---GADGRAVTSVRYIALSKVLAPHKLAEVVRA 296 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDVV +EDD + R+ NA + + IR +A +IL + K + KMR+ Sbjct: 297 HWTIENQLHWSLDVVFHEDDARSRKDNAPQNLAVIRRLARDILAAHPLDK-PIASKMRRV 355 Query: 362 AMDRNYLASVLA 373 +R++ Sbjct: 356 NWNRDFFHEFFT 367 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats. Identities = 87/365 (23%), Positives = 165/365 (45%), Gaps = 18/365 (4%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 ++ L+ + +PD R ++L ++ + + AV +GA S+ I D+ + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD-VIAIDGKTLRHSYDKS 120 +P TI +V + + ++ +A+DGKT+R + + Sbjct: 102 CGIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RI 159 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 ++ A H+++A + ++V+GQ +T KSNEI + LL +DI G ++T DAM QK Sbjct: 160 GKQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKAT 219 Query: 181 AEKIQKQ-GGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHI 239 A +++Q +Y+ VK NQ L ++ P +++ D E+ HGREE R + Sbjct: 220 ARCLREQCRAEYVMIVKANQPGLLARVRDQ-PWEQVPVVWSD---PVERGHGREEHRSYK 275 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD---LTAEKFA 296 + V L + +++ + R ++ V Y I S + A Sbjct: 276 ILTVARGL-----RFPYAQQVIQIIRRRRVLGAGAW--STEVVYAICSLPCEQAPPKLLA 328 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 + IR HWH+EN++H+ DV +ED +R G+ ++ + +R++ + + Sbjct: 329 SWIRGHWHIENRIHYVRDVTFDEDRSAVRTGHGPQVMATLRNVVVGLHRRAGHSNIARAC 388 Query: 357 KMRKA 361 + A Sbjct: 389 RRLAA 393 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 264 bits (675), Expect = 3e-69, Method: Composition-based stats. Identities = 85/348 (24%), Positives = 157/348 (45%), Gaps = 16/348 (4%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFL-KQY 61 + L+E + + D+R+ H L +L++ I + G + ++ +F + + L +++ Sbjct: 1 MLSLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWM-RDCHSSNDKDVIAIDGKTLRHSYDK- 119 +P + TI RV+ + + F W + +D + + +DGK+L+++ Sbjct: 61 NIIPERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNP 120 Query: 120 -SRRRGAIHVISAFSTMHSLVIGQIKTDKK-SNEITAIPELLNMLDIKGKIITTDAMGCQ 177 + ++ I +S FS LV+ + + K +EI ++ ++ K+ T DA+ CQ Sbjct: 121 NNEQQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRL 237 K I K DY+ VKGNQ L K ++ ++ + + SHGR+ R Sbjct: 181 KKTISLIAKTKNDYVITVKGNQKNLYKRIQDL----SNSSKPESCFLEQDNSHGRKISRK 236 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 V V ++ L+++ + + YYISS +A+ FA Sbjct: 237 IEVFKVRKNE---RQGFENLRRVIKVERK----GSRGDKTYEETAYYISSLTESAQVFAK 289 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 IR HW +EN+LHW DV+ ED +I AA +S + I +N+ Sbjct: 290 IIRGHWKIENQLHWVKDVIFEEDKSEISDFQAASNWSILTTIGLNLFR 337 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 264 bits (674), Expect = 4e-69, Method: Composition-based stats. Identities = 101/386 (26%), Positives = 169/386 (43%), Gaps = 30/386 (7%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 E+ L+E ++ +PD R V H L+ +L LT AV++GA S + ++ + L + Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 62 GDFE-------NGIPVHDTIARVVSCISPAKFHECFINW-MRDCHSSNDKDVIAIDGKTL 113 P TI RV++ I W + +A+DGK+L Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWLACRQQDAGGLRALAVDGKSL 157 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIITTD 172 R + RR +H+++A + LV+ Q+ +K+NEIT LL+ L D+ G ++T+D Sbjct: 158 RGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTSD 215 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGR 232 A+ Q D A ++ + Y+ VK N +L+ + P +++ HGR Sbjct: 216 ALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLKS-LPWQQIPL----QDRTRTTGHGR 270 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA---D 289 EIR VC V + L + G ++ V R + + + Y ++S Sbjct: 271 CEIRRLKVCTVNNLL------FPGARQAVQIVRRR--VNRTTGKVSLKTIYAVTSLAAEQ 322 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + A IR HW VE LH DV ED ++R GNA + + R++AI L V Sbjct: 323 APPARVAQLIRGHWTVEA-LHHVRDVTFAEDASQLRSGNAPQAMATYRNLAIGALRLAGV 381 Query: 350 FKAGLRRKMRKAAMDRNYLASVLAGS 375 + +R+ A D+ + L + Sbjct: 382 RN--IAAGLRRTARDQTRTLTHLGLT 405 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 108/360 (30%), Positives = 163/360 (45%), Gaps = 41/360 (11%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 L+ + + I D RQ KV H+ I++ + V + +SW ++ DF +DF++++ Sbjct: 18 LESMYNALGEI-DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFF 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINW--------------------MRDCHSSND 102 P HDT+ R + P + W + + Sbjct: 77 PDIQKAPSHDTLRRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKP 136 Query: 103 KDVIAIDGKTLRHSYDKSRRR--------------GAIHVISAFSTMHSLVIGQIKTDKK 148 IAIDGKT++ + ++ RRR +H++SAFS L +GQ + DKK Sbjct: 137 FRQIAIDGKTIKKAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKK 196 Query: 149 SNEITAIPELLNMLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAF- 206 NEI AIP LL+ LDI +G ++T DAMG QKDI +I K+ YL VK NQ L + Sbjct: 197 ENEIVAIPRLLDDLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIA 256 Query: 207 --EEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 F L N + + E HG +R VC L +W+ L+ + Sbjct: 257 GNMRDFERIPLPNEVYKVHKEGENGHGFVFLRECRVCSSLHSLGKIYKDWENLRSYGLIR 316 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + R + E E + Y+ISS + EK R HW +EN LHW+LD+ EDD ++ Sbjct: 317 TER--VDEATGESAVETHYFISSLENDPEKIMRVKRKHWGIENGLHWQLDITFKEDDGRM 374 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 260 bits (664), Expect = 6e-68, Method: Composition-based stats. Identities = 107/382 (28%), Positives = 176/382 (46%), Gaps = 25/382 (6%) Query: 3 LKKLMEHISIIPDYRQ--AWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 +K L E + +PDYR+ ++KL ILLL I + + DI FG+ +L + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSN---DKDVIAIDGKTLRHSY 117 G +G+P T+ R+ I E + H D++ IDGK +R + Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAGDILCIDGKAMRGTV 137 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 ++ R I +SA+S + + ++KSNEIT++P+LL+ +D+ G I+T DAM Q Sbjct: 138 LENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADAMSFQ 195 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEK-SHGREEIR 236 K I +KI+++GGD+L +K NQ L E+ L E D Y+ HGR E R Sbjct: 196 KAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAE----PVDVYSEGPFLEHGRIETR 251 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 + + D LI +W G L V + + + R+Y+SS +A + Sbjct: 252 VCRIFRGND-LITDREKWNG--NLTVVEIRTATERKSDGQKSSERRFYVSSFHGSARRLG 308 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL-R 355 T R HW +E+ +HW LD + +D + +A I+ + + IL + + Sbjct: 309 TIARMHWAIES-MHWDLDRNLRQDFIRRNSARSARNLDTIQRMVLAIL--------SIWK 359 Query: 356 RKMRKAAMDRNYLASVLAGSGL 377 K +K + A ++ L Sbjct: 360 GKRKKPSEKAKGTAELIGELSL 381 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 260 bits (663), Expect = 8e-68, Method: Composition-based stats. Identities = 109/349 (31%), Positives = 167/349 (47%), Gaps = 16/349 (4%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 L+KL E S IPD+R+A K + HKLS I++L I +S S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDK-----DVIAIDGKTLRH 115 NGIP T+ R+ I + H ++I IDGK R Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 + K+ R I +SA S + + ++KSNEI A+P L++ +DI GKI+T DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 QKDI +KI+++ GD++ +K NQ L E+K +P + E HGR E Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDKIKEL---SPVYSYCGEPELGHGRIET 269 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R + V D D LI +W G L + + + R ++SS + Sbjct: 270 RSYRVFDGTD-LIANKEKWNG--NLTIIEYECETVKKSTGNCTTEKRLHVSSLPANTPRL 326 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 T +RNHW +E+ +HW LD + +D K + AA I+ I ++ Sbjct: 327 GTPVRNHWSIES-MHWGLDRNLLQDKIKRKSARAARNLDTIQRIVYSVF 374 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 93/253 (36%), Positives = 144/253 (56%), Gaps = 7/253 (2%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 ++H+++A+ + +L++GQ+K D KSNEITAIP+LL ML ++G I+T DAMGCQK IA++ Sbjct: 1 NSLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQ 60 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN-PEHDSYAMSEKSHGREEIRLHIVCD 242 I + DY+ AVK NQ L + + F ++N H + + HGR E R + Sbjct: 61 IGSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREYSTI- 119 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 V D+L+ W L + + S R + RY+I S + A++F A+R H Sbjct: 120 VGDDLLAGITGWDNLNAIGMVESKREV----GNTISNEKRYFIMSINGHAQRFGDAVREH 175 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN +HW LDV ED +IR+ N+ E S +R IA+N + + K ++RK + A Sbjct: 176 WGIENTVHWVLDVSFGEDQSRIRKDNSPENLSMLRKIALNCVKQEST-KTSMKRKRKMAG 234 Query: 363 MDRNYLASVLAGS 375 D ++L VL G+ Sbjct: 235 WDNSFLIKVLTGN 247 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 256 bits (654), Expect = 9e-67, Method: Composition-based stats. Identities = 106/286 (37%), Positives = 153/286 (53%), Gaps = 9/286 (3%) Query: 9 HISIIPDYRQA-WKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG 67 +IPD R+A H LS IL + + AV+SG + WE + +FG T +L+Q+ NG Sbjct: 17 FFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANG 76 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCH-SSNDKDVIAIDGKTLRHSYDKSRRRGAI 126 IP HDT RV S I P F F +W + D +A+DGKT+R S+ S R A+ Sbjct: 77 IPSHDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSHRGSAGR-AL 135 Query: 127 HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQK 186 H++ A+S L++ Q + D KSNEITAIP++L++ D++G I+ DA+GCQK +A +I + Sbjct: 136 HLLHAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITE 195 Query: 187 QGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDE 246 GGDY+ A+KGNQ L+ + +P+ EK HGR E R V D D Sbjct: 196 AGGDYVLALKGNQSALHDDVRLFMETQADRHPQG-QAEAVEKDHGRIETRRIWVNDEIDW 254 Query: 247 LIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTA 292 L +W GLK L + S R + + R +I+S Sbjct: 255 LTQ-KPDWPGLKTLVMVESRREL----NGQVSCERRCFITSHTADP 295 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 240 bits (613), Expect = 5e-62, Method: Composition-based stats. Identities = 90/363 (24%), Positives = 146/363 (40%), Gaps = 48/363 (13%) Query: 29 ILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHE 88 +L+ + G + + +THL+ L+++ + GI TI R++ I Sbjct: 1 MLVCVTLGFLCGRTTIRRSLKWCKTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALY 60 Query: 89 CFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKK 148 F+ W+ + S + +A+DGK L + +K++ +++ T+ L++ Q+ D K Sbjct: 61 AFMEWVGEIVDSRN-THLAVDGKALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSK 119 Query: 149 SNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEE 208 +NEIT IPELL +LDI G I+T DA+G Q I E+I +QGG + VK NQ + Sbjct: 120 TNEITVIPELLKLLDISGSIVTIDAVGTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHT 179 Query: 209 KFPLKELNNPE-----------------HDSYAMSEKSHGREEIRLHIVCDVPDELIDFT 251 E + + ++ EK+ R E R +C L Sbjct: 180 FMDKLEAADVQRKKGEVLDSGMREYLEKYEEIIRIEKNRDRNEYRTCQICKDASNLTKSQ 239 Query: 252 FEWKGLKKLCVAVSFRSIIAEQ----------------------------KKEPEMTVRY 283 EW ++ + R + ++ Sbjct: 240 KEWPHVQSIGRIKQVRIPSEKDSHGNDVTPSKEEFLEKGSRRVPAPSAEEGTGKDVQCTA 299 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 IS LTAE+ + R HW +EN+LH LD ED ++ S IR A NI Sbjct: 300 LISDLILTAEELGSIKRMHWSIENRLHHVLDDTFREDRSPAKKSR--NNLSLIRKYAYNI 357 Query: 344 LTN 346 L Sbjct: 358 LRL 360 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 240 bits (612), Expect = 7e-62, Method: Composition-based stats. Identities = 79/376 (21%), Positives = 130/376 (34%), Gaps = 31/376 (8%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 +++ L+E + +PD R+ V L +L L + AV GA + +I + L Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCH--------------SSNDKDVIA 107 P T RV+ P E W + VI+ Sbjct: 91 FGLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVIS 150 Query: 108 IDGKTLRHSYDKSRRRGAI--HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML--- 162 DGKT+R + ++ V+ V+ + +EI A+ ++ L Sbjct: 151 ADGKTMRGARRRTGDGKIAQDQVVEILDHASGAVVA-CEPVNDGDEIGAVRTVMGRLADR 209 Query: 163 --DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH 220 + G ++ TDA Q + E++ GG +L VK NQ R+ P ++ Sbjct: 210 WGSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRILAKVR-ALPWAQVRA--- 265 Query: 221 DSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMT 280 K+HGR E R V P + + K+ R Sbjct: 266 -QDTCRGKAHGRAETRTVRVVQAPTHVDLALAGTAQVIKITRHTRRR-PHPGAPAASTRE 323 Query: 281 VRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIR 337 Y ++S A +R+HW +EN++HW D +ED R GN + +R Sbjct: 324 NAYLLTSLPAEVADPATLAAMVRSHWLIENQVHWVRDTAYDEDRHTARTGNGPINLACLR 383 Query: 338 HIAINILTNDKVFKAG 353 + AI Sbjct: 384 NTAITRHRAHGASNIA 399 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 89/247 (36%), Positives = 141/247 (57%), Gaps = 3/247 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L++H + D R +HKL I+++ + A+I GA+S+ +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + I P +F + F +W+ DV+ IDGKT++HS +K + A Sbjct: 69 NGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKKA 128 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH+++A+++ LV+ Q K +++ EITAIP L+ +L++ G ++T DAMG Q DIAE + Sbjct: 129 IHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELLH 188 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPEHDSYAMSEKSHGREEIRLHIVCD 242 +G DY A+KGNQ L + +E F E EH + EK R E+ + Sbjct: 189 SKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEVSSAYRTE 248 Query: 243 VPDELID 249 Sbjct: 249 QERLWSH 255 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 236 bits (601), Expect = 1e-60, Method: Composition-based stats. Identities = 89/386 (23%), Positives = 155/386 (40%), Gaps = 29/386 (7%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWE----DIEDFGETHLDF 57 ++ L+ + I D R+A + LS +L + A ++GA + DFG+ L Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKQ---YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC--HSSNDKDVIAIDGKT 112 L + P I + + A F W+ + V+A+D K Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKII-T 170 LR ++ + +R + +SA LV GQ++ +NEIT + LL L DI G ++ T Sbjct: 141 LRGAWSEGNKRVTL--LSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 171 TDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSH 230 DA+ Q + A + + G DY VKGNQ L + F + + + E+ H Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLY---RKTFEQTLPLLQKPPQHEVEERGH 255 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD- 289 GR + + ++ R + + ++S Sbjct: 256 GRIKKWQAWTTEAKGIGFPEVATAAVIR--------RDEFDLKGIRVSREYAHILTSVAG 307 Query: 290 --LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 TA IR HW +EN++H+ D ED + GN+ + R++AI I+ + Sbjct: 308 NRATAAYIHRLIRGHWGIENEIHYPRDTAWREDANQTHTGNSPHTLASFRNLAIGIIRRN 367 Query: 348 KVFKAGLRRKMRKAAMDRNYLASVLA 373 + K ++ + A DR+ + +LA Sbjct: 368 GIRK--IKETLEYIAGDRDRVLPLLA 391 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 233 bits (595), Expect = 6e-60, Method: Composition-based stats. Identities = 89/207 (42%), Positives = 135/207 (65%), Gaps = 1/207 (0%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 +KL IPD+R++ K + L ILL+ I +VI GA+SW ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 NGIP HDT RV S I +F +CFI W+ +++IAIDGKT+R + ++ Sbjct: 65 LPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGGKK 123 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H++SA++ ++LV+GQ+K KSNEITAIP+LL +L I+ I+T DAMGCQ IA+ Sbjct: 124 SPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIAKA 183 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKF 210 I K+ DY+ AVK NQ +L + E++F Sbjct: 184 IVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 232 bits (592), Expect = 1e-59, Method: Composition-based stats. Identities = 83/398 (20%), Positives = 158/398 (39%), Gaps = 31/398 (7%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 E++ L + ++ +PD R + H+L IL L+ AV +G +S E+I + + Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 62 GDFENGI-------PVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD---VIAIDGK 111 P DT+ RV+S + + + V+A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML----DIKGK 167 TLR + R A H+++ +V+ + + K+NE+TA LL L + G Sbjct: 158 TLRGAAGPEGR--APHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 IITTDAMGCQKDIAEKIQ-KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMS 226 ++T DA+ + A+ I + G ++F VK N L+ + ++ ++ Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPIG----HSAE 271 Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMT-----V 281 ++HGR E R + + + + + ++ V + T Sbjct: 272 GRAHGRFERRTIQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGRARVTRTIPSTVT 331 Query: 282 RYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 + ++S L A R HW +ENK+HW DV ED ++R G + + +R+ Sbjct: 332 VHVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRN 391 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSG 376 + I ++ + + + D L ++L Sbjct: 392 LIIGLIRLAGHNRIAPTIRRIRH--DNALLLAILTLDN 427 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 228 bits (580), Expect = 3e-58, Method: Composition-based stats. Identities = 89/240 (37%), Positives = 136/240 (56%), Gaps = 8/240 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 +++L+E + I D RQ KV H L IL++ +FA ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + + K +I IDGKT+R +K Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRS--NK 118 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 H++SA+S +GQ +KSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 119 RNGEKPGHIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD---SYAMSEKSHGREEIR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 227 bits (579), Expect = 4e-58, Method: Composition-based stats. Identities = 80/383 (20%), Positives = 148/383 (38%), Gaps = 17/383 (4%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 ++ L+ + +PD+R V ++L+ +L L + I+G ++ + ++ + Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS----SNDKDVIAIDGKTLRHSY 117 F +P TI R+V P + + W +A DGK ++ + Sbjct: 84 LGFPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGAR 143 Query: 118 DK--SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 + V+ A +G + +EI ++ L+N + ++TTD + Sbjct: 144 SRPPQGSVRQEAVVEAVRHDTGTALGHQRVVA-GDEIASVRRLVNRVCDHNTLVTTDCLH 202 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 + +A I+ +GG +LF++KGNQ + P E N + EK+HGR E Sbjct: 203 AHEPLARAIRAKGGHWLFSIKGNQPTVRAKL-AGLPWDEFGN----QHVTREKAHGRIEE 257 Query: 236 RLHIV-CDVPDELIDFTFEWKGLK-KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 R L+ F + +K + E + +S+ + Sbjct: 258 RALKALTPSAPSLVGFRGTRQVVKLARTTRRKKTTTSPAATSTEEFYLVTSLSTDQASPA 317 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A R HW VE +H D M+ED IR NAA ++ R I+ L Sbjct: 318 QLARWARGHWTVEA-IHHVRDRTMDEDRHTIRTKNAALNWAIARDTTISALRLAGYKN-- 374 Query: 354 LRRKMRKAAMDRNYLASVLAGSG 376 +R+ R D + ++A + Sbjct: 375 IRQARRATIRDPGLVLQIIALTS 397 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 226 bits (576), Expect = 1e-57, Method: Composition-based stats. Identities = 84/324 (25%), Positives = 135/324 (41%), Gaps = 27/324 (8%) Query: 50 FGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAID 109 FG + +LK GI H T + V C++ F ++ Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAALPKPLQR------------A 90 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKII 169 K S + + ++ ++ LVIGQ + NE+ + L +L ++G I+ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQQTAPGR-NEVQGALDALALLSLEGAIV 149 Query: 170 TTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKS 229 T DA+ C+ D A I GGDY A+K NQ L + E + +E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLGVQ----TAAEND 205 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 H R E R + V D ++ GL+ + + VRY++ S Sbjct: 206 HDRCERRRACIVAVND------IDFPGLQAIGSVEA---TSRHADGRLTSHVRYFLLSTI 256 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ++A R HW +ENKLHW LDV ED + R+ + + +R IA+N++ Sbjct: 257 MSAAALIEVTRTHWQIENKLHWVLDVQFREDAARNRKDHGPANIALLRKIALNLIRAHPD 316 Query: 350 FKAGLRRKMRKAAMDRNYLASVLA 373 KA +RRK++ A D +L S++A Sbjct: 317 -KASIRRKIKNAGWDDQFLISIIA 339 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 226 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 84/338 (24%), Positives = 136/338 (40%), Gaps = 22/338 (6%) Query: 28 GILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFH 87 +L + + A +G + + T D + P T V+S + PA + Sbjct: 2 ALLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 ECFINWMRDCHSSNDKDV---IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIK 144 ++ +S+D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGAL--RAKATATHLVSVFAHRARLVLGQLA 119 Query: 145 TDKKSNEITAIPELLNMLDIKG-KIITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRL 202 +KSNEI + LL +L ++T DAM Q A+ I YL VK NQ ++ Sbjct: 120 VAEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 NKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCV 262 P E+ D + HGR E R + + + K++ Sbjct: 180 LARI-TALPWAEVPAAATD----DSRGHGRVETRTLQIITAARGIG-----FPYAKQIIR 229 Query: 263 AVSFRSIIAEQKKEPEMTVRYYISSADLTAEK---FATAIRNHWHVENKLHWRLDVVMNE 319 R I A + + V Y I S + T +R H +EN LHW DV +E Sbjct: 230 ITRERLITATD--QRSVEVVYAICSLPFEHARPTAIMTWMRQHCRIENSLHWIRDVTFDE 287 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 D + GN A++ + +R+ AIN+ + + Sbjct: 288 DRQRAHTGNGAQVLATLRNTAINLHRLNGADNIAEACR 325 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 226 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 84/380 (22%), Positives = 154/380 (40%), Gaps = 23/380 (6%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDF------ 57 L+ ++ +PD R V H L +L + AV++GA S + ++ Sbjct: 27 SSLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELG 86 Query: 58 -LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS--NDKDVIAIDGKTLR 114 + + P T R+++ + + W+ C + + V ++DGKTLR Sbjct: 87 VFRDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLR 146 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAM 174 S +H+++ V+GQ+ D K+NE+T LL LD+ ++T DA+ Sbjct: 147 GS---GPAGEQVHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADAL 203 Query: 175 GCQKDIA-EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGRE 233 Q++ A + + Y+F VK NQ RL + + P ++ S + HGR Sbjct: 204 HTQREHARWLVDTKKAAYVFTVKKNQPRLYRQLKT-LPWTKIPI----QDETSTRGHGRY 258 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 +IR L ++ L + ++ + + +S+A Sbjct: 259 DIRRLQAVTCTGPLALDFPHA--VQALRIRRRRLNLATGRWSTVTVYAITNLSAAQAGPA 316 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A +R HW +E LH D ED ++R GNA + +R+ AIN+L + Sbjct: 317 ELADWLRGHWAIET-LHHIRDTTYAEDASRLRTGNAPRAMATLRNTAINLLRLTGI--TT 373 Query: 354 LRRKMRKAAMDRNYLASVLA 373 + +R + + +L Sbjct: 374 IAAALRHNSRNPYRPLQLLG 393 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 223 bits (569), Expect = 6e-57, Method: Composition-based stats. Identities = 82/273 (30%), Positives = 135/273 (49%), Gaps = 9/273 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 ++ L+E + + D R K+EH+L IL++ + AV++ AE++EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCH------SSNDKDVIAIDGKTLRHS 116 D GIP HDT RV I P F CF+NW R + + IA+DGK +RHS Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ R +H++SA++T LV+ Q D K E A+P +L L + G +++ DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPE--HDSYAMSEKSHGREE 234 ++++A+ I +G YL +K NQ +++ F + + +HGR Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAFDDTHGRLV 241 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFR 267 R C W GL + + + R Sbjct: 242 RRRVFACPDAGCFT-TLRGWPGLTTVLASETIR 273 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 218 bits (554), Expect = 4e-55, Method: Composition-based stats. Identities = 88/249 (35%), Positives = 126/249 (50%), Gaps = 14/249 (5%) Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS------YDKSR 121 IP HDT R S I P F F NW++ V+AIDGK +R + + Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHTTGK 62 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + ++SA+S + + +GQ+K D KSNEITAIP L+N L++ G I+T DAMGCQKDI Sbjct: 63 EGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDIT 122 Query: 182 EKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLH 238 + I + +Y+ A+K N+ + L K + + K+ + HGR E R Sbjct: 123 QTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRTC 182 Query: 239 IVCDV-PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AEKFA 296 V F + GLK + S R+I+A E VRYY++S D T E+ A Sbjct: 183 TVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIVAT--GEYTQEVRYYVTSLDNTKPEEIA 240 Query: 297 TAIRNHWHV 305 +AIR HW + Sbjct: 241 SAIRQHWSI 249 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 78/418 (18%), Positives = 140/418 (33%), Gaps = 62/418 (14%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVIS-GAESWEDIEDFGETHLDF------ 57 L++ ++I D R H L+ IL + A ++ G + IE + + Sbjct: 29 DLIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLH 88 Query: 58 -LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD------------ 104 + + P TI RV++ + + C ++ + Sbjct: 89 IWRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRR 148 Query: 105 ---------------------VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 A+DGK L+ + R +H+IS + + + V Q Sbjct: 149 TEREARRAAHRSPTPAPGLLPAYAVDGKRLKGARHPDGGR--VHLISLAAHLDATVHAQR 206 Query: 144 KTDKKSNEITAIPELLNM---LDIKGKIITTDAMGCQKDIAEK-IQKQGGDYLFAVKGNQ 199 + KS+EI A+ LL D+ G +IT DA+ Q+ A I++ Y+ VK NQ Sbjct: 207 QIPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQ 266 Query: 200 GRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 L+ + + + + + HGR E R+ ++ + Sbjct: 267 PTLHATAITALTGTDTDFAAVT-HRETHRGHGRTEYRILRTAPA------DGIDFPYAAQ 319 Query: 260 LCVAVSFRSIIAEQKKEPEMTVRYYISSADL---TAEKFATAIRNHW-HVENKLHWRLDV 315 + + R V Y I+ A +R HW +EN +H DV Sbjct: 320 VFRVLRHR--GGLDGIRHSKEVCYGITDLTARQAGPAHLAAYVRGHWKAIENGVHHVRDV 377 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 ED C+ R + R++A L + R+ D + + Sbjct: 378 TFAEDACQARTATLPRALAAFRNLATGTLRRAGHVN--IAHARREHGYDHQRVLDLFN 433 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 211 bits (538), Expect = 3e-53, Method: Composition-based stats. Identities = 73/388 (18%), Positives = 135/388 (34%), Gaps = 46/388 (11%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAV-ISGAESWEDIEDFGETHLDFLKQYGD 63 + E ++ IPD+R A + + L + + + AV +G + + ++ + Sbjct: 23 GIWERLAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERVRLR 82 Query: 64 FEN------GIPVHDTIARVVSCISPAKFHECFIN-------------------WMRDCH 98 +P TI R ++ + ++ Sbjct: 83 LPWNPWDGHLLPDEATIRRFLNTVDDQALATALLDPPLADTPADLTDAVPSAPVRPPAGD 142 Query: 99 SSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPEL 158 + A+DGKT R + + +H++ + ++GQ + D KSNE T L Sbjct: 143 QAVPVRAYAVDGKTSRGAKRADGSQ--VHLLGVAAHGAGALLGQREIDAKSNETTEFRAL 200 Query: 159 LNMLDIKGKIITTDAMGC-QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 L L++ G ++ DA+ + ++ + ++ YL K NQ +L P E+ Sbjct: 201 LAPLELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKLRAFL-AALPWTEIPT 259 Query: 218 PEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEP 277 + ++ HGREE R V V ++ + +R ++ + Sbjct: 260 ADL----TRDRGHGREETRTLKVATVT------HLDFPHAAQAIRIRRWR---RQKGQPA 306 Query: 278 EMTVRYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 Y I+ A A R WH+E K H+ DV ED R G + + Sbjct: 307 SHETIYAITDATADQASPALLADLARGQWHIEVKQHYVRDVTFGEDSSTSRTGRGPAVLA 366 Query: 335 GIRHIAINILTNDKVFKAGLRRKMRKAA 362 R + L R+ K A Sbjct: 367 LFRATVADTLRRAGHRSVPACRRAHKTA 394 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 210 bits (535), Expect = 6e-53, Method: Composition-based stats. Identities = 77/237 (32%), Positives = 120/237 (50%), Gaps = 7/237 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 +++ I D R K HK+ I+ ++I AVI GA+SW +IE+FG + F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPS 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS------YD 118 IP HDT R S I P F F NW++ V+AIDGK +R + Sbjct: 64 LEFIPSHDTFNRFFSMIKPDYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 + + + ++SA+S + + +GQ+K D KS+EITAIP L+N L++ G I+T DAMGCQK Sbjct: 123 RGKEGFKLWMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 DI + I +Y+ A+K N+ + + ++ + + + R Sbjct: 183 DITQTIIGHDANYIIAIKENKKKKYQPAKQIIDDYQDRDEIINRVIRHVSEKCRTWK 239 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 203 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 85/404 (21%), Positives = 145/404 (35%), Gaps = 61/404 (15%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVI-SGAESWEDIEDFGETHLDFLKQ 60 +++ L+ + D R A V +++S +L L + A+ +G +S ++ Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 61 YGDFEN-------GIPVHDTIARVVSCISPAKFHECFINWMRDCHSS------------- 100 IP T+ V+ + P + + +R S+ Sbjct: 90 AFGLPYHPLRGRYRIPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPLMPDGG 149 Query: 101 -------------------NDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 + + IA+DGK LR + R + V+SA + + Sbjct: 150 IEREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR--VFVLSAVRHGDGITLA 207 Query: 142 QIKTDKKSNEITAIP---ELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 + K+NEI + L+ D+KG ++T DA+ Q+D A + ++G YL +K N Sbjct: 208 SREIGAKTNEIPEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERGAHYLLTIKNN 267 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 Q + P KE+ D + HGR E RL V V L + Sbjct: 268 QRGQARQLH-ALPWKEIPVIHRD----DARGHGRHEQRLVQVVTVNGLL------FPHAA 316 Query: 259 KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT---AIRNHWHVENKLHWRLDV 315 ++ R + +K Y I+ A R HW VEN +HW DV Sbjct: 317 QVLRIQRRRRLYGAKKW--SSETVYAITDLPAEEASAAEIASWARGHWTVENTVHWCRDV 374 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 NED ++R N + + +R + L R+ Sbjct: 375 TFNEDKSQVRTHNTPSVLAAVRDLIRGALKLAGYVNTAAGRRAH 418 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 202 bits (514), Expect = 1e-50, Method: Composition-based stats. Identities = 77/225 (34%), Positives = 105/225 (46%), Gaps = 9/225 (4%) Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIPELL LD++G +T DA+G Q IA I + G DY+ AVK NQ RL + + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 ELNNPEHDS--YAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIA 271 E + +K HGR E R+ V + L W GL++L + R I Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETRVCRVSEDVAWLASTGQHWAGLQRLVMLERTRQI-- 118 Query: 272 EQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 ++ YYISS + A + A IR HW +EN+LHW LDV ED IR AA Sbjct: 119 --GQKVTTERCYYISSKAVKAAQMAQLIRAHWGIENQLHWVLDVSWGEDASLIRDTVAAR 176 Query: 332 LFSGIRHIAINILTND---KVFKAGLRRKMRKAAMDRNYLASVLA 373 + +R I +N+ + K L+ AA D +L Sbjct: 177 NMASLRKITLNLARLAQNRQPKKVSLKNIRNLAAWDTAMRDDILG 221 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 199 bits (506), Expect = 1e-49, Method: Composition-based stats. Identities = 89/237 (37%), Positives = 116/237 (48%), Gaps = 9/237 (3%) Query: 143 IKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL 202 + T+ KSNEITAIP LL L+ K ++T DAMGCQKDIA I GGD++ AVK NQ +L Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 NKAFEEKFPLK---ELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 A EL H +Y HGR + R H V VP EW +K Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVP-PGFAAKGEWPWIKA 119 Query: 260 LCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + AV I VRYY+ S L+ ++F +R HW +E +HW LDV E Sbjct: 120 IGTAVR---ITTHADGTQSDEVRYYMLSRFLSGKRFGEVVRGHWGIE-SMHWVLDVTFGE 175 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSG 376 D + R+ A S +R AI +L K +R KM + MD ++L VL G Sbjct: 176 DRTRTRQRVLANNLSWVRRFAITLLKRHP-EKDSIRGKMIRCLMDTSFLNEVLTLQG 231 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 192 bits (488), Expect = 1e-47, Method: Composition-based stats. Identities = 85/224 (37%), Positives = 115/224 (51%), Gaps = 11/224 (4%) Query: 111 KTLRHSYDKSRRRGAIHVISAFSTMH---SLVIGQIKTDKKSNEITAIPELLNMLDIKGK 167 K + S + S +LV+GQ K + KSNEITAIP L+ ML+I+ Sbjct: 3 KGFQRSVKTEEKHKPSQKKSQVLKDSLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESS 62 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPEHDSYA 224 IIT DAMGCQK+I I+K+ GDY+ +K NQ L + +E F +E + EH Y Sbjct: 63 IITIDAMGCQKEITSLIRKKKGDYIITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQ 122 Query: 225 MSEKSHGREEIRLHIVCDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRY 283 E H R E R I V + W LK + + S R + + VR+ Sbjct: 123 EIETGHHRIEKREVIAVSVSSLPCLHNQDLWTELKTVVMVKSERRLWN----KTTTEVRF 178 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRG 327 YISS + ++K ATAIR+HW +EN LHW LDV +ED +IR Sbjct: 179 YISSVEKNSQKIATAIRSHWEIENSLHWTLDVTFSEDKSRIRTR 222 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 192 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 102/197 (51%), Positives = 133/197 (67%), Gaps = 13/197 (6%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M L L +H + + D RQA KV +KL +L L + AVISGAE WE+IEDFG L +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKS 120 YGDF +GIPVHDTIAR+V I P FH+ FI WM+ DK V+A+DGKTL Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 H+ISAF+T + +V+GQ +TD+KSNEITA+PELL +L+++G ++T DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 AEKIQKQGGDYLFAVKG 197 + I K+ DY AVK Sbjct: 168 VKTIVKKKADYCIAVKK 184 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 177 bits (449), Expect = 5e-43, Method: Composition-based stats. Identities = 77/179 (43%), Positives = 107/179 (59%), Gaps = 3/179 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+ +PD R+ + H+L +LL I VISGAESW + + + LD+L+ Y + Sbjct: 7 SLLTAFDDLPDPRR-RECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 +GI HDT RV S + ++F CF+ W+ S + +AIDGK LR S+D R Sbjct: 66 AHGIASHDTFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHD--GARS 123 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 IH++SA+S+ +L +GQ++T KSNEITAIPELL LDI+G IT DAMGC A Sbjct: 124 PIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCHGMPARH 182 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 65/229 (28%), Positives = 106/229 (46%), Gaps = 5/229 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+E ++ +PD R ++ L G+L L + AV+ G + E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 NG +P +TIA ++ + P + W+RD H + + +A+DGK L S D + Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRH-PDGWEHLALDGKRLCGSRD--GQV 120 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD-IKGKIITTDAMGCQKDIAE 182 H+++A++ S V+ Q+ + +NE A LL +L + G ++T DA+ Q D+ Sbjct: 121 PGTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHG 231 +Q +GGD + K NQG L E F + G Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAFATAAGGDFSPRVTGRVGSGRG 229 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 56/194 (28%), Positives = 86/194 (44%), Gaps = 7/194 (3%) Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH-DSYAMSEKSHGREEIRLHIV 240 EKI ++ GDY+ +K N + E F + PE +++ R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 V D L EWKG+K + RS + +YISS D+ + A +R Sbjct: 61 LKVSDWLSK-AEEWKGIKSVLEVCRKRS----DNGKESQEKVFYISSLDVDVQILAKCVR 115 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW VENK HW LDVV ED+C + AE + +R +A+N+ K ++ K+ Sbjct: 116 GHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHP-KKQSMKGKLTA 174 Query: 361 AAMDRNYLASVLAG 374 A + +L G Sbjct: 175 AGWSDEFRDELLLG 188 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 66/218 (30%), Positives = 100/218 (45%), Gaps = 3/218 (1%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M + L E +S IPD R + H L +L L A++ G S + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDK 119 F G P T++R + P + W+ + IA+DGKTLR S D Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGSRDG 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 H+++A++ V+ Q++ D K+NE A LL +L + G ++T DAM CQ+D Sbjct: 121 QVPGQ--HLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 +A + G DY+ K NQ L + E ++ Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGLGFEDAAR 216 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 172 bits (435), Expect = 2e-41, Method: Composition-based stats. Identities = 71/273 (26%), Positives = 108/273 (39%), Gaps = 13/273 (4%) Query: 58 LKQYGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS 116 L + D + + ++ + F S +K + DGK LR S Sbjct: 8 LCAFLDIPETTVVSRSHLPVLLQKVDVEVFDYLLFTHYGFRLDSQEKQWFSGDGKELRGS 67 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDK-KSNEITAIPELLNMLDIKGKIITTDAMG 175 + ++RG V+ I Q D K +EI + LL+ D+ + IT DA+ Sbjct: 68 IESGKKRGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRALLSKDDLASQKITLDALH 126 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 E I K GG +L +K NQ L + P D + +HGR E Sbjct: 127 LCPSTTEMITKAGGVFLIGLKENQPTLLAHM------TDCALPPIDQKTTFDFNHGRVEQ 180 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R + + DV + D ++ K+L R I ++ + V YYIS+ E Sbjct: 181 RKYWLYDVSKQGFDPRWDNTAFKRLVKVQRTR--INQKNAKISREVSYYISNETA-KEGI 237 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGN 328 A+RNHW VE H DV +NED K ++ Sbjct: 238 FDAVRNHWSVEVNNH-IRDVTLNEDQLKSKKRQ 269 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 169 bits (427), Expect = 2e-40, Method: Composition-based stats. Identities = 75/284 (26%), Positives = 120/284 (42%), Gaps = 15/284 (5%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + L+ + + D R H L +L L + A + GA++ ++ +F E + L++ Sbjct: 1 MSVLISILREVRDPRD-VNARHDLGELLFLALLATLRGAKTCVEMAEFSEARQEELREIV 59 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND----KDVIAIDGKTLRHSYD 118 +G P HDT +RV + P + F +M + K V+AIDGK+LR YD Sbjct: 60 ALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRGALGLPAPKGVVAIDGKSLRRGYD 119 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K R ++S + I ++ +EI A +L L +KG +T DA+ C Sbjct: 120 KGRAFMPPLMVSVWDVETRPSIAAMRAPG-GDEIKATLSVLKALTLKGCTVTADALHCHP 178 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLH 238 +A+ + Y +K N G L +A E F + + E+ HGREE R Sbjct: 179 AMAQALLAAKAQYALGLKANHGPLFRAAEAGFA----AVTDLAVFETRERGHGREEQRRA 234 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVR 282 V V + GLK + + R +PE VR Sbjct: 235 SVLPVDR--LVKRPSLPGLKAIGRIEAVR---TGANGKPEQAVR 273 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 166 bits (420), Expect = 1e-39, Method: Composition-based stats. Identities = 56/228 (24%), Positives = 102/228 (44%), Gaps = 14/228 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 +++ LM+ +S D R+ + H ++ + A++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDFENGI-------PVHDTIARVVSCISPAKFHECFINWMRDCHSS----NDKDVIAIDG 110 F P T+ R + I + W + D V++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIT 170 K +R + + IH ++AF +V+ Q D+K+NEI + LL ++I+G+I+T Sbjct: 341 KAVRGASKAKGGQ-KIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 TDAMGCQKDIAEKIQK-QGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 DA+ Q + A I + + DY+F VK NQ + + E P + Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIES-LPWEAFPP 446 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 81/194 (41%), Positives = 119/194 (61%), Gaps = 2/194 (1%) Query: 94 MRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEIT 153 M+ H +V+AIDGKTLR SYD+ R+ IH++SA+++ + LV+GQ+KT+ KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIP L+ MLD++G I+T DAM CQ IA+ I ++GGDYL AVKGNQG+L A + F Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFTPH 120 Query: 214 ELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 + D+ EK GR E R + V D + DF+ W GL + + ++R+ Q Sbjct: 121 RRAPIDRDTC-QIEKQKGRVEARTYHVLSASDLIRDFST-WSGLTSIVMVENYRAAKGRQ 178 Query: 274 KKEPEMTVRYYISS 287 + + + + + S Sbjct: 179 RARVGVPLLHKVQS 192 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 64/189 (33%), Positives = 94/189 (49%), Gaps = 8/189 (4%) Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCD 242 I + GDYL VKGNQ +L +A E F + + + D A+ E+ HGR ++ V Sbjct: 1 MIIAKKGDYLLMVKGNQPKLLEAIEIAF-IDQHDVKSVDRSALVERGHGRTVGQIASVLS 59 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 I +W + S R + + E ++ YYI+S LTAE+ A ++R Sbjct: 60 AKG--IINPGDWPNCVTIGRIDSMRVVDEK---ESDLERCYYITSRALTAEQLAASVRAR 114 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKMRK 360 W VEN+ HW LDV +ED + + NA + S +R IA+NI+ DK K+ LR K + Sbjct: 115 WGVENRFHWILDVSFSEDASTVAKDNAPQNLSMLRKIALNIIRADKTDTRKSSLRLKRKG 174 Query: 361 AAMDRNYLA 369 AA D Sbjct: 175 AARDDGVRE 183 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 164 bits (416), Expect = 3e-39, Method: Composition-based stats. Identities = 60/264 (22%), Positives = 112/264 (42%), Gaps = 22/264 (8%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 + L+E ++ +PD R+ V ++ + +L + + A++SGA S+ I ++ + Sbjct: 47 DQTALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAG 106 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND-------------KDVIAI 108 +P TI RV+ + A W++ + D + V+A+ Sbjct: 107 LGLTGRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAV 166 Query: 109 DGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGK 167 DGK +R +H++ +V+ Q+ D+K+NEI +L+ + D+ Sbjct: 167 DGKAMR---ATRHGTHPVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDV 223 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSE 227 +IT DAM Q A+ + +G L VK NQ ++ + P K++ + + Sbjct: 224 LITVDAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRLKT-LPWKDVPVG----HTTTG 278 Query: 228 KSHGREEIRLHIVCDVPDELIDFT 251 + HGR E R VP L Sbjct: 279 RGHGRIETRTLKAVTVPAGLGFPH 302 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 164 bits (414), Expect = 6e-39, Method: Composition-based stats. Identities = 62/189 (32%), Positives = 86/189 (45%), Gaps = 10/189 (5%) Query: 192 LFAVKGNQGRLNKAFEEKFPLKELNNPEHDS---YAMSEKSHGREEIRLHIVCDVPDELI 248 + AVK NQ L E + S + +K HGR E R + D P Sbjct: 1 MLAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREVDKGHGRIETRRCLALDFPGPFE 60 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 W GL+ + + S R I RYY+SS A + A A+R HW +E+ Sbjct: 61 PDL--WPGLQSIPMVESTREI----GDTVTTGRRYYVSSLPADAVRIAHAVRAHWGIES- 113 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 +HW LDV NED C+ R NAA+ F+ +R IA ++ D KAG+R + KA +Y Sbjct: 114 MHWVLDVTFNEDQCRTRLENAAQNFAILRRIATTLIRRDNSTKAGIRIRRLKAGASDDYR 173 Query: 369 ASVLAGSGL 377 A +L L Sbjct: 174 AQLLGLKTL 182 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 55/164 (33%), Positives = 86/164 (52%), Gaps = 3/164 (1%) Query: 47 IEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVI 106 + + L+ + NG P DT RV+ I P + C + ++ S + I Sbjct: 1 MHELCLERGASLRPPVELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHI 60 Query: 107 AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKG 166 AIDGK L+ S K+ H++SA+ L + Q +K NE+ AIPE+L+ LD+ G Sbjct: 61 AIDGKRLKGSKKKTGS---THILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSG 117 Query: 167 KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 +I+ DAMG Q +IAE+I + DY+ ++KGNQ L + + F Sbjct: 118 AVISIDAMGTQTNIAEQIIQSEADYILSLKGNQKHLYEDVRDCF 161 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 157 bits (396), Expect = 8e-37, Method: Composition-based stats. Identities = 57/229 (24%), Positives = 96/229 (41%), Gaps = 19/229 (8%) Query: 11 SIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGI-- 68 + + D R+A + H +LL+ + V++G S+E I + + + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 69 -----PVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRR 123 P TI R++S P + ++ + IAIDGKT+R S Sbjct: 289 ERFLPPSEPTIRRILSKADPVELDRILSQYIVAH---SSGRAIAIDGKTIRSS------- 338 Query: 124 GAIHVISAFSTMHSLVIGQIKTDK-KSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 ++ +++A V+ Q D K +EI A LL LD+ GK++T DA+ Q +A Sbjct: 339 -SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALAS 397 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHG 231 +I+++GGDY+F VK N+ L +P D Sbjct: 398 RIREKGGDYVFTVKDNRKTLKDEISGLDDEAFSPSPYDDLLRTWPDRDP 446 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 156 bits (394), Expect = 1e-36, Method: Composition-based stats. Identities = 55/187 (29%), Positives = 93/187 (49%), Gaps = 4/187 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFL-K 59 + L+ + +PD R+A + L +L+ T+ A++SGA S+ I F E + L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 60 QYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND---KDVIAIDGKTLRHS 116 +G PV +T+ V+ + + F + + K V+A+DGKTLR S Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGS 130 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D R A ++AF + ++V+ + D KSNEI A +++ L + G + T DAM C Sbjct: 131 FDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHC 190 Query: 177 QKDIAEK 183 QK + + Sbjct: 191 QKKHSRR 197 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 153 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 51/196 (26%), Positives = 85/196 (43%), Gaps = 9/196 (4%) Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHI 239 ++E+ ++ DY+ A+KGN + + ++ F + + +K HGR E R Sbjct: 1 MSERNSEKDNDYILALKGNHPLMEQEVKDFFLSPVTST--RSVHTTFDKGHGRIE-RRIY 57 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 D + EWK L + S +K + +RY+I+S ++FA + Sbjct: 58 TLDTNIGWFEDKKEWKHLAGFGMVDSMV----TRKGKECREIRYFITSVT-DVKQFAKGV 112 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 +HW +EN LHW LDV+ +D+C + NAAE + IR I N + K Sbjct: 113 CSHWMIENNLHWCLDVLFCDDECTVLDRNAAENLAIIRRIVYNRIKMLSKMDTLSMGKR- 171 Query: 360 KAAMDRNYLASVLAGS 375 D + A +L Sbjct: 172 ACIYDDEFRAQILFSC 187 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 150 bits (378), Expect = 8e-35, Method: Composition-based stats. Identities = 57/173 (32%), Positives = 87/173 (50%), Gaps = 11/173 (6%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEI 235 K + I + G DY+ AVKGNQ RL++ + +E+ R Sbjct: 1 MPKKTVQLIIEGGNDYVIAVKGNQKRLHEQIKLTTE----QRLPVSLDITTERRSDRITT 56 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V D L +++W+GL++L F + +P + YYISS + A +F Sbjct: 57 RSVSVFDD---LSGISYDWEGLQRLVKVERF----GTRAGKPYHQIVYYISSLTINAAQF 109 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 A IR HW +EN+LHW DVV++ED+ ++R+GNA FS IR + + IL + Sbjct: 110 AQGIRGHWGIENRLHWVKDVVLDEDNSRMRQGNAPANFSIIRSLVLTILRYNG 162 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 63/326 (19%), Positives = 115/326 (35%), Gaps = 43/326 (13%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS 116 + G P ++T+ +++C+ WM + A DGK L S Sbjct: 13 RWRPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGGIRA-DGKVLGGS 71 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K A+H + + + + Q + + A+ LL + G++++ DA Sbjct: 72 --KRAGAPALHGVELVTHTTGMALAQREAVG-GDAAAALLALLTEAPLDGRMVSMDAGFL 128 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP------------------------- 211 + + I ++ G+YL VKG+Q ++ P Sbjct: 129 NAAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALDQIAPPR 188 Query: 212 --------LKELNNPEHDSYAMSEKSHGREEIRLHIVCDV--PDELIDFTFEWKGLKKLC 261 +EL E+S GR EIR V D + + W+ + ++ Sbjct: 189 RKRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGDVGPSLMTAYGWRQVTQIG 248 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 + + +SS T +F +IRNHW +EN++H D M ED Sbjct: 249 GLRRWCRRRHADLW--TVEEVTVVSSRQRTPAQFLASIRNHWTIENQVHRPRDGSMQEDR 306 Query: 322 CKIRRGNAAELFSGIRHIAINILTND 347 R + + R++ IN++ Sbjct: 307 LHGRA--IGVILAVCRNVVINLIRRH 330 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 149 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 55/227 (24%), Positives = 104/227 (45%), Gaps = 15/227 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFG----ETHLDF 57 +++ L + +PD R +H L IL + + AV++ A+S+ + ++ + L Sbjct: 219 QMEGLRQIFFRLPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKR 278 Query: 58 LKQYGD---FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLR 114 ++ + P T+ RV+ + W+ + +A+DGK L+ Sbjct: 279 IRARFNPRTQRYVAPSEPTLRRVLQGANVTALDAAIGAWL---LGIAGFEAVAVDGKVLK 335 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAM 174 + + + +H++SAF I Q + +K+NEI + LL +DI+ K++T DA+ Sbjct: 336 GAVREDGSQ--VHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADAL 393 Query: 175 GCQKDIAEKIQK-QGGDYLF-AVKGNQGRLNKAFEEKFPLKELNNPE 219 Q+ A + + + DYLF AVKGNQ +L + P + Sbjct: 394 HTQRKTARFLVEDKKADYLFTAVKGNQRKLRNSLI-CLPWGDFPPQR 439 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 148 bits (374), Expect = 3e-34, Method: Composition-based stats. Identities = 50/206 (24%), Positives = 80/206 (38%), Gaps = 13/206 (6%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 +L +++ GK IT DA+ QK +AE I + YLF VK NQ L + F Sbjct: 2 FIPILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYFEH-- 59 Query: 215 LNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 E D HGR + R +E + E+ + + + + Sbjct: 60 --RKEPDYCLQDPPGHGRIDTRSIWTTTELNEYL----EFPHVGQAFCI--HKKSYDPKT 111 Query: 275 KEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 + Y ++S + R HW +EN H+ LD +ED +IR GN Sbjct: 112 NKVCENTFYGVTSHHPNKADPARILQIHRGHWSIENSKHYILDWTYDEDRNRIRTGNGPA 171 Query: 332 LFSGIRHIAINILTNDKVFKAGLRRK 357 + +R AI +L + V + + Sbjct: 172 NTNRLRGFAIGLLKSKGVKDIAQKVR 197 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 148 bits (373), Expect = 4e-34, Method: Composition-based stats. Identities = 60/194 (30%), Positives = 84/194 (43%), Gaps = 11/194 (5%) Query: 186 KQGGDYLF--AVKGNQGRLNKAFEEKFPLKELNNPEHDS---YAMSEKSHGREEIRLHIV 240 +G + +G L A + F + + +K HGR E R Sbjct: 91 DRGRWWRLRACRQGQPTHLAHALRDFFGTLDAPGYPVRQTCVHETLDKGHGRIETRRCTA 150 Query: 241 CDVPDEL--IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 D L + WK + + S R I + E RY ISS +E+ A Sbjct: 151 AGDLDWLATLGLKERWKKITSVAGIDSSRVI----GSKTETDRRYVISSLPADSERILHA 206 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R HW +EN LHW LDV ED C IR NAA FS +R A+N+ D GL +K Sbjct: 207 VRMHWGIENGLHWCLDVAFGEDACPIRLRNAALDFSLLRRAAMNLFRADHSRAMGLPKKR 266 Query: 359 RKAAMDRNYLASVL 372 + AA + +YLA++L Sbjct: 267 KAAAWNPDYLANIL 280 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 55/142 (38%), Positives = 78/142 (54%), Gaps = 3/142 (2%) Query: 101 NDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLN 160 VIAI+GK+LR + + A+H +SA++ + L +GQ+ +KSNEITAI ELL Sbjct: 1 MGGLVIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLP 60 Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH 220 L ++G ++T DA+GCQ +AE+I GGDY+ AVK NQ L A + F Sbjct: 61 TLALEGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPV 120 Query: 221 DS---YAMSEKSHGREEIRLHI 239 + +K HGR E R Sbjct: 121 RQTCVHETLDKGHGRIETRRCT 142 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 144 bits (364), Expect = 4e-33, Method: Composition-based stats. Identities = 57/245 (23%), Positives = 95/245 (38%), Gaps = 17/245 (6%) Query: 28 GILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFH 87 +L + + A + + + T D + P T V+S + PA + Sbjct: 2 ALLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 ECFINWMRDCHSSNDKDV---IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIK 144 ++ +S+D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGAL--RAKATATHLVSVFAHRARLVLGQLA 119 Query: 145 TDKKSNEITAIPELLNMLDIKG-KIITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRL 202 +KSNEI + LL +L ++T DAM Q A+ I YL VK NQ ++ Sbjct: 120 VAEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 NKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCV 262 P E+ D + HGR + R + + + K++ Sbjct: 180 LARI-TALPWAEVPAAATD----DSRGHGRVKTRTLQIITAARGIG-----FPYAKQIIR 229 Query: 263 AVSFR 267 R Sbjct: 230 ITRER 234 >UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TCH7_HELMI Length = 453 Score = 144 bits (363), Expect = 4e-33, Method: Composition-based stats. Identities = 60/398 (15%), Positives = 110/398 (27%), Gaps = 58/398 (14%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + + + D R+ +++ I + F ES E ++ + +Q Sbjct: 38 VYGFSQMVRQAKDGRKQPRIK--APAIFTVAFFGAFFCMESMEQMDRW--QKTGVFRQLV 93 Query: 63 DFENGIPVHDTIARVVSCIS---PAKFHECFINWMRDCHSSN-----DKDVIAIDGKTLR 114 +P HDT+ + + + H C I ++ V AIDG L Sbjct: 94 PKNIRLPSHDTVRQALMKWDLKEQREQHNCVIQRYKEQRGPQKESINGWRVTAIDGVELF 153 Query: 115 HSYDKSRRRGAI--HVISAFSTMHSLVIGQIK------------------TDKKSNEITA 154 H+ H H++V+ Q DK E T Sbjct: 154 HTKAYRCPECLTREHRDKTTDYYHAVVVAQQVGGNANLIYDWEMRKPQDGVDKDEGETTV 213 Query: 155 IPELLNML-DIKGK---IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 L+ + + GK + T DA+ + + G + +K + R+ K F Sbjct: 214 AQRLIRRMAETYGKITDVYTLDALFAKAPVIHAALDAGAHVVVRMKEERRRIMKEANACF 273 Query: 211 PLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSII 270 + ++ + V + +W ++ V Sbjct: 274 ANRLPDSTWEERDGKGNT------------VYVQAWDEEGLAQWPQVRVPMRIVKIIRHT 321 Query: 271 AEQKKEPEMTV----------RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNED 320 + E V SS + A W +EN L D Sbjct: 322 NKTVIEANKEVFVTDVVERWIATTCSSEKADTQTIAQIAAARWDIENIGFRNLKTFNALD 381 Query: 321 DCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 C + A + G + +A N+ R Sbjct: 382 HCFVHDSVAIKAMIGFQVLAFNLKRLFFFHHLPASRHR 419 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 140 bits (352), Expect = 9e-32, Method: Composition-based stats. Identities = 58/145 (40%), Positives = 78/145 (53%), Gaps = 7/145 (4%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH---DSYAMSEKSH 230 MGCQK+IAE I +Q DY+ AVK NQ L++A ++ F N E D KSH Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYFEEANEANFESYNIDFAETYNKSH 60 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL 290 GR E R V L D + W+GL+ + + S R++ K++ + RYYISS Sbjct: 61 GRIESRRCWVGYDALPLTDDSQNWEGLQTIVMVESERTL----KEKTTIEHRYYISSTMA 116 Query: 291 TAEKFATAIRNHWHVENKLHWRLDV 315 TA + R HW +EN LHWRLD+ Sbjct: 117 TAAYLLNSSREHWGIENSLHWRLDI 141 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 139 bits (350), Expect = 2e-31, Method: Composition-based stats. Identities = 51/226 (22%), Positives = 90/226 (39%), Gaps = 11/226 (4%) Query: 20 WKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGI-PVHDTIARVV 78 H L +L L AV+ + I FG + L F G P T+++ + Sbjct: 2 QGRIHPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTL 61 Query: 79 SCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSL 138 I P + W+ + + + +A+DGK LR S D H ++A++ + Sbjct: 62 RRIDPQQLEAALGRWIAGRLTPDARAHVALDGKCLRGSRDGDVPG--PHRVAAYAPHAAA 119 Query: 139 VIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 V+GQI+ D ++NE A LL ++ + G ++T A C +D+A + GG Y+ +G Sbjct: 120 VLGQIRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYVSHGQGQ 179 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAMSEKS--------HGREEIR 236 R + ++ + +S R R Sbjct: 180 PTRPGGRHRGRVGVRGRRPRARGGHVPLSRSRRPSNWGARPRRWTR 225 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 49/180 (27%), Positives = 85/180 (47%), Gaps = 4/180 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+E ++ +PD+R A + L +LLL I +S + +EDF H + L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 EN-GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRR- 122 P T RV+ I F NW+ ++D + +DGK+++ + + Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 123 -RGAIHVISAFSTMHSLVIG-QIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + I+V+S FS + I Q +K+ +EI + LL LD++G + T D++ CQK + Sbjct: 124 YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKKL 183 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 66/141 (46%), Positives = 91/141 (64%), Gaps = 4/141 (2%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIK 165 +AIDGK LR S+D R IH++SA+S+ +L +GQ++T KSNEITAIPELL LDI+ Sbjct: 1 MAIDGKCLRGSHD--GARSPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD--SY 223 G IT DAMGCQ DIAE+I ++G DY+ VKGNQ L +A + F + E + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 AMSEKSHGREEIRLHIVCDVP 244 + ++K+HGR E R + + Sbjct: 119 SQTDKNHGRIETRRCVATNDV 139 >UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7Q7_9GAMM Length = 164 Score = 137 bits (345), Expect = 6e-31, Method: Composition-based stats. Identities = 49/155 (31%), Positives = 75/155 (48%), Gaps = 6/155 (3%) Query: 221 DSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMT 280 +SY EK HGR+E+R V +W +K + V RS+ K + Sbjct: 15 ESYITEEKGHGRKEVREVYVLPAAFS-EALRQKWCLVKSIVAVVRDRSV----KGKGSYE 69 Query: 281 VRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 YYI + L+ E + A R HWH+EN+ HW LDV+ ED+ +I G++A + R Sbjct: 70 TSYYICTDHLSLELASKATRKHWHIENQQHWALDVIFKEDEQRIYAGDSALNMACCRRFV 129 Query: 341 INILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGS 375 N+ + + RKM +AA +++Y VL S Sbjct: 130 QNLFRKSEG-NLSVPRKMNQAAWNKDYREKVLFTS 163 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 72/315 (22%), Positives = 121/315 (38%), Gaps = 41/315 (13%) Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRD-------CHSSNDKDVIAIDGKTLR 114 G P T+ R+++ SPA E ++D + V++ DGK Sbjct: 93 LGLGRGKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTW 152 Query: 115 HSYDKSRRRGAIH----------------VISAFSTMHSLVIGQIKTDKKSNEITAIPEL 158 D + +GA + S+ +GQ K E TA L Sbjct: 153 SRTDGEKVKGAQQSAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFRRL 212 Query: 159 L----NMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 L L + +I+T DA C ++ AE + G Y+F +K NQ L+ + + Sbjct: 213 LPAISEQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLHDIARDY-GQYD 271 Query: 215 LNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 L P +E+ G +R DV + L +C + R + Sbjct: 272 LGTPLAR---TAERYRGHTIVRELYARDVAGNPAAAIEAAQQLWYVCQTTTDR-----RG 323 Query: 275 KEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA- 330 + + RY+++S LT ++ +R HW +EN HW +DV++ ED+ + + A Sbjct: 324 EIVAVEQRYFVTSIPTGTLTRDQELALVRMHWAIENGCHWTMDVMLGEDEGHPCQASRAS 383 Query: 331 -ELFSGIRHIAINIL 344 E S +R I N + Sbjct: 384 IETVSWLRLIGYNAV 398 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 58/180 (32%), Positives = 87/180 (48%), Gaps = 5/180 (2%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLK-QY 61 + L + + IPD+R+A L +LL +I A++SGA S+ I F TH L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSR 121 G P + +I + + F VIA+DGKTLR S D+ Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHFRAHAARLAE--GAAVIALDGKTLRGSLDRFE 118 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTD--KKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A V+SAF+T +V+GQI + K +EI A L+ L + G++ T DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 59/160 (36%), Positives = 89/160 (55%), Gaps = 3/160 (1%) Query: 97 CHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIP 156 + D+IA+DGKTLR SYD++ + AIH++SA+ST + LV+GQ+KT++KSNE TAIP Sbjct: 1 MAARIPGDIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIP 60 Query: 157 ELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 +L +L ++ +T DA+G Q+DIA++I + DYL VK NQ L++ + + E Sbjct: 61 KLFTLLALEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAK 120 Query: 217 NPEHDSY---AMSEKSHGREEIRLHIVCDVPDELIDFTFE 253 D HGR + V L + Sbjct: 121 GFTEDFTDSVTEEGDKHGRIDKLHCRVTHRFSGLGALADK 160 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 135 bits (339), Expect = 3e-30, Method: Composition-based stats. Identities = 45/190 (23%), Positives = 72/190 (37%), Gaps = 6/190 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L+ H+ IPD R V +LL+ + ++S ES D+E F H L + Sbjct: 12 DLISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGI 71 Query: 65 E-NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDK--DVIAIDGKTLRHSYDK-- 119 E P + A +W D + DGKTLR S + Sbjct: 72 ELKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTS 131 Query: 120 SRRRGAIHVISAFSTMHSLVIGQ-IKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 I ++ +S + I Q + +E + +LL LD++G +I DA+ Q+ Sbjct: 132 GGGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQ 191 Query: 179 DIAEKIQKQG 188 Q +G Sbjct: 192 AFFGSSQSRG 201 >UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azollae' 0708 RepID=B9YJB3_ANAAZ Length = 124 Score = 134 bits (337), Expect = 5e-30, Method: Composition-based stats. Identities = 42/113 (37%), Positives = 67/113 (59%), Gaps = 4/113 (3%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W LK + + S + + + RY+ISS D E+ A ++R+HW +EN LHW L Sbjct: 15 WSNLKSVGMVESI----GQVDDKTTVETRYFISSLDSNGEQLANSVRSHWAIENSLHWVL 70 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRN 366 DV + +DDC+IR+ NA + F+ +R IA+++L + K G++ K AA+D N Sbjct: 71 DVALKQDDCQIRKDNAPQNFAVMRQIAVDLLGKENPVKRGIKNKQFLAAVDNN 123 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 55/167 (32%), Positives = 84/167 (50%), Gaps = 13/167 (7%) Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 ++S ++T + LV+GQ+K + SNEITAIPELL +L++ G I+ A+ C KDI + I ++ Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 GGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPEHDSYAMSEKSHGREEIRLHIVCDVP 244 DY+ +K NQG L ++ E+ F +H +Y E HG EIR P Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQELQHSTYKPEETGHGLHEIRNFGFQLDP 120 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 D W LK + + + + + RY+ISS D Sbjct: 121 DS------VWSNLKSVGMVEPI----GQVDDKTTVETRYFISSLDSN 157 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 59/201 (29%), Positives = 92/201 (45%), Gaps = 13/201 (6%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 + +L IK I T DA+ CQK E I ++ Y+ VK NQ L +A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIEDTAKNSP 61 Query: 215 LNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 LN ++ ++K HG E H + + +W GL++ ++ Sbjct: 62 LNA-----WSWTQKGHGHE---SHCRLKIWEATESMKMQWAGLERFISIRRQGFRHHKKF 113 Query: 275 KEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 Y+I+S L++ + A IR H +EN LHW DV++NED+C IR + A + Sbjct: 114 DSTT----YHITSETLSSYRLAGFIRGHRRIENNLHWTKDVILNEDNCGIREPHPAAILG 169 Query: 335 GIRHIAINILTNDKVFKAGLR 355 +R+IA N L V L+ Sbjct: 170 ILRNIAFN-LRLGTVSNPSLK 189 >UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC8_9GAMM Length = 154 Score = 130 bits (327), Expect = 8e-29, Method: Composition-based stats. Identities = 42/111 (37%), Positives = 61/111 (54%), Gaps = 4/111 (3%) Query: 252 FEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHW 311 W+ L+ + + S R+ +K E + RYYISS TA R HW +E LHW Sbjct: 5 ENWEELQTIVMVESERA----EKGETTIEHRYYISSTLGTAAYLLDYKREHWGIETSLHW 60 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 LD+ ED+ +I +GN AE F+ +RHIA+N+L + K G++ K KA Sbjct: 61 CLDIAFREDESRISKGNGAENFAILRHIALNLLKKEDTAKIGIKNKRLKAG 111 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 45/187 (24%), Positives = 77/187 (41%), Gaps = 17/187 (9%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEK--------FPLKELNNPEHDSYAM 225 M Q D+ +Q++GGDY+ K NQG L E FP + D+ Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYI 285 E S G + + L ++ W G++++ R + + + V Y I Sbjct: 61 CEVSKGHGWVERRTMTS-TIWLNEYLTRWPGVQQVFRLTRTRQV----GGKTTVEVVYGI 115 Query: 286 SSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 SS + R HW +E++ H D + ED C++RRG A + + +R++A+ Sbjct: 116 SSLSSVAAAPDALLRYTRTHWGIESR-HHIRDATLGEDRCRVRRGAAPRVLAVLRNVAVY 174 Query: 343 ILTNDKV 349 +L Sbjct: 175 LLRRLGT 181 >UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H9Y8_ANADF Length = 431 Score = 125 bits (313), Expect = 3e-27, Method: Composition-based stats. Identities = 68/367 (18%), Positives = 118/367 (32%), Gaps = 39/367 (10%) Query: 10 ISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIP 69 + +PD R L+ IL + +++GA S + E+ + ++ +P Sbjct: 22 LEAVPDVRAREG-RWSLAEILTGVLLGIVAGARSLAEAEELTDGMSPAARRLASVPRRLP 80 Query: 70 VHDTIARVVSCISPAKFHECFIN-----WMRDCHSS--NDKDVIAIDGK-----TLRHSY 117 T + + W R + V+A+DGK TL H Sbjct: 81 -DTTARDALCAVPLDGLRAALHRLVRAAWRRKALTPVDLPVGVVALDGKVTALPTLNHPL 139 Query: 118 DKS--------RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGK- 167 ++ + S I + ++NE +L L + G Sbjct: 140 IQNQHPDVGLPYGLARTVTCALVSAPGRPCIDAVPIPAETNEAGHFQHVLAGLVETYGAL 199 Query: 168 --IITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAM 225 ++T DA + + G DY+FA+K + + K E E+ D + Sbjct: 200 FQVVTYDAGALSEANGAAVVAAGKDYVFALKNDHFTMVKLATELLDPHEIAARRED--VL 257 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFE---WKGLKKLCVAVSFRSIIAEQKKEPEMTVR 282 + EI++ V E W + S + E R Sbjct: 258 DNATTATREIQILAVDPSHGYGAGKGPEESVWSHARTFLRVTS---TVRRSGVVIERDSR 314 Query: 283 YYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCK--IRRGNAAELFSGIR 337 ++SS +++ +R HW VEN H LD ED+ N +R Sbjct: 315 LFVSSRAADQLTPDQWLQVVRAHWGVENNNHHTLDTAFAEDERPWIAADANGMLAVLLLR 374 Query: 338 HIAINIL 344 IA +L Sbjct: 375 RIAYTLL 381 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 124 bits (310), Expect = 6e-27, Method: Composition-based stats. Identities = 49/167 (29%), Positives = 80/167 (47%), Gaps = 9/167 (5%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 L+KL E S IPD+R+A K + HKL +++L I +S S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDK-----DVIAIDGKTLRH 115 NGIP T+ R+ I + H +++ IDGK R Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCAQEIVCIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML 162 + K+ R I +SA S + + ++KSNEI A+P L++ + Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLIDKI 199 >UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=Bacteroides RepID=C6Z3A5_9BACE Length = 115 Score = 123 bits (309), Expect = 9e-27, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 64/118 (54%), Gaps = 4/118 (3%) Query: 261 CVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AEKFATAIRNHWHVENKLHWRLDVVMNE 319 S R+I+A E VRYY++S D T EK A+AIR HW + N LHW+LDV E Sbjct: 1 VRIKSERTIVAI--GEYTQEVRYYVTSLDNTRPEKIASAIRQHWSIGNNLHWQLDVTFRE 58 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGL 377 D K + NAA FS +A+ IL N+K K + K KA D NYL+ +L + Sbjct: 59 DYSK-KVKNAAGNFSVATKMALTILKNEKTTKGSMNLKRLKAGWDENYLSQLLQDNNF 115 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 123 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 55/161 (34%), Positives = 76/161 (47%), Gaps = 7/161 (4%) Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 H++SA++T H + +G + T++KSNEITAI LL L K ++T DAMGCQKDIA I Sbjct: 3 PRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARNI 62 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPEHDSYAMSEKSHGREEIRLHIVC 241 GGD++ AV+ NQ +L A E H ++ HGR + R + Sbjct: 63 VAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWGA 122 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVR 282 VP EW +K + AV VR Sbjct: 123 QVP-PDFAAKGEWPWIKAIGTAVRI---TTHPDGTQTDEVR 159 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 121 bits (302), Expect = 5e-26, Method: Composition-based stats. Identities = 44/187 (23%), Positives = 83/187 (44%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L + +S +PD R A + L G+L L + A +S +S +E F + L G Sbjct: 3 LRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 P H I ++ + P K ++ +V+ +DGK LR S + Sbjct: 60 RKAPGHTAITLLLHRLDPEKLQAALGQVFP---EADLGEVLVVDGKHLRGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML---DIKGKIITTDAMGCQKDIAE 182 + ++ + + Q + + + E A ELL+ L +++GK++ DA ++A Sbjct: 115 VKLVEVLALHLHTTLAQARAEGR--EEKAFLELLDRLEARELEGKVVVGDAGYLYPEVAA 172 Query: 183 KIQKQGG 189 +++K+GG Sbjct: 173 RVRKKGG 179 >UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=Bacteria RepID=B7UED9_ECOLX Length = 104 Score = 118 bits (296), Expect = 3e-25, Method: Composition-based stats. Identities = 84/99 (84%), Positives = 90/99 (90%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 MTVRYYISSAD TAEKF TAIRNHWH+EN L+WRLDVVMNEDD KIRRGNAAE FSGIRH Sbjct: 1 MTVRYYISSADSTAEKFVTAIRNHWHMENNLYWRLDVVMNEDDYKIRRGNAAESFSGIRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGL 377 IAINILTN++VFKA RRKMRKA MD+NYLASVLAG+G Sbjct: 61 IAINILTNNQVFKARSRRKMRKATMDKNYLASVLAGAGF 99 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 118 bits (295), Expect = 4e-25, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 74/202 (36%), Gaps = 50/202 (24%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLN---KAFEEKFPLKELNNPEHDSYAMSEKSH 230 MGCQK+IA+ I KQ DY+ A+KG+ L +A+ K + D + + H Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGH 60 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL 290 GR E R V ++ ++W GLK + S Sbjct: 61 GRIETRRCQQVLVNKSWLNNKYQWVGLKSIIKVTSDVHEKTTT----------------- 103 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 + +IR+G F+ +R IA+ + ++ Sbjct: 104 ------------------------------ESRIRKGRGPLAFNVMRKIAMTLFKQEQTK 133 Query: 351 KAGLRRKMRKAAMDRNYLASVL 372 +A + K + A +D Y +++L Sbjct: 134 RASIVAKKKMAGLDDEYRSTLL 155 >UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammaproteobacteria RepID=Q47YV8_COLP3 Length = 151 Score = 117 bits (292), Expect = 7e-25, Method: Composition-based stats. Identities = 34/136 (25%), Positives = 65/136 (47%), Gaps = 3/136 (2%) Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 +H+ + + + + GLK + + + + R+ ISS DL + Sbjct: 12 IHLRTLIDKKWLAKAYRRSGLKSIIKV--HTQVHDKSTGKDTAETRWNISSLDLHVVQAL 69 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 A+R+HW VE+ +HW LD+ D+ +I R +F+ +R IA+ + D + R Sbjct: 70 NAVRSHWQVES-IHWMLDMTFRVDESRICRKQGPHVFNVMRKIAMTLFKQDTTKLVSMAR 128 Query: 357 KMRKAAMDRNYLASVL 372 K + A +D +Y +++L Sbjct: 129 KKKMAGLDDDYRSNLL 144 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 115 bits (288), Expect = 3e-24, Method: Composition-based stats. Identities = 46/187 (24%), Positives = 85/187 (45%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L E +S IPD R A ++ L G+L L + A +S +S +E F + L G Sbjct: 3 LREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 P H + ++ + P K E + ++ +V+ +DGK L+ S + Sbjct: 60 RKPPGHTILTLLLHRLDPEKLQEALLQVFP---GADLGEVLVVDGKHLKGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML---DIKGKIITTDAMGCQKDIAE 182 + ++ + + Q K + + + A+ ELL+ L +KGK++ DA ++A Sbjct: 115 VRLVEVLALHLLTTLAQAKAEGRED--QALLELLDRLGAEGLKGKVVVGDAGYLYPELAG 172 Query: 183 KIQKQGG 189 K+ ++GG Sbjct: 173 KVVQKGG 179 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 115 bits (287), Expect = 3e-24, Method: Composition-based stats. Identities = 46/184 (25%), Positives = 76/184 (41%), Gaps = 15/184 (8%) Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREE 234 K E + G D L +KGN +L A + SY + R E Sbjct: 3 STFKKTVETVLATGNDLLVQLKGNHPKLLAAVRTLCQSRAHA---EQSYTVDLGRRNRIE 59 Query: 235 IRLHIVCDVPD------ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA 288 R + +P F +G +++ V + +++ P YY+++ Sbjct: 60 QRTVRLWPLPPGSGTDPWHDHFQTVIEGQRQIEVFNPYHRRFEPRQESP----AYYLATC 115 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 +A A IR HW +EN+LH LDV + ED +IRR +F+ +RH A+N+L ++ Sbjct: 116 TASAATLAQVIRGHWAIENRLHHVLDVSLGEDSSRIRRN--PGVFALLRHFALNLLRHNG 173 Query: 349 VFKA 352 Sbjct: 174 QANI 177 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats. Identities = 49/206 (23%), Positives = 85/206 (41%), Gaps = 18/206 (8%) Query: 100 SNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELL 159 + + IA+DGK L+ S + R H++SA + + + +++ K+NE T LL Sbjct: 128 AGPRRAIAVDGKALKASARLTSPRR--HLLSAVTHGRVVTLARVEVGAKTNETTHFKPLL 185 Query: 160 NMLDIKGKIITTDAMG-CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP 218 LD+ ++T DA+ + +I+ ++ + Y+ +K NQ + P +++ Sbjct: 186 APLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQLAT-LPWRDIPV- 243 Query: 219 EHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPE 278 +A SE HGR E C +PDEL + L A+ K Sbjct: 244 ---QHAASEVGHGRRESSSIKTCAIPDELGGIAYPHARL-----AIRVHRRCQPTGKRES 295 Query: 279 MTVRYYISSADLTAEKFATAIRNHWH 304 Y ++S D A R W Sbjct: 296 RESVYAVTSLDAH-----QATRPIWP 316 >UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica OS145 RepID=A3WIW9_9GAMM Length = 96 Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 43/96 (44%), Positives = 62/96 (64%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 M RYYISSA L+AE+FA+ +R HW +EN+LHW LDV + ED+C I RG+AA+ + RH Sbjct: 1 MQYRYYISSASLSAEEFASTVRAHWGIENRLHWVLDVTIREDNCPISRGHAADNLACFRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 +A+N + +K A + RK + A M L ++ Sbjct: 61 VALNQIRREKTIDASVNRKQKMATMSEEVLDLIVNA 96 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 112 bits (280), Expect = 2e-23, Method: Composition-based stats. Identities = 59/361 (16%), Positives = 96/361 (26%), Gaps = 70/361 (19%) Query: 26 LSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 L+ +L L V++G +++ + ++ L GIP T R+V P Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFGLTRGIPSERTTRRLVEGCDPVA 107 Query: 86 FHECFINW--MRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 E W +A D K R Sbjct: 108 LDEALSGWIARAATVGDPGPRGLAFD-----GKTLKGTRSFTE----------------- 145 Query: 144 KTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLN 203 AM E + + G+Q + Sbjct: 146 ---------------------------AGAMS-----QEAVLEAVWHDTGITAGHQRVVG 173 Query: 204 KAFEEKFPLKELNNPEHDSYAMS-EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCV 262 D + EK HGR E+R V + + G K++ Sbjct: 174 GDEIAALEALAGRLDLTDVLVTTAEKGHGRVEVRSLKALTVTTPKLVGFW---GTKQVIE 230 Query: 263 AVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA-------TAIRNHWHVENKLHWRLDV 315 P ++ + L AE+ R HW VE +H D Sbjct: 231 LRRRTRRKKTVTAAPTVSEEVFYLVTSLPAEQAHPRDLAARARARGHWTVEA-IHHVRDR 289 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGS 375 V++ED R NA ++ R AI+ L + + +R A + +A Sbjct: 290 VLDEDRHTARTANAPLAWAIARDTAISALRL--TGHRSIAKALRTTARQPERVLQTIALI 347 Query: 376 G 376 Sbjct: 348 S 348 >UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C429C Length = 149 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 38/153 (24%), Positives = 72/153 (47%), Gaps = 13/153 (8%) Query: 224 AMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRY 283 SEK HGR E R + +WKGLK+ R++ K + + V Y Sbjct: 2 TTSEKGHGRIEKRTLETTPIVT----VGQKWKGLKQGLRITRERAV----KGKKTVEVVY 53 Query: 284 YISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 I+S A T +R+HW +EN LH+ DV + ED C++R+G A ++ + +R++ Sbjct: 54 GITSLSMARANAATLLTILRDHWQIENGLHYVRDVTLGEDACRVRKGTAPQVLAAVRNVV 113 Query: 341 INILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 +++L + + ++ + + +++ Sbjct: 114 VHLLASVEAKSRPEAIELLQ--LHPENARNLIG 144 >UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina DSM 3645 RepID=A3ZX26_9PLAN Length = 115 Score = 108 bits (269), Expect = 4e-22, Method: Composition-based stats. Identities = 43/104 (41%), Positives = 62/104 (59%) Query: 272 EQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 +Q + VRYYI S LT +FA A+R HW +EN LHW+LDV E +IR+G+A Sbjct: 12 KQNGKEASEVRYYIVSKYLTGRRFAEAVRGHWGIENSLHWQLDVTFGEHQSRIRKGHADI 71 Query: 332 LFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGS 375 FS +R ++++L N+K + G++ K KA + YL VL G Sbjct: 72 NFSLLRRTSLSLLKNNKTARVGVKNKRLKAGRNDKYLLEVLLGK 115 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 107 bits (268), Expect = 5e-22, Method: Composition-based stats. Identities = 46/211 (21%), Positives = 91/211 (43%), Gaps = 14/211 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLK-- 59 + + +++ IPD R+ K +H+ +LL+ + AV SG + + + + FL Sbjct: 5 SIPSPLPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDE 64 Query: 60 ---QYGDFENGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSNDKDV-----IAID 109 + E +P T+ R+ + + ++W R+ + K+ +A+D Sbjct: 65 VHIRTRRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVD 124 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG-QIKTDKKSNEITAIPELLNMLDIKGKI 168 GK LR + R A+ +SA L +G Q D ++ + + L + + Sbjct: 125 GKHLRGTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGVD-WV 183 Query: 169 ITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 +T DA C +++A + +Q G A KG + Sbjct: 184 LTGDAALCTQELAAVVVEQKGGICSASKGTR 214 >UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE7_9GAMM Length = 185 Score = 107 bits (267), Expect = 6e-22, Method: Composition-based stats. Identities = 43/173 (24%), Positives = 64/173 (36%), Gaps = 10/173 (5%) Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSH-GREEIRLHIVCDV 243 G L +K NQ L+ A E +P D + E R E R V + Sbjct: 2 IATGNHLLVQLKRNQPLLHDAMVEYT----RGHPFVDEHHTHEIGRRNRIEKRAVHVWHL 57 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQ--KKEPEMTVRYYISSADLTAEKFATAIRN 301 L + + L + YY+ L A +F+ AIRN Sbjct: 58 HPSLGSAPWY-DHFRALIRVQRHTERFDTRLRDWRVSKECAYYLCDLVLPAARFSEAIRN 116 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 HW VEN+ H+ D ED +IRR F+ +R A+N++ ++V Sbjct: 117 HWRVENRAHYVRDTRFQEDASRIRRN--PCTFALLRSFALNLMRFNRVENISQ 167 >UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BR51_9GAMM Length = 153 Score = 106 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 59/150 (39%), Gaps = 9/150 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFG-----ETHLD 56 +++ L ++ + +PD +A H+L +L L A + G + ++ + ++ Sbjct: 7 QMRSLPQYFTDLPDPHRAEGRRHRLPVVLTLAAGASLCGMQGYKSMAEWASSLAQAARRR 66 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS 116 F + + +P I + + P W N ++ +A+DGK ++ Sbjct: 67 FGCRRVNGHYLVPSLYVIRDCLVRLGPEALDRRLQAWQAA--QLNSEEALAMDGKIMKGG 124 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTD 146 D + + H++S + Q K+ Sbjct: 125 VDHTGAQ--THIVSLIGHESKHCVAQKKSA 152 >UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REV9_LEGLO Length = 139 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 38/141 (26%), Positives = 65/141 (46%), Gaps = 6/141 (4%) Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R + +P + T G+K + + S E RYY++S + Sbjct: 3 RRYFAYRLPKTI--NTGSLVGIKSIIATETISSKTNET--AISAEWRYYVTSHETEKSDL 58 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKAG 353 +RNHW +EN+LHW LDV +N+D K R A FS I+ + ++++ K Sbjct: 59 HLYVRNHWSIENELHWHLDVHLNDDADKKRDDTTAINFSSIKRMLLSLVKTKLPPGKKRS 118 Query: 354 LRRKMRKAAMDRNYLASVLAG 374 +R ++++ D YL S+L+ Sbjct: 119 VRSRLKQVGWDTEYLVSLLSA 139 >UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C6B8_ACAM1 Length = 145 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 44/152 (28%), Positives = 71/152 (46%), Gaps = 9/152 (5%) Query: 222 SYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTV 281 + S +S GREE R V + + EW+ ++ + + ++ + Sbjct: 3 EHTHSIQSRGREEHRCIQVYE---PVGIALQEWEAIRSVLCVQRW----GTRQGKAYHNT 55 Query: 282 RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAI 341 YYISSA + + + +R HW +EN+LHW DVV EDD ++ A +S +R I I Sbjct: 56 AYYISSAATSPHHWQSLVREHWGIENRLHWPKDVVFGEDDYRLEDEQALLNWSVLRTIVI 115 Query: 342 NILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 NIL + L+ M K A + + S+L Sbjct: 116 NILRLNGYQ--SLKTAMTKLANRVDIIFSLLT 145 >UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D5 Length = 148 Score = 104 bits (259), Expect = 6e-21, Method: Composition-based stats. Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 11/124 (8%) Query: 228 KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISS 287 K HGR E R L ++ W G++++ R + + V Y ISS Sbjct: 3 KGHGRVERRSITTTT---WLNEYLTRWPGVQQVFRLERQR----RADGKTTVEVVYGISS 55 Query: 288 AD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 + R+HW +E+ LH+ DV ++ED C++RRG A + + +R++A+ +L Sbjct: 56 LSPVAAPPDTVLGYTRSHWGIES-LHYVRDVTLDEDRCRVRRGTAPRVLASLRNVAVYLL 114 Query: 345 TNDK 348 Sbjct: 115 RRLG 118 >UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKL6_9PROT Length = 506 Score = 104 bits (258), Expect = 7e-21, Method: Composition-based stats. Identities = 59/386 (15%), Positives = 120/386 (31%), Gaps = 57/386 (14%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHK----LSGILLLTIFAVISGAESWEDIEDFGETHLDF 57 EL L+ + IPD R K HK L LL+ +F S E+ ++ L Sbjct: 75 ELPALLGQLEQIPDPRDPRKRRHKLTVLLLYGLLMFVFQFASRRETNREMTR--PQFLAN 132 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHEC-------------FINW-MRDCHSSNDK 103 L++ +P DT+ R++ I A + F + + CH Sbjct: 133 LQRLFPEIEALPHADTLYRLLRDIDLAHLEQAHVDLVRRLIRGKSFRRYLINHCHPIAID 192 Query: 104 DVIAIDGKTLR---------HSYDKSRRRGAIHVISA-FSTMHSLV-----------IGQ 142 + G TL + + ++V+ A + LV +G Sbjct: 193 GSQKLAGDTLWAEELLQRHVGKDETRHTQYFVYVLEASLVFHNGLVIPLLSEFLEHALGD 252 Query: 143 IKTDKKSNEITAIPELLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 + K+ E+ L + L ++ D + + ++ + ++ +K Sbjct: 253 SEAQKQDCELRGFARLSDRLKRLFPRLPILLLLDGLYANGPVMQRCLRAHWQFMIVLKDK 312 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 + +E + ++ GR + V D+ L Sbjct: 313 --------DLPTVWEEFRALQPRQLPTLQQDWGRRQQHFSWVNDIEYAYGSNGRCRLKLH 364 Query: 259 KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT----AEKFATAIRNHWHVENKLHWRLD 314 + ++ + E + E ++SS L+ E+ R+ W +E Sbjct: 365 VVVCEERWQGVDQEARIVTETARHAWLSSQPLSRENVHERCNLGARHRWGIEAGFLVEKH 424 Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIA 340 + + NA + + +A Sbjct: 425 QGYHYEHAFALDWNAMRGYHLLMRLA 450 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 29/212 (13%), Positives = 68/212 (32%), Gaps = 34/212 (16%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFA-VISGAESWEDIEDFGETHLDFLKQYG 62 + + E + + D R + + + + + +G + + + Sbjct: 22 EGIWERLDRVTDPRSTRGRVYSWLCLAAVWLCSLTAAGHHRVSAVRAWLARTSGAERARL 81 Query: 63 DFEN------GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD------------ 104 +P TI + + + ++ D ++ Sbjct: 82 RLPWDPFAGWRLPSTATIHCFLQAVDDGELAVALLDPPLDPDPPAEQGDDTDQRTEPSAA 141 Query: 105 -------------VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNE 151 +A+DGKT RH+ K +H++ S ++ Q++ + K+NE Sbjct: 142 PVDPGHGCQPVESAVALDGKTSRHA--KRADGSKVHLVGVASHGDGRLLAQVEVEAKTNE 199 Query: 152 ITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 LL LD+ ++T DA+ + + Sbjct: 200 TAVFRRLLRPLDLTNVLVTADALHTVRANLDT 231 >UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3V5_9PROT Length = 174 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 41/164 (25%), Positives = 67/164 (40%), Gaps = 10/164 (6%) Query: 195 VKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K NQ L + N+ D+ K R+E R V V D L ++ Sbjct: 1 MKANQSNLFETA----CAIAANDAPADTAFSRNKGRSRQEDRTVEVFPVGDALAGTEWQ- 55 Query: 255 KGLKKLCVAVSFRSII--AEQKKEPEMTVRYYISSA-DLTAEKFATAIRNHWHVENKLHW 311 +K + + A + V +Y+SSA + A +A AIR HW +EN+ H+ Sbjct: 56 PFIKTIIRVTRRTLLHSAATGLWDQRGEVAFYVSSAANFPATAWAAAIRGHWGIENRNHY 115 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 DV +ED +IR + + R A+NI+ + + Sbjct: 116 VRDVSCDEDKSRIRDN--PGIMARARSFALNIMRKNGIANVAQA 157 >UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV84_9DELT Length = 357 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 60/148 (40%), Gaps = 9/148 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFG-----ETHLD 56 +++ L ++ + D R+ H++S +L + A + G + ++ I + + Sbjct: 214 QMESLPDYFKTVTDPRRTHGRRHRISTVLSIAAAATLCGMKGYKAIYGWANKLGQKARQR 273 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS 116 F + + + IP I V+ P + + D + +A DGKT++++ Sbjct: 274 FRCRKENGKYVIPSQFVIRDVLVRADPVELDLAVQRFNED--QGLEDTCLAFDGKTMKNA 331 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIK 144 D++ R+ H+ S Q K Sbjct: 332 IDENARQ--THIASVVGHESKTTHTQKK 357 >UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX Length = 105 Score = 101 bits (251), Expect = 4e-20, Method: Composition-based stats. Identities = 74/88 (84%), Positives = 77/88 (87%) Query: 271 AEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 EQKKEPEMT RYY SADLTAEKFATA RNHW+VENKLHW LDVVMN+DDCKIRRGNAA Sbjct: 18 TEQKKEPEMTDRYYSISADLTAEKFATANRNHWYVENKLHWHLDVVMNKDDCKIRRGNAA 77 Query: 331 ELFSGIRHIAINILTNDKVFKAGLRRKM 358 ELFSGIR IAINILT DK+ KAG R KM Sbjct: 78 ELFSGIRKIAINILTKDKILKAGARCKM 105 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 101 bits (251), Expect = 5e-20, Method: Composition-based stats. Identities = 41/95 (43%), Positives = 61/95 (64%), Gaps = 1/95 (1%) Query: 3 LKKLM-EHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 ++ L E S IPD R +H I+ L +F+V++GA+S+ +IEDF E H+D+LK Y Sbjct: 1 MEGLFVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTY 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRD 96 + NGIP HDT +RV S I+PA F + F+ W++ Sbjct: 61 FNLPNGIPSHDTFSRVFSAINPASFQDSFLIWLKA 95 >UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C378D Length = 453 Score = 100 bits (249), Expect = 9e-20, Method: Composition-based stats. Identities = 54/365 (14%), Positives = 96/365 (26%), Gaps = 58/365 (15%) Query: 8 EHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG 67 E IPD R L +L+ + A S F LD ++ G Sbjct: 27 ERFETIPDAR--RGPTFSLPDVLMAGLALFALKAPSLLA---FQRRTLDHNLRHVFGLTG 81 Query: 68 IPVHDTIARVVSCISPAKFHECFIN--------WMRDCHSSNDKDVIAIDGKT------- 112 P + V+ + P F + + D + + V+A+DG Sbjct: 82 RPSDSQMRAVLDDVDPDHLRPVFRDVFARLQAAHVLDEYRVDGCYVVALDGVEYFCSQKV 141 Query: 113 ------LRHSYDKSRRRGAIHVISAFSTMH-SLVIG------QIKTDKKSN--EITAIPE 157 R + + + +A S V+ Q N E A Sbjct: 142 HCPHCMTRRHANGAVSYYHQMLGAAVVHPDFSAVLALAPEPIQRADGGTKNDCERNAARR 201 Query: 158 LLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 L ++ DA +QK +L VK A Sbjct: 202 WLGRFREEHPDLAVLVVEDARSSNAPHVRDLQKARCHFLLGVK-------AADHAHLFAH 254 Query: 214 ELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 + ++ + E + R +R + L + + + + Sbjct: 255 VCARQDQHAFEVVEDADPRTGLRRSYLWIADLPLNESNDD-------VRVNFVHLVELDP 307 Query: 274 KKEPEMTVRYY-ISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRR-GNAAE 331 P ++ + A A R W +EN+ L N+ G+ Sbjct: 308 DGTPREWTWVADMAVTGANVRQLARAGRARWRIENETFNTLK---NQGYHFAHNFGHGDN 364 Query: 332 LFSGI 336 S + Sbjct: 365 NLSVV 369 >UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BLK3_9GAMM Length = 190 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 54/129 (41%), Gaps = 6/129 (4%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFG-----ETHLD 56 +++ L ++ + +PD R+A H+L + LT A + G + ++ + ++ Sbjct: 59 QMRALPQYFTDLPDPRRAQGRRHRLPVVPALTAGASLCGMQGYKAMAEWASSLGQAARQR 118 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS 116 F + + +P I + + P W +S+D + +A+DGK ++ Sbjct: 119 FGCRRVNGHYLVPSLYVIRDCLVRLGPKALDRRLQAWQAAQLNSSD-EALAMDGKIMKGG 177 Query: 117 YDKSRRRGA 125 D + + Sbjct: 178 VDHTGAQTQ 186 >UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY37_9BACT Length = 138 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 28/122 (22%), Positives = 52/122 (42%), Gaps = 9/122 (7%) Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA--- 288 R E + V L+ ++ L+++ R ++K + ++S Sbjct: 1 RIETQTIRVSS----LLKGYSDFPHLEQVFRID--RVTRFKKKGKTRKETALGVTSLSSG 54 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 + + +R HW +EN+LHW D V ED C R GN A + + +R++ I++L Sbjct: 55 QASPRELLDFVRGHWEIENRLHWIRDSVFREDTCTTRTGNGAHVMATLRNMTISLLRVAG 114 Query: 349 VF 350 Sbjct: 115 SK 116 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 99.1 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 53/128 (41%), Positives = 70/128 (54%), Gaps = 1/128 (0%) Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREE 234 + ++ +KI ++ DYL AVKGNQG L AF++ F LNN + + Y E+S GR E Sbjct: 11 SVRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHE 70 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 R V L D + EW GLK + VS S E +E ++ VRYYISS L AE+ Sbjct: 71 SRAAFVSHDLSVLGDISDEWPGLKSMAFVVSMNS-EKEVAEEADIYVRYYISSKQLNAEE 129 Query: 295 FATAIRNH 302 TA R H Sbjct: 130 LLTASRLH 137 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 99.1 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 28/129 (21%), Positives = 54/129 (41%), Gaps = 6/129 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 + L ++ +PD R + L IL + + AV++GA ++ I D+ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDC------HSSNDKDVIAIDGKTLRHSY 117 F + +P T+ R++ I + W+R VIA+DGK +R + Sbjct: 89 FTDRVPAATTVWRLLIRIDAEVLPQVLARWLRARTAPVVVTGRRLCLVIAVDGKVVRGAR 148 Query: 118 DKSRRRGAI 126 ++ A+ Sbjct: 149 LRAAGPSAL 157 >UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9DIV7_9GAMM Length = 133 Score = 99.1 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 3/107 (2%) Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 V ++ ++W GLK + S + + + R+YISS DL AE+ +++RNH Sbjct: 3 VNKSWLNNKYQWVGLKSIIKVTS--DVHEKTTGKETTETRWYISSLDLNAEQALSSVRNH 60 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 W VE+ +HW L++ ED+ + R+G F+ +R IA+ + D+ Sbjct: 61 WQVES-MHWVLEMTFREDESRFRKGRGPLAFNVMRKIAMTLFKQDQT 106 >UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N3J1_VIBHB Length = 184 Score = 98.3 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 33/132 (25%), Positives = 59/132 (44%), Gaps = 7/132 (5%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYIS 286 ++ HGR R + +P+EL + G+K + + + YYI+ Sbjct: 34 DEGHGRLVRRRYFAFPLPEELHNHALS--GIKSCIAVERI--VQEGKGEPKTSHFSYYIT 89 Query: 287 SADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN 346 + + K A +R HW +E+ HW LDV N+D K N+AE F+ I+ + +N++ Sbjct: 90 NHPASDPKLADYVRQHWEIES-YHWLLDVYFNDDRDKKYEENSAENFAQIKRLPLNLVKA 148 Query: 347 DK--VFKAGLRR 356 K ++ Sbjct: 149 KDWAGKKKSVKS 160 >UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillaceae RepID=C5D2E6_GEOSW Length = 437 Score = 96.8 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 67/416 (16%), Positives = 130/416 (31%), Gaps = 83/416 (19%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDI-EDFGETH-LDFLKQ 60 K L++ + + D R + + IL + + G +S + E F + ++ ++ Sbjct: 28 FKDLVDQLKKVKDKRHQSYITYGPETILYTILLKSVFGIKSMRSMTELFNKDECIENIRV 87 Query: 61 YGDFE--NGIPVHDTIARVVSCISPAKFH--------ECFINWMRDCHSSNDKDV-IAID 109 + N +P +DTI ++ + P + + F + +K I D Sbjct: 88 VLGLKELNELPHYDTINDFLAKLEPKELETIRIYLIKKLFEKRCLESFRILNKYWPIVFD 147 Query: 110 GK------------TLRHSY-DKSRRRGAI----HVISA--FSTMHSLVIGQIKTDKKS- 149 G LR Y DK + HV+ A L I + +S Sbjct: 148 GTGIHTFKEKHCEHCLRREYKDKETGETKVVYMHHVLEAKLVVGDMVLSIATEFIENESE 207 Query: 150 ------NEITAIPELLNMLD-----IKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 E+ A L++ L + +I D++ + + E K Y+F K + Sbjct: 208 NVPKQDCELKAFMRLVDKLKKTFKRLPICLI-ADSLYACEPVFEICDKHNWKYIFRFKED 266 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 + + E N + + V D+ + Sbjct: 267 RIKTVSQEFRAIQSLETNGKSSEYF---------------WVNDIAYND----------R 301 Query: 259 KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMN 318 + + + E+K+E + I+ + AE A R W +EN+ Sbjct: 302 LVNLVEKVKVTENEKKQEFLFITNFRIT--ERNAEILVQAGRRRWKIENEGFNNQKNGWY 359 Query: 319 EDDC-KIRRGNAAELFSGIRHIA----------INILTNDKVFKAGLRRKMRKAAM 363 E + NA + + IA +L K + K+ +A Sbjct: 360 EIEHVNCHNYNALKNHYLLVQIADILVQLYKYGSKLLKQLKKSAKEISSKLLEAIR 415 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 96.8 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 40/186 (21%), Positives = 61/186 (32%), Gaps = 18/186 (9%) Query: 195 VKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K NQ + P + + S HGR E R C + DEL F Sbjct: 2 IKRNQPTTYRQL-AALPWPDSAV----QHTASSAGHGRRESRSIKTCGIADELGGIAFPH 56 Query: 255 KGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT---AEKFATAIRNHWHVENKLHW 311 L A+ + Y ++S D + A A+R HW VE H Sbjct: 57 GRL-----ALRVHRRRKQTGGCESRETVYAVTSLDAHETTPAELAAAVRGHWTVEALRH- 110 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD-RNYLAS 370 DV E+ + G A + R++A+ +L K +A D Sbjct: 111 VRDVTYAEEASTLHTGTAPRAMATFRNLAVGLLKTLGAINIA---KTTRAIRDQPERALP 167 Query: 371 VLAGSG 376 +L + Sbjct: 168 LLGITN 173 >UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PA49_CLOTS Length = 245 Score = 96.0 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 50/230 (21%), Positives = 84/230 (36%), Gaps = 37/230 (16%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + L E I+ + D R V+ +S I + +F + S+ +E + K+ Sbjct: 16 VYHLGEKINTLKDKRVKSSVK--ISTITFVVLFGFMLQIRSFNRLEHW--LKKGKFKKAL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHEC--------FINWMRDCHSSNDKDVIAIDGKTLR 114 + +P DTI RV+S +E N + + + V+AIDG L Sbjct: 72 PKKTKMPRIDTIRRVLSNFDLDGLNELNNSIIKTSIKNKVFRRGTIDGLKVVAIDGVELF 131 Query: 115 HSYDKSRRRGAIH--------------VISAFSTMHSLVIGQIKTDKKSN-------EIT 153 S K V S + L++GQ + K + EIT Sbjct: 132 ESTKKCCGNCLTRVQKDGITHYFHRTVVCSTIGSDSHLILGQEILEPKKDGSDKDEGEIT 191 Query: 154 AIPELLNMLDIK----GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 A L+ L + II DA+ C+ +++ G D + VK + Sbjct: 192 AGKRLIRKLHREFHHFADIIVADALYCKSTWVKEVLSIGMDAVVRVKDER 241 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 94.9 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 39/120 (32%), Positives = 58/120 (48%), Gaps = 4/120 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 ++H I D R +H L I+LL I AV+SG+E WE IE+FG LD+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSNDKDVIAIDG--KTLRHSYDKSR 121 GIP HDTIARV+ + + + + D + + G + H + Sbjct: 67 AGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREG 126 Score = 57.5 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 3/79 (3%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLN---KAFEEKFPLKELNNPEHDSYAMSEKSHGREE 234 K+IA+ I KQ DY+ A+KG+ L +A+ K + D + + HGR E Sbjct: 87 KEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGHGRIE 146 Query: 235 IRLHIVCDVPDELIDFTFE 253 R V ++ + Sbjct: 147 TRRCQQVLVNKSWLNNKYR 165 >UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobacteria RepID=Q2RP40_RHORT Length = 152 Score = 94.5 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 33/130 (25%), Positives = 45/130 (34%), Gaps = 2/130 (1%) Query: 224 AMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRY 283 HGR+E R V DV L +++ Sbjct: 6 TTDRGRHGRQEHRWVEVFDVSGRLGPTWDGLIAAVARVTRLTWHKDTKSGLWHKTQETAL 65 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 Y +L A TAIR HW VE + H+ DV ED +IR F+ +R A+NI Sbjct: 66 YACQINLPAAVAGTAIRQHWGVEKRSHYVRDVTFFEDQSRIRTK--PGHFARLRSFALNI 123 Query: 344 LTNDKVFKAG 353 L + Sbjct: 124 LRANGTNNIS 133 >UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q750_9BACT Length = 129 Score = 91.8 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 40/106 (37%), Gaps = 5/106 (4%) Query: 250 FTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVE 306 + +K++ R + + + Y I+S + + R HW +E Sbjct: 19 ISACRSWVKQVFCI--HRIFTKVKTGKKTEEIVYGITSLTQQKASPKTILKFSRGHWSIE 76 Query: 307 NKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA 352 N LH+ D ED +IR NA + ++++ + + V Sbjct: 77 NGLHYVRDTAFREDHSQIRTQNAPRAMASLKNLVVGLFHFLNVPNI 122 >UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3K1_9PROT Length = 134 Score = 91.0 bits (224), Expect = 7e-17, Method: Composition-based stats. Identities = 31/117 (26%), Positives = 49/117 (41%), Gaps = 6/117 (5%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGD 63 L+ S I D R+ + L+ +LL T+ A+++GA S+ ++ F THLD L D Sbjct: 3 STLLSLFSQISDPRRDQGKIYPLAPVLLFTVLAMLAGARSYRQVQAFIRTHLDRLNDGFD 62 Query: 64 FE-NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSND-----KDVIAIDGKTLR 114 P + T+ ++ I + F + IAIDGKT Sbjct: 63 LSLRRAPAYSTVRFILRGIDAEEMERAFRDHALGLADGPAEGAAIPGAIAIDGKTWC 119 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 90.6 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 33/84 (39%), Positives = 50/84 (59%), Gaps = 1/84 (1%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE- 182 A+H++SAF + +V+ Q+ +KSNEI A ELL LDI G +T DAM Q++ A Sbjct: 7 KAVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQREHARF 66 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAF 206 ++ + D++ VK NQ L +A Sbjct: 67 AVEDKRADFVMTVKDNQPELREAL 90 >UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A0_AZOSE Length = 92 Score = 89.9 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 26/75 (34%), Positives = 43/75 (57%) Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R ++LHW LDV N+D ++RRG AA F +RHI +N+L ++ KA ++ K Sbjct: 15 VRLPRPTRHQLHWSLDVQFNDDQSRVRRGYAANNFVVLRHIVLNLLRHNTTRKASIKSKR 74 Query: 359 RKAAMDRNYLASVLA 373 A M+ ++ +L Sbjct: 75 LLACMEDDFREELLG 89 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 89.1 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 39/108 (36%), Positives = 58/108 (53%), Gaps = 2/108 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 ++H + D R +H L I+LL I AV+SG+E WEDIE+FG LD+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSNDKDVIAIDGK 111 GIP HDTIARV+ + + + + D + + G+ Sbjct: 67 AGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQGE 114 >UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C56B7 Length = 116 Score = 89.1 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 49/109 (44%), Gaps = 5/109 (4%) Query: 268 SIIAEQKKEPEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + + + V + I+S A +R HW +EN+LH+ DV + ED C++ Sbjct: 8 TRERTVRGQTTVEVHFGITSLSAEKADAATLLNHVRTHWRIENELHYVRDVTLGEDVCRV 67 Query: 325 RRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 R G+A ++ + +R+ +++ K + + MD ++ Sbjct: 68 RMGHAPQVLAALRNAVVHLWREVKAVSCPEAIERLQ--MDPAMAKGLIG 114 >UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ26_GLOVI Length = 125 Score = 87.6 bits (215), Expect = 7e-16, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 46/106 (43%), Gaps = 1/106 (0%) Query: 261 CVAVSFR-SIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 S R E + + +Y+SS + +A + IR HW VEN++H+ DV E Sbjct: 12 GRTRSIRLERYRELRGIVTVKTHWYLSSIEASASELGRRIRGHWGVENQVHYPKDVTFGE 71 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 D +IR +++S R A+N+ + + ++ + Sbjct: 72 DRSRIRTLPLVQVWSVARSFALNLYRSLLMANRAQAQRRCMFGLST 117 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 87.6 bits (215), Expect = 7e-16, Method: Composition-based stats. Identities = 32/118 (27%), Positives = 49/118 (41%), Gaps = 9/118 (7%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L EH++ +PD R + H L IL + + A+ SGAE + + ++ T L Q Sbjct: 15 GLWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGC 74 Query: 65 ENG-------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRH 115 + P T+ RV+ I NW+ S +A+DGKTL Sbjct: 75 QESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSLGLSPA--ALAVDGKTLAG 130 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 86.8 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 2/131 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L ++ IPD+R+A + L+ +LL +I AV+SGA S+ I+ F + H + L Sbjct: 3 LKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQLH 62 Query: 66 N-GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDV-IAIDGKTLRHSYDKSRRR 123 PVH +I + + F + IA+DGKTLR + + R Sbjct: 63 WKRAPVHTSIRYALQGLDAKAGELAFHRHASGLDGEGAQHASIAMDGKTLRAAVSITSRT 122 Query: 124 GAIHVISAFST 134 SA Sbjct: 123 ARPLRYSAHWP 133 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 86.4 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 42/85 (49%) Query: 7 MEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFEN 66 ++H + D R L I+ + I AV++GA+ + IE +G+ +L+ + D Sbjct: 28 LKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLPK 87 Query: 67 GIPVHDTIARVVSCISPAKFHECFI 91 GIP HDT RV+ + P + F Sbjct: 88 GIPSHDTFGRVLRILEPKQLQSGFR 112 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 84.9 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 36/128 (28%), Positives = 59/128 (46%), Gaps = 5/128 (3%) Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEH 220 M +KG ++T DAMGCQ+ IA+++++ G D + ++KGNQG+ A F ++ + Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 221 --DSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPE 278 + E SHGR R V + + W ++ L V R + Sbjct: 61 LKPDHDEFEDSHGRTVRRRGWVLPLT-PETKHSGSWPDIQALLVTEKIRQAH--YSETVT 117 Query: 279 MTVRYYIS 286 RYY+S Sbjct: 118 SDFRYYLS 125 >UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H5_9SYNE Length = 177 Score = 84.5 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 59/153 (38%), Gaps = 12/153 (7%) Query: 197 GNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKG 256 G+Q L + ++ K + E HGR+ + + W G Sbjct: 8 GDQKTLYRQIADQLLGKRHIPLMATDH---EIGHGRD---ILWTLRAKEAPQHIKANWHG 61 Query: 257 LKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVV 316 + ++ + ++P +I+S T + +R W VE+ HW D Sbjct: 62 TSWIAEVIA----TGTRDRKPFKATHRFITSLRTTPDALLRLVRERWSVESW-HWIRDTQ 116 Query: 317 MNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ++EDD + R GN A + + +R A+N+L Sbjct: 117 LHEDDHRYR-GNGAGVMAALRTAAMNLLRLTGF 148 >UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JBB1_FRASC Length = 173 Score = 84.5 bits (207), Expect = 7e-15, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 45/106 (42%), Gaps = 5/106 (4%) Query: 274 KKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 + ++S + A + HW +EN+LHW DV +ED + R GNA Sbjct: 69 GGPATAETVHAVTSLPTHHASPRLLAELAQAHWAIENRLHWVRDVTYDEDRHRARTGNAP 128 Query: 331 ELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSG 376 ++ + +R++AI IL + + +R A + +G Sbjct: 129 QVMTSLRNLAITILRLTGAKN--IAKALRHHARHPERPLETIKKAG 172 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 82.9 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 38/86 (44%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 + ++ + + + D R +H+ I+++ + V+ G + I + ++L+ Sbjct: 6 VAVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQG 65 Query: 61 YGDFENGIPVHDTIARVVSCISPAKF 86 + + NG+P D I + + P F Sbjct: 66 FLELPNGLPSRDCIRNWLMALQPDAF 91 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 82.9 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 32/79 (40%), Positives = 50/79 (63%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 +++++E + + D R A + +H L IL+L + AV+SGA+ W+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARVVSCI 81 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRVSLRW 85 >UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN77_ANAAZ Length = 77 Score = 82.6 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 26/61 (42%), Positives = 43/61 (70%) Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 A ++R+HW +EN LHW LDV + +DDC+IR+ NA + F+ +R IA+++L + K G+ Sbjct: 1 MANSVRSHWAIENSLHWVLDVALKQDDCRIRKDNAQQNFAVMRQIAVDLLGKENPVKRGI 60 Query: 355 R 355 + Sbjct: 61 K 61 >UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK19_ACIF5 Length = 104 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 36/93 (38%), Gaps = 3/93 (3%) Query: 273 QKKEPEMTVRYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + + ++S E R HW +EN+ H D +ED +IR N Sbjct: 2 KDGTLREDCAFGLTSLTKDRTTPENLLGIARGHWEIENRNHHVRDTTYHEDLSQIRTENG 61 Query: 330 AELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 + + +R +A++IL V + A+ Sbjct: 62 PHMMATLRGLAMSILRLIGVKNIAQAGRDFAAS 94 >UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus aquaticus Y51MC23 RepID=B7A7V9_THEAQ Length = 161 Score = 81.8 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 55/143 (38%), Gaps = 9/143 (6%) Query: 213 KELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAE 272 +E P + G E + + G ++ R ++ + Sbjct: 2 EERRLPGETEAVWNLVRDGEVWTYRVWASPYLPEEM---RAFPGCGQVVRME--REVVRK 56 Query: 273 QKKEPEMTVRYYISSA---DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 E TV Y ++S A + + + W VEN+ W D +++ED C++R G Sbjct: 57 GTGEVRRTVSYALTSLGPEVADARRLGELLLSRWEVENRSFWVRDFLLHEDACQVR-GVG 115 Query: 330 AELFSGIRHIAINILTNDKVFKA 352 A++ + +R +++L V + Sbjct: 116 AQVLAALRAFLVSLLHRQGVREK 138 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 81.4 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 27/77 (35%), Positives = 49/77 (63%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M + +++H S + D RQ+W+V + L I LL + A +SG E + +I +G+ L+FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTIARV 77 + +E G+P HDT+ + Sbjct: 77 FLPYERGLPAHDTLKGL 93 >UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_9ACTO Length = 410 Score = 81.0 bits (198), Expect = 6e-14, Method: Composition-based stats. Identities = 30/138 (21%), Positives = 49/138 (35%), Gaps = 11/138 (7%) Query: 45 EDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS---SN 101 + + G + P I R++ I P W+ Sbjct: 224 SALIAWVLARPTVAVLLGIDADRRPSEAMIRRLLQAIDPDLLTTAIGIWLAARIPAPAPG 283 Query: 102 DKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE---- 157 + IA+DGKTLR S + HV++A +V+ D K+NEIT Sbjct: 284 SRRAIAVDGKTLRGSRTRDSAAR--HVLAAADQHTGIVLASTDVDTKTNEITRFTASGSH 341 Query: 158 --LLNMLDIKGKIITTDA 173 LL+ I+ +++ A Sbjct: 342 ADLLSSRCIRSGVVSPAA 359 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 81.0 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 37/128 (28%), Positives = 53/128 (41%), Gaps = 13/128 (10%) Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWM----RDCHSSNDKDVIAIDGKTLRHSYDKSR 121 PV+ ++ ++ I P F R C + IAIDGKTLR S+D Sbjct: 9 RRAPVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAFS 68 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELL---------NMLDIKGKIITTD 172 A +V+SAF+ H +++ D+KSNEI A L+ I + D Sbjct: 69 DTKAAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALIVATALWKSREETSIYASSVMLD 128 Query: 173 AMGCQKDI 180 AM I Sbjct: 129 AMTFAPAI 136 >UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E6_9ACTO Length = 128 Score = 81.0 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 30/129 (23%), Positives = 47/129 (36%), Gaps = 13/129 (10%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + L E ++ + D R+ H +LL+ AV++GA S+ I ++ + Sbjct: 1 MGPLAERLATLADPRRRRGKRHPFVAVLLIACSAVVTGARSFVAIHEWATDSPQDVLARL 60 Query: 63 DFENG-------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRH 115 P TI RV+ P + H D +AIDGK+ R Sbjct: 61 GARTATALAVRIPPSGVTIRRVIKDTCPGGLADLLG------HDPAGTDTLAIDGKSARG 114 Query: 116 SYDKSRRRG 124 S S R Sbjct: 115 SRLGSTRPP 123 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 29/96 (30%), Positives = 38/96 (39%), Gaps = 4/96 (4%) Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPEHDSYAMSEK 228 D +GCQK IA+ I +Q DYL AVK NQ L++A F D K Sbjct: 8 DGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYFEEANKARFAGYNIDYDEKINK 67 Query: 229 SHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 GR E R V + W L+ + + Sbjct: 68 GPGRLEQRRCWV-GYEIPDTINSQNWAKLETIVMVE 102 >UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XTV7_9DEIN Length = 118 Score = 78.7 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 31/122 (25%), Positives = 55/122 (45%), Gaps = 7/122 (5%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W+G + R ++ + E Y ++S A++ R HW VEN+LH + Sbjct: 4 WRGSRMALRM--RRRVVRKNSGELREETAYALTSLQAPAKRLYALWRGHWEVENRLHHKR 61 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 D V+ ED + R+G A +R + +N+L + + R +RK + D L ++ Sbjct: 62 DTVLGEDASRSRKGAAG--LMYLRDVILNLLHL---KRWPVLRSVRKFSADPKVLLRLIR 116 Query: 374 GS 375 G Sbjct: 117 GL 118 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 54/198 (27%), Positives = 80/198 (40%), Gaps = 37/198 (18%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFG--ETHLDFLK 59 +LKKL+E S IPD R+A V+H+L+ +LL + + + S + L L+ Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSRPAFLQALQ 138 Query: 60 QYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC--------HSSNDKDVIAIDG- 110 +P DT+ARV+ I P K E FI +R H N IAIDG Sbjct: 139 GLFPELETLPHGDTLARVLERIEPQKLEESFIRLLRRYIRHKKFKRHLINKCYPIAIDGT 198 Query: 111 -KTLR-------------HSYDKSRRRGAIHVISA-FSTMHSLV-------IGQIKTDKK 148 K +R + D + + I+V+ A F + L + + D K Sbjct: 199 QKLVRDGELGEEWLERHIKTKDGEKVQQYIYVLEANFVFKNGLTIPIMSEFLSYSEDDSK 258 Query: 149 S----NEITAIPELLNML 162 EI A L + L Sbjct: 259 EVKQDCEIKAFKRLSHRL 276 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 76.4 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 29/112 (25%), Positives = 48/112 (42%), Gaps = 6/112 (5%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFE 65 L ++S IPD+R+A + L+ +LL +I A++SGA S+ I+ F +TH + L Sbjct: 3 LKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQLH 62 Query: 66 N-GIPVHDTIARVVSCISPAKFHECFINWM-----RDCHSSNDKDVIAIDGK 111 P H +I + + F D + VI + K Sbjct: 63 RKRAPAHTSIRYALQGLDAKAVELAFPRHASGLDGEDHNRFFPSTVIDAEWK 114 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 74.1 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 50/108 (46%), Gaps = 4/108 (3%) Query: 29 ILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG-IPVHDTIARVVSCISPAKFH 87 +L L + AV++G + E I FG L F+NG +P +TIA ++ + Sbjct: 3 LLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLDADHLD 62 Query: 88 ECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTM 135 W+ D H + D IA+DGK L S D H+++A++ Sbjct: 63 RIIGAWLGDRH-PDGWDHIALDGKRLCGSRD--GAVPGTHLLAAYAPQ 107 >UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C544B Length = 103 Score = 74.1 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 28/109 (25%), Positives = 42/109 (38%), Gaps = 11/109 (10%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYIS 286 + HGR E R L+ W GLK R++ K + V + I+ Sbjct: 2 DPGHGRIETRTVRATP----LLTCHDRWTGLKHGFRITRTRTV----KGVTTVEVVHGIT 53 Query: 287 SAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 S A +R+HW +EN+ H DV + ED+ + R A Sbjct: 54 SRPVERADARALLGLVRSHWRIENQRHDVRDVTLREDEPRCRAAGAGRA 102 >UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XCB4_SALTO Length = 117 Score = 74.1 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 46/109 (42%), Gaps = 3/109 (2%) Query: 26 LSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 ++ +L + AV++GA ++ D+ E F + +PV T+ R++ + Sbjct: 1 MASVLADAVCAVMAGASTFAAFGDWVEDLDAPAWSRLGFTDRVPVLTTLWRLLVRVDAET 60 Query: 86 FHECFINWMRDC---HSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISA 131 + +W+ + VIA+DGK +R + R A+ + Sbjct: 61 LTAVWADWLCSRLPVAPPPVRRVIAVDGKVVRGAVLTEGRVPALWMPKT 109 >UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW01_RHOSR Length = 130 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 28/82 (34%) Query: 11 SIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPV 70 +PD R V H+ S IL + A +GA S+ I ++ +P Sbjct: 49 DEVPDPRHRRGVRHRFSVILAPALSATCAGARSFIAIAEWAHDAPAEALGGLGVSAVVPS 108 Query: 71 HDTIARVVSCISPAKFHECFIN 92 T R ++ + + Sbjct: 109 ESTSRRFLAGVDATALDQVLGM 130 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 39/81 (48%), Positives = 53/81 (65%), Gaps = 4/81 (4%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M + + E++S D R A+ +H I+ L + AVISGA SW +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCI 81 Y FE GIPV DTIARV+ I Sbjct: 57 YRPFECGIPVDDTIARVIKRI 77 >UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TKB9_RICCO Length = 107 Score = 71.8 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 18/100 (18%), Positives = 35/100 (35%), Gaps = 1/100 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ-YGDF 64 + S + D R+A + L +L + +++SG+ S ++ F E L L + +G Sbjct: 8 FGDVFSELRDVRRAQGKRYALEPLLCAIVMSILSGSASLRKMQVFIEEQLPNLNRLFGTS 67 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKD 104 P I + + + F S Sbjct: 68 WRKAPCWVAIREFLLGLDEQELERAFREHANRQVSPPPGR 107 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 69.5 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 26/71 (36%), Positives = 40/71 (56%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQ 60 M L+ H + I D RQ+ KV + L +L +T+ VI+GAE W +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIPVH 71 G G+PV Sbjct: 72 KGILTEGVPVR 82 >UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LD44_FRASN Length = 261 Score = 69.5 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 50/145 (34%), Gaps = 10/145 (6%) Query: 192 LFAVKGNQGRLNKAFEEKFPLKELNNPEHDSY------AMSEKSHGREEIRLHIVCDVPD 245 +G+Q L +A + L+ H + + G + R VP Sbjct: 51 RLVTEGDQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLTAGSRQTRALKAVTVPA 110 Query: 246 ELI-DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT---AEKFATAIRN 301 L + L + ++ + E K+ Y I + + AT IR Sbjct: 111 GLGFPHAAQAIQLTRTSRPINKNTKKTEGKRRQRRETVYAICTLPAHDALPAELATWIRG 170 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRR 326 HW +E +L W DV + ED + R Sbjct: 171 HWSIEVRLRWVRDVTLGEDLHQART 195 Score = 43.3 bits (100), Expect = 0.016, Method: Composition-based stats. Identities = 9/37 (24%), Positives = 21/37 (56%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVI 38 + L+E ++ +PD R+ V H + +L + + A++ Sbjct: 57 DQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAML 93 >UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVV4_METNO Length = 123 Score = 69.5 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 25/116 (21%), Positives = 43/116 (37%), Gaps = 2/116 (1%) Query: 248 IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVEN 307 + W GL + + R VR+ + S+ +E A AIR H + Sbjct: 1 MATLRTWPGLTTVLATETLR--GGNGTDSVPAQVRHSLGSSTAPSEVLAQAIRRHGALAT 58 Query: 308 KLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 W L+V E+ ++R AA + +R +A++ D A + R Sbjct: 59 GEPWVLEVSFGEERSRVRERCAARHLALLRRVALDRRRADASLTASRPAQDRGLGR 114 >UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RIX6_9ACTO Length = 90 Score = 68.3 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 16/63 (25%), Positives = 24/63 (38%) Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 R WH+EN+LHW DV E + R G + + +R+ AI + Sbjct: 11 AQPADLQQWARLEWHIENRLHWVRDVTFGEGTHRARTGTGPAVAAVLRNTAIGFHRGNGE 70 Query: 350 FKA 352 Sbjct: 71 TNI 73 >UniRef50_B2IT45 Putative uncharacterized protein n=5 Tax=Cyanobacteria RepID=B2IT45_NOSP7 Length = 435 Score = 68.3 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 51/391 (13%), Positives = 114/391 (29%), Gaps = 68/391 (17%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIED-FGETHLDFLKQY 61 ++ + +PD R +++S L + + S+ + + Q Sbjct: 11 VQYFQSILKDLPDKRTGKNKRYQMSDAALSAFSIFFTQSPSFLAHQRSMAHSKGHNNAQS 70 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECF---------INWMRDCHSSNDKDVIAIDGKT 112 + IP + I ++ I P F + S + +IA+DG Sbjct: 71 LFGVHQIPSDNHIRDLLDEIEPTVVFPVFTKIFKALENGKHLSKFRSFKNNLLIALDGTE 130 Query: 113 LRHSYD-----------KSRRRGAIHVISA---FSTMHS--------LVIGQIKTDKKSN 150 S + K+ H + +S V+ Q K+ Sbjct: 131 YFCSNEIHCEHCSSRTFKNGTTQYFHTVVTPVIVCPSNSQVIPLIPEFVVPQDGYQKQDC 190 Query: 151 EITAIPELLNMLDIK----GKIITTDAMGCQKDIAEKIQKQGGDYLF-AVKGNQGRLNKA 205 E A + + G I D + C + + E + ++ +++ + L + Sbjct: 191 ENAAAKRWIQKYAKQYASLGITILGDDLYCHQPLCELLLQEKLNFILVCRSKSHKTLYEW 250 Query: 206 FE----EKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 E + F +K ++ Y R + + W L Sbjct: 251 LEGMPLDTFSVKHWKGKVYEIYT----------YRYVNQIPLRNSEDALLVNWCEL---- 296 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLH-------WRLD 314 + + + I+ D+ E + R+ W +EN+ + + L+ Sbjct: 297 ---AITRSDGTIIYKNTFATNHRIT--DINVEAIVSDGRSRWKIENENNNTLKTKGYNLE 351 Query: 315 VVMNEDDCKIRRGNAAEL-FSGIRHIAINIL 344 + A + + H ++I+ Sbjct: 352 HNFGHGKTHLSSLLATFNILAFLFHTLLDII 382 >UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitrosococcus oceani AFC27 RepID=B6C2C4_9GAMM Length = 77 Score = 66.4 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 31/57 (54%) Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 ED+C++ A F+ +R IAI++L D+ K LR + RK A D +Y+ + Sbjct: 21 SFREDECRVHDPMAGGNFALLRKIAISLLVRDRSNKTSLRGRCRKVAWDNDYMRQLF 77 >UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium meliloti RepID=Q1WLF6_RHIME Length = 105 Score = 65.6 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 30/74 (40%), Gaps = 1/74 (1%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + + +PD R H L+ IL + I A++ GAES D+ DFG +LK Sbjct: 1 MSRFAACFEDLPDPR-GRNARHPLTSILFIAIAAIVCGAESCTDMADFGVAKKKWLKTIV 59 Query: 63 DFENGIPVHDTIAR 76 I + Sbjct: 60 PLPYASRCWRDIRK 73 >UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BQC4_9GAMM Length = 96 Score = 65.2 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 37/89 (41%), Gaps = 1/89 (1%) Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 +T ++ R HW + + LH+ D NED +IR G+ + + AI +L + Sbjct: 1 MTPQQVLAINRGHWSIAS-LHYISDWNYNEDRGQIRTGHGPANVTRLCRFAIGVLKHFPK 59 Query: 350 FKAGLRRKMRKAAMDRNYLASVLAGSGLS 378 + MR+ A + L + S Sbjct: 60 PGQYIPEMMRQLARRPRQVLDYLRLTAHS 88 >UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L7Y6_FRASN Length = 209 Score = 63.3 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 16/64 (25%), Positives = 29/64 (45%) Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 +TA T +R +W +EN++H+ D ED GN + R++AI ++ + Sbjct: 88 SVTAAYLHTHVRGNWGIENEVHYTRDAAWREDANPTYTGNTNHALASFRNLAIGVIGLNG 147 Query: 349 VFKA 352 Sbjct: 148 TRNI 151 Score = 44.8 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 14/86 (16%), Positives = 27/86 (31%), Gaps = 1/86 (1%) Query: 53 THLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKT 112 L + P T+ + I F W+ + + +AIDGK Sbjct: 20 ARLGAPLDHFRRNTRAPSKKTLRAPLKKIDVDALDATFGAWLCAQI-ARGRVALAIDGKV 78 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSL 138 LR ++ A ++ + + Sbjct: 79 LRGAWSGDESVTAAYLHTHVRGNWGI 104 >UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JGX0_FRASC Length = 222 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 19/109 (17%), Positives = 40/109 (36%), Gaps = 6/109 (5%) Query: 66 NGIPVHDTIARVVSCISPAKFHEC-FINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRG 124 G P + + + P + + V+ +DG T+R Sbjct: 31 PGTPAPGGVGKSCRSLDPGSLAALDAAPHRPTWRAGRVRRVLTVDGTTMR----PQHGSR 86 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIITTD 172 +H+ + +++ Q+ D+K+NE + L + D+ G +IT Sbjct: 87 HVHLPEGLAHACGVLLTQVDVDEKTNENPFVLRGLGQIPDLTGVLITAF 135 >UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C435B Length = 133 Score = 62.5 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 28/141 (19%), Positives = 40/141 (28%), Gaps = 26/141 (18%) Query: 192 LFAVKGNQGRLNKAFEEK--FPLKELNNPEHDSYAMSEKSHG-------------REEIR 236 + K NQ L E F S + R E R Sbjct: 1 MLTAKDNQPGLVADIEAGLGFEDAARGLAAATSPLTGPDARATGAPGHVGGPGHGRIETR 60 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD---LTAE 293 L+ W GLK R++ K + V + I+S A Sbjct: 61 TVRATP----LLTCHDRWTGLKHGSRITRARTV----KGVTTVEVLHGITSLTVERADAR 112 Query: 294 KFATAIRNHWHVENKLHWRLD 314 +R+HW +EN+ H D Sbjct: 113 ALLGLVRSHWRIENQRHDVRD 133 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 38/59 (64%), Positives = 39/59 (66%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLK 59 MELKKLMEHISIIPDYRQAWKVEHKL IL + FGETHLDFLK Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRGFGETHLDFLK 59 >UniRef50_A5GAF0 Putative uncharacterized protein n=6 Tax=Deltaproteobacteria RepID=A5GAF0_GEOUR Length = 439 Score = 61.4 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 49/378 (12%), Positives = 100/378 (26%), Gaps = 53/378 (14%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 +L L + IPD R K+ L+ +L+ S ++ Q Sbjct: 13 QLGVLRCCLEHIPDQRDGAKI--SLADVLMSGYAMFDLKDPSLLAFDE-RRCRDAANLQR 69 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIA---------IDGKT 112 + + V+ + PA F + +A +DG Sbjct: 70 IYGIGKVACDTQLRTVIDPVDPAGLRPGFKTIVATLQRGKALQQLAYYEGYYLLSLDGTG 129 Query: 113 LRHSYDKSRRRG--------------AIHVISAFSTMHSLV--IG------QIKTDKKSN 150 S + S + + +V + Q K Sbjct: 130 SFGSENLSSASCLVKNKSNGKKLYYQQVLGAALVHPDSRVVIPLAPEMIIPQDGATKNDC 189 Query: 151 EITAIPELL----NMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVK-GNQGRLNKA 205 E A L I+ D + +Q+ ++ K G+ L + Sbjct: 190 ERNASKRFLPNFREDFPRLPVIVVEDGLSSNGPHIRDLQQHNMRFILGAKPGDHPLLFEN 249 Query: 206 FEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVS 265 + ++A + + + + D P LK + Sbjct: 250 LTDAIKK-----KTATTFAQIDPKNPQIMHSYCFLNDTPLN-----QANPDLKVNFLV-- 297 Query: 266 FRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLD--VVMNEDDCK 323 + A+ K + + + A R+ W +EN+ L E + Sbjct: 298 YEEHNAKTGKTQRFSWVTDLPITEENAYILMRGGRSRWKIENETFNTLKNQGYNLEHNYG 357 Query: 324 IRRGNAAELFSGIRHIAI 341 + + + +E F + +A Sbjct: 358 LGKEHLSENFVMLMMLAF 375 >UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3536 Length = 130 Score = 61.4 bits (147), Expect = 6e-08, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 33/71 (46%), Gaps = 7/71 (9%) Query: 252 FEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE---KFATAIRNHWHVENK 308 +WKGLK+ R++ + V + I+S + +R+HW +EN+ Sbjct: 9 QDWKGLKQGFQITRERTV----NGVTTVEVVHGITSLSADRANAGALLSLLRDHWRIENQ 64 Query: 309 LHWRLDVVMNE 319 LH+ DV + E Sbjct: 65 LHYVPDVTLGE 75 >UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3Z283_9SYNE Length = 156 Score = 61.0 bits (146), Expect = 7e-08, Method: Composition-based stats. Identities = 23/96 (23%), Positives = 42/96 (43%), Gaps = 4/96 (4%) Query: 84 AKFHECFINWM-RDCHSSNDKDVIAIDGKTLRHSYD--KSRRRGAIHVISAFSTMHSLVI 140 F + WM + ++ D + DGKTLR S D I +S +S + I Sbjct: 2 EAFEALLLQWMSQQPALADGVDTLVCDGKTLRGSIDQKPGAAASFIAQVSLYSQPLGVAI 61 Query: 141 GQ-IKTDKKSNEITAIPELLNMLDIKGKIITTDAMG 175 Q +S+E ++ LL+ +++ ++ D +G Sbjct: 62 AQTTYATDESSETASLLWLLSGIELTDMLVQADEVG 97 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 61.0 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 27/57 (47%), Positives = 42/57 (73%) Query: 97 CHSSNDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEIT 153 S + ++ DGKTLR S+D+S + AIH++SA+++ +SLV+GQ+KTD+KSNE Sbjct: 17 YQKSLKEKSLSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNEHK 73 >UniRef50_A8MIZ4 Putative uncharacterized protein n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MIZ4_ALKOO Length = 218 Score = 59.1 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 61/189 (32%), Gaps = 33/189 (17%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 + + E I+ + D R V+ +S I + +F + S+ + + E K+ Sbjct: 16 VYDIGEKINTLKDKRVKSPVK--VSTISFVVLFGFMLQIRSFNRLNHWIE--KGKFKKVV 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHEC--------FINWMRDCHSSNDKDVIAIDGKTLR 114 + +P D++ R ++ N + + + V AIDG L Sbjct: 72 PKKTKMPCIDSVRRFLADFDLHGLKNMHSHIVKTSIKNKVFRSGTVDGLKVAAIDGVELF 131 Query: 115 HSYDKSRRRGAIH--------------VISAFSTMHSLVIGQIKTDKK-------SNEIT 153 S K + S + L++GQ + K E+T Sbjct: 132 ESTKKCCNNCLTRVHKDEITHYFHRSVICSTVGSDPHLILGQEMLEPKRDGSNKDEGEVT 191 Query: 154 AIPELLNML 162 L+ L Sbjct: 192 GGKRLIKKL 200 >UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria RepID=A8FXM7_SHESH Length = 57 Score = 56.4 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 25/58 (43%), Positives = 36/58 (62%), Gaps = 4/58 (6%) Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + +FR +I + + RYYISS +LTAE+ A + HW +E+ +HW LDV MNE Sbjct: 1 MVENFRFVI---GNKLVLEYRYYISSKELTAEQAANTVSEHWGIES-MHWVLDVSMNE 54 >UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HUT6_9ACTO Length = 130 Score = 55.6 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 33/84 (39%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L +S IPD R+ + L +L L + AV+ GA S I F L++ Sbjct: 45 SLAGTLSRIPDPRRVRGRRYHLGSLLALCLVAVLGGARSLATIARFAADTNSSLREQLGL 104 Query: 65 ENGIPVHDTIARVVSCISPAKFHE 88 + P T+ + + + E Sbjct: 105 ASSTPNASTLGGLRANLKDEWVRE 128 >UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J572_FRASC Length = 148 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 19/63 (30%), Gaps = 1/63 (1%) Query: 8 EHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENG 67 E IPD R V H+L +L L AV+ G + + Sbjct: 70 ECFEQIPDPRDPRGVRHRLPVVLSLC-AAVLCGESGLAGVAAWVAAEGPGDPTGEGCRWP 128 Query: 68 IPV 70 P Sbjct: 129 RPG 131 >UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ITI9_METNO Length = 74 Score = 54.8 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 17/64 (26%), Positives = 33/64 (51%), Gaps = 1/64 (1%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYG 62 ++ L + D R+ +H+L IL++ + AVI+ AES +DI +G + +L+ + Sbjct: 2 IEGLAVCFPGLEDLRETSGCDHRLIAILVIAVCAVIACAES-KDIGLYGRSKQAWLQTFL 60 Query: 63 DFEN 66 Sbjct: 61 PLPC 64 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 54.8 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 31/59 (52%), Positives = 34/59 (57%) Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHS 116 LKQYG FE GI HDTI +VSCIS F + FI WM C A DGKT+R S Sbjct: 12 LKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36C2 Length = 109 Score = 54.4 bits (129), Expect = 6e-06, Method: Composition-based stats. Identities = 17/60 (28%), Positives = 25/60 (41%), Gaps = 3/60 (5%) Query: 268 SIIAEQKKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + + V Y I+S A R HW +EN LH+ DV + ED C + Sbjct: 5 ERRRKANGKATVEVVYGITSLSRLAADAAALLGYSRRHWGIENGLHYTRDVTLGEDRCPV 64 >UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B1F Length = 84 Score = 54.1 bits (128), Expect = 9e-06, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 30/56 (53%) Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 L+M D+ + DA+G Q IAE+I + G DY+ A+K NQ +A F E Sbjct: 17 LDMEDLAQSQLVIDAVGTQGPIAEQIIEAGADYVLALKANQPSALQAVSAHFKEAE 72 >UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VX04_POLNA Length = 100 Score = 51.0 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 16/66 (24%), Positives = 38/66 (57%), Gaps = 4/66 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDI----EDFGETHLDFLKQY 61 L++ SI+PD R + L ++++T+ AV+ GA++W D+ + +G++ + +++ Sbjct: 2 LIQAFSILPDPRTGPAQRYDLREMIVMTLSAVLCGADNWVDVPVGSKKYGDSCMQVVREK 61 Query: 62 GDFENG 67 +G Sbjct: 62 CCLTSG 67 >UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E7_9ACTO Length = 72 Score = 50.6 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 13/45 (28%), Positives = 25/45 (55%) Query: 134 TMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 T + + Q++ + +NEIT LL+ D++ +T DA+ Q+ Sbjct: 2 TGTGMTVTQLRVPENTNEITCFAALLDPYDLREVTVTGDALHTQR 46 >UniRef50_A7C035 Transposase n=5 Tax=Bacteria RepID=A7C035_9GAMM Length = 437 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 50/358 (13%), Positives = 92/358 (25%), Gaps = 65/358 (18%) Query: 1 MELKKLME----HISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLD 56 + + L+ + IP K LS L+ + S ++ + Sbjct: 9 LSMPGLLSEIKNYFEKIPSPVVKQKDSISLSDCLMSGLAIFSLKYPSLLQFDN--DKRTP 66 Query: 57 FLKQYGDFENGI---PVHDTIARVVSCISPAKFHECFINWMRDCHSS---------NDKD 104 ++ I P + + + ++ + +R ND Sbjct: 67 VVEHNLKSLYKIGIIPSDTYMRERLDELPTSELRGAYTTLIRQAQRGKVLEKFTYYNDYY 126 Query: 105 VIAIDGKTLRHSYDKSRRRGAI-----------HVISAFS-----TMHSLVIG------Q 142 ++++DG S+D + H + + H L + Q Sbjct: 127 LVSMDGTGYFSSHDIHCDQCCEKHHRNGKITYHHQMLGIALVHPNHHHVLPLAPEPIIKQ 186 Query: 143 IKTDKKSNEITAIPELLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVK-G 197 +K E A LL L IIT D + + ++ Y+ K Sbjct: 187 DGVEKNDCERNAGKRLLTQLRKEYPKMKMIITEDGLASNGPHIKLLKSLNMSYILGAKPK 246 Query: 198 NQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGL 257 + L + K + D G + V VP F Sbjct: 247 DHTYLFDRIKNSSQTKFYQTQDDD---------GTIHKYRY-VNQVPLNESHFDLN---- 292 Query: 258 KKLCVAVSFRSIIAEQKKEPEMTVRYY-ISSADLTAEKFATAIRNHWHVENKLHWRLD 314 K + I ++ T E R W +EN+ L Sbjct: 293 -----VNFLIYQEISPKGKVTNFSWVTDILLSEQTLEIVMKGGRARWRIENETFNTLK 345 >UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicutes RepID=B8FXU5_DESHD Length = 381 Score = 49.4 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 43/278 (15%), Positives = 89/278 (32%), Gaps = 25/278 (8%) Query: 51 GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMR---DCHSSNDKDVIA 107 G + L + + + + I P F + R +S D ++A Sbjct: 3 GNSLSKELYDWLGYSSETATASAFVQQRDKIRPEALKLLFHEFTRLTVSENSLQDYRLLA 62 Query: 108 IDGKTLR------------HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKS-NEITA 154 +DG LR + + S+ +H+ + + M + + KK NE A Sbjct: 63 VDGSDLRLPSNSKDGFSSIRNSEDSKNYNLVHLDAMYDLMGKVYVDASVQSKKGMNEHKA 122 Query: 155 IPELLNMLDIKGKIITT-DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 + +++ +I G +I D + Q++ Y+ K + G + L Sbjct: 123 LVSMVDQSEINGNVIAIMDRGYESFNNIAHFQEKSWYYIIRAKESYG-----IISRLSLP 177 Query: 214 ELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK---KLCVAVSFRSII 270 + + + + +E + L I + +K + FR++ Sbjct: 178 DYPEYDEEIMLTLTRRQTKETLPLLKAYPHRYRWIQPHTTFDFIKPKDSKFYDLHFRAVR 237 Query: 271 AEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 TV +++ D EK W +E Sbjct: 238 FAIADGVYETVYTNLNAEDFPPEKLKQLYNLRWGIETS 275 >UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4X2F9_SALTO Length = 143 Score = 48.3 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 23/63 (36%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 L + +PD V H+L+ +L+ I AV S+ I ++ G Sbjct: 14 GLPAALLDLPDPLCRLGVLHRLTVVLIAAICAVAVSNRSYTAIAEWFPDVPAATGARGGH 73 Query: 65 ENG 67 G Sbjct: 74 RPG 76 >UniRef50_C7GHC1 Transposase, IS4 family (Fragment) n=6 Tax=Roseburia intestinalis L1-82 RepID=C7GHC1_9FIRM Length = 232 Score = 47.9 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 61/201 (30%), Gaps = 27/201 (13%) Query: 137 SLVIGQIK------TDKKSNEITAIPELLNMLDIK----GKIITTDAMGCQKDIAEKIQK 186 +++GQ + K E+T L+ L + +I DA+ +++ Sbjct: 8 HVILGQEMLKPRDGSGKDEGELTGGKRLIERLKKRHGHFADVIVADALYLNAPFINTLKE 67 Query: 187 QGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDE 246 G + + +K + + + E F E + + K + E+ + Sbjct: 68 NGLEGVIRLKDERRMIFQDAERLFKQDE--GKKASFW----KGKKKIEV---------WD 112 Query: 247 LIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVE 306 L F E G V + E KE E + + + W +E Sbjct: 113 LSGFKME--GCPYKLRVVRYHEQWEENGKETERFMWLVTTLEAADYRVLWEMMHRRWDIE 170 Query: 307 NKLHWRLDVVMNEDDCKIRRG 327 +L + C R Sbjct: 171 ENGFHQLKTYYHAKHCYCRDA 191 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 47.5 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 63/163 (38%), Gaps = 16/163 (9%) Query: 51 GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS---NDKDVIA 107 G+ D L +Y +F+N P + + + + I P F F + + + N +IA Sbjct: 53 GKALRDELLEYFEFDNTTPSNSSFNQRRAQILPEAFEFLFQEFTKSFTDNVTYNGLRLIA 112 Query: 108 IDGKTLRHSYDK------------SRRRGAIHVISAFS-TMHSLVIGQIKTDKKSNEITA 154 DG L +++ + +H+ + + I+ + +NE A Sbjct: 113 CDGSDLCIAHNPQDETTYFQTLPDRKGYNLLHLNAFYDLCSRQYTDAIIQPSRLANERRA 172 Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKG 197 + E+++ + I D +I ++ +G YL VK Sbjct: 173 MCEMIDRYNDTSAIFIADRGYENYNIFAHVEHKGMYYLIRVKD 215 >UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ13_9PLAN Length = 75 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 1/55 (1%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLD 56 EL++L ++ + D R HKL ++L+ + AVI+GA+ IE + L Sbjct: 19 ELRELAKYFQSLDDSRSHVNQRHKLVNVVLIAMCAVIAGADGPTAIE-WLAGRLQ 72 >UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z7_THET2 Length = 112 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 35/108 (32%), Gaps = 7/108 (6%) Query: 41 AESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS 100 +S +E F + L G ++ + P K E + Sbjct: 1 MDSLRGVERFARANPHLLPHLGLRNPPGHTLL--PLLLHRLDPKKLQEALHQVFP---EA 55 Query: 101 NDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKK 148 + V+ +DGK LR S + + ++ + + Q + + K Sbjct: 56 DLGGVLVVDGKHLRGS--GKGKSPQVRLVEVLALHLKTTLAQARVEGK 101 >UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteobacteria RepID=A6X872_OCHA4 Length = 330 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 71/226 (31%), Gaps = 22/226 (9%) Query: 17 RQAWK--VEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 R+ + I I + + E + L + + E +P H T Sbjct: 53 RKTRGGQCRYSDLAIETTLICGKV-----FNQPLRQTEGLMASLLRLLNVELPVPDHTTF 107 Query: 75 ARVVSCISPAKFHECFINWMRDCHSSNDKDVIAI------DGKTLRHSYDKSRRRGAIHV 128 +R + + + C D D + +H +R+ +H+ Sbjct: 108 SRRCANLVVSSLTRCTRRDGTDEPLHVIVDSTGMKIYEAGQWLEEKHGAKSARKWLKLHL 167 Query: 129 ISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQG 188 A + VI + TD+ +++++ +P+LL+M+D D + ++ Sbjct: 168 --AIDADSNQVIAETLTDQNTSDLSQVPDLLDMIDRPIACFMADGAYDSDQTYQALRSHS 225 Query: 189 GDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREE 234 + R + + D ++ + GR E Sbjct: 226 PGVSIII---PPR----IRDLQEASYGPPDQRDWHSRTNAQRGRME 264 >UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BWL6_9GAMM Length = 94 Score = 46.7 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 32/55 (58%), Gaps = 1/55 (1%) Query: 8 EHISIIPDYRQAW-KVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 EH +PD R+ + HK IL++ I A+I GA+SW + +FG+ D+ + + Sbjct: 40 EHFKSLPDPRRRTMNLRHKFIDILIIAICAIICGADSWVAVAEFGKAKEDWFRVF 94 >UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7B Length = 481 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 51/325 (15%), Positives = 103/325 (31%), Gaps = 66/325 (20%) Query: 13 IPDY------RQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFEN 66 +PD + ++ + + I A IE G L+++ ++ Sbjct: 46 VPDPVLADLYERHRGRGYE-DVVTFAQLVTWIFDAL----IEHQGSGRQAHLRRHRQPDD 100 Query: 67 GIPVHDTIARVVSCISPAKFHECFINWMRDCH--------------SSNDKDVIAIDGKT 112 G H+ + I P E F+ + D S + +V+ +DGK+ Sbjct: 101 G--CHEAFYGKLRRI-PRGLSEAFLRDVTDRFTALFPEVVAHRLPTSFDRLEVLILDGKS 157 Query: 113 LR----HSYDKSRRRGAIH---VISAFSTMHSLVIG-QIKTDKKSNEITAIPELLNMLDI 164 L+ D G + ++ A+ LV+ D ++NE IP+L+ + Sbjct: 158 LKKVAKRLVDTRGTPGKLLGGKLLVAYRPRDGLVLDMAADLDGETNEAKLIPDLMPRVHA 217 Query: 165 KG---KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD 221 +G K++ D + C + K G ++ + + P Sbjct: 218 RGGPAKLVVGDRLFCASKHFAEFTKDNGHFVV----------RYARTLSFEPDPKRPAVT 267 Query: 222 SYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTV 281 + S+++ V + + L++ R +A E + Sbjct: 268 TADPSQRA----------VVEEWGWAGKPKDK---LRRYVR----RITVARPVGEAITIL 310 Query: 282 RYYISSADLTAEKFATAIRNHWHVE 306 + SA A R W +E Sbjct: 311 TDLLDSAPYPATDLLDLYRIRWTIE 335 >UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P672_AZOSE Length = 47 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 15/31 (48%), Positives = 18/31 (58%) Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 HW VEN LHW L+V NED ++R A Sbjct: 1 HWGVENWLHWCLNVQFNEDRSRVRSAYAVNN 31 >UniRef50_Q745Z8 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z8_THET2 Length = 77 Score = 46.0 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 17/59 (28%), Positives = 35/59 (59%), Gaps = 1/59 (1%) Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 +EN+ W DV++ E+ C++R G A++ + +R +++L V + R++ KAA+ Sbjct: 1 MENRSFWVRDVLLYEEACQVR-GVGAQVLAALRAFLVSLLHRRGVREKVTRQRTLKAAL 58 >UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus pneumoniae RepID=B2ISL1_STRPS Length = 535 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 43/249 (17%), Positives = 79/249 (31%), Gaps = 26/249 (10%) Query: 18 QAWKVEHKLSGILLLTIFAVISGAESWEDIEDF-GETHLDFLKQYGDFENGIPVHDTIAR 76 Q + S IL+ +F +++G + ++ + + L + G T++R Sbjct: 142 QRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGGQL----ASQPTLSR 197 Query: 77 VVSCISPAKFHEC------FINWMRDCHSSN------DKDVIAIDGKTLRHSYDKSRRRG 124 +S H + + H N D GK +Y+ R Sbjct: 198 FLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDSTHFTTYGKQEGVAYNAHYRAH 257 Query: 125 AIHVISAFSTMHSLVI-GQIKTDKK--SNEI-TAIPELLNMLDIKGKIITTDAMGCQKDI 180 H + AF Q++ + S E + I +L + D+ + Sbjct: 258 GYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFITPVLERF--NQLLFRMDSGFATPKL 315 Query: 181 AEKIQKQGGDYLFAVKGNQ--GRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGR-EEIRL 237 + I+K G YL +K N RL ++L H +Y+ + G R Sbjct: 316 YDLIEKTGQYYLIKLKKNTVLSRLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRR 375 Query: 238 HIVCDVPDE 246 E Sbjct: 376 VCQFSERKE 384 >UniRef50_A3YV03 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3YV03_9SYNE Length = 113 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 31/77 (40%), Gaps = 2/77 (2%) Query: 270 IAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 I + +P +++S T + +R+ W +EN H+ + ++E + N Sbjct: 6 IGTRGCKPFKATHLFLTSLSSTPKTLLQLVRDRWSIENW-HFFRNTQLHESAH-GYQDNG 63 Query: 330 AELFSGIRHIAINILTN 346 A + + N+L Sbjct: 64 ACAMTTQKTGTQNLLRL 80 >UniRef50_B0JNZ6 Transposase n=20 Tax=Cyanobacteria RepID=B0JNZ6_MICAN Length = 382 Score = 45.2 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 90/311 (28%), Gaps = 59/311 (18%) Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSNDKDVI------------AID 109 IP + I ++ P F ++ + VI A+D Sbjct: 13 LFGIKKIPGDNQIRNLL---DPIPAATIFGSFQQVYQWLKKPGVIKKFFYLDEEILIALD 69 Query: 110 GKTLRHSYDKSRR----RGAIHVISAFSTMHSLVI------------------GQIKTDK 147 G S S R + + + I Q K Sbjct: 70 GTEYFSSKKISCPHCNCRNPRNGTTTYFHGCVTPIVVSPEQKQVINLEPEFIKKQDGQQK 129 Query: 148 KSNEITAIPELLNMLDIK--GKIITT--DAMGCQKDIAEKIQKQGGDYLF-AVKGNQGRL 202 + E A+ L+ K G +T D + ++ I E KQG +++F ++ + L Sbjct: 130 QDCENAAVKRWLDKNHQKKYGYPVTLLGDDLYSRQPICELALKQGYNFIFVCLETSHKTL 189 Query: 203 NKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCV 262 + E E+ E + R V VP ++ + E + + Sbjct: 190 YEWREFLEKSGEVKTVEKKQW----DGRKNLIYRYRYVSRVPLREVESSLEVNWCEVTVI 245 Query: 263 AVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKL------H-WRLDV 315 + II + + + EK A R+ W VEN+ H + L+ Sbjct: 246 NEKTQKIIYQNNWITNHQI------TENNVEKIVKAGRSRWKVENEGNNVLKNHGYNLEH 299 Query: 316 VMNEDDCKIRR 326 + Sbjct: 300 NFGHGQSHLCE 310 >UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L6S8_RUMHA Length = 107 Score = 44.8 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 15/47 (31%), Positives = 24/47 (51%) Query: 47 IEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINW 93 + F + + ++ D + G P DT+ RV + I P KF E F +W Sbjct: 1 MAFFMKLQEPYFEKILDLKYGTPSADTLLRVFAIIEPEKFMEMFYHW 47 >UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AI67_NITEC Length = 94 Score = 44.4 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 17/61 (27%), Positives = 26/61 (42%), Gaps = 11/61 (18%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDF 64 +L + I D RQ K H L +L++TI +I + LD+L+QY Sbjct: 34 RLADVFVSITDPRQ-RKSRHDLVKVLVITI----------NEILAWANEKLDWLRQYLKL 82 Query: 65 E 65 Sbjct: 83 T 83 >UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmonella enterica RepID=C0Q104_SALPC Length = 177 Score = 44.4 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 24/50 (48%), Positives = 27/50 (54%), Gaps = 13/50 (26%) Query: 309 LHWRLDVVMNEDDCKIRRGNAAELF-------------SGIRHIAINILT 345 +HWRLDV MNEDDC+IRRGN F +R I INIL Sbjct: 1 MHWRLDVAMNEDDCRIRRGNVKSFFEIIKSGEYEIWGCEIMRWIRINILK 50 >UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUW9_9BACT Length = 61 Score = 44.4 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 11/61 (18%), Positives = 23/61 (37%), Gaps = 2/61 (3%) Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASV 371 D ED +IR NA + ++++ + + V + + +R A + Sbjct: 1 MRDTSFREDHSQIRTQNAPRAMASLKNLVVGLFHFLNVPN--IAKTLRNFAARPFLALQM 58 Query: 372 L 372 L Sbjct: 59 L 59 >UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID=A1RCW9_ARTAT Length = 436 Score = 43.7 bits (101), Expect = 0.012, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 69/207 (33%), Gaps = 19/207 (9%) Query: 11 SIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPV 70 I+PD R +V+H L +L I+A+ +G E D + G H L+ + + Sbjct: 49 KIVPDRRDPGRVQHGLQTLLAQRIYALAAGYEDLNDHD--GLRHDYALQTAVNRLQPLAG 106 Query: 71 HDTIARVVSCISPAKFHEC----FINWMRDCHSSNDKDVIAID-------GKTLRHSYDK 119 T+ R+ + + +++ + + V+ D G + Sbjct: 107 KSTLGRLEQQADRETVVQAHRLLWEHFIAQHDQAPAEIVLDFDATDVPVHGDQEGRFFHG 166 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEIT-AIPELLNMLDIKGK-----IITTDA 173 + F H LV ++ + AI LL + + D Sbjct: 167 YYDHYCFLPLYVFCGRHLLVSYLRPSNIDGARHSWAILALLVKFIRRFWPETRIVFRGDG 226 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQG 200 C+ + + ++ DY+ + N Sbjct: 227 GFCRHRMLDWCDRKQVDYVVGLARNTR 253 >UniRef50_A5FU21 Transposase, IS4 family protein n=11 Tax=Alphaproteobacteria RepID=A5FU21_ACICJ Length = 448 Score = 43.7 bits (101), Expect = 0.013, Method: Composition-based stats. Identities = 55/386 (14%), Positives = 113/386 (29%), Gaps = 58/386 (15%) Query: 11 SIIPDYRQAWKVEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLK-QYGDFENGIP 69 + I D R +V+H L I+ + + +G E D + + L + + Sbjct: 55 ACIDDPRTPERVQHGLDEIIRFRMLMIAAGYEDGNDADRLRNDPMFKLAMERLPEAGDLC 114 Query: 70 VHDTIARVVSCISPAKFHE----CFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 TI+R + P ++ + ++ V+ ID ++D + Sbjct: 115 SQATISRTENLPGPRALLRMGLAMVEHYCASFRTIPNRVVLDID-----DTFDAAHGAQQ 169 Query: 126 IHVISAFSTM-----------HSLVIGQI---KTDKKSNEI-TAIPELLNML----DIKG 166 + + +A ++ + K ++I + L++ + Sbjct: 170 LCLFNAHHDEYGFQPIVVFDGDGRMLAAVLRPACRPKGSQIVKWLRRLIDAIRSHWPRTA 229 Query: 167 KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEK-----FPLKELNNPEHD 221 ++ D+ C ++ + + DY+F V L K ++ + Sbjct: 230 IMLRGDSHYCTPEVLRFCRARRLDYIFGV-APTTTLRKHVIALEASTTARAQQAPGEKIR 288 Query: 222 SYAMSEKS---HGREEIRLHIVCDVPDELIDFTFEWKGLKK---------LCVAVSFRSI 269 + R E R+ + +D F LK + A Sbjct: 289 RFKEFNDGAASWDRVE-RIIARVEAGPMGVDTRFIVTSLKAGSPRTLYQEIYCARGQAEN 347 Query: 270 IAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + K R S A + I +W L W L +M R Sbjct: 348 HIKAWKTHLAADRTSCSRASANQMRLFLHIGAYW-----LMWSLRSLM-----PRRSRWR 397 Query: 330 AELFSGIRHIAINILTNDKVFKAGLR 355 F +R I + + K +R Sbjct: 398 GIQFDTLRLRLIKLAVRLETLKRSIR 423 >UniRef50_A7JYJ5 Putative uncharacterized protein n=1 Tax=Vibrio sp. Ex25 RepID=A7JYJ5_VIBSE Length = 47 Score = 42.5 bits (98), Expect = 0.025, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 23/48 (47%), Gaps = 1/48 (2%) Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 + + EDD + R AE S IR +N++ + K L + ++A Sbjct: 1 MNLKEDDLRNRVAGGAENVSVIRRFTLNLVRL-QSKKYSLGAEAKQAG 47 >UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepID=B0R8M6_HALS3 Length = 449 Score = 42.5 bits (98), Expect = 0.026, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 67/221 (30%), Gaps = 21/221 (9%) Query: 143 IKTDKKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGN-QGR 201 TD ++E + +P + +I D + ++I + GG ++ VK N Sbjct: 168 RTTDGTTHERSQLP---TGEWVADALILLDLGFYDFWLFDRIDQNGGWFVSRVKDNANFE 224 Query: 202 LNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 + + E + + ++R+ T ++ + Sbjct: 225 IVEELRTWRGNSIPLEGESLQAVLDDLQRQEIDVRI-------------TLSFERKRGSG 271 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSA---DLTAEKFATAIRNHWHVENKLHWRLDVVMN 318 + + + + E Y+++ D +A A R W VE L L Sbjct: 272 ASATRTFRLVGLRNEETEEYHLYLTNLGNDDYSAPDIAQLYRARWEVE-LLFKELKSRFG 330 Query: 319 EDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 D+ E + I++ + L + R Sbjct: 331 LDEINTTDAYIIEALIIMAAISLMMSRVIVDELRSLEARQR 371 >UniRef50_A7BZU0 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU0_9GAMM Length = 201 Score = 42.5 bits (98), Expect = 0.027, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 9/128 (7%) Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRY-YISSADLT---- 291 + V + + + + ++ + ++ + R+ +ISS L Sbjct: 38 FYWVNGIHYAYGNNKGIL--ISVVVCEEKWQEVDPNTGEKLDKNSRHVWISSQFLNKHDV 95 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHI--AINILTNDKV 349 E+ R+ W +EN ++ + A + + + + A+N L + Sbjct: 96 HERCNLGARSRWGIENSINTEKRRGYCYEHPFSYDFTAMQNYHYLMRMAHALNALALNTK 155 Query: 350 FKAGLRRK 357 A RK Sbjct: 156 LGAKFVRK 163 >UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW00_RHOSR Length = 98 Score = 42.5 bits (98), Expect = 0.028, Method: Composition-based stats. Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 5/52 (9%) Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNM 161 GKT R + D S H+++A +V+ Q+ + NEI P LL+ Sbjct: 18 GKTWRGAKDGSG--HLTHLLAAVDHDAGVVLRQVAVGARINEI---PLLLDP 64 >UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX Length = 98 Score = 42.1 bits (97), Expect = 0.034, Method: Composition-based stats. Identities = 36/48 (75%), Positives = 39/48 (81%) Query: 78 VSCISPAKFHECFINWMRDCHSSNDKDVIAIDGKTLRHSYDKSRRRGA 125 +SCI KFHECFIN MR+CHSS+D DVIAIDGK L HS DKSRRR A Sbjct: 1 MSCIRSVKFHECFINRMRECHSSDDIDVIAIDGKALPHSCDKSRRRRA 48 >UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 Length = 41 Score = 42.1 bits (97), Expect = 0.038, Method: Composition-based stats. Identities = 16/30 (53%), Positives = 24/30 (80%) Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 +++A +T + + IGQ+K D KSNEITAIP+ Sbjct: 1 MVNALATANGMSIGQLKVDSKSNEITAIPK 30 >UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostridium spiroforme DSM 1552 RepID=B1C560_9FIRM Length = 399 Score = 41.7 bits (96), Expect = 0.039, Method: Composition-based stats. Identities = 42/276 (15%), Positives = 86/276 (31%), Gaps = 33/276 (11%) Query: 56 DFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSN---DKDVIAIDG-- 110 D L ++ DF P + S I P F F + + ++AIDG Sbjct: 21 DELLKFNDFSITTPSASAFVQARSKIKPEAFRTLFDGFNKKTFKKKLYHGYRLLAIDGSE 80 Query: 111 ------------KTLRHSYDKSRRRGAIHVISAFS----TMHSLVIGQIKTDKKSNEITA 154 LRH H+ +++ T ++I + + K +E A Sbjct: 81 LPIDNTIFDDETTVLRHGTLAKTFSAY-HLNASYDLMERTYDDIII---QGEAKRDEHGA 136 Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 +L++ D + I D + E + G YL V+ + + Sbjct: 137 FCQLVDRYDGQKAIFIADRGYESYNGFEHVVHSGHKYLIRVRDIES------QSSITKSL 190 Query: 215 LNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELID--FTFEWKGLKKLCVAVSFRSIIAE 272 P+ + + ++ ++ C + + F++ + + R + + Sbjct: 191 GPFPDGEFDVDVSRMLTLKQTKMIKACPDVYKFVPKNMRFDFMNKQNPWYEFNCRVVRLK 250 Query: 273 QKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 + TV +S + + E W E Sbjct: 251 ITENTYETVITNLSRNEFSMEDICEIYNMRWGEETS 286 >UniRef50_A4BVT6 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVT6_9GAMM Length = 120 Score = 41.7 bits (96), Expect = 0.046, Method: Composition-based stats. Identities = 11/94 (11%), Positives = 27/94 (28%), Gaps = 4/94 (4%) Query: 3 LKKLMEHISIIPDYRQAWK-VEHKLSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQY 61 L+ + + D R + L+ L + + S +D + + Sbjct: 14 LRTVRACFEALDDPRSRPNSTRYTLADALSSALAMFLLKYPSLLQFDDSARAADEVTRHN 73 Query: 62 GDFENG---IPVHDTIARVVSCISPAKFHECFIN 92 G +P + ++ + P+ F Sbjct: 74 LGTLYGVEQVPCDTQMRAILDPLKPSTLRGAFRA 107 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.122 0.311 Lambda K H 0.267 0.0374 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,904,895,694 Number of Sequences: 3077464 Number of extensions: 67384141 Number of successful extensions: 213114 Number of sequences better than 1.0e-01: 249 Number of HSP's better than 0.1 without gapping: 560 Number of HSP's successfully gapped in prelim test: 106 Number of HSP's that attempted gapping in prelim test: 211202 Number of HSP's gapped (non-prelim): 708 length of query: 378 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 248 effective length of database: 640,326,036 effective search space: 158800856928 effective search space used: 158800856928 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 93 (40.6 bits)