BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (374 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 424 e-117 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 353 8e-96 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 310 7e-83 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 291 2e-77 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 273 1e-71 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 269 1e-70 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 266 6e-70 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 257 6e-67 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 244 3e-63 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 241 4e-62 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 240 7e-62 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 238 3e-61 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 233 6e-60 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 231 3e-59 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 225 2e-57 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 224 4e-57 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 224 6e-57 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 221 3e-56 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 220 8e-56 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 216 1e-54 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 215 2e-54 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 214 3e-54 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 214 5e-54 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 210 8e-53 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 208 3e-52 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 207 3e-52 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 205 2e-51 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 204 5e-51 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 201 3e-50 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 199 1e-49 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 194 4e-48 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 193 8e-48 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 193 9e-48 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 192 1e-47 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 187 5e-46 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 187 7e-46 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 186 9e-46 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 185 2e-45 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 184 5e-45 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 179 1e-43 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 178 3e-43 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 176 1e-42 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 175 3e-42 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 174 5e-42 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 174 6e-42 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 166 9e-40 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 160 6e-38 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 159 1e-37 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 158 3e-37 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 154 6e-36 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 153 1e-35 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 153 1e-35 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 152 2e-35 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 148 3e-34 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 148 3e-34 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 145 3e-33 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 144 5e-33 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 144 5e-33 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 143 9e-33 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 143 1e-32 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 140 6e-32 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 139 1e-31 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 136 1e-30 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 132 3e-29 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 131 4e-29 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 130 1e-28 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 128 4e-28 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 125 3e-27 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 124 4e-27 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 122 3e-26 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 120 1e-25 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 117 1e-24 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 109 1e-22 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 103 1e-20 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 102 2e-20 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 102 3e-20 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 100 9e-20 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 99 2e-19 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 99 2e-19 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 98 5e-19 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 98 5e-19 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 97 1e-18 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 95 3e-18 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 93 2e-17 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 92 2e-17 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 92 3e-17 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 91 6e-17 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 91 9e-17 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 88 6e-16 UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX 86 2e-15 UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=B... 86 2e-15 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 85 5e-15 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 84 1e-14 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 82 3e-14 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 82 3e-14 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 81 7e-14 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 81 7e-14 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 80 1e-13 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 79 3e-13 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 79 3e-13 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 78 4e-13 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 78 5e-13 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 78 6e-13 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 77 1e-12 UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica... 76 2e-12 UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azolla... 74 1e-11 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 73 1e-11 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 73 2e-11 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 73 2e-11 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 72 4e-11 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 71 5e-11 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 70 1e-10 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 70 1e-10 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 70 1e-10 UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata ob... 69 3e-10 UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp... 68 6e-10 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 66 2e-09 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 66 2e-09 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 66 2e-09 UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina... 66 2e-09 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 63 1e-08 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 61 6e-08 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 60 1e-07 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 60 2e-07 UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewane... 59 3e-07 UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7... 59 3e-07 UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=B... 59 3e-07 UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata ob... 59 3e-07 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 59 3e-07 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 59 3e-07 UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legione... 58 5e-07 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 58 5e-07 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 58 6e-07 UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscurig... 57 1e-06 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 57 2e-06 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 56 2e-06 UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax... 56 2e-06 UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia... 55 3e-06 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 55 3e-06 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 55 4e-06 UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum fe... 55 5e-06 UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio ... 54 8e-06 UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mo... 54 1e-05 UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus Re... 52 2e-05 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 52 5e-05 UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synecho... 51 6e-05 UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria... 50 9e-05 UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswald... 50 2e-04 UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammap... 49 2e-04 UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum ... 49 4e-04 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 49 4e-04 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 48 5e-04 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 47 9e-04 UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synecho... 47 0.002 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 46 0.002 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 46 0.002 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 45 0.003 UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX 45 0.003 UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris mari... 45 0.003 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 45 0.004 UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia ... 45 0.004 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 44 0.007 UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 44 0.007 UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=... 44 0.010 UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylob... 42 0.038 UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothe... 42 0.054 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 41 0.054 UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 41 0.062 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 424 bits (1089), Expect = e-117, Method: Compositional matrix adjust. Identities = 207/358 (57%), Positives = 270/358 (75%), Gaps = 2/358 (0%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M ++ L+++IS+ PD RQ KV+HKLS IL LT+CAVI+GA+ W++IEDFG L++LK+ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 YGDF+NGIPV DTIARVVS I F + FI WM++CH D ++IAIDGKT+R S+DK Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 +R+GAIH++SAFS + +V+GQ+KT+ KSNEITAIPELLN+L +K +IT DAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 A KI+ + DYL AVKG QG+L+ AFEEKFP+ +N + DS++ E SHGR+E RLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQ-KKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 +V DF FEWKGLKKLCVA+SFR E K +++RYYISS D+ A++FA AI Sbjct: 241 CDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 R HW +E+ LHW LDV MNED +IRRGNAAE+ SGI+ +A+N+L + K K +K Sbjct: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 353 bits (905), Expect = 8e-96, Method: Compositional matrix adjust. Identities = 172/357 (48%), Positives = 248/357 (69%), Gaps = 4/357 (1%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 MS +L++ +S+ D RQ KV H L +LFL + AVI+G + W+EI+DFG+++L+WL+K Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 Y F GIP DDTI+R+ ID F+K F WM+ C E++ G++IAIDGKT+RGSF+K Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKK 120 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 + IHMVSAF+ N VVLGQVKT AKSNEITAIP+LL+LL ++ L+TIDAMGCQ I Sbjct: 121 DKSDTIHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKI 180 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 A KI DK DYLL VKGNQ +L A + F + + ++++T+E HGR+++R+ +V Sbjct: 181 AKKIVDKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMV 240 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIR 300 ++ + D FEW GLK L A+SFR +K+ ++ V++++YISS +DAK A R Sbjct: 241 ADANEIG--DLVFEWPGLKTLGYAVSFRTEKDMQTT--VAVKFYISSAKLDAKSLLEASR 296 Query: 301 AHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 AHW +E++LHW LD+ MNED+ RIR+ N+ E ++ ++ +LNLL++ K G ++K Sbjct: 297 AHWTVENNLHWQLDISMNEDSCRIRQQNSTENLATVRHTSLNLLKNEKSFDGGIKRK 353 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 310 bits (793), Expect = 7e-83, Method: Compositional matrix adjust. Identities = 170/350 (48%), Positives = 227/350 (64%), Gaps = 8/350 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 SL++ S+ D RQ+ K+ H+L IL L V AVI GA+ WQ+IE+ GH RL WL++ G F Sbjct: 6 SLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFF 65 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 GIPVDDTIAR++S+++ ++ FI+WM E TDG+IIA+DGK+IR S+DK KRK Sbjct: 66 KKGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKS 125 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 AIHMVSA++ ENGVVLGQ KT+ KSNEI AIP LL+LL +K ++TIDAMGCQ+ IA KI Sbjct: 126 AIHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKI 185 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNV---FSNYKGDSFSTQEISHGRKETRLHIVS 241 K+ DY+LAVK NQ +LH + F + F + D F HGR E R + +S Sbjct: 186 VTKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRRYWIS 245 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 ++ L+ W L+ + + S R +AE RY+I+S DAK FA+A+R Sbjct: 246 DM--LSTLGNPERWASLQSIGMVESERYIDGKTTAE---TRYFITSIAPDAKIFANAVRK 300 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIK 351 HW IE+ LHWVLDV ED SR+RR NA+E + +A+N LR+ K K Sbjct: 301 HWAIENQLHWVLDVSFREDDSRVRRDNASENFGVFRHVAINALRNEKSCK 350 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 291 bits (746), Expect = 2e-77, Method: Compositional matrix adjust. Identities = 166/368 (45%), Positives = 228/368 (61%), Gaps = 17/368 (4%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDF--GHERLEWL 58 M I+S + S D RQ KV + L +LF ++CAVIA ++ W EI ++ GH W Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHH--SWF 58 Query: 59 KKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFD 118 KK F +GIP DDTIAR+VS ID +F F+ WM+ H++T+GE+IAIDGKT+RGS++ Sbjct: 59 KKQKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYN 118 Query: 119 KGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 + R IHM+SA+++ N +VLGQ+K E KSNEITAIP LL +L L+ L+TIDAMGCQ Sbjct: 119 RDDRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQT 178 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLH 238 IA+ I DK DYLLAVK NQG L A + F + + D + ++ SHGR E R Sbjct: 179 AIATTIIDKGGDYLLAVKNNQGNLAKAVNKAFSPHRSAGLSDDHVNIEK-SHGRIENRTC 237 Query: 239 IVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHA 298 V + L+ DF W+ LK + + SFR K ++ + RYYISSK + A++ A Sbjct: 238 YVLSSAALD-GDFT-HWEALKSIVMVESFRAVKGKTAS--LEYRYYISSKVLSAEQALSA 293 Query: 299 IRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 R HW IE S+HWVLDV MNED +I + N AE ++ ++ M+LN+L+ +E K Sbjct: 294 TREHWGIE-SMHWVLDVSMNEDECQIYKNNGAENLAYLRHMSLNMLQ-------KEPTKL 345 Query: 359 GCVKHRER 366 V R+R Sbjct: 346 SIVGKRKR 353 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 273 bits (697), Expect = 1e-71, Method: Compositional matrix adjust. Identities = 154/355 (43%), Positives = 214/355 (60%), Gaps = 12/355 (3%) Query: 2 SIQSLLDYISVTPDIRQQG-KVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 +++S +Y D R++ +H IL + VCA+I+GA+ + EIE FGH + EW + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 + NGIP DT V++ + FE F+ W GE IAID KT+RGS DK Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKK 124 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 K +H+VSA++ E +V+GQ+KTE SNEITAIPELLN L LK L++IDAMGCQ +I Sbjct: 125 NGKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEI 184 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPV---NVFSNYKGDSFSTQEISHGRKETRL 237 A KI +K ADY+LA+KGNQ KLH + E F + N Y+ D T E S+GR+E R Sbjct: 185 AEKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRC 244 Query: 238 HIVSN-VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFA 296 +N + ++ D EWK +K + + S R KKE + IRYYISS + A++ Sbjct: 245 AYATNEIEKIIAND---EWKNIKTVAMIESQRIKKEKE----FDIRYYISSAKLSAEDCL 297 Query: 297 HAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIK 351 +R HW IE+ LHW LDV ED SRIR+ N AE ++ ++++ALNL++ K K Sbjct: 298 KVVRKHWEIENKLHWTLDVAFREDESRIRQRNTAENMAILRRIALNLVKQEKTAK 352 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 269 bits (688), Expect = 1e-70, Method: Compositional matrix adjust. Identities = 150/346 (43%), Positives = 212/346 (61%), Gaps = 9/346 (2%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 L + S D RQ+ KV + L IL LT+CAV++GA++W I +G ++L +LK++ F Sbjct: 25 FLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPFA 84 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGA 125 +G P D + + + +D+ AF+ FI+W+ ++ G ++AIDGKT R S DK K A Sbjct: 85 DGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTVTG-VVAIDGKTSRRSLDKAGGKAA 143 Query: 126 IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIK 185 IHM+SA+S+E + L Q + + KSNEITAIPELL LL LK ++TIDAMGCQ++IA+KI Sbjct: 144 IHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKII 203 Query: 186 DKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFS---TQEISHGRKETRLHIVSN 242 K+ADY+LA+KGNQG L E +Y + + T E SHGR ETR V+ Sbjct: 204 SKEADYILALKGNQGSLRKDTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRR--VTV 261 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 T +++ + W GLK + V + + +DK+ RYYISS DA+ A AIR H Sbjct: 262 CTDIDWLKADHNWPGLKSI-VMVQYHAILQDKTR--AETRYYISSMTSDAEHHAKAIRDH 318 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCK 348 W IE+ LHWV+D+ +D RIR GNA + IK +A N+LR K Sbjct: 319 WGIENGLHWVMDMVFRDDECRIRNGNAPANFTTIKHVASNMLRSVK 364 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 266 bits (681), Expect = 6e-70, Method: Compositional matrix adjust. Identities = 146/364 (40%), Positives = 221/364 (60%), Gaps = 24/364 (6%) Query: 8 DYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG 67 DY D R + KHKL I+ +T+CAVI GAD W +IE FG + +WLKK+ + NG Sbjct: 11 DYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNG 70 Query: 68 IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIH 127 IP DT RV S ++ +++F++W+Q T GEI+AIDGKT+R S+D+ K K A+ Sbjct: 71 IPSHDTFGRVFSLLNPEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQ 130 Query: 128 MVSAFSNENGVVLGQVKTEAKSNEITAIPE---------------LLNLLYLKKNLITID 172 M+SA++ NG+VLGQ + KSNEITAIP+ LL +L L ++T+D Sbjct: 131 MISAWATTNGLVLGQSIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLD 190 Query: 173 AMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG---DSFSTQEIS 229 A+GCQK+I +I ++ ADY++ +K NQG L+ E F + SN++G + ++ Sbjct: 191 AIGCQKEIVKQITEQDADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEG 250 Query: 230 HGRKETRLH-IVSNVTRLNFCDFEFEWKGLKKLCVALSFR-QKKEDKSAEGVSIRYYISS 287 HGR+E R + ++SNV D +++W L + R + DK++ + RY+ISS Sbjct: 251 HGRQEVRYYQMLSNVAE--EIDPDWQWLNLNSIGYVEYLRVENGTDKTS--LERRYFISS 306 Query: 288 KDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDC 347 + + K FA ++R HW IE+ HW+LDV+ NED SRIR+ NA ++ ++ +ALNLL+ Sbjct: 307 LNNNIKLFASSVREHWCIENQCHWILDVQFNEDDSRIRKDNAPANMAILRHLALNLLKQE 366 Query: 348 KDIK 351 K +K Sbjct: 367 KTLK 370 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 257 bits (656), Expect = 6e-67, Method: Compositional matrix adjust. Identities = 136/345 (39%), Positives = 210/345 (60%), Gaps = 9/345 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 S++++ S D R ++++ L I+ +T+CAV+ GAD W E+ ++G + +WLK++ Sbjct: 8 SIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQWIAL 67 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 NG+P DT V + + ++ F+ W Q ++++ GE+IAIDGKT+RG+ G++ Sbjct: 68 PNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGEQCS 127 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 IHMVSA+++ N +VLGQ + KSNEITAIPELL +L L+ L++IDAMGCQ IA I Sbjct: 128 LIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIAETI 187 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG---DSFSTQEISHGRKETRLHIVS 241 + + DY+LA+KGNQG L++ + F + ++G DS+ T E HGR E R + Sbjct: 188 IEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTYWTM 247 Query: 242 NVTRLNFCDFEFEWKGLKKL-CVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIR 300 T ++ W LK + CV RQ + + RYY+ S + DA+ FA A+R Sbjct: 248 GQT--DYLLGAERWAQLKSIGCVESCRRQPGHPGT---LQRRYYLLSIESDAQRFADAVR 302 Query: 301 AHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 +HW IE+ LHW+LDV ED R +G +A+ +S I+ +A NLL+ Sbjct: 303 SHWGIENQLHWILDVGFREDKLRACQGCSAQNLSVIRHIAANLLQ 347 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 244 bits (624), Expect = 3e-63, Method: Compositional matrix adjust. Identities = 139/370 (37%), Positives = 216/370 (58%), Gaps = 30/370 (8%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 Q+L+++ D R +G+ H+L +L + +C ++ G + + ++EDFG + +W K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 +GIP DT RV + + AF F+ W Q EI+A+DGK +R + ++G+ Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQGQSP 126 Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 I VSA++ N +VLGQ++ K+NEITA+P+LL +L L ++T+DAMGCQK+IA + Sbjct: 127 RVI--VSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IKDKKADYLLAVKGNQGKLHHAF-----------EEKFPVN---VFSNYKGDSFSTQEIS 229 I + A+Y+LA+KGNQG+ H +++ PV V YK T E Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYK----ETTEKD 240 Query: 230 HGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKD 289 HGR ETR + S +++ +W GL+ + V S RQ + A V RYY+SS + Sbjct: 241 HGRLETRRYWQSG--DVSWLADRQQWAGLRSVGVVESVRQVGQ--QAPTVERRYYLSSLN 296 Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 +D ++FA A+R HW +E+SLHWVLDV+ ED +R R G+AAE ++ ++++ALNLL Sbjct: 297 VDVEKFARAVRGHWSVENSLHWVLDVQCGEDRNRARSGHAAENLATLRRLALNLL----- 351 Query: 350 IKGEEEKKEG 359 K E KK G Sbjct: 352 -KRESTKKRG 360 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 241 bits (615), Expect = 4e-62, Method: Compositional matrix adjust. Identities = 136/344 (39%), Positives = 204/344 (59%), Gaps = 9/344 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 S + + D R KH L ++FLTV A+++GA+ W++I+ FG +L+WL+K+ F Sbjct: 2 SFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAF 61 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 G+PVDDTIAR++S+++ A FI W+ E E +IA DGKT+R SFD G RK Sbjct: 62 KEGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFD-GDRKT 120 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A+H VSA++ E G+VL Q K++ K NE++ + EL+ LL LK +++T DAM C K +A I Sbjct: 121 ALHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAI 180 Query: 185 KDKKADYLLAVKGNQGKLHH---AFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVS 241 K DY+L VK NQGKL A+ K + + K +S + HGR E R ++ Sbjct: 181 NAKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYV-- 238 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 +L + + +G + + +K+ K E YYISS +++ + A AIR+ Sbjct: 239 ---QLPITPWLTQSQGWTNIKPVIEVTRKRYLKDKETSETAYYISSLEVNLPQIAKAIRS 295 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 HW IE++ HWVLD+ ED SRIRRG+A E ++ ++ A+NL R Sbjct: 296 HWSIENASHWVLDMTYREDDSRIRRGDAPENMATFRRFAMNLAR 339 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 240 bits (612), Expect = 7e-62, Method: Compositional matrix adjust. Identities = 141/332 (42%), Positives = 196/332 (59%), Gaps = 10/332 (3%) Query: 14 PDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDT 73 P R G + H +L + + AV++ D ++I +G E+ +WL+++ NG+ ++T Sbjct: 19 PRKRSNGTL-HDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLVLLNGVASEET 77 Query: 74 IARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFS 133 R+ +D FE F W+ G + +DGKT+RGS G+ AIHMVSAF+ Sbjct: 78 FLRIFRALDPKQFEAAFRRWVAGVVGTLTGGL-GVDGKTVRGSGSGGE--SAIHMVSAFA 134 Query: 134 NENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLL 193 E GVVLGQ K +KSNEITAIPELL LY+ L+TIDAMGCQK+IA +I D+ DYLL Sbjct: 135 TELGVVLGQEKVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIARQITDQGGDYLL 194 Query: 194 AVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEF 253 AVKGNQ L A E +F ++ + + D SHGR I S + D Sbjct: 195 AVKGNQPTLLDAIETEF-IDQYQSDDVDRHRQVHPSHGRIVA--QIASVLPAEGIVDLA- 250 Query: 254 EWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVL 313 +W KK+ S R+ +S + RYYISS+++ A++ A A+RAHW IE+ LHWVL Sbjct: 251 DWPECKKIARVDSLRKVGNHESK--LERRYYISSRELTAEQLAAAVRAHWGIENRLHWVL 308 Query: 314 DVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 DV EDAS IR+GNA + +S +KK+ LNL+R Sbjct: 309 DVSFGEDASTIRKGNAPQNLSLLKKIVLNLIR 340 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 238 bits (607), Expect = 3e-61, Method: Compositional matrix adjust. Identities = 144/379 (37%), Positives = 212/379 (55%), Gaps = 19/379 (5%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + S+L+Y + D R+ KH L +L + V AVIAGAD + I + +EWLK Sbjct: 10 VVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSRL 69 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHE--ITDG---EIIAIDGKTIRGSF 117 + +G+P DTI R+++ + AF++ F EW+ + TD EIIAIDGKT+R S Sbjct: 70 ELPSGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRSH 129 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 D+GK G + + SA++ GV LGQ+ KSNEI PEL+ + ++K ++T+DA GCQ Sbjct: 130 DRGKGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGCQ 189 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPV---NVFSNYKGDSFSTQEISHGRKE 234 +D+A KI K DY+LA+K NQ +LH + N F+ K + + HGR + Sbjct: 190 RDVAEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRLD 249 Query: 235 TRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKE 294 R + V + +W+GLK + VA+ Q + E RYYISS DAK+ Sbjct: 250 KRFYY--QVKLPDEVPAGEDWRGLKTIGVAIRISQ---ENGRETCDTRYYISSLKPDAKQ 304 Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEE 354 FA A+R HW IE+SLHW LDV ED SR+R AAE ++ +K++A++L IK + Sbjct: 305 FAAAVRGHWGIENSLHWTLDVTFREDESRLRNRIAAENLAWLKRLAVSL------IKQHK 358 Query: 355 EKKEGCVKHRERSSEVHFL 373 K+ ++ R V+FL Sbjct: 359 SKESVVMRRRMAGWNVNFL 377 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 233 bits (595), Expect = 6e-60, Method: Compositional matrix adjust. Identities = 129/356 (36%), Positives = 200/356 (56%), Gaps = 6/356 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +LL Y+ D R Q KH L +L + + AVIAG+ W+++E++G + EWL ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 +GIP DDT RV ID + +K +W+Q GEII IDGKT+RGS+D+ + Sbjct: 90 PHGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQC 149 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A++ V+A++++ +VLGQVK E SNEITAIP LL LL + ++ITIDAMG Q I +I Sbjct: 150 ALYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQI 209 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG---DSFSTQEISHGRKETRLHIVS 241 +KADY++ +K N L ++ F + + G D + + H R E R Sbjct: 210 CRQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRYVWAI 269 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 V + + +W GL+ + V R + + I++Y++S +A+ HAIR Sbjct: 270 PVAAMGELYQQQQWHGLQTIVVVERIRHLWNKTTHD---IQFYLTSLPPNAQFLCHAIRT 326 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 HW IE++LHW LDV +ED RIR + + + ++++ALN+L K K +K Sbjct: 327 HWSIENNLHWTLDVTFSEDQCRIRSEYSPQNFALLRRLALNVLHQEKTFKRSLRQK 382 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 231 bits (590), Expect = 3e-59, Method: Compositional matrix adjust. Identities = 139/356 (39%), Positives = 199/356 (55%), Gaps = 18/356 (5%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M+ +SLLDY+ PD R Q K H LS ++F+ +CA++ G D W EI F ER W ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEII---AIDGKTIRGSF 117 + GIP DT R+ + + + + +F W+ + + D +++ A+DGK +R + Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDI--MGDDKLVGQLAVDGKALRAT- 117 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 KG+ A+HMV+ +S E G+ +GQ K KSNEITAIPELL LL LK L++IDAMG Q Sbjct: 118 AKGRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQ 177 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSN---YKGDSFSTQ-EISHGRK 233 IA I K DYLLAVK NQ L+ +E+F N G F+ Q + HGRK Sbjct: 178 VKIADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRK 237 Query: 234 ETRLHIVSNVTR-LNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDA 292 E R V V + C +WK K +A+ + + K + V R+YISS+ +DA Sbjct: 238 EHRRCWVLMVDESMPVCQ---QWKA--KTIIAVQAERIENGKGYDFV--RFYISSRALDA 290 Query: 293 KEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCK 348 A RAHW +E+ LHW LD+ ED + R G A E ++ I++ LN+L+ K Sbjct: 291 TSALKATRAHWSVENLLHWTLDIAFGEDQLQARAGFAGENLAVIRQWILNMLKQNK 346 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 225 bits (573), Expect = 2e-57, Method: Compositional matrix adjust. Identities = 137/327 (41%), Positives = 190/327 (58%), Gaps = 13/327 (3%) Query: 24 HKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDS 83 H IL + + AV++ D ++I + + WL+++ NGIP ++T R++ +D Sbjct: 19 HDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLKNGIPSEETFLRILRALDP 78 Query: 84 LAFEKMFIEWMQEC-HEITD----GEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGV 138 FE MF W+ ++D IAIDGKT+RGS G+ AIHMVSAF+ E G+ Sbjct: 79 KQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGSGSGGE--SAIHMVSAFATELGL 136 Query: 139 VLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGN 198 VLGQ K AKSNEITAIPELL L +K L+TIDAMGCQK IA +I KK DYLL VKGN Sbjct: 137 VLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSIAKQIVAKKGDYLLMVKGN 196 Query: 199 QGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGL 258 Q KL A E F ++ D S E HGR T I S ++ D +W Sbjct: 197 QPKLLEAIETAF-IDQHGVESVDRSSRVERGHGR--TVGQIASVLSAKGIVD-PADWPKC 252 Query: 259 KKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMN 318 + S R + +S + RYYISS+ + A++ A A+RAHW +E+ LHW+LDV + Sbjct: 253 VTIGRIDSMRVVGDKQS--DLERRYYISSRALSAEQLAAAVRAHWGVENRLHWILDVSFS 310 Query: 319 EDASRIRRGNAAEIISGIKKMALNLLR 345 EDAS + + NA + +S ++K+AL ++R Sbjct: 311 EDASTVAKDNAPQNLSLLRKIALTIIR 337 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 224 bits (571), Expect = 4e-57, Method: Compositional matrix adjust. Identities = 125/336 (37%), Positives = 194/336 (57%), Gaps = 9/336 (2%) Query: 15 DIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTI 74 D R G+ + L IL +T+CA+I G D W+ I DFG +R WL ++ + G+P T Sbjct: 25 DPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVNMRCGVPSTLTF 84 Query: 75 ARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSN 134 ARV S I+ F+ WM + ++ ++I +DGK++ GS +GK + A H+V+A+ Sbjct: 85 ARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQKATHIVNAYLP 144 Query: 135 ENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLA 194 + V LG+V+ KSNEI AIP LLN L ++ +I+IDAMG QK IA+ I+ K+ADY+LA Sbjct: 145 KEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANLIRLKQADYVLA 204 Query: 195 VKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEI---SHGRKETRLHIVSNVTRLNFCDF 251 +K N + + E F + +Y+G + T+E HGR E R + V + + F + Sbjct: 205 LKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSYCV--LPMMYFHKY 262 Query: 252 EFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM-DAKEFAHAIRAHWLIEHSLH 310 + W+ L+ + S R K + + RYYI+S + + + AIR HW IE+ LH Sbjct: 263 KKYWRDLQAIVRVQSKRHKGNEIET---ATRYYITSLPFAEHRRMSQAIRQHWAIENQLH 319 Query: 311 WVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRD 346 W LD+ + EDAS I RG A + ++ ++KM L +L + Sbjct: 320 WKLDIGLGEDASLITRGYADQNLATLRKMVLKMLEN 355 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 224 bits (570), Expect = 6e-57, Method: Compositional matrix adjust. Identities = 130/332 (39%), Positives = 184/332 (55%), Gaps = 6/332 (1%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 L Y D R + H+L I+ + + AV+AGAD W IE +G + WL+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGA 125 NGIP DT ARV + +D A E F W++ ++IAIDGKT +GS+D+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIK 185 + +VSA+++E+ +VLGQ + KSNEITAIP LL L L +++IDAMG + IA++I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 DKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQ---EISHGRKETRLHIVSN 242 ++ADY+LA+KGNQ L ++ F + G ++ E +H R E+R Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRRVFQVP 253 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 V ++ +W GL+ L V S R + E RY++SS DA FAH IRAH Sbjct: 254 VEQVFTPKQGRDWAGLRSLVVIQSQRCLWNKDTTE---TRYFLSSLSTDAATFAHYIRAH 310 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIIS 334 W IE+ LHW LDV NED SRIR+ +A S Sbjct: 311 WGIENQLHWCLDVVFNEDKSRIRKDHAPRNFS 342 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 221 bits (563), Expect = 3e-56, Method: Compositional matrix adjust. Identities = 134/352 (38%), Positives = 204/352 (57%), Gaps = 16/352 (4%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 +Q LL+++ D RQQ KV+H L IL + + A +A AD+W E+ F + ++L+KY Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE----IIAIDGKTIRGSFD 118 + NG P DT+ RV+ + ++++ +W QE +GE II IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKW-QERLNRNEGELLKKIICIDGKTMRSNKR 119 Query: 119 KGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 G++ G H+VSA+S E+G LGQ KSNEITAIPELL + +K ++TIDAMG Q Sbjct: 120 NGEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQT 177 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFS---NYKGDSFSTQEISHGRKET 235 IA KI++K+ADY+L++K NQG L+ E F F +G TQE +HG+ ET Sbjct: 178 AIAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIET 237 Query: 236 RLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF 295 R + ++ + + WKGLK + + R+ E + + RY+ISS + + Sbjct: 238 REYY--QTEKIKWLSQKKAWKGLKSIIME---RKTLEKEGKRLIEYRYFISSLKEEIETV 292 Query: 296 AHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDC 347 + A+R HW IE S+HW LDV EDA+ AA+ ++ I+K +L++L+ Sbjct: 293 SRAVRGHWSIE-SMHWHLDVTFREDANTTIDKMAAQNLNIIRKWSLSILKTA 343 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 220 bits (560), Expect = 8e-56, Method: Compositional matrix adjust. Identities = 138/361 (38%), Positives = 199/361 (55%), Gaps = 22/361 (6%) Query: 1 MSIQSLLDYISV---TPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEW 57 M I L D + V D R + +H+LS +L + VCAV++GAD+++EI +G ++ W Sbjct: 1 MDIGKLADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPW 60 Query: 58 LKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQEC-HEITDGEIIAIDGKTIRGS 116 L+ + D G+ DT RV + +D FE+ F W+ + ++IAIDGK+ R + Sbjct: 61 LRGFLRLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRT 120 Query: 117 FDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGC 176 K +H+VSAF+ GVVLGQ T KSNEITAIPELL +L ++ ++TIDAMG Sbjct: 121 TSKAA-AAPLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGT 179 Query: 177 QKDIASKIKDKKADYLLAVKGNQGKLHHA-----FEEKFPVNVFSNYKGDSFSTQEISHG 231 Q IA I+++ A Y+L VK N KL + + + P+ S ++ T HG Sbjct: 180 QTKIARAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLTPSSTHE-----TTSTGHG 234 Query: 232 RKETRLHIVSNVT-RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM 290 R E R + T RL+ + WK + V R E S E V YYISS Sbjct: 235 RIEVRRCTAFDATDRLHKAE---AWKDVASFAVVERVRTVGERTSTERV---YYISSLPA 288 Query: 291 DAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDI 350 DA+ A AIR+HW +E+ LHW LDV+ +D +R R G+ A ++ ++ MALNL+R K I Sbjct: 289 DAERIAVAIRSHWEVENRLHWCLDVQFGDDYARGRIGHIAHNLALVRHMALNLIRLDKSI 348 Query: 351 K 351 K Sbjct: 349 K 349 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 216 bits (549), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 123/341 (36%), Positives = 186/341 (54%), Gaps = 11/341 (3%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 SL D++S+ D R +H L +LFL + AV +G D W EI+ FG +LEWL+K+ F Sbjct: 2 SLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPF 61 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 NGIP TIAR++ + + W+ + + IIAIDGKT+RG+ G Sbjct: 62 ANGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLG--CN 119 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 +H V AF NG+ L Q K EI + L+ +L + K LIT+DA+ Q+ I Sbjct: 120 TLHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAI 179 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL--HIVSN 242 +K DY++ VK NQ L A + ++ V + + F+ E HGR E R+ I S Sbjct: 180 VARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQRITFQIPSK 239 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 ++ + +W +K L +A+ +K +K++ + +Y+SS D+D + A A+R H Sbjct: 240 LS----PKLQEKWPSVKTL-IAVERHRKIGNKTS--IETSFYLSSHDIDPEYIATAVRGH 292 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNL 343 W IE+SLHWVLDV EDA R+ AE ++ +++MALNL Sbjct: 293 WRIENSLHWVLDVVYREDACRVHEQRTAESLAIVRRMALNL 333 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 215 bits (547), Expect = 2e-54, Method: Compositional matrix adjust. Identities = 130/327 (39%), Positives = 188/327 (57%), Gaps = 10/327 (3%) Query: 22 VKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNI 81 V + L+ +L T+ +I A ++ EIE G E+L+WL+++ F++G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 DSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLG 141 D E F W+ E + G +AIDGKT+RGS GA+H+VSA+++E G+V+G Sbjct: 62 DPKYLETAFSAWV-ESLRVHVGGGVAIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGK 201 Q AKSNEITAIPELL+ L L ++TIDAMG QK IA+K+ DK ADY+LA+KGNQG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LHHAFEEKF--PVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLK 259 LH + F P + + D I HGR E R V++ + + W GL Sbjct: 181 LHDDVRDFFADPDLLRECARHDDTC---IGHGRIEERTCQVADASAW-LTEQHSGWAGLA 236 Query: 260 KLCVALSFRQKKEDKSAEGVS-IRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMN 318 + ++ R K KS E S R YISS D K +A R+HW +E++LHW LDV Sbjct: 237 SIAAVIATRTDK--KSGEVSSETRLYISSLPPDPKTILNACRSHWSVENNLHWQLDVTFR 294 Query: 319 EDASRIRRGNAAEIISGIKKMALNLLR 345 ED R R+ +A ++ I+ A N+L+ Sbjct: 295 EDECRTRKDHAPLSLAIIRHAAFNMLK 321 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 214 bits (546), Expect = 3e-54, Method: Compositional matrix adjust. Identities = 134/363 (36%), Positives = 197/363 (54%), Gaps = 23/363 (6%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY--G 62 +LL+ S PD R+ ++ L+ IL + VCA++ GAD W E+ D+ +R EWL + Sbjct: 2 TLLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRW 61 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKR 122 + G P DT + +D+ FE F +W++E + DG ++AIDGKT+RGS KG Sbjct: 62 PLEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVIDG-VVAIDGKTLRGSGKKGSN 120 Query: 123 KGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIAS 182 + +HMV+A++ ++G+ L Q T K +E+ + LL++L LK ++T+DA+GCQ ++A Sbjct: 121 E-LLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAE 179 Query: 183 KIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV---FSNYKGDSFSTQEISHGRKETRLHI 239 KI + DY+L VK NQ L A E F F + F E HGR ETR + Sbjct: 180 KIVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYT 239 Query: 240 -VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKE-FAH 297 +++VT ++ WK L + + S RQ + S V RY I S + E FA Sbjct: 240 WINDVTWMDR-PMRAAWKKLGGVGMIESIRQIGDKVS---VDQRYAIGSCGVQTVEMFAK 295 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 A R+HW IE+ LHW LDV ED R R GN+A +S ++K L LR K+ Sbjct: 296 ASRSHWGIENGLHWTLDVVFREDQCRARLGNSARNLSVLRKFVLTTLR----------KE 345 Query: 358 EGC 360 EGC Sbjct: 346 EGC 348 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 214 bits (544), Expect = 5e-54, Method: Compositional matrix adjust. Identities = 128/340 (37%), Positives = 182/340 (53%), Gaps = 15/340 (4%) Query: 15 DIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTI 74 D R Q +H L+ IL + CA++ G + +E FG+ + WL+ + NGIP DT Sbjct: 25 DWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLALPNGIPSHDTF 84 Query: 75 ARVVSNIDSLAFEKMFIEWMQECHEITDGE--------IIAIDGKTIRGSFDKGKRKGAI 126 +V S +D F + F W Q E +IAIDGK +RG+ DKG+ I Sbjct: 85 RKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRGAVDKGQAPAVI 144 Query: 127 HMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKD 186 V A+++E + LGQVK KSNEI A+PELL +L LK ++TIDAMGCQ+++A KI Sbjct: 145 --VGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMGCQREVARKIIQ 202 Query: 187 KKADYLLAVKGNQGKLHHAFEEKFPVNV-FSNYKGDSFSTQEISHGRKETRLHIVSNVTR 245 +K DY+LA+K NQ LH + +G+ + HGR E R VS Sbjct: 203 QKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHEVRRCWVSEEVE 262 Query: 246 LNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLI 305 E +W GL+ + R + V RY+ISS DA A ++RAHW I Sbjct: 263 CWLQGAE-KWAGLRSVAAVECERTVAGQTT---VQRRYFISSLKADAALIAASVRAHWGI 318 Query: 306 EHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 E+SLHWVLDV ED SR RRG +AE ++ ++++ +++ Sbjct: 319 ENSLHWVLDVTFGEDESRSRRGYSAENLATLRRLTHAMIK 358 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 210 bits (534), Expect = 8e-53, Method: Compositional matrix adjust. Identities = 133/373 (35%), Positives = 203/373 (54%), Gaps = 22/373 (5%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 Q+ L+ ++ D R + ++L IL ++ AVI D + E+ F + ++L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECH------EITDGEIIAIDGKTIRGSF 117 F +G P DT +V+S +D + F WM E + + G +AIDGKTI S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS- 121 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 + A H+++AF++ +VLGQ+KT+ KSNEITAIPELL L +K ++TIDAMG Q Sbjct: 122 -GSAEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIASKIKDKKADYLLAVKGNQGK--------LHHAFEEKFPVNVFSNYKGDSFSTQEIS 229 K+IA+KI +K DY+LAVKGNQ K LH +++ + KG T E Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTREL--KAKGQYAVTLEKD 238 Query: 230 HGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISS-K 288 HGR E R +SN L++ + +W+G+ + + R Y+I S K Sbjct: 239 HGRIEKRECYLSN--DLSWFEGLEDWRGIAGVGWIRNTRHVDNKNDKTTTEDHYFIYSLK 296 Query: 289 DMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCK 348 + AK+ R HW IE++LHW+LD+ ED R R NAAE+++ ++K+AL +L+ C Sbjct: 297 EAQAKDLLRIKREHWAIENNLHWMLDMAFREDDCRARAKNAAEVMNILRKLALQMLKTCD 356 Query: 349 DIK-GEEEKKEGC 360 K G K++ C Sbjct: 357 TCKCGMRSKRKLC 369 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 208 bits (529), Expect = 3e-52, Method: Compositional matrix adjust. Identities = 122/324 (37%), Positives = 183/324 (56%), Gaps = 7/324 (2%) Query: 23 KHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNID 82 KH I+FL V AVI+GA+ W EI+ FG L+WL+KY F+ GIPVDDTIARV+ I+ Sbjct: 19 KHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPFECGIPVDDTIARVIKRIE 78 Query: 83 SLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQ 142 AF ++F+ ++ E E+IAIDGKT+R SF+ + + A+H V+ +S G++L Q Sbjct: 79 PQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFNP-ETQSALHSVTVWSQSRGLILSQ 137 Query: 143 VKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKL 202 K+ K NE A+ E+++ LK +IT+DAM QK IA KI +KK DY++ +K N + Sbjct: 138 KKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKIAEKIIEKKGDYVMPLKKNHRQF 197 Query: 203 HHAFEEKFPVNVFSNYKGDSFST-QEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKL 261 E F + S + T +E++ R + ++ EWKG+K + Sbjct: 198 QSEVEAYF--HKISRDCPEMLETYEEVNAERSRIDERYYRKLKVSDWLSKAEEWKGIKSV 255 Query: 262 CVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDA 321 L +K+ D E +YISS D+D + A +R HW +E+ HWVLDV ED Sbjct: 256 ---LEVCRKRSDNGKESQEKVFYISSLDVDIQILAKCVRGHWEVENKAHWVLDVVYKEDE 312 Query: 322 SRIRRGNAAEIISGIKKMALNLLR 345 + AE ++ ++++ALNL R Sbjct: 313 CAVTDEWGAENLAILRRLALNLAR 336 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 207 bits (528), Expect = 3e-52, Method: Compositional matrix adjust. Identities = 134/345 (38%), Positives = 189/345 (54%), Gaps = 26/345 (7%) Query: 23 KHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLK-KYGDFDNGIPVDDTIARVVSNI 81 KH+ S I+ + + AVI GAD W IEDFG + + K +F NGIP DT R S + Sbjct: 33 KHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKLSNF-NGIPSHDTFNRFFSAL 91 Query: 82 DSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSF----DKGKRKGAI----------- 126 D L FE+ + +W+Q + G I AIDGKTIRG++ DK RK + Sbjct: 92 DPLKFEESYRQWVQSILKCYSGHI-AIDGKTIRGAYESEQDKRHRKQGVLPDSNTGKYKL 150 Query: 127 HMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKD 186 H++SAF+ E GV LGQ+ T+ K NEI IPELL++L +K +ITIDA+GCQ+ IA K+ Sbjct: 151 HVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDCIITIDALGCQRTIAEKVIK 210 Query: 187 KKADYLLAVKGNQGKLHH---AFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNV 243 + DY+ VK NQ KL + E V+ + + D + T E HGR E+R+ N Sbjct: 211 GEGDYIFIVKDNQPKLKEIVLSVTESI-VSKGTTVRFDKYETHEEGHGRNESRICYCCND 269 Query: 244 TRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHW 303 D +WK ++ + R + + E R +ISS + DA++ R HW Sbjct: 270 PGFLGADIRKKWKNIQSFGYIENTRNTNKGTTVEK---RCFISSLEPDAQKILKNSREHW 326 Query: 304 LIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCK 348 IE++LHW LDV +ED +R RR +A S + K+AL LR+ K Sbjct: 327 EIENNLHWQLDVNFHEDNTR-RRNISALNFSVLAKIALATLRNNK 370 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 205 bits (522), Expect = 2e-51, Method: Compositional matrix adjust. Identities = 106/196 (54%), Positives = 138/196 (70%), Gaps = 13/196 (6%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 MS+ L D+ + D RQ KV +KL +LFL + AVI+GA+ W+EIEDFGH RL+WLKK Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 YGDF +GIPV DTIAR+V ID F + FI+WMQ ++TD +++A+DGKT Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKT-------- 112 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 +HM+SAF+ +NGVVLGQ +T+ KSNEITA+PELL LL L+ ++T+DAM CQK I Sbjct: 113 -----LHMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 ASKIKDKKADYLLAVK 196 I KKADY +AVK Sbjct: 168 VKTIVKKKADYCIAVK 183 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 204 bits (518), Expect = 5e-51, Method: Compositional matrix adjust. Identities = 114/344 (33%), Positives = 194/344 (56%), Gaps = 12/344 (3%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +L++++++ + R + KH L ++FL + A+++GA+ W +IE +G +++WL+++ F Sbjct: 8 TLIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPF 67 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 NGIP T+AR++ I + + W+ E IIA DGK +RGSF +G K Sbjct: 68 ANGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKD 126 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A+ +V+A+ ENG+VL Q T K EI + ++L++L LK ++T+DA+ CQ++ KI Sbjct: 127 ALQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKI 186 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 +KKA ++ VK NQ KL+ A + +F + + +E HGR+E R ++ Sbjct: 187 SEKKAHVVVQVKNNQPKLYQAVQSQFQSLFDAQKEKIVVEHKESGHGRQEER-YVFQLKA 245 Query: 245 RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEG---VSIRYYISSKDMDAKEFAHAIRA 301 +L + +W ++ + + +SA G V YY+SS K H IR Sbjct: 246 KLP-PELTEKWPTIRSIIAV------ERHRSANGKGTVDTSYYVSSLSPKHKLLGHYIRQ 298 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 HW IE+S H++LDV NEDASRI +A E ++ ++ LN+++ Sbjct: 299 HWRIENSQHYILDVVFNEDASRIAMEDAVENMALFRRFVLNIVK 342 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 201 bits (512), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 102/209 (48%), Positives = 138/209 (66%), Gaps = 1/209 (0%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 + Q L PD R+ K + L +IL + + +VI GAD W E+E++ + + E+L+ + Sbjct: 3 TTQKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSF 62 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGK 121 D NGIP DT RV SNIDS FEK FI+W+ ++ EIIAIDGKTIRG+ G Sbjct: 63 LDLPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGG 121 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 +K +HMVSA++N+N +VLGQVK KSNEITAIP+LL +L ++ ++TIDAMGCQ IA Sbjct: 122 KKSPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIA 181 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKF 210 I K ADY+LAVK NQ +L E++F Sbjct: 182 KAIVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 199 bits (507), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 143/370 (38%), Positives = 197/370 (53%), Gaps = 23/370 (6%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M+IQ+ + ++ PD R ++ + I+F+ + AVI GAD W EIE FG + K Sbjct: 1 MTIQA---FSAIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKA 57 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQE-CHEITDGEIIAIDGKTIRGSFDK 119 IP DT++R S +D FE+ F W+ + C I ++AIDGK I + DK Sbjct: 58 RLPGLVSIPSHDTLSRFFSILDIDWFEECFRLWVDDICRRIPG--VVAIDGKAICDNPDK 115 Query: 120 GKR-----KGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAM 174 + ++MVSA+S NG+ LGQ K E KSNE AIPEL+ L L+ +ITIDA+ Sbjct: 116 SSNSKNGVRSKLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAI 175 Query: 175 GCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY---KGDSFSTQEISHG 231 GCQK I I + KADY+L K N L + E F ++ S Y + + HG Sbjct: 176 GCQKSITKLIIENKADYILCAKDNHEALRNIIE--FNLSEESRYYLCHAKRYFEENKGHG 233 Query: 232 RKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMD 291 R E R + + L + F W G+K L + S R K DK A + RYYISS + D Sbjct: 234 RSEYRECVCISAKNLQY--FLKGWTGIKTLAMINSIR-KMGDKEAV-METRYYISSLEPD 289 Query: 292 AKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIK 351 +IR HW +E++LHWVLD+ ED R + GNAA S I K+AL LL+ DIK Sbjct: 290 PIIILKSIRPHWEVENNLHWVLDIGFREDDDR-KIGNAAINFSAIPKLALMLLKQ-SDIK 347 Query: 352 -GEEEKKEGC 360 G K++ C Sbjct: 348 LGMAGKRKAC 357 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 194 bits (494), Expect = 4e-48, Method: Compositional matrix adjust. Identities = 130/357 (36%), Positives = 196/357 (54%), Gaps = 29/357 (8%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + ++++++ D R+ K+KH LS I+ L A ++GA+ W EIE FG LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD-------------GEIIAID 109 +NGIP DT+ RV + +D ++ +E Q +I + ++AID Sbjct: 66 QLENGIPSHDTLQRVFATLDP----QVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAID 121 Query: 110 GKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLI 169 GKTIRG + ++ A+H+V+A++ + G+ GQV T KSNEITAIPELL+++ +K ++ Sbjct: 122 GKTIRG--NGSAKQKALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMV 179 Query: 170 TIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEIS 229 +IDAMG QK IA KI KKADY LAVK NQ L E+ P S D + T E + Sbjct: 180 SIDAMGTQKAIADKIIKKKADYCLAVKENQKTL---LEDIVPFFEMSQEADDHYHTVEKA 236 Query: 230 HGRKETRLH-IVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSK 288 HG+ ETR + ++ +V+ L EF ++ + A K +S E RY+I S Sbjct: 237 HGQIETRAYEVIHDVSWLRKTHPEF--GHIQSIGRARIHLDKNGQESEES---RYFILSC 291 Query: 289 DMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 + AKE +R HW IE S+HW+LDV EDA++ A ++ + K L +L+ Sbjct: 292 QVSAKELCDYVRGHWQIE-SMHWLLDVVFREDANKTLNKQLAFNLNVMDKFCLAVLK 347 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 193 bits (491), Expect = 8e-48, Method: Compositional matrix adjust. Identities = 132/363 (36%), Positives = 190/363 (52%), Gaps = 25/363 (6%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 ++D D R K HK+ I+++++ AVI GA W EIE+FG+ ++ + K Sbjct: 5 IIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPDL 64 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWM-QECHEITDGEIIAIDGKTIRGS------FD 118 IP DT R S I FE +F W+ Q C E+ ++AIDGK +RG Sbjct: 65 EFIPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG--VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 GK + MVSA+S NG+ LGQVK + KSNEITAIP L+N L L ++TIDAMGCQK Sbjct: 123 TGKEGFKLWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG--------DSFSTQEISH 230 DI I ++ A+Y++A+K N+ K + ++ + +Y+ ++ H Sbjct: 183 DITQTIIERDANYIIAIKENKKKNYQLAKQ-----IIDDYQDRDEIINRVTRHVSENTGH 237 Query: 231 GRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKD- 289 GR E R V + + F+ + GLK + V + + +RYY++S D Sbjct: 238 GRVEKRTCTVVSYGSIMEKMFKKKLVGLKSI-VGIKSERTIVATGEYTQEVRYYVTSLDN 296 Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 +E A AIR HW IE++LHW LDV ED S+ + NAA S KMAL +L+ K Sbjct: 297 TKPEEIASAIRQHWSIENNLHWQLDVTFREDYSK-KVKNAARNFSVATKMALTILKKDKT 355 Query: 350 IKG 352 KG Sbjct: 356 TKG 358 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 193 bits (490), Expect = 9e-48, Method: Compositional matrix adjust. Identities = 117/300 (39%), Positives = 163/300 (54%), Gaps = 10/300 (3%) Query: 9 YISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGI 68 ++SV PD R + + +H LS +L + VCAV+ GA+++ ++ +G L WL+K+ G+ Sbjct: 13 FVSV-PDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKFLKLKAGV 71 Query: 69 PVDDTIARVVSNIDSLAFEKMFIEWMQE-CHEITDGEIIAIDGKTIRGSFDKGKRKGAIH 127 P DT RV++ ID AFE F+ W+ + ++AIDGKT R S K G +H Sbjct: 72 PSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGK-DTSGPLH 130 Query: 128 MVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDK 187 MVSAF+ G+VLGQ T+ KSNEITAIPELL +L L+ ++TIDAMG Q IA I+ + Sbjct: 131 MVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAIARTIRSR 190 Query: 188 KADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKET-RLHIVSNVTRL 246 ADY+L VK N L + F Q HGR E R V++L Sbjct: 191 GADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWAYDAVSQL 250 Query: 247 NFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIE 306 + +W GL+ AL R++ D V YYISS DA A A+R+HW +E Sbjct: 251 YKSE---QWAGLQSF--ALVERERTVDGKTS-VERHYYISSLPADAARIAQAVRSHWAVE 304 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 192 bits (489), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 121/363 (33%), Positives = 192/363 (52%), Gaps = 14/363 (3%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++ LL++ S D R + ++ H L IL L VC +A D+++ I +G L +L+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKR 122 + +G+P + + +++ ID F F W++ + +AIDGKT R S D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFP-GRADFVAIDGKTSRRSHDRRAG 130 Query: 123 KGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLY----LKKNLITIDAMGCQK 178 IH+VSAF+ + +VL Q K+NE+ AIP LL+ L L L++IDA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLH 238 IA+ I+ + ADYLLAVK NQ L E F V +++ D + HGR E R Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGADHHHD----LDKGHGRVEERH- 245 Query: 239 IVSNVTRLNFCDFEFEWKGLKKLC-VALSFRQKKEDKSAEGV--SIRYYISSKDMDAKEF 295 VS + +++ + G +L VA R A+ RY+ISS + A+ Sbjct: 246 -VSVIREVDWLSGTRRFPGEMRLPDVAAIVRVHTTAHIADRTRTDTRYFISSAPLTAEHA 304 Query: 296 AHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEE 355 A A+R HW IE+ LHWVLDV +D SR+R G+ A+ ++ ++ ALNL+R D K + Sbjct: 305 ADAVRGHWGIENRLHWVLDVLFKDDLSRLRTGHGAKNMAVVRHFALNLIRAANDQKSLKT 364 Query: 356 KKE 358 +++ Sbjct: 365 RRK 367 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 187 bits (475), Expect = 5e-46, Method: Compositional matrix adjust. Identities = 123/343 (35%), Positives = 189/343 (55%), Gaps = 11/343 (3%) Query: 7 LDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDN 66 +D +V D RQ K+++ LS ILFL +AG + +E+EDF Y D Sbjct: 11 IDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEPLFATYVDLSE 70 Query: 67 GIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI-TDGEIIAIDGKTIRGSFDKGKRKGA 125 G P DT+ RV+S ++S +++ +++ Q + ++I++DGKTIRG ++GK + Sbjct: 71 GCPSHDTLERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRG--NRGKNQKP 128 Query: 126 IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIK 185 +H+V+A+ + + LGQV E KSNEI AIP+LL + ++K+++TIDAMG Q I I Sbjct: 129 VHIVTAYDGGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTII 188 Query: 186 DKKADYLLAVKGNQGKLHHAFEEKFP-VNVFSNYKGDS--FSTQEISHGRKETRLHIVSN 242 KADY LAVKGNQ L+ F VN+ + ++ + T E S G+ E R + VS+ Sbjct: 189 KGKADYCLAVKGNQETLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQIEVREYWVSS 248 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 + C +W L+ + + + K S E RY+I S D FA+ +R H Sbjct: 249 DIKW-LCQNHPKWHKLRGIGMTRNTIDKDGQLSQEN---RYFIFSFKPDVLTFANCVRGH 304 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 W IE S+HW+LDV +ED + AA ++ I+KM L L+ Sbjct: 305 WQIE-SMHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLK 346 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 187 bits (474), Expect = 7e-46, Method: Compositional matrix adjust. Identities = 110/341 (32%), Positives = 178/341 (52%), Gaps = 8/341 (2%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 +L ++ D R +H + I FL + AVI+GA W +FG LEWL+KY F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGA 125 NGIP +I R+ + + + W+ E T IAIDGK ++G+ A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLKGA-KASASSAA 119 Query: 126 IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIK 185 +HMV+A+ +G+V +K +E+ + ELL L LK L+T DA+ CQ I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 DKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTR 245 + D +L VKGNQ KL+ A + +F + +N + F+ HGR E R ++ Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKR---ITFQCP 236 Query: 246 LNF-CDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWL 304 LN + + +W LK L +A+ +K +K++ + +Y+SS + ++ F AIRAHW Sbjct: 237 LNLPAEIKMKWSQLKTL-IAVERHRKVGNKTS--IDTHFYVSSAVLTSEAFGRAIRAHWQ 293 Query: 305 IEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 E++ HW+LD ED ++ + A I++ +++ ALNL++ Sbjct: 294 TENNQHWLLDKLFQEDKQKMYDEDGASILAILRRWALNLVK 334 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 186 bits (473), Expect = 9e-46, Method: Compositional matrix adjust. Identities = 107/335 (31%), Positives = 177/335 (52%), Gaps = 14/335 (4%) Query: 15 DIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTI 74 D RQ GKV+H++ +L + C+ + + + ++ DF +L WL+ + +G P D Sbjct: 13 DPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFLPLKHGAPSHDVF 72 Query: 75 ARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSN 134 V+ I A ++ W C ++ +G IAIDGK +RG+ + + +H++ A+ + Sbjct: 73 RNVLMAIQPQALLEVLTGW---CGDL-EGRHIAIDGKALRGTHNAETGRHLVHLLRAWVD 128 Query: 135 ENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLA 194 + + GQ+ KSNEI AIP LL L LK +TIDAMG Q IA +I ADY+LA Sbjct: 129 DYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAEQITGAGADYVLA 188 Query: 195 VKGNQGKLHHAFEEKFP----VNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCD 250 +K N + H + F +++ ++ S T E+SHGR E R + ++ L++ Sbjct: 189 LKANHPRAHETVRKHFTEAERLDLSPSHHRKSV-TLELSHGRCERREYTITE--ELDWYH 245 Query: 251 FEFEWKGLKKLCVALSFRQKKEDKSAEGV-SIRYYISSKDMDAKEFAHAIRAHWLIEHSL 309 ++W GL+ VA RQ + + + Y++ S D + A +R HW +E+ Sbjct: 246 KSWKWAGLQS--VAQVRRQVQRSHDGPPLEEVHYFLCSFKADVERLAKLVRGHWSVENRC 303 Query: 310 HWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 HWVLDV NED ++R NAA ++ +++M + L Sbjct: 304 HWVLDVTFNEDHCQVRDRNAAHNLTILREMVITTL 338 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 185 bits (469), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 119/343 (34%), Positives = 179/343 (52%), Gaps = 20/343 (5%) Query: 14 PDIRQQGKVK-HKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDD 72 PD R + K H L+ IL + CAVIAGA+ W++I ++G + + +++ + NG+P D Sbjct: 13 PDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLELKNGVPSHD 72 Query: 73 TIARVVSNIDSLAFEKMFIEWMQECHEIT-------DGEI-IAIDGKTIRGSFDKGKRKG 124 T RV + +D AF F W E E T DG +A+DGK+ R S K G Sbjct: 73 TFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGKSARRSA-KPTFSG 131 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 +H+V + + ++LGQ +EIT ++L L L ++T+DA GCQ + I Sbjct: 132 CLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTLDAAGCQTETLEVI 191 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG-DSFSTQEISHGRKETRLHIVSNV 243 + + +Y++ VKGNQ L A F + + G D ++ +HGR E R NV Sbjct: 192 RARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAHGRHEER-----NV 246 Query: 244 TRLNFCD-FEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 T ++ D W G+ VAL R ++ A + YY+SS + A E A IR H Sbjct: 247 TVVHDPDGLPAGWAGVGS--VALVCRDRQVKGKANESTAHYYLSSLRVGAAELAGYIRGH 304 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 W IE S+HWVLDV ED SR R G+A + I+++A++LL+ Sbjct: 305 WHIE-SMHWVLDVAFREDESRTRVGHAGANLGMIRRVAVSLLK 346 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 184 bits (467), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 109/309 (35%), Positives = 160/309 (51%), Gaps = 6/309 (1%) Query: 38 IAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQEC 97 +A A+ W++IE +G + WL+ + NGIP DT RV +D+ AFE+ F +Q Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPE 157 E++A+DGK++R S G +H+VS +++ G+ LGQ + KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVF-S 216 LL L L ++T+DAMGCQ IA +I+ K AD LL +K N G + A F S Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 217 NYKGDSFSTQEISHGR-KETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKS 275 G HGR R+ + + T L W L ++ + R + Sbjct: 184 GAAGRPVFDAFEGHGRLVRRRVFVDAAATALAPLS---GWPDLSRVLAVETLRGIPGTGT 240 Query: 276 AEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISG 335 IRY+++S D IR HW +E++LHWVL+V ED SR+R AA + Sbjct: 241 VVA-DIRYFLTSCRDDPSVLVGVIRRHWSVENALHWVLEVSFREDDSRVRDRTAARNFAL 299 Query: 336 IKKMALNLL 344 ++K+ALNL+ Sbjct: 300 VRKIALNLI 308 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 179 bits (455), Expect = 1e-43, Method: Compositional matrix adjust. Identities = 110/342 (32%), Positives = 176/342 (51%), Gaps = 13/342 (3%) Query: 14 PDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDT 73 PD R +H L +L + + A I GA+ + F +R +++ + G+P DT Sbjct: 13 PDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLPSHDT 72 Query: 74 IARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFS 133 +RV +D +AF + F +++ E G ++AIDGKT+R SFD+ + A+H+VSAF+ Sbjct: 73 FSRVFRLLDPVAFSRCFQQFLDHLGEDGAG-VLAIDGKTLRRSFDRAAGRSALHVVSAFA 131 Query: 134 NENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLL 193 + +++GQ A NEI A LL L LK L+T DA+ Q+ A I ++ D+L Sbjct: 132 SGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGGDWLF 191 Query: 194 AVKGNQGKLHHAFEEKF--PVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDF 251 +K N+ L E F P V + T + HGR E R H VS+ D Sbjct: 192 PLKDNRPALRAEVERYFADPATVLAV----PHVTTDADHGRIEVRRHWVSHDVAWLASDR 247 Query: 252 EFE----WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEH 307 F GLK L + + ++ ++ Y+SS ++ K A A+RAHW IE Sbjct: 248 RFPDEAVLPGLKILGLVERTVTSPDGRTTATRTL--YLSSAALEPKTLARAVRAHWSIEA 305 Query: 308 SLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 ++HWVLD +ED +R R+ + E ++ ++K+ALN++R + Sbjct: 306 AVHWVLDTSFDEDRARNRKDHGPENLATLRKLALNVVRSANN 347 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 178 bits (452), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 117/369 (31%), Positives = 196/369 (53%), Gaps = 23/369 (6%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +LL ++ + D R +KH L ++FLT+ A+++GA W+ IE FG +L+WL+ Y F Sbjct: 2 TLLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPF 61 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 ++GIP IA ++ ++DS + W+ + T IIA+DGKT+R ++ + Sbjct: 62 EHGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ- 120 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A+H+VSAF NG+ L E K +E ++++ L L ++T+DA+ CQK KI Sbjct: 121 ALHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKI 180 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQE---ISHGRKETR--LHI 239 KK+D+++ +KGNQ L A + + Y + + E HGRKE R + I Sbjct: 181 ISKKSDFVIQIKGNQPALLAAVKAA----FAACYDSPALAISEQTNTGHGRKECRRVMQI 236 Query: 240 VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 N+ + +W ++ L V ++ + +K+A S R+Y+SS +D + A I Sbjct: 237 EGNLPP----ELSEKWPHIRTL-VEVASERTVGNKTA--CSSRWYVSSLPVDTAQLADII 289 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEG 359 RAHW IE+ LHWVLDV ED + + A+ ++ + AL++ IK + KK+ Sbjct: 290 RAHWAIENQLHWVLDVVFREDELNVSDPDGAKHLALFNRAALSV------IKQHQGKKDS 343 Query: 360 CVKHRERSS 368 R+ ++ Sbjct: 344 LAAKRQSAA 352 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 176 bits (447), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 101/241 (41%), Positives = 146/241 (60%), Gaps = 10/241 (4%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 +Q LL+++ D RQQ KV+H L IL + + A +A AD+W E+ F + ++L+KY Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE----IIAIDGKTIRGSFD 118 + NG P DT+ RV+ + ++++ +W QE +GE II IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKW-QERLNRNEGELLKKIICIDGKTMRSNKR 119 Query: 119 KGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 G++ G H+VSA+S E+G LGQ KSNEITAIPELL + +K ++TIDAMG Q Sbjct: 120 NGEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQT 177 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFS---NYKGDSFSTQEISHGRKET 235 IA KI++K+ADY+L++K NQG L+ E F F +G TQE +HG+ ET Sbjct: 178 AIAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIET 237 Query: 236 R 236 R Sbjct: 238 R 238 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 175 bits (443), Expect = 3e-42, Method: Compositional matrix adjust. Identities = 88/206 (42%), Positives = 131/206 (63%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +L+D+ D R + HKL I+ + +CA+I GAD + +E +G+ + EWLK++ + Sbjct: 8 TLIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLEL 67 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 +NGIP DT ARV + ID FE+ F +W+ E+ G+++ IDGKT++ S +K + K Sbjct: 68 ENGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKK 127 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 AIH+V+A+++E +VL Q K ++ EITAIP L+ +L L L+TIDAMG Q DIA + Sbjct: 128 AIHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELL 187 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKF 210 K ADY LA+KGNQ L +E F Sbjct: 188 HSKGADYCLALKGNQRGLFQEVKEVF 213 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 174 bits (441), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 104/288 (36%), Positives = 163/288 (56%), Gaps = 13/288 (4%) Query: 9 YISVTPDIRQQGKVK-HKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG 67 + + PD R+ K H LS IL + +CAV++G D+W+ + +FG + WL+++ NG Sbjct: 17 FFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANG 76 Query: 68 IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEII---AIDGKTIRGSFDKGKRKG 124 IP DT RV S ID AFE F +W H G+++ A+DGKT+R S +G Sbjct: 77 IPSHDTFGRVFSLIDPEAFEAAFFDWA--AHARIGGDVLDQLALDGKTVRRSH-RGSAGR 133 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A+H++ A+S E +++ Q + + KSNEITAIP++L+L L+ I+IDA+GCQK +A +I Sbjct: 134 ALHLLHAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQI 193 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 + DY+LA+KGNQ LH + +G + + ++ HGR ETR V++ Sbjct: 194 TEAGGDYVLALKGNQSALHDDVRLFMETQADRHPQGQAEAVEK-DHGRIETRRIWVND-- 250 Query: 245 RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDA 292 +++ + +W GLK L + S R+ S E R +I+S D Sbjct: 251 EIDWLTQKPDWPGLKTLVMVESRRELNGQVSCER---RCFITSHTADP 295 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 174 bits (440), Expect = 6e-42, Method: Compositional matrix adjust. Identities = 119/355 (33%), Positives = 195/355 (54%), Gaps = 20/355 (5%) Query: 10 ISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG-- 67 I+V D R QG++ + L IL +++ A I+G D+W++IED+ + E L+ +G Sbjct: 9 IAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKLSGKE 68 Query: 68 -----IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKR 122 +P DT V ID F +++ +++ +E G+ IAIDGKT RG + Sbjct: 69 LKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPRG-IKQTAN 127 Query: 123 KGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIAS 182 ++VSA+ ++ V+ + +E K +E+++I +L+ LL+L+ N +TIDA G ++ Sbjct: 128 SHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGTYVEVIE 187 Query: 183 KIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFS--TQE-ISHGRKETR-LH 238 I K +++L VKGNQ KL E++ F Y+G++ S TQE I HGR E R ++ Sbjct: 188 MILSKGGNFVLPVKGNQKKLLEFIEKE-----FREYRGNTVSADTQEDIGHGRVEKRTVY 242 Query: 239 IVSNVTRLNFCDFEFE-WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 ++ + + D + WKG+K L + KK DKS + YYI++ +D KE Sbjct: 243 CITEIKTDDDIDGCMQKWKGVKTLVKIVREVYKKADKSTR-IETVYYITNL-IDPKEINR 300 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKG 352 AIRAHW IE++LH LDV +NED S+ N E + +AL ++++ +G Sbjct: 301 AIRAHWGIENNLHRSLDVLLNEDHSQKSNHNVVENFHIMSLLALFIIKEISKQRG 355 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 166 bits (421), Expect = 9e-40, Method: Compositional matrix adjust. Identities = 109/340 (32%), Positives = 178/340 (52%), Gaps = 17/340 (5%) Query: 14 PDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDT 73 PD R G H L+ ILF+ + A + GA ++ F + + NG+P DT Sbjct: 16 PDPRA-GNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDVLVLKNGLPSHDT 74 Query: 74 IARVVSNIDSLAFEKMFIEWMQ---ECHEITDGE-IIAIDGKTIRGSFDKGKRKGAIHMV 129 +RV +D AFEK F +M+ + +I + +IA+DGK +R ++ G+ MV Sbjct: 75 FSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGYESGRSHMPPVMV 134 Query: 130 SAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKA 189 +A++ + + L V+ +NE +L+ LL LK ++T DA+ C + +A IK + Sbjct: 135 TAWAAQTRMALANVQAP-NNNEAAGALQLIELLQLKGCVVTADALHCHRGMAEAIKARGG 193 Query: 190 DYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFC 249 DY+LAVK NQ L + K + + S T + HGRKE R +V+ V ++ Sbjct: 194 DYVLAVKDNQPALMR--DAKAAIRAATRQGKPSTITVDAGHGRKEKRRAVVAAVPQMAQ- 250 Query: 250 DFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSL 309 + ++ GLK VA ++ DK+ E RY++ S+ K+ +R HW IE+SL Sbjct: 251 --DHDFAGLK--AVARITSKRGTDKTVE----RYFLMSQAYPPKDVLRIVRTHWTIENSL 302 Query: 310 HWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 HW LDV ++ED +R R+ NA ++ ++++ALN+ R D Sbjct: 303 HWPLDVVLDEDLARNRKDNAPANLAVLRRLALNVARAHPD 342 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 160 bits (405), Expect = 6e-38, Method: Compositional matrix adjust. Identities = 129/356 (36%), Positives = 183/356 (51%), Gaps = 22/356 (6%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 SL + + + P R + K + L +L + + ++G W EIED+ E E LK + Sbjct: 4 SLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEM 63 Query: 65 DNG------IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRG--- 115 G +P DT+ R +S +D AFE + W++ T G+ I IDGKT+RG Sbjct: 64 LTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRGVKK 123 Query: 116 -SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAM 174 SFD H+VSAFS ++ L Q+ + K+NEI AI +LL+LL L +++IDA+ Sbjct: 124 LSFDTQS-----HVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAI 178 Query: 175 GCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKE 234 G Q I +I DK DY+L VK NQ E F +F + T E+SHGR E Sbjct: 179 GTQTAIVEQIIDKGGDYVLCVKANQSLSLQEIEAYF-CPLFQKHILLDEQT-ELSHGRIE 236 Query: 235 TRLH-IVSNVTRLNFCDFEFEWKGLKKLC-VALSFRQKKEDKSAEGVSIRYYISSKDMDA 292 TR + + N + + KGL+ + V R KK DK++E V+ YYISS D Sbjct: 237 TRRYESILNPLEIEANEVLTRRKGLRSIHKVVRKRRDKKSDKTSEEVA--YYISSLT-DV 293 Query: 293 KEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCK 348 AIR HW IE+ LH LDV DAS R N A+I+ I+K+ L ++ K Sbjct: 294 SSLKQAIRGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLK 349 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 159 bits (402), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 109/350 (31%), Positives = 173/350 (49%), Gaps = 25/350 (7%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 LD PD R +H L ILF+ + AV+ GA E+E F RL+ L+++ + Sbjct: 3 FLDVFGEVPDPRDL-TAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWM-----QECHEITDGEIIAIDGKTIRGSFDKG 120 G P DT +RV++ +D +A + F+ +M Q + G++ A+DGK++R ++ KG Sbjct: 62 RGAPSHDTFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQV-AVDGKSLRRAYAKG 120 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 + +V+ F + + L Q + + E+ A L LL LK +T DA+ C + + Sbjct: 121 RSHMPPLVVTVFGCDTFMSLAQTVAQ-EGGEVQAAIAALELLSLKGLTVTADALHCHRRM 179 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSF-STQEISHGRKETRLHI 239 ++D Y++A+KGNQ KL A E ++ + K F T+E +HGR E R Sbjct: 180 TKTVRDGGGHYVIAIKGNQSKL--AAEANTALDKAAAGKATKFHQTEEDAHGRHEVRRAF 237 Query: 240 VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKED-KSAEGVS---IRYYISSKDMDAKEF 295 V F K V L + E ++ EG + +R Y S+ M A E Sbjct: 238 V----------IPFAQTPGKNALVDLCAIGRVESWRTVEGKTTHKVRCYALSRKMPAHEL 287 Query: 296 AHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 +R HW IE+ LHW LDV + ED R R+ N A + ++++ LN+LR Sbjct: 288 LATVRRHWSIENDLHWQLDVLLGEDHIRGRKNNTAANHAILRRLTLNVLR 337 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 158 bits (400), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 92/222 (41%), Positives = 139/222 (62%), Gaps = 6/222 (2%) Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 ++H+V+A+ +++ ++LGQVK + KSNEITAIP+LL +L+L+ ++TIDAMGCQK IA +I Sbjct: 2 SLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQI 61 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKF-PVNVFSNYKGDSFSTQEISHGRKETRLHIVSNV 243 KKADY+LAVK NQ +L+ + F V ++ + T + HGR ETR + S + Sbjct: 62 GSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREY--STI 119 Query: 244 TRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHW 303 + W L + + S R+ S E RY+I S + A+ F A+R HW Sbjct: 120 VGDDLLAGITGWDNLNAIGMVESKREVGNTISNEK---RYFIMSINGHAQRFGDAVREHW 176 Query: 304 LIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 IE+++HWVLDV ED SRIR+ N+ E +S ++K+ALN ++ Sbjct: 177 GIENTVHWVLDVSFGEDQSRIRKDNSPENLSMLRKIALNCVK 218 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 154 bits (389), Expect = 6e-36, Method: Compositional matrix adjust. Identities = 85/195 (43%), Positives = 121/195 (62%), Gaps = 4/195 (2%) Query: 94 MQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEIT 153 M+ H++T GE++AIDGKT+RGS+D+ R+ IHMVSA+++ N +VLGQ+KT KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVN 213 AIP L+ +L L+ ++TIDAM CQ IA I K DYLLAVKGNQGKL A + F + Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFTPH 120 Query: 214 VFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKED 273 + D+ ++ GR E R + V + + L DF W GL + + ++R K Sbjct: 121 RRAPIDRDTCQIEK-QKGRVEARTYHVLSASDL-IRDFS-TWSGLTSIVMVENYRAAKGR 177 Query: 274 KSAE-GVSIRYYISS 287 + A GV + + + S Sbjct: 178 QRARVGVPLLHKVQS 192 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 153 bits (386), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 85/241 (35%), Positives = 132/241 (54%), Gaps = 14/241 (5%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 I+ L++ + D R GK++H+L IL + VCAV+A A+ +++I +G + WL + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEW------MQECHEITDGEIIAIDGKTIRGS 116 D GIP DT RV ID AFE+ F+ W Q E + E IA+DGK +R S Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 FDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGC 176 FD+ + +H+VSA++ G+VL Q + K E A+P +L L+L L+++DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYK-----GDSFSTQEISHG 231 ++++A I + A YLL +K NQ K+H F N F++ D+F +HG Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAFDD---THG 238 Query: 232 R 232 R Sbjct: 239 R 239 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 153 bits (386), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 114/355 (32%), Positives = 178/355 (50%), Gaps = 30/355 (8%) Query: 3 IQSLLDYISVTPDIRQ--QGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++ L +++ PD R+ +G K+KL IL L + + +I FG L+ + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHE---ITDGEIIAIDGKTIRGSF 117 G +G+P + T+ R+ +ID A + E+ H+ G+I+ IDGK +RG+ Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAGDILCIDGKAMRGTV 137 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 + R I VSA+S E GV L E KSNEIT++P+LL+ + + ++T DAM Q Sbjct: 138 LENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADAMSFQ 195 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKF----PVNVFSNYKGDSFSTQEISHGRK 233 K I KI++K D+L+ +K NQ L + E+ PV+V+S + HGR Sbjct: 196 KAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAEPVDVYSE-------GPFLEHGRI 248 Query: 234 ETRLHIVSNVTRLN--FCDFEFEWKGLKKLCVALSFRQKKED--KSAEGVSIRYYISSKD 289 ETR V + R N D E +W G + + ++K D KS+E R+Y+SS Sbjct: 249 ETR---VCRIFRGNDLITDRE-KWNGNLTVVEIRTATERKSDGQKSSER---RFYVSSFH 301 Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 A+ R HW IE S+HW LD + +D R +A + I++M L +L Sbjct: 302 GSARRLGTIARMHWAIE-SMHWDLDRNLRQDFIRRNSARSARNLDTIQRMVLAIL 355 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 152 bits (385), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 106/348 (30%), Positives = 173/348 (49%), Gaps = 16/348 (4%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 +++ L PD R V+H L +L + +V+ G+ E+ FG + + + Sbjct: 10 IAMHIFLSAFDEVPDPRAS-NVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT-DGEIIAIDGKTIRGSFDK 119 + + IP DT + V ID A + F + + + ++ DG+IIAIDGK +RG+ D Sbjct: 69 FLKLKHAIPSHDTFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDP 128 Query: 120 GKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 G+ MVSA+++ + L V + + E++A E L L+ L+ ++T DA+ C + Sbjct: 129 GESARTRMMVSAYASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRR 187 Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFS-TQEISHGRKETRLH 238 + I D+ LA+KGNQ L F +K D + T+ HGRKETR Sbjct: 188 TVAAINAGGGDWCLALKGNQESLLSDARGCFS----KGHKSDPTAVTENTGHGRKETRKA 243 Query: 239 IVSNVTRLNFCDFEF-EWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 +V + L E+ E+ GLK + R+ ++E RY+ S + Sbjct: 244 VVVSAKALA----EYHEFPGLKGFGRIEATRETGGKVTSE---TRYFALSWVPTPEVLLA 296 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 A+R HW IE++LHW LDV EDA+R R+ N I+ +++ AL++LR Sbjct: 297 AVRDHWAIENALHWQLDVSFREDAARNRKDNGPGNIAVLRRRALDVLR 344 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 148 bits (374), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 102/335 (30%), Positives = 163/335 (48%), Gaps = 16/335 (4%) Query: 14 PDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDT 73 PD R + +H L +L + +V+ GA E+ FG + + + + +P DT Sbjct: 45 PDPRAE-NTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLKHAVPSHDT 103 Query: 74 IARVVSNIDSLAFEKMFIEWMQECHEIT-DGEIIAIDGKTIRGSFDKGKRKGAIHMVSAF 132 + V ID A + F + + + DG++IA+DGK +RG+ D G+ MVSA+ Sbjct: 104 FSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGRTRMMVSAY 163 Query: 133 SNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYL 192 + + L V + + E+ A E L L+ LK ++T DA+ C + + I D+ Sbjct: 164 AARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAINAGGGDWC 222 Query: 193 LAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFE 252 LA+K NQ L F ++ S +++I HGR ETR V + L Sbjct: 223 LALKANQDSLLSDARASFGAEPDAH---PSALSEDIGHGRTETRKATVVSSKALAE---H 276 Query: 253 FEWKGLKKLCVALSFRQKKEDKSAEGVS--IRYYISSKDMDAKEFAHAIRAHWLIEHSLH 310 E+ GLK R + K+AEG + RY+ S + +RAHW IE+SLH Sbjct: 277 HEFPGLKAFG-----RVEATRKTAEGTTSETRYFALSWVPTPEVLLATVRAHWAIENSLH 331 Query: 311 WVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 W LDV EDA+R R+ N+ I+ +++ AL+++R Sbjct: 332 WQLDVSFREDAARNRKDNSPGNIAILRRRALDVMR 366 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 148 bits (374), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 119/395 (30%), Positives = 192/395 (48%), Gaps = 49/395 (12%) Query: 13 TPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDD 72 T D R++ KV + I+ +T+ V W +I DF + ++L+++ P D Sbjct: 27 TIDPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHD 86 Query: 73 TIARVVSNIDSLAFEKMFIEW----------MQEC----------HEITDGEIIAIDGKT 112 T+ R I + E + EW +++C +++ IAIDGKT Sbjct: 87 TLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKT 146 Query: 113 IRGSFDKGK--------------RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPEL 158 I G+ + K +H+VSAF ++ + LGQ + K NEI AIP+L Sbjct: 147 ICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKL 206 Query: 159 LNLLYLKK-NLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSN 217 L+ + +++ +++TIDA+G QK I KI +K+ADYLL VK N KL E + S Sbjct: 207 LDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISG 266 Query: 218 YKGDSFSTQEIS---HGRKETRLHI-VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKED 273 + D E + HG TR I S +RL FC +WK L+ + + +K Sbjct: 267 RENDFIKRAEETTEGHGFMVTRTCISCSEPSRLGFC--YRDWKNLRTYGIIKT--EKINI 322 Query: 274 KSAEGVSIRY-YISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEI 332 + E + ++ +ISS + + R HW +E+ LHW LDV NED R + N+A+ Sbjct: 323 ATGEIQNEKHCFISSLVNNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQN 381 Query: 333 ISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERS 367 S + KMAL +L++ +D E+KK + R+++ Sbjct: 382 FSTLTKMALTILKNYQD----EDKKTSVNRKRKKA 412 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 145 bits (365), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 107/359 (29%), Positives = 172/359 (47%), Gaps = 36/359 (10%) Query: 3 IQSLLDYISVTPDIR--QQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++ L ++ S PD R ++G ++HKLS I+ L + ++ EI +FG L+ +K Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG-----EIIAIDGKTIRG 115 NGIP + T+ R+ ID A + + H+ G EII IDGK RG Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMG 175 + K R I VSA S + L E KSNEI A+P L++ + + ++T DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFP-VNVFSNYKGDSFSTQEISHGRKE 234 QKDI KI++K D+++ +K NQ L + E+K ++ +Y G+ E+ HGR E Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDKIKELSPVYSYCGEP----ELGHGRIE 268 Query: 235 TRLHIVSNVTRL---------NFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYI 285 TR + V + T L N E+E + +KK + + R ++ Sbjct: 269 TRSYRVFDGTDLIANKEKWNGNLTIIEYECETVKKSTGNCTTEK------------RLHV 316 Query: 286 SSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 SS + +R HW IE S+HW LD + +D + + AA + I+++ ++ Sbjct: 317 SSLPANTPRLGTPVRNHWSIE-SMHWGLDRNLLQDKIKRKSARAARNLDTIQRIVYSVF 374 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 144 bits (363), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 84/210 (40%), Positives = 120/210 (57%), Gaps = 9/210 (4%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 ++D D R K HK+ I+++++ AVI GA W EIE+FG+ ++ + K Sbjct: 5 IIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPSL 64 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWM-QECHEITDGEIIAIDGKTIRGS------FD 118 IP DT R S I FE +F W+ Q C E+ ++AIDGK +RG Sbjct: 65 EFIPSHDTFNRFFSMIKPDYFELIFRNWVKQVCQEVKG--VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 +GK + MVSA+S NG+ LGQVK + KS+EITAIP L+N L L ++TIDAMGCQK Sbjct: 123 RGKEGFKLWMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEE 208 DI I A+Y++A+K N+ K + ++ Sbjct: 183 DITQTIIGHDANYIIAIKENKKKKYQPAKQ 212 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 144 bits (363), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 76/186 (40%), Positives = 111/186 (59%), Gaps = 6/186 (3%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 SLL PD R++ + H+L +L +C VI+GA+ W + + +L+WL+ Y + Sbjct: 7 SLLTAFDDLPDPRRR-ECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 +GI DT RV S +D+ FE F+ W+ +G+ +AIDGK +RGS D + Sbjct: 66 AHGIASHDTFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHDGA--RS 123 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 IH+VSA+S+ + LGQV+T KSNEITAIPELL L ++ + ITIDAMGC A Sbjct: 124 PIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCHGMPA--- 180 Query: 185 KDKKAD 190 + ++AD Sbjct: 181 RHRRAD 186 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 143 bits (361), Expect = 9e-33, Method: Compositional matrix adjust. Identities = 108/364 (29%), Positives = 185/364 (50%), Gaps = 46/364 (12%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERL-EWLKKY 61 I LL+ ++ PD R V+H L+A+L LT CAV+AGA + ++ E E L++ Sbjct: 39 IPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLERL 98 Query: 62 GDFDNGI------PVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEI--IAIDGKTI 113 G + + P + TI RV++ ID+ A ++ W+ C + G + +A+DGK++ Sbjct: 99 GIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWL-ACRQQDAGGLRALAVDGKSL 157 Query: 114 RGSF-DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLY-LKKNLITI 171 RG+ KG+R +H+++A + G+VL Q+ K+NEIT LL+ L L ++T Sbjct: 158 RGAARAKGRR---VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTS 214 Query: 172 DAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFE----EKFPVNVFSNYKGDSFSTQE 227 DA+ Q D A+ ++ + Y++ VK N KL + ++ P+ + G Sbjct: 215 DALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLKSLPWQQIPLQDRTRTTG------- 267 Query: 228 ISHGRKETRLHIVSNVTRLNFCDF-EFEWKGLKKLCVALSFRQKKEDKSAEGVSIR--YY 284 HGR E R RL C + G ++ A+ +++ +++ VS++ Y Sbjct: 268 --HGRCEIR--------RLKVCTVNNLLFPGARQ---AVQIVRRRVNRTTGKVSLKTIYA 314 Query: 285 ISS---KDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMAL 341 ++S + A IR HW +E +LH V DV EDAS++R GNA + ++ + +A+ Sbjct: 315 VTSLAAEQAPPARVAQLIRGHWTVE-ALHHVRDVTFAEDASQLRSGNAPQAMATYRNLAI 373 Query: 342 NLLR 345 LR Sbjct: 374 GALR 377 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 143 bits (360), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 109/360 (30%), Positives = 165/360 (45%), Gaps = 64/360 (17%) Query: 15 DIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTI 74 D RQ+ KV H+ I+ + V A W E+ DF ER+++++K+ P DT+ Sbjct: 29 DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFFPDIQKAPSHDTL 88 Query: 75 ARVVSNIDSLAFEKMFIEWMQECHE----------ITDG----------EIIAIDGKTIR 114 R + A E+ + W E + +G IAIDGKTI+ Sbjct: 89 RRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKPFRQIAIDGKTIK 148 Query: 115 GSFDKGKRKGA--------------IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLN 160 + ++ +R+ +H+VSAFS ++ + LGQ + + K NEI AIP LL+ Sbjct: 149 KAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKKENEIVAIPRLLD 208 Query: 161 LLYLKK-NLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHA-------FEE-KFP 211 L + + +++TIDAMG QKDI S+I K+A YLL VK NQ L FE P Sbjct: 209 DLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIAGNMRDFERIPLP 268 Query: 212 VNVFSNYKGDSFSTQEISHG-------RKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVA 264 V+ +K E HG R + LH + + + +W+ L+ + Sbjct: 269 NEVYKVHK-----EGENGHGFVFLRECRVCSSLHSLGKIYK--------DWENLRSYGLI 315 Query: 265 LSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRI 324 + R E V Y+ISS + D ++ R HW IE+ LHW LD+ ED R+ Sbjct: 316 RTERV-DEATGESAVETHYFISSLENDPEKIMRVKRKHWGIENGLHWQLDITFKEDDGRM 374 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 140 bits (354), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 83/193 (43%), Positives = 115/193 (59%), Gaps = 6/193 (3%) Query: 136 NGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAV 195 +VLGQ K KSNEITAIP L+ +L ++ ++ITIDAMGCQK+I S I+ KK DY++ + Sbjct: 31 QNLVLGQKKVNDKSNEITAIPALIEMLEIESSIITIDAMGCQKEITSLIRKKKGDYIITL 90 Query: 196 KGNQGKLHHAFEEKFPVNVFSNYKGDSFS-TQEIS--HGRKETRLHIVSNVTRLNFCDFE 252 K NQ L +E F + +K S QEI H R E R I +V+ L + Sbjct: 91 KANQKSLRQEIKEWFKIAEAEEFKDREHSYYQEIETGHHRIEKREVIAVSVSSLPCLHNQ 150 Query: 253 FEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWV 312 W LK + + S R+ + E +R+YISS + ++++ A AIR+HW IE+SLHW Sbjct: 151 DLWTELKTVVMVKSERRLWNKTTTE---VRFYISSVEKNSQKIATAIRSHWEIENSLHWT 207 Query: 313 LDVKMNEDASRIR 325 LDV +ED SRIR Sbjct: 208 LDVTFSEDKSRIR 220 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 139 bits (351), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 113/378 (29%), Positives = 183/378 (48%), Gaps = 49/378 (12%) Query: 30 LFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKM 89 + +T+ V W +I DF + ++L+++ P DT+ R I + E Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FIEW----------MQEC----------HEITDGEIIAIDGKTIRGSFDKGK-------- 121 + EW +++C +++ IAIDGKTI G+ + K Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 122 ------RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKK-NLITIDAM 174 +H+VSAF ++ + LGQ + K NEI AIP+LL+ + +++ +++TIDA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEIS---HG 231 G QK I KI +K+ADYLL VK N KL E + S + D E + HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 RKETRLHI-VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRY-YISSKD 289 TR I S +RL FC +WK L+ + + +K + E + ++ +ISS Sbjct: 241 FMVTRTCISCSEPSRLGFC--YRDWKNLRTYGIIKT--EKINIATGEIQNEKHCFISSLV 296 Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 + + R HW +E+ LHW LDV NED R + N+A+ S + KMAL +L++ +D Sbjct: 297 NNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTLTKMALTILKNYQD 355 Query: 350 IKGEEEKKEGCVKHRERS 367 E+KK + R+++ Sbjct: 356 ----EDKKTSVNRKRKKA 369 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 136 bits (343), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 97/312 (31%), Positives = 157/312 (50%), Gaps = 16/312 (5%) Query: 40 GADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHE 99 GA EI +F R LK+ +G P DT +R+ ID + ++ + Sbjct: 37 GAKNCVEIAEFVEGREAELKEIVTLRHGCPSHDTFSRIFRLIDPDELARALGAFLAALRQ 96 Query: 100 -----ITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITA 154 ++A+DGK +R ++KG+ MVS + E + + + E S+E+ A Sbjct: 97 GLGLGPRPRGVVAVDGKALRRGYEKGRAFMPPVMVSVWDAETRLSVATKRAEG-SDEVAA 155 Query: 155 IPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV 214 LL + LK ++T DA+ C+ D A + +KA Y LA+K N+G+L E F V Sbjct: 156 TLALLKSIDLKGCIVTADALHCRPDTAKALIGRKAHYALALKANRGRLFACAEAGF---V 212 Query: 215 FSNYKGD-SF-STQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKE 272 ++ GD +F T+E HGR ETR ++V L + GLK + + RQ + Sbjct: 213 AADAAGDLAFHETRETGHGRLETRR---ASVLPLKAFKQAPAFPGLKAIGRIQATRQGAD 269 Query: 273 DKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEI 332 ++ S+RY SK + + A +RAHW IE+ LHW LDV +ED +R R+ NA + Sbjct: 270 GRAV--TSVRYIALSKVLAPHKLAEVVRAHWTIENQLHWSLDVVFHEDDARSRKDNAPQN 327 Query: 333 ISGIKKMALNLL 344 ++ I+++A ++L Sbjct: 328 LAVIRRLARDIL 339 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 132 bits (331), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 79/206 (38%), Positives = 112/206 (54%), Gaps = 8/206 (3%) Query: 143 VKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKL 202 + TE KSNEITAIP LL L KK ++TIDAMGCQKDIA I D+++AVK NQ KL Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 HHAFE---EKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLK 259 A EK + ++ T HGR++ R H V+ V E+ W +K Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVPPGFAAKGEWPW--IK 118 Query: 260 KLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNE 319 + A+ + ++ +RYY+ S+ + K F +R HW IE S+HWVLDV E Sbjct: 119 AIGTAVRITTHADGTQSD--EVRYYMLSRFLSGKRFGEVVRGHWGIE-SMHWVLDVTFGE 175 Query: 320 DASRIRRGNAAEIISGIKKMALNLLR 345 D +R R+ A +S +++ A+ LL+ Sbjct: 176 DRTRTRQRVLANNLSWVRRFAITLLK 201 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 131 bits (329), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 70/133 (52%), Positives = 89/133 (66%), Gaps = 3/133 (2%) Query: 103 GEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL 162 G+IIA+DGKT+RGS+D+ K AIHMVSA+S N +VLGQ+KTE KSNE TAIP+L LL Sbjct: 7 GDIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIPKLFTLL 66 Query: 163 YLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKF---PVNVFSNYK 219 L+ +TIDA+G Q+DIA +I DK ADYLL VK NQ LH + + F+ Sbjct: 67 ALEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAKGFTEDF 126 Query: 220 GDSFSTQEISHGR 232 DS + + HGR Sbjct: 127 TDSVTEEGDKHGR 139 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 130 bits (326), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 106/347 (30%), Positives = 173/347 (49%), Gaps = 18/347 (5%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFG-HERLEWLKKYGD 63 SL++ + D R+ +H L +L + + + G ++E+ +F + R +++ Sbjct: 3 SLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEFNI 62 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEW-MQECHEITDGEIIAIDGKTIRGSF--DKG 120 +P TI RV+ ++ KMF EW ++E + D + +DGK+++ + Sbjct: 63 IPERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNPNN 122 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTE-AKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 +++ I VS FS E+G+VL + E K +EI ++ L+ + T DA+ CQK Sbjct: 123 EQQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQKK 182 Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDS-FSTQEISHGRKETRLH 238 S I K DY++ VKGNQ L+ ++ + ++ K +S F Q+ SHGRK +R Sbjct: 183 TISLIAKTKNDYVITVKGNQKNLYKRIQD-----LSNSSKPESCFLEQDNSHGRKISRKI 237 Query: 239 IVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHA 298 V V + FE L+++ + + + + DK+ E + YYISS A+ FA Sbjct: 238 EVFKVRKNERQGFE----NLRRV-IKVERKGSRGDKTYEETA--YYISSLTESAQVFAKI 290 Query: 299 IRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 IR HW IE+ LHWV DV ED S I AA S + + LNL R Sbjct: 291 IRGHWKIENQLHWVKDVIFEEDKSEISDFQAASNWSILTTIGLNLFR 337 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 128 bits (321), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 92/254 (36%), Positives = 130/254 (51%), Gaps = 24/254 (9%) Query: 68 IPVDDTIARVVSNIDSLAFEKMFIEWM-QECHEITDGEIIAIDGKTIRGS------FDKG 120 IP DT R S I FE +F W+ Q C E+ ++AIDGK +RG G Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG--VVAIDGKLMRGPSQCDGEHTTG 61 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 K + MVSA+S NG+ LGQVK + KSNEITAIP L+N L L ++TIDAMGCQKDI Sbjct: 62 KEGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDI 121 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG--------DSFSTQEISHGR 232 I + A+Y++A+K N+ K + ++ + +Y+ ++ HGR Sbjct: 122 TQTIIEHDANYIIAIKENKKKNYQLAKQ-----IIDDYQDKDEIINRVTRHVSENTGHGR 176 Query: 233 KETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKD-MD 291 ETR V + + F+ + GLK + V + + +RYY++S D Sbjct: 177 IETRTCTVVSYGSIMEKMFKKKLVGLKSI-VGIKSERTIVATGEYTQEVRYYVTSLDNTK 235 Query: 292 AKEFAHAIRAHWLI 305 +E A AIR HW I Sbjct: 236 PEEIASAIRQHWSI 249 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 98/352 (27%), Positives = 175/352 (49%), Gaps = 26/352 (7%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + LL + PD R + +++L +++ + +CAV AGA + I D+ + + Sbjct: 43 MPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQRC 102 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQ-ECHEITDGEIIAIDGKTIRGSFDKGK 121 +P + TI +V +D A +++ + T +A+DGKTIRG+ + Sbjct: 103 GIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RIG 160 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 ++ A H+V+A ++ + VVLGQ +T KSNEI + LL + + ++T+DAM QK A Sbjct: 161 KQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKATA 220 Query: 182 SKIKDK-KADYLLAVKGNQGKLHHAFE----EKFPVNVFSNYKGDSFSTQEISHGRKETR 236 ++++ +A+Y++ VK NQ L E+ PV V+S+ E HGR+E R Sbjct: 221 RCLREQCRAEYVMIVKANQPGLLARVRDQPWEQVPV-VWSD-------PVERGHGREEHR 272 Query: 237 LHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISS---KDMDAK 293 + + V R F + + + + R++ A + Y I S + K Sbjct: 273 SYKILTVAR----GLRFPY---AQQVIQIIRRRRVLGAGAWSTEVVYAICSLPCEQAPPK 325 Query: 294 EFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 A IR HW IE+ +H+V DV +ED S +R G+ ++++ ++ + + L R Sbjct: 326 LLASWIRGHWHIENRIHYVRDVTFDEDRSAVRTGHGPQVMATLRNVVVGLHR 377 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 124 bits (312), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 104/372 (27%), Positives = 169/372 (45%), Gaps = 41/372 (11%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDF-GHERLEWLKKY 61 +Q L D ++ PD R ++H+L IL L+ AV AG +EI + H + L Sbjct: 39 VQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTAL 98 Query: 62 GDFDNGI------PVDDTIARVVSNIDSLAFEK---MFIEWMQECHEITDGEIIAIDGKT 112 G + + P DT+ RV+S +DS A + MF ++A+DGKT Sbjct: 99 GARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGKT 158 Query: 113 IRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLY----LKKNL 168 +RG+ G A H+++ + GVVL + + AK+NE+TA LL L+ L + Sbjct: 159 LRGA--AGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGVV 216 Query: 169 ITIDAMGCQKDIASKI-KDKKADYLLAVKGNQGKL----HHAFE-EKFPVNVFSNYKGDS 222 +T DA+ + A I + A ++ VK N L H A + K P+ Sbjct: 217 VTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPIG--------- 267 Query: 223 FSTQEISHGRKETR---LHIVSNVTRLNFCDFEFEW---KGLKKLCVALSFRQKKEDKSA 276 S + +HGR E R L S R + + +++ + R + Sbjct: 268 HSAEGRAHGRFERRTIQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGRARVTRTIP 327 Query: 277 EGVSIRYYISSKDMDA---KEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEII 333 V++ + ++S +DA + A R HW IE+ +HWV DV EDASR+R G I+ Sbjct: 328 STVTV-HVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIM 386 Query: 334 SGIKKMALNLLR 345 + ++ + + L+R Sbjct: 387 TTLRNLIIGLIR 398 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 122 bits (305), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 79/194 (40%), Positives = 104/194 (53%), Gaps = 6/194 (3%) Query: 154 AIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVN 213 AIPELL L L+ +TIDA+G Q IA I + ADY+LAVK NQ +L + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 VFSNYKGDSF--STQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKK 271 +G + + + HGR ETR+ VS W GL++L + RQ Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETRVCRVSEDVAW-LASTGQHWAGLQRLVMLERTRQIG 119 Query: 272 EDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAE 331 + + E YYISSK + A + A IRAHW IE+ LHWVLDV EDAS IR AA Sbjct: 120 QKVTTERC---YYISSKAVKAAQMAQLIRAHWGIENQLHWVLDVSWGEDASLIRDTVAAR 176 Query: 332 IISGIKKMALNLLR 345 ++ ++K+ LNL R Sbjct: 177 NMASLRKITLNLAR 190 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 120 bits (300), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 104/361 (28%), Positives = 167/361 (46%), Gaps = 41/361 (11%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDF----GHERLEWLKK 60 SL+ ++ PD R V H L A+L V AV+ GA + ++ + L L Sbjct: 28 SLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELGV 87 Query: 61 YGDFDNGI---PVDDTIARVVSNIDSLAFEKMFIEWMQECHE--ITDGEIIAIDGKTIRG 115 + D G+ P + T R+++ +D+ A + W+ C T + ++DGKT+RG Sbjct: 88 FRDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLRG 147 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMG 175 S G++ +H+++ G VLGQV + K+NE+T LL L L ++T DA+ Sbjct: 148 SGPAGEQ---VHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADALH 204 Query: 176 CQKDIASKIKD-KKADYLLAVKGNQGKLHHAFE----EKFPVNVFSNYKGDSFSTQEISH 230 Q++ A + D KKA Y+ VK NQ +L+ + K P+ ++ +G H Sbjct: 205 TQREHARWLVDTKKAAYVFTVKKNQPRLYRQLKTLPWTKIPIQDETSTRG---------H 255 Query: 231 GRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEG--VSIRYY---- 284 GR + R T DF V +++ A G ++ Y Sbjct: 256 GRYDIRRLQAVTCTGPLALDFPH--------AVQALRIRRRRLNLATGRWSTVTVYAITN 307 Query: 285 ISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 +S+ E A +R HW IE +LH + D EDASR+R GNA ++ ++ A+NLL Sbjct: 308 LSAAQAGPAELADWLRGHWAIE-TLHHIRDTTYAEDASRLRTGNAPRAMATLRNTAINLL 366 Query: 345 R 345 R Sbjct: 367 R 367 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 117 bits (292), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 66/139 (47%), Positives = 86/139 (61%), Gaps = 4/139 (2%) Query: 106 IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLK 165 +AIDGK +RGS D + IH+VSA+S+ + LGQV+T KSNEITAIPELL L ++ Sbjct: 1 MAIDGKCLRGSHDGARSP--IHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 KNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGD--SF 223 + ITIDAMGCQ DIA +I + ADY+L VKGNQ L A + F + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 STQEISHGRKETRLHIVSN 242 S + +HGR ETR + +N Sbjct: 119 SQTDKNHGRIETRRCVATN 137 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 98/361 (27%), Positives = 164/361 (45%), Gaps = 35/361 (9%) Query: 3 IQSLLDYISVTPDIRQ-QGKVKHKLSAILFLTVCAVIAGADEWQEI----EDFGHERLEW 57 I LL + D R+ +GK+ + LS +L + A +AGA +EI DFG + L Sbjct: 22 ISGLLAMLGGITDPRKARGKI-YSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKKYGDFDNG---IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE--IIAIDGKT 112 L D G P + I + +D A + F W+ GE ++A+D K Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 IRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELL-NLLYLKKNLI-T 170 +RG++ +G ++ + ++SA + G+V GQV+ +NEIT + LL NL + ++ T Sbjct: 141 LRGAWSEGNKR--VTLLSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 171 IDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHH-AFEEKFPVNVFSNYKGDSFSTQEIS 229 +DA+ Q + A + + DY L VKGNQ L+ FE+ P+ K +E Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLYRKTFEQTLPLL----QKPPQHEVEERG 254 Query: 230 HGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYY----- 284 HGR + + E + G ++ A R+ + D VS Y Sbjct: 255 HGR----------IKKWQAWTTEAKGIGFPEVATAAVIRRDEFDLKGIRVSREYAHILTS 304 Query: 285 ISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 ++ A IR HW IE+ +H+ D EDA++ GN+ ++ + +A+ ++ Sbjct: 305 VAGNRATAAYIHRLIRGHWGIENEIHYPRDTAWREDANQTHTGNSPHTLASFRNLAIGII 364 Query: 345 R 345 R Sbjct: 365 R 365 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 91/338 (26%), Positives = 147/338 (43%), Gaps = 52/338 (15%) Query: 55 LEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIR 114 LE L+K+ GI TI R++ ID F+EW+ E + + + A+DGK + Sbjct: 27 LEELRKHMKLKYGIASPSTITRMLCGIDEELALYAFMEWVGEIVDSRNTHL-AVDGKALC 85 Query: 115 GSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAM 174 G+ +K K + +++ G++L Q+ ++K+NEIT IPELL LL + +++TIDA+ Sbjct: 86 GATEKTKGETTPMLLNVVETVRGLMLAQLPVDSKTNEITVIPELLKLLDISGSIVTIDAV 145 Query: 175 GCQKDIASKIKDKKADYLLAVKGNQGKLH---HAFEEKFPVNVFSNYKGDSFST------ 225 G Q I +I ++ + L VK NQ + + H F +K KG+ + Sbjct: 146 GTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHTFMDKLEAADVQRKKGEVLDSGMREYL 205 Query: 226 ---QEI-----SHGRKETR-LHIVSNVTRLNFCDFEFEWKGLKKLCVALSFR-------- 268 +EI + R E R I + + N + EW ++ + R Sbjct: 206 EKYEEIIRIEKNRDRNEYRTCQICKDAS--NLTKSQKEWPHVQSIGRIKQVRIPSEKDSH 263 Query: 269 ---------------------QKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEH 307 E+ + + V IS + A+E R HW IE+ Sbjct: 264 GNDVTPSKEEFLEKGSRRVPAPSAEEGTGKDVQCTALISDLILTAEELGSIKRMHWSIEN 323 Query: 308 SLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 LH VLD ED S ++ +S I+K A N+LR Sbjct: 324 RLHHVLDDTFREDRSPAKKSRNN--LSLIRKYAYNILR 359 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 102 bits (254), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 68/225 (30%), Positives = 115/225 (51%), Gaps = 17/225 (7%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 IQ L+D +S T D R++ ++H +++ VCA+++GA + + ++ LKK Sbjct: 222 IQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKRL 281 Query: 63 DFDNG-------IPVDDTIARVVSNIDSLAFEKMFIEW----MQECHEITDGEIIAIDGK 111 F P + T+ R + +ID L +++ W + +C D +++IDGK Sbjct: 282 GFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDGK 341 Query: 112 TIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITI 171 +RG+ K K IH ++AF G+V+ Q + K+NEI + LL + ++ ++T Sbjct: 342 AVRGA-SKAKGGQKIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVTA 400 Query: 172 DAMGCQKDIASKI-KDKKADYLLAVKGNQGKLHHAFE----EKFP 211 DA+ Q + A I +DKKADY+ VK NQ + E E FP Sbjct: 401 DALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIESLPWEAFP 445 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 102 bits (253), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 60/158 (37%), Positives = 91/158 (57%), Gaps = 3/158 (1%) Query: 53 ERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKT 112 ER L+ + NG P DT RV+ I+ + + +E +G+ IAIDGK Sbjct: 7 ERGASLRPPVELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHIAIDGKR 66 Query: 113 IRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITID 172 ++GS K+ G+ H++SA+ +E G+ L Q K NE+ AIPE+L+ L L +I+ID Sbjct: 67 LKGS---KKKTGSTHILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSGAVISID 123 Query: 173 AMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKF 210 AMG Q +IA +I +ADY+L++KGNQ L+ + F Sbjct: 124 AMGTQTNIAEQIIQSEADYILSLKGNQKHLYEDVRDCF 161 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 100 bits (249), Expect = 9e-20, Method: Compositional matrix adjust. Identities = 60/140 (42%), Positives = 80/140 (57%), Gaps = 9/140 (6%) Query: 103 GEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL 162 G +IAI+GK++RG+ A+H VSA++ G+ LGQ+ + KSNEITAI ELL L Sbjct: 3 GLVIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLPTL 62 Query: 163 YLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDS 222 L+ ++TIDA+GCQ +A +I DY+LAVK NQ L HA + F GD Sbjct: 63 ALEGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGT---LGAPGDP 119 Query: 223 F------STQEISHGRKETR 236 T + HGR ETR Sbjct: 120 VRQTCVHETLDKGHGRIETR 139 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 84/266 (31%), Positives = 122/266 (45%), Gaps = 35/266 (13%) Query: 106 IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLK 165 IA+DGK +RG+ + A H+VS F++ +VLGQ+ KSNEI + LL LL Sbjct: 83 IALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQLAVAEKSNEIPCVCALLTLLPGS 140 Query: 166 -KNLITIDAMGCQKDIASKI-KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSF 223 + L+T+DAM Q A I K+ YL+ VK NQ K+ V + DS Sbjct: 141 LRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKILARITALPWAEVPAAATDDSR 200 Query: 224 STQEISHGRKETR-LHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIR 282 HGR ETR L I++ + F K + ++ + V + Sbjct: 201 G-----HGRVETRTLQIITAARGIGF--------PYAKQIIRITRERLITATDQRSVEVV 247 Query: 283 YYISSKDMDAKEFAHA--------IRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIIS 334 Y I S F HA +R H IE+SLHW+ DV +ED R GN A++++ Sbjct: 248 YAICSL-----PFEHARPTAIMTWMRQHCRIENSLHWIRDVTFDEDRQRAHTGNGAQVLA 302 Query: 335 GIKKMALNLLRDCKDIKGEEEKKEGC 360 ++ A+NL R + G + E C Sbjct: 303 TLRNTAINLHR----LNGADNIAEAC 324 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 72/210 (34%), Positives = 105/210 (50%), Gaps = 17/210 (8%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M +SL + +S PD R + H L A+L L A++ G Q I FG + L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFDNG-IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEI-------IAIDGKT 112 F G P T++R + D E W+ DG + IA+DGKT Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWL-------DGRVGPVARTHIALDGKT 113 Query: 113 IRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITID 172 +RGS D G+ G H+V+A++ VL QV+ +AK+NE A LL +L + +++T D Sbjct: 114 LRGSRD-GQVPGQ-HLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGD 171 Query: 173 AMGCQKDIASKIKDKKADYLLAVKGNQGKL 202 AM CQ+D+A+ + ADY+L K NQ L Sbjct: 172 AMFCQRDVAAAVIAGGADYVLVAKDNQPGL 201 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 97.8 bits (242), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 56/160 (35%), Positives = 86/160 (53%), Gaps = 19/160 (11%) Query: 190 DYLLAVKGNQGKLHHAF----EEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTR 245 DY++AVKGNQ +LH E++ PV++ +I+ R+ R+ S Sbjct: 15 DYVIAVKGNQKRLHEQIKLTTEQRLPVSL------------DITTERRSDRITTRSVSVF 62 Query: 246 LNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLI 305 + ++W+GL++L F + + I YYISS ++A +FA IR HW I Sbjct: 63 DDLSGISYDWEGLQRLVKVERFGTRAGKPYHQ---IVYYISSLTINAAQFAQGIRGHWGI 119 Query: 306 EHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 E+ LHWV DV ++ED SR+R+GNA S I+ + L +LR Sbjct: 120 ENRLHWVKDVVLDEDNSRMRQGNAPANFSIIRSLVLTILR 159 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 97.8 bits (242), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 72/234 (30%), Positives = 118/234 (50%), Gaps = 15/234 (6%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 SL++ ++ PD R + ++ L +L L + AV+ G + I FG R + L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 DNG-IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG-EIIAIDGKTIRGSFDKGKR 122 NG +P +TIA ++ +D + + W+++ H DG E +A+DGK + GS D G+ Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP--DGWEHLALDGKRLCGSRD-GQV 120 Query: 123 KGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLY-LKKNLITIDAMGCQKDIA 181 G H+++A++ + V+ Q+ EA +NE A LL +L L ++T DA+ Q D+ Sbjct: 121 PG-THLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVC 179 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFS---TQEISHGR 232 + ++ K D +L K NQG L E F+ G FS T + GR Sbjct: 180 AAVQHKGGDSILYAKSNQGTLRADLEA-----AFATAAGGDFSPRVTGRVGSGR 228 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 84/304 (27%), Positives = 133/304 (43%), Gaps = 34/304 (11%) Query: 50 FGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAID 109 FG + +WLK GI T + V ++ +AFE + +Q Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAALPKPLQRA------------ 90 Query: 110 GKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLI 169 K S + + +V ++ G+V+GQ +T NE+ + L LL L+ ++ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQ-QTAPGRNEVQGALDALALLSLEGAIV 149 Query: 170 TIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKF----PVNVFSNYKGDSFST 225 T DA+ C+ D A I DY LA+K NQ L + P+ V + + D Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLGVQTAAEND---- 205 Query: 226 QEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYI 285 H R E R + V ++F GL+ + + + + + V RY++ Sbjct: 206 ----HDRCERRRACIVAVNDIDF-------PGLQAIGSVEATSRHADGRLTSHV--RYFL 252 Query: 286 SSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 S M A R HW IE+ LHWVLDV+ EDA+R R+ + I+ ++K+ALNL+R Sbjct: 253 LSTIMSAAALIEVTRTHWQIENKLHWVLDVQFREDAARNRKDHGPANIALLRKIALNLIR 312 Query: 346 DCKD 349 D Sbjct: 313 AHPD 316 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 95.1 bits (235), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 62/147 (42%), Positives = 81/147 (55%), Gaps = 9/147 (6%) Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 G H+VSA++ E+GV LG V TE KSNEITAI LL L KK ++TIDAMGCQKDIA Sbjct: 2 GPRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARN 61 Query: 184 IKDKKADYLLAVKGNQGKLHHAFE---EKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 I D++LAV+ NQ KL A EK + + T HGR++ R + Sbjct: 62 IVAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWG 121 Query: 241 SNVTRLNFCDF--EFEWKGLKKLCVAL 265 + V DF + EW +K + A+ Sbjct: 122 AQVP----PDFAAKGEWPWIKAIGTAV 144 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 62/171 (36%), Positives = 88/171 (51%), Gaps = 19/171 (11%) Query: 128 MVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDK 187 MVS ++ N +VLGQVK SNEITAIPELL +L L ++ I A+ C KDI I + Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 KADYLLAVKGNQGKLHHAFEEKFPVNV---FSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 ADY++ +K NQG L+ + E+ F + F + ++ +E HG E R Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQELQHSTYKPEETGHGLHEIR-------- 112 Query: 245 RLNF---CDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDA 292 NF D + W LK + + Q +DK+ V RY+ISS D + Sbjct: 113 --NFGFQLDPDSVWSNLKSVGMVEPIGQ-VDDKTT--VETRYFISSLDSNG 158 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 92.4 bits (228), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 95/369 (25%), Positives = 148/369 (40%), Gaps = 52/369 (14%) Query: 10 ISVTPDIRQQGKVKHKLSAILFLTVCAV-------IAGADEWQEIEDFGHE---RLEWLK 59 ++ PD R + + L + + +CAV +A EW + RL W Sbjct: 28 LAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERVRLRLPWNP 87 Query: 60 KYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIE--WMQECHEITDG-------------- 103 G +P + TI R ++ +D A ++ ++TD Sbjct: 88 WDGHL---LPDEATIRRFLNTVDDQALATALLDPPLADTPADLTDAVPSAPVRPPAGDQA 144 Query: 104 ---EIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLN 160 A+DGKT RG+ K +H++ ++ G +LGQ + +AKSNE T LL Sbjct: 145 VPVRAYAVDGKTSRGA--KRADGSQVHLLGVAAHGAGALLGQREIDAKSNETTEFRALLA 202 Query: 161 LLYLKKNLITIDAM-GCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYK 219 L L ++ DA+ + ++ + K A YL K NQ KL AF P Sbjct: 203 PLELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKL-RAFLAALPWTEIPTAD 261 Query: 220 GDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGV 279 T++ HGR+ETR V+ VT L+F + + + RQK + S E + Sbjct: 262 ----LTRDRGHGREETRTLKVATVTHLDFP------HAAQAIRIRRWRRQKGQPASHETI 311 Query: 280 SIRYYISSKDMDAKE---FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGI 336 Y I+ D A R W IE H+V DV ED+S R G +++ Sbjct: 312 ---YAITDATADQASPALLADLARGQWHIEVKQHYVRDVTFGEDSSTSRTGRGPAVLALF 368 Query: 337 KKMALNLLR 345 + + LR Sbjct: 369 RATVADTLR 377 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 92.0 bits (227), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 62/146 (42%), Positives = 78/146 (53%), Gaps = 9/146 (6%) Query: 174 MGCQKDIASKIKDKKADYLLAVKGNQGKLHHA----FEEKFPVNVFSNYKGDSFSTQEIS 229 MGCQK+IA I +++ADY+ AVK NQ LH A FEE N F +Y D T S Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYFEEANEAN-FESYNIDFAETYNKS 59 Query: 230 HGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKD 289 HGR E+R V L D W+GL+ + + S R KE + E RYYISS Sbjct: 60 HGRIESRRCWVG-YDALPLTDDSQNWEGLQTIVMVESERTLKEKTTIEH---RYYISSTM 115 Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDV 315 A ++ R HW IE+SLHW LD+ Sbjct: 116 ATAAYLLNSSREHWGIENSLHWRLDI 141 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 90.9 bits (224), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 59/183 (32%), Positives = 98/183 (53%), Gaps = 6/183 (3%) Query: 1 MSIQSLLDYISVTPDIRQ-QGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLK 59 + +LL + PD R+ QGK ++ L +L TV A+++GA ++ I F R E L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGK-RYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLT 69 Query: 60 KYGDFD-NGIPVDDTIARVVSNIDSLAFEKMF---IEWMQECHEITDGEIIAIDGKTIRG 115 + D PV +T+ V+ ++D+ E F + + E+ + ++A+DGKT+RG Sbjct: 70 HHFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRG 129 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMG 175 SFD + A ++AF + + +VL + + KSNEI A +++ L L + T DAM Sbjct: 130 SFDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMH 189 Query: 176 CQK 178 CQK Sbjct: 190 CQK 192 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 90.5 bits (223), Expect = 9e-17, Method: Compositional matrix adjust. Identities = 49/127 (38%), Positives = 73/127 (57%), Gaps = 1/127 (0%) Query: 176 CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKET 235 + ++ KI +K DYLLAVKGNQG L AF++ F ++ +N + ++T+E S GR E+ Sbjct: 12 VRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHES 71 Query: 236 RLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF 295 R VS+ + D EW GLK + +S +KE + +RYYISSK ++A+E Sbjct: 72 RAAFVSHDLSV-LGDISDEWPGLKSMAFVVSMNSEKEVAEEADIYVRYYISSKQLNAEEL 130 Query: 296 AHAIRAH 302 A R H Sbjct: 131 LTASRLH 137 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 87.8 bits (216), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 108/421 (25%), Positives = 180/421 (42%), Gaps = 96/421 (22%) Query: 8 DYISVTPDIRQQGKVKHKLSAILFLTVCAV-------IAGADEW------QEIEDFGHER 54 ++ SVT D R V++++S++L L VCA+ I A EW +E+ FG Sbjct: 37 EFESVT-DPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELAAFG--- 92 Query: 55 LEWLKKYGDFDNGIPVDDTIARVVSNID-----SLAFEKMFIEWMQECHE----ITDGEI 105 L + G + IP + T+ V+ +D + ++ + H + DG I Sbjct: 93 LPYHPLRGRYR--IPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPLMPDGGI 150 Query: 106 -----------------------IAIDGKTIRGS-FDKGKRKGAIHMVSAFSNENGVVLG 141 IA+DGK +R + G R + ++SA + +G+ L Sbjct: 151 EREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR---VFVLSAVRHGDGITLA 207 Query: 142 QVKTEAKSNEITAIPEL------LNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAV 195 + AK+NEI PE L+ LK ++T DA+ Q+D A+ + ++ A YLL + Sbjct: 208 SREIGAKTNEI---PEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERGAHYLLTI 264 Query: 196 KGNQG----KLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDF 251 K NQ +LH ++ PV + +G HGR E RL V V L F Sbjct: 265 KNNQRGQARQLHALPWKEIPVIHRDDARG---------HGRHEQRLVQVVTVNGLLF--- 312 Query: 252 EFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHA-----IRAHWLIE 306 L ++++ A+ S + D+ A+E + A R HW +E Sbjct: 313 -------PHAAQVLRIQRRRRLYGAKKWSSETVYAITDLPAEEASAAEIASWARGHWTVE 365 Query: 307 HSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRER 366 +++HW DV NED S++R N +++ ++ +L+R + G G H ER Sbjct: 366 NTVHWCRDVTFNEDKSQVRTHNTPSVLAAVR----DLIRGALKLAGYVNTAAGRRAHTER 421 Query: 367 S 367 + Sbjct: 422 T 422 >UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX Length = 105 Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 44/88 (50%), Positives = 61/88 (69%), Gaps = 1/88 (1%) Query: 265 LSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRI 324 LSF ++ K E ++ RYY S D+ A++FA A R HW +E+ LHW LDV MN+D +I Sbjct: 13 LSFNNTEQKKEPE-MTDRYYSISADLTAEKFATANRNHWYVENKLHWHLDVVMNKDDCKI 71 Query: 325 RRGNAAEIISGIKKMALNLLRDCKDIKG 352 RRGNAAE+ SGI+K+A+N+L K +K Sbjct: 72 RRGNAAELFSGIRKIAINILTKDKILKA 99 >UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=Bacteria RepID=B7UED9_ECOLX Length = 104 Score = 85.9 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 40/79 (50%), Positives = 57/79 (72%) Query: 279 VSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKK 338 +++RYYISS D A++F AIR HW +E++L+W LDV MNED +IRRGNAAE SGI+ Sbjct: 1 MTVRYYISSADSTAEKFVTAIRNHWHMENNLYWRLDVVMNEDDYKIRRGNAAESFSGIRH 60 Query: 339 MALNLLRDCKDIKGEEEKK 357 +A+N+L + + K +K Sbjct: 61 IAINILTNNQVFKARSRRK 79 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 84.7 bits (208), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 56/160 (35%), Positives = 87/160 (54%), Gaps = 8/160 (5%) Query: 187 KKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL-HIVSNVTR 245 KK DYLL VKGNQ KL A E F ++ D + E HGR ++ ++S Sbjct: 5 KKGDYLLMVKGNQPKLLEAIEIAF-IDQHDVKSVDRSALVERGHGRTVGQIASVLSAKGI 63 Query: 246 LNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLI 305 +N D W + S R E +S + YYI+S+ + A++ A ++RA W + Sbjct: 64 INPGD----WPNCVTIGRIDSMRVVDEKES--DLERCYYITSRALTAEQLAASVRARWGV 117 Query: 306 EHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 E+ HW+LDV +EDAS + + NA + +S ++K+ALN++R Sbjct: 118 ENRFHWILDVSFSEDASTVAKDNAPQNLSMLRKIALNIIR 157 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 87/344 (25%), Positives = 146/344 (42%), Gaps = 31/344 (9%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGAD------EWQEIEDFGHERLE 56 + L+ + PD R V+++L+ +L L V IAG D EW G Sbjct: 25 VAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGV---- 80 Query: 57 WLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHE--ITDGEIIAI--DGKT 112 L G F +P + TI R+V ++ W G ++A+ DGK Sbjct: 81 -LAGLG-FPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKV 138 Query: 113 IRGSFDKGKRKGAIH---MVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLI 169 ++G+ + +G++ +V A ++ G LG + A +EI ++ L+N + L+ Sbjct: 139 MKGARSRPP-QGSVRQEAVVEAVRHDTGTALGHQRVVA-GDEIASVRRLVNRVCDHNTLV 196 Query: 170 TIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEIS 229 T D + + +A I+ K +L ++KGNQ + A P + F G+ T+E + Sbjct: 197 TTDCLHAHEPLARAIRAKGGHWLFSIKGNQPTV-RAKLAGLPWDEF----GNQHVTREKA 251 Query: 230 HGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKD 289 HGR E R + + F + + KL ++ +A Y ++S Sbjct: 252 HGRIEERALKALTPSAPSLVGFRGT-RQVVKLARTTRRKKTTTSPAATSTEEFYLVTSLS 310 Query: 290 MD---AKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAA 330 D + A R HW +E ++H V D M+ED IR NAA Sbjct: 311 TDQASPAQLARWARGHWTVE-AIHHVRDRTMDEDRHTIRTKNAA 353 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 82.0 bits (201), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 73/265 (27%), Positives = 122/265 (46%), Gaps = 35/265 (13%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +LL+ ++ PD R++ V+++ +A+L + VCA+++GA + I ++ + + Sbjct: 50 ALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAGLGL 109 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE-------------IIAIDGK 111 +P TI RV+ +D A E W+Q + D ++A+DGK Sbjct: 110 TGRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAVDGK 169 Query: 112 TIRGSFDKGKRKG--AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLY-LKKNL 168 +R + R G +H++ + GVVL QV + K+NEI +L+ + L L Sbjct: 170 AMRAT-----RHGTHPVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDVL 224 Query: 169 ITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFE----EKFPVNVFSNYKGDSFS 224 IT+DAM Q A + + A L+ VK NQ +H + + PV + +G Sbjct: 225 ITVDAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRLKTLPWKDVPVGHTTTGRG---- 280 Query: 225 TQEISHGRKETR-LHIVSNVTRLNF 248 HGR ETR L V+ L F Sbjct: 281 -----HGRIETRTLKAVTVPAGLGF 300 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 82.0 bits (201), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 65/211 (30%), Positives = 95/211 (45%), Gaps = 9/211 (4%) Query: 40 GADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHE 99 GA E+ +F R E L++ +G P DT +RV +D E+ F +M Sbjct: 37 GAKTCVEMAEFSEARQEELREIVALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRG 96 Query: 100 I----TDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAI 155 ++AIDGK++R +DKG+ MVS + E + ++ +EI A Sbjct: 97 ALGLPAPKGVVAIDGKSLRRGYDKGRAFMPPLMVSVWDVETRPSIAAMRAPG-GDEIKAT 155 Query: 156 PELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVF 215 +L L LK +T DA+ C +A + KA Y L +K N G L A E F Sbjct: 156 LSVLKALTLKGCTVTADALHCHPAMAQALLAAKAQYALGLKANHGPLFRAAEAGF--AAV 213 Query: 216 SNYKGDSFSTQEISHGRKETRLHIVSNVTRL 246 ++ F T+E HGR+E R V V RL Sbjct: 214 TDLA--VFETRERGHGREEQRRASVLPVDRL 242 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 80.9 bits (198), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 56/190 (29%), Positives = 93/190 (48%), Gaps = 14/190 (7%) Query: 155 IPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV 214 + +L +KK + T+DA+ CQK I K+ Y++ VK NQ L A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIED-----T 56 Query: 215 FSNYKGDSFSTQEISHGRKE-TRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKED 273 N +++S + HG + RL I + + +W GL++ +S R++ Sbjct: 57 AKNSPLNAWSWTQKGHGHESHCRLKIWEATESM-----KMQWAGLERF---ISIRRQGFR 108 Query: 274 KSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEII 333 + S Y+I+S+ + + A IR H IE++LHW DV +NED IR + A I+ Sbjct: 109 HHKKFDSTTYHITSETLSSYRLAGFIRGHRRIENNLHWTKDVILNEDNCGIREPHPAAIL 168 Query: 334 SGIKKMALNL 343 ++ +A NL Sbjct: 169 GILRNIAFNL 178 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 80.9 bits (198), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 60/202 (29%), Positives = 102/202 (50%), Gaps = 14/202 (6%) Query: 14 PDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLE-WLKKYGDFDNG----- 67 PD R + +H L AIL + V AV+ A + + ++ + LK+ N Sbjct: 231 PDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKRIRARFNPRTQRY 290 Query: 68 -IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAI 126 P + T+ RV+ + A + W+ I E +A+DGK ++G+ + + + Sbjct: 291 VAPSEPTLRRVLQGANVTALDAAIGAWLLG---IAGFEAVAVDGKVLKGAVREDGSQ--V 345 Query: 127 HMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIAS-KIK 185 H++SAF + G + Q + K+NEI + LL + ++ ++T DA+ Q+ A ++ Sbjct: 346 HLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADALHTQRKTARFLVE 405 Query: 186 DKKADYLL-AVKGNQGKLHHAF 206 DKKADYL AVKGNQ KL ++ Sbjct: 406 DKKADYLFTAVKGNQRKLRNSL 427 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 89/371 (23%), Positives = 150/371 (40%), Gaps = 45/371 (12%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++ LL+ + PD R++ V+ L +L L + AV GA + EI + + L Sbjct: 32 VEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAAF 91 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQE----------CHEITDGE----IIAI 108 P T RV+ D A ++ W Q DG+ +I+ Sbjct: 92 GLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVISA 151 Query: 109 DGKTIRGSFDK-GKRKGAI-HMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLY--- 163 DGKT+RG+ + G K A +V + +G V+ + +EI A+ ++ L Sbjct: 152 DGKTMRGARRRTGDGKIAQDQVVEILDHASGAVVA-CEPVNDGDEIGAVRTVMGRLADRW 210 Query: 164 --LKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFP---VNVFSNY 218 L ++ DA Q + ++ +LL VK NQ ++ A P V Sbjct: 211 GSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRIL-AKVRALPWAQVRAQDTC 269 Query: 219 KGDSFSTQEISHGRKETR-LHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAE 277 +G + HGR ETR + +V T ++ G ++ +++ A Sbjct: 270 RGKA-------HGRAETRTVRVVQAPTHVDLA-----LAGTAQVIKITRHTRRRPHPGAP 317 Query: 278 GVSIR---YYISSKDM---DAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAE 331 S R Y ++S D A +R+HWLIE+ +HWV D +ED R GN Sbjct: 318 AASTRENAYLLTSLPAEVADPATLAAMVRSHWLIENQVHWVRDTAYDEDRHTARTGNGPI 377 Query: 332 IISGIKKMALN 342 ++ ++ A+ Sbjct: 378 NLACLRNTAIT 388 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 53/168 (31%), Positives = 82/168 (48%), Gaps = 9/168 (5%) Query: 3 IQSLLDYISVTPDIR--QQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++ L ++ S PD R ++G ++HKL ++ L + ++ EI +FG L+ +K Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG-----EIIAIDGKTIRG 115 NGIP + T+ R+ ID A + + H+ G EI+ IDGK RG Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCAQEIVCIDGKAERG 154 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLY 163 + K R I VSA S + L E KSNEI A+P L++ +Y Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLIDKIY 200 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 55/193 (28%), Positives = 98/193 (50%), Gaps = 8/193 (4%) Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHI 239 ++ + +K DY+LA+KGN + ++ F V S +T + HGR E R++ Sbjct: 1 MSERNSEKDNDYILALKGNHPLMEQEVKDFFLSPVTSTRS--VHTTFDKGHGRIERRIYT 58 Query: 240 VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 + T + + + + EWK L + S +K + E IRY+I+S D K+FA + Sbjct: 59 LD--TNIGWFEDKKEWKHLAGFGMVDSMVTRKGKECRE---IRYFITSV-TDVKQFAKGV 112 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEG 359 +HW+IE++LHW LDV +D + NAAE ++ I+++ N ++ + K Sbjct: 113 CSHWMIENNLHWCLDVLFCDDECTVLDRNAAENLAIIRRIVYNRIKMLSKMDTLSMGKRA 172 Query: 360 CVKHRERSSEVHF 372 C+ E +++ F Sbjct: 173 CIYDDEFRAQILF 185 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 78.2 bits (191), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 90/406 (22%), Positives = 159/406 (39%), Gaps = 54/406 (13%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIA-GADEWQEIEDFGHER----LEWLKK 60 L+D ++ D R +H L++IL + CA +A G D IE + L L Sbjct: 30 LIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLHI 89 Query: 61 YGDFDNGI---PVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEI------------ 105 + D G+ P + TI RV++ +D + ++ + E Sbjct: 90 WRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRRT 149 Query: 106 ---------------------IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVK 144 A+DGK ++G+ + G +H++S ++ + V Q + Sbjct: 150 EREARRAAHRSPTPAPGLLPAYAVDGKRLKGA--RHPDGGRVHLISLAAHLDATVHAQRQ 207 Query: 145 TEAKSNEITAIPELLNLLY---LKKNLITIDAMGCQKDIAS-KIKDKKADYLLAVKGNQG 200 AKS+EI A+ LL L +IT DA+ Q+ A I++ A Y++ VK NQ Sbjct: 208 IPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQP 267 Query: 201 KLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKK 260 LH +++ + HGR E R+ + ++F ++ L+ Sbjct: 268 TLHATAITAL-TGTDTDFAAVTHRETHRGHGRTEYRILRTAPADGIDFPYAAQVFRVLRH 326 Query: 261 LCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHW-LIEHSLHWVLDVKMNE 319 R KE G++ ++++ A +R HW IE+ +H V DV E Sbjct: 327 RGGLDGIRHSKE--VCYGIT---DLTARQAGPAHLAAYVRGHWKAIENGVHHVRDVTFAE 381 Query: 320 DASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRE 365 DA + R ++ + +A LR + ++E H+ Sbjct: 382 DACQARTATLPRALAAFRNLATGTLRRAGHVNIAHARREHGYDHQR 427 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 57/160 (35%), Positives = 83/160 (51%), Gaps = 16/160 (10%) Query: 192 LLAVKGNQG----KLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLN 247 +LAVK NQ ++ +A + + D + HGR ETR + L+ Sbjct: 1 MLAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREVDK-GHGRIETRRCLA-----LD 54 Query: 248 FC-DFEFE-WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLI 305 F FE + W GL+ + + S R+ + + RYY+SS DA AHA+RAHW I Sbjct: 55 FPGPFEPDLWPGLQSIPMVESTREIGDTVT---TGRRYYVSSLPADAVRIAHAVRAHWGI 111 Query: 306 EHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 E S+HWVLDV NED R R NAA+ + ++++A L+R Sbjct: 112 E-SMHWVLDVTFNEDQCRTRLENAAQNFAILRRIATTLIR 150 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 77.8 bits (190), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 58/180 (32%), Positives = 89/180 (49%), Gaps = 11/180 (6%) Query: 4 QSLLDYISVTPDIRQ-QGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLK-KY 61 QSLL PD R+ QG++ L +L ++ A+++GA ++ I F H L + Sbjct: 6 QSLL----AIPDHRRAQGRL-FDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGK 121 G P +I + +D A F E +IA+DGKT+RGS D+ + Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHFRAHAARLAE--GAAVIALDGKTLRGSLDRFE 118 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTE--AKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 + A ++SAF+ E +VLGQ+ E K +EI A L+ L L L T+DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 55/196 (28%), Positives = 102/196 (52%), Gaps = 19/196 (9%) Query: 15 DIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLE-WLKKYG-DFDNG----- 67 D R+ ++H ++L + + V+AG ++ I + + + L++ G + G Sbjct: 233 DPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGKERFL 292 Query: 68 IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIH 127 P + TI R++S D + +++ +++ H + G IAIDGKTIR S ++ Sbjct: 293 PPSEPTIRRILSKADPVELDRILSQYIV-AH--SSGRAIAIDGKTIRSS--------SVG 341 Query: 128 MVSAFSNENGVVLGQVKTEA-KSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKD 186 +++A +++G V+ Q + K +EI A LL L L ++T DA+ Q +AS+I++ Sbjct: 342 LMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALASRIRE 401 Query: 187 KKADYLLAVKGNQGKL 202 K DY+ VK N+ L Sbjct: 402 KGGDYVFTVKDNRKTL 417 >UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica OS145 RepID=A3WIW9_9GAMM Length = 96 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 37/77 (48%), Positives = 51/77 (66%) Query: 282 RYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMAL 341 RYYISS + A+EFA +RAHW IE+ LHWVLDV + ED I RG+AA+ ++ + +AL Sbjct: 4 RYYISSASLSAEEFASTVRAHWGIENRLHWVLDVTIREDNCPISRGHAADNLACFRHVAL 63 Query: 342 NLLRDCKDIKGEEEKKE 358 N +R K I +K+ Sbjct: 64 NQIRREKTIDASVNRKQ 80 >UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azollae' 0708 RepID=B9YJB3_ANAAZ Length = 124 Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 36/90 (40%), Positives = 61/90 (67%), Gaps = 3/90 (3%) Query: 255 WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLD 314 W LK + + S Q +DK+ V RY+ISS D + ++ A+++R+HW IE+SLHWVLD Sbjct: 15 WSNLKSVGMVESIGQV-DDKTT--VETRYFISSLDSNGEQLANSVRSHWAIENSLHWVLD 71 Query: 315 VKMNEDASRIRRGNAAEIISGIKKMALNLL 344 V + +D +IR+ NA + + ++++A++LL Sbjct: 72 VALKQDDCQIRKDNAPQNFAVMRQIAVDLL 101 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 34/59 (57%), Positives = 42/59 (71%) Query: 23 KHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNI 81 KH I+FL V AVI+GA+ W EI+ FG L+WL+KY F+ GIPVDDTIARV+ I Sbjct: 19 KHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPFECGIPVDDTIARVIKRI 77 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 52/175 (29%), Positives = 84/175 (48%), Gaps = 4/175 (2%) Query: 19 QGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG-IPVDDTIARV 77 QG++ H L A+L L AV+ Q I FG + L F G P T+++ Sbjct: 2 QGRI-HPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQT 60 Query: 78 VSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENG 137 + ID E W+ +A+DGK +RGS D G G H V+A++ Sbjct: 61 LRRIDPQQLEAALGRWIAGRLTPDARAHVALDGKCLRGSRD-GDVPGP-HRVAAYAPHAA 118 Query: 138 VVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYL 192 VLGQ++ +A++NE A LL ++ + +++T A C +D+A+ + D Y+ Sbjct: 119 AVLGQIRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYV 173 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 32/79 (40%), Positives = 52/79 (65%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 + L + +T D R + KH L I+ L + AV++G++ W++IE+FGH +L+WL++Y F Sbjct: 6 TFLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPF 65 Query: 65 DNGIPVDDTIARVVSNIDS 83 GIP DTIARV+ + + Sbjct: 66 KAGIPRHDTIARVICRLKA 84 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 71.6 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 33/76 (43%), Positives = 47/76 (61%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 +I+ +++ + D R G+ H L IL L +CAV++GA W +IED+GH R WL++Y Sbjct: 6 TIEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRY 65 Query: 62 GDFDNGIPVDDTIARV 77 NGIP DTI RV Sbjct: 66 LKLRNGIPGHDTIRRV 81 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 71.2 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 34/90 (37%), Positives = 51/90 (56%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 ++ S PD R KH I+ L + +V+AGA + EIEDF ++WLK Y + Sbjct: 5 FVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTYFNLP 64 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQ 95 NGIP DT +RV S I+ +F+ F+ W++ Sbjct: 65 NGIPSHDTFSRVFSAINPASFQDSFLIWLK 94 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 62/188 (32%), Positives = 90/188 (47%), Gaps = 17/188 (9%) Query: 169 ITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEI 228 IT DA+ QK +A I + A YL VK NQ L+ F+ K N F + K + Q+ Sbjct: 16 ITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLY--FDIK---NYFEHRKEPDYCLQDP 70 Query: 229 S-HGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISS 287 HGR +TR + T LN EF G + C+ K +K E Y ++S Sbjct: 71 PGHGRIDTRS--IWTTTELNEY-LEFPHVG-QAFCIHKKSYDPKTNKVCENTF--YGVTS 124 Query: 288 KDMDAKEFAHAI---RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 + + A + R HW IE+S H++LD +ED +RIR GN + ++ A+ LL Sbjct: 125 HHPNKADPARILQIHRGHWSIENSKHYILDWTYDEDRNRIRTGNGPANTNRLRGFAIGLL 184 Query: 345 RD--CKDI 350 + KDI Sbjct: 185 KSKGVKDI 192 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 49/164 (29%), Positives = 80/164 (48%), Gaps = 6/164 (3%) Query: 183 KIKDKKADYLLAVKGNQGKLHHAFEEKF-PVNVFSNYKGDSFSTQEISHGRKETRLHIVS 241 KI +KK DY++ +K N + E F ++ ++F R + R + Sbjct: 2 KIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRKL 61 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 V+ ++ EWKG+K + L +K+ D E +YISS D+D + A +R Sbjct: 62 KVS--DWLSKAEEWKGIKSV---LEVCRKRSDNGKESQEKVFYISSLDVDVQILAKCVRG 116 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 HW +E+ HWVLDV ED + AE ++ ++++ALNL R Sbjct: 117 HWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLAR 160 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 47/180 (26%), Positives = 80/180 (44%), Gaps = 11/180 (6%) Query: 174 MGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEK--------FPVNVFSNYKGDSFST 225 M Q D+ + ++++ DY+L K NQG L E FP + + D+ + Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 QEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYI 285 E+S G +++ LN ++ W G++++ RQ + E V + Sbjct: 61 CEVSKGHGWVERRTMTSTIWLN--EYLTRWPGVQQVFRLTRTRQVGGKTTVEVVYGISSL 118 Query: 286 SSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 SS R HW IE S H + D + ED R+RRG A +++ ++ +A+ LLR Sbjct: 119 SSVAAAPDALLRYTRTHWGIE-SRHHIRDATLGEDRCRVRRGAAPRVLAVLRNVAVYLLR 177 >UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C429C Length = 149 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 39/125 (31%), Positives = 65/125 (52%), Gaps = 11/125 (8%) Query: 223 FSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIR 282 +T E HGR E R + +WKGLK+ L +++ K + V + Sbjct: 1 MTTSEKGHGRIEKR-----TLETTPIVTVGQKWKGLKQ---GLRITRERAVKGKKTVEVV 52 Query: 283 YYISSKDM---DAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKM 339 Y I+S M +A +R HW IE+ LH+V DV + EDA R+R+G A ++++ ++ + Sbjct: 53 YGITSLSMARANAATLLTILRDHWQIENGLHYVRDVTLGEDACRVRKGTAPQVLAAVRNV 112 Query: 340 ALNLL 344 ++LL Sbjct: 113 VVHLL 117 >UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC8_9GAMM Length = 154 Score = 67.8 bits (164), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 44/120 (36%), Positives = 63/120 (52%), Gaps = 9/120 (7%) Query: 255 WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLD 314 W+ L+ + + S R +K + + E RYYISS A R HW IE SLHW LD Sbjct: 7 WEELQTIVMVESERAEKGETTIEH---RYYISSTLGTAAYLLDYKREHWGIETSLHWCLD 63 Query: 315 VKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSSEVHFLY 374 + ED SRI +GN AE + ++ +ALNLL K E+ K G R ++ + F++ Sbjct: 64 IAFREDESRISKGNGAENFAILRHIALNLL------KKEDTAKIGIKNKRLKAGGMEFIF 117 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 50/154 (32%), Positives = 65/154 (42%), Gaps = 7/154 (4%) Query: 196 KGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEI---SHGRKETR-LHIVSNVTRLNFCDF 251 +G L HA + F Y E HGR ETR ++ L Sbjct: 103 QGQPTHLAHALRDFFGTLDAPGYPVRQTCVHETLDKGHGRIETRRCTAAGDLDWLATLGL 162 Query: 252 EFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHW 311 + WK K+ + S RY ISS D++ HA+R HW IE+ LHW Sbjct: 163 KERWK---KITSVAGIDSSRVIGSKTETDRRYVISSLPADSERILHAVRMHWGIENGLHW 219 Query: 312 VLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 LDV EDA IR NAA S +++ A+NL R Sbjct: 220 CLDVAFGEDACPIRLRNAALDFSLLRRAAMNLFR 253 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 52/146 (35%), Positives = 73/146 (50%), Gaps = 10/146 (6%) Query: 106 IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLK 165 IA+DGK +RG+ + A H+VS F++ +VLGQ+ KSNEI + LL LL Sbjct: 83 IALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQLAVAEKSNEIPCVRALLTLLPDN 140 Query: 166 -KNLITIDAMGCQKDIASKI-KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSF 223 + L+T+DAM Q A I K+ YL+ VK NQ K+ V + DS Sbjct: 141 LRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKILARITALPWAEVPAAATDDSR 200 Query: 224 STQEISHGRKETR-LHIVSNVTRLNF 248 HGR +TR L I++ + F Sbjct: 201 -----GHGRVKTRTLQIITAARGIGF 221 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 30/79 (37%), Positives = 48/79 (60%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 + L + D R + KH L I+ L + AV++G++ W+ IE+FGH +L+WL ++ F Sbjct: 6 TFLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPF 65 Query: 65 DNGIPVDDTIARVVSNIDS 83 GIP DTIARV+ + + Sbjct: 66 KAGIPRHDTIARVICRLKA 84 >UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina DSM 3645 RepID=A3ZX26_9PLAN Length = 115 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 29/53 (54%), Positives = 36/53 (67%) Query: 277 EGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNA 329 E +RYYI SK + + FA A+R HW IE+SLHW LDV E SRIR+G+A Sbjct: 17 EASEVRYYIVSKYLTGRRFAEAVRGHWGIENSLHWQLDVTFGEHQSRIRKGHA 69 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 63.2 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 36/83 (43%), Positives = 52/83 (62%), Gaps = 1/83 (1%) Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIAS-K 183 A+H++SAF + GVVL Q+ KSNEI A ELL L + +T DAM Q++ A Sbjct: 8 AVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQREHARFA 67 Query: 184 IKDKKADYLLAVKGNQGKLHHAF 206 ++DK+AD+++ VK NQ +L A Sbjct: 68 VEDKRADFVMTVKDNQPELREAL 90 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 61.2 bits (147), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 65/216 (30%), Positives = 103/216 (47%), Gaps = 18/216 (8%) Query: 109 DGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEA-KSNEITAIPELLNLLYLKKN 167 DGK +RGS + GK++G +V + +G + Q + K +EI + LL+ L Sbjct: 60 DGKELRGSIESGKKRGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRALLSKDDLASQ 118 Query: 168 LITIDAMGCQKDIASKIKDKKADYLLAVKGNQGK-LHHAFEEKFPVNVFSNYKGDSFSTQ 226 IT+DA+ I +L+ +K NQ L H + P D +T Sbjct: 119 KITLDALHLCPSTTEMITKAGGVFLIGLKENQPTLLAHMTDCALP-------PIDQKTTF 171 Query: 227 EISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFR-QKKEDKSAEGVSIRYYI 285 + +HGR E R + + +V++ F D ++ K+L R +K K + VS YYI Sbjct: 172 DFNHGRVEQRKYWLYDVSKQGF-DPRWDNTAFKRLVKVQRTRINQKNAKISREVS--YYI 228 Query: 286 SSKDMDAKE-FAHAIRAHWLIEHSLHWVLDVKMNED 320 S++ AKE A+R HW +E + H + DV +NED Sbjct: 229 SNE--TAKEGIFDAVRNHWSVEVNNH-IRDVTLNED 261 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 34/95 (35%), Positives = 53/95 (55%), Gaps = 5/95 (5%) Query: 251 FEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLH 310 F+ +G +++ V + ++ E + S YY+++ A A IR HW IE+ LH Sbjct: 81 FQTVIEGQRQIEVFNPYHRRFEPRQE---SPAYYLATCTASAATLAQVIRGHWAIENRLH 137 Query: 311 WVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 VLDV + ED+SRIRR + + ++ ALNLLR Sbjct: 138 HVLDVSLGEDSSRIRRNPG--VFALLRHFALNLLR 170 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 42/137 (30%), Positives = 67/137 (48%), Gaps = 20/137 (14%) Query: 106 IAIDGKTIRGS--FDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLY 163 IA+DGK ++ S +R H++SA ++ V L +V+ AK+NE T LL L Sbjct: 134 IAVDGKALKASARLTSPRR----HLLSAVTHGRVVTLARVEVGAKTNETTHFKPLLAPLD 189 Query: 164 LKKNLITIDAM-GCQKDIASKIKDKKADYLLAVKGNQGKLHHAFE----EKFPVNVFSNY 218 L ++T DA+ + +I+ ++ KKA Y+ +K NQ HH PV Sbjct: 190 LADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQLATLPWRDIPVQ----- 244 Query: 219 KGDSFSTQEISHGRKET 235 + E+ HGR+E+ Sbjct: 245 ----HAASEVGHGRRES 257 >UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9DIV7_9GAMM Length = 133 Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 30/99 (30%), Positives = 60/99 (60%), Gaps = 2/99 (2%) Query: 247 NFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIE 306 ++ + +++W GLK + + ++ ++ E R+YISS D++A++ ++R HW +E Sbjct: 6 SWLNNKYQWVGLKSI-IKVTSDVHEKTTGKETTETRWYISSLDLNAEQALSSVRNHWQVE 64 Query: 307 HSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 S+HWVL++ ED SR R+G + ++K+A+ L + Sbjct: 65 -SMHWVLEMTFREDESRFRKGRGPLAFNVMRKIAMTLFK 102 >UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7Q7_9GAMM Length = 164 Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 41/130 (31%), Positives = 59/130 (45%), Gaps = 11/130 (8%) Query: 219 KGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEG 278 K +S+ T+E HGRKE R V F +K C+ S D+S +G Sbjct: 13 KQESYITEEKGHGRKEVREVYVLPAA--------FSEALRQKWCLVKSIVAVVRDRSVKG 64 Query: 279 ---VSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISG 335 YYI + + + + A R HW IE+ HW LDV ED RI G++A ++ Sbjct: 65 KGSYETSYYICTDHLSLELASKATRKHWHIENQQHWALDVIFKEDEQRIYAGDSALNMAC 124 Query: 336 IKKMALNLLR 345 ++ NL R Sbjct: 125 CRRFVQNLFR 134 >UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=Bacteroides RepID=C6Z3A5_9BACE Length = 115 Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 34/73 (46%), Positives = 44/73 (60%), Gaps = 2/73 (2%) Query: 281 IRYYISSKDMDAKE-FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKM 339 +RYY++S D E A AIR HW I ++LHW LDV ED S+ + NAA S KM Sbjct: 19 VRYYVTSLDNTRPEKIASAIRQHWSIGNNLHWQLDVTFREDYSK-KVKNAAGNFSVATKM 77 Query: 340 ALNLLRDCKDIKG 352 AL +L++ K KG Sbjct: 78 ALTILKNEKTTKG 90 >UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D5 Length = 148 Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 42/122 (34%), Positives = 69/122 (56%), Gaps = 17/122 (13%) Query: 230 HGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEG---VSIRYYIS 286 HGR E R ++ T LN ++ W G++++ FR +++ + A+G V + Y IS Sbjct: 5 HGRVERR--SITTTTWLN--EYLTRWPGVQQV-----FRLERQ-RRADGKTTVEVVYGIS 54 Query: 287 SKDMDAKEFAHAI---RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNL 343 S A + R+HW IE SLH+V DV ++ED R+RRG A +++ ++ +A+ L Sbjct: 55 SLSPVAAPPDTVLGYTRSHWGIE-SLHYVRDVTLDEDRCRVRRGTAPRVLASLRNVAVYL 113 Query: 344 LR 345 LR Sbjct: 114 LR 115 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 58.9 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 28/59 (47%), Positives = 36/59 (61%) Query: 58 LKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGS 116 LK+YG F+ GI DTI +VS I + F+K FI+WM C E+ A DGKT+R S Sbjct: 12 LKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 26/70 (37%), Positives = 39/70 (55%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M +++ + + D RQ KV + L +LF+T+C VIAGA+ W EI D+ W K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFDNGIPV 70 G G+PV Sbjct: 72 KGILTEGVPV 81 >UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REV9_LEGLO Length = 139 Score = 58.2 bits (139), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 30/86 (34%), Positives = 48/86 (55%) Query: 260 KLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNE 319 K +A K +++A RYY++S + + + +R HW IE+ LHW LDV +N+ Sbjct: 23 KSIIATETISSKTNETAISAEWRYYVTSHETEKSDLHLYVRNHWSIENELHWHLDVHLND 82 Query: 320 DASRIRRGNAAEIISGIKKMALNLLR 345 DA + R A S IK+M L+L++ Sbjct: 83 DADKKRDDTTAINFSSIKRMLLSLVK 108 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 58.2 bits (139), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 44/146 (30%), Positives = 67/146 (45%), Gaps = 11/146 (7%) Query: 161 LLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHA----FEEKFPVN-VF 215 + LK +L+T+DAMGCQ+ IA ++++ AD +L++KGNQGK A F+++ + Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 216 SNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKS 275 D F E SHGR R V +T W ++ L V RQ ++ Sbjct: 61 LKPDHDEF---EDSHGRTVRRRGWVLPLT--PETKHSGSWPDIQALLVTEKIRQAHYSET 115 Query: 276 AEGVSIRYYISSKDMDAKEFAHAIRA 301 RYY+S + H A Sbjct: 116 VTS-DFRYYLSRCQEARPDIGHTTHA 140 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 57.8 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 26/74 (35%), Positives = 47/74 (63%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M+ +S+LD+ S D RQ +V + L I L +CA ++G +++ EI +G RLE+L++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFDNGIPVDDTI 74 + ++ G+P DT+ Sbjct: 77 FLPYERGLPAHDTL 90 >UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C56B7 Length = 116 Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 23/70 (32%), Positives = 43/70 (61%) Query: 285 ISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 +S++ DA + +R HW IE+ LH+V DV + ED R+R G+A ++++ ++ ++L Sbjct: 28 LSAEKADAATLLNHVRTHWRIENELHYVRDVTLGEDVCRVRMGHAPQVLAALRNAVVHLW 87 Query: 345 RDCKDIKGEE 354 R+ K + E Sbjct: 88 REVKAVSCPE 97 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 47/183 (25%), Positives = 86/183 (46%), Gaps = 6/183 (3%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +LL+ ++ PD R ++ L +L L + ++ ++ +EDF E L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 DNG-IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGS---FDKG 120 P D T RV+ ID +F W+ + + + +DGK+I+ + +D+ Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 121 KRKGAIHMVSAFSNENGVVLG-QVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 + I++VS FS + GV + Q + +EI + LL L L+ + T+D++ CQK Sbjct: 124 YQD-FINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKK 182 Query: 180 IAS 182 + S Sbjct: 183 LYS 185 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 77/338 (22%), Positives = 132/338 (39%), Gaps = 60/338 (17%) Query: 51 GHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDG 110 G + W + G P +T+ +++ +D+ ++ WM+ + G I A DG Sbjct: 8 GRGAVRW-RPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGGIRA-DG 65 Query: 111 KTIRGSFDKGKRKGA--IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNL 168 K + GS KR GA +H V ++ G+ L Q + + A+ LL L + Sbjct: 66 KVLGGS----KRAGAPALHGVELVTHTTGMALAQ-REAVGGDAAAALLALLTEAPLDGRM 120 Query: 169 ITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFS------------ 216 +++DA + I + +YL VKG+Q + + P FS Sbjct: 121 VSMDAGFLNAAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVA 180 Query: 217 ---------------------NYKGDSFSTQEISHGRKETR-LHIVSNVTRLNFCDFEFE 254 + T E S GR E R L +V + Sbjct: 181 LDQIAPPRRKRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGDVGPSLMTAYG 240 Query: 255 WK------GLKKLCVALSFRQKKEDK-SAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEH 307 W+ GL++ C R++ D + E V++ +SS+ +F +IR HW IE+ Sbjct: 241 WRQVTQIGGLRRWC-----RRRHADLWTVEEVTV---VSSRQRTPAQFLASIRNHWTIEN 292 Query: 308 SLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 +H D M ED R+ I++ + + +NL+R Sbjct: 293 QVHRPRDGSMQED--RLHGRAIGVILAVCRNVVINLIR 328 >UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN77_ANAAZ Length = 77 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 23/50 (46%), Positives = 39/50 (78%) Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 A+++R+HW IE+SLHWVLDV + +D RIR+ NA + + ++++A++LL Sbjct: 1 MANSVRSHWAIENSLHWVLDVALKQDDCRIRKDNAQQNFAVMRQIAVDLL 50 >UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JBB1_FRASC Length = 173 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 25/71 (35%), Positives = 41/71 (57%) Query: 275 SAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIIS 334 +AE V + + + A +AHW IE+ LHWV DV +ED R R GNA ++++ Sbjct: 73 TAETVHAVTSLPTHHASPRLLAELAQAHWAIENRLHWVRDVTYDEDRHRARTGNAPQVMT 132 Query: 335 GIKKMALNLLR 345 ++ +A+ +LR Sbjct: 133 SLRNLAITILR 143 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 26/46 (56%), Positives = 36/46 (78%) Query: 106 IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNE 151 ++ DGKT+R S D+ K AIH+VSA+++ N +VLGQVKT+ KSNE Sbjct: 26 LSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNE 71 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 32/119 (26%), Positives = 60/119 (50%), Gaps = 6/119 (5%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 +SL ++ PD R ++ L +IL + VCAV+AGA + I D+ ++ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWM--QECHEITDGE----IIAIDGKTIRGS 116 F + +P T+ R++ ID+ ++ W+ + + G +IA+DGK +RG+ Sbjct: 89 FTDRVPAATTVWRLLIRIDAEVLPQVLARWLRARTAPVVVTGRRLCLVIAVDGKVVRGA 147 >UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY37_9BACT Length = 138 Score = 54.7 bits (130), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 3/76 (3%) Query: 270 KKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNA 329 K ++A GV+ +SS +E +R HW IE+ LHW+ D ED R GN Sbjct: 39 KTRKETALGVT---SLSSGQASPRELLDFVRGHWEIENRLHWIRDSVFREDTCTTRTGNG 95 Query: 330 AEIISGIKKMALNLLR 345 A +++ ++ M ++LLR Sbjct: 96 AHVMATLRNMTISLLR 111 >UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N3J1_VIBHB Length = 184 Score = 54.3 bits (129), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 35/97 (36%), Positives = 54/97 (55%), Gaps = 4/97 (4%) Query: 260 KLCVALSFRQKKEDKSAEGVS-IRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMN 318 K C+A+ R +E K S YYI++ + A +R HW IE S HW+LDV N Sbjct: 63 KSCIAVE-RIVQEGKGEPKTSHFSYYITNHPASDPKLADYVRQHWEIE-SYHWLLDVYFN 120 Query: 319 EDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEE 355 +D + N+AE + IK++ LNL++ KD G+++ Sbjct: 121 DDRDKKYEENSAENFAQIKRLPLNLVK-AKDWAGKKK 156 >UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE7_9GAMM Length = 185 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 26/63 (41%), Positives = 37/63 (58%), Gaps = 2/63 (3%) Query: 283 YYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALN 342 YY+ + A F+ AIR HW +E+ H+V D + EDASRIRR + ++ ALN Sbjct: 98 YYLCDLVLPAARFSEAIRNHWRVENRAHYVRDTRFQEDASRIRRNPCT--FALLRSFALN 155 Query: 343 LLR 345 L+R Sbjct: 156 LMR 158 >UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ26_GLOVI Length = 125 Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 27/76 (35%), Positives = 39/76 (51%) Query: 271 KEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAA 330 +E + V +Y+SS + A E IR HW +E+ +H+ DV ED SRIR Sbjct: 23 RELRGIVTVKTHWYLSSIEASASELGRRIRGHWGVENQVHYPKDVTFGEDRSRIRTLPLV 82 Query: 331 EIISGIKKMALNLLRD 346 ++ S + ALNL R Sbjct: 83 QVWSVARSFALNLYRS 98 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 51.6 bits (122), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 38/98 (38%), Positives = 54/98 (55%), Gaps = 18/98 (18%) Query: 174 MGCQKDIASKIKDKKADYLLAVKGN----QGKL----HHAFEEKFPVNVFSNYKGDSFST 225 MGCQK+IA I +KADY+LA+KG+ QG+L H E F + F D +T Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNF-----DEHTT 55 Query: 226 QEISHGRKETRL--HIVSNVTRLNFCDFEFEWKGLKKL 261 + HGR ETR ++ N + LN +++W GLK + Sbjct: 56 IDSGHGRIETRRCQQVLVNKSWLN---NKYQWVGLKSI 90 >UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3Z283_9SYNE Length = 156 Score = 51.2 bits (121), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 31/95 (32%), Positives = 50/95 (52%), Gaps = 4/95 (4%) Query: 85 AFEKMFIEWMQECHEITDG-EIIAIDGKTIRGSFDK--GKRKGAIHMVSAFSNENGVVLG 141 AFE + ++WM + + DG + + DGKT+RGS D+ G I VS +S GV + Sbjct: 3 AFEALLLQWMSQQPALADGVDTLVCDGKTLRGSIDQKPGAAASFIAQVSLYSQPLGVAIA 62 Query: 142 QVKTEA-KSNEITAIPELLNLLYLKKNLITIDAMG 175 Q +S+E ++ LL+ + L L+ D +G Sbjct: 63 QTTYATDESSETASLLWLLSGIELTDMLVQADEVG 97 >UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria RepID=A8FXM7_SHESH Length = 57 Score = 50.4 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 23/38 (60%), Positives = 30/38 (78%), Gaps = 1/38 (2%) Query: 282 RYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNE 319 RYYISSK++ A++ A+ + HW IE S+HWVLDV MNE Sbjct: 18 RYYISSKELTAEQAANTVSEHWGIE-SMHWVLDVSMNE 54 >UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3V5_9PROT Length = 174 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 33/98 (33%), Positives = 54/98 (55%), Gaps = 9/98 (9%) Query: 254 EWKGLKKLCVALSFRQKKEDKSAEGV-----SIRYYISSK-DMDAKEFAHAIRAHWLIEH 307 EW+ K + ++ R+ +A G+ + +Y+SS + A +A AIR HW IE+ Sbjct: 53 EWQPFIKTIIRVT-RRTLLHSAATGLWDQRGEVAFYVSSAANFPATAWAAAIRGHWGIEN 111 Query: 308 SLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 H+V DV +ED SRIR + I++ + ALN++R Sbjct: 112 RNHYVRDVSCDEDKSRIR--DNPGIMARARSFALNIMR 147 >UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammaproteobacteria RepID=Q47YV8_COLP3 Length = 151 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 3/75 (4%) Query: 273 DKSA--EGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAA 330 DKS + R+ ISS D+ + +A+R+HW +E S+HW+LD+ D SRI R Sbjct: 44 DKSTGKDTAETRWNISSLDLHVVQALNAVRSHWQVE-SIHWMLDMTFRVDESRICRKQGP 102 Query: 331 EIISGIKKMALNLLR 345 + + ++K+A+ L + Sbjct: 103 HVFNVMRKIAMTLFK 117 >UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A0_AZOSE Length = 92 Score = 48.5 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 21/39 (53%), Positives = 27/39 (69%) Query: 307 HSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 H LHW LDV+ N+D SR+RRG AA ++ + LNLLR Sbjct: 23 HQLHWSLDVQFNDDQSRVRRGYAANNFVVLRHIVLNLLR 61 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 36/127 (28%), Positives = 55/127 (43%), Gaps = 15/127 (11%) Query: 230 HGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIR---YYIS 286 HGR+E+R + C E G+ L+ R + K G R Y ++ Sbjct: 32 HGRRESR--------SIKTCGIADELGGIAFPHGRLALRVHRRRKQTGGCESRETVYAVT 83 Query: 287 SKD---MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNL 343 S D E A A+R HW +E +L V DV E+AS + G A ++ + +A+ L Sbjct: 84 SLDAHETTPAELAAAVRGHWTVE-ALRHVRDVTYAEEASTLHTGTAPRAMATFRNLAVGL 142 Query: 344 LRDCKDI 350 L+ I Sbjct: 143 LKTLGAI 149 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 48.1 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 31/69 (44%), Positives = 36/69 (52%), Gaps = 5/69 (7%) Query: 172 DAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHA----FEEKFPVNVFSNYKGDSFSTQE 227 D +GCQK IA I +++ADYLLAVK NQ LH A FEE F+ Y D Sbjct: 8 DGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYFEEANKAR-FAGYNIDYDEKIN 66 Query: 228 ISHGRKETR 236 GR E R Sbjct: 67 KGPGRLEQR 75 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 47.4 bits (111), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 21/65 (32%), Positives = 35/65 (53%) Query: 26 LSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLA 85 L +I+ + + AV+ GAD + IE +G + WL+ + D GIP DT RV+ ++ Sbjct: 47 LVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLPKGIPSHDTFGRVLRILEPKQ 106 Query: 86 FEKMF 90 + F Sbjct: 107 LQSGF 111 >UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H5_9SYNE Length = 177 Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust. Identities = 37/130 (28%), Positives = 56/130 (43%), Gaps = 29/130 (22%) Query: 226 QEISHGR--------KETRLHIVSNVTRLNFCDFEFEWKGLKKLC--VALSFRQKKEDKS 275 EI HGR KE HI +N W G + +A R +K K+ Sbjct: 34 HEIGHGRDILWTLRAKEAPQHIKAN------------WHGTSWIAEVIATGTRDRKPFKA 81 Query: 276 AEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISG 335 +I+S +R W +E S HW+ D +++ED R RGN A +++ Sbjct: 82 TH-----RFITSLRTTPDALLRLVRERWSVE-SWHWIRDTQLHEDDHRY-RGNGAGVMAA 134 Query: 336 IKKMALNLLR 345 ++ A+NLLR Sbjct: 135 LRTAAMNLLR 144 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 22/74 (29%), Positives = 36/74 (48%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++++S+ Y D R KH+ I+ + VC V+ G D I + R EWL+ Sbjct: 6 VAVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQG 65 Query: 61 YGDFDNGIPVDDTI 74 + + NG+P D I Sbjct: 66 FLELPNGLPSRDCI 79 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 4/95 (4%) Query: 69 PVDDTIARVVSNIDSLAFEKMFIEWMQ----ECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 PV ++ ++ ID A F + C IAIDGKT+R SFD Sbjct: 12 PVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAFSDTK 71 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELL 159 A +++SAF+ ++ ++L + KSNEI A L+ Sbjct: 72 AAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALI 106 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust. Identities = 27/69 (39%), Positives = 39/69 (56%), Gaps = 2/69 (2%) Query: 106 IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLK 165 +A+DGKT R + K +H+V S+ +G +L QV+ EAK+NE LL L L Sbjct: 156 VALDGKTSRHAKRADGSK--VHLVGVASHGDGRLLAQVEVEAKTNETAVFRRLLRPLDLT 213 Query: 166 KNLITIDAM 174 L+T DA+ Sbjct: 214 NVLVTADAL 222 >UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX Length = 98 Score = 45.4 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 22/48 (45%), Positives = 31/48 (64%) Query: 78 VSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGA 125 +S I S+ F + FI M+ECH D ++IAIDGK + S DK +R+ A Sbjct: 1 MSCIRSVKFHECFINRMRECHSSDDIDVIAIDGKALPHSCDKSRRRRA 48 >UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C6B8_ACAM1 Length = 145 Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust. Identities = 31/94 (32%), Positives = 44/94 (46%), Gaps = 7/94 (7%) Query: 254 EWKGLKK-LCVA-LSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHW 311 EW+ ++ LCV RQ K + YYISS + +R HW IE+ LHW Sbjct: 31 EWEAIRSVLCVQRWGTRQGKAYHNTA-----YYISSAATSPHHWQSLVREHWGIENRLHW 85 Query: 312 VLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 DV ED R+ A S ++ + +N+LR Sbjct: 86 PKDVVFGEDDYRLEDEQALLNWSVLRTIVINILR 119 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 45.1 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 35/111 (31%), Positives = 60/111 (54%), Gaps = 12/111 (10%) Query: 28 AILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG-IPVDDTIARVVSNIDSLAF 86 +L L + AV+AG + I FG R + L F NG +P +TIA ++ +D+ Sbjct: 2 GLLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLDADHL 61 Query: 87 EKMFIEWMQECHEITDG-EIIAIDGKTIRGSFDKGKRKGAI---HMVSAFS 133 +++ W+ + H DG + IA+DGK + GS R GA+ H+++A++ Sbjct: 62 DRIIGAWLGDRH--PDGWDHIALDGKRLCGS-----RDGAVPGTHLLAAYA 105 >UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q750_9BACT Length = 129 Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust. Identities = 28/87 (32%), Positives = 40/87 (45%), Gaps = 5/87 (5%) Query: 261 LCVALSFRQKKEDKSAEGVSIRYYISS---KDMDAKEFAHAIRAHWLIEHSLHWVLDVKM 317 C+ F + K K E I Y I+S + K R HW IE+ LH+V D Sbjct: 30 FCIHRIFTKVKTGKKTE--EIVYGITSLTQQKASPKTILKFSRGHWSIENGLHYVRDTAF 87 Query: 318 NEDASRIRRGNAAEIISGIKKMALNLL 344 ED S+IR NA ++ +K + + L Sbjct: 88 REDHSQIRTQNAPRAMASLKNLVVGLF 114 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust. Identities = 50/216 (23%), Positives = 97/216 (44%), Gaps = 24/216 (11%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAG------ADEW-QEIEDFGHER 54 SI S L Y++ PD R+ K +H+ +L + + AV +G +W Q+ F + Sbjct: 5 SIPSPLPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDE 64 Query: 55 LEWLKKYGDFDNGIPVDDTIARVVSNI--DSLAFEKMFIEWMQECHEITDGE-----IIA 107 + + G + +P T+ R+ ++ D +K + W +E + E +A Sbjct: 65 VHIRTRRG--ERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVA 122 Query: 108 IDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQ----VKTEAKSNEITAIPELLNLLY 163 +DGK +RG+ + + A+ +SA G+ LG A + + E L + + Sbjct: 123 VDGKHLRGTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGVDW 182 Query: 164 LKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQ 199 ++T DA C +++A+ + ++K A KG + Sbjct: 183 ----VLTGDAALCTQELAAVVVEQKGGICSASKGTR 214 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust. Identities = 38/150 (25%), Positives = 68/150 (45%), Gaps = 7/150 (4%) Query: 26 LSAILFLTVCAVIAGADEWQEIEDFGHE-RLEWLKKYGDFDNGIPVDDTIARVVSNIDSL 84 L+++L L V+AG + + ++ + E L +G GIP + T R+V D + Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFG-LTRGIPSERTTRRLVEGCDPV 106 Query: 85 AFEKMFIEWMQECHEITDGEI--IAIDGKTIRG--SFDKGKRKGAIHMVSAFSNENGVVL 140 A ++ W+ + D +A DGKT++G SF + ++ A ++ G+ Sbjct: 107 ALDEALSGWIARAATVGDPGPRGLAFDGKTLKGTRSFTEAGAMSQEAVLEAVWHDTGITA 166 Query: 141 GQVKTEAKSNEITAIPELLNLLYLKKNLIT 170 G + +EI A+ L L L L+T Sbjct: 167 GHQRV-VGGDEIAALEALAGRLDLTDVLVT 195 >UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C544B Length = 103 Score = 43.9 bits (102), Expect = 0.010, Method: Composition-based stats. Identities = 33/104 (31%), Positives = 45/104 (43%), Gaps = 11/104 (10%) Query: 230 HGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKD 289 HGR ETR T L C W GLK + + K V + + I+S+ Sbjct: 5 HGRIETR---TVRATPLLTC--HDRWTGLKH---GFRITRTRTVKGVTTVEVVHGITSRP 56 Query: 290 M---DAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAA 330 + DA+ +R+HW IE+ H V DV + ED R R A Sbjct: 57 VERADARALLGLVRSHWRIENQRHDVRDVTLREDEPRCRAAGAG 100 >UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVV4_METNO Length = 123 Score = 42.0 bits (97), Expect = 0.038, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 1/91 (1%) Query: 255 WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLD 314 W GL + + R S +R+ + S ++ A AIR H + WVL+ Sbjct: 7 WPGLTTVLATETLRGGNGTDSVPA-QVRHSLGSSTAPSEVLAQAIRRHGALATGEPWVLE 65 Query: 315 VKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 V E+ SR+R AA ++ ++++AL+ R Sbjct: 66 VSFGEERSRVRERCAARHLALLRRVALDRRR 96 >UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XTV7_9DEIN Length = 118 Score = 41.6 bits (96), Expect = 0.054, Method: Composition-based stats. Identities = 29/92 (31%), Positives = 47/92 (51%), Gaps = 7/92 (7%) Query: 255 WKGLKKLCVALSFRQKKEDKSAEGV--SIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWV 312 W+G + +AL R++ K++ + Y ++S AK R HW +E+ LH Sbjct: 4 WRGSR---MALRMRRRVVRKNSGELREETAYALTSLQAPAKRLYALWRGHWEVENRLHHK 60 Query: 313 LDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 D + EDASR R+G A + ++ + LNLL Sbjct: 61 RDTVLGEDASRSRKGAAG--LMYLRDVILNLL 90 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 41.2 bits (95), Expect = 0.054, Method: Compositional matrix adjust. Identities = 43/179 (24%), Positives = 76/179 (42%), Gaps = 6/179 (3%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 L+ ++ PD R + V+ +L + V +++ + +++E F L + + Sbjct: 13 LISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGIE 72 Query: 66 NGIPVDDTIAR-VVSNIDSLAFEKMFIEWM--QECHEITDGEIIAIDGKTIRGSFDKGKR 122 P D+ R +D A +W Q D + + DGKT+RGS + Sbjct: 73 LKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTSG 132 Query: 123 KGA--IHMVSAFSNENGVVLGQV-KTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 GA I V+ +S GV + Q + +E + +LL L L+ LI DA+ Q+ Sbjct: 133 GGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQ 191 >UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 Length = 41 Score = 41.2 bits (95), Expect = 0.062, Method: Composition-based stats. Identities = 17/30 (56%), Positives = 25/30 (83%) Query: 128 MVSAFSNENGVVLGQVKTEAKSNEITAIPE 157 MV+A + NG+ +GQ+K ++KSNEITAIP+ Sbjct: 1 MVNALATANGMSIGQLKVDSKSNEITAIPK 30 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 432 e-120 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 418 e-115 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 411 e-113 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 408 e-112 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 400 e-110 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 397 e-109 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 397 e-109 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 388 e-106 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 379 e-104 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 377 e-103 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 370 e-101 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 364 4e-99 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 364 5e-99 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 363 6e-99 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 362 1e-98 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 360 4e-98 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 353 7e-96 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 353 7e-96 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 352 2e-95 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 350 4e-95 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 348 1e-94 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 347 6e-94 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 345 2e-93 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 344 3e-93 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 344 3e-93 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 339 1e-91 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 338 2e-91 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 335 1e-90 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 333 5e-90 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 329 1e-88 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 328 2e-88 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 328 3e-88 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 325 1e-87 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 325 2e-87 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 323 5e-87 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 323 7e-87 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 317 6e-85 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 315 2e-84 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 313 8e-84 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 310 4e-83 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 304 3e-81 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 303 1e-80 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 299 1e-79 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 297 4e-79 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 295 1e-78 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 290 6e-77 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 285 2e-75 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 282 2e-74 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 279 1e-73 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 275 2e-72 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 274 4e-72 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 273 5e-72 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 269 1e-70 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 269 1e-70 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 267 5e-70 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 266 7e-70 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 266 8e-70 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 261 3e-68 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 257 6e-67 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 252 1e-65 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 252 2e-65 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 247 4e-64 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 247 5e-64 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 244 5e-63 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 238 3e-61 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 237 5e-61 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 233 8e-60 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 229 9e-59 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 228 2e-58 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 228 3e-58 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 227 6e-58 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 222 1e-56 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 218 3e-55 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 213 1e-53 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 209 9e-53 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 206 1e-51 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 190 7e-47 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 189 1e-46 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 186 1e-45 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 185 3e-45 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 182 2e-44 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 182 3e-44 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 177 8e-43 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 175 3e-42 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 174 5e-42 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 170 8e-41 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 169 1e-40 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 164 4e-39 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 164 5e-39 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 164 7e-39 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 164 7e-39 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 157 5e-37 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 154 4e-36 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 151 4e-35 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 151 4e-35 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 148 3e-34 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 148 4e-34 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 147 6e-34 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 145 2e-33 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 145 2e-33 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 144 4e-33 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 144 5e-33 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 143 1e-32 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 143 1e-32 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 142 2e-32 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 142 2e-32 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 141 4e-32 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 141 4e-32 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 135 2e-30 UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp... 129 2e-28 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 128 3e-28 UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azolla... 125 2e-27 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 123 1e-26 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 118 4e-25 UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7... 115 2e-24 UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata ob... 114 5e-24 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 114 7e-24 UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica... 109 2e-22 UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=B... 106 2e-21 UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata ob... 105 2e-21 UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX 105 3e-21 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 104 5e-21 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 103 9e-21 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 103 1e-20 UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum fe... 101 4e-20 UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=B... 101 5e-20 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 100 7e-20 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 100 7e-20 UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio ... 100 1e-19 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 100 1e-19 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 100 1e-19 UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewane... 99 3e-19 UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mo... 99 3e-19 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 99 3e-19 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 98 4e-19 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 98 5e-19 UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswald... 94 8e-18 UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscurig... 94 8e-18 UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia... 93 2e-17 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 92 3e-17 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 92 3e-17 UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina... 92 3e-17 UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legione... 92 4e-17 UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus Re... 91 6e-17 UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammap... 90 1e-16 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 87 8e-16 UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax... 85 4e-15 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 82 3e-14 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 82 5e-14 UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synecho... 81 8e-14 UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synecho... 70 2e-10 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 67 2e-09 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 65 3e-09 UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum ... 65 4e-09 UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria... 62 4e-08 Sequences not found previously or not previously below threshold: UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 129 1e-28 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 123 1e-26 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 120 1e-25 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 113 9e-24 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 107 5e-22 UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaerom... 106 2e-21 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 96 2e-18 UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis... 93 1e-17 UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteoba... 92 2e-17 UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris mari... 92 3e-17 UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfo... 92 3e-17 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 91 6e-17 UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia ... 90 1e-16 UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobac... 87 1e-15 UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis... 87 1e-15 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 85 3e-15 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 82 3e-14 UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 81 9e-14 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 80 1e-13 UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferro... 80 2e-13 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 76 2e-12 UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=... 75 3e-12 UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium m... 75 4e-12 UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_... 75 5e-12 UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinis... 74 8e-12 UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodoco... 74 1e-11 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 73 2e-11 UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothe... 70 1e-10 UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 69 3e-10 UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylo... 68 7e-10 UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia... 67 2e-09 UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoa... 66 2e-09 UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=... 65 5e-09 UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylob... 64 8e-09 UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=... 64 9e-09 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 64 9e-09 UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia... 64 1e-08 UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliob... 62 3e-08 UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitroco... 60 1e-07 UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus... 59 2e-07 UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=... 58 5e-07 UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus... 58 5e-07 UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata ob... 58 6e-07 UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polarom... 56 2e-06 UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia... 56 2e-06 UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobi... 56 3e-06 UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastop... 53 2e-05 UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitroso... 52 3e-05 UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=... 52 3e-05 UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aro... 51 5e-05 UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 51 9e-05 UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiat... 50 1e-04 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 50 2e-04 UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 R... 50 2e-04 UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Strepto... 49 3e-04 UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia... 49 3e-04 UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromon... 47 8e-04 UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus p... 47 8e-04 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 47 0.001 UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 47 0.001 UniRef50_UPI00016C3A84 hypothetical protein GobsU_12175 n=1 Tax=... 45 0.003 UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmone... 45 0.005 UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodoco... 45 0.006 UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinis... 44 0.007 UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillace... 44 0.008 UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia ... 44 0.009 UniRef50_D1RJD3 Putative uncharacterized protein n=1 Tax=Legione... 44 0.009 UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitroso... 44 0.010 UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewane... 44 0.013 UniRef50_A3YV03 Putative uncharacterized protein n=1 Tax=Synecho... 43 0.020 UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms Rep... 41 0.055 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 432 bits (1111), Expect = e-120, Method: Composition-based stats. Identities = 207/358 (57%), Positives = 270/358 (75%), Gaps = 2/358 (0%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M ++ L+++IS+ PD RQ KV+HKLS IL LT+CAVI+GA+ W++IEDFG L++LK+ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 YGDF+NGIPV DTIARVVS I F + FI WM++CH D ++IAIDGKT+R S+DK Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 +R+GAIH++SAFS + +V+GQ+KT+ KSNEITAIPELLN+L +K +IT DAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 A KI+ + DYL AVKG QG+L+ AFEEKFP+ +N + DS++ E SHGR+E RLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQ-KKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 +V DF FEWKGLKKLCVA+SFR E K +++RYYISS D+ A++FA AI Sbjct: 241 CDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 R HW +E+ LHW LDV MNED +IRRGNAAE+ SGI+ +A+N+L + K K +K Sbjct: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 172/357 (48%), Positives = 248/357 (69%), Gaps = 4/357 (1%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 MS +L++ +S+ D RQ KV H L +LFL + AVI+G + W+EI+DFG+++L+WL+K Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 Y F GIP DDTI+R+ ID F+K F WM+ C E++ G++IAIDGKT+RGSF+K Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKK 120 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 + IHMVSAF+ N VVLGQVKT AKSNEITAIP+LL+LL ++ L+TIDAMGCQ I Sbjct: 121 DKSDTIHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKI 180 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 A KI DK DYLL VKGNQ +L A + F + + ++++T+E HGR+++R+ +V Sbjct: 181 AKKIVDKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMV 240 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIR 300 ++ + D FEW GLK L A+SFR +K+ ++ V++++YISS +DAK A R Sbjct: 241 ADANEIG--DLVFEWPGLKTLGYAVSFRTEKDMQT--TVAVKFYISSAKLDAKSLLEASR 296 Query: 301 AHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 AHW +E++LHW LD+ MNED+ RIR+ N+ E ++ ++ +LNLL++ K G ++K Sbjct: 297 AHWTVENNLHWQLDISMNEDSCRIRQQNSTENLATVRHTSLNLLKNEKSFDGGIKRK 353 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 411 bits (1056), Expect = e-113, Method: Composition-based stats. Identities = 154/369 (41%), Positives = 214/369 (57%), Gaps = 10/369 (2%) Query: 2 SIQSLLDYISVTPDIRQQG-KVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 +++S +Y D R++ +H IL + VCA+I+GA+ + EIE FGH + EW + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 + NGIP DT V++ + FE F+ W GE IAID KT+RGS DK Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKK 124 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 K +H+VSA++ E +V+GQ+KTE SNEITAIPELLN L LK L++IDAMGCQ +I Sbjct: 125 NGKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEI 184 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVF---SNYKGDSFSTQEISHGRKETRL 237 A KI +K ADY+LA+KGNQ KLH + E F + Y+ D T E S+GR+E R Sbjct: 185 AEKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRC 244 Query: 238 HIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 +N + EWK +K + + S R KKE + IRYYISS + A++ Sbjct: 245 AYATN--EIEKIIANDEWKNIKTVAMIESQRIKKEKE----FDIRYYISSAKLSAEDCLK 298 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 +R HW IE+ LHW LDV ED SRIR+ N AE ++ ++++ALNL++ K K + K Sbjct: 299 VVRKHWEIENKLHWTLDVAFREDESRIRQRNTAENMAILRRIALNLVKQEKTAKVGQATK 358 Query: 358 EGCVKHRER 366 E+ Sbjct: 359 RLMAGWDEK 367 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 408 bits (1049), Expect = e-112, Method: Composition-based stats. Identities = 171/361 (47%), Positives = 231/361 (63%), Gaps = 8/361 (2%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++ SL++ S+ D RQ+ K+ H+L IL L V AVI GA+ WQ+IE+ GH RL WL++ Sbjct: 2 IARTSLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQE 61 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 G F GIPVDDTIAR++S+++ ++ FI+WM E TDG+IIA+DGK+IR S+DK Sbjct: 62 RGFFKKGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKK 121 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 KRK AIHMVSA++ ENGVVLGQ KT+ KSNEI AIP LL+LL +K ++TIDAMGCQ+ I Sbjct: 122 KRKSAIHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKI 181 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV---FSNYKGDSFSTQEISHGRKETRL 237 A KI K+ DY+LAVK NQ +LH + F + F + D F HGR E R Sbjct: 182 AEKIVTKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRR 241 Query: 238 HIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 + +S++ L+ W L+ + + S R +AE RY+I+S DAK FA+ Sbjct: 242 YWISDM--LSTLGNPERWASLQSIGMVESERYIDGKTTAET---RYFITSIAPDAKIFAN 296 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 A+R HW IE+ LHWVLDV ED SR+RR NA+E + +A+N LR+ K K + K Sbjct: 297 AVRKHWAIENQLHWVLDVSFREDDSRVRRDNASENFGVFRHVAINALRNEKSCKKGIKAK 356 Query: 358 E 358 Sbjct: 357 R 357 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 400 bits (1027), Expect = e-110, Method: Composition-based stats. Identities = 143/387 (36%), Positives = 224/387 (57%), Gaps = 25/387 (6%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 ++ DY D R + KHKL I+ +T+CAVI GAD W +IE FG + +WLKK+ + Sbjct: 8 TIEDYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLEL 67 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 NGIP DT RV S ++ +++F++W+Q T GEI+AIDGKT+R S+D+ K K Sbjct: 68 PNGIPSHDTFGRVFSLLNPEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKP 127 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPE---------------LLNLLYLKKNLI 169 A+ M+SA++ NG+VLGQ + KSNEITAIP+ LL +L L ++ Sbjct: 128 ALQMISAWATTNGLVLGQSIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIV 187 Query: 170 TIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG---DSFSTQ 226 T+DA+GCQK+I +I ++ ADY++ +K NQG L+ E F + SN++G + + Sbjct: 188 TLDAIGCQKEIVKQITEQDADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVK 247 Query: 227 EISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYIS 286 + HGR+E R + + + D +++W L + R + + RY+IS Sbjct: 248 DEGHGRQEVRYYQMLSNVA-EEIDPDWQWLNLNSIGYVEYLR-VENGTDKTSLERRYFIS 305 Query: 287 SKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRD 346 S + + K FA ++R HW IE+ HW+LDV+ NED SRIR+ NA ++ ++ +ALNLL+ Sbjct: 306 SLNNNIKLFASSVREHWCIENQCHWILDVQFNEDDSRIRKDNAPANMAILRHLALNLLKQ 365 Query: 347 CKDIKGEEEKKEGCVKHRERSSEVHFL 373 K +K + K ++ + ++L Sbjct: 366 EKTLKVGVKAK-----RKKAGWDENYL 387 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 397 bits (1020), Expect = e-109, Method: Composition-based stats. Identities = 135/360 (37%), Positives = 211/360 (58%), Gaps = 7/360 (1%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 S++++ S D R ++++ L I+ +T+CAV+ GAD W E+ ++G + +WLK++ Sbjct: 5 PFASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQW 64 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGK 121 NG+P DT V + + ++ F+ W Q ++++ GE+IAIDGKT+RG+ G+ Sbjct: 65 IALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGE 124 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 + IHMVSA+++ N +VLGQ + KSNEITAIPELL +L L+ L++IDAMGCQ IA Sbjct: 125 QCSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIA 184 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV---FSNYKGDSFSTQEISHGRKETRLH 238 I + + DY+LA+KGNQG L++ + F F + DS+ T E HGR E R + Sbjct: 185 ETIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTY 244 Query: 239 IVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHA 298 + + ++ W LK + S R++ + RYY+ S + DA+ FA A Sbjct: 245 W--TMGQTDYLLGAERWAQLKSIGCVESCRRQPGHPG--TLQRRYYLLSIESDAQRFADA 300 Query: 299 IRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 +R+HW IE+ LHW+LDV ED R +G +A+ +S I+ +A NLL+ K + K Sbjct: 301 VRSHWGIENQLHWILDVGFREDKLRACQGCSAQNLSVIRHIAANLLQQESTAKCGVKAKR 360 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 397 bits (1020), Expect = e-109, Method: Composition-based stats. Identities = 129/356 (36%), Positives = 199/356 (55%), Gaps = 6/356 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +LL Y+ D R Q KH L +L + + AVIAG+ W+++E++G + EWL ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 +GIP DDT RV ID + +K +W+Q GEII IDGKT+RGS+D+ + Sbjct: 90 PHGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQC 149 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A++ V+A++++ +VLGQVK E SNEITAIP LL LL + ++ITIDAMG Q I +I Sbjct: 150 ALYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQI 209 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG---DSFSTQEISHGRKETRLHIVS 241 +KADY++ +K N L ++ F + + G D + + H R E R Sbjct: 210 CRQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRYVWAI 269 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 V + + +W GL+ + V R + I++Y++S +A+ HAIR Sbjct: 270 PVAAMGELYQQQQWHGLQTIVVVERIRHLWNKTTH---DIQFYLTSLPPNAQFLCHAIRT 326 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 HW IE++LHW LDV +ED RIR + + + ++++ALN+L K K +K Sbjct: 327 HWSIENNLHWTLDVTFSEDQCRIRSEYSPQNFALLRRLALNVLHQEKTFKRSLRQK 382 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 160/360 (44%), Positives = 218/360 (60%), Gaps = 6/360 (1%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M I+S + S D RQ KV + L +LF ++CAVIA ++ W EI ++ W KK Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 F +GIP DDTIAR+VS ID +F F+ WM+ H++T+GE+IAIDGKT+RGS+++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 R IHM+SA+++ N +VLGQ+K E KSNEITAIP LL +L L+ L+TIDAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 A+ I DK DYLLAVK NQG L A + F + + D E SHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFSPHRSAGLS-DDHVNIEKSHGRIENRTCYV 239 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIR 300 + L+ W+ LK + + SFR K + + RYYISSK + A++ A R Sbjct: 240 LSSAALD--GDFTHWEALKSIVMVESFRAVKGKTA--SLEYRYYISSKVLSAEQALSATR 295 Query: 301 AHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGC 360 HW IE S+HWVLDV MNED +I + N AE ++ ++ M+LN+L+ K++ C Sbjct: 296 EHWGIE-SMHWVLDVSMNEDECQIYKNNGAENLAYLRHMSLNMLQKEPTKLSIVGKRKRC 354 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 379 bits (974), Expect = e-104, Method: Composition-based stats. Identities = 131/365 (35%), Positives = 212/365 (58%), Gaps = 16/365 (4%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 Q+L+++ D R +G+ H+L +L + +C ++ G + + ++EDFG + +W K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 +GIP DT RV + + AF F+ W Q EI+A+DGK +R + ++G + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQG--Q 124 Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 +VSA++ N +VLGQ++ K+NEITA+P+LL +L L ++T+DAMGCQK+IA + Sbjct: 125 SPRVIVSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGD----------SFSTQEISHGRK 233 I + A+Y+LA+KGNQG+ H + V + + T E HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 ETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAK 293 ETR + S +++ +W GL+ + V S RQ + A V RYY+SS ++D + Sbjct: 245 ETRRYWQS--GDVSWLADRQQWAGLRSVGVVESVRQVGQQ--APTVERRYYLSSLNVDVE 300 Query: 294 EFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGE 353 +FA A+R HW +E+SLHWVLDV+ ED +R R G+AAE ++ ++++ALNLL+ K Sbjct: 301 KFARAVRGHWSVENSLHWVLDVQCGEDRNRARSGHAAENLATLRRLALNLLKRESTKKRG 360 Query: 354 EEKKE 358 + K+ Sbjct: 361 IKGKQ 365 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 377 bits (969), Expect = e-103, Method: Composition-based stats. Identities = 151/368 (41%), Positives = 214/368 (58%), Gaps = 9/368 (2%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 L + S D RQ+ KV + L IL LT+CAV++GA++W I +G ++L +LK++ F Sbjct: 25 FLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPFA 84 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGA 125 +G P D + + + +D+ AF+ FI+W+ ++ G ++AIDGKT R S DK K A Sbjct: 85 DGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTVTG-VVAIDGKTSRRSLDKAGGKAA 143 Query: 126 IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIK 185 IHM+SA+S+E + L Q + + KSNEITAIPELL LL LK ++TIDAMGCQ++IA+KI Sbjct: 144 IHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKII 203 Query: 186 DKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG---DSFSTQEISHGRKETRLHIVSN 242 K+ADY+LA+KGNQG L E +Y T E SHGR ETR V+ Sbjct: 204 SKEADYILALKGNQGSLRKDTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRR--VTV 261 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 T +++ + W GLK + + ++ AE RYYISS DA+ A AIR H Sbjct: 262 CTDIDWLKADHNWPGLKSIVMVQYHAILQDKTRAET---RYYISSMTSDAEHHAKAIRDH 318 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVK 362 W IE+ LHWV+D+ +D RIR GNA + IK +A N+LR K K+ Sbjct: 319 WGIENGLHWVMDMVFRDDECRIRNGNAPANFTTIKHVASNMLRSVKGKHSLRSKRHIASW 378 Query: 363 HRERSSEV 370 + +E+ Sbjct: 379 DDDFLAEI 386 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 370 bits (950), Expect = e-101, Method: Composition-based stats. Identities = 129/332 (38%), Positives = 182/332 (54%), Gaps = 6/332 (1%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 L Y D R + H+L I+ + + AV+AGAD W IE +G + WL+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGA 125 NGIP DT ARV + +D A E F W++ ++IAIDGKT +GS+D+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIK 185 + +VSA+++E+ +VLGQ + KSNEITAIP LL L L +++IDAMG + IA++I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 DKKADYLLAVKGNQGKLHHAFEEKFP---VNVFSNYKGDSFSTQEISHGRKETRLHIVSN 242 ++ADY+LA+KGNQ L ++ F + + E +H R E+R Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRRVFQVP 253 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 V ++ +W GL+ L V S R + E RY++SS DA FAH IRAH Sbjct: 254 VEQVFTPKQGRDWAGLRSLVVIQSQRCLWNKDTTET---RYFLSSLSTDAATFAHYIRAH 310 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIIS 334 W IE+ LHW LDV NED SRIR+ +A S Sbjct: 311 WGIENQLHWCLDVVFNEDKSRIRKDHAPRNFS 342 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 364 bits (933), Expect = 4e-99, Method: Composition-based stats. Identities = 132/372 (35%), Positives = 190/372 (51%), Gaps = 13/372 (3%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 ++D D R K HK+ I+++++ AVI GA W EIE+FG+ ++ + K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPD 63 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGS------FD 118 IP DT R S I FE +F W+++ + G ++AIDGK +RG Sbjct: 64 LEFIPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 GK + MVSA+S NG+ LGQVK + KSNEITAIP L+N L L ++TIDAMGCQK Sbjct: 123 TGKEGFKLWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY---KGDSFSTQEISHGRKET 235 DI I ++ A+Y++A+K N+ K + ++ + + ++ HGR E Sbjct: 183 DITQTIIERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEK 242 Query: 236 RLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMD-AKE 294 R V + + F+ + GLK + S R +RYY++S D +E Sbjct: 243 RTCTVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIV-ATGEYTQEVRYYVTSLDNTKPEE 301 Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEE 354 A AIR HW IE++LHW LDV ED S+ + NAA S KMAL +L+ K KG Sbjct: 302 IASAIRQHWSIENNLHWQLDVTFREDYSK-KVKNAARNFSVATKMALTILKKDKTTKGSM 360 Query: 355 EKKEGCVKHRER 366 K E+ Sbjct: 361 NLKRLKAGWDEK 372 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 364 bits (933), Expect = 5e-99, Method: Composition-based stats. Identities = 129/374 (34%), Positives = 190/374 (50%), Gaps = 20/374 (5%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 +L + D R Q +H L+ IL + CA++ G + +E FG+ + WL+ + Sbjct: 14 SNLREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLA 73 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE--------IIAIDGKTIRG 115 NGIP DT +V S +D F + F W Q E +IAIDGK +RG Sbjct: 74 LPNGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRG 133 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMG 175 + DKG + +V A+++E + LGQVK KSNEI A+PELL +L LK ++TIDAMG Sbjct: 134 AVDKG--QAPAVIVGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMG 191 Query: 176 CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVF-SNYKGDSFSTQEISHGRKE 234 CQ+++A KI +K DY+LA+K NQ LH + +G+ + HGR E Sbjct: 192 CQREVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHE 251 Query: 235 TRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKE 294 R VS + +W GL+ + R + V RY+ISS DA Sbjct: 252 VRRCWVSEEVEC-WLQGAEKWAGLRSVAAVECERTVAGQTT---VQRRYFISSLKADAAL 307 Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEE 354 A ++RAHW IE+SLHWVLDV ED SR RRG +AE ++ ++++ +++ Sbjct: 308 IAASVRAHWGIENSLHWVLDVTFGEDESRSRRGYSAENLATLRRLTHAMIKRENP----- 362 Query: 355 EKKEGCVKHRERSS 368 K+ + R + Sbjct: 363 NSKKSVNQRRFEAG 376 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 363 bits (932), Expect = 6e-99, Method: Composition-based stats. Identities = 139/380 (36%), Positives = 211/380 (55%), Gaps = 19/380 (5%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 + S+L+Y + D R+ KH L +L + V AVIAGAD + I + +EWLK Sbjct: 9 DVVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSR 68 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT-----DGEIIAIDGKTIRGS 116 + +G+P DTI R+++ + AF++ F EW+ + EIIAIDGKT+R S Sbjct: 69 LELPSGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRS 128 Query: 117 FDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGC 176 D+GK G + + SA++ GV LGQ+ KSNEI PEL+ + ++K ++T+DA GC Sbjct: 129 HDRGKGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGC 188 Query: 177 QKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPV---NVFSNYKGDSFSTQEISHGRK 233 Q+D+A KI K DY+LA+K NQ +LH + N F+ K + + HGR Sbjct: 189 QRDVAEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRL 248 Query: 234 ETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAK 293 + R + + + +W+GLK + VA+ Q+ ++ RYYISS DAK Sbjct: 249 DKRFYYQVKLP--DEVPAGEDWRGLKTIGVAIRISQENGRETC---DTRYYISSLKPDAK 303 Query: 294 EFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGE 353 +FA A+R HW IE+SLHW LDV ED SR+R AAE ++ +K++A++L++ K Sbjct: 304 QFAAAVRGHWGIENSLHWTLDVTFREDESRLRNRIAAENLAWLKRLAVSLIKQHK----- 358 Query: 354 EEKKEGCVKHRERSSEVHFL 373 K+ ++ R V+FL Sbjct: 359 -SKESVVMRRRMAGWNVNFL 377 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 362 bits (929), Expect = 1e-98, Method: Composition-based stats. Identities = 142/365 (38%), Positives = 207/365 (56%), Gaps = 14/365 (3%) Query: 5 SLLDYISVTPDIRQQ-GKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 SL+ D R++ H +L + + AV++ D ++I +G E+ +WL+++ Sbjct: 8 SLMCCFREVDDPRKRSNGTLHDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLV 67 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 NG+ ++T R+ +D FE F W+ G + +DGKT+RGS G+ Sbjct: 68 LLNGVASEETFLRIFRALDPKQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGSGSGGE-- 124 Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 AIHMVSAF+ E GVVLGQ K +KSNEITAIPELL LY+ L+TIDAMGCQK+IA + Sbjct: 125 SAIHMVSAFATELGVVLGQEKVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIARQ 184 Query: 184 IKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNV 243 I D+ DYLLAVKGNQ L A E +F ++ + + D SHGR ++ V Sbjct: 185 ITDQGGDYLLAVKGNQPTLLDAIETEF-IDQYQSDDVDRHRQVHPSHGRIVAQIASVLPA 243 Query: 244 TRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHW 303 + +W KK+ S R+ +S + RYYISS+++ A++ A A+RAHW Sbjct: 244 EGIVDL---ADWPECKKIARVDSLRKVGNHES--KLERRYYISSRELTAEQLAAAVRAHW 298 Query: 304 LIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKH 363 IE+ LHWVLDV EDAS IR+GNA + +S +KK+ LNL+R + ++ K Sbjct: 299 GIENRLHWVLDVSFGEDASTIRKGNAPQNLSLLKKIVLNLIR----LDTADKTKTSLRLK 354 Query: 364 RERSS 368 R+ ++ Sbjct: 355 RKCAA 359 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 360 bits (925), Expect = 4e-98, Method: Composition-based stats. Identities = 130/361 (36%), Positives = 190/361 (52%), Gaps = 7/361 (1%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + +++ D R + +H+LS +L + VCAV++GAD+++EI +G ++ WL+ + Sbjct: 6 LADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPWLRGFL 65 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQEC-HEITDGEIIAIDGKTIRGSFDKGK 121 D G+ DT RV + +D FE+ F W+ + ++IAIDGK+ R + K Sbjct: 66 RLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRTTSKAA 125 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 +H+VSAF+ GVVLGQ T KSNEITAIPELL +L ++ ++TIDAMG Q IA Sbjct: 126 AA-PLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGTQTKIA 184 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVS 241 I+++ A Y+L VK N KL + + T HGR E R Sbjct: 185 RAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLTPSSTHETTSTGHGRIEVRRCTAF 244 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 + T + WK + V R E S E V YYISS DA+ A AIR+ Sbjct: 245 DAT--DRLHKAEAWKDVASFAVVERVRTVGERTSTERV---YYISSLPADAERIAVAIRS 299 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCV 361 HW +E+ LHW LDV+ +D +R R G+ A ++ ++ MALNL+R K IK + K Sbjct: 300 HWEVENRLHWCLDVQFGDDYARGRIGHIAHNLALVRHMALNLIRLDKSIKTSIKTKRLLA 359 Query: 362 K 362 Sbjct: 360 A 360 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 353 bits (905), Expect = 7e-96, Method: Composition-based stats. Identities = 136/366 (37%), Positives = 192/366 (52%), Gaps = 13/366 (3%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M+ +SLLDY+ PD R Q K H LS ++F+ +CA++ G D W EI F ER W ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD-GEIIAIDGKTIRGSFDK 119 + GIP DT R+ + + + + +F W+ + +A+DGK +R + K Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMGDDKLVGQLAVDGKALR-ATAK 119 Query: 120 GKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 G+ A+HMV+ +S E G+ +GQ K KSNEITAIPELL LL LK L++IDAMG Q Sbjct: 120 GRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQVK 179 Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYK----GDSFSTQEISHGRKET 235 IA I K DYLLAVK NQ L+ +E+F N + HGRKE Sbjct: 180 IADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKEH 239 Query: 236 RLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF 295 R V V +WK K +A+ + + K + V R+YISS+ +DA Sbjct: 240 RRCWVLMVDESM--PVCQQWK--AKTIIAVQAERIENGKGYDFV--RFYISSRALDATSA 293 Query: 296 AHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIK-GEE 354 A RAHW +E+ LHW LD+ ED + R G A E ++ I++ LN+L+ K Sbjct: 294 LKATRAHWSVENLLHWTLDIAFGEDQLQARAGFAGENLAVIRQWILNMLKQNKSRNLSMA 353 Query: 355 EKKEGC 360 K+ C Sbjct: 354 NKRRLC 359 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 353 bits (905), Expect = 7e-96, Method: Composition-based stats. Identities = 129/368 (35%), Positives = 192/368 (52%), Gaps = 11/368 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWL--KKYG 62 +LL+ S PD R+ ++ L+ IL + VCA++ GAD W E+ D+ +R EWL + Sbjct: 2 TLLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRW 61 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKR 122 + G P DT + +D+ FE F +W++E + DG ++AIDGKT+RGS KG Sbjct: 62 PLEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVIDG-VVAIDGKTLRGSGKKGSN 120 Query: 123 KGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIAS 182 + +HMV+A++ ++G+ L Q T K +E+ + LL++L LK ++T+DA+GCQ ++A Sbjct: 121 E-LLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAE 179 Query: 183 KIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY---KGDSFSTQEISHGRKETRLHI 239 KI + DY+L VK NQ L A E F + + + F E HGR ETR + Sbjct: 180 KIVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYT 239 Query: 240 VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM-DAKEFAHA 298 N WK L + + S RQ + S V RY I S + + FA A Sbjct: 240 WINDVTWMDRPMRAAWKKLGGVGMIESIRQIGDKVS---VDQRYAIGSCGVQTVEMFAKA 296 Query: 299 IRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 R+HW IE+ LHW LDV ED R R GN+A +S ++K L LR + K ++ Sbjct: 297 SRSHWGIENGLHWTLDVVFREDQCRARLGNSARNLSVLRKFVLTTLRKEEGCKMGLNRRR 356 Query: 359 GCVKHRER 366 E Sbjct: 357 LHADRNES 364 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 352 bits (902), Expect = 2e-95, Method: Composition-based stats. Identities = 131/371 (35%), Positives = 198/371 (53%), Gaps = 18/371 (4%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 Q+ L+ ++ D R + ++L IL ++ AVI D + E+ F + ++L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECH------EITDGEIIAIDGKTIRGSF 117 F +G P DT +V+S +D + F WM E + + G +AIDGKTI S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRSG 122 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 + A H+++AF++ +VLGQ+KT+ KSNEITAIPELL L +K ++TIDAMG Q Sbjct: 123 S--AEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSN------YKGDSFSTQEISHG 231 K+IA+KI +K DY+LAVKGNQ KL + KG T E HG Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTRELKAKGQYAVTLEKDHG 240 Query: 232 RKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSK-DM 290 R E R +SN L++ + +W+G+ + + R Y+I S + Sbjct: 241 RIEKRECYLSN--DLSWFEGLEDWRGIAGVGWIRNTRHVDNKNDKTTTEDHYFIYSLKEA 298 Query: 291 DAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDI 350 AK+ R HW IE++LHW+LD+ ED R R NAAE+++ ++K+AL +L+ C Sbjct: 299 QAKDLLRIKREHWAIENNLHWMLDMAFREDDCRARAKNAAEVMNILRKLALQMLKTCDTC 358 Query: 351 K-GEEEKKEGC 360 K G K++ C Sbjct: 359 KCGMRSKRKLC 369 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 350 bits (898), Expect = 4e-95, Method: Composition-based stats. Identities = 138/368 (37%), Positives = 200/368 (54%), Gaps = 18/368 (4%) Query: 7 LDYISVTPDIRQQ-GKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 + + D R+ H IL + + AV++ D ++I + + WL+++ Sbjct: 1 MSCLDEIDDPRKPSNGTLHDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLK 60 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE-----IIAIDGKTIRGSFDKG 120 NGIP ++T R++ +D FE MF W+ + IAIDGKT+RGS G Sbjct: 61 NGIPSEETFLRILRALDPKQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGSGSGG 120 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 + AIHMVSAF+ E G+VLGQ K AKSNEITAIPELL L +K L+TIDAMGCQK I Sbjct: 121 E--SAIHMVSAFATELGLVLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSI 178 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 A +I KK DYLL VKGNQ KL A E F ++ D S E HGR ++ V Sbjct: 179 AKQIVAKKGDYLLMVKGNQPKLLEAIETAF-IDQHGVESVDRSSRVERGHGRTVGQIASV 237 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIR 300 + + +W + S R + +S + RYYISS+ + A++ A A+R Sbjct: 238 LSAKGIVDP---ADWPKCVTIGRIDSMRVVGDKQS--DLERRYYISSRALSAEQLAAAVR 292 Query: 301 AHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGC 360 AHW +E+ LHW+LDV +EDAS + + NA + +S ++K+AL ++R K + +K Sbjct: 293 AHWGVENRLHWILDVSFSEDASTVAKDNAPQNLSLLRKIALTIIRADKT----DTRKSSL 348 Query: 361 VKHRERSS 368 R+ ++ Sbjct: 349 RLKRKGAA 356 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 348 bits (894), Expect = 1e-94, Method: Composition-based stats. Identities = 138/371 (37%), Positives = 209/371 (56%), Gaps = 9/371 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 S + + D R KH L ++FLTV A+++GA+ W++I+ FG +L+WL+K+ F Sbjct: 2 SFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAF 61 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 G+PVDDTIAR++S+++ A FI W+ E E +IA DGKT+R SFD G RK Sbjct: 62 KEGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFD-GDRKT 120 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A+H VSA++ E G+VL Q K++ K NE++ + EL+ LL LK +++T DAM C K +A I Sbjct: 121 ALHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAI 180 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFP---VNVFSNYKGDSFSTQEISHGRKETRLHIVS 241 K DY+L VK NQGKL F + + K +S + HGR E R ++ Sbjct: 181 NAKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYV-- 238 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 +L + + +G + + +K+ K E YYISS +++ + A AIR+ Sbjct: 239 ---QLPITPWLTQSQGWTNIKPVIEVTRKRYLKDKETSETAYYISSLEVNLPQIAKAIRS 295 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCV 361 HW IE++ HWVLD+ ED SRIRRG+A E ++ ++ A+NL R + K + Sbjct: 296 HWSIENASHWVLDMTYREDDSRIRRGDAPENMATFRRFAMNLARLSPIKDSMKGKLKQAA 355 Query: 362 KHRERSSEVHF 372 E ++ F Sbjct: 356 WSDEVREKLLF 366 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 347 bits (889), Expect = 6e-94, Method: Composition-based stats. Identities = 129/359 (35%), Positives = 198/359 (55%), Gaps = 9/359 (2%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 Q L D R G+ + L IL +T+CA+I G D W+ I DFG +R WL ++ + Sbjct: 14 QYLFHCFLSIKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVN 73 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 G+P T ARV S I+ F+ WM + ++ ++I +DGK++ GS +GK + Sbjct: 74 MRCGVPSTLTFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQ 133 Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 A H+V+A+ + V LG+V+ KSNEI AIP LLN L ++ +I+IDAMG QK IA+ Sbjct: 134 KATHIVNAYLPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANL 193 Query: 184 IKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEI---SHGRKETRLHIV 240 I+ K+ADY+LA+K N + + E F + +Y+G + T+E HGR E R + V Sbjct: 194 IRLKQADYVLALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSYCV 253 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKD-MDAKEFAHAI 299 + + F ++ W+ L+ + S R K + + RYYI+S + + + AI Sbjct: 254 --LPMMYFHKYKKYWRDLQAIVRVQSKRHKGNE---IETATRYYITSLPFAEHRRMSQAI 308 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 R HW IE+ LHW LD+ + EDAS I RG A + ++ ++KM L +L + K K Sbjct: 309 RQHWAIENQLHWKLDIGLGEDASLITRGYADQNLATLRKMVLKMLENENSSKQGIAGKR 367 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 345 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 130/365 (35%), Positives = 207/365 (56%), Gaps = 14/365 (3%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 +Q LL+++ D RQQ KV+H L IL + + A +A AD+W E+ F + ++L+KY Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD---GEIIAIDGKTIRGSFDK 119 + NG P DT+ RV+ + ++++ +W + + +II IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 GKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 G++ G H+VSA+S E+G LGQ KSNEITAIPELL + +K ++TIDAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFS---NYKGDSFSTQEISHGRKETR 236 IA KI++K+ADY+L++K NQG L+ E F F +G TQE +HG+ ETR Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFA 296 + ++ + + WKGLK + + R+ E + + RY+ISS + + + Sbjct: 239 EYY--QTEKIKWLSQKKAWKGLKSIIM---ERKTLEKEGKRLIEYRYFISSLKEEIETVS 293 Query: 297 HAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEK 356 A+R HW IE S+HW LDV EDA+ AA+ ++ I+K +L++L+ + + + Sbjct: 294 RAVRGHWSIE-SMHWHLDVTFREDANTTIDKMAAQNLNIIRKWSLSILKTAEVSRHKLSM 352 Query: 357 KEGCV 361 ++ Sbjct: 353 RKKRY 357 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 344 bits (883), Expect = 3e-93, Method: Composition-based stats. Identities = 126/347 (36%), Positives = 186/347 (53%), Gaps = 11/347 (3%) Query: 22 VKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNI 81 V + L+ +L T+ +I A ++ EIE G E+L+WL+++ F++G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 DSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLG 141 D E F W++ G + AIDGKT+RGS GA+H+VSA+++E G+V+G Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGK 201 Q AKSNEITAIPELL+ L L ++TIDAMG QK IA+K+ DK ADY+LA+KGNQG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKL 261 LH + F T I HGR E R V++ + + W GL + Sbjct: 181 LHDDVRDFFADPDLLRECARHDDTC-IGHGRIEERTCQVADASAW-LTEQHSGWAGLASI 238 Query: 262 CVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDA 321 ++ R K R YISS D K +A R+HW +E++LHW LDV ED Sbjct: 239 AAVIATRTDK-KSGEVSSETRLYISSLPPDPKTILNACRSHWSVENNLHWQLDVTFREDE 297 Query: 322 SRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSS 368 R R+ +A ++ I+ A N+L+ K + R +++ Sbjct: 298 CRTRKDHAPLSLAIIRHAAFNMLKREPS-------KMSIKRKRLKAA 337 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 344 bits (882), Expect = 3e-93, Method: Composition-based stats. Identities = 136/378 (35%), Positives = 195/378 (51%), Gaps = 19/378 (5%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M+IQ+ ++ PD R ++ + I+F+ + AVI GAD W EIE FG + K Sbjct: 1 MTIQAF---SAIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKA 57 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 IP DT++R S +D FE+ F W+ + G ++AIDGK I + DK Sbjct: 58 RLPGLVSIPSHDTLSRFFSILDIDWFEECFRLWVDDICRRIPG-VVAIDGKAICDNPDKS 116 Query: 121 KR-----KGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMG 175 + ++MVSA+S NG+ LGQ K E KSNE AIPEL+ L L+ +ITIDA+G Sbjct: 117 SNSKNGVRSKLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIG 176 Query: 176 CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY---KGDSFSTQEISHGR 232 CQK I I + KADY+L K N L + E F ++ S Y + + HGR Sbjct: 177 CQKSITKLIIENKADYILCAKDNHEALRNIIE--FNLSEESRYYLCHAKRYFEENKGHGR 234 Query: 233 KETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDA 292 E R + + L + F W G+K L + S R+ + ++ + RYYISS + D Sbjct: 235 SEYRECVCISAKNLQY--FLKGWTGIKTLAMINSIRKMGDKEAV--METRYYISSLEPDP 290 Query: 293 KEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKG 352 +IR HW +E++LHWVLD+ ED R + GNAA S I K+AL LL+ G Sbjct: 291 IIILKSIRPHWEVENNLHWVLDIGFREDDDR-KIGNAAINFSAIPKLALMLLKQSDIKLG 349 Query: 353 EEEKKEGCVKHRERSSEV 370 K++ C + +V Sbjct: 350 MAGKRKACGWDEKIRDKV 367 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 339 bits (869), Expect = 1e-91, Method: Composition-based stats. Identities = 117/363 (32%), Positives = 188/363 (51%), Gaps = 14/363 (3%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++ LL++ S D R + ++ H L IL L VC +A D+++ I +G L +L+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKR 122 + +G+P + + +++ ID F F W++ + +AIDGKT R S D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFP-GRADFVAIDGKTSRRSHDRRAG 130 Query: 123 KGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLY----LKKNLITIDAMGCQK 178 IH+VSAF+ + +VL Q K+NE+ AIP LL+ L L L++IDA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLH 238 IA+ I+ + ADYLLAVK NQ L E F V +++ D + HGR E R Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGADHHHD----LDKGHGRVEER-- 244 Query: 239 IVSNVTRLNFCDFEFEWKG---LKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF 295 VS + +++ + G L + + RY+ISS + A+ Sbjct: 245 HVSVIREVDWLSGTRRFPGEMRLPDVAAIVRVHTTAHIADRTRTDTRYFISSAPLTAEHA 304 Query: 296 AHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEE 355 A A+R HW IE+ LHWVLDV +D SR+R G+ A+ ++ ++ ALNL+R D K + Sbjct: 305 ADAVRGHWGIENRLHWVLDVLFKDDLSRLRTGHGAKNMAVVRHFALNLIRAANDQKSLKT 364 Query: 356 KKE 358 +++ Sbjct: 365 RRK 367 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 123/356 (34%), Positives = 181/356 (50%), Gaps = 7/356 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 SL D++S+ D R +H L +LFL + AV +G D W EI+ FG +LEWL+K+ F Sbjct: 2 SLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPF 61 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 NGIP TIAR++ + + W+ + + IIAIDGKT+RG+ G Sbjct: 62 ANGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLG--CN 119 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 +H V AF NG+ L Q K EI + L+ +L + K LIT+DA+ Q+ I Sbjct: 120 TLHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAI 179 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 +K DY++ VK NQ L A + ++ V + + F+ E HGR E R I + Sbjct: 180 VARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQR--ITFQIP 237 Query: 245 RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWL 304 + +W +K L R+ S + +Y+SS D+D + A A+R HW Sbjct: 238 SKLSPKLQEKWPSVKTLIAVERHRKIGNKTS---IETSFYLSSHDIDPEYIATAVRGHWR 294 Query: 305 IEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGC 360 IE+SLHWVLDV EDA R+ AE ++ +++MALNL + K + K Sbjct: 295 IENSLHWVLDVVYREDACRVHEQRTAESLAIVRRMALNLAKLEITQKRSMKSKLHR 350 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 335 bits (860), Expect = 1e-90, Method: Composition-based stats. Identities = 114/369 (30%), Positives = 197/369 (53%), Gaps = 7/369 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +L++++++ + R + KH L ++FL + A+++GA+ W +IE +G +++WL+++ F Sbjct: 8 TLIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPF 67 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 NGIP T+AR++ I + + W+ E IIA DGK +RGSF +G K Sbjct: 68 ANGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKD 126 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A+ +V+A+ ENG+VL Q T K EI + ++L++L LK ++T+DA+ CQ++ KI Sbjct: 127 ALQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKI 186 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 +KKA ++ VK NQ KL+ A + +F + + +E HGR+E R V + Sbjct: 187 SEKKAHVVVQVKNNQPKLYQAVQSQFQSLFDAQKEKIVVEHKESGHGRQEER--YVFQLK 244 Query: 245 RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWL 304 + +W ++ + R + V YY+SS K H IR HW Sbjct: 245 AKLPPELTEKWPTIRSIIAVERHRSANGKGT---VDTSYYVSSLSPKHKLLGHYIRQHWR 301 Query: 305 IEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVK-H 363 IE+S H++LDV NEDASRI +A E ++ ++ LN+++ + K + Sbjct: 302 IENSQHYILDVVFNEDASRIAMEDAVENMALFRRFVLNIVKQSNCGARSQRNKLKRAGWN 361 Query: 364 RERSSEVHF 372 + +++ F Sbjct: 362 DDYRAQLFF 370 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 333 bits (855), Expect = 5e-90, Method: Composition-based stats. Identities = 127/369 (34%), Positives = 194/369 (52%), Gaps = 8/369 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 S+ + D R KH I+FL V AVI+GA+ W EI+ FG L+WL+KY F Sbjct: 2 SVFRFFENLSDPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPF 60 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 + GIPVDDTIARV+ I+ AF ++F+ ++ E E+IAIDGKT+R SF+ + + Sbjct: 61 ECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFNP-ETQS 119 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A+H V+ +S G++L Q K+ K NE A+ E+++ LK +IT+DAM QK IA KI Sbjct: 120 ALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKIAEKI 179 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKF-PVNVFSNYKGDSFSTQEISHGRKETRLHIVSNV 243 +KK DY++ +K N + E F ++ +++ R + R + V Sbjct: 180 IEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYRKLKV 239 Query: 244 TRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHW 303 + ++ EWKG+K + R +S E V +YISS D+D + A +R HW Sbjct: 240 S--DWLSKAEEWKGIKSVLEVCRKRSDNGKESQEKV---FYISSLDVDIQILAKCVRGHW 294 Query: 304 LIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKH 363 +E+ HWVLDV ED + AE ++ ++++ALNL R + + K Sbjct: 295 EVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHPKKQSMKGKLTAAGWS 354 Query: 364 RERSSEVHF 372 E E+ Sbjct: 355 DEFRDELLL 363 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 329 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 135/390 (34%), Positives = 196/390 (50%), Gaps = 31/390 (7%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + D + + D R KH+ S I+ + + AVI GAD W IEDFG + + Sbjct: 14 LHEFADSLILI-DNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKL 72 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSF----- 117 NGIP DT R S +D L FE+ + +W+Q + G IAIDGKTIRG++ Sbjct: 73 SNFNGIPSHDTFNRFFSALDPLKFEESYRQWVQSILKCYSG-HIAIDGKTIRGAYESEQD 131 Query: 118 ----------DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKN 167 D K +H++SAF+ E GV LGQ+ T+ K NEI IPELL++L +K Sbjct: 132 KRHRKQGVLPDSNTGKYKLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDC 191 Query: 168 LITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFP--VNVFSNYKGDSFST 225 +ITIDA+GCQ+ IA K+ + DY+ VK NQ KL V+ + + D + T Sbjct: 192 IITIDALGCQRTIAEKVIKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKGTTVRFDKYET 251 Query: 226 QEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYI 285 E HGR E+R+ N D +WK ++ + R + + V R +I Sbjct: 252 HEEGHGRNESRICYCCNDPGFLGADIRKKWKNIQSFGYIENTRNTNKGTT---VEKRCFI 308 Query: 286 SSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 SS + DA++ R HW IE++LHW LDV +ED +R RR +A S + K+AL LR Sbjct: 309 SSLEPDAQKILKNSREHWEIENNLHWQLDVNFHEDNTR-RRNISALNFSVLAKIALATLR 367 Query: 346 DCKDIKGEEEKKEGCVKHRE-RSSEVHFLY 374 + K ++ + R + FL+ Sbjct: 368 NNK-------REIPINRKRLIAGWDNEFLW 390 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 328 bits (841), Expect = 2e-88, Method: Composition-based stats. Identities = 115/381 (30%), Positives = 182/381 (47%), Gaps = 19/381 (4%) Query: 1 MSIQSLLDYISVTPDIRQQ-GKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLK 59 M++ L + PD R + H L+ IL + CAVIAGA+ W++I ++G + + + Sbjct: 1 MALP-LTSVFADLPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFR 59 Query: 60 KYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT--------DGEIIAIDGK 111 ++ + NG+P DT RV + +D AF F W E E T +A+DGK Sbjct: 60 RFLELKNGVPSHDTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGK 119 Query: 112 TIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITI 171 + R S K G +H+V + + ++LGQ +EIT ++L L L ++T+ Sbjct: 120 SARRS-AKPTFSGCLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTL 178 Query: 172 DAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG-DSFSTQEISH 230 DA GCQ + I+ + +Y++ VKGNQ L A F + + G D ++ +H Sbjct: 179 DAAGCQTETLEVIRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAH 238 Query: 231 GRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM 290 GR E R V + W G+ + + RQ K A + YY+SS + Sbjct: 239 GRHEERNVTVVHDPD----GLPAGWAGVGSVALVCRDRQVKGK--ANESTAHYYLSSLRV 292 Query: 291 DAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDI 350 A E A IR HW IE S+HWVLDV ED SR R G+A + I+++A++LL+ Sbjct: 293 GAAELAGYIRGHWHIE-SMHWVLDVAFREDESRTRVGHAGANLGMIRRVAVSLLKRAGKK 351 Query: 351 KGEEEKKEGCVKHRERSSEVH 371 ++ + ++V Sbjct: 352 GSIHTRRLRAGWDDQYMAQVL 372 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 328 bits (840), Expect = 3e-88, Method: Composition-based stats. Identities = 127/366 (34%), Positives = 190/366 (51%), Gaps = 21/366 (5%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + ++++++ D R+ K+KH LS I+ L A ++GA+ W EIE FG LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT---------DGEIIAIDGKTI 113 +NGIP DT+ RV + +D ++ W E ++AIDGKTI Sbjct: 66 QLENGIPSHDTLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTI 125 Query: 114 RGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDA 173 RG + ++ A+H+V+A++ + G+ GQV T KSNEITAIPELL+++ +K +++IDA Sbjct: 126 RG--NGSAKQKALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDA 183 Query: 174 MGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRK 233 MG QK IA KI KKADY LAVK NQ L F S D + T E +HG+ Sbjct: 184 MGTQKAIADKIIKKKADYCLAVKENQKTLLEDIVPFFE---MSQEADDHYHTVEKAHGQI 240 Query: 234 ETRLH-IVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDA 292 ETR + ++ +V+ L EF + R + E RY+I S + A Sbjct: 241 ETRAYEVIHDVSWLRKTHPEFG-----HIQSIGRARIHLDKNGQESEESRYFILSCQVSA 295 Query: 293 KEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKG 352 KE +R HW IE S+HW+LDV EDA++ A ++ + K L +L+ K Sbjct: 296 KELCDYVRGHWQIE-SMHWLLDVVFREDANKTLNKQLAFNLNVMDKFCLAVLKQLDFGKK 354 Query: 353 EEEKKE 358 +++ Sbjct: 355 MSMRRK 360 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 109/374 (29%), Positives = 177/374 (47%), Gaps = 18/374 (4%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 +++ L PD R V+H L +L + +V+ G+ E+ FG + + + Sbjct: 10 IAMHIFLSAFDEVPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT-DGEIIAIDGKTIRGSFDK 119 + + IP DT + V ID A + F + + + ++ DG+IIAIDGK +RG+ D Sbjct: 69 FLKLKHAIPSHDTFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDP 128 Query: 120 GKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 G+ MVSA+++ + L V + + E++A E L L+ L+ ++T DA+ C + Sbjct: 129 GESARTRMMVSAYASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRR 187 Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHI 239 + I D+ LA+KGNQ L F S+ + T+ HGRKETR + Sbjct: 188 TVAAINAGGGDWCLALKGNQESLLSDARGCFSKGHKSD---PTAVTENTGHGRKETRKAV 244 Query: 240 VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 V + L E+ GLK + R+ ++E RY+ S + A+ Sbjct: 245 VVSAKALAEYH---EFPGLKGFGRIEATRETGGKVTSET---RYFALSWVPTPEVLLAAV 298 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEG 359 R HW IE++LHW LDV EDA+R R+ N I+ +++ AL++LR KG K Sbjct: 299 RDHWAIENALHWQLDVSFREDAARNRKDNGPGNIAVLRRRALDVLRRD-TSKGSLSIKI- 356 Query: 360 CVKHRERSSEVHFL 373 + + FL Sbjct: 357 ----KRAGWDTTFL 366 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 325 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 107/377 (28%), Positives = 174/377 (46%), Gaps = 22/377 (5%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 + + PD R G H L+ ILF+ + A + GA ++ F + + Sbjct: 4 PMDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDV 62 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI----TDGEIIAIDGKTIRGSF 117 NG+P DT +RV +D AFEK F +M+ + +IA+DGK +R + Sbjct: 63 LVLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGY 122 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 + G+ MV+A++ + + L V+ NE +L+ LL LK ++T DA+ C Sbjct: 123 ESGRSHMPPVMVTAWAAQTRMALANVQAPNN-NEAAGALQLIELLQLKGCVVTADALHCH 181 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL 237 + +A IK + DY+LAVK NQ L + S T + HGRKE R Sbjct: 182 RGMAEAIKARGGDYVLAVKDNQPALMRDAKAAIRAATRQGK--PSTITVDAGHGRKEKRR 239 Query: 238 HIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 +V+ V ++ + ++ GLK + S R + RY++ S+ K+ Sbjct: 240 AVVAAVPQMAQ---DHDFAGLKAVARITSKR------GTDKTVERYFLMSQAYPPKDVLR 290 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 +R HW IE+SLHW LDV ++ED +R R+ NA ++ ++++ALN+ R D K Sbjct: 291 IVRTHWTIENSLHWPLDVVLDEDLARNRKDNAPANLAVLRRLALNVARAHPDNTTSLRGK 350 Query: 358 EGCVKHRERSSEVHFLY 374 + FL+ Sbjct: 351 L-----KRAGWNDTFLF 362 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 323 bits (829), Expect = 5e-87, Method: Composition-based stats. Identities = 106/374 (28%), Positives = 178/374 (47%), Gaps = 10/374 (2%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 +++L + D RQ GKV+H++ +L + C+ + + + ++ DF +L WL+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKR 122 +G P D V+ I A ++ W +G IAIDGK +RG+ + Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGW----CGDLEGRHIAIDGKALRGTHNAETG 116 Query: 123 KGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIAS 182 + +H++ A+ ++ + GQ+ KSNEI AIP LL L LK +TIDAMG Q IA Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFS---TQEISHGRKETRLHI 239 +I ADY+LA+K N + H + F + T E+SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 ++ L++ ++W GL+ + Q+ D + Y++ S D + A + Sbjct: 237 IT--EELDWYHKSWKWAGLQSVAQVRRQVQRSHD-GPPLEEVHYFLCSFKADVERLAKLV 293 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEG 359 R HW +E+ HWVLDV NED ++R NAA ++ +++M + L K++ Sbjct: 294 RGHWSVENRCHWVLDVTFNEDHCQVRDRNAAHNLTILREMVITTLHRHPAKVSLRRKRKL 353 Query: 360 CVKHRERSSEVHFL 373 ++ L Sbjct: 354 ATMDPAFRLQMLGL 367 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 323 bits (827), Expect = 7e-87, Method: Composition-based stats. Identities = 108/353 (30%), Positives = 176/353 (49%), Gaps = 9/353 (2%) Query: 10 ISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIP 69 PD R +H L +L + + A I GA+ + F +R +++ + G+P Sbjct: 9 FQGLPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLP 68 Query: 70 VDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMV 129 DT +RV +D +AF + F +++ E G ++AIDGKT+R SFD+ + A+H+V Sbjct: 69 SHDTFSRVFRLLDPVAFSRCFQQFLDHLGEDGAG-VLAIDGKTLRRSFDRAAGRSALHVV 127 Query: 130 SAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKA 189 SAF++ +++GQ A NEI A LL L LK L+T DA+ Q+ A I ++ Sbjct: 128 SAFASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGG 187 Query: 190 DYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFC 249 D+L +K N+ L E F + T + HGR E R H VS+ Sbjct: 188 DWLFPLKDNRPALRAEVERYF--ADPATVLAVPHVTTDADHGRIEVRRHWVSHDVAWLAS 245 Query: 250 DFEFE----WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLI 305 D F GLK L + + ++ + Y+SS ++ K A A+RAHW I Sbjct: 246 DRRFPDEAVLPGLKILGLVERTVTSPDGRT--TATRTLYLSSAALEPKTLARAVRAHWSI 303 Query: 306 EHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 E ++HWVLD +ED +R R+ + E ++ ++K+ALN++R + +++ Sbjct: 304 EAAVHWVLDTSFDEDRARNRKDHGPENLATLRKLALNVVRSANNQDSIRLRRK 356 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 317 bits (811), Expect = 6e-85, Method: Composition-based stats. Identities = 111/300 (37%), Positives = 155/300 (51%), Gaps = 7/300 (2%) Query: 8 DYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG 67 + PD R + + +H LS +L + VCAV+ GA+++ ++ +G L WL+K+ G Sbjct: 11 EVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKFLKLKAG 70 Query: 68 IPVDDTIARVVSNIDSLAFEKMFIEWMQE-CHEITDGEIIAIDGKTIRGSFDKGKRKGAI 126 +P DT RV++ ID AFE F+ W+ + ++AIDGKT R S K G + Sbjct: 71 VPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGK-DTSGPL 129 Query: 127 HMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKD 186 HMVSAF+ G+VLGQ T+ KSNEITAIPELL +L L+ ++TIDAMG Q IA I+ Sbjct: 130 HMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAIARTIRS 189 Query: 187 KKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRL 246 + ADY+L VK N L + F Q HGR E R + + Sbjct: 190 RGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWAYDA--V 247 Query: 247 NFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIE 306 + +W GL+ + R S V YYISS DA A A+R+HW +E Sbjct: 248 SQLYKSEQWAGLQSFALVERERTVDGKTS---VERHYYISSLPADAARIAQAVRSHWAVE 304 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 111/363 (30%), Positives = 176/363 (48%), Gaps = 9/363 (2%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 +L ++ D R +H + I FL + AVI+GA W +FG LEWL+KY F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGA 125 NGIP +I R+ + + + W+ E T IAIDGK ++G+ A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLKGAKAS-ASSAA 119 Query: 126 IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIK 185 +HMV+A+ +G+V +K +E+ + ELL L LK L+T DA+ CQ I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 DKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTR 245 + D +L VKGNQ KL+ A + +F + +N + F+ HGR E R+ + Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLN- 238 Query: 246 LNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLI 305 + + +W LK L R+ S + +Y+SS + ++ F AIRAHW Sbjct: 239 -LPAEIKMKWSQLKTLIAVERHRKVGNKTS---IDTHFYVSSAVLTSEAFGRAIRAHWQT 294 Query: 306 EHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK--EGCVKH 363 E++ HW+LD ED ++ + A I++ +++ ALNL++ K + +K C Sbjct: 295 ENNQHWLLDKLFQEDKQKMYDEDGASILAILRRWALNLVKLHPA-KTSQTQKFNRACWSD 353 Query: 364 RER 366 R Sbjct: 354 DFR 356 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 313 bits (801), Expect = 8e-84, Method: Composition-based stats. Identities = 106/369 (28%), Positives = 170/369 (46%), Gaps = 18/369 (4%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 +L PD R + +H L +L + +V+ GA E+ FG + + + Sbjct: 37 ILSAFEDVPDPRAE-NTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLK 95 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT-DGEIIAIDGKTIRGSFDKGKRKG 124 + +P DT + V ID A + F + + + DG++IA+DGK +RG+ D G+ Sbjct: 96 HAVPSHDTFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGR 155 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 MVSA++ + L V + + E+ A E L L+ LK ++T DA+ C + + I Sbjct: 156 TRMMVSAYAARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAI 214 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 D+ LA+K NQ L F + S +++I HGR ETR V + Sbjct: 215 NAGGGDWCLALKANQDSLLSDARASFGAEPDA---HPSALSEDIGHGRTETRKATVVSSK 271 Query: 245 RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWL 304 L E+ GLK + R+ E ++E RY+ S + +RAHW Sbjct: 272 ALAE---HHEFPGLKAFGRVEATRKTAEGTTSET---RYFALSWVPTPEVLLATVRAHWA 325 Query: 305 IEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHR 364 IE+SLHW LDV EDA+R R+ N+ I+ +++ AL+++R KG K + Sbjct: 326 IENSLHWQLDVSFREDAARNRKDNSPGNIAILRRRALDVMRRD-TSKGSLSIKL-----K 379 Query: 365 ERSSEVHFL 373 + FL Sbjct: 380 RAGWDDDFL 388 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 310 bits (795), Expect = 4e-83, Method: Composition-based stats. Identities = 124/370 (33%), Positives = 191/370 (51%), Gaps = 16/370 (4%) Query: 7 LDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDN 66 +D +V D RQ K+++ LS ILFL +AG + +E+EDF Y D Sbjct: 11 IDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEPLFATYVDLSE 70 Query: 67 GIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD-GEIIAIDGKTIRGSFDKGKRKGA 125 G P DT+ RV+S ++S +++ +++ Q + ++I++DGKTIRG ++GK + Sbjct: 71 GCPSHDTLERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRG--NRGKNQKP 128 Query: 126 IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIK 185 +H+V+A+ + + LGQV E KSNEI AIP+LL + ++K+++TIDAMG Q I I Sbjct: 129 VHIVTAYDGGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTII 188 Query: 186 DKKADYLLAVKGNQGKLHHAFEEKFP---VNVFSNYKGDSFSTQEISHGRKETRLHIVSN 242 KADY LAVKGNQ L+ F + + T E S G+ E R + VS+ Sbjct: 189 KGKADYCLAVKGNQETLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQIEVREYWVSS 248 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 + C +W L+ + + R + RY+I S D FA+ +R H Sbjct: 249 DIKW-LCQNHPKWHKLRGIGM---TRNTIDKDGQLSQENRYFIFSFKPDVLTFANCVRGH 304 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVK 362 W IE S+HW+LDV +ED + AA ++ I+KM L L+ KK+ + Sbjct: 305 WQIE-SMHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVM-----VFPKKDLSYR 358 Query: 363 HRERSSEVHF 372 ++R VH Sbjct: 359 RKQRYISVHL 368 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 304 bits (779), Expect = 3e-81, Method: Composition-based stats. Identities = 110/364 (30%), Positives = 184/364 (50%), Gaps = 13/364 (3%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +LL ++ + D R +KH L ++FLT+ A+++GA W+ IE FG +L+WL+ Y F Sbjct: 2 TLLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPF 61 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 ++GIP IA ++ ++DS + W+ + T IIA+DGKT+R ++ Sbjct: 62 EHGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAW-ADDIHQ 120 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A+H+VSAF NG+ L E K +E ++++ L L ++T+DA+ CQK KI Sbjct: 121 ALHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKI 180 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 KK+D+++ +KGNQ A + + + HGRKE R V + Sbjct: 181 ISKKSDFVIQIKGNQPA-LLAAVKAAFAACYDSPALAISEQTNTGHGRKECRR--VMQIE 237 Query: 245 RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWL 304 + +W ++ L S R + S R+Y+SS +D + A IRAHW Sbjct: 238 GNLPPELSEKWPHIRTLVEVASERTVGNKTAC---SSRWYVSSLPVDTAQLADIIRAHWA 294 Query: 305 IEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHR 364 IE+ LHWVLDV ED + + A+ ++ + AL++++ + KK+ R Sbjct: 295 IENQLHWVLDVVFREDELNVSDPDGAKHLALFNRAALSVIKQ------HQGKKDSLAAKR 348 Query: 365 ERSS 368 + ++ Sbjct: 349 QSAA 352 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 303 bits (775), Expect = 1e-80, Method: Composition-based stats. Identities = 118/405 (29%), Positives = 185/405 (45%), Gaps = 48/405 (11%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + S+ + I D R++ KV + I+ +T+ V W +I DF + ++L+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEW--------------------MQECHEITD 102 P DT+ R I + E + EW + E +++ Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 103 GEIIAIDGKTIRGSFDKGK--------------RKGAIHMVSAFSNENGVVLGQVKTEAK 148 IAIDGKTI G+ + K +H+VSAF ++ + LGQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNLLYLK-KNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFE 207 NEI AIP+LL+ + ++ +++TIDA+G QK I KI +K+ADYLL VK N KL E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPVNVFSNYKGDSFSTQE---ISHGRKETRLHI-VSNVTRLNFCDFEFEWKGLKKLCV 263 + S + D E HG TR I S +RL FC +WK L+ + Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRLGFCY--RDWKNLRTYGI 314 Query: 264 ALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASR 323 + +ISS + + R HW +E+ LHW LDV NED R Sbjct: 315 IK-TEKINIATGEIQNEKHCFISSLVNNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR 373 Query: 324 IRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSS 368 + N+A+ S + KMAL +L++ +D E+KK + R+++ Sbjct: 374 -KMMNSAQNFSTLTKMALTILKNYQD----EDKKTSVNRKRKKAG 413 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 299 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 100/347 (28%), Positives = 161/347 (46%), Gaps = 13/347 (3%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 LD PD R +H L ILF+ + AV+ GA E+E F RL+ L+++ + Sbjct: 3 FLDVFGEVPDPRDL-TAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI----TDGEIIAIDGKTIRGSFDKGK 121 G P DT +RV++ +D +A + F+ +M E +A+DGK++R ++ KG+ Sbjct: 62 RGAPSHDTFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGR 121 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 +V+ F + + L Q + E+ A L LL LK +T DA+ C + + Sbjct: 122 SHMPPLVVTVFGCDTFMSLAQT-VAQEGGEVQAAIAALELLSLKGLTVTADALHCHRRMT 180 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVS 241 ++D Y++A+KGNQ KL + T+E +HGR E R V Sbjct: 181 KTVRDGGGHYVIAIKGNQSKLAAEANTALD-KAAAGKATKFHQTEEDAHGRHEVRRAFV- 238 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 + L + S+R + + + +R Y S+ M A E +R Sbjct: 239 --IPFAQTPGKNALVDLCAIGRVESWRTVEGKTTHK---VRCYALSRKMPAHELLATVRR 293 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCK 348 HW IE+ LHW LDV + ED R R+ N A + ++++ LN+LR Sbjct: 294 HWSIENDLHWQLDVLLGEDHIRGRKNNTAANHAILRRLTLNVLRADP 340 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 297 bits (761), Expect = 4e-79, Method: Composition-based stats. Identities = 108/322 (33%), Positives = 159/322 (49%), Gaps = 4/322 (1%) Query: 38 IAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQEC 97 +A A+ W++IE +G + WL+ + NGIP DT RV +D+ AFE+ F +Q Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPE 157 E++A+DGK++R S G +H+VS +++ G+ LGQ + KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVF-S 216 LL L L ++T+DAMGCQ IA +I+ K AD LL +K N G + A F S Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 217 NYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSA 276 G HGR R V W L ++ + R + Sbjct: 184 GAAGRPVFDAFEGHGRLVRRRVFVDAAAT--ALAPLSGWPDLSRVLAVETLRGIPGTGTV 241 Query: 277 EGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGI 336 IRY+++S D IR HW +E++LHWVL+V ED SR+R AA + + Sbjct: 242 -VADIRYFLTSCRDDPSVLVGVIRRHWSVENALHWVLEVSFREDDSRVRDRTAARNFALV 300 Query: 337 KKMALNLLRDCKDIKGEEEKKE 358 +K+ALNL+ + + + Sbjct: 301 RKIALNLIAQDRSTQASLRGRR 322 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 295 bits (756), Expect = 1e-78, Method: Composition-based stats. Identities = 115/363 (31%), Positives = 193/363 (53%), Gaps = 16/363 (4%) Query: 9 YISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG- 67 I+V D R QG++ + L IL +++ A I+G D+W++IED+ + E L+ +G Sbjct: 8 AIAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKLSGK 67 Query: 68 ------IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGK 121 +P DT V ID F +++ +++ +E G+ IAIDGKT RG + Sbjct: 68 ELKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPRG-IKQTA 126 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 ++VSA+ ++ V+ + +E K +E+++I +L+ LL+L+ N +TIDA G ++ Sbjct: 127 NSHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGTYVEVI 186 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL-HIV 240 I K +++L VKGNQ KL E++F + D + ++I HGR E R + + Sbjct: 187 EMILSKGGNFVLPVKGNQKKLLEFIEKEFREYRGNTVSAD--TQEDIGHGRVEKRTVYCI 244 Query: 241 SNVTRLNFCDFE-FEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 + + + D +WKG+K L + KK DKS + YYI++ +D KE AI Sbjct: 245 TEIKTDDDIDGCMQKWKGVKTLVKIVREVYKKADKSTR-IETVYYITNL-IDPKEINRAI 302 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKG--EEEKK 357 RAHW IE++LH LDV +NED S+ N E + +AL ++++ +G + Sbjct: 303 RAHWGIENNLHRSLDVLLNEDHSQKSNHNVVENFHIMSLLALFIIKEISKQRGISMNRTR 362 Query: 358 EGC 360 + C Sbjct: 363 KLC 365 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 290 bits (742), Expect = 6e-77, Method: Composition-based stats. Identities = 126/375 (33%), Positives = 185/375 (49%), Gaps = 19/375 (5%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 SL + + + P R + K + L +L + + ++G W EIED+ E E LK + Sbjct: 3 HSLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYE 62 Query: 64 FDNG------IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSF 117 G +P DT+ R +S +D AFE + W++ T G+ I IDGKT+RG Sbjct: 63 MLTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRG-V 121 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 K H+VSAFS ++ L Q+ + K+NEI AI +LL+LL L +++IDA+G Q Sbjct: 122 KKLSFDTQSHVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQ 181 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL 237 I +I DK DY+L VK NQ E F + D E+SHGR ETR Sbjct: 182 TAIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLD--EQTELSHGRIETRR 239 Query: 238 H-IVSNVTRLNFCDFEFEWKGLKKLC-VALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF 295 + + N + + KGL+ + V R KK DK++E + YYISS D Sbjct: 240 YESILNPLEIEANEVLTRRKGLRSIHKVVRKRRDKKSDKTSE--EVAYYISSLT-DVSSL 296 Query: 296 AHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEE 355 AIR HW IE+ LH LDV DAS R N A+I+ I+K+ L ++ K Sbjct: 297 KQAIRGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKT-----N 351 Query: 356 KKEGCVKHRERSSEV 370 K + +++ + + Sbjct: 352 MKSSIPRIQKKPARM 366 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 285 bits (729), Expect = 2e-75, Method: Composition-based stats. Identities = 111/378 (29%), Positives = 172/378 (45%), Gaps = 47/378 (12%) Query: 30 LFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKM 89 + +T+ V W +I DF + ++L+++ P DT+ R I + E Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FIEW--------------------MQECHEITDGEIIAIDGKTIRGSFDKGK-------- 121 + EW + E +++ IAIDGKTI G+ + K Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 122 ------RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLK-KNLITIDAM 174 +H+VSAF ++ + LGQ + K NEI AIP+LL+ + ++ +++TIDA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQE---ISHG 231 G QK I KI +K+ADYLL VK N KL E + S + D E HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 RKETRLHI-VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM 290 TR I S +RL FC +WK L+ + + +ISS Sbjct: 241 FMVTRTCISCSEPSRLGFCY--RDWKNLRTYGIIK-TEKINIATGEIQNEKHCFISSLVN 297 Query: 291 DAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDI 350 + + R HW +E+ LHW LDV NED R + N+A+ S + KMAL +L++ +D Sbjct: 298 NPELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTLTKMALTILKNYQD- 355 Query: 351 KGEEEKKEGCVKHRERSS 368 E+KK + R+++ Sbjct: 356 ---EDKKTSVNRKRKKAG 370 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 282 bits (721), Expect = 2e-74, Method: Composition-based stats. Identities = 104/365 (28%), Positives = 168/365 (46%), Gaps = 13/365 (3%) Query: 7 LDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDN 66 L + D R +H L+ +LFL + A + GA EI +F R LK+ + Sbjct: 5 LSILREIHDPRD-INARHDLAELLFLALAATLCGAKNCVEIAEFVEGREAELKEIVTLRH 63 Query: 67 GIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT-----DGEIIAIDGKTIRGSFDKGK 121 G P DT +R+ ID + ++ + ++A+DGK +R ++KG+ Sbjct: 64 GCPSHDTFSRIFRLIDPDELARALGAFLAALRQGLGLGPRPRGVVAVDGKALRRGYEKGR 123 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 MVS + E + + + S+E+ A LL + LK ++T DA+ C+ D A Sbjct: 124 AFMPPVMVSVWDAETRLSVATKR-AEGSDEVAATLALLKSIDLKGCIVTADALHCRPDTA 182 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVS 241 + +KA Y LA+K N+G+L E F + T+E HGR ETR V Sbjct: 183 KALIGRKAHYALALKANRGRLFACAEAGFVAADAAG-DLAFHETRETGHGRLETRRASVL 241 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 L + GLK + + RQ + ++ S+RY SK + + A +RA Sbjct: 242 P---LKAFKQAPAFPGLKAIGRIQATRQGADGRAV--TSVRYIALSKVLAPHKLAEVVRA 296 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCV 361 HW IE+ LHW LDV +ED +R R+ NA + ++ I+++A ++L K K Sbjct: 297 HWTIENQLHWSLDVVFHEDDARSRKDNAPQNLAVIRRLARDILAAHPLDKPIASKMRRVN 356 Query: 362 KHRER 366 +R+ Sbjct: 357 WNRDF 361 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 279 bits (714), Expect = 1e-73, Method: Composition-based stats. Identities = 94/365 (25%), Positives = 172/365 (47%), Gaps = 22/365 (6%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + LL + PD R + +++L +++ + +CAV AGA + I D+ + + Sbjct: 43 MPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQRC 102 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEI-IAIDGKTIRGSFDKGK 121 +P + TI +V +D A +++ + + +A+DGKTIRG+ + Sbjct: 103 GIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RIG 160 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 ++ A H+V+A ++ + VVLGQ +T KSNEI + LL + + ++T+DAM QK A Sbjct: 161 KQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKATA 220 Query: 182 SKIKDK-KADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 ++++ +A+Y++ VK NQ L ++ V + E HGR+E R + + Sbjct: 221 RCLREQCRAEYVMIVKANQPGLLARVRDQPWEQVPVVWSDP----VERGHGREEHRSYKI 276 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSK---DMDAKEFAH 297 V R + + + + R++ A + Y I S K A Sbjct: 277 LTVARGLRFPYAQQ-------VIQIIRRRRVLGAGAWSTEVVYAICSLPCEQAPPKLLAS 329 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 IR HW IE+ +H+V DV +ED S +R G+ ++++ ++ + + L R G Sbjct: 330 WIRGHWHIENRIHYVRDVTFDEDRSAVRTGHGPQVMATLRNVVVGLHRRA----GHSNIA 385 Query: 358 EGCVK 362 C + Sbjct: 386 RACRR 390 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 275 bits (702), Expect = 2e-72, Method: Composition-based stats. Identities = 100/372 (26%), Positives = 165/372 (44%), Gaps = 22/372 (5%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 I LL+ ++ PD R V+H L+A+L LT CAV+AGA + ++ E E L + Sbjct: 39 IPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLERL 98 Query: 63 DFD-------NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG-EIIAIDGKTIR 114 P + TI RV++ ID+ A ++ W+ + G +A+DGK++R Sbjct: 99 GIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWLACRQQDAGGLRALAVDGKSLR 158 Query: 115 GSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL-YLKKNLITIDA 173 G+ R+ +H+++A + G+VL Q+ K+NEIT LL+ L L ++T DA Sbjct: 159 GAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTSDA 216 Query: 174 MGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRK 233 + Q D A+ ++ + Y++ VK N KL + + + T+ HGR Sbjct: 217 LHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLKSLPWQQIPLQDR-----TRTTGHGRC 271 Query: 234 ETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAK 293 E R V V L F V + S + + ++++ Sbjct: 272 EIRRLKVCTVNNLLFPGARQ-----AVQIVRRRVNRTTGKVSLKTIYAVTSLAAEQAPPA 326 Query: 294 EFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGE 353 A IR HW +E +LH V DV EDAS++R GNA + ++ + +A+ LR Sbjct: 327 RVAQLIRGHWTVE-ALHHVRDVTFAEDASQLRSGNAPQAMATYRNLAIGALRLAGVRNIA 385 Query: 354 EEKKEGCVKHRE 365 + Sbjct: 386 AGLRRTARDQTR 397 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 274 bits (700), Expect = 4e-72, Method: Composition-based stats. Identities = 103/286 (36%), Positives = 158/286 (55%), Gaps = 9/286 (3%) Query: 9 YISVTPDIRQQG-KVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG 67 + + PD R+ H LS IL + +CAV++G D+W+ + +FG + WL+++ NG Sbjct: 17 FFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANG 76 Query: 68 IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG-EIIAIDGKTIRGSFDKGKRKGAI 126 IP DT RV S ID AFE F +W D + +A+DGKT+R S +G A+ Sbjct: 77 IPSHDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSH-RGSAGRAL 135 Query: 127 HMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKD 186 H++ A+S E +++ Q + + KSNEITAIP++L+L L+ I+IDA+GCQK +A +I + Sbjct: 136 HLLHAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITE 195 Query: 187 KKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRL 246 DY+LA+KGNQ LH + +G + E HGR ETR V++ + Sbjct: 196 AGGDYVLALKGNQSALHDDVRLFMETQADRHPQGQA-EAVEKDHGRIETRRIWVND--EI 252 Query: 247 NFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDA 292 ++ + +W GLK L + S R+ S E R +I+S D Sbjct: 253 DWLTQKPDWPGLKTLVMVESRRELNGQVSCE---RRCFITSHTADP 295 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 273 bits (699), Expect = 5e-72, Method: Composition-based stats. Identities = 102/360 (28%), Positives = 165/360 (45%), Gaps = 16/360 (4%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHE-RLEWLKKY 61 + SL++ + D R+ +H L +L + + + G ++E+ +F R +++ Sbjct: 1 MLSLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEF 60 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEW-MQECHEITDGEIIAIDGKTIRGSFDK- 119 +P TI RV+ ++ KMF EW ++E + D + +DGK+++ + Sbjct: 61 NIIPERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNP 120 Query: 120 -GKRKGAIHMVSAFSNENGVVLGQVKTEAK-SNEITAIPELLNLLYLKKNLITIDAMGCQ 177 +++ I VS FS E+G+VL + E K +EI ++ L+ + T DA+ CQ Sbjct: 121 NNEQQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQ 180 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL 237 K S I K DY++ VKGNQ L+ ++ + + F Q+ SHGRK +R Sbjct: 181 KKTISLIAKTKNDYVITVKGNQKNLYKRIQDLSNSSKPES----CFLEQDNSHGRKISRK 236 Query: 238 HIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 V V + +G + L + +K YYISS A+ FA Sbjct: 237 IEVFKVRKNER-------QGFENLRRVIKVERKGSRGDKTYEETAYYISSLTESAQVFAK 289 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 IR HW IE+ LHWV DV ED S I AA S + + LNL R + E ++ Sbjct: 290 IIRGHWKIENQLHWVKDVIFEEDKSEISDFQAASNWSILTTIGLNLFRGLGFLSITEGQR 349 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 269 bits (687), Expect = 1e-70, Method: Composition-based stats. Identities = 105/359 (29%), Positives = 173/359 (48%), Gaps = 17/359 (4%) Query: 3 IQSLLDYISVTPDIRQ--QGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++ L +++ PD R+ +G K+KL IL L + + +I FG L+ + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT---DGEIIAIDGKTIRGSF 117 G +G+P + T+ R+ +ID A + E+ H+ G+I+ IDGK +RG+ Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAGDILCIDGKAMRGTV 137 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 + R I VSA+S E GV L E KSNEIT++P+LL+ + + ++T DAM Q Sbjct: 138 LENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADAMSFQ 195 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL 237 K I KI++K D+L+ +K NQ L + E+ + + + HGR ETR+ Sbjct: 196 KAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAEPVDV---YSEGPFLEHGRIETRV 252 Query: 238 HIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 + + +W G + + ++K D + R+Y+SS A+ Sbjct: 253 CRIF--RGNDLITDREKWNGNLTVVEIRTATERKSD-GQKSSERRFYVSSFHGSARRLGT 309 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEK 356 R HW IE S+HW LD + +D R +A + I++M L +L KG+ +K Sbjct: 310 IARMHWAIE-SMHWDLDRNLRQDFIRRNSARSARNLDTIQRMVLAIL---SIWKGKRKK 364 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 269 bits (687), Expect = 1e-70, Method: Composition-based stats. Identities = 105/350 (30%), Positives = 169/350 (48%), Gaps = 18/350 (5%) Query: 3 IQSLLDYISVTPDIRQQ--GKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++ L ++ S PD R+ G ++HKLS I+ L + ++ EI +FG L+ +K Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG-----EIIAIDGKTIRG 115 NGIP + T+ R+ ID A + + H+ G EII IDGK RG Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMG 175 + K R I VSA S + L E KSNEI A+P L++ + + ++T DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKF-PVNVFSNYKGDSFSTQEISHGRKE 234 QKDI KI++K D+++ +K NQ L + E+K ++ +Y G+ E+ HGR E Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDKIKELSPVYSYCGEP----ELGHGRIE 268 Query: 235 TRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKE 294 TR + V + T L + +W G + + K+ R ++SS + Sbjct: 269 TRSYRVFDGTDLIA--NKEKWNGNLTI-IEYECETVKKSTGNCTTEKRLHVSSLPANTPR 325 Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 +R HW IE S+HW LD + +D + + AA + I+++ ++ Sbjct: 326 LGTPVRNHWSIE-SMHWGLDRNLLQDKIKRKSARAARNLDTIQRIVYSVF 374 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 267 bits (682), Expect = 5e-70, Method: Composition-based stats. Identities = 94/246 (38%), Positives = 141/246 (57%), Gaps = 3/246 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +L+D+ D R + HKL I+ + +CA+I GAD + +E +G+ + EWLK++ + Sbjct: 8 TLIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLEL 67 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 +NGIP DT ARV + ID FE+ F +W+ E+ G+++ IDGKT++ S +K + K Sbjct: 68 ENGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKK 127 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 AIH+V+A+++E +VL Q K ++ EITAIP L+ +L L L+TIDAMG Q DIA + Sbjct: 128 AIHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELL 187 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY---KGDSFSTQEISHGRKETRLHIVS 241 K ADY LA+KGNQ L +E F + + + T E R E + Sbjct: 188 HSKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEVSSAYRT 247 Query: 242 NVTRLN 247 RL Sbjct: 248 EQERLW 253 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 266 bits (681), Expect = 7e-70, Method: Composition-based stats. Identities = 105/360 (29%), Positives = 161/360 (44%), Gaps = 41/360 (11%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++S+ + + D RQ+ KV H+ I+ + V A W E+ DF ER+++++K+ Sbjct: 18 LESMYNALGEI-DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFF 76 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEW--------------------MQECHEITD 102 P DT+ R + A E+ + W + E E Sbjct: 77 PDIQKAPSHDTLRRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKP 136 Query: 103 GEIIAIDGKTIRGSFDKGKRK--------------GAIHMVSAFSNENGVVLGQVKTEAK 148 IAIDGKTI+ + ++ +R+ +H+VSAFS ++ + LGQ + + K Sbjct: 137 FRQIAIDGKTIKKAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKK 196 Query: 149 SNEITAIPELLNLLYL-KKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAF- 206 NEI AIP LL+ L + + +++TIDAMG QKDI S+I K+A YLL VK NQ L Sbjct: 197 ENEIVAIPRLLDDLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIA 256 Query: 207 --EEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVA 264 F N E HG R V + + +W+ L+ + Sbjct: 257 GNMRDFERIPLPNEVYKVHKEGENGHGFVFLRECRVCSSLH-SLGKIYKDWENLRSYGLI 315 Query: 265 LSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRI 324 + E V Y+ISS + D ++ R HW IE+ LHW LD+ ED R+ Sbjct: 316 -RTERVDEATGESAVETHYFISSLENDPEKIMRVKRKHWGIENGLHWQLDITFKEDDGRM 374 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 266 bits (681), Expect = 8e-70, Method: Composition-based stats. Identities = 102/209 (48%), Positives = 138/209 (66%), Gaps = 1/209 (0%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 + Q L PD R+ K + L +IL + + +VI GAD W E+E++ + + E+L+ + Sbjct: 3 TTQKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSF 62 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGK 121 D NGIP DT RV SNIDS FEK FI+W+ ++ EIIAIDGKTIRG+ G Sbjct: 63 LDLPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGG 121 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 +K +HMVSA++N+N +VLGQVK KSNEITAIP+LL +L ++ ++TIDAMGCQ IA Sbjct: 122 KKSPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIA 181 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKF 210 I K ADY+LAVK NQ +L E++F Sbjct: 182 KAIVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 95/370 (25%), Positives = 159/370 (42%), Gaps = 31/370 (8%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEI----EDFGHERLEW 57 I LL + D R+ + LS +L + A +AGA +EI DFG + L Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKKYGDFDNG---IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE--IIAIDGKT 112 L D G P + I + +D A + F W+ GE ++A+D K Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 IRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL-YLKKNLI-T 170 +RG++ +G ++ + ++SA + G+V GQV+ +NEIT + LL L + ++ T Sbjct: 141 LRGAWSEGNKR--VTLLSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 171 IDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISH 230 +DA+ Q + A + + DY L VKGNQ L+ + F + K +E H Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLY---RKTFEQTLPLLQKPPQHEVEERGH 255 Query: 231 GRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYY-----I 285 GR + E + G ++ A R+ + D VS Y + Sbjct: 256 GRI----------KKWQAWTTEAKGIGFPEVATAAVIRRDEFDLKGIRVSREYAHILTSV 305 Query: 286 SSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 + A IR HW IE+ +H+ D EDA++ GN+ ++ + +A+ ++R Sbjct: 306 AGNRATAAYIHRLIRGHWGIENEIHYPRDTAWREDANQTHTGNSPHTLASFRNLAIGIIR 365 Query: 346 DCKDIKGEEE 355 K +E Sbjct: 366 RNGIRKIKET 375 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 257 bits (656), Expect = 6e-67, Method: Composition-based stats. Identities = 97/241 (40%), Positives = 144/241 (59%), Gaps = 8/241 (3%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 +Q LL+++ D RQQ KV+H L IL + + A +A AD+W E+ F + ++L+KY Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD---GEIIAIDGKTIRGSFDK 119 + NG P DT+ RV+ + ++++ +W + + +II IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 GKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 G++ G H+VSA+S E+G LGQ KSNEITAIPELL + +K ++TIDAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFS---NYKGDSFSTQEISHGRKETR 236 IA KI++K+ADY+L++K NQG L+ E F F +G TQE +HG+ ETR Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 L 237 Sbjct: 239 E 239 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 81/379 (21%), Positives = 141/379 (37%), Gaps = 37/379 (9%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++ LL+ + PD R++ V+ L +L L + AV GA + EI + + L Sbjct: 32 VEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAAF 91 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG--------------EIIAI 108 P T RV+ D A ++ W Q +I+ Sbjct: 92 GLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVISA 151 Query: 109 DGKTIRGSFDKGKRKGAI--HMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL---- 162 DGKT+RG+ + +V + +G V+ + +EI A+ ++ L Sbjct: 152 DGKTMRGARRRTGDGKIAQDQVVEILDHASGAVVA-CEPVNDGDEIGAVRTVMGRLADRW 210 Query: 163 -YLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGD 221 L ++ DA Q + ++ +LL VK NQ ++ V + Sbjct: 211 GSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRILAKVRALPWAQVRAQD--- 267 Query: 222 SFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSI 281 + + +HGR ETR V + G ++ +++ A S Sbjct: 268 --TCRGKAHGRAETRTVRVVQAP----THVDLALAGTAQVIKITRHTRRRPHPGAPAAST 321 Query: 282 R---YYISSKD---MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISG 335 R Y ++S D A +R+HWLIE+ +HWV D +ED R GN ++ Sbjct: 322 RENAYLLTSLPAEVADPATLAAMVRSHWLIENQVHWVRDTAYDEDRHTARTGNGPINLAC 381 Query: 336 IKKMALNLLRDCKDIKGEE 354 ++ A+ R + Sbjct: 382 LRNTAITRHRAHGASNIAK 400 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 252 bits (644), Expect = 2e-65, Method: Composition-based stats. Identities = 88/277 (31%), Positives = 140/277 (50%), Gaps = 10/277 (3%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 I+ L++ + D R GK++H+L IL + VCAV+A A+ +++I +G + WL + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECH------EITDGEIIAIDGKTIRGS 116 D GIP DT RV ID AFE+ F+ W + E + E IA+DGK +R S Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 FDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGC 176 FD+ + +H+VSA++ G+VL Q + K E A+P +L L+L L+++DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG--DSFSTQEISHGRKE 234 ++++A I + A YLL +K NQ K+H F N F++ + +HGR Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAFDDTHGRLV 241 Query: 235 TRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKK 271 R V W GL + + + R + Sbjct: 242 RRR--VFACPDAGCFTTLRGWPGLTTVLASETIRADR 276 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 247 bits (631), Expect = 4e-64, Method: Composition-based stats. Identities = 93/236 (39%), Positives = 143/236 (60%), Gaps = 6/236 (2%) Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 ++H+V+A+ +++ ++LGQVK + KSNEITAIP+LL +L+L+ ++TIDAMGCQK IA + Sbjct: 1 NSLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQ 60 Query: 184 IKDKKADYLLAVKGNQGKLHHAFEEKF-PVNVFSNYKGDSFSTQEISHGRKETRLHIVSN 242 I KKADY+LAVK NQ +L+ + F V ++ + T + HGR ETR + S Sbjct: 61 IGSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREY--ST 118 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 + + W L + + S R+ S E RY+I S + A+ F A+R H Sbjct: 119 IVGDDLLAGITGWDNLNAIGMVESKREVGNTISNEK---RYFIMSINGHAQRFGDAVREH 175 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 W IE+++HWVLDV ED SRIR+ N+ E +S ++K+ALN ++ + K++ Sbjct: 176 WGIENTVHWVLDVSFGEDQSRIRKDNSPENLSMLRKIALNCVKQESTKTSMKRKRK 231 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 247 bits (630), Expect = 5e-64, Method: Composition-based stats. Identities = 86/363 (23%), Positives = 145/363 (39%), Gaps = 50/363 (13%) Query: 29 ILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEK 88 +L + G + + LE L+K+ GI TI R++ ID Sbjct: 1 MLVCVTLGFLCGRTTIRRSLKWCKTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALY 60 Query: 89 MFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAK 148 F+EW+ E + + +A+DGK + G+ +K K + +++ G++L Q+ ++K Sbjct: 61 AFMEWVGEIVD-SRNTHLAVDGKALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSK 119 Query: 149 SNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEE 208 +NEIT IPELL LL + +++TIDA+G Q I +I ++ + L VK NQ + + Sbjct: 120 TNEITVIPELLKLLDISGSIVTIDAVGTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHT 179 Query: 209 KFPVNVFSNYKG-----------------DSFSTQEISHGRKETRLHIVSNVTRLNFCDF 251 ++ + + E + R E R + N Sbjct: 180 FMDKLEAADVQRKKGEVLDSGMREYLEKYEEIIRIEKNRDRNEYRTCQICK-DASNLTKS 238 Query: 252 EFEWKGLKKLCVALSFR-----------------------------QKKEDKSAEGVSIR 282 + EW ++ + R E+ + + V Sbjct: 239 QKEWPHVQSIGRIKQVRIPSEKDSHGNDVTPSKEEFLEKGSRRVPAPSAEEGTGKDVQCT 298 Query: 283 YYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALN 342 IS + A+E R HW IE+ LH VLD ED S ++ +S I+K A N Sbjct: 299 ALISDLILTAEELGSIKRMHWSIENRLHHVLDDTFREDRSPAKKSR--NNLSLIRKYAYN 356 Query: 343 LLR 345 +LR Sbjct: 357 ILR 359 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 244 bits (622), Expect = 5e-63, Method: Composition-based stats. Identities = 85/406 (20%), Positives = 153/406 (37%), Gaps = 54/406 (13%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIA-GADEWQEIEDFGHERLEWLKKYGDF 64 L+D ++ D R +H L++IL + CA +A G D IE + + + Sbjct: 30 LIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLHI 89 Query: 65 D-------NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEI------------ 105 + P + TI RV++ +D + ++ + E Sbjct: 90 WRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRRT 149 Query: 106 ---------------------IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVK 144 A+DGK ++G+ + G +H++S ++ + V Q + Sbjct: 150 EREARRAAHRSPTPAPGLLPAYAVDGKRLKGA--RHPDGGRVHLISLAAHLDATVHAQRQ 207 Query: 145 TEAKSNEITAIPELLNLL---YLKKNLITIDAMGCQKDIAS-KIKDKKADYLLAVKGNQG 200 AKS+EI A+ LL L +IT DA+ Q+ A I++ A Y++ VK NQ Sbjct: 208 IPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQP 267 Query: 201 KLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKK 260 LH +++ + HGR E R+ + ++F ++ L+ Sbjct: 268 TLHATAITAL-TGTDTDFAAVTHRETHRGHGRTEYRILRTAPADGIDFPYAAQVFRVLRH 326 Query: 261 LCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHW-LIEHSLHWVLDVKMNE 319 R K E ++++ A +R HW IE+ +H V DV E Sbjct: 327 RGGLDGIRHSK-----EVCYGITDLTARQAGPAHLAAYVRGHWKAIENGVHHVRDVTFAE 381 Query: 320 DASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRE 365 DA + R ++ + +A LR + ++E H+ Sbjct: 382 DACQARTATLPRALAAFRNLATGTLRRAGHVNIAHARREHGYDHQR 427 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 238 bits (606), Expect = 3e-61, Method: Composition-based stats. Identities = 95/379 (25%), Positives = 157/379 (41%), Gaps = 29/379 (7%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEW-LKKY 61 +Q L D ++ PD R ++H+L IL L+ AV AG +EI + L Sbjct: 39 VQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTAL 98 Query: 62 GDFDNGI------PVDDTIARVVSNIDSLAFEKM---FIEWMQECHEITDGEIIAIDGKT 112 G + + P DT+ RV+S +DS A + F ++A+DGKT Sbjct: 99 GARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGKT 158 Query: 113 IRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL----YLKKNL 168 +RG+ G A H+++ + GVVL + + AK+NE+TA LL L L + Sbjct: 159 LRGA--AGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGVV 216 Query: 169 ITIDAMGCQKDIASKIK-DKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQE 227 +T DA+ + A I + A ++ VK N L + S + Sbjct: 217 VTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIP----IGHSAEG 272 Query: 228 ISHGRKETRL---HIVSNVTRLNFCDFEFEWKGLKKLCVALSF-----RQKKEDKSAEGV 279 +HGR E R S R + + + + ++ R + S V Sbjct: 273 RAHGRFERRTIQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGRARVTRTIPSTVTV 332 Query: 280 SIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKM 339 + ++ + + A R HW IE+ +HWV DV EDASR+R G I++ ++ + Sbjct: 333 HVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNL 392 Query: 340 ALNLLRDCKDIKGEEEKKE 358 + L+R + + Sbjct: 393 IIGLIRLAGHNRIAPTIRR 411 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 237 bits (604), Expect = 5e-61, Method: Composition-based stats. Identities = 83/238 (34%), Positives = 123/238 (51%), Gaps = 7/238 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 ++D D R K HK+ I+++++ AVI GA W EIE+FG+ ++ + K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPS 63 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGS------FD 118 IP DT R S I FE +F W+++ + G ++AIDGK +RG Sbjct: 64 LEFIPSHDTFNRFFSMIKPDYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 +GK + MVSA+S NG+ LGQVK + KS+EITAIP L+N L L ++TIDAMGCQK Sbjct: 123 RGKEGFKLWMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETR 236 DI I A+Y++A+K N+ K + ++ + + R Sbjct: 183 DITQTIIGHDANYIIAIKENKKKKYQPAKQIIDDYQDRDEIINRVIRHVSEKCRTWKD 240 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 233 bits (594), Expect = 8e-60, Method: Composition-based stats. Identities = 96/352 (27%), Positives = 152/352 (43%), Gaps = 21/352 (5%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEW------ 57 SL+ ++ PD R V H L A+L V AV+ GA + ++ + + Sbjct: 27 SSLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELG 86 Query: 58 -LKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHE--ITDGEIIAIDGKTIR 114 + + P + T R+++ +D+ A + W+ C T + ++DGKT+R Sbjct: 87 VFRDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLR 146 Query: 115 GSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAM 174 GS G++ +H+++ G VLGQV + K+NE+T LL L L ++T DA+ Sbjct: 147 GSGPAGEQ---VHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADAL 203 Query: 175 GCQKDIASKIKD-KKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRK 233 Q++ A + D KKA Y+ VK NQ +L+ + + T HGR Sbjct: 204 HTQREHARWLVDTKKAAYVFTVKKNQPRLYRQLKTLPWTKIP-----IQDETSTRGHGRY 258 Query: 234 ETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAK 293 + R T DF S V +S+ Sbjct: 259 DIRRLQAVTCTGPLALDFPH--AVQALRIRRRRLNLATGRWSTVTVYAITNLSAAQAGPA 316 Query: 294 EFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 E A +R HW IE +LH + D EDASR+R GNA ++ ++ A+NLLR Sbjct: 317 ELADWLRGHWAIE-TLHHIRDTTYAEDASRLRTGNAPRAMATLRNTAINLLR 367 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 229 bits (585), Expect = 9e-59, Method: Composition-based stats. Identities = 92/343 (26%), Positives = 142/343 (41%), Gaps = 28/343 (8%) Query: 28 AILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFE 87 A+L + V A AG + + + + P + T V+S +D Sbjct: 2 ALLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 KMFIEWMQECHEITDGEI---IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVK 144 + +D IA+DGK +RG+ + A H+VS F++ +VLGQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGALR--AKATATHLVSVFAHRARLVLGQLA 119 Query: 145 TEAKSNEITAIPELLNLLYLK-KNLITIDAMGCQKDIASKIKDK-KADYLLAVKGNQGKL 202 KSNEI + LL LL + L+T+DAM Q A I K+ YL+ VK NQ K+ Sbjct: 120 VAEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 HHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL-HIVSNVTRLNFCDFEFEWKGLKKL 261 V + DS HGR ETR I++ + F K Sbjct: 180 LARITALPWAEVPAAATDDS-----RGHGRVETRTLQIITAARGIGFP--------YAKQ 226 Query: 262 CVALSFRQKKEDKSAEGVSIRYYISSKDMDAKE---FAHAIRAHWLIEHSLHWVLDVKMN 318 + ++ + V + Y I S + +R H IE+SLHW+ DV + Sbjct: 227 IIRITRERLITATDQRSVEVVYAICSLPFEHARPTAIMTWMRQHCRIENSLHWIRDVTFD 286 Query: 319 EDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCV 361 ED R GN A++++ ++ A+NL R + G + E C Sbjct: 287 EDRQRAHTGNGAQVLATLRNTAINLHR----LNGADNIAEACR 325 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 228 bits (582), Expect = 2e-58, Method: Composition-based stats. Identities = 79/368 (21%), Positives = 144/368 (39%), Gaps = 15/368 (4%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + L+ + PD R V+++L+ +L L V IAG D + ++ + Sbjct: 25 VAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAGL 84 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE----IIAIDGKTIRGSFD 118 F +P + TI R+V ++ W +A DGK ++G+ Sbjct: 85 GFPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGARS 144 Query: 119 KGKRKGAIH--MVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGC 176 + + +V A ++ G LG + A +EI ++ L+N + L+T D + Sbjct: 145 RPPQGSVRQEAVVEAVRHDTGTALGHQRVVA-GDEIASVRRLVNRVCDHNTLVTTDCLHA 203 Query: 177 QKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETR 236 + +A I+ K +L ++KGNQ + P + F G+ T+E +HGR E R Sbjct: 204 HEPLARAIRAKGGHWLFSIKGNQPTVRAKL-AGLPWDEF----GNQHVTREKAHGRIEER 258 Query: 237 LHIVSNVTRLNFCDFEFEWKGLK--KLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKE 294 + + F + +K + S E + +S+ + Sbjct: 259 ALKALTPSAPSLVGFRGTRQVVKLARTTRRKKTTTSPAATSTEEFYLVTSLSTDQASPAQ 318 Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEE 354 A R HW +E ++H V D M+ED IR NAA + + ++ LR + Sbjct: 319 LARWARGHWTVE-AIHHVRDRTMDEDRHTIRTKNAALNWAIARDTTISALRLAGYKNIRQ 377 Query: 355 EKKEGCVK 362 ++ Sbjct: 378 ARRATIRD 385 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 228 bits (581), Expect = 3e-58, Method: Composition-based stats. Identities = 90/248 (36%), Positives = 127/248 (51%), Gaps = 12/248 (4%) Query: 68 IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGS------FDKGK 121 IP DT R S I FE +F W+++ + G ++AIDGK +RG GK Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHTTGK 62 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 + MVSA+S NG+ LGQVK + KSNEITAIP L+N L L ++TIDAMGCQKDI Sbjct: 63 EGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDIT 122 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY---KGDSFSTQEISHGRKETRLH 238 I + A+Y++A+K N+ K + ++ + + ++ HGR ETR Sbjct: 123 QTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRTC 182 Query: 239 IVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMD-AKEFAH 297 V + + F+ + GLK + S R +RYY++S D +E A Sbjct: 183 TVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIV-ATGEYTQEVRYYVTSLDNTKPEEIAS 241 Query: 298 AIRAHWLI 305 AIR HW I Sbjct: 242 AIRQHWSI 249 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 227 bits (578), Expect = 6e-58, Method: Composition-based stats. Identities = 106/197 (53%), Positives = 138/197 (70%), Gaps = 13/197 (6%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 MS+ L D+ + D RQ KV +KL +LFL + AVI+GA+ W+EIEDFGH RL+WLKK Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 YGDF +GIPV DTIAR+V ID F + FI+WMQ ++TD +++A+DGKT+ Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 HM+SAF+ +NGVVLGQ +T+ KSNEITA+PELL LL L+ ++T+DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 ASKIKDKKADYLLAVKG 197 I KKADY +AVK Sbjct: 168 VKTIVKKKADYCIAVKK 184 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 222 bits (566), Expect = 1e-56, Method: Composition-based stats. Identities = 81/308 (26%), Positives = 132/308 (42%), Gaps = 27/308 (8%) Query: 50 FGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAID 109 FG + +WLK GI T + V ++ +AFE + +Q Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAALPKPLQRA------------ 90 Query: 110 GKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLI 169 K S + + +V ++ G+V+GQ + NE+ + L LL L+ ++ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQQTAPGR-NEVQGALDALALLSLEGAIV 149 Query: 170 TIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEIS 229 T DA+ C+ D A I DY LA+K NQ L + + + E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLGVQ----TAAEND 205 Query: 230 HGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKD 289 H R E R + V ++ + GL+ + + + + + +RY++ S Sbjct: 206 HDRCERRRACIVAVNDID-------FPGLQAIGSVEATSRHADGRL--TSHVRYFLLSTI 256 Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 M A R HW IE+ LHWVLDV+ EDA+R R+ + I+ ++K+ALNL+R D Sbjct: 257 MSAAALIEVTRTHWQIENKLHWVLDVQFREDAARNRKDHGPANIALLRKIALNLIRAHPD 316 Query: 350 IKGEEEKK 357 K +K Sbjct: 317 -KASIRRK 323 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 218 bits (554), Expect = 3e-55, Method: Composition-based stats. Identities = 92/385 (23%), Positives = 150/385 (38%), Gaps = 46/385 (11%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAV-------IAGADEWQEIEDFGHE---R 54 + + ++ PD R + + L + + +CAV +A EW + R Sbjct: 23 GIWERLAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERVRLR 82 Query: 55 LEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIE--WMQECHEITDG--------- 103 L W G +P + TI R ++ +D A ++ ++TD Sbjct: 83 LPWNPWDGHL---LPDEATIRRFLNTVDDQALATALLDPPLADTPADLTDAVPSAPVRPP 139 Query: 104 --------EIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAI 155 A+DGKT RG+ K +H++ ++ G +LGQ + +AKSNE T Sbjct: 140 AGDQAVPVRAYAVDGKTSRGA--KRADGSQVHLLGVAAHGAGALLGQREIDAKSNETTEF 197 Query: 156 PELLNLLYLKKNLITIDAMGC-QKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV 214 LL L L ++ DA+ + ++ + K A YL K NQ KL AF P Sbjct: 198 RALLAPLELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKLR-AFLAALPWTE 256 Query: 215 FSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDK 274 T++ HGR+ETR V+ VT L+F + + + RQK + Sbjct: 257 IPTADL----TRDRGHGREETRTLKVATVTHLDFPHAA------QAIRIRRWRRQKGQPA 306 Query: 275 SAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIIS 334 S E + ++ A R W IE H+V DV ED+S R G +++ Sbjct: 307 SHETIYAITDATADQASPALLADLARGQWHIEVKQHYVRDVTFGEDSSTSRTGRGPAVLA 366 Query: 335 GIKKMALNLLRDCKDIKGEEEKKEG 359 + + LR ++ Sbjct: 367 LFRATVADTLRRAGHRSVPACRRAH 391 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 213 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 92/413 (22%), Positives = 161/413 (38%), Gaps = 69/413 (16%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVI-AGADEWQEIEDFGHER-LEWLKK 60 ++ L+ D R V++++S++L L VCA+ AG D ++ E L Sbjct: 31 VRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELAA 90 Query: 61 Y------GDFDNGIPVDDTIARVVSNIDS-----LAFEKMFIEWMQECHEITD------- 102 + IP + T+ V+ +D ++ + H Sbjct: 91 FGLPYHPLRGRYRIPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPLMPDGGI 150 Query: 103 --------------------GEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQ 142 IA+DGK +R + K + ++SA + +G+ L Sbjct: 151 EREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSA--KRPDGSRVFVLSAVRHGDGITLAS 208 Query: 143 VKTEAKSNEITAIPELLNLLYL---KKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQ 199 + AK+NEI LL+ L K ++T DA+ Q+D A+ + ++ A YLL +K NQ Sbjct: 209 REIGAKTNEIPEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERGAHYLLTIKNNQ 268 Query: 200 GKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLK 259 + ++ D+ HGR E RL V V L F Sbjct: 269 RGQARQLHALPWKEIPVIHRDDA-----RGHGRHEQRLVQVVTVNGLLFP---------- 313 Query: 260 KLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH-----AIRAHWLIEHSLHWVLD 314 L ++++ A+ S + D+ A+E + R HW +E+++HW D Sbjct: 314 HAAQVLRIQRRRRLYGAKKWSSETVYAITDLPAEEASAAEIASWARGHWTVENTVHWCRD 373 Query: 315 VKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERS 367 V NED S++R N +++ ++ +L+R + G G H ER+ Sbjct: 374 VTFNEDKSQVRTHNTPSVLAAVR----DLIRGALKLAGYVNTAAGRRAHTERT 422 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 209 bits (533), Expect = 9e-53, Method: Composition-based stats. Identities = 74/179 (41%), Positives = 105/179 (58%), Gaps = 3/179 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 SLL PD R + + H+L +L +C VI+GA+ W + + +L+WL+ Y + Sbjct: 7 SLLTAFDDLPDPR-RRECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 +GI DT RV S +D+ FE F+ W+ +G+ +AIDGK +RGS D + Sbjct: 66 AHGIASHDTFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHD--GARS 123 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 IH+VSA+S+ + LGQV+T KSNEITAIPELL L ++ + ITIDAMGC A Sbjct: 124 PIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCHGMPARH 182 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats. Identities = 80/195 (41%), Positives = 113/195 (57%), Gaps = 6/195 (3%) Query: 136 NGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAV 195 +VLGQ K KSNEITAIP L+ +L ++ ++ITIDAMGCQK+I S I+ KK DY++ + Sbjct: 31 QNLVLGQKKVNDKSNEITAIPALIEMLEIESSIITIDAMGCQKEITSLIRKKKGDYIITL 90 Query: 196 KGNQGKLHHAFEEKF---PVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFE 252 K NQ L +E F F + + + E H R E R I +V+ L + Sbjct: 91 KANQKSLRQEIKEWFKIAEAEEFKDREHSYYQEIETGHHRIEKREVIAVSVSSLPCLHNQ 150 Query: 253 FEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWV 312 W LK + + S R+ + E +R+YISS + ++++ A AIR+HW IE+SLHW Sbjct: 151 DLWTELKTVVMVKSERRLWNKTTTE---VRFYISSVEKNSQKIATAIRSHWEIENSLHWT 207 Query: 313 LDVKMNEDASRIRRG 327 LDV +ED SRIR Sbjct: 208 LDVTFSEDKSRIRTR 222 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 190 bits (483), Expect = 7e-47, Method: Composition-based stats. Identities = 84/234 (35%), Positives = 120/234 (51%), Gaps = 8/234 (3%) Query: 143 VKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKL 202 + TE KSNEITAIP LL L KK ++TIDAMGCQKDIA I D+++AVK NQ KL Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 HHAFE---EKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLK 259 A EK + ++ T HGR++ R H V+ V E+ W +K Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVPPGFAAKGEWPW--IK 118 Query: 260 KLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNE 319 + A+ + ++ +RYY+ S+ + K F +R HW IE S+HWVLDV E Sbjct: 119 AIGTAVRITTHADGTQSD--EVRYYMLSRFLSGKRFGEVVRGHWGIE-SMHWVLDVTFGE 175 Query: 320 DASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSSEVHFL 373 D +R R+ A +S +++ A+ LL+ + K C+ +EV L Sbjct: 176 DRTRTRQRVLANNLSWVRRFAITLLKRHPEKDSIRGKMIRCLMDTSFLNEVLTL 229 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 63/229 (27%), Positives = 109/229 (47%), Gaps = 5/229 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 SL++ ++ PD R + ++ L +L L + AV+ G + I FG R + L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 DNG-IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 NG +P +TIA ++ +D + + W+++ H E +A+DGK + GS D + Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGKRLCGSRD--GQV 120 Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLY-LKKNLITIDAMGCQKDIAS 182 H+++A++ + V+ Q+ EA +NE A LL +L L ++T DA+ Q D+ + Sbjct: 121 PGTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHG 231 ++ K D +L K NQG L E F ++ G Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAFATAAGGDFSPRVTGRVGSGRG 229 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 186 bits (473), Expect = 1e-45, Method: Composition-based stats. Identities = 82/217 (37%), Positives = 112/217 (51%), Gaps = 9/217 (4%) Query: 154 AIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVN 213 AIPELL L L+ +TIDA+G Q IA I + ADY+LAVK NQ +L + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 VFSNYKGDS--FSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKK 271 +G + + HGR ETR+ VS + W GL++L + RQ Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETRVCRVSEDVAWLASTGQH-WAGLQRLVMLERTRQIG 119 Query: 272 EDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAE 331 + + E YYISSK + A + A IRAHW IE+ LHWVLDV EDAS IR AA Sbjct: 120 QKVTTERC---YYISSKAVKAAQMAQLIRAHWGIENQLHWVLDVSWGEDASLIRDTVAAR 176 Query: 332 IISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSS 368 ++ ++K+ LNL R ++ + KK R ++ Sbjct: 177 NMASLRKITLNLARLAQN---RQPKKVSLKNIRNLAA 210 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 185 bits (469), Expect = 3e-45, Method: Composition-based stats. Identities = 80/281 (28%), Positives = 119/281 (42%), Gaps = 13/281 (4%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + L+ + D R +H L +LFL + A + GA E+ +F R E L++ Sbjct: 1 MSVLISILREVRDPRD-VNARHDLGELLFLALLATLRGAKTCVEMAEFSEARQEELREIV 59 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHE----ITDGEIIAIDGKTIRGSFD 118 +G P DT +RV +D E+ F +M ++AIDGK++R +D Sbjct: 60 ALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRGALGLPAPKGVVAIDGKSLRRGYD 119 Query: 119 KGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 KG+ MVS + E + ++ +EI A +L L LK +T DA+ C Sbjct: 120 KGRAFMPPLMVSVWDVETRPSIAAMRAPG-GDEIKATLSVLKALTLKGCTVTADALHCHP 178 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLH 238 +A + KA Y L +K N G L A E F + F T+E HGR+E R Sbjct: 179 AMAQALLAAKAQYALGLKANHGPLFRAAEAGF----AAVTDLAVFETRERGHGREEQRRA 234 Query: 239 IVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGV 279 V V RL GLK + + R K + V Sbjct: 235 SVLPVDRLVK---RPSLPGLKAIGRIEAVRTGANGKPEQAV 272 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 64/231 (27%), Positives = 112/231 (48%), Gaps = 13/231 (5%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 IQ L+D +S T D R++ ++H +++ VCA+++GA + + ++ LKK Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDFDNGI-------PVDDTIARVVSNIDSLAFEKMFIEW----MQECHEITDGEIIAIDG 110 F P + T+ R + +ID L +++ W + +C D +++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLIT 170 K +RG+ K K IH ++AF G+V+ Q + K+NEI + LL + ++ ++T Sbjct: 341 KAVRGA-SKAKGGQKIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 IDAMGCQKDIASKIKD-KKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG 220 DA+ Q + A I + KKADY+ VK NQ + E + Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIESLPWEAFPPSSDI 450 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 182 bits (461), Expect = 3e-44, Method: Composition-based stats. Identities = 69/218 (31%), Positives = 103/218 (47%), Gaps = 3/218 (1%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M +SL + +S PD R + H L A+L L A++ G Q I FG + L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFDNG-IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDK 119 F G P T++R + D E W+ IA+DGKT+RGS D Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGSRD- 119 Query: 120 GKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 + H+V+A++ VL QV+ +AK+NE A LL +L + +++T DAM CQ+D Sbjct: 120 -GQVPGQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSN 217 +A+ + ADY+L K NQ L + E + Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGLGFEDAAR 216 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 177 bits (448), Expect = 8e-43, Method: Composition-based stats. Identities = 68/267 (25%), Positives = 116/267 (43%), Gaps = 22/267 (8%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +LL+ ++ PD R++ V+++ +A+L + VCA+++GA + I ++ + + Sbjct: 50 ALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAGLGL 109 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD-------------GEIIAIDGK 111 +P TI RV+ +D A E W+Q + D ++A+DGK Sbjct: 110 TGRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAVDGK 169 Query: 112 TIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL-YLKKNLIT 170 +R + +H++ + GVVL QV + K+NEI +L+ + L LIT Sbjct: 170 AMRAT---RHGTHPVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDVLIT 226 Query: 171 IDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISH 230 +DAM Q A + + A L+ VK NQ +H + +V +T H Sbjct: 227 VDAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRLKTLPWKDVPVG-----HTTTGRGH 281 Query: 231 GRKETRLHIVSNVTRLNFCDFEFEWKG 257 GR ETR V + G Sbjct: 282 GRIETRTLKAVTVPAGLGFPHAAQAIG 308 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 175 bits (443), Expect = 3e-42, Method: Composition-based stats. Identities = 81/184 (44%), Positives = 112/184 (60%), Gaps = 3/184 (1%) Query: 94 MQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEIT 153 M+ H++T GE++AIDGKT+RGS+D+ R+ IHMVSA+++ N +VLGQ+KT KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVN 213 AIP L+ +L L+ ++TIDAM CQ IA I K DYLLAVKGNQGKL A + F + Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFTPH 120 Query: 214 VFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKED 273 + D+ E GR E R + V + + L W GL + + ++R K Sbjct: 121 RRAPIDRDTCQI-EKQKGRVEARTYHVLSASDLIR--DFSTWSGLTSIVMVENYRAAKGR 177 Query: 274 KSAE 277 + A Sbjct: 178 QRAR 181 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 174 bits (441), Expect = 5e-42, Method: Composition-based stats. Identities = 60/165 (36%), Positives = 92/165 (55%), Gaps = 3/165 (1%) Query: 47 IEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEII 106 + + ER L+ + NG P DT RV+ I+ + + +E +G+ I Sbjct: 1 MHELCLERGASLRPPVELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHI 60 Query: 107 AIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKK 166 AIDGK ++GS K G+ H++SA+ +E G+ L Q K NE+ AIPE+L+ L L Sbjct: 61 AIDGKRLKGSKKKT---GSTHILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSG 117 Query: 167 NLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFP 211 +I+IDAMG Q +IA +I +ADY+L++KGNQ L+ + F Sbjct: 118 AVISIDAMGTQTNIAEQIIQSEADYILSLKGNQKHLYEDVRDCFT 162 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 170 bits (430), Expect = 8e-41, Method: Composition-based stats. Identities = 68/331 (20%), Positives = 124/331 (37%), Gaps = 42/331 (12%) Query: 51 GHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDG 110 G + W + G P +T+ +++ +D+ ++ WM+ + G I A DG Sbjct: 8 GRGAVRW-RPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGGIRA-DG 65 Query: 111 KTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLIT 170 K + GS K A+H V ++ G+ L Q + + A+ LL L +++ Sbjct: 66 KVLGGS--KRAGAPALHGVELVTHTTGMALAQ-REAVGGDAAAALLALLTEAPLDGRMVS 122 Query: 171 IDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFS-------------- 216 +DA + I + +YL VKG+Q + + P FS Sbjct: 123 MDAGFLNAAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALD 182 Query: 217 -------------------NYKGDSFSTQEISHGRKETRLHIVSNVTRL-NFCDFEFEWK 256 + T E S GR E R V + + + W+ Sbjct: 183 QIAPPRRKRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGDVGPSLMTAYGWR 242 Query: 257 GLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVK 316 + ++ + +++ V +SS+ +F +IR HW IE+ +H D Sbjct: 243 QVTQIGGLRRWCRRR-HADLWTVEEVTVVSSRQRTPAQFLASIRNHWTIENQVHRPRDGS 301 Query: 317 MNEDASRIRRGNAAEIISGIKKMALNLLRDC 347 M ED R+ I++ + + +NL+R Sbjct: 302 MQED--RLHGRAIGVILAVCRNVVINLIRRH 330 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 169 bits (429), Expect = 1e-40, Method: Composition-based stats. Identities = 56/187 (29%), Positives = 96/187 (51%), Gaps = 4/187 (2%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWL-K 59 + +LL + PD R+ ++ L +L TV A+++GA ++ I F R E L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 60 KYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQEC---HEITDGEIIAIDGKTIRGS 116 +G PV +T+ V+ ++D+ E F + E+ + ++A+DGKT+RGS Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGS 130 Query: 117 FDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGC 176 FD + A ++AF + + +VL + + KSNEI A +++ L L + T DAM C Sbjct: 131 FDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHC 190 Query: 177 QKDIASK 183 QK + + Sbjct: 191 QKKHSRR 197 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 164 bits (415), Expect = 4e-39, Method: Composition-based stats. Identities = 59/213 (27%), Positives = 102/213 (47%), Gaps = 14/213 (6%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHER-LEWLKKY 61 ++ L PD R + +H L AIL + V AV+ A + + ++ LK+ Sbjct: 220 MEGLRQIFFRLPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKRI 279 Query: 62 GDFDNG------IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRG 115 N P + T+ RV+ + A + W+ E +A+DGK ++G Sbjct: 280 RARFNPRTQRYVAPSEPTLRRVLQGANVTALDAAIGAWLLGIA---GFEAVAVDGKVLKG 336 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMG 175 + + + +H++SAF + G + Q + K+NEI + LL + ++ ++T DA+ Sbjct: 337 AVREDGSQ--VHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADALH 394 Query: 176 CQKDIASKIKD-KKADYLL-AVKGNQGKLHHAF 206 Q+ A + + KKADYL AVKGNQ KL ++ Sbjct: 395 TQRKTARFLVEDKKADYLFTAVKGNQRKLRNSL 427 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 164 bits (415), Expect = 5e-39, Method: Composition-based stats. Identities = 55/193 (28%), Positives = 97/193 (50%), Gaps = 8/193 (4%) Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHI 239 ++ + +K DY+LA+KGN + ++ F V S +T + HGR E R++ Sbjct: 1 MSERNSEKDNDYILALKGNHPLMEQEVKDFFLSPVTSTRSV--HTTFDKGHGRIERRIYT 58 Query: 240 VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 + T + + + + EWK L + S +K E IRY+I+S D K+FA + Sbjct: 59 L--DTNIGWFEDKKEWKHLAGFGMVDSMVTRKGK---ECREIRYFITSVT-DVKQFAKGV 112 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEG 359 +HW+IE++LHW LDV +D + NAAE ++ I+++ N ++ + K Sbjct: 113 CSHWMIENNLHWCLDVLFCDDECTVLDRNAAENLAIIRRIVYNRIKMLSKMDTLSMGKRA 172 Query: 360 CVKHRERSSEVHF 372 C+ E +++ F Sbjct: 173 CIYDDEFRAQILF 185 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 164 bits (414), Expect = 7e-39, Method: Composition-based stats. Identities = 59/267 (22%), Positives = 112/267 (41%), Gaps = 17/267 (6%) Query: 68 IPVDDTIAR-----VVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKR 122 IP ++R ++ +D F+ + + + + + DGK +RGS + GK+ Sbjct: 14 IPETTVVSRSHLPVLLQKVDVEVFDYLLFTHYGFRLDSQEKQWFSGDGKELRGSIESGKK 73 Query: 123 KGAIHMVSAFSNENGVVLGQVKTEA-KSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 +G +V + +G + Q + K +EI + LL+ L IT+DA+ Sbjct: 74 RGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRALLSKDDLASQKITLDALHLCPSTT 132 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVS 241 I +L+ +K NQ L + + D +T + +HGR E R + + Sbjct: 133 EMITKAGGVFLIGLKENQPTLLAHMTDC------ALPPIDQKTTFDFNHGRVEQRKYWLY 186 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 +V++ F+ W + R + K+A+ Y S + + A+R Sbjct: 187 DVSK---QGFDPRWDNTAFKRLVKVQRTRINQKNAKISREVSYYISNETAKEGIFDAVRN 243 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGN 328 HW +E + H + DV +NED + ++ Sbjct: 244 HWSVEVNNH-IRDVTLNEDQLKSKKRQ 269 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 164 bits (414), Expect = 7e-39, Method: Composition-based stats. Identities = 53/204 (25%), Positives = 100/204 (49%), Gaps = 19/204 (9%) Query: 11 SVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHER-LEWLKKYGDFDNG-- 67 + D R+ ++H ++L + + V+AG ++ I + + L++ G + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 68 ----IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 P + TI R++S D + +++ +++ + G IAIDGKTIR S Sbjct: 289 ERFLPPSEPTIRRILSKADPVELDRILSQYIVAH---SSGRAIAIDGKTIRSS------- 338 Query: 124 GAIHMVSAFSNENGVVLGQVKTEA-KSNEITAIPELLNLLYLKKNLITIDAMGCQKDIAS 182 ++ +++A +++G V+ Q + K +EI A LL L L ++T DA+ Q +AS Sbjct: 339 -SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALAS 397 Query: 183 KIKDKKADYLLAVKGNQGKLHHAF 206 +I++K DY+ VK N+ L Sbjct: 398 RIREKGGDYVFTVKDNRKTLKDEI 421 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 157 bits (398), Expect = 5e-37, Method: Composition-based stats. Identities = 58/142 (40%), Positives = 78/142 (54%), Gaps = 3/142 (2%) Query: 101 TDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLN 160 G +IAI+GK++RG+ A+H VSA++ G+ LGQ+ + KSNEITAI ELL Sbjct: 1 MGGLVIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLP 60 Query: 161 LLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG 220 L L+ ++TIDA+GCQ +A +I DY+LAVK NQ L HA + F Sbjct: 61 TLALEGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPV 120 Query: 221 DS---FSTQEISHGRKETRLHI 239 T + HGR ETR Sbjct: 121 RQTCVHETLDKGHGRIETRRCT 142 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 46/182 (25%), Positives = 84/182 (46%), Gaps = 4/182 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +LL+ ++ PD R ++ L +L L + ++ ++ +EDF E L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 DN-GIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKR- 122 P D T RV+ ID +F W+ + + + +DGK+I+ + + Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 123 -KGAIHMVSAFSNENGVVLG-QVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 + I++VS FS + GV + Q + +EI + LL L L+ + T+D++ CQK + Sbjct: 124 YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKKL 183 Query: 181 AS 182 S Sbjct: 184 YS 185 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 151 bits (382), Expect = 4e-35, Method: Composition-based stats. Identities = 58/186 (31%), Positives = 94/186 (50%), Gaps = 10/186 (5%) Query: 183 KIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSN 242 I KK DYLL VKGNQ KL A E F ++ D + E HGR ++ V + Sbjct: 1 MIIAKKGDYLLMVKGNQPKLLEAIEIAF-IDQHDVKSVDRSALVERGHGRTVGQIASVLS 59 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 + +W + S R E +S + YYI+S+ + A++ A ++RA Sbjct: 60 AKGIINPG---DWPNCVTIGRIDSMRVVDEKES--DLERCYYITSRALTAEQLAASVRAR 114 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVK 362 W +E+ HW+LDV +EDAS + + NA + +S ++K+ALN++R K + +K Sbjct: 115 WGVENRFHWILDVSFSEDASTVAKDNAPQNLSMLRKIALNIIRADKT----DTRKSSLRL 170 Query: 363 HRERSS 368 R+ ++ Sbjct: 171 KRKGAA 176 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 66/142 (46%), Positives = 86/142 (60%), Gaps = 4/142 (2%) Query: 106 IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLK 165 +AIDGK +RGS D + IH+VSA+S+ + LGQV+T KSNEITAIPELL L ++ Sbjct: 1 MAIDGKCLRGSHD--GARSPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 KNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDS--F 223 + ITIDAMGCQ DIA +I + ADY+L VKGNQ L A + F + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 STQEISHGRKETRLHIVSNVTR 245 S + +HGR ETR + +N Sbjct: 119 SQTDKNHGRIETRRCVATNDVA 140 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 148 bits (374), Expect = 3e-34, Method: Composition-based stats. Identities = 55/170 (32%), Positives = 85/170 (50%), Gaps = 11/170 (6%) Query: 176 CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKET 235 K I + DY++AVKGNQ +LH + + + +I+ R+ Sbjct: 1 MPKKTVQLIIEGGNDYVIAVKGNQKRLHEQIKLTTEQRLPVSL--------DITTERRSD 52 Query: 236 RLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF 295 R+ S + ++W+GL++L F + + V YYISS ++A +F Sbjct: 53 RITTRSVSVFDDLSGISYDWEGLQRLVKVERFGTRAGKPYHQIV---YYISSLTINAAQF 109 Query: 296 AHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 A IR HW IE+ LHWV DV ++ED SR+R+GNA S I+ + L +LR Sbjct: 110 AQGIRGHWGIENRLHWVKDVVLDEDNSRMRQGNAPANFSIIRSLVLTILR 159 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 148 bits (373), Expect = 4e-34, Method: Composition-based stats. Identities = 51/207 (24%), Positives = 83/207 (40%), Gaps = 13/207 (6%) Query: 155 IPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV 214 +L + + IT DA+ QK +A I + A YL VK NQ L+ + F Sbjct: 2 FIPILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYFEH-- 59 Query: 215 FSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDK 274 + D HGR +TR + + +F + C+ K +K Sbjct: 60 --RKEPDYCLQDPPGHGRIDTRSIW-TTTELNEYLEFPHVG---QAFCIHKKSYDPKTNK 113 Query: 275 SAEGVSIRYYISSKD---MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAE 331 E Y ++S D R HW IE+S H++LD +ED +RIR GN Sbjct: 114 VCENT--FYGVTSHHPNKADPARILQIHRGHWSIENSKHYILDWTYDEDRNRIRTGNGPA 171 Query: 332 IISGIKKMALNLLRDCKDIKGEEEKKE 358 + ++ A+ LL+ ++ ++ Sbjct: 172 NTNRLRGFAIGLLKSKGVKDIAQKVRD 198 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 147 bits (371), Expect = 6e-34, Method: Composition-based stats. Identities = 56/190 (29%), Positives = 92/190 (48%), Gaps = 14/190 (7%) Query: 155 IPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV 214 + +L +KK + T+DA+ CQK I K+ Y++ VK NQ L A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIED-----T 56 Query: 215 FSNYKGDSFSTQEISHGRKE-TRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKED 273 N +++S + HG + RL I + + +W GL++ S R++ Sbjct: 57 AKNSPLNAWSWTQKGHGHESHCRLKIWEATESM-----KMQWAGLERFI---SIRRQGFR 108 Query: 274 KSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEII 333 + S Y+I+S+ + + A IR H IE++LHW DV +NED IR + A I+ Sbjct: 109 HHKKFDSTTYHITSETLSSYRLAGFIRGHRRIENNLHWTKDVILNEDNCGIREPHPAAIL 168 Query: 334 SGIKKMALNL 343 ++ +A NL Sbjct: 169 GILRNIAFNL 178 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 145 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 58/167 (34%), Positives = 84/167 (50%), Gaps = 13/167 (7%) Query: 128 MVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDK 187 MVS ++ N +VLGQVK SNEITAIPELL +L L ++ I A+ C KDI I + Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 KADYLLAVKGNQGKLHHAFEEKFPVNV---FSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 ADY++ +K NQG L+ + E+ F + F + ++ +E HG E R Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQELQHSTYKPEETGHGLHEIR-------N 113 Query: 245 RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMD 291 D + W LK + + Q + + V RY+ISS D + Sbjct: 114 FGFQLDPDSVWSNLKSVGMVEPIGQVDDKTT---VETRYFISSLDSN 157 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 145 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 52/192 (27%), Positives = 85/192 (44%), Gaps = 6/192 (3%) Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKF-PVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 KI +KK DY++ +K N + E F ++ ++F R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIR 300 V+ ++ EWKG+K + R +S E V +YISS D+D + A +R Sbjct: 61 LKVS--DWLSKAEEWKGIKSVLEVCRKRSDNGKESQEKV---FYISSLDVDVQILAKCVR 115 Query: 301 AHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGC 360 HW +E+ HWVLDV ED + AE ++ ++++ALNL R + + K Sbjct: 116 GHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHPKKQSMKGKLTAA 175 Query: 361 VKHRERSSEVHF 372 E E+ Sbjct: 176 GWSDEFRDELLL 187 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 144 bits (364), Expect = 4e-33, Method: Composition-based stats. Identities = 54/170 (31%), Positives = 75/170 (44%), Gaps = 10/170 (5%) Query: 192 LLAVKGNQGKLHHAFEEKFPVNVFSN---YKGDSFSTQEISHGRKETRLHIVSNVTRLNF 248 +LAVK NQ L + HGR ETR + + Sbjct: 1 MLAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREVDKGHGRIETRRCLALDFPGPFE 60 Query: 249 CDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHS 308 D W GL+ + + S R+ + + RYY+SS DA AHA+RAHW IE S Sbjct: 61 PDL---WPGLQSIPMVESTREIGDTVT---TGRRYYVSSLPADAVRIAHAVRAHWGIE-S 113 Query: 309 LHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 +HWVLDV NED R R NAA+ + ++++A L+R K + Sbjct: 114 MHWVLDVTFNEDQCRTRLENAAQNFAILRRIATTLIRRDNSTKAGIRIRR 163 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 144 bits (363), Expect = 5e-33, Method: Composition-based stats. Identities = 53/180 (29%), Positives = 85/180 (47%), Gaps = 5/180 (2%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWL-KKY 61 + +L + PD R+ L +L ++ A+++GA ++ I F H L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGK 121 G P +I + +D A F E +IA+DGKT+RGS D+ + Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHFRAHAARLAE--GAAVIALDGKTLRGSLDRFE 118 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTE--AKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 + A ++SAF+ E +VLGQ+ E K +EI A L+ L L L T+DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 62/232 (26%), Positives = 95/232 (40%), Gaps = 13/232 (5%) Query: 28 AILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFE 87 A+L + V A A + + + + P + T V+S +D Sbjct: 2 ALLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 KMFIEWMQECHEITDGEI---IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVK 144 + +D IA+DGK +RG+ + A H+VS F++ +VLGQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGALR--AKATATHLVSVFAHRARLVLGQLA 119 Query: 145 TEAKSNEITAIPELLNLLYLK-KNLITIDAMGCQKDIASKIKDK-KADYLLAVKGNQGKL 202 KSNEI + LL LL + L+T+DAM Q A I K+ YL+ VK NQ K+ Sbjct: 120 VAEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 HHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL-HIVSNVTRLNFCDFEF 253 V + DS HGR +TR I++ + F + Sbjct: 180 LARITALPWAEVPAAATDDS-----RGHGRVKTRTLQIITAARGIGFPYAKQ 226 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 71/148 (47%), Positives = 92/148 (62%), Gaps = 3/148 (2%) Query: 97 CHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIP 156 G+IIA+DGKT+RGS+D+ K AIHMVSA+S N +VLGQ+KTE KSNE TAIP Sbjct: 1 MAARIPGDIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIP 60 Query: 157 ELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKF---PVN 213 +L LL L+ +TIDA+G Q+DIA +I DK ADYLL VK NQ LH + + Sbjct: 61 KLFTLLALEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAK 120 Query: 214 VFSNYKGDSFSTQEISHGRKETRLHIVS 241 F+ DS + + HGR + V+ Sbjct: 121 GFTEDFTDSVTEEGDKHGRIDKLHCRVT 148 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 49/181 (27%), Positives = 80/181 (44%), Gaps = 3/181 (1%) Query: 21 KVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG-IPVDDTIARVVS 79 H L A+L L AV+ Q I FG + L F G P T+++ + Sbjct: 3 GRIHPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTLR 62 Query: 80 NIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVV 139 ID E W+ +A+DGK +RGS D H V+A++ V Sbjct: 63 RIDPQQLEAALGRWIAGRLTPDARAHVALDGKCLRGSRD--GDVPGPHRVAAYAPHAAAV 120 Query: 140 LGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQ 199 LGQ++ +A++NE A LL ++ + +++T A C +D+A+ + D Y+ +G Sbjct: 121 LGQIRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYVSHGQGQP 180 Query: 200 G 200 Sbjct: 181 T 181 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 142 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 47/184 (25%), Positives = 80/184 (43%), Gaps = 11/184 (5%) Query: 174 MGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEK--------FPVNVFSNYKGDSFST 225 M Q D+ + ++++ DY+L K NQG L E FP + + D+ + Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 QEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYI 285 E+S G +++ LN ++ W G++++ RQ + E V + Sbjct: 61 CEVSKGHGWVERRTMTSTIWLN--EYLTRWPGVQQVFRLTRTRQVGGKTTVEVVYGISSL 118 Query: 286 SSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 SS R HW IE S H + D + ED R+RRG A +++ ++ +A+ LLR Sbjct: 119 SSVAAAPDALLRYTRTHWGIE-SRHHIRDATLGEDRCRVRRGAAPRVLAVLRNVAVYLLR 177 Query: 346 DCKD 349 Sbjct: 178 RLGT 181 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 141 bits (356), Expect = 4e-32, Method: Composition-based stats. Identities = 52/167 (31%), Positives = 80/167 (47%), Gaps = 9/167 (5%) Query: 3 IQSLLDYISVTPDIRQQ--GKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++ L ++ S PD R+ G ++HKL ++ L + ++ EI +FG L+ +K Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG-----EIIAIDGKTIRG 115 NGIP + T+ R+ ID A + + H+ G EI+ IDGK RG Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCAQEIVCIDGKAERG 154 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL 162 + K R I VSA S + L E KSNEI A+P L++ + Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLIDKI 199 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 141 bits (355), Expect = 4e-32, Method: Composition-based stats. Identities = 58/146 (39%), Positives = 77/146 (52%), Gaps = 7/146 (4%) Query: 174 MGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV---FSNYKGDSFSTQEISH 230 MGCQK+IA I +++ADY+ AVK NQ LH A ++ F F +Y D T SH Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYFEEANEANFESYNIDFAETYNKSH 60 Query: 231 GRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM 290 GR E+R V L D W+GL+ + + S R KE + + RYYISS Sbjct: 61 GRIESRRCWVG-YDALPLTDDSQNWEGLQTIVMVESERTLKEKTT---IEHRYYISSTMA 116 Query: 291 DAKEFAHAIRAHWLIEHSLHWVLDVK 316 A ++ R HW IE+SLHW LD+ Sbjct: 117 TAAYLLNSSREHWGIENSLHWRLDIA 142 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 135 bits (341), Expect = 2e-30, Method: Composition-based stats. Identities = 55/190 (28%), Positives = 78/190 (41%), Gaps = 15/190 (7%) Query: 185 KDKKADYLLAV--KGNQGKLHHAFEEKFPVNVFSNYKGDS---FSTQEISHGRKETRLHI 239 D+ + L +G L HA + F Y T + HGR ETR Sbjct: 90 ADRGRWWRLRACRQGQPTHLAHALRDFFGTLDAPGYPVRQTCVHETLDKGHGRIETRRCT 149 Query: 240 VS-NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHA 298 + ++ L + WK + + S R RY ISS D++ HA Sbjct: 150 AAGDLDWLATLGLKERWKKITSVAGIDSSRVIGSKT---ETDRRYVISSLPADSERILHA 206 Query: 299 IRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 +R HW IE+ LHW LDV EDA IR NAA S +++ A+NL R + Sbjct: 207 VRMHWGIENGLHWCLDVAFGEDACPIRLRNAALDFSLLRRAAMNLFRAD------HSRAM 260 Query: 359 GCVKHRERSS 368 G K R+ ++ Sbjct: 261 GLPKKRKAAA 270 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 129 bits (325), Expect = 1e-28, Method: Composition-based stats. Identities = 67/331 (20%), Positives = 115/331 (34%), Gaps = 70/331 (21%) Query: 26 LSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLA 85 L+++L L V+AG + + ++ + L GIP + T R+V D +A Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFGLTRGIPSERTTRRLVEGCDPVA 107 Query: 86 FEKMFIEWMQECHEITDG--EIIAIDGKTIRG--SFDKGKRKGAIHMVSAFSNENGVVLG 141 ++ W+ + D +A DGKT++G SF + ++ A ++ G+ G Sbjct: 108 LDEALSGWIARAATVGDPGPRGLAFDGKTLKGTRSFTEAGAMSQEAVLEAVWHDTGITAG 167 Query: 142 QVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGK 201 + +EI A+ L L D + Sbjct: 168 HQRV-VGGDEIAALEA------LAGRLDLTDVL--------------------------- 193 Query: 202 LHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKL 261 +T E HGR E R VT F W + + Sbjct: 194 ---------------------VTTAEKGHGRVEVRSLKALTVTTPKLVGF---WGTKQVI 229 Query: 262 CVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFA-------HAIRAHWLIEHSLHWVLD 314 + R+KK +A VS + + A++ R HW +E ++H V D Sbjct: 230 ELRRRTRRKKTVTAAPTVSEEVFYLVTSLPAEQAHPRDLAARARARGHWTVE-AIHHVRD 288 Query: 315 VKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 ++ED R NA + + A++ LR Sbjct: 289 RVLDEDRHTARTANAPLAWAIARDTAISALR 319 >UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC8_9GAMM Length = 154 Score = 129 bits (324), Expect = 2e-28, Method: Composition-based stats. Identities = 41/120 (34%), Positives = 62/120 (51%), Gaps = 9/120 (7%) Query: 255 WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLD 314 W+ L+ + + S R +K + + + RYYISS A R HW IE SLHW LD Sbjct: 7 WEELQTIVMVESERAEKGETT---IEHRYYISSTLGTAAYLLDYKREHWGIETSLHWCLD 63 Query: 315 VKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSSEVHFLY 374 + ED SRI +GN AE + ++ +ALNLL+ + K G R ++ + F++ Sbjct: 64 IAFREDESRISKGNGAENFAILRHIALNLLKKE------DTAKIGIKNKRLKAGGMEFIF 117 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 128 bits (321), Expect = 3e-28, Method: Composition-based stats. Identities = 61/159 (38%), Positives = 82/159 (51%), Gaps = 5/159 (3%) Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 G H+VSA++ E+GV LG V TE KSNEITAI LL L KK ++TIDAMGCQKDIA Sbjct: 2 GPRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARN 61 Query: 184 IKDKKADYLLAVKGNQGKLHHAFE---EKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 I D++LAV+ NQ KL A EK + + T HGR++ R + Sbjct: 62 IVAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWG 121 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGV 279 + V E+ W +K + A+ + + V Sbjct: 122 AQVPPDFAAKGEWPW--IKAIGTAVRITTHPDGTQTDEV 158 >UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azollae' 0708 RepID=B9YJB3_ANAAZ Length = 124 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 37/110 (33%), Positives = 66/110 (60%), Gaps = 3/110 (2%) Query: 249 CDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHS 308 D + W LK + + S Q + + V RY+ISS D + ++ A+++R+HW IE+S Sbjct: 9 LDPDSVWSNLKSVGMVESIGQVDDKTT---VETRYFISSLDSNGEQLANSVRSHWAIENS 65 Query: 309 LHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 LHWVLDV + +D +IR+ NA + + ++++A++LL +K + K+ Sbjct: 66 LHWVLDVALKQDDCQIRKDNAPQNFAVMRQIAVDLLGKENPVKRGIKNKQ 115 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 123 bits (309), Expect = 1e-26, Method: Composition-based stats. Identities = 41/180 (22%), Positives = 74/180 (41%), Gaps = 6/180 (3%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 L+ ++ PD R + V+ +L + V +++ + +++E F L + + Sbjct: 13 LISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGIE 72 Query: 66 -NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEI--IAIDGKTIRGSFDKGKR 122 P D +D A +W G++ + DGKT+RGS + Sbjct: 73 LKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTSG 132 Query: 123 KGA--IHMVSAFSNENGVVLGQ-VKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 GA I V+ +S GV + Q + +E + +LL L L+ LI DA+ Q+ Sbjct: 133 GGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQA 192 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 123 bits (308), Expect = 1e-26, Method: Composition-based stats. Identities = 34/94 (36%), Positives = 53/94 (56%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 ++ S PD R KH I+ L + +V+AGA + EIEDF ++WLK Y + Sbjct: 5 FVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTYFNLP 64 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHE 99 NGIP DT +RV S I+ +F+ F+ W++ ++ Sbjct: 65 NGIPSHDTFSRVFSAINPASFQDSFLIWLKAIND 98 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 63/330 (19%), Positives = 114/330 (34%), Gaps = 42/330 (12%) Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQEC-------HEITDGEIIAIDGKTIR 114 G P D T+ R+++ E+ +++ +++ +++ DGK Sbjct: 93 LGLGRGKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTW 152 Query: 115 GSFDKGKRKGAIHMVSAFSNENG------------------VVLGQVKTEAKSNEITAIP 156 D K KGA SA+ E +GQ +K E TA Sbjct: 153 SRTDGEKVKGAQQ--SAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFR 210 Query: 157 ELLNLL--YLKKN--LITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPV 212 LL + L ++T DA C ++ A + Y+ +K NQ LH + Sbjct: 211 RLLPAISEQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLHDIARDYGQY 270 Query: 213 NVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKE 272 ++ + T E G R +V E + + ++ E Sbjct: 271 DLGTPLAR----TAERYRGHTIVRELYARDVAGNPAAAIEAAQQ--LWYVCQTTTDRRGE 324 Query: 273 DKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAA-- 330 + E I + + + +R HW IE+ HW +DV + ED + + A Sbjct: 325 IVAVEQRYFVTSIPTGTLTRDQELALVRMHWAIENGCHWTMDVMLGEDEGHPCQASRASI 384 Query: 331 EIISGIKKMALNLLRDCKDIKGEEEKKEGC 360 E +S ++ + N + +K+G Sbjct: 385 ETVSWLRLIGYN---AVSAWRTLAPRKDGR 411 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 118 bits (295), Expect = 4e-25, Method: Composition-based stats. Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 15/184 (8%) Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISH-GRKETR 236 K + D L+ +KGN KL A S + T ++ R E R Sbjct: 6 KKTVETVLATGNDLLVQLKGNHPKLLAAVRTL----CQSRAHAEQSYTVDLGRRNRIEQR 61 Query: 237 LHIVSNVTRLNFCD-----FEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMD 291 + + + D F+ +G +++ V + ++ E + S YY+++ Sbjct: 62 TVRLWPLPPGSGTDPWHDHFQTVIEGQRQIEVFNPYHRRFEPRQE---SPAYYLATCTAS 118 Query: 292 AKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIK 351 A A IR HW IE+ LH VLDV + ED+SRIRR + + ++ ALNLLR Sbjct: 119 AATLAQVIRGHWAIENRLHHVLDVSLGEDSSRIRRN--PGVFALLRHFALNLLRHNGQAN 176 Query: 352 GEEE 355 Sbjct: 177 IRSA 180 >UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7Q7_9GAMM Length = 164 Score = 115 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 40/140 (28%), Positives = 59/140 (42%), Gaps = 5/140 (3%) Query: 207 EEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALS 266 + K +S+ T+E HGRKE R V +W +K + + Sbjct: 1 MQFQDYWALPEDKQESYITEEKGHGRKEVREVYVLPAAFSEAL--RQKWCLVKSIVAVVR 58 Query: 267 FRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRR 326 R K S E YYI + + + + A R HW IE+ HW LDV ED RI Sbjct: 59 DRSVKGKGSYETS---YYICTDHLSLELASKATRKHWHIENQQHWALDVIFKEDEQRIYA 115 Query: 327 GNAAEIISGIKKMALNLLRD 346 G++A ++ ++ NL R Sbjct: 116 GDSALNMACCRRFVQNLFRK 135 >UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C429C Length = 149 Score = 114 bits (285), Expect = 5e-24, Method: Composition-based stats. Identities = 39/134 (29%), Positives = 63/134 (47%), Gaps = 5/134 (3%) Query: 224 STQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRY 283 +T E HGR E R + +WKGLK+ R K K+ E V Sbjct: 2 TTSEKGHGRIEKRT-----LETTPIVTVGQKWKGLKQGLRITRERAVKGKKTVEVVYGIT 56 Query: 284 YISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNL 343 +S +A +R HW IE+ LH+V DV + EDA R+R+G A ++++ ++ + ++L Sbjct: 57 SLSMARANAATLLTILRDHWQIENGLHYVRDVTLGEDACRVRKGTAPQVLAAVRNVVVHL 116 Query: 344 LRDCKDIKGEEEKK 357 L + E + Sbjct: 117 LASVEAKSRPEAIE 130 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 114 bits (284), Expect = 7e-24, Method: Composition-based stats. Identities = 32/131 (24%), Positives = 62/131 (47%), Gaps = 6/131 (4%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 +SL ++ PD R ++ L +IL + VCAV+AGA + I D+ ++ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE------IIAIDGKTIRGSF 117 F + +P T+ R++ ID+ ++ W++ +IA+DGK +RG+ Sbjct: 89 FTDRVPAATTVWRLLIRIDAEVLPQVLARWLRARTAPVVVTGRRLCLVIAVDGKVVRGAR 148 Query: 118 DKGKRKGAIHM 128 + A+ + Sbjct: 149 LRAAGPSALGL 159 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 113 bits (283), Expect = 9e-24, Method: Composition-based stats. Identities = 42/188 (22%), Positives = 76/188 (40%), Gaps = 13/188 (6%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +L +S PD R ++ L +L L + A ++ D + +E F L G Sbjct: 2 TLRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG-- 58 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 P I ++ +D + Q E GE++ +DGK +RGS + Sbjct: 59 LRKAPGHTAITLLLHRLDPEKLQAALG---QVFPEADLGEVLVVDGKHLRGSGK--GKSP 113 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL---YLKKNLITIDAMGCQKDIA 181 + +V + L Q + E + E A ELL+ L L+ ++ DA ++A Sbjct: 114 QVKLVEVLALHLHTTLAQARAEGR--EEKAFLELLDRLEARELEGKVVVGDAGYLYPEVA 171 Query: 182 SKIKDKKA 189 ++++ K Sbjct: 172 ARVRKKGG 179 >UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica OS145 RepID=A3WIW9_9GAMM Length = 96 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 37/80 (46%), Positives = 52/80 (65%) Query: 279 VSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKK 338 + RYYISS + A+EFA +RAHW IE+ LHWVLDV + ED I RG+AA+ ++ + Sbjct: 1 MQYRYYISSASLSAEEFASTVRAHWGIENRLHWVLDVTIREDNCPISRGHAADNLACFRH 60 Query: 339 MALNLLRDCKDIKGEEEKKE 358 +ALN +R K I +K+ Sbjct: 61 VALNQIRREKTIDASVNRKQ 80 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 107 bits (268), Expect = 5e-22, Method: Composition-based stats. Identities = 41/188 (21%), Positives = 77/188 (40%), Gaps = 13/188 (6%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +L + +S PD R ++ L +L L + A ++ D + +E F L G Sbjct: 2 TLREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG-- 58 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 P + ++ +D ++ +Q GE++ +DGK ++GS + Sbjct: 59 LRKPPGHTILTLLLHRLDPEKLQEAL---LQVFPGADLGEVLVVDGKHLKGSGK--GKSP 113 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL---YLKKNLITIDAMGCQKDIA 181 + +V + L Q K E + + A+ ELL+ L LK ++ DA ++A Sbjct: 114 QVRLVEVLALHLLTTLAQAKAEGRED--QALLELLDRLGAEGLKGKVVVGDAGYLYPELA 171 Query: 182 SKIKDKKA 189 K+ K Sbjct: 172 GKVVQKGG 179 >UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=Bacteria RepID=B7UED9_ECOLX Length = 104 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 40/79 (50%), Positives = 57/79 (72%) Query: 279 VSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKK 338 +++RYYISS D A++F AIR HW +E++L+W LDV MNED +IRRGNAAE SGI+ Sbjct: 1 MTVRYYISSADSTAEKFVTAIRNHWHMENNLYWRLDVVMNEDDYKIRRGNAAESFSGIRH 60 Query: 339 MALNLLRDCKDIKGEEEKK 357 +A+N+L + + K +K Sbjct: 61 IAINILTNNQVFKARSRRK 79 >UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H9Y8_ANADF Length = 431 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 58/364 (15%), Positives = 121/364 (33%), Gaps = 33/364 (9%) Query: 10 ISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIP 69 + PD+R + + L+ IL + ++AGA E E+ ++ +P Sbjct: 22 LEAVPDVRAREG-RWSLAEILTGVLLGIVAGARSLAEAEELTDGMSPAARRLASVPRRLP 80 Query: 70 VDDTIARVVSNIDSLAFEKMFIEWMQEC-------HEITDGEIIAIDGK-----TIRGSF 117 D T + + ++ ++A+DGK T+ Sbjct: 81 -DTTARDALCAVPLDGLRAALHRLVRAAWRRKALTPVDLPVGVVALDGKVTALPTLNHPL 139 Query: 118 DKGKRK------GAIHMVS--AFSNENGVVLGQVKTEAKSNEITAIPELLNLL-YLKKN- 167 + + G V+ S + V A++NE +L L Sbjct: 140 IQNQHPDVGLPYGLARTVTCALVSAPGRPCIDAVPIPAETNEAGHFQHVLAGLVETYGAL 199 Query: 168 --LITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFST 225 ++T DA + + + DY+ A+K + + E + + + D Sbjct: 200 FQVVTYDAGALSEANGAAVVAAGKDYVFALKNDHFTMVKLATELLDPHEIAARREDVLDN 259 Query: 226 QEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYI 285 + + S+ E W + S ++ R ++ Sbjct: 260 ATTATREIQILAVDPSHGYGAGKGPEESVWSHARTFLRVTSTVRRSG--VVIERDSRLFV 317 Query: 286 SSK---DMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISG--IKKMA 340 SS+ + ++ +RAHW +E++ H LD ED +A +++ ++++A Sbjct: 318 SSRAADQLTPDQWLQVVRAHWGVENNNHHTLDTAFAEDERPWIAADANGMLAVLLLRRIA 377 Query: 341 LNLL 344 LL Sbjct: 378 YTLL 381 >UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D5 Length = 148 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 33/144 (22%), Positives = 60/144 (41%), Gaps = 5/144 (3%) Query: 228 ISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISS 287 HGR E R + ++ W G++++ R+ + E V +S Sbjct: 3 KGHGRVERRSITTTT----WLNEYLTRWPGVQQVFRLERQRRADGKTTVEVVYGISSLSP 58 Query: 288 KDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDC 347 R+HW IE SLH+V DV ++ED R+RRG A +++ ++ +A+ LLR Sbjct: 59 VAAPPDTVLGYTRSHWGIE-SLHYVRDVTLDEDRCRVRRGTAPRVLASLRNVAVYLLRRL 117 Query: 348 KDIKGEEEKKEGCVKHRERSSEVH 371 + + + ++ Sbjct: 118 GAGTIAAAVRTVVARPELALAALN 141 >UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX Length = 105 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 42/86 (48%), Positives = 57/86 (66%) Query: 272 EDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAE 331 E K ++ RYY S D+ A++FA A R HW +E+ LHW LDV MN+D +IRRGNAAE Sbjct: 19 EQKKEPEMTDRYYSISADLTAEKFATANRNHWYVENKLHWHLDVVMNKDDCKIRRGNAAE 78 Query: 332 IISGIKKMALNLLRDCKDIKGEEEKK 357 + SGI+K+A+N+L K +K K Sbjct: 79 LFSGIRKIAINILTKDKILKAGARCK 104 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 32/81 (39%), Positives = 52/81 (64%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 + L + +T D R + KH L I+ L + AV++G++ W++IE+FGH +L+WL++Y F Sbjct: 6 TFLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPF 65 Query: 65 DNGIPVDDTIARVVSNIDSLA 85 GIP DTIARV+ + + Sbjct: 66 KAGIPRHDTIARVICRLKADE 86 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 103 bits (257), Expect = 9e-21, Method: Composition-based stats. Identities = 48/198 (24%), Positives = 82/198 (41%), Gaps = 19/198 (9%) Query: 101 TDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLN 160 IA+DGK ++ S + H++SA ++ V L +V+ AK+NE T LL Sbjct: 129 GPRRAIAVDGKALKASARLTSPRR--HLLSAVTHGRVVTLARVEVGAKTNETTHFKPLLA 186 Query: 161 LLYLKKNLITIDAMG-CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYK 219 L L ++T DA+ + +I+ ++ KKA Y+ +K NQ HH ++ Sbjct: 187 PLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQLATLPWRDIPVQ-- 244 Query: 220 GDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKE---DKSA 276 + E+ HGR+E+ + + C E G+ L+ R + Sbjct: 245 ---HAASEVGHGRRES--------SSIKTCAIPDELGGIAYPHARLAIRVHRRCQPTGKR 293 Query: 277 EGVSIRYYISSKDMDAKE 294 E Y ++S D Sbjct: 294 ESRESVYAVTSLDAHQAT 311 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 33/76 (43%), Positives = 47/76 (61%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 +I+ +++ + D R G+ H L IL L +CAV++GA W +IED+GH R WL++Y Sbjct: 6 TIEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRY 65 Query: 62 GDFDNGIPVDDTIARV 77 NGIP DTI RV Sbjct: 66 LKLRNGIPGHDTIRRV 81 >UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY37_9BACT Length = 138 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 31/120 (25%), Positives = 52/120 (43%), Gaps = 7/120 (5%) Query: 232 RKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQ--KKEDKSAEGVSIRYYISSKD 289 R ET+ VS++ ++ L+++ + KK E +SS Sbjct: 1 RIETQTIRVSSL-----LKGYSDFPHLEQVFRIDRVTRFKKKGKTRKETALGVTSLSSGQ 55 Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 +E +R HW IE+ LHW+ D ED R GN A +++ ++ M ++LLR Sbjct: 56 ASPRELLDFVRGHWEIENRLHWIRDSVFREDTCTTRTGNGAHVMATLRNMTISLLRVAGS 115 >UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=Bacteroides RepID=C6Z3A5_9BACE Length = 115 Score = 101 bits (251), Expect = 5e-20, Method: Composition-based stats. Identities = 36/112 (32%), Positives = 54/112 (48%), Gaps = 7/112 (6%) Query: 263 VALSFRQKKEDKSAEGVSIRYYISSKDMD-AKEFAHAIRAHWLIEHSLHWVLDVKMNEDA 321 V + + +RYY++S D ++ A AIR HW I ++LHW LDV ED Sbjct: 1 VRIKSERTIVAIGEYTQEVRYYVTSLDNTRPEKIASAIRQHWSIGNNLHWQLDVTFREDY 60 Query: 322 SRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSSEVHFL 373 S+ + NAA S KMAL +L++ K KG K + + ++L Sbjct: 61 SK-KVKNAAGNFSVATKMALTILKNEKTTKGSMNLKRL-----KAGWDENYL 106 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 100 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 30/81 (37%), Positives = 48/81 (59%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 + L + D R + KH L I+ L + AV++G++ W+ IE+FGH +L+WL ++ F Sbjct: 6 TFLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPF 65 Query: 65 DNGIPVDDTIARVVSNIDSLA 85 GIP DTIARV+ + + Sbjct: 66 KAGIPRHDTIARVICRLKADE 86 Score = 51.2 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 37/83 (44%), Gaps = 3/83 (3%) Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKF---PVNVFSNYKGDSFSTQEISHGRKE 234 K+IA I +KADY+LA+KG+ L E + F+ D +T + HGR E Sbjct: 87 KEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGHGRIE 146 Query: 235 TRLHIVSNVTRLNFCDFEFEWKG 257 TR V + + + G Sbjct: 147 TRRCQQVLVNKSWLNNKYRKRPG 169 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 100 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 39/179 (21%), Positives = 66/179 (36%), Gaps = 21/179 (11%) Query: 195 VKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFE 254 +K NQ + + + HGR+E+R + C E Sbjct: 2 IKRNQPTTYRQLAALPWPDSAVQ-----HTASSAGHGRRESR--------SIKTCGIADE 48 Query: 255 WKGLKKLCVA---LSFRQKKEDKSAEGVSIRYYISSKDM---DAKEFAHAIRAHWLIEHS 308 G+ R++K+ E Y ++S D E A A+R HW +E + Sbjct: 49 LGGIAFPHGRLALRVHRRRKQTGGCESRETVYAVTSLDAHETTPAELAAAVRGHWTVE-A 107 Query: 309 LHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERS 367 L V DV E+AS + G A ++ + +A+ LL+ I + + ER+ Sbjct: 108 LRHVRDVTYAEEASTLHTGTAPRAMATFRNLAVGLLKTLGAINIAKTTR-AIRDQPERA 165 >UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N3J1_VIBHB Length = 184 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 36/133 (27%), Positives = 58/133 (43%), Gaps = 6/133 (4%) Query: 227 EISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYIS 286 + HGR R + L G+K Q + + YYI+ Sbjct: 34 DEGHGRLVRRRYFAFP---LPEELHNHALSGIKSCIAVERIVQ-EGKGEPKTSHFSYYIT 89 Query: 287 SKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRD 346 + + A +R HW IE S HW+LDV N+D + N+AE + IK++ LNL++ Sbjct: 90 NHPASDPKLADYVRQHWEIE-SYHWLLDVYFNDDRDKKYEENSAENFAQIKRLPLNLVK- 147 Query: 347 CKDIKGEEEKKEG 359 KD G+++ + Sbjct: 148 AKDWAGKKKSVKS 160 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 39/85 (45%) Query: 7 LDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDN 66 L + D R L +I+ + + AV+ GAD + IE +G + WL+ + D Sbjct: 28 LKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLPK 87 Query: 67 GIPVDDTIARVVSNIDSLAFEKMFI 91 GIP DT RV+ ++ + F Sbjct: 88 GIPSHDTFGRVLRILEPKQLQSGFR 112 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 26/74 (35%), Positives = 47/74 (63%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M+ +S+LD+ S D RQ +V + L I L +CA ++G +++ EI +G RLE+L++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFDNGIPVDDTI 74 + ++ G+P DT+ Sbjct: 77 FLPYERGLPAHDTL 90 >UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9DIV7_9GAMM Length = 133 Score = 99.0 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 31/102 (30%), Positives = 58/102 (56%), Gaps = 2/102 (1%) Query: 248 FCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEH 307 + + +++W GLK + S ++ E R+YISS D++A++ ++R HW +E Sbjct: 7 WLNNKYQWVGLKSIIKVTS-DVHEKTTGKETTETRWYISSLDLNAEQALSSVRNHWQVE- 64 Query: 308 SLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 S+HWVL++ ED SR R+G + ++K+A+ L + + Sbjct: 65 SMHWVLEMTFREDESRFRKGRGPLAFNVMRKIAMTLFKQDQT 106 >UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE7_9GAMM Length = 185 Score = 99.0 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 45/163 (27%), Positives = 67/163 (41%), Gaps = 8/163 (4%) Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISH-GRKETRLHIVSNV 243 L+ +K NQ LH A E + F D T EI R E R V ++ Sbjct: 2 IATGNHLLVQLKRNQPLLHDAMVEYTRGHPF----VDEHHTHEIGRRNRIEKRAVHVWHL 57 Query: 244 -TRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 L + ++ L ++ + YY+ + A F+ AIR H Sbjct: 58 HPSLGSAPWYDHFRALIRVQRHTERFDTRLRDWRVSKECAYYLCDLVLPAARFSEAIRNH 117 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 W +E+ H+V D + EDASRIRR + ++ ALNL+R Sbjct: 118 WRVENRAHYVRDTRFQEDASRIRRN--PCTFALLRSFALNLMR 158 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 98.6 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 36/84 (42%), Positives = 52/84 (61%), Gaps = 1/84 (1%) Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIAS- 182 A+H++SAF + GVVL Q+ KSNEI A ELL L + +T DAM Q++ A Sbjct: 7 KAVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQREHARF 66 Query: 183 KIKDKKADYLLAVKGNQGKLHHAF 206 ++DK+AD+++ VK NQ +L A Sbjct: 67 AVEDKRADFVMTVKDNQPELREAL 90 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 98.2 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 49/127 (38%), Positives = 73/127 (57%), Gaps = 1/127 (0%) Query: 176 CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKET 235 + ++ KI +K DYLLAVKGNQG L AF++ F ++ +N + ++T+E S GR E+ Sbjct: 12 VRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHES 71 Query: 236 RLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF 295 R VS+ + D EW GLK + +S +KE + +RYYISSK ++A+E Sbjct: 72 RAAFVSHDLSV-LGDISDEWPGLKSMAFVVSMNSEKEVAEEADIYVRYYISSKQLNAEEL 130 Query: 296 AHAIRAH 302 A R H Sbjct: 131 LTASRLH 137 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 97.8 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 24/86 (27%), Positives = 40/86 (46%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++++S+ Y D R KH+ I+ + VC V+ G D I + R EWL+ Sbjct: 6 VAVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQG 65 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAF 86 + + NG+P D I + + AF Sbjct: 66 FLELPNGLPSRDCIRNWLMALQPDAF 91 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 96.3 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 46/209 (22%), Positives = 93/209 (44%), Gaps = 14/209 (6%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 SI S L Y++ PD R+ K +H+ +L + + AV +G + + ++ +L Sbjct: 5 SIPSPLPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDE 64 Query: 62 GDFDNG-----IPVDDTIARVVSNI--DSLAFEKMFIEWMQECHEITDGEI-----IAID 109 +P T+ R+ ++ D +K + W +E + E +A+D Sbjct: 65 VHIRTRRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVD 124 Query: 110 GKTIRGSFDKGKRKGAIHMVSAFSNENGVVLG-QVKTEAKSNEITAIPELLNLLYLKKNL 168 GK +RG+ + + A+ +SA G+ LG Q + ++ + + L + + Sbjct: 125 GKHLRGTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGV-DWV 183 Query: 169 ITIDAMGCQKDIASKIKDKKADYLLAVKG 197 +T DA C +++A+ + ++K A KG Sbjct: 184 LTGDAALCTQELAAVVVEQKGGICSASKG 212 >UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3V5_9PROT Length = 174 Score = 94.0 bits (232), Expect = 8e-18, Method: Composition-based stats. Identities = 41/162 (25%), Positives = 64/162 (39%), Gaps = 14/162 (8%) Query: 195 VKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFE 254 +K NQ L ++ D+ ++ R+E R V V E Sbjct: 1 MKANQSNLFETACAI----AANDAPADTAFSRNKGRSRQEDRTVEVFPVGDALA---GTE 53 Query: 255 WKGLKKLCVALSFRQKKEDKSAEGVSIR-----YYISSKDMDAKEFAHAIRAHWLIEHSL 309 W+ K + ++ R + R Y S+ + A +A AIR HW IE+ Sbjct: 54 WQPFIKTIIRVTRRTLLHSAATGLWDQRGEVAFYVSSAANFPATAWAAAIRGHWGIENRN 113 Query: 310 HWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIK 351 H+V DV +ED SRIR I++ + ALN++R Sbjct: 114 HYVRDVSCDEDKSRIRDN--PGIMARARSFALNIMRKNGIAN 153 >UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C56B7 Length = 116 Score = 94.0 bits (232), Expect = 8e-18, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 48/94 (51%) Query: 265 LSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRI 324 R + + E +S++ DA + +R HW IE+ LH+V DV + ED R+ Sbjct: 8 TRERTVRGQTTVEVHFGITSLSAEKADAATLLNHVRTHWRIENELHYVRDVTLGEDVCRV 67 Query: 325 RRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 R G+A ++++ ++ ++L R+ K + E + Sbjct: 68 RMGHAPQVLAALRNAVVHLWREVKAVSCPEAIER 101 >UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BR51_9GAMM Length = 153 Score = 93.2 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 30/149 (20%), Positives = 64/149 (42%), Gaps = 9/149 (6%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++SL Y + PD + +H+L +L L A + G ++ + ++ + ++ Sbjct: 8 MRSLPQYFTDLPDPHRAEGRRHRLPVVLTLAAGASLCGMQGYKSMAEWASSLAQAARRRF 67 Query: 63 D--FDNG---IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSF 117 NG +P I + + A ++ W + ++ E +A+DGK ++G Sbjct: 68 GCRRVNGHYLVPSLYVIRDCLVRLGPEALDRRLQAW--QAAQLNSEEALAMDGKIMKGGV 125 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTE 146 D + H+VS +E+ + Q K+ Sbjct: 126 DHTGAQT--HIVSLIGHESKHCVAQKKSA 152 >UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JBB1_FRASC Length = 173 Score = 92.8 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 27/95 (28%), Positives = 46/95 (48%), Gaps = 4/95 (4%) Query: 271 KEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAA 330 +AE V + + + A +AHW IE+ LHWV DV +ED R R GNA Sbjct: 69 GGPATAETVHAVTSLPTHHASPRLLAELAQAHWAIENRLHWVRDVTYDEDRHRARTGNAP 128 Query: 331 EIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRE 365 ++++ ++ +A+ +LR + G + + H Sbjct: 129 QVMTSLRNLAITILR----LTGAKNIAKALRHHAR 159 >UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3K1_9PROT Length = 134 Score = 92.4 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 30/117 (25%), Positives = 50/117 (42%), Gaps = 6/117 (5%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 +LL S D R+ + L+ +L TV A++AGA +++++ F L+ L D Sbjct: 3 STLLSLFSQISDPRRDQGKIYPLAPVLLFTVLAMLAGARSYRQVQAFIRTHLDRLNDGFD 62 Query: 64 F-DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD-----GEIIAIDGKTIR 114 P T+ ++ ID+ E+ F + + IAIDGKT Sbjct: 63 LSLRRAPAYSTVRFILRGIDAEEMERAFRDHALGLADGPAEGAAIPGAIAIDGKTWC 119 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 92.1 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 43/143 (30%), Positives = 63/143 (44%), Gaps = 5/143 (3%) Query: 161 LLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSN--Y 218 + LK +L+T+DAMGCQ+ IA ++++ AD +L++KGNQGK A F + Y Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 219 KGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEG 278 E SHGR R V +T W ++ L V RQ ++ Sbjct: 61 LKPDHDEFEDSHGRTVRRRGWVLPLT--PETKHSGSWPDIQALLVTEKIRQAHYSETV-T 117 Query: 279 VSIRYYISSKDMDAKEFAHAIRA 301 RYY+S + H A Sbjct: 118 SDFRYYLSRCQEARPDIGHTTHA 140 >UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C6B8_ACAM1 Length = 145 Score = 92.1 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 31/124 (25%), Positives = 50/124 (40%), Gaps = 7/124 (5%) Query: 222 SFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSI 281 + S GR+E R V + EW+ ++ + + ++ Sbjct: 3 EHTHSIQSRGREEHRCIQVYEPVGIAL----QEWEAIRSVLCVQRWGTRQGKAYHNTA-- 56 Query: 282 RYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMAL 341 YYISS + +R HW IE+ LHW DV ED R+ A S ++ + + Sbjct: 57 -YYISSAATSPHHWQSLVREHWGIENRLHWPKDVVFGEDDYRLEDEQALLNWSVLRTIVI 115 Query: 342 NLLR 345 N+LR Sbjct: 116 NILR 119 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 92.1 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 37/77 (48%), Positives = 47/77 (61%), Gaps = 1/77 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 S+ + D R KH I+FL V AVI+GA+ W EI+ FG L+WL+KY F Sbjct: 2 SVFRFFENLSDPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPF 60 Query: 65 DNGIPVDDTIARVVSNI 81 + GIPVDDTIARV+ I Sbjct: 61 ECGIPVDDTIARVIKRI 77 >UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina DSM 3645 RepID=A3ZX26_9PLAN Length = 115 Score = 92.1 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 35/94 (37%), Positives = 53/94 (56%) Query: 273 DKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEI 332 E +RYYI SK + + FA A+R HW IE+SLHW LDV E SRIR+G+A Sbjct: 13 QNGKEASEVRYYIVSKYLTGRRFAEAVRGHWGIENSLHWQLDVTFGEHQSRIRKGHADIN 72 Query: 333 ISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRER 366 S +++ +L+LL++ K + + K ++ Sbjct: 73 FSLLRRTSLSLLKNNKTARVGVKNKRLKAGRNDK 106 >UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV84_9DELT Length = 357 Score = 92.1 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 31/147 (21%), Positives = 66/147 (44%), Gaps = 9/147 (6%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++SL DY D R+ +H++S +L + A + G ++ I + ++ + ++ Sbjct: 215 MESLPDYFKTVTDPRRTHGRRHRISTVLSIAAAATLCGMKGYKAIYGWANKLGQKARQRF 274 Query: 63 --DFDNG---IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSF 117 +NG IP I V+ D + + + ++ + + +A DGKT++ + Sbjct: 275 RCRKENGKYVIPSQFVIRDVLVRADPVELDLAVQRFNED--QGLEDTCLAFDGKTMKNAI 332 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVK 144 D+ R+ H+ S +E+ Q K Sbjct: 333 DENARQT--HIASVVGHESKTTHTQKK 357 >UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REV9_LEGLO Length = 139 Score = 91.7 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 38/140 (27%), Positives = 63/140 (45%), Gaps = 9/140 (6%) Query: 235 TRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKE 294 R + RL G+K + +A K +++A RYY++S + + + Sbjct: 2 RRRYFA---YRLPKTINTGSLVGIKSI-IATETISSKTNETAISAEWRYYVTSHETEKSD 57 Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEE 354 +R HW IE+ LHW LDV +N+DA + R A S IK+M L+L++ K Sbjct: 58 LHLYVRNHWSIENELHWHLDVHLNDDADKKRDDTTAINFSSIKRMLLSLVKT----KLPP 113 Query: 355 EKKEGCVKH-RERSSEVHFL 373 KK ++ + +L Sbjct: 114 GKKRSVRSRLKQVGWDTEYL 133 >UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ26_GLOVI Length = 125 Score = 91.3 bits (225), Expect = 6e-17, Method: Composition-based stats. Identities = 28/113 (24%), Positives = 48/113 (42%), Gaps = 3/113 (2%) Query: 246 LNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLI 305 + F G + +R+ + + + +Y+SS + A E IR HW + Sbjct: 1 MKAFPPLFSGNGRTRSIRLERYRELRGIVTVKT---HWYLSSIEASASELGRRIRGHWGV 57 Query: 306 EHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 E+ +H+ DV ED SRIR ++ S + ALNL R + ++ Sbjct: 58 ENQVHYPKDVTFGEDRSRIRTLPLVQVWSVARSFALNLYRSLLMANRAQAQRR 110 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 90.9 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 69/211 (32%), Gaps = 34/211 (16%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCA-VIAGADEWQEIEDFGHERLEWLK 59 +S + + + + D R + + + +C+ AG + + + Sbjct: 19 VSREGIWERLDRVTDPRSTRGRVYSWLCLAAVWLCSLTAAGHHRVSAVRAWLARTSGAER 78 Query: 60 KYGDFDN------GIPVDDTIARVVSNIDSLAF------------------EKMFIEWMQ 95 +P TI + +D + Sbjct: 79 ARLRLPWDPFAGWRLPSTATIHCFLQAVDDGELAVALLDPPLDPDPPAEQGDDTDQRTEP 138 Query: 96 ECHEITDG-------EIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAK 148 + G +A+DGKT R + K +H+V S+ +G +L QV+ EAK Sbjct: 139 SAAPVDPGHGCQPVESAVALDGKTSRHA--KRADGSKVHLVGVASHGDGRLLAQVEVEAK 196 Query: 149 SNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 +NE LL L L L+T DA+ + Sbjct: 197 TNETAVFRRLLRPLDLTNVLVTADALHTVRA 227 >UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammaproteobacteria RepID=Q47YV8_COLP3 Length = 151 Score = 90.1 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 32/130 (24%), Positives = 59/130 (45%), Gaps = 4/130 (3%) Query: 240 VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 + + + + GLK + + K + R+ ISS D+ + +A+ Sbjct: 14 LRTLIDKKWLAKAYRRSGLKSIIKVHTQVHDK-STGKDTAETRWNISSLDLHVVQALNAV 72 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDC--KDIKGEEEKK 357 R+HW +E S+HW+LD+ D SRI R + + ++K+A+ L + K + +KK Sbjct: 73 RSHWQVE-SIHWMLDMTFRVDESRICRKQGPHVFNVMRKIAMTLFKQDTTKLVSMARKKK 131 Query: 358 EGCVKHRERS 367 + RS Sbjct: 132 MAGLDDDYRS 141 >UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q750_9BACT Length = 129 Score = 89.7 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 36/78 (46%) Query: 267 FRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRR 326 + K K+ E V ++ + K R HW IE+ LH+V D ED S+IR Sbjct: 37 TKVKTGKKTEEIVYGITSLTQQKASPKTILKFSRGHWSIENGLHYVRDTAFREDHSQIRT 96 Query: 327 GNAAEIISGIKKMALNLL 344 NA ++ +K + + L Sbjct: 97 QNAPRAMASLKNLVVGLF 114 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 87.4 bits (215), Expect = 8e-16, Method: Composition-based stats. Identities = 26/71 (36%), Positives = 39/71 (54%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M +++ + + D RQ KV + L +LF+T+C VIAGA+ W EI D+ W K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFDNGIPVD 71 G G+PV Sbjct: 72 KGILTEGVPVR 82 >UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobacteria RepID=Q2RP40_RHORT Length = 152 Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 32/126 (25%), Positives = 51/126 (40%), Gaps = 2/126 (1%) Query: 224 STQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRY 283 +T HGR+E R V +V+ ++ + ++ + K + Sbjct: 6 TTDRGRHGRQEHRWVEVFDVSGRLGPTWDGLIAAVARVTRLTWHKDTKSGLWHKTQETAL 65 Query: 284 YISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNL 343 Y ++ A AIR HW +E H+V DV ED SRIR + ++ ALN+ Sbjct: 66 YACQINLPAAVAGTAIRQHWGVEKRSHYVRDVTFFEDQSRIRTK--PGHFARLRSFALNI 123 Query: 344 LRDCKD 349 LR Sbjct: 124 LRANGT 129 >UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BLK3_9GAMM Length = 190 Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 25/126 (19%), Positives = 53/126 (42%), Gaps = 6/126 (4%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 +++L Y + PD R+ +H+L + LT A + G ++ + ++ + ++ Sbjct: 60 MRALPQYFTDLPDPRRAQGRRHRLPVVPALTAGASLCGMQGYKAMAEWASSLGQAARQRF 119 Query: 63 D--FDNG---IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSF 117 NG +P I + + A ++ W +D E +A+DGK ++G Sbjct: 120 GCRRVNGHYLVPSLYVIRDCLVRLGPKALDRRLQAWQAAQLNSSD-EALAMDGKIMKGGV 178 Query: 118 DKGKRK 123 D + Sbjct: 179 DHTGAQ 184 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 85.1 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 56/108 (51%), Gaps = 4/108 (3%) Query: 29 ILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG-IPVDDTIARVVSNIDSLAFE 87 +L L + AV+AG + I FG R + L F NG +P +TIA ++ +D+ + Sbjct: 3 LLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLDADHLD 62 Query: 88 KMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNE 135 ++ W+ + H + IA+DGK + GS D H+++A++ + Sbjct: 63 RIIGAWLGDRHP-DGWDHIALDGKRLCGSRD--GAVPGTHLLAAYAPQ 107 >UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN77_ANAAZ Length = 77 Score = 85.1 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 24/61 (39%), Positives = 42/61 (68%) Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEE 354 A+++R+HW IE+SLHWVLDV + +D RIR+ NA + + ++++A++LL +K Sbjct: 1 MANSVRSHWAIENSLHWVLDVALKQDDCRIRKDNAQQNFAVMRQIAVDLLGKENPVKRGI 60 Query: 355 E 355 + Sbjct: 61 K 61 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 82.0 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 32/97 (32%), Positives = 41/97 (42%), Gaps = 5/97 (5%) Query: 172 DAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPV---NVFSNYKGDSFSTQEI 228 D +GCQK IA I +++ADYLLAVK NQ LH A F F+ Y D Sbjct: 8 DGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYFEEANKARFAGYNIDYDEKINK 67 Query: 229 SHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVAL 265 GR E R V + W L+ + + Sbjct: 68 GPGRLEQRRCWVG--YEIPDTINSQNWAKLETIVMVE 102 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 82.0 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 27/118 (22%), Positives = 51/118 (43%), Gaps = 9/118 (7%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 L ++++ PD R +H L +IL + + A+ +GA+ + + ++ + L + Sbjct: 15 GLWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGC 74 Query: 65 DNG-------IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRG 115 P T+ RV+ I LA E+ W+ +A+DGKT+ G Sbjct: 75 QESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSLGLSPAA--LAVDGKTLAG 130 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 81.7 bits (200), Expect = 5e-14, Method: Composition-based stats. Identities = 45/198 (22%), Positives = 75/198 (37%), Gaps = 56/198 (28%) Query: 174 MGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKF---PVNVFSNYKGDSFSTQEISH 230 MGCQK+IA I +KADY+LA+KG+ L E + F+ D +T + H Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGH 60 Query: 231 GRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM 290 GR ETR V + ++ + +++W GLK + S E + E Sbjct: 61 GRIETRRCQQVLVNK-SWLNNKYQWVGLKSIIKVTS--DVHEKTTTE------------- 104 Query: 291 DAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDI 350 SRIR+G + ++K+A+ L + + Sbjct: 105 -------------------------------SRIRKGRGPLAFNVMRKIAMTLFKQEQT- 132 Query: 351 KGEEEKKEGCVKHRERSS 368 K+ V ++ + Sbjct: 133 -----KRASIVAKKKMAG 145 >UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H5_9SYNE Length = 177 Score = 80.9 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 31/149 (20%), Positives = 59/149 (39%), Gaps = 12/149 (8%) Query: 197 GNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWK 256 G+Q L+ ++ + + + EI HGR + + W Sbjct: 8 GDQKTLYRQIADQL---LGKRHIPLMATDHEIGHGR---DILWTLRAKEAPQ-HIKANWH 60 Query: 257 GLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVK 316 G + ++ + + +I+S +R W +E S HW+ D + Sbjct: 61 GTSWIAEVIATGTRDRK---PFKATHRFITSLRTTPDALLRLVRERWSVE-SWHWIRDTQ 116 Query: 317 MNEDASRIRRGNAAEIISGIKKMALNLLR 345 ++ED R R GN A +++ ++ A+NLLR Sbjct: 117 LHEDDHRYR-GNGAGVMAALRTAAMNLLR 144 >UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E6_9ACTO Length = 128 Score = 80.5 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 49/129 (37%), Gaps = 13/129 (10%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + L + ++ D R++ +H A+L + AV+ GA + I ++ + + + Sbjct: 1 MGPLAERLATLADPRRRRGKRHPFVAVLLIACSAVVTGARSFVAIHEWATDSPQDVLARL 60 Query: 63 DFDNGI-------PVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRG 115 P TI RV+ + + H+ + +AIDGK+ RG Sbjct: 61 GARTATALAVRIPPSGVTIRRVIKDTCPGGLADLLG------HDPAGTDTLAIDGKSARG 114 Query: 116 SFDKGKRKG 124 S R Sbjct: 115 SRLGSTRPP 123 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 80.1 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 33/129 (25%), Positives = 53/129 (41%), Gaps = 2/129 (1%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF- 64 L Y+ PD R+ + L+ +L ++ AV++GA +++I+ F E L Sbjct: 3 LKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQLH 62 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQEC-HEITDGEIIAIDGKTIRGSFDKGKRK 123 PV +I + +D+ A E F E IA+DGKT+R + R Sbjct: 63 WKRAPVHTSIRYALQGLDAKAGELAFHRHASGLDGEGAQHASIAMDGKTLRAAVSITSRT 122 Query: 124 GAIHMVSAF 132 SA Sbjct: 123 ARPLRYSAH 131 >UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK19_ACIF5 Length = 104 Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 18/83 (21%), Positives = 37/83 (44%) Query: 283 YYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALN 342 ++ + R HW IE+ H V D +ED S+IR N +++ ++ +A++ Sbjct: 15 TSLTKDRTTPENLLGIARGHWEIENRNHHVRDTTYHEDLSQIRTENGPHMMATLRGLAMS 74 Query: 343 LLRDCKDIKGEEEKKEGCVKHRE 365 +LR + ++ R+ Sbjct: 75 ILRLIGVKNIAQAGRDFAASARK 97 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 34/129 (26%), Positives = 54/129 (41%), Gaps = 13/129 (10%) Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQE----CHEITDGEIIAIDGKTIRGSFDKG 120 PV ++ ++ ID A F + C IAIDGKT+R SFD Sbjct: 8 LRRAPVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAF 67 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELL---------NLLYLKKNLITI 171 A +++SAF+ ++ ++L + KSNEI A L+ + + + + Sbjct: 68 SDTKAAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALIVATALWKSREETSIYASSVML 127 Query: 172 DAMGCQKDI 180 DAM I Sbjct: 128 DAMTFAPAI 136 >UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C544B Length = 103 Score = 75.1 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 31/105 (29%), Positives = 39/105 (37%), Gaps = 5/105 (4%) Query: 227 EISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYIS 286 + HGR ETR V W GLK R K + E V Sbjct: 2 DPGHGRIETRT-----VRATPLLTCHDRWTGLKHGFRITRTRTVKGVTTVEVVHGITSRP 56 Query: 287 SKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAE 331 + DA+ +R+HW IE+ H V DV + ED R R A Sbjct: 57 VERADARALLGLVRSHWRIENQRHDVRDVTLREDEPRCRAAGAGR 101 >UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium meliloti RepID=Q1WLF6_RHIME Length = 105 Score = 75.1 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 17/64 (26%), Positives = 30/64 (46%), Gaps = 1/64 (1%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + PD R +H L++ILF+ + A++ GA+ ++ DFG + +WLK Sbjct: 1 MSRFAACFEDLPDPR-GRNARHPLTSILFIAIAAIVCGAESCTDMADFGVAKKKWLKTIV 59 Query: 63 DFDN 66 Sbjct: 60 PLPY 63 >UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_9ACTO Length = 410 Score = 74.7 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 30/134 (22%), Positives = 54/134 (40%), Gaps = 11/134 (8%) Query: 46 EIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQE---CHEITD 102 + + R G + P + I R++ ID W+ Sbjct: 225 ALIAWVLARPTVAVLLGIDADRRPSEAMIRRLLQAIDPDLLTTAIGIWLAARIPAPAPGS 284 Query: 103 GEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAI------P 156 IA+DGKT+RGS + + A H+++A G+VL + K+NEIT Sbjct: 285 RRAIAVDGKTLRGS--RTRDSAARHVLAAADQHTGIVLASTDVDTKTNEITRFTASGSHA 342 Query: 157 ELLNLLYLKKNLIT 170 +LL+ ++ +++ Sbjct: 343 DLLSSRCIRSGVVS 356 >UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XCB4_SALTO Length = 117 Score = 74.0 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 26/106 (24%), Positives = 51/106 (48%), Gaps = 3/106 (2%) Query: 26 LSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLA 85 ++++L VCAV+AGA + D+ + F + +PV T+ R++ +D+ Sbjct: 1 MASVLADAVCAVMAGASTFAAFGDWVEDLDAPAWSRLGFTDRVPVLTTLWRLLVRVDAET 60 Query: 86 FEKMFIEWMQECHEITD---GEIIAIDGKTIRGSFDKGKRKGAIHM 128 ++ +W+ + +IA+DGK +RG+ R A+ M Sbjct: 61 LTAVWADWLCSRLPVAPPPVRRVIAVDGKVVRGAVLTEGRVPALWM 106 >UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW01_RHOSR Length = 130 Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 19/81 (23%), Positives = 36/81 (44%) Query: 11 SVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPV 70 PD R + V+H+ S IL + A AGA + I ++ H+ +P Sbjct: 49 DEVPDPRHRRGVRHRFSVILAPALSATCAGARSFIAIAEWAHDAPAEALGGLGVSAVVPS 108 Query: 71 DDTIARVVSNIDSLAFEKMFI 91 + T R ++ +D+ A +++ Sbjct: 109 ESTSRRFLAGVDATALDQVLG 129 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 39/93 (41%), Gaps = 1/93 (1%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF- 64 L Y+S PD R+ + L+ +L ++ A+++GA +++I+ F E L Sbjct: 3 LKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQLH 62 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQEC 97 P +I + +D+ A E F Sbjct: 63 RKRAPAHTSIRYALQGLDAKAVELAFPRHASGL 95 >UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XTV7_9DEIN Length = 118 Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 26/90 (28%), Positives = 44/90 (48%), Gaps = 3/90 (3%) Query: 255 WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLD 314 W+G ++ + + R +++ Y ++S AK R HW +E+ LH D Sbjct: 4 WRG-SRMALRMRRRVVRKNSGELREETAYALTSLQAPAKRLYALWRGHWEVENRLHHKRD 62 Query: 315 VKMNEDASRIRRGNAAEIISGIKKMALNLL 344 + EDASR R+G A + ++ + LNLL Sbjct: 63 TVLGEDASRSRKGAAG--LMYLRDVILNLL 90 >UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3Z283_9SYNE Length = 156 Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 31/96 (32%), Positives = 50/96 (52%), Gaps = 4/96 (4%) Query: 84 LAFEKMFIEWMQECHEITDG-EIIAIDGKTIRGSFDK--GKRKGAIHMVSAFSNENGVVL 140 AFE + ++WM + + DG + + DGKT+RGS D+ G I VS +S GV + Sbjct: 2 EAFEALLLQWMSQQPALADGVDTLVCDGKTLRGSIDQKPGAAASFIAQVSLYSQPLGVAI 61 Query: 141 GQVKTE-AKSNEITAIPELLNLLYLKKNLITIDAMG 175 Q +S+E ++ LL+ + L L+ D +G Sbjct: 62 AQTTYATDESSETASLLWLLSGIELTDMLVQADEVG 97 >UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RIX6_9ACTO Length = 90 Score = 68.9 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 16/67 (23%), Positives = 26/67 (38%) Query: 285 ISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 + + + R W IE+ LHWV DV E R R G + + ++ A+ Sbjct: 6 LPAAYAQPADLQQWARLEWHIENRLHWVRDVTFGEGTHRARTGTGPAVAAVLRNTAIGFH 65 Query: 345 RDCKDIK 351 R + Sbjct: 66 RGNGETN 72 >UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ITI9_METNO Length = 74 Score = 67.8 bits (164), Expect = 7e-10, Method: Composition-based stats. Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 1/63 (1%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 I+ L D+R+ H+L AIL + VCAVIA A+ ++I +G + WL+ + Sbjct: 2 IEGLAVCFPGLEDLRETSGCDHRLIAILVIAVCAVIACAES-KDIGLYGRSKQAWLQTFL 60 Query: 63 DFD 65 Sbjct: 61 PLP 63 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 26/59 (44%), Positives = 39/59 (66%) Query: 95 QECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEIT 153 + + + ++ DGKT+R S D+ K AIH+VSA+++ N +VLGQVKT+ KSNE Sbjct: 15 KVYQKSLKEKSLSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNEHK 73 >UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LD44_FRASN Length = 261 Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 36/158 (22%), Positives = 60/158 (37%), Gaps = 10/158 (6%) Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEI------SHGR 232 ++A+++ D+ + L +G+Q L A + + + + I + G Sbjct: 38 ELAAQVPDRISQPRLVTEGDQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLTAGS 97 Query: 233 KETR-LHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMD 291 ++TR L V+ L F + + +K E K + Y I + Sbjct: 98 RQTRALKAVTVPAGLGFPHAAQAIQLTRTSRPINKNTKKTEGKRRQRRETVYAICTLPAH 157 Query: 292 ---AKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRR 326 E A IR HW IE L WV DV + ED + R Sbjct: 158 DALPAELATWIRGHWSIEVRLRWVRDVTLGEDLHQART 195 Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats. Identities = 12/35 (34%), Positives = 25/35 (71%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIA 39 +LL+ ++ PD+R++ V+H +A+L + VCA++ Sbjct: 60 ALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLT 94 >UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PA49_CLOTS Length = 245 Score = 65.9 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 48/226 (21%), Positives = 88/226 (38%), Gaps = 37/226 (16%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 L + I+ D R + VK +S I F+ + + + +E + + KK Sbjct: 18 HLGEKINTLKDKRVKSSVK--ISTITFVVLFGFMLQIRSFNRLEHW--LKKGKFKKALPK 73 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKM--------FIEWMQECHEITDGEIIAIDG------ 110 +P DTI RV+SN D ++ + I +++AIDG Sbjct: 74 KTKMPRIDTIRRVLSNFDLDGLNELNNSIIKTSIKNKVFRRGTIDGLKVVAIDGVELFES 133 Query: 111 --KTIRGSFDKGKRKGAIH------MVSAFSNENGVVLGQVKTEAKSN-------EITAI 155 K + ++ G H + S +++ ++LGQ E K + EITA Sbjct: 134 TKKCCGNCLTRVQKDGITHYFHRTVVCSTIGSDSHLILGQEILEPKKDGSDKDEGEITAG 193 Query: 156 PELLNLLYLK----KNLITIDAMGCQKDIASKIKDKKADYLLAVKG 197 L+ L+ + ++I DA+ C+ ++ D ++ VK Sbjct: 194 KRLIRKLHREFHHFADIIVADALYCKSTWVKEVLSIGMDAVVRVKD 239 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 65.5 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 28/59 (47%), Positives = 36/59 (61%) Query: 58 LKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGS 116 LK+YG F+ GI DTI +VS I + F+K FI+WM C E+ A DGKT+R S Sbjct: 12 LKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A0_AZOSE Length = 92 Score = 65.1 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 2/62 (3%) Query: 307 HSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE--GCVKHR 364 H LHW LDV+ N+D SR+RRG AA ++ + LNLLR K + K C++ Sbjct: 23 HQLHWSLDVQFNDDQSRVRRGYAANNFVVLRHIVLNLLRHNTTRKASIKSKRLLACMEDD 82 Query: 365 ER 366 R Sbjct: 83 FR 84 >UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3536 Length = 130 Score = 64.7 bits (156), Expect = 5e-09, Method: Composition-based stats. Identities = 21/67 (31%), Positives = 31/67 (46%) Query: 253 FEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWV 312 +WKGLK+ R + E V +S+ +A +R HW IE+ LH+V Sbjct: 9 QDWKGLKQGFQITRERTVNGVTTVEVVHGITSLSADRANAGALLSLLRDHWRIENQLHYV 68 Query: 313 LDVKMNE 319 DV + E Sbjct: 69 PDVTLGE 75 >UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVV4_METNO Length = 123 Score = 63.9 bits (154), Expect = 8e-09, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 48/113 (42%), Gaps = 1/113 (0%) Query: 255 WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLD 314 W GL + + R + +R+ + S ++ A AIR H + WVL+ Sbjct: 7 WPGLTTVLATETLRG-GNGTDSVPAQVRHSLGSSTAPSEVLAQAIRRHGALATGEPWVLE 65 Query: 315 VKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERS 367 V E+ SR+R AA ++ ++++AL+ R + ++ + R Sbjct: 66 VSFGEERSRVRERCAARHLALLRRVALDRRRADASLTASRPAQDRGLGRRRHG 118 >UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C435B Length = 133 Score = 63.9 bits (154), Expect = 9e-09, Method: Composition-based stats. Identities = 31/138 (22%), Positives = 42/138 (30%), Gaps = 20/138 (14%) Query: 192 LLAVKGNQGKLHHAFEEKFP---------------VNVFSNYKGDSFSTQEISHGRKETR 236 +L K NQ L E + G HGR ETR Sbjct: 1 MLTAKDNQPGLVADIEAGLGFEDAARGLAAATSPLTGPDARATGAPGHVGGPGHGRIETR 60 Query: 237 LHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFA 296 V W GLK R K + E + ++ + DA+ Sbjct: 61 T-----VRATPLLTCHDRWTGLKHGSRITRARTVKGVTTVEVLHGITSLTVERADARALL 115 Query: 297 HAIRAHWLIEHSLHWVLD 314 +R+HW IE+ H V D Sbjct: 116 GLVRSHWRIENQRHDVRD 133 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 63.9 bits (154), Expect = 9e-09, Method: Composition-based stats. Identities = 28/122 (22%), Positives = 48/122 (39%), Gaps = 18/122 (14%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIED------FGHERLE 56 ++ LL+ S PD R+ VKH+L+ +L + + + +E F Sbjct: 80 LKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSRPAFLQALQG 139 Query: 57 WLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEI--------IAI 108 + +G DT+ARV+ I+ E+ FI ++ + IAI Sbjct: 140 LFPELETLPHG----DTLARVLERIEPQKLEESFIRLLRRYIRHKKFKRHLINKCYPIAI 195 Query: 109 DG 110 DG Sbjct: 196 DG 197 >UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L7Y6_FRASN Length = 209 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 13/55 (23%), Positives = 27/55 (49%) Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 + A +R +W IE+ +H+ D EDA+ GN ++ + +A+ ++ Sbjct: 89 VTAAYLHTHVRGNWGIENEVHYTRDAAWREDANPTYTGNTNHALASFRNLAIGVI 143 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 33/69 (47%), Gaps = 3/69 (4%) Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEI-IAIDGKTIRGSFDK 119 + + P T+ + ID A + F W+ C +I G + +AIDGK +RG++ Sbjct: 28 HFRRNTRAPSKKTLRAPLKKIDVDALDATFGAWL--CAQIARGRVALAIDGKVLRGAWSG 85 Query: 120 GKRKGAIHM 128 + A ++ Sbjct: 86 DESVTAAYL 94 >UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TCH7_HELMI Length = 453 Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 53/385 (13%), Positives = 115/385 (29%), Gaps = 62/385 (16%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 + D R+Q ++K AI + + ++++ + ++ ++ Sbjct: 40 GFSQMVRQAKDGRKQPRIK--APAIFTVAFFGAFFCMESMEQMDRW--QKTGVFRQLVPK 95 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE--------IIAIDGKTIRGS 116 + +P DT+ + + D + +Q E + + AIDG + + Sbjct: 96 NIRLPSHDTVRQALMKWDLKEQREQHNCVIQRYKEQRGPQKESINGWRVTAIDGVELFHT 155 Query: 117 FD-----------KGKRKGAIHMVSAFSNENG----------VVLGQVKTEAKSNEITAI 155 + K H V + + G + Q + E T Sbjct: 156 KAYRCPECLTREHRDKTTDYYHAV-VVAQQVGGNANLIYDWEMRKPQDGVDKDEGETTVA 214 Query: 156 PELLNLLYLK----KNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFP 211 L+ + ++ T+DA+ + + D A ++ +K + ++ F Sbjct: 215 QRLIRRMAETYGKITDVYTLDALFAKAPVIHAALDAGAHVVVRMKEERRRIMKEANACF- 273 Query: 212 VNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKK 271 +N DS + G + +W ++ + + Sbjct: 274 ----ANRLPDSTWEERDGKGNTVYVQAW--------DEEGLAQWPQVRVPMRIVKIIRHT 321 Query: 272 EDKSAEGVSIRYYI-----------SSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNED 320 E + SS+ D + A A W IE+ L D Sbjct: 322 NKTVIEANKEVFVTDVVERWIATTCSSEKADTQTIAQIAAARWDIENIGFRNLKTFNALD 381 Query: 321 ASRIRRGNAAEIISGIKKMALNLLR 345 + A + + G + +A NL R Sbjct: 382 HCFVHDSVAIKAMIGFQVLAFNLKR 406 >UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria RepID=A8FXM7_SHESH Length = 57 Score = 61.6 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 26/57 (45%), Positives = 37/57 (64%), Gaps = 3/57 (5%) Query: 263 VALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNE 319 + +FR +K + RYYISSK++ A++ A+ + HW IE S+HWVLDV MNE Sbjct: 1 MVENFRFVIGNKLV--LEYRYYISSKELTAEQAANTVSEHWGIE-SMHWVLDVSMNE 54 >UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BQC4_9GAMM Length = 96 Score = 60.1 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 1/56 (1%) Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 M ++ R HW I SLH++ D NED +IR G+ ++ + + A+ +L+ Sbjct: 1 MTPQQVLAINRGHWSI-ASLHYISDWNYNEDRGQIRTGHGPANVTRLCRFAIGVLK 55 >UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus aquaticus Y51MC23 RepID=B7A7V9_THEAQ Length = 161 Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 48/119 (40%), Gaps = 9/119 (7%) Query: 231 GRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSK-- 288 G T S + + G ++ +K ++ Y ++S Sbjct: 20 GEVWTYRVWASP----YLPEEMRAFPGCGQVVRMEREVVRKGTGEVRR-TVSYALTSLGP 74 Query: 289 -DMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRD 346 DA+ + + W +E+ WV D ++EDA ++R G A++++ ++ ++LL Sbjct: 75 EVADARRLGELLLSRWEVENRSFWVRDFLLHEDACQVR-GVGAQVLAALRAFLVSLLHR 132 >UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C378D Length = 453 Score = 58.2 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 52/338 (15%), Positives = 96/338 (28%), Gaps = 60/338 (17%) Query: 8 DYISVTPDIRQQGKVKHKLSAILF--LTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 + PD R L +L L + A+ A + F L+ ++ Sbjct: 27 ERFETIPDAR--RGPTFSLPDVLMAGLALFALKA-----PSLLAFQRRTLDHNLRHVFGL 79 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFI--------EWMQECHEITDGEIIAIDG------- 110 G P D + V+ ++D +F + + + + ++A+DG Sbjct: 80 TGRPSDSQMRAVLDDVDPDHLRPVFRDVFARLQAAHVLDEYRVDGCYVVALDGVEYFCSQ 139 Query: 111 -----KTIRGSFDKGKRKGAIHMVSA--FSNENGVVLG------QVKTEAKSN--EITAI 155 + G M+ A + VL Q N E A Sbjct: 140 KVHCPHCMTRRHANGAVSYYHQMLGAAVVHPDFSAVLALAPEPIQRADGGTKNDCERNAA 199 Query: 156 PELLNLL-----YLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKG-NQGKLHHAFEEK 209 L L L+ DA ++ + +LL VK + L Sbjct: 200 RRWLGRFREEHPDLA-VLVVEDARSSNAPHVRDLQKARCHFLLGVKAADHAHLF------ 252 Query: 210 FPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQ 269 +V + +F E + R R + + L + Sbjct: 253 --AHVCARQDQHAFEVVEDADPRTGLRRSYLWIADLPLNESNDDVRVNFVHLV------E 304 Query: 270 KKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEH 307 D + + ++ + ++ A A RA W IE+ Sbjct: 305 LDPDGTPREWTWVADMAVTGANVRQLARAGRARWRIEN 342 >UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TKB9_RICCO Length = 107 Score = 58.2 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 20/100 (20%), Positives = 40/100 (40%), Gaps = 1/100 (1%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWL-KKYGDF 64 D S D+R+ ++ L +L V ++++G+ ++++ F E+L L + +G Sbjct: 8 FGDVFSELRDVRRAQGKRYALEPLLCAIVMSILSGSASLRKMQVFIEEQLPNLNRLFGTS 67 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE 104 P I + +D E+ F E G Sbjct: 68 WRKAPCWVAIREFLLGLDEQELERAFREHANRQVSPPPGR 107 >UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36C2 Length = 109 Score = 57.8 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 18/71 (25%), Positives = 25/71 (35%), Gaps = 2/71 (2%) Query: 263 VALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDAS 322 R+ + E V +S DA R HW IE+ LH+ DV + ED Sbjct: 3 RLERRRKANGKATVEVVYGITSLSRLAADAAALLGYSRRHWGIENGLHYTRDVTLGEDRC 62 Query: 323 RI--RRGNAAE 331 + R Sbjct: 63 PVGARSRPTPR 73 >UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VX04_POLNA Length = 100 Score = 55.9 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 13/42 (30%), Positives = 24/42 (57%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEI 47 L+ S+ PD R ++ L ++ +T+ AV+ GAD W ++ Sbjct: 2 LIQAFSILPDPRTGPAQRYDLREMIVMTLSAVLCGADNWVDV 43 >UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J572_FRASC Length = 148 Score = 55.9 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 11/46 (23%), Positives = 18/46 (39%), Gaps = 1/46 (2%) Query: 8 DYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHE 53 + PD R V+H+L +L L AV+ G + + Sbjct: 70 ECFEQIPDPRDPRGVRHRLPVVLSLC-AAVLCGESGLAGVAAWVAA 114 >UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B1F Length = 84 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 21/61 (34%), Positives = 32/61 (52%) Query: 159 LNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY 218 L++ L ++ + IDA+G Q IA +I + ADY+LA+K NQ A F + Sbjct: 17 LDMEDLAQSQLVIDAVGTQGPIAEQIIEAGADYVLALKANQPSALQAVSAHFKEAESVDL 76 Query: 219 K 219 K Sbjct: 77 K 77 >UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ13_9PLAN Length = 75 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 19/64 (29%), Positives = 26/64 (40%), Gaps = 8/64 (12%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++ L Y D R +HKL ++ + +CAVIAGAD IE WL Sbjct: 20 LRELAKYFQSLDDSRSHVNQRHKLVNVVLIAMCAVIAGADGPTAIE--------WLAGRL 71 Query: 63 DFDN 66 Sbjct: 72 QLPT 75 >UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AI67_NITEC Length = 94 Score = 52.4 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 11/59 (18%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 L D D RQ K +H L +L +T+ EI + +E+L+WL++Y Sbjct: 35 LADVFVSITDPRQ-RKSRHDLVKVLVITI----------NEILAWANEKLDWLRQYLKL 82 >UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7B Length = 481 Score = 52.0 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 44/287 (15%), Positives = 93/287 (32%), Gaps = 59/287 (20%) Query: 47 IEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEII 106 IE G R L+++ D+G + + I + F+ + + E++ Sbjct: 81 IEHQGSGRQAHLRRHRQPDDG--CHEAFYGKLRRI-PRGLSEAFLRDVTDRFTALFPEVV 137 Query: 107 A--------------IDGKTIR----GSFDKGKRKGAIH---MVSAFSNENGVVLG-QVK 144 A +DGK+++ D G + ++ A+ +G+VL Sbjct: 138 AHRLPTSFDRLEVLILDGKSLKKVAKRLVDTRGTPGKLLGGKLLVAYRPRDGLVLDMAAD 197 Query: 145 TEAKSNEITAIPELLNLLYLKK---NLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGK 201 + ++NE IP+L+ ++ + L+ D + C ++ ++ Sbjct: 198 LDGETNEAKLIPDLMPRVHARGGPAKLVVGDRLFCASKHFAEFTKDNGHFV--------- 248 Query: 202 LHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKL 261 ++ + K + +T + S E+ W G K Sbjct: 249 ----VRYARTLSFEPDPKRPAVTTADPSQRAVVE----------------EWGWAGKPKD 288 Query: 262 CVALSFRQ-KKEDKSAEGVSIRYYIS-SKDMDAKEFAHAIRAHWLIE 306 + R+ E ++I + S A + R W IE Sbjct: 289 KLRRYVRRITVARPVGEAITILTDLLDSAPYPATDLLDLYRIRWTIE 335 >UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P672_AZOSE Length = 47 Score = 51.2 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 15/31 (48%), Positives = 20/31 (64%) Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEI 332 HW +E+ LHW L+V+ NED SR+R A Sbjct: 1 HWGVENWLHWCLNVQFNEDRSRVRSAYAVNN 31 >UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E7_9ACTO Length = 72 Score = 50.8 bits (120), Expect = 9e-05, Method: Composition-based stats. Identities = 13/45 (28%), Positives = 24/45 (53%) Query: 134 NENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 G+ + Q++ +NEIT LL+ L++ +T DA+ Q+ Sbjct: 2 TGTGMTVTQLRVPENTNEITCFAALLDPYDLREVTVTGDALHTQR 46 >UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BWL6_9GAMM Length = 94 Score = 50.1 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 32/55 (58%), Gaps = 1/55 (1%) Query: 8 DYISVTPDIRQQG-KVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 ++ PD R++ ++HK IL + +CA+I GAD W + +FG + +W + + Sbjct: 40 EHFKSLPDPRRRTMNLRHKFIDILIIAICAIICGADSWVAVAEFGKAKEDWFRVF 94 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 67/176 (38%), Gaps = 21/176 (11%) Query: 43 EWQEIEDF-----GHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQEC 97 ++E+ F G + L +Y +FDN P + + + + I AFE +F E+ + Sbjct: 40 SFEEVMKFMLTMEGKALRDELLEYFEFDNTTPSNSSFNQRRAQILPEAFEFLFQEFTKSF 99 Query: 98 HEI---TDGEIIAIDGKTIRGSFDK------------GKRKGAIHMVSAFS-NENGVVLG 141 + +IA DG + + + K +H+ + + Sbjct: 100 TDNVTYNGLRLIACDGSDLCIAHNPQDETTYFQTLPDRKGYNLLHLNAFYDLCSRQYTDA 159 Query: 142 QVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKG 197 ++ +NE A+ E+++ + D +I + ++ K YL+ VK Sbjct: 160 IIQPSRLANERRAMCEMIDRYNDTSAIFIADRGYENYNIFAHVEHKGMYYLIRVKD 215 >UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L6S8_RUMHA Length = 107 Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 14/48 (29%), Positives = 23/48 (47%) Query: 47 IEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWM 94 + F + + +K D G P DT+ RV + I+ F +MF W+ Sbjct: 1 MAFFMKLQEPYFEKILDLKYGTPSADTLLRVFAIIEPEKFMEMFYHWI 48 >UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HUT6_9ACTO Length = 130 Score = 48.9 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 19/84 (22%), Positives = 36/84 (42%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 SL +S PD R+ ++ L ++L L + AV+ GA I F + L++ Sbjct: 45 SLAGTLSRIPDPRRVRGRRYHLGSLLALCLVAVLGGARSLATIARFAADTNSSLREQLGL 104 Query: 65 DNGIPVDDTIARVVSNIDSLAFEK 88 + P T+ + +N+ + Sbjct: 105 ASSTPNASTLGGLRANLKDEWVRE 128 >UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JGX0_FRASC Length = 222 Score = 48.5 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 21/108 (19%), Positives = 40/108 (37%), Gaps = 6/108 (5%) Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIE-WMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 G P + + ++D + + ++ +DG T+R Sbjct: 31 PGTPAPGGVGKSCRSLDPGSLAALDAAPHRPTWRAGRVRRVLTVDGTTMR----PQHGSR 86 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL-YLKKNLITI 171 +H+ ++ GV+L QV + K+NE + L + L LIT Sbjct: 87 HVHLPEGLAHACGVLLTQVDVDEKTNENPFVLRGLGQIPDLTGVLITA 134 >UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromonas gingivalis ATCC 33277 RepID=B2RI66_PORG3 Length = 87 Score = 47.4 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 13/47 (27%), Positives = 22/47 (46%) Query: 17 RQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 R + K + L + + + ++G W EIED+ E E LK + Sbjct: 23 RIESKEVYPLDFLFLIVFLSTLSGDTSWYEIEDYAEEYEEVLKSRYE 69 >UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus pneumoniae RepID=B2ISL1_STRPS Length = 535 Score = 47.4 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 69/201 (34%), Gaps = 29/201 (14%) Query: 18 QQGKVKHKLSAILFLTVCAVIAG-ADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIAR 76 Q+ ++ S IL + ++ G ++ E L + G T++R Sbjct: 142 QRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGGQL----ASQPTLSR 197 Query: 77 VVSNIDSL----------AFEKMFIEW--MQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 +S D + F+++ + + D GK +++ R Sbjct: 198 FLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDSTHFTTYGKQEGVAYNAHYRAH 257 Query: 125 AIHMVSAFSNENGVVL-GQVKTEAK--SNE----ITAIPELLNLLYLKKNLITIDAMGCQ 177 H + AF + G Q++ + S E IT + E N L L +D+ Sbjct: 258 GYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFITPVLERFNQL-----LFRMDSGFAT 312 Query: 178 KDIASKIKDKKADYLLAVKGN 198 + I+ YL+ +K N Sbjct: 313 PKLYDLIEKTGQYYLIKLKKN 333 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats. Identities = 21/65 (32%), Positives = 33/65 (50%), Gaps = 12/65 (18%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTV------CAVIAGADEWQEIEDFGHER 54 M ++ L+++IS+ PD RQ KV+HKL IL + C ++ FG Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRG------FGETH 54 Query: 55 LEWLK 59 L++LK Sbjct: 55 LDFLK 59 >UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 Length = 41 Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 17/30 (56%), Positives = 25/30 (83%) Query: 128 MVSAFSNENGVVLGQVKTEAKSNEITAIPE 157 MV+A + NG+ +GQ+K ++KSNEITAIP+ Sbjct: 1 MVNALATANGMSIGQLKVDSKSNEITAIPK 30 >UniRef50_UPI00016C3A84 hypothetical protein GobsU_12175 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A84 Length = 100 Score = 45.5 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 3/68 (4%) Query: 268 RQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRG 327 R ++ A + +SS + A E A IR HW IE +++ VLDV D R R Sbjct: 3 RDRQVKGKANESTAHDDLSSLRVGAAELAGYIRRHWHIE-AMNGVLDVAFRVD--REHRP 59 Query: 328 NAAEIISG 335 ++++ Sbjct: 60 TRRQVLAL 67 >UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmonella enterica RepID=C0Q104_SALPC Length = 177 Score = 44.7 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 17/29 (58%), Positives = 18/29 (62%) Query: 309 LHWVLDVKMNEDASRIRRGNAAEIISGIK 337 +HW LDV MNED RIRRGN IK Sbjct: 1 MHWRLDVAMNEDDCRIRRGNVKSFFEIIK 29 >UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW00_RHOSR Length = 98 Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 21/52 (40%), Positives = 29/52 (55%), Gaps = 5/52 (9%) Query: 110 GKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNL 161 GKT RG+ D H+++A ++ GVVL QV A+ NEI P LL+ Sbjct: 18 GKTWRGAKD--GSGHLTHLLAAVDHDAGVVLRQVAVGARINEI---PLLLDP 64 >UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4X2F9_SALTO Length = 143 Score = 44.3 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 13/66 (19%), Positives = 24/66 (36%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 + L + PD + V H+L+ +L +CAV + I ++ + Sbjct: 11 TAAGLPAALLDLPDPLCRLGVLHRLTVVLIAAICAVAVSNRSYTAIAEWFPDVPAATGAR 70 Query: 62 GDFDNG 67 G G Sbjct: 71 GGHRPG 76 >UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillaceae RepID=C5D2E6_GEOSW Length = 437 Score = 44.3 bits (103), Expect = 0.008, Method: Composition-based stats. Identities = 53/350 (15%), Positives = 116/350 (33%), Gaps = 74/350 (21%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEI-EDFGHER-LEWLKK 60 + L+D + D R Q + + IL+ + + G + + E F + +E ++ Sbjct: 28 FKDLVDQLKKVKDKRHQSYITYGPETILYTILLKSVFGIKSMRSMTELFNKDECIENIRV 87 Query: 61 YGDFD--NGIPVDDTIARVVSNIDSLAFEKMFI---------EWMQECHEITDGEIIAID 109 N +P DTI ++ ++ E + I ++ + I D Sbjct: 88 VLGLKELNELPHYDTINDFLAKLEPKELETIRIYLIKKLFEKRCLESFRILNKYWPIVFD 147 Query: 110 GK------------TIRGSF-DKGKRKGAI----HMVSA--FSNENGVVLGQVKTEAKSN 150 G +R + DK + + H++ A + + + E +S Sbjct: 148 GTGIHTFKEKHCEHCLRREYKDKETGETKVVYMHHVLEAKLVVGDMVLSIATEFIENESE 207 Query: 151 -------EITAIPELLNLLY-----LKKNLITIDAMGCQKDIASKIKDKKAD-YLLAVKG 197 E+ A L++ L L LI D++ + + I DK Y+ Sbjct: 208 NVPKQDCELKAFMRLVDKLKKTFKRLPICLI-ADSLYACEPVFE-ICDKHNWKYIF---- 261 Query: 198 NQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKG 257 +F + + + Q + K + V+++ + Sbjct: 262 -----------RFKEDRIKTVSQEFRAIQSLETNGKSSEYFWVNDIAYNDRL-------- 302 Query: 258 LKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEH 307 + + + + +K E + I + ++ +A+ A R W IE+ Sbjct: 303 ---VNLVEKVKVTENEKKQEFLFITNFRITER-NAEILVQAGRRRWKIEN 348 >UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUW9_9BACT Length = 61 Score = 43.9 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 10/33 (30%), Positives = 17/33 (51%) Query: 312 VLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 + D ED S+IR NA ++ +K + + L Sbjct: 1 MRDTSFREDHSQIRTQNAPRAMASLKNLVVGLF 33 >UniRef50_D1RJD3 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RJD3_LEGLO Length = 61 Score = 43.9 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 1/55 (1%) Query: 22 VKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIAR 76 ++ L I+FL + I DFG ++EWLK++ + NG+PVDDT+ R Sbjct: 2 KRYLLIKIMFLLLVLQFMDVKAGT-IRDFGLLKIEWLKQFLTYKNGMPVDDTMTR 55 >UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitrosococcus oceani AFC27 RepID=B6C2C4_9GAMM Length = 77 Score = 43.9 bits (102), Expect = 0.010, Method: Composition-based stats. Identities = 9/42 (21%), Positives = 18/42 (42%) Query: 316 KMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 ED R+ A + ++K+A++LL + K + Sbjct: 21 SFREDECRVHDPMAGGNFALLRKIAISLLVRDRSNKTSLRGR 62 >UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TLQ7_SHEHH Length = 74 Score = 43.5 bits (101), Expect = 0.013, Method: Composition-based stats. Identities = 10/62 (16%), Positives = 24/62 (38%) Query: 7 LDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDN 66 +++S+ R +H I+FL A+ + + W +I++F + Sbjct: 4 FEHLSIIKAPRSSINHEHDPVDIMFLVNSAIASDCEGWLDIDEFDRIDDRKNAERMALIR 63 Query: 67 GI 68 + Sbjct: 64 RM 65 >UniRef50_A3YV03 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3YV03_9SYNE Length = 113 Score = 42.8 bits (99), Expect = 0.020, Method: Composition-based stats. Identities = 16/65 (24%), Positives = 29/65 (44%), Gaps = 2/65 (3%) Query: 281 IRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMA 340 +++S K +R W IE+ H+ + +++E A + N A ++ K Sbjct: 17 THLFLTSLSSTPKTLLQLVRDRWSIEN-WHFFRNTQLHESAH-GYQDNGACAMTTQKTGT 74 Query: 341 LNLLR 345 NLLR Sbjct: 75 QNLLR 79 >UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms RepID=Q11MU1_MESSB Length = 447 Score = 41.2 bits (95), Expect = 0.055, Method: Composition-based stats. Identities = 33/249 (13%), Positives = 79/249 (31%), Gaps = 28/249 (11%) Query: 11 SVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG--- 67 + D R +V+H L+ IL + A+ G ++ +++ + G + Sbjct: 52 AAIRDPRDPARVRHSLTDILRARIFAIACGYEDANDLDR-LRNDPAFKLACGRLPDSGQD 110 Query: 68 IPVDDTIARVVSNID---SLAFEKMFIE-WMQECHEITDGEIIAID-------GKTIRGS 116 + T +R+ + D + ++ ++ W+ + ID G Sbjct: 111 LCSQPTCSRLENLPDLRTVIRLGRVLVDLWLSSYPAPPKSVTLDIDDTLDVVHGHQQLSL 170 Query: 117 FDKGKRKGAIHMVSAFSNENGVVLGQVKTEAK---SNEITAIPELLNLL-----YLKKNL 168 F+ + + + G + + K EI L + L Sbjct: 171 FNGHHDERCFLPIHIYDAATGRPVAMILRPGKTPSGKEIRGHLRRLARCIRARWPDTRIL 230 Query: 169 ITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEI 228 + D+ + ++ + ++ DY+ + GN + + + + + Sbjct: 231 VRGDSHYGRVEVMAWCEENAIDYVFGLAGN-----KVLKRLVDASADDIRTRRALEQKPV 285 Query: 229 SHGRKETRL 237 G ETR Sbjct: 286 LRGYVETRY 294 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 382 e-104 UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 379 e-104 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 373 e-102 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 373 e-102 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 371 e-101 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 369 e-101 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 366 e-100 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 365 1e-99 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 353 7e-96 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 346 1e-93 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 338 2e-91 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 338 2e-91 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 338 2e-91 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 334 3e-90 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 334 5e-90 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 333 5e-90 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 332 1e-89 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 330 6e-89 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 326 9e-88 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 326 1e-87 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 321 3e-86 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 319 1e-85 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 319 2e-85 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 317 3e-85 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 316 1e-84 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 314 2e-84 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 314 5e-84 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 314 5e-84 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 312 1e-83 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 312 2e-83 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 309 2e-82 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 306 1e-81 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 302 2e-80 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 299 1e-79 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 298 2e-79 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 296 1e-78 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 290 5e-77 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 289 8e-77 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 286 7e-76 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 284 3e-75 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 284 5e-75 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 284 5e-75 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 279 9e-74 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 277 5e-73 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 274 5e-72 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 272 1e-71 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 265 2e-69 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 264 4e-69 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 259 9e-68 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 259 1e-67 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 257 5e-67 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 256 9e-67 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 250 8e-65 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 249 1e-64 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 248 3e-64 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 247 5e-64 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 240 5e-62 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 240 7e-62 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 240 7e-62 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 235 2e-60 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 235 2e-60 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 233 1e-59 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 232 1e-59 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 226 1e-57 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 225 2e-57 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 224 5e-57 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 223 7e-57 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 221 3e-56 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 220 4e-56 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 217 5e-55 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 212 2e-53 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 211 3e-53 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 204 4e-51 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 200 7e-50 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 198 3e-49 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 184 4e-45 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 184 5e-45 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 181 2e-44 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 178 2e-43 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 171 3e-41 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 170 7e-41 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 170 8e-41 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 169 1e-40 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 163 1e-38 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 161 4e-38 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 159 2e-37 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 159 2e-37 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 156 1e-36 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 153 1e-35 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 151 4e-35 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 151 4e-35 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 151 5e-35 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 150 7e-35 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 148 2e-34 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 147 7e-34 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 144 6e-33 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 143 1e-32 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 141 4e-32 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 140 6e-32 UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaerom... 138 2e-31 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 138 3e-31 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 138 4e-31 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 137 6e-31 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 136 1e-30 UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliob... 136 2e-30 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 135 2e-30 UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 133 1e-29 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 132 2e-29 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 132 2e-29 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 131 4e-29 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 129 1e-28 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 129 2e-28 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 129 2e-28 UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp... 123 1e-26 UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azolla... 122 2e-26 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 122 2e-26 UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7... 122 2e-26 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 122 2e-26 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 118 4e-25 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 117 8e-25 UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata ob... 111 5e-23 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 110 8e-23 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 110 1e-22 UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata ob... 108 4e-22 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 108 5e-22 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 104 5e-21 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 103 1e-20 UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis... 103 1e-20 UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mo... 101 4e-20 UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=... 101 5e-20 UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfo... 101 6e-20 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 100 8e-20 UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum fe... 100 8e-20 UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=B... 100 2e-19 UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewane... 100 2e-19 UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris mari... 99 2e-19 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 97 8e-19 UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica... 96 1e-18 UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis... 96 2e-18 UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammap... 96 2e-18 UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobac... 96 3e-18 UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswald... 96 3e-18 UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=B... 95 4e-18 UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoa... 94 8e-18 UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteoba... 94 8e-18 UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX 94 9e-18 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 93 1e-17 UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio ... 93 2e-17 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 92 3e-17 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 91 4e-17 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 90 2e-16 UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscurig... 90 2e-16 UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia ... 89 2e-16 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 89 2e-16 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 89 2e-16 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 89 3e-16 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 88 4e-16 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 88 6e-16 UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina... 87 1e-15 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 87 1e-15 UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legione... 87 1e-15 UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia... 86 2e-15 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 85 3e-15 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 85 4e-15 UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus p... 85 6e-15 UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus Re... 84 8e-15 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 83 2e-14 UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferro... 83 2e-14 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 82 4e-14 UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synecho... 81 5e-14 UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 81 7e-14 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 80 9e-14 UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_... 80 1e-13 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 80 2e-13 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 78 7e-13 UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax... 77 1e-12 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 76 3e-12 UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylob... 76 3e-12 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 75 4e-12 UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=... 75 4e-12 UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia... 73 2e-11 UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinis... 73 2e-11 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 73 2e-11 UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodoco... 73 2e-11 UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus... 72 3e-11 UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 71 8e-11 UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum ... 71 8e-11 UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus... 70 2e-10 UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothe... 68 5e-10 UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium m... 68 6e-10 UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synecho... 66 2e-09 UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=... 65 4e-09 UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia... 65 7e-09 UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=... 64 1e-08 UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia... 62 4e-08 UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitroco... 61 5e-08 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 61 5e-08 UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylo... 60 1e-07 UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobi... 59 4e-07 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 55 3e-06 UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia... 55 4e-06 UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Strepto... 54 7e-06 UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata ob... 54 8e-06 UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria... 54 1e-05 UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polarom... 53 2e-05 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 50 1e-04 UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 50 1e-04 UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitroso... 48 5e-04 UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastop... 48 7e-04 UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aro... 47 9e-04 UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiat... 47 0.001 UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 R... 46 0.002 Sequences not found previously or not previously below threshold: UniRef50_A5GAF0 Putative uncharacterized protein n=6 Tax=Deltapr... 61 7e-08 UniRef50_B2IT45 Putative uncharacterized protein n=5 Tax=Cyanoba... 60 1e-07 UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostri... 58 5e-07 UniRef50_A8MIZ4 Putative uncharacterized protein n=1 Tax=Alkalip... 56 2e-06 UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillace... 53 2e-05 UniRef50_A7C035 Transposase n=5 Tax=Bacteria RepID=A7C035_9GAMM 52 3e-05 UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostri... 49 3e-04 UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID... 49 3e-04 UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitroso... 48 8e-04 UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia ... 47 0.001 UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candida... 46 0.003 UniRef50_A4BVT6 Putative uncharacterized protein n=1 Tax=Nitroco... 46 0.003 UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia R... 45 0.004 UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteoba... 45 0.005 UniRef50_A7BZU6 Transposase, IS4 n=2 Tax=Beggiatoa sp. PS RepID=... 45 0.005 UniRef50_Q877V8 ISPpu8, transposase n=3 Tax=Proteobacteria RepID... 44 0.008 UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001... 44 0.009 UniRef50_Q745Z8 Putative uncharacterized protein n=1 Tax=Thermus... 43 0.014 UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus... 43 0.017 UniRef50_A3YV03 Putative uncharacterized protein n=1 Tax=Synecho... 43 0.024 UniRef50_A1TX01 Transposase, IS4 family protein n=5 Tax=Marinoba... 43 0.026 UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms Rep... 42 0.036 UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodoco... 42 0.039 UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinis... 42 0.039 UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 42 0.040 UniRef50_A3CU17 AAA ATPase n=1 Tax=Methanoculleus marisnigri JR1... 41 0.057 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 382 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 154/369 (41%), Positives = 214/369 (57%), Gaps = 10/369 (2%) Query: 2 SIQSLLDYISVTPDIRQQG-KVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 +++S +Y D R++ +H IL + VCA+I+GA+ + EIE FGH + EW + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 + NGIP DT V++ + FE F+ W GE IAID KT+RGS DK Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKK 124 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 K +H+VSA++ E +V+GQ+KTE SNEITAIPELLN L LK L++IDAMGCQ +I Sbjct: 125 NGKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEI 184 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFS---NYKGDSFSTQEISHGRKETRL 237 A KI +K ADY+LA+KGNQ KLH + E F + + Y+ D T E S+GR+E R Sbjct: 185 AEKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRC 244 Query: 238 HIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 +N + EWK +K + + S R KKE IRYYISS + A++ Sbjct: 245 AYATN--EIEKIIANDEWKNIKTVAMIESQRIKKEK----EFDIRYYISSAKLSAEDCLK 298 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 +R HW IE+ LHW LDV ED SRIR+ N AE ++ ++++ALNL++ K K + K Sbjct: 299 VVRKHWEIENKLHWTLDVAFREDESRIRQRNTAENMAILRRIALNLVKQEKTAKVGQATK 358 Query: 358 EGCVKHRER 366 E+ Sbjct: 359 RLMAGWDEK 367 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 379 bits (974), Expect = e-104, Method: Composition-based stats. Identities = 209/373 (56%), Positives = 273/373 (73%), Gaps = 3/373 (0%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M ++ L+++IS+ PD RQ KV+HKLS IL LT+CAVI+GA+ W++IEDFG L++LK+ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 YGDF+NGIPV DTIARVVS I F + FI WM++CH D ++IAIDGKT+R S+DK Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 +R+GAIH++SAFS + +V+GQ+KT+ KSNEITAIPELLN+L +K +IT DAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 A KI+ + DYL AVKG QG+L+ AFEEKFP+ +N + DS++ E SHGR+E RLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQ-KKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 +V DF FEWKGLKKLCVA+SFR E K +++RYYISS D+ A++FA AI Sbjct: 241 CDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK-E 358 R HW +E+ LHW LDV MNED +IRRGNAAE+ SGI+ +A+N+L + K K +K Sbjct: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 Query: 359 GCVKHRERSSEVH 371 R + V Sbjct: 360 KAAMDRNYLASVL 372 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 143/387 (36%), Positives = 224/387 (57%), Gaps = 25/387 (6%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 ++ DY D R + KHKL I+ +T+CAVI GAD W +IE FG + +WLKK+ + Sbjct: 8 TIEDYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLEL 67 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 NGIP DT RV S ++ +++F++W+Q T GEI+AIDGKT+R S+D+ K K Sbjct: 68 PNGIPSHDTFGRVFSLLNPEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKP 127 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPE---------------LLNLLYLKKNLI 169 A+ M+SA++ NG+VLGQ + KSNEITAIP+ LL +L L ++ Sbjct: 128 ALQMISAWATTNGLVLGQSIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIV 187 Query: 170 TIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG---DSFSTQ 226 T+DA+GCQK+I +I ++ ADY++ +K NQG L+ E F + SN++G + + Sbjct: 188 TLDAIGCQKEIVKQITEQDADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVK 247 Query: 227 EISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYIS 286 + HGR+E R + + + D +++W L + R + + RY+IS Sbjct: 248 DEGHGRQEVRYYQMLSNVA-EEIDPDWQWLNLNSIGYVEYLR-VENGTDKTSLERRYFIS 305 Query: 287 SKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRD 346 S + + K FA ++R HW IE+ HW+LDV+ NED SRIR+ NA ++ ++ +ALNLL+ Sbjct: 306 SLNNNIKLFASSVREHWCIENQCHWILDVQFNEDDSRIRKDNAPANMAILRHLALNLLKQ 365 Query: 347 CKDIKGEEEKKEGCVKHRERSSEVHFL 373 K +K + K ++ + ++L Sbjct: 366 EKTLKVGVKAKR-----KKAGWDENYL 387 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 128/357 (35%), Positives = 198/357 (55%), Gaps = 6/357 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +LL Y+ D R Q KH L +L + + AVIAG+ W+++E++G + EWL ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 +GIP DDT RV ID + +K +W+Q GEII IDGKT+RGS+D+ + Sbjct: 90 PHGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQC 149 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A++ V+A++++ +VLGQVK E SNEITAIP LL LL + ++ITIDAMG Q I +I Sbjct: 150 ALYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQI 209 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY---KGDSFSTQEISHGRKETRLHIVS 241 +KADY++ +K N L ++ F + + + D + + H R E R Sbjct: 210 CRQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRYVWAI 269 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 V + + +W GL+ + V R I++Y++S +A+ HAIR Sbjct: 270 PVAAMGELYQQQQWHGLQTIVVVERIRHLWNKT---THDIQFYLTSLPPNAQFLCHAIRT 326 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 HW IE++LHW LDV +ED RIR + + + ++++ALN+L K K +K Sbjct: 327 HWSIENNLHWTLDVTFSEDQCRIRSEYSPQNFALLRRLALNVLHQEKTFKRSLRQKM 383 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 371 bits (953), Expect = e-101, Method: Composition-based stats. Identities = 171/361 (47%), Positives = 231/361 (63%), Gaps = 8/361 (2%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++ SL++ S+ D RQ+ K+ H+L IL L V AVI GA+ WQ+IE+ GH RL WL++ Sbjct: 2 IARTSLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQE 61 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 G F GIPVDDTIAR++S+++ ++ FI+WM E TDG+IIA+DGK+IR S+DK Sbjct: 62 RGFFKKGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKK 121 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 KRK AIHMVSA++ ENGVVLGQ KT+ KSNEI AIP LL+LL +K ++TIDAMGCQ+ I Sbjct: 122 KRKSAIHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKI 181 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV---FSNYKGDSFSTQEISHGRKETRL 237 A KI K+ DY+LAVK NQ +LH + F + F + D F HGR E R Sbjct: 182 AEKIVTKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRR 241 Query: 238 HIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 + +S++ L+ W L+ + + S R +AE RY+I+S DAK FA+ Sbjct: 242 YWISDM--LSTLGNPERWASLQSIGMVESERYIDGKTTAET---RYFITSIAPDAKIFAN 296 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 A+R HW IE+ LHWVLDV ED SR+RR NA+E + +A+N LR+ K K + K Sbjct: 297 AVRKHWAIENQLHWVLDVSFREDDSRVRRDNASENFGVFRHVAINALRNEKSCKKGIKAK 356 Query: 358 E 358 Sbjct: 357 R 357 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 369 bits (947), Expect = e-101, Method: Composition-based stats. Identities = 136/375 (36%), Positives = 216/375 (57%), Gaps = 12/375 (3%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 S++++ S D R ++++ L I+ +T+CAV+ GAD W E+ ++G + +WLK++ Sbjct: 5 PFASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQW 64 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGK 121 NG+P DT V + + ++ F+ W Q ++++ GE+IAIDGKT+RG+ G+ Sbjct: 65 IALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGE 124 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 + IHMVSA+++ N +VLGQ + KSNEITAIPELL +L L+ L++IDAMGCQ IA Sbjct: 125 QCSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIA 184 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV---FSNYKGDSFSTQEISHGRKETRLH 238 I + + DY+LA+KGNQG L++ + F F + DS+ T E HGR E R + Sbjct: 185 ETIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTY 244 Query: 239 IVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHA 298 + + ++ W LK + S R++ + RYY+ S + DA+ FA A Sbjct: 245 W--TMGQTDYLLGAERWAQLKSIGCVESCRRQPGHPG--TLQRRYYLLSIESDAQRFADA 300 Query: 299 IRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 +R+HW IE+ LHW+LDV ED R +G +A+ +S I+ +A NLL+ K + K Sbjct: 301 VRSHWGIENQLHWILDVGFREDKLRACQGCSAQNLSVIRHIAANLLQQESTAKCGVKAKR 360 Query: 359 GCVKHRERSSEVHFL 373 + + ++L Sbjct: 361 L-----KAGWDDNYL 370 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 366 bits (940), Expect = e-100, Method: Composition-based stats. Identities = 161/371 (43%), Positives = 222/371 (59%), Gaps = 6/371 (1%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M I+S + S D RQ KV + L +LF ++CAVIA ++ W EI ++ W KK Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 F +GIP DDTIAR+VS ID +F F+ WM+ H++T+GE+IAIDGKT+RGS+++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 R IHM+SA+++ N +VLGQ+K E KSNEITAIP LL +L L+ L+TIDAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 A+ I DK DYLLAVK NQG L A + F + + D E SHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFSPHRSAGLSDD-HVNIEKSHGRIENRTCYV 239 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIR 300 + L+ W+ LK + + SFR K + + RYYISSK + A++ A R Sbjct: 240 LSSAALD--GDFTHWEALKSIVMVESFRAVKGKTA--SLEYRYYISSKVLSAEQALSATR 295 Query: 301 AHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGC 360 HW IE S+HWVLDV MNED +I + N AE ++ ++ M+LN+L+ K++ C Sbjct: 296 EHWGIE-SMHWVLDVSMNEDECQIYKNNGAENLAYLRHMSLNMLQKEPTKLSIVGKRKRC 354 Query: 361 VKHRERSSEVH 371 + + +V Sbjct: 355 LMNPAFLEKVL 365 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 365 bits (937), Expect = 1e-99, Method: Composition-based stats. Identities = 172/370 (46%), Positives = 250/370 (67%), Gaps = 4/370 (1%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 MS +L++ +S+ D RQ KV H L +LFL + AVI+G + W+EI+DFG+++L+WL+K Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 Y F GIP DDTI+R+ ID F+K F WM+ C E++ G++IAIDGKT+RGSF+K Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKK 120 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 + IHMVSAF+ N VVLGQVKT AKSNEITAIP+LL+LL ++ L+TIDAMGCQ I Sbjct: 121 DKSDTIHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKI 180 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 A KI DK DYLL VKGNQ +L A + F + + ++++T+E HGR+++R+ +V Sbjct: 181 AKKIVDKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMV 240 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIR 300 ++ + FEW GLK L A+SFR +K+ ++ V++++YISS +DAK A R Sbjct: 241 ADANEIGDL--VFEWPGLKTLGYAVSFRTEKDMQT--TVAVKFYISSAKLDAKSLLEASR 296 Query: 301 AHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGC 360 AHW +E++LHW LD+ MNED+ RIR+ N+ E ++ ++ +LNLL++ K G ++K Sbjct: 297 AHWTVENNLHWQLDISMNEDSCRIRQQNSTENLATVRHTSLNLLKNEKSFDGGIKRKHKQ 356 Query: 361 VKHRERSSEV 370 + E+ Sbjct: 357 ANRSDSYREL 366 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 353 bits (905), Expect = 7e-96, Method: Composition-based stats. Identities = 150/368 (40%), Positives = 212/368 (57%), Gaps = 9/368 (2%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 L + S D RQ+ KV + L IL LT+CAV++GA++W I +G ++L +LK++ F Sbjct: 25 FLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPFA 84 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGA 125 +G P D + + + +D+ AF+ FI+W+ ++ G ++AIDGKT R S DK K A Sbjct: 85 DGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTVTG-VVAIDGKTSRRSLDKAGGKAA 143 Query: 126 IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIK 185 IHM+SA+S+E + L Q + + KSNEITAIPELL LL LK ++TIDAMGCQ++IA+KI Sbjct: 144 IHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKII 203 Query: 186 DKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYK---GDSFSTQEISHGRKETRLHIVSN 242 K+ADY+LA+KGNQG L E +Y T E SHGR ETR V Sbjct: 204 SKEADYILALKGNQGSLRKDTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVTVCT 263 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 +++ + W GLK + + ++ AE RYYISS DA+ A AIR H Sbjct: 264 --DIDWLKADHNWPGLKSIVMVQYHAILQDKTRAET---RYYISSMTSDAEHHAKAIRDH 318 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVK 362 W IE+ LHWV+D+ +D RIR GNA + IK +A N+LR K K+ Sbjct: 319 WGIENGLHWVMDMVFRDDECRIRNGNAPANFTTIKHVASNMLRSVKGKHSLRSKRHIASW 378 Query: 363 HRERSSEV 370 + +E+ Sbjct: 379 DDDFLAEI 386 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 346 bits (886), Expect = 1e-93, Method: Composition-based stats. Identities = 133/367 (36%), Positives = 193/367 (52%), Gaps = 6/367 (1%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 L Y D R + H+L I+ + + AV+AGAD W IE +G + WL+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGA 125 NGIP DT ARV + +D A E F W++ ++IAIDGKT +GS+D+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIK 185 + +VSA+++E+ +VLGQ + KSNEITAIP LL L L +++IDAMG + IA++I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 DKKADYLLAVKGNQGKLHHAFEEKFPV---NVFSNYKGDSFSTQEISHGRKETRLHIVSN 242 ++ADY+LA+KGNQ L ++ F + + E +H R E+R Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRRVFQVP 253 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 V ++ +W GL+ L V S R + RY++SS DA FAH IRAH Sbjct: 254 VEQVFTPKQGRDWAGLRSLVVIQSQRCLWNK---DTTETRYFLSSLSTDAATFAHYIRAH 310 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVK 362 W IE+ LHW LDV NED SRIR+ +A S ++++ LNLL K+ Sbjct: 311 WGIENQLHWCLDVVFNEDKSRIRKDHAPRNFSLLRRLTLNLLHRDSSKGSLVMKRYRAGL 370 Query: 363 HRERSSE 369 + + Sbjct: 371 DDQFMMQ 377 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 129/374 (34%), Positives = 189/374 (50%), Gaps = 20/374 (5%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 +L + D R Q +H L+ IL + CA++ G + +E FG+ + WL+ + Sbjct: 14 SNLREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLA 73 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE--------IIAIDGKTIRG 115 NGIP DT +V S +D F + F W Q E +IAIDGK +RG Sbjct: 74 LPNGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRG 133 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMG 175 + DKG + +V A+++E + LGQVK KSNEI A+PELL +L LK ++TIDAMG Sbjct: 134 AVDKG--QAPAVIVGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMG 191 Query: 176 CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVF-SNYKGDSFSTQEISHGRKE 234 CQ+++A KI +K DY+LA+K NQ LH + +G+ + HGR E Sbjct: 192 CQREVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHE 251 Query: 235 TRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKE 294 R VS + +W GL+ + R V RY+ISS DA Sbjct: 252 VRRCWVSEEVEC-WLQGAEKWAGLRSVAAVECERTVAGQT---TVQRRYFISSLKADAAL 307 Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEE 354 A ++RAHW IE+SLHWVLDV ED SR RRG +AE ++ ++++ +++ Sbjct: 308 IAASVRAHWGIENSLHWVLDVTFGEDESRSRRGYSAENLATLRRLTHAMIKRENP----- 362 Query: 355 EKKEGCVKHRERSS 368 K+ + R + Sbjct: 363 NSKKSVNQRRFEAG 376 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 338 bits (866), Expect = 2e-91, Method: Composition-based stats. Identities = 131/371 (35%), Positives = 212/371 (57%), Gaps = 16/371 (4%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 Q+L+++ D R +G+ H+L +L + +C ++ G + + ++EDFG + +W K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 +GIP DT RV + + AF F+ W Q EI+A+DGK +R + ++G + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQG--Q 124 Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 +VSA++ N +VLGQ++ K+NEITA+P+LL +L L ++T+DAMGCQK+IA + Sbjct: 125 SPRVIVSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGD----------SFSTQEISHGRK 233 I + A+Y+LA+KGNQG+ H + V + + T E HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 ETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAK 293 ETR + S +++ +W GL+ + V S RQ + A V RYY+SS ++D + Sbjct: 245 ETRRYWQS--GDVSWLADRQQWAGLRSVGVVESVRQV--GQQAPTVERRYYLSSLNVDVE 300 Query: 294 EFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGE 353 +FA A+R HW +E+SLHWVLDV+ ED +R R G+AAE ++ ++++ALNLL+ K Sbjct: 301 KFARAVRGHWSVENSLHWVLDVQCGEDRNRARSGHAAENLATLRRLALNLLKRESTKKRG 360 Query: 354 EEKKEGCVKHR 364 + K+ Sbjct: 361 IKGKQLNASWD 371 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 338 bits (866), Expect = 2e-91, Method: Composition-based stats. Identities = 132/372 (35%), Positives = 190/372 (51%), Gaps = 13/372 (3%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 ++D D R K HK+ I+++++ AVI GA W EIE+FG+ ++ + K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPD 63 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGS------FD 118 IP DT R S I FE +F W+++ + G ++AIDGK +RG Sbjct: 64 LEFIPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 GK + MVSA+S NG+ LGQVK + KSNEITAIP L+N L L ++TIDAMGCQK Sbjct: 123 TGKEGFKLWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY---KGDSFSTQEISHGRKET 235 DI I ++ A+Y++A+K N+ K + ++ + + ++ HGR E Sbjct: 183 DITQTIIERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEK 242 Query: 236 RLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMD-AKE 294 R V + + F+ + GLK + S R +RYY++S D +E Sbjct: 243 RTCTVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIV-ATGEYTQEVRYYVTSLDNTKPEE 301 Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEE 354 A AIR HW IE++LHW LDV ED S+ + NAA S KMAL +L+ K KG Sbjct: 302 IASAIRQHWSIENNLHWQLDVTFREDYSK-KVKNAARNFSVATKMALTILKKDKTTKGSM 360 Query: 355 EKKEGCVKHRER 366 K E+ Sbjct: 361 NLKRLKAGWDEK 372 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 334 bits (856), Expect = 3e-90, Method: Composition-based stats. Identities = 129/371 (34%), Positives = 189/371 (50%), Gaps = 8/371 (2%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + +++ D R + +H+LS +L + VCAV++GAD+++EI +G ++ WL+ + Sbjct: 6 LADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPWLRGFL 65 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE-IIAIDGKTIRGSFDKGK 121 D G+ DT RV + +D FE+ F W+ + +IAIDGK+ R + K Sbjct: 66 RLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRTTSKAA 125 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 +H+VSAF+ GVVLGQ T KSNEITAIPELL +L ++ ++TIDAMG Q IA Sbjct: 126 AA-PLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGTQTKIA 184 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVS 241 I+++ A Y+L VK N KL + + T HGR E R Sbjct: 185 RAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLTPSSTHETTSTGHGRIEVRRCTAF 244 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 + T WK + V R E S YYISS DA+ A AIR+ Sbjct: 245 DATD--RLHKAEAWKDVASFAVVERVRTVGERTST---ERVYYISSLPADAERIAVAIRS 299 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCV 361 HW +E+ LHW LDV+ +D +R R G+ A ++ ++ MALNL+R K IK + K Sbjct: 300 HWEVENRLHWCLDVQFGDDYARGRIGHIAHNLALVRHMALNLIRLDKSIKTSIKTKRLLA 359 Query: 362 K-HRERSSEVH 371 E + + Sbjct: 360 ATSDEFRAALL 370 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 334 bits (855), Expect = 5e-90, Method: Composition-based stats. Identities = 141/371 (38%), Positives = 204/371 (54%), Gaps = 13/371 (3%) Query: 5 SLLDYISVTPDIRQQG-KVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 SL+ D R++ H +L + + AV++ D ++I +G E+ +WL+++ Sbjct: 8 SLMCCFREVDDPRKRSNGTLHDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLV 67 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 NG+ ++T R+ +D FE F W+ G + +DGKT+RGS + Sbjct: 68 LLNGVASEETFLRIFRALDPKQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGSGS--GGE 124 Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 AIHMVSAF+ E GVVLGQ K +KSNEITAIPELL LY+ L+TIDAMGCQK+IA + Sbjct: 125 SAIHMVSAFATELGVVLGQEKVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIARQ 184 Query: 184 IKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNV 243 I D+ DYLLAVKGNQ L A E +F ++ + + D SHGR ++ V Sbjct: 185 ITDQGGDYLLAVKGNQPTLLDAIETEF-IDQYQSDDVDRHRQVHPSHGRIVAQIASVLPA 243 Query: 244 TRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHW 303 + +W KK+ S R+ +S + RYYISS+++ A++ A A+RAHW Sbjct: 244 EGIVDL---ADWPECKKIARVDSLRKVGNHESK--LERRYYISSRELTAEQLAAAVRAHW 298 Query: 304 LIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR---DCKDIKGEEEKKEGC 360 IE+ LHWVLDV EDAS IR+GNA + +S +KK+ LNL+R K K++ Sbjct: 299 GIENRLHWVLDVSFGEDASTIRKGNAPQNLSLLKKIVLNLIRLDTADKTKTSLRLKRKCA 358 Query: 361 VKHRERSSEVH 371 + + Sbjct: 359 AWTDDVRMRIL 369 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 333 bits (854), Expect = 5e-90, Method: Composition-based stats. Identities = 136/380 (35%), Positives = 207/380 (54%), Gaps = 13/380 (3%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 + S+L+Y + D R+ KH L +L + V AVIAGAD + I + +EWLK Sbjct: 9 DVVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSR 68 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHE-----ITDGEIIAIDGKTIRGS 116 + +G+P DTI R+++ + AF++ F EW+ + EIIAIDGKT+R S Sbjct: 69 LELPSGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRS 128 Query: 117 FDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGC 176 D+GK G + + SA++ GV LGQ+ KSNEI PEL+ + ++K ++T+DA GC Sbjct: 129 HDRGKGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGC 188 Query: 177 QKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVN---VFSNYKGDSFSTQEISHGRK 233 Q+D+A KI K DY+LA+K NQ +LH + F+ K + + HGR Sbjct: 189 QRDVAEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRL 248 Query: 234 ETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAK 293 + R + + +W+GLK + VA+ Q+ E RYYISS DAK Sbjct: 249 DKRFYYQVKLPD--EVPAGEDWRGLKTIGVAIRISQENGR---ETCDTRYYISSLKPDAK 303 Query: 294 EFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGE 353 +FA A+R HW IE+SLHW LDV ED SR+R AAE ++ +K++A++L++ K + Sbjct: 304 QFAAAVRGHWGIENSLHWTLDVTFREDESRLRNRIAAENLAWLKRLAVSLIKQHKSKESV 363 Query: 354 EEKKEGCVKHRERSSEVHFL 373 ++ + +E+ L Sbjct: 364 VMRRRMAGWNVNFLAEILGL 383 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 332 bits (850), Expect = 1e-89, Method: Composition-based stats. Identities = 136/380 (35%), Positives = 194/380 (51%), Gaps = 13/380 (3%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M+ +SLLDY+ PD R Q K H LS ++F+ +CA++ G D W EI F ER W ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD-GEIIAIDGKTIRGSFDK 119 + GIP DT R+ + + + + +F W+ + +A+DGK +R + K Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMGDDKLVGQLAVDGKALR-ATAK 119 Query: 120 GKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 G+ A+HMV+ +S E G+ +GQ K KSNEITAIPELL LL LK L++IDAMG Q Sbjct: 120 GRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQVK 179 Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYK----GDSFSTQEISHGRKET 235 IA I K DYLLAVK NQ L+ +E+F N + HGRKE Sbjct: 180 IADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKEH 239 Query: 236 RLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF 295 R V V +WK K + + R + +R+YISS+ +DA Sbjct: 240 RRCWVLMVDESM--PVCQQWKA-KTIIAVQAERIENGKGY---DFVRFYISSRALDATSA 293 Query: 296 AHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIK-GEE 354 A RAHW +E+ LHW LD+ ED + R G A E ++ I++ LN+L+ K Sbjct: 294 LKATRAHWSVENLLHWTLDIAFGEDQLQARAGFAGENLAVIRQWILNMLKQNKSRNLSMA 353 Query: 355 EKKEGCVKHRERSSEVHFLY 374 K+ C + + E L+ Sbjct: 354 NKRRLCCLNEQYLFECMGLF 373 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 330 bits (845), Expect = 6e-89, Method: Composition-based stats. Identities = 139/371 (37%), Positives = 206/371 (55%), Gaps = 9/371 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 S + + D R KH L ++FLTV A+++GA+ W++I+ FG +L+WL+K+ F Sbjct: 2 SFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAF 61 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 G+PVDDTIAR++S+++ A FI W+ E E +IA DGKT+R SFD G RK Sbjct: 62 KEGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFD-GDRKT 120 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A+H VSA++ E G+VL Q K++ K NE++ + EL+ LL LK +++T DAM C K +A I Sbjct: 121 ALHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAI 180 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNV---FSNYKGDSFSTQEISHGRKETRLHIVS 241 K DY+L VK NQGKL F + K +S + HGR E R ++ Sbjct: 181 NAKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYVQL 240 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 +T + W +K + R K+ E YYISS +++ + A AIR+ Sbjct: 241 PIT--PWLTQSQGWTNIKPVIEVTRKRYLKDK---ETSETAYYISSLEVNLPQIAKAIRS 295 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCV 361 HW IE++ HWVLD+ ED SRIRRG+A E ++ ++ A+NL R + K + Sbjct: 296 HWSIENASHWVLDMTYREDDSRIRRGDAPENMATFRRFAMNLARLSPIKDSMKGKLKQAA 355 Query: 362 KHRERSSEVHF 372 E ++ F Sbjct: 356 WSDEVREKLLF 366 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 326 bits (835), Expect = 9e-88, Method: Composition-based stats. Identities = 137/376 (36%), Positives = 196/376 (52%), Gaps = 17/376 (4%) Query: 7 LDYISVTPDIRQQG-KVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 + + D R+ H IL + + AV++ D ++I + + WL+++ Sbjct: 1 MSCLDEIDDPRKPSNGTLHDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLK 60 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE-----IIAIDGKTIRGSFDKG 120 NGIP ++T R++ +D FE MF W+ + IAIDGKT+RGS Sbjct: 61 NGIPSEETFLRILRALDPKQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGSGS-- 118 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 + AIHMVSAF+ E G+VLGQ K AKSNEITAIPELL L +K L+TIDAMGCQK I Sbjct: 119 GGESAIHMVSAFATELGLVLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSI 178 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 A +I KK DYLL VKGNQ KL A E F D S E HGR ++ V Sbjct: 179 AKQIVAKKGDYLLMVKGNQPKLLEAIETAFIDQHGV-ESVDRSSRVERGHGRTVGQIASV 237 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIR 300 + + +W + S R + +S + RYYISS+ + A++ A A+R Sbjct: 238 LSAKGIVDP---ADWPKCVTIGRIDSMRVVGDKQS--DLERRYYISSRALSAEQLAAAVR 292 Query: 301 AHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD---IKGEEEKK 357 AHW +E+ LHW+LDV +EDAS + + NA + +S ++K+AL ++R K K+ Sbjct: 293 AHWGVENRLHWILDVSFSEDASTVAKDNAPQNLSLLRKIALTIIRADKTDTRKSSLRLKR 352 Query: 358 EGCVKHRERSSEVHFL 373 +G + + Sbjct: 353 KGAAWDDGVRERMLGI 368 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 326 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 128/367 (34%), Positives = 190/367 (51%), Gaps = 11/367 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWL--KKYG 62 +LL+ S PD R+ ++ L+ IL + VCA++ GAD W E+ D+ +R EWL + Sbjct: 2 TLLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRW 61 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKR 122 + G P DT + +D+ FE F +W++E + DG ++AIDGKT+RGS KG Sbjct: 62 PLEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVIDG-VVAIDGKTLRGSGKKGSN 120 Query: 123 KGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIAS 182 +HMV+A++ ++G+ L Q T K +E+ + LL++L LK ++T+DA+GCQ ++A Sbjct: 121 -ELLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAE 179 Query: 183 KIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY---KGDSFSTQEISHGRKETRLHI 239 KI + DY+L VK NQ L A E F + + + F E HGR ETR + Sbjct: 180 KIVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYT 239 Query: 240 VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM-DAKEFAHA 298 N WK L + + S RQ + S RY I S + + FA A Sbjct: 240 WINDVTWMDRPMRAAWKKLGGVGMIESIRQIGDKVSV---DQRYAIGSCGVQTVEMFAKA 296 Query: 299 IRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 R+HW IE+ LHW LDV ED R R GN+A +S ++K L LR + K ++ Sbjct: 297 SRSHWGIENGLHWTLDVVFREDQCRARLGNSARNLSVLRKFVLTTLRKEEGCKMGLNRRR 356 Query: 359 GCVKHRE 365 E Sbjct: 357 LHADRNE 363 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 321 bits (822), Expect = 3e-86, Method: Composition-based stats. Identities = 125/369 (33%), Positives = 189/369 (51%), Gaps = 8/369 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 S+ + D R KH I+FL V AVI+GA+ W EI+ FG L+WL+KY F Sbjct: 2 SVFRFFENLSDPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPF 60 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 + GIPVDDTIARV+ I+ AF ++F+ ++ E E+IAIDGKT+R SF+ + + Sbjct: 61 ECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFNP-ETQS 119 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A+H V+ +S G++L Q K+ K NE A+ E+++ LK +IT+DAM QK IA KI Sbjct: 120 ALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKIAEKI 179 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPV-NVFSNYKGDSFSTQEISHGRKETRLHIVSNV 243 +KK DY++ +K N + E F + +++ R + R + V Sbjct: 180 IEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYRKLKV 239 Query: 244 TRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHW 303 + + EWKG+K + R E +YISS D+D + A +R HW Sbjct: 240 SD--WLSKAEEWKGIKSVLEVCRKRSDNGK---ESQEKVFYISSLDVDIQILAKCVRGHW 294 Query: 304 LIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKH 363 +E+ HWVLDV ED + AE ++ ++++ALNL R + + K Sbjct: 295 EVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHPKKQSMKGKLTAAGWS 354 Query: 364 RERSSEVHF 372 E E+ Sbjct: 355 DEFRDELLL 363 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 319 bits (816), Expect = 1e-85, Method: Composition-based stats. Identities = 133/379 (35%), Positives = 189/379 (49%), Gaps = 15/379 (3%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M+IQ+ ++ PD R ++ + I+F+ + AVI GAD W EIE FG + K Sbjct: 1 MTIQAFS---AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKA 57 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIR-----G 115 IP DT++R S +D FE+ F W+ + G ++AIDGK I Sbjct: 58 RLPGLVSIPSHDTLSRFFSILDIDWFEECFRLWVDDICRRIPG-VVAIDGKAICDNPDKS 116 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMG 175 S K + ++MVSA+S NG+ LGQ K E KSNE AIPEL+ L L+ +ITIDA+G Sbjct: 117 SNSKNGVRSKLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIG 176 Query: 176 CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKF-PVNVFSNYKGDSFSTQEISHGRKE 234 CQK I I + KADY+L K N L + E + + + + HGR E Sbjct: 177 CQKSITKLIIENKADYILCAKDNHEALRNIIEFNLSEESRYYLCHAKRYFEENKGHGRSE 236 Query: 235 TRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKE 294 R + + L F W G+K L + S R+ + + RYYISS + D Sbjct: 237 YRECVCISAKNLQ--YFLKGWTGIKTLAMINSIRKMGDK--EAVMETRYYISSLEPDPII 292 Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEE 354 +IR HW +E++LHWVLD+ ED R + GNAA S I K+AL LL+ G Sbjct: 293 ILKSIRPHWEVENNLHWVLDIGFREDDDR-KIGNAAINFSAIPKLALMLLKQSDIKLGMA 351 Query: 355 EKKEGCVKHRERSSEVHFL 373 K++ C + +V + Sbjct: 352 GKRKACGWDEKIRDKVIGI 370 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 319 bits (816), Expect = 2e-85, Method: Composition-based stats. Identities = 124/346 (35%), Positives = 185/346 (53%), Gaps = 4/346 (1%) Query: 22 VKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNI 81 V + L+ +L T+ +I A ++ EIE G E+L+WL+++ F++G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 DSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLG 141 D E F W++ G + AIDGKT+RGS GA+H+VSA+++E G+V+G Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGK 201 Q AKSNEITAIPELL+ L L ++TIDAMG QK IA+K+ DK ADY+LA+KGNQG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKL 261 LH + F T I HGR E R V++ + + W GL + Sbjct: 181 LHDDVRDFFADPDLLRECARHDDTC-IGHGRIEERTCQVADASAW-LTEQHSGWAGLASI 238 Query: 262 CVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDA 321 ++ R + R YISS D K +A R+HW +E++LHW LDV ED Sbjct: 239 AAVIATRT-DKKSGEVSSETRLYISSLPPDPKTILNACRSHWSVENNLHWQLDVTFREDE 297 Query: 322 SRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERS 367 R R+ +A ++ I+ A N+L+ + K+ ++ Sbjct: 298 CRTRKDHAPLSLAIIRHAAFNMLKREPSKMSIKRKRLKAAMNQAFR 343 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 317 bits (813), Expect = 3e-85, Method: Composition-based stats. Identities = 129/365 (35%), Positives = 205/365 (56%), Gaps = 14/365 (3%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 +Q LL+++ D RQQ KV+H L IL + + A +A AD+W E+ F + ++L+KY Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD---GEIIAIDGKTIRGSFDK 119 + NG P DT+ RV+ + ++++ +W + + +II IDGKT+R +K Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRS--NK 118 Query: 120 GKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 + H+VSA+S E+G LGQ KSNEITAIPELL + +K ++TIDAMG Q Sbjct: 119 RNGEKPGHIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFS---NYKGDSFSTQEISHGRKETR 236 IA KI++K+ADY+L++K NQG L+ E F F +G TQE +HG+ ETR Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFA 296 + ++ + + WKGLK + + R+ E + + RY+ISS + + + Sbjct: 239 EYY--QTEKIKWLSQKKAWKGLKSIIM---ERKTLEKEGKRLIEYRYFISSLKEEIETVS 293 Query: 297 HAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEK 356 A+R HW IE S+HW LDV EDA+ AA+ ++ I+K +L++L+ + + + Sbjct: 294 RAVRGHWSIE-SMHWHLDVTFREDANTTIDKMAAQNLNIIRKWSLSILKTAEVSRHKLSM 352 Query: 357 KEGCV 361 ++ Sbjct: 353 RKKRY 357 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 316 bits (808), Expect = 1e-84, Method: Composition-based stats. Identities = 133/384 (34%), Positives = 202/384 (52%), Gaps = 18/384 (4%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 Q+ L+ ++ D R + ++L IL ++ AVI D + E+ F + ++L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECH------EITDGEIIAIDGKTIRGSF 117 F +G P DT +V+S +D + F WM E + + G +AIDGKTI S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRSG 122 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 + A H+++AF++ +VLGQ+KT+ KSNEITAIPELL L +K ++TIDAMG Q Sbjct: 123 S--AEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSN------YKGDSFSTQEISHG 231 K+IA+KI +K DY+LAVKGNQ KL + KG T E HG Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTRELKAKGQYAVTLEKDHG 240 Query: 232 RKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSK-DM 290 R E R +SN L++ + +W+G+ + + R Y+I S + Sbjct: 241 RIEKRECYLSN--DLSWFEGLEDWRGIAGVGWIRNTRHVDNKNDKTTTEDHYFIYSLKEA 298 Query: 291 DAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDI 350 AK+ R HW IE++LHW+LD+ ED R R NAAE+++ ++K+AL +L+ C Sbjct: 299 QAKDLLRIKREHWAIENNLHWMLDMAFREDDCRARAKNAAEVMNILRKLALQMLKTCDTC 358 Query: 351 K-GEEEKKEGCVKHRERSSEVHFL 373 K G K++ C + +V L Sbjct: 359 KCGMRSKRKLCGLGIPTALQVLGL 382 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 314 bits (805), Expect = 2e-84, Method: Composition-based stats. Identities = 134/387 (34%), Positives = 191/387 (49%), Gaps = 23/387 (5%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + D + + D R KH+ S I+ + + AVI GAD W IEDFG + + Sbjct: 14 LHEFADSLILI-DNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKL 72 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSF----- 117 NGIP DT R S +D L FE+ + +W+Q + G IAIDGKTIRG++ Sbjct: 73 SNFNGIPSHDTFNRFFSALDPLKFEESYRQWVQSILKCYSG-HIAIDGKTIRGAYESEQD 131 Query: 118 ----------DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKN 167 D K +H++SAF+ E GV LGQ+ T+ K NEI IPELL++L +K Sbjct: 132 KRHRKQGVLPDSNTGKYKLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDC 191 Query: 168 LITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPV--NVFSNYKGDSFST 225 +ITIDA+GCQ+ IA K+ + DY+ VK NQ KL + + + D + T Sbjct: 192 IITIDALGCQRTIAEKVIKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKGTTVRFDKYET 251 Query: 226 QEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYI 285 E HGR E+R+ N D +WK ++ + R + V R +I Sbjct: 252 HEEGHGRNESRICYCCNDPGFLGADIRKKWKNIQSFGYIENTRNTNKGT---TVEKRCFI 308 Query: 286 SSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 SS + DA++ R HW IE++LHW LDV +ED +R RR +A S + K+AL LR Sbjct: 309 SSLEPDAQKILKNSREHWEIENNLHWQLDVNFHEDNTR-RRNISALNFSVLAKIALATLR 367 Query: 346 DCKDIKGEEEKKEGCVKHRERSSEVHF 372 + K K+ E E+ Sbjct: 368 NNKREIPINRKRLIAGWDNEFLWELIL 394 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 314 bits (803), Expect = 5e-84, Method: Composition-based stats. Identities = 106/374 (28%), Positives = 178/374 (47%), Gaps = 10/374 (2%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 +++L + D RQ GKV+H++ +L + C+ + + + ++ DF +L WL+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKR 122 +G P D V+ I A ++ W +G IAIDGK +RG+ + Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGW----CGDLEGRHIAIDGKALRGTHNAETG 116 Query: 123 KGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIAS 182 + +H++ A+ ++ + GQ+ KSNEI AIP LL L LK +TIDAMG Q IA Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFS---TQEISHGRKETRLHI 239 +I ADY+LA+K N + H + F + T E+SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 ++ L++ ++W GL+ + Q+ D + Y++ S D + A + Sbjct: 237 IT--EELDWYHKSWKWAGLQSVAQVRRQVQRSHD-GPPLEEVHYFLCSFKADVERLAKLV 293 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEG 359 R HW +E+ HWVLDV NED ++R NAA ++ +++M + L K++ Sbjct: 294 RGHWSVENRCHWVLDVTFNEDHCQVRDRNAAHNLTILREMVITTLHRHPAKVSLRRKRKL 353 Query: 360 CVKHRERSSEVHFL 373 ++ L Sbjct: 354 ATMDPAFRLQMLGL 367 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 314 bits (803), Expect = 5e-84, Method: Composition-based stats. Identities = 128/359 (35%), Positives = 197/359 (54%), Gaps = 9/359 (2%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 Q L D R G+ + L IL +T+CA+I G D W+ I DFG +R WL ++ + Sbjct: 14 QYLFHCFLSIKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVN 73 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 G+P T ARV S I+ F+ WM + ++ ++I +DGK++ GS +GK + Sbjct: 74 MRCGVPSTLTFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQ 133 Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 A H+V+A+ + V LG+V+ KSNEI AIP LLN L ++ +I+IDAMG QK IA+ Sbjct: 134 KATHIVNAYLPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANL 193 Query: 184 IKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEI---SHGRKETRLHIV 240 I+ K+ADY+LA+K N + + E F + +Y+G + T+E HGR E R + V Sbjct: 194 IRLKQADYVLALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSYCV 253 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKD-MDAKEFAHAI 299 + + F ++ W+ L+ + S R + + RYYI+S + + + AI Sbjct: 254 LPM--MYFHKYKKYWRDLQAIVRVQSKR---HKGNEIETATRYYITSLPFAEHRRMSQAI 308 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 R HW IE+ LHW LD+ + EDAS I RG A + ++ ++KM L +L + K K Sbjct: 309 RQHWAIENQLHWKLDIGLGEDASLITRGYADQNLATLRKMVLKMLENENSSKQGIAGKR 367 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 312 bits (799), Expect = 1e-83, Method: Composition-based stats. Identities = 113/369 (30%), Positives = 198/369 (53%), Gaps = 7/369 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +L++++++ + R + KH L ++FL + A+++GA+ W +IE +G +++WL+++ F Sbjct: 8 TLIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPF 67 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 NGIP T+AR++ I + + W+ E IIA DGK +RGSF +G K Sbjct: 68 ANGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKD 126 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A+ +V+A+ ENG+VL Q T K EI + ++L++L LK ++T+DA+ CQ++ KI Sbjct: 127 ALQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKI 186 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 +KKA ++ VK NQ KL+ A + +F + + +E HGR+E R V + Sbjct: 187 SEKKAHVVVQVKNNQPKLYQAVQSQFQSLFDAQKEKIVVEHKESGHGRQEER--YVFQLK 244 Query: 245 RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWL 304 + +W ++ + R + + YY+SS K H IR HW Sbjct: 245 AKLPPELTEKWPTIRSIIAVERHRSANGKGTVDTS---YYVSSLSPKHKLLGHYIRQHWR 301 Query: 305 IEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK-EGCVKH 363 IE+S H++LDV NEDASRI +A E ++ ++ LN+++ + K + + Sbjct: 302 IENSQHYILDVVFNEDASRIAMEDAVENMALFRRFVLNIVKQSNCGARSQRNKLKRAGWN 361 Query: 364 RERSSEVHF 372 + +++ F Sbjct: 362 DDYRAQLFF 370 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 312 bits (798), Expect = 2e-83, Method: Composition-based stats. Identities = 123/356 (34%), Positives = 181/356 (50%), Gaps = 7/356 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 SL D++S+ D R +H L +LFL + AV +G D W EI+ FG +LEWL+K+ F Sbjct: 2 SLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPF 61 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 NGIP TIAR++ + + W+ + + IIAIDGKT+RG+ G Sbjct: 62 ANGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLGC--N 119 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 +H V AF NG+ L Q K EI + L+ +L + K LIT+DA+ Q+ I Sbjct: 120 TLHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAI 179 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 +K DY++ VK NQ L A + ++ V + + F+ E HGR E R+ + Sbjct: 180 VARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQRI--TFQIP 237 Query: 245 RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWL 304 + +W +K L R+ S E +Y+SS D+D + A A+R HW Sbjct: 238 SKLSPKLQEKWPSVKTLIAVERHRKIGNKTSIETS---FYLSSHDIDPEYIATAVRGHWR 294 Query: 305 IEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGC 360 IE+SLHWVLDV EDA R+ AE ++ +++MALNL + K + K Sbjct: 295 IENSLHWVLDVVYREDACRVHEQRTAESLAIVRRMALNLAKLEITQKRSMKSKLHR 350 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 309 bits (790), Expect = 2e-82, Method: Composition-based stats. Identities = 111/381 (29%), Positives = 178/381 (46%), Gaps = 19/381 (4%) Query: 1 MSIQSLLDYISVTPDIRQQ-GKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLK 59 M++ L + PD R + H L+ IL + CAVIAGA+ W++I ++G + + + Sbjct: 1 MALP-LTSVFADLPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFR 59 Query: 60 KYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI--------TDGEIIAIDGK 111 ++ + NG+P DT RV + +D AF F W E E +A+DGK Sbjct: 60 RFLELKNGVPSHDTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGK 119 Query: 112 TIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITI 171 + R S +H+V + + ++LGQ +EIT ++L L L ++T+ Sbjct: 120 SARRSAKPTFSGC-LHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTL 178 Query: 172 DAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG-DSFSTQEISH 230 DA GCQ + I+ + +Y++ VKGNQ L A F + + G D ++ +H Sbjct: 179 DAAGCQTETLEVIRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAH 238 Query: 231 GRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM 290 GR E R V + W G+ + + RQ K + YY+SS + Sbjct: 239 GRHEERNVTVVHDPDGLPAG----WAGVGSVALVCRDRQVKGKANEST--AHYYLSSLRV 292 Query: 291 DAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDI 350 A E A IR HW IE S+HWVLDV ED SR R G+A + I+++A++LL+ Sbjct: 293 GAAELAGYIRGHWHIE-SMHWVLDVAFREDESRTRVGHAGANLGMIRRVAVSLLKRAGKK 351 Query: 351 KGEEEKKEGCVKHRERSSEVH 371 ++ + ++V Sbjct: 352 GSIHTRRLRAGWDDQYMAQVL 372 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 306 bits (783), Expect = 1e-81, Method: Composition-based stats. Identities = 115/376 (30%), Positives = 186/376 (49%), Gaps = 14/376 (3%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++ LL++ S D R + ++ H L IL L VC +A D+++ I +G L +L+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKR 122 + +G+P + + +++ ID F F W++ + +AIDGKT R S D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFPGRA-DFVAIDGKTSRRSHDRRAG 130 Query: 123 KGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLY----LKKNLITIDAMGCQK 178 IH+VSAF+ + +VL Q K+NE+ AIP LL+ L L L++IDA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLH 238 IA+ I+ + ADYLLAVK NQ L E F D + HGR E R Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAF----AVGDGADHHHDLDKGHGRVEERHV 246 Query: 239 IVSNVTRLNFCDFEFEWKG---LKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF 295 V + +++ + G L + + RY+ISS + A+ Sbjct: 247 SV--IREVDWLSGTRRFPGEMRLPDVAAIVRVHTTAHIADRTRTDTRYFISSAPLTAEHA 304 Query: 296 AHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEE 355 A A+R HW IE+ LHWVLDV +D SR+R G+ A+ ++ ++ ALNL+R D K + Sbjct: 305 ADAVRGHWGIENRLHWVLDVLFKDDLSRLRTGHGAKNMAVVRHFALNLIRAANDQKSLKT 364 Query: 356 KKEGCVKHRERSSEVH 371 +++ + + + Sbjct: 365 RRKMAGWSDDYLASLL 380 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 302 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 123/365 (33%), Positives = 187/365 (51%), Gaps = 19/365 (5%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + ++++++ D R+ K+KH LS I+ L A ++GA+ W EIE FG LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT---------DGEIIAIDGKTI 113 +NGIP DT+ RV + +D ++ W E ++AIDGKTI Sbjct: 66 QLENGIPSHDTLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTI 125 Query: 114 RGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDA 173 RG + ++ A+H+V+A++ + G+ GQV T KSNEITAIPELL+++ +K +++IDA Sbjct: 126 RG--NGSAKQKALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDA 183 Query: 174 MGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRK 233 MG QK IA KI KKADY LAVK NQ L F ++ ++ D + T E +HG+ Sbjct: 184 MGTQKAIADKIIKKKADYCLAVKENQKTLLEDIVPFFEMSQEAD---DHYHTVEKAHGQI 240 Query: 234 ETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAK 293 ETR + V + + R + E RY+I S + AK Sbjct: 241 ETRAYEVIHDVSWLRKTH----PEFGHIQSIGRARIHLDKNGQESEESRYFILSCQVSAK 296 Query: 294 EFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGE 353 E +R HW IE S+HW+LDV EDA++ A ++ + K L +L+ K Sbjct: 297 ELCDYVRGHWQIE-SMHWLLDVVFREDANKTLNKQLAFNLNVMDKFCLAVLKQLDFGKKM 355 Query: 354 EEKKE 358 +++ Sbjct: 356 SMRRK 360 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 299 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 107/377 (28%), Positives = 174/377 (46%), Gaps = 22/377 (5%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 + + PD R G H L+ ILF+ + A + GA ++ F + + Sbjct: 4 PMDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDV 62 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI----TDGEIIAIDGKTIRGSF 117 NG+P DT +RV +D AFEK F +M+ + +IA+DGK +R + Sbjct: 63 LVLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGY 122 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 + G+ MV+A++ + + L V+ NE +L+ LL LK ++T DA+ C Sbjct: 123 ESGRSHMPPVMVTAWAAQTRMALANVQAPNN-NEAAGALQLIELLQLKGCVVTADALHCH 181 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL 237 + +A IK + DY+LAVK NQ L + S T + HGRKE R Sbjct: 182 RGMAEAIKARGGDYVLAVKDNQPALMRDAKAAIRAATRQGK--PSTITVDAGHGRKEKRR 239 Query: 238 HIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 +V+ V ++ + ++ GLK + S R + RY++ S+ K+ Sbjct: 240 AVVAAVPQMAQ---DHDFAGLKAVARITSKR------GTDKTVERYFLMSQAYPPKDVLR 290 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 +R HW IE+SLHW LDV ++ED +R R+ NA ++ ++++ALN+ R D K Sbjct: 291 IVRTHWTIENSLHWPLDVVLDEDLARNRKDNAPANLAVLRRLALNVARAHPDNTTSLRGK 350 Query: 358 EGCVKHRERSSEVHFLY 374 + FL+ Sbjct: 351 L-----KRAGWNDTFLF 362 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 298 bits (762), Expect = 2e-79, Method: Composition-based stats. Identities = 105/362 (29%), Positives = 177/362 (48%), Gaps = 9/362 (2%) Query: 10 ISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIP 69 PD R +H L +L + + A I GA+ + F +R +++ + G+P Sbjct: 9 FQGLPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLP 68 Query: 70 VDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMV 129 DT +RV +D +AF + F +++ + ++AIDGKT+R SFD+ + A+H+V Sbjct: 69 SHDTFSRVFRLLDPVAFSRCFQQFL-DHLGEDGAGVLAIDGKTLRRSFDRAAGRSALHVV 127 Query: 130 SAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKA 189 SAF++ +++GQ A NEI A LL L LK L+T DA+ Q+ A I ++ Sbjct: 128 SAFASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGG 187 Query: 190 DYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNF- 248 D+L +K N+ L E F + T + HGR E R H VS+ Sbjct: 188 DWLFPLKDNRPALRAEVERYF--ADPATVLAVPHVTTDADHGRIEVRRHWVSHDVAWLAS 245 Query: 249 ---CDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLI 305 E GLK L + + ++ ++ Y+SS ++ K A A+RAHW I Sbjct: 246 DRRFPDEAVLPGLKILGLVERTVTSPDGRTTATRTL--YLSSAALEPKTLARAVRAHWSI 303 Query: 306 EHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRE 365 E ++HWVLD +ED +R R+ + E ++ ++K+ALN++R + +++ + Sbjct: 304 EAAVHWVLDTSFDEDRARNRKDHGPENLATLRKLALNVVRSANNQDSIRLRRKRAGWSDD 363 Query: 366 RS 367 + Sbjct: 364 YA 365 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 296 bits (756), Expect = 1e-78, Method: Composition-based stats. Identities = 110/367 (29%), Positives = 176/367 (47%), Gaps = 6/367 (1%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 +L ++ D R +H + I FL + AVI+GA W +FG LEWL+KY F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGA 125 NGIP +I R+ + + + W+ E T IAIDGK ++G A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLKG-AKASASSAA 119 Query: 126 IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIK 185 +HMV+A+ +G+V +K +E+ + ELL L LK L+T DA+ CQ I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 DKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTR 245 + D +L VKGNQ KL+ A + +F + +N + F+ HGR E R+ + Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLN- 238 Query: 246 LNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLI 305 + + +W LK L R+ S + +Y+SS + ++ F AIRAHW Sbjct: 239 -LPAEIKMKWSQLKTLIAVERHRKVGNKTS---IDTHFYVSSAVLTSEAFGRAIRAHWQT 294 Query: 306 EHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRE 365 E++ HW+LD ED ++ + A I++ +++ ALNL++ + +K + Sbjct: 295 ENNQHWLLDKLFQEDKQKMYDEDGASILAILRRWALNLVKLHPAKTSQTQKFNRACWSDD 354 Query: 366 RSSEVHF 372 E+ F Sbjct: 355 FREEIIF 361 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 290 bits (742), Expect = 5e-77, Method: Composition-based stats. Identities = 108/374 (28%), Positives = 174/374 (46%), Gaps = 18/374 (4%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 +++ L PD R V+H L +L + +V+ G+ E+ FG + + + Sbjct: 10 IAMHIFLSAFDEVPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT-DGEIIAIDGKTIRGSFDK 119 + + IP DT + V ID A + F + + + ++ DG+IIAIDGK +RG+ D Sbjct: 69 FLKLKHAIPSHDTFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDP 128 Query: 120 GKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 G+ MVSA+++ + L V + + E++A E L L+ L+ ++T DA+ C + Sbjct: 129 GESARTRMMVSAYASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRR 187 Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHI 239 + I D+ LA+KGNQ L F S+ + T+ HGRKETR + Sbjct: 188 TVAAINAGGGDWCLALKGNQESLLSDARGCFSKGHKSD---PTAVTENTGHGRKETRKAV 244 Query: 240 VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 V + L E+ GLK + R+ RY+ S + A+ Sbjct: 245 VVSAKALAEYH---EFPGLKGFGRIEATRETGGK---VTSETRYFALSWVPTPEVLLAAV 298 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEG 359 R HW IE++LHW LDV EDA+R R+ N I+ +++ AL++LR KG K Sbjct: 299 RDHWAIENALHWQLDVSFREDAARNRKDNGPGNIAVLRRRALDVLRRD-TSKGSLSIK-- 355 Query: 360 CVKHRERSSEVHFL 373 + + FL Sbjct: 356 ---IKRAGWDTTFL 366 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 289 bits (740), Expect = 8e-77, Method: Composition-based stats. Identities = 110/301 (36%), Positives = 152/301 (50%), Gaps = 7/301 (2%) Query: 8 DYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG 67 + PD R + + +H LS +L + VCAV+ GA+++ ++ +G L WL+K+ G Sbjct: 11 EVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKFLKLKAG 70 Query: 68 IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE-IIAIDGKTIRGSFDKGKRKGAI 126 +P DT RV++ ID AFE F+ W+ + ++AIDGKT R S K G + Sbjct: 71 VPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGKD-TSGPL 129 Query: 127 HMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKD 186 HMVSAF+ G+VLGQ T+ KSNEITAIPELL +L L+ ++TIDAMG Q IA I+ Sbjct: 130 HMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAIARTIRS 189 Query: 187 KKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRL 246 + ADY+L VK N L + F Q HGR E R + Sbjct: 190 RGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWAYDAVS- 248 Query: 247 NFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIE 306 +W GL+ + R S YYISS DA A A+R+HW +E Sbjct: 249 -QLYKSEQWAGLQSFALVERERTVDGKTSV---ERHYYISSLPADAARIAQAVRSHWAVE 304 Query: 307 H 307 Sbjct: 305 S 305 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 286 bits (732), Expect = 7e-76, Method: Composition-based stats. Identities = 124/380 (32%), Positives = 193/380 (50%), Gaps = 22/380 (5%) Query: 3 IQSLLDYISVTPD------IRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLE 56 + +++D+I D RQ K+++ LS ILFL +AG + +E+EDF Sbjct: 1 MTTMIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEP 60 Query: 57 WLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD-GEIIAIDGKTIRG 115 Y D G P DT+ RV+S ++S +++ +++ Q + ++I++DGKTIRG Sbjct: 61 LFATYVDLSEGCPSHDTLERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRG 120 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMG 175 ++GK + +H+V+A+ + + LGQV E KSNEI AIP+LL + ++K+++TIDAMG Sbjct: 121 --NRGKNQKPVHIVTAYDGGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMG 178 Query: 176 CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPV---NVFSNYKGDSFSTQEISHGR 232 Q I I KADY LAVKGNQ L+ F + T E S G+ Sbjct: 179 TQTAIVDTIIKGKADYCLAVKGNQETLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQ 238 Query: 233 KETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDA 292 E R + VS+ + C +W L+ + + R + RY+I S D Sbjct: 239 IEVREYWVSSDIKW-LCQNHPKWHKLRGIGM---TRNTIDKDGQLSQENRYFIFSFKPDV 294 Query: 293 KEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKG 352 FA+ +R HW IE S+HW+LDV +ED + AA ++ I+KM L L+ Sbjct: 295 LTFANCVRGHWQIE-SMHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKV-----M 348 Query: 353 EEEKKEGCVKHRERSSEVHF 372 KK+ + ++R VH Sbjct: 349 VFPKKDLSYRRKQRYISVHL 368 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 284 bits (727), Expect = 3e-75, Method: Composition-based stats. Identities = 102/367 (27%), Positives = 163/367 (44%), Gaps = 12/367 (3%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 +L PD R +H L +L + +V+ GA E+ FG + + + Sbjct: 37 ILSAFEDVPDPR-AENTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLK 95 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT-DGEIIAIDGKTIRGSFDKGKRKG 124 + +P DT + V ID A + F + + + DG++IA+DGK +RG+ D G+ Sbjct: 96 HAVPSHDTFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGR 155 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 MVSA++ + L V + + E+ A E L L+ LK ++T DA+ C + + I Sbjct: 156 TRMMVSAYAARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAI 214 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 D+ LA+K NQ L F + S +++I HGR ETR V + Sbjct: 215 NAGGGDWCLALKANQDSLLSDARASFGAEPDA---HPSALSEDIGHGRTETRKATVVSSK 271 Query: 245 RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWL 304 L E+ GLK + R+ E RY+ S + +RAHW Sbjct: 272 ALAE---HHEFPGLKAFGRVEATRKTAEGT---TSETRYFALSWVPTPEVLLATVRAHWA 325 Query: 305 IEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHR 364 IE+SLHW LDV EDA+R R+ N+ I+ +++ AL+++R K + Sbjct: 326 IENSLHWQLDVSFREDAARNRKDNSPGNIAILRRRALDVMRRDTSKGSLSIKLKRAGWDD 385 Query: 365 ERSSEVH 371 + V Sbjct: 386 DFLRNVL 392 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 284 bits (725), Expect = 5e-75, Method: Composition-based stats. Identities = 111/411 (27%), Positives = 173/411 (42%), Gaps = 49/411 (11%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + S+ + I D R++ KV + I+ +T+ V W +I DF + ++L+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD-------------------- 102 P DT+ R I + E + EW + Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 103 GEIIAIDGKTIRGSFDKGK--------------RKGAIHMVSAFSNENGVVLGQVKTEAK 148 IAIDGKTI G+ + K +H+VSAF ++ + LGQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNLLYLK-KNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFE 207 NEI AIP+LL+ + ++ +++TIDA+G QK I KI +K+ADYLL VK N KL E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPVNVFSNYKGDSFSTQE---ISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVA 264 + S + D E HG TR I + +WK L+ + Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRLGF-CYRDWKNLRTYGII 315 Query: 265 LSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRI 324 + +ISS + + R HW +E+ LHW LDV NED R Sbjct: 316 -KTEKINIATGEIQNEKHCFISSLVNNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR- 373 Query: 325 RRGNAAEIISGIKKMALNLLR--DCKDIKGEEEKKEGCVKHRERSSEVHFL 373 + N+A+ S + KMAL +L+ +D K +K ++ +L Sbjct: 374 KMMNSAQNFSTLTKMALTILKNYQDEDKKTSVNRKR-----KKAGWSDEYL 419 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 284 bits (725), Expect = 5e-75, Method: Composition-based stats. Identities = 102/373 (27%), Positives = 169/373 (45%), Gaps = 16/373 (4%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 LD PD R +H L ILF+ + AV+ GA E+E F RL+ L+++ + Sbjct: 3 FLDVFGEVPDPRDL-TAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI----TDGEIIAIDGKTIRGSFDKGK 121 G P DT +RV++ +D +A + F+ +M E +A+DGK++R ++ KG+ Sbjct: 62 RGAPSHDTFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGR 121 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 +V+ F + + L Q + E+ A L LL LK +T DA+ C + + Sbjct: 122 SHMPPLVVTVFGCDTFMSLAQT-VAQEGGEVQAAIAALELLSLKGLTVTADALHCHRRMT 180 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVS 241 ++D Y++A+KGNQ KL + T+E +HGR E R V Sbjct: 181 KTVRDGGGHYVIAIKGNQSKLAAEANTALDKA-AAGKATKFHQTEEDAHGRHEVRRAFVI 239 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 + L + S+R + + + +R Y S+ M A E +R Sbjct: 240 P---FAQTPGKNALVDLCAIGRVESWRTVEGKTTHK---VRCYALSRKMPAHELLATVRR 293 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCV 361 HW IE+ LHW LDV + ED R R+ N A + ++++ LN+LR + K+ Sbjct: 294 HWSIENDLHWQLDVLLGEDHIRGRKNNTAANHAILRRLTLNVLRADPEKIPLSHKRLKAR 353 Query: 362 KHRERSSEVHFLY 374 + ++ L+ Sbjct: 354 WADQ---DLLSLF 363 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 279 bits (714), Expect = 9e-74, Method: Composition-based stats. Identities = 111/369 (30%), Positives = 182/369 (49%), Gaps = 8/369 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +LL ++ + D R +KH L ++FLT+ A+++GA W+ IE FG +L+WL+ Y F Sbjct: 2 TLLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPF 61 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 ++GIP IA ++ ++DS + W+ + T IIA+DGKT+R ++ Sbjct: 62 EHGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWA-DDIHQ 120 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 A+H+VSAF NG+ L E K +E ++++ L L ++T+DA+ CQK KI Sbjct: 121 ALHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKI 180 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 KK+D+++ +KGNQ A + + + HGRKE R V + Sbjct: 181 ISKKSDFVIQIKGNQPA-LLAAVKAAFAACYDSPALAISEQTNTGHGRKECRR--VMQIE 237 Query: 245 RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWL 304 + +W ++ L S R S R+Y+SS +D + A IRAHW Sbjct: 238 GNLPPELSEKWPHIRTLVEVASERTVGNKT---ACSSRWYVSSLPVDTAQLADIIRAHWA 294 Query: 305 IEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDC-KDIKGEEEKKEGCVKH 363 IE+ LHWVLDV ED + + A+ ++ + AL++++ K++ Sbjct: 295 IENQLHWVLDVVFREDELNVSDPDGAKHLALFNRAALSVIKQHQGKKDSLAAKRQSAAWD 354 Query: 364 RERSSEVHF 372 SE+ F Sbjct: 355 PAFRSELLF 363 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 277 bits (708), Expect = 5e-73, Method: Composition-based stats. Identities = 120/374 (32%), Positives = 180/374 (48%), Gaps = 17/374 (4%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 SL + + + P R + K + L +L + + ++G W EIED+ E E LK + Sbjct: 3 HSLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYE 62 Query: 64 FDNG------IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSF 117 G +P DT+ R +S +D AFE + W++ T G+ I IDGKT+RG Sbjct: 63 MLTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRG-V 121 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 K H+VSAFS ++ L Q+ + K+NEI AI +LL+LL L +++IDA+G Q Sbjct: 122 KKLSFDTQSHVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQ 181 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL 237 I +I DK DY+L VK NQ E F + D E+SHGR ETR Sbjct: 182 TAIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLD--EQTELSHGRIETRR 239 Query: 238 H-IVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFA 296 + + N + + KGL+ + + R+ + + YYISS D Sbjct: 240 YESILNPLEIEANEVLTRRKGLRSIHKVVRKRR-DKKSDKTSEEVAYYISSLT-DVSSLK 297 Query: 297 HAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEK 356 AIR HW IE+ LH LDV DAS R N A+I+ I+K+ L ++ K Sbjct: 298 QAIRGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKT-----NM 352 Query: 357 KEGCVKHRERSSEV 370 K + +++ + + Sbjct: 353 KSSIPRIQKKPARM 366 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 274 bits (699), Expect = 5e-72, Method: Composition-based stats. Identities = 108/335 (32%), Positives = 161/335 (48%), Gaps = 5/335 (1%) Query: 38 IAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQEC 97 +A A+ W++IE +G + WL+ + NGIP DT RV +D+ AFE+ F +Q Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPE 157 E++A+DGK++R S G +H+VS +++ G+ LGQ + KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV-FS 216 LL L L ++T+DAMGCQ IA +I+ K AD LL +K N G + A F S Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 217 NYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSA 276 G HGR R V W L ++ + R Sbjct: 184 GAAGRPVFDAFEGHGRLVRRRVFVDAAATALAP--LSGWPDLSRVLAVETLRGIPG-TGT 240 Query: 277 EGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGI 336 IRY+++S D IR HW +E++LHWVL+V ED SR+R AA + + Sbjct: 241 VVADIRYFLTSCRDDPSVLVGVIRRHWSVENALHWVLEVSFREDDSRVRDRTAARNFALV 300 Query: 337 KKMALNLLRDC-KDIKGEEEKKEGCVKHRERSSEV 370 +K+ALNL+ +++ + ++ Sbjct: 301 RKIALNLIAQDRSTQASLRGRRKKAAWDDDYMLQI 335 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 99/373 (26%), Positives = 167/373 (44%), Gaps = 22/373 (5%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 I LL+ ++ PD R V+H L+A+L LT CAV+AGA + ++ E E L + Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 62 GDFD-------NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG-EIIAIDGKTI 113 P + TI RV++ ID+ A ++ W+ + G +A+DGK++ Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWLACRQQDAGGLRALAVDGKSL 157 Query: 114 RGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL-YLKKNLITID 172 RG+ R+ +H+++A + G+VL Q+ K+NEIT LL+ L L ++T D Sbjct: 158 RGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTSD 215 Query: 173 AMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGR 232 A+ Q D A+ ++ + Y++ VK N KL + + + T+ HGR Sbjct: 216 ALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLKSLPWQQIPLQDR-----TRTTGHGR 270 Query: 233 KETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDA 292 E R V V L F + +++ + S + + ++++ Sbjct: 271 CEIRRLKVCTVNNLLFPGARQAVQIVRR-----RVNRTTGKVSLKTIYAVTSLAAEQAPP 325 Query: 293 KEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKG 352 A IR HW +E LH V DV EDAS++R GNA + ++ + +A+ LR Sbjct: 326 ARVAQLIRGHWTVEA-LHHVRDVTFAEDASQLRSGNAPQAMATYRNLAIGALRLAGVRNI 384 Query: 353 EEEKKEGCVKHRE 365 + Sbjct: 385 AAGLRRTARDQTR 397 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 265 bits (676), Expect = 2e-69, Method: Composition-based stats. Identities = 103/368 (27%), Positives = 167/368 (45%), Gaps = 16/368 (4%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHE-RLEWLKKY 61 + SL++ + D R+ +H L +L + + + G ++E+ +F R +++ Sbjct: 1 MLSLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEF 60 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWM-QECHEITDGEIIAIDGKTIRGSFDK- 119 +P TI RV+ ++ KMF EW +E + D + +DGK+++ + Sbjct: 61 NIIPERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNP 120 Query: 120 -GKRKGAIHMVSAFSNENGVVLGQVKTEAK-SNEITAIPELLNLLYLKKNLITIDAMGCQ 177 +++ I VS FS E+G+VL + E K +EI ++ L+ + T DA+ CQ Sbjct: 121 NNEQQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQ 180 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL 237 K S I K DY++ VKGNQ L+ ++ + + F Q+ SHGRK +R Sbjct: 181 KKTISLIAKTKNDYVITVKGNQKNLYKRIQDLSNSSKPES----CFLEQDNSHGRKISRK 236 Query: 238 HIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 V V + E +G + L + +K YYISS A+ FA Sbjct: 237 IEVFKVR-------KNERQGFENLRRVIKVERKGSRGDKTYEETAYYISSLTESAQVFAK 289 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 IR HW IE+ LHWV DV ED S I AA S + + LNL R + E ++ Sbjct: 290 IIRGHWKIENQLHWVKDVIFEEDKSEISDFQAASNWSILTTIGLNLFRGLGFLSITEGQR 349 Query: 358 EGCVKHRE 365 + + Sbjct: 350 WLAERWEK 357 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 264 bits (674), Expect = 4e-69, Method: Composition-based stats. Identities = 104/384 (27%), Positives = 160/384 (41%), Gaps = 48/384 (12%) Query: 30 LFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKM 89 + +T+ V W +I DF + ++L+++ P DT+ R I + E Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FIEWMQECHEITD--------------------GEIIAIDGKTIRGSFDKGK-------- 121 + EW + IAIDGKTI G+ + K Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 122 ------RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLK-KNLITIDAM 174 +H+VSAF ++ + LGQ + K NEI AIP+LL+ + ++ +++TIDA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEI---SHG 231 G QK I KI +K+ADYLL VK N KL E + S + D E HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 RKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMD 291 TR I + +WK L+ + + +ISS + Sbjct: 241 FMVTRTCISCSEPSRLGF-CYRDWKNLRTYGII-KTEKINIATGEIQNEKHCFISSLVNN 298 Query: 292 AKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR--DCKD 349 + R HW +E+ LHW LDV NED R + N+A+ S + KMAL +L+ +D Sbjct: 299 PELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTLTKMALTILKNYQDED 357 Query: 350 IKGEEEKKEGCVKHRERSSEVHFL 373 K +K ++ +L Sbjct: 358 KKTSVNRKR-----KKAGWSDEYL 376 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 259 bits (662), Expect = 9e-68, Method: Composition-based stats. Identities = 92/364 (25%), Positives = 169/364 (46%), Gaps = 18/364 (4%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + LL + PD R + +++L +++ + +CAV AGA + I D+ + + Sbjct: 43 MPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQRC 102 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE-IIAIDGKTIRGSFDKGK 121 +P + TI +V +D A +++ + +A+DGKTIRG+ + Sbjct: 103 GIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RIG 160 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 ++ A H+V+A ++ + VVLGQ +T KSNEI + LL + + ++T+DAM QK A Sbjct: 161 KQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKATA 220 Query: 182 SKIKDK-KADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIV 240 ++++ +A+Y++ VK NQ L ++ V + E HGR+E R + + Sbjct: 221 RCLREQCRAEYVMIVKANQPGLLARVRDQPWEQVPVVWSDP----VERGHGREEHRSYKI 276 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSK---DMDAKEFAH 297 V R + +++ + R+ A + Y I S K A Sbjct: 277 LTVARGLRFPYA------QQVIQIIRRRRVLG-AGAWSTEVVYAICSLPCEQAPPKLLAS 329 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK 357 IR HW IE+ +H+V DV +ED S +R G+ ++++ ++ + + L R + Sbjct: 330 WIRGHWHIENRIHYVRDVTFDEDRSAVRTGHGPQVMATLRNVVVGLHRRAGHSNIARACR 389 Query: 358 EGCV 361 Sbjct: 390 RLAA 393 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 114/367 (31%), Positives = 191/367 (52%), Gaps = 16/367 (4%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 + + I+V D R QG++ + L IL +++ A I+G D+W++IED+ + E L+ Sbjct: 5 IWNAIAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKL 64 Query: 66 NG-------IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFD 118 +G +P DT V ID F +++ +++ +E G+ IAIDGKT RG Sbjct: 65 SGKELKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPRG-IK 123 Query: 119 KGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 + ++VSA+ ++ V+ + +E K +E+++I +L+ LL+L+ N +TIDA G Sbjct: 124 QTANSHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGTYV 183 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLH 238 ++ I K +++L VKGNQ KL E++F + D + ++I HGR E R Sbjct: 184 EVIEMILSKGGNFVLPVKGNQKKLLEFIEKEFREYRGNTVSAD--TQEDIGHGRVEKRTV 241 Query: 239 IVSNVTRLNF--CDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFA 296 + + +WKG+K L + KK DKS + YYI++ +D KE Sbjct: 242 YCITEIKTDDDIDGCMQKWKGVKTLVKIVREVYKKADKST-RIETVYYITNL-IDPKEIN 299 Query: 297 HAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKG--EE 354 AIRAHW IE++LH LDV +NED S+ N E + +AL ++++ +G Sbjct: 300 RAIRAHWGIENNLHRSLDVLLNEDHSQKSNHNVVENFHIMSLLALFIIKEISKQRGISMN 359 Query: 355 EKKEGCV 361 ++ C Sbjct: 360 RTRKLCG 366 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 257 bits (656), Expect = 5e-67, Method: Composition-based stats. Identities = 105/365 (28%), Positives = 166/365 (45%), Gaps = 13/365 (3%) Query: 7 LDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDN 66 L + D R +H L+ +LFL + A + GA EI +F R LK+ + Sbjct: 5 LSILREIHDPRD-INARHDLAELLFLALAATLCGAKNCVEIAEFVEGREAELKEIVTLRH 63 Query: 67 GIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD-----GEIIAIDGKTIRGSFDKGK 121 G P DT +R+ ID + ++ + ++A+DGK +R ++KG+ Sbjct: 64 GCPSHDTFSRIFRLIDPDELARALGAFLAALRQGLGLGPRPRGVVAVDGKALRRGYEKGR 123 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 MVS + E + + + E S+E+ A LL + LK ++T DA+ C+ D A Sbjct: 124 AFMPPVMVSVWDAETRLSVATKRAEG-SDEVAATLALLKSIDLKGCIVTADALHCRPDTA 182 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVS 241 + +KA Y LA+K N+G+L E F + T+E HGR ETR V Sbjct: 183 KALIGRKAHYALALKANRGRLFACAEAGFVAADAAG-DLAFHETRETGHGRLETRRASVL 241 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 L + GLK + + RQ S+RY SK + + A +RA Sbjct: 242 P---LKAFKQAPAFPGLKAIGRIQATRQ--GADGRAVTSVRYIALSKVLAPHKLAEVVRA 296 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCV 361 HW IE+ LHW LDV +ED +R R+ NA + ++ I+++A ++L K K Sbjct: 297 HWTIENQLHWSLDVVFHEDDARSRKDNAPQNLAVIRRLARDILAAHPLDKPIASKMRRVN 356 Query: 362 KHRER 366 +R+ Sbjct: 357 WNRDF 361 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 256 bits (654), Expect = 9e-67, Method: Composition-based stats. Identities = 101/286 (35%), Positives = 155/286 (54%), Gaps = 9/286 (3%) Query: 9 YISVTPDIRQQ-GKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG 67 + + PD R+ H LS IL + +CAV++G D+W+ + +FG + WL+++ NG Sbjct: 17 FFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANG 76 Query: 68 IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG-EIIAIDGKTIRGSFDKGKRKGAI 126 IP DT RV S ID AFE F +W D + +A+DGKT+R S +G A+ Sbjct: 77 IPSHDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSH-RGSAGRAL 135 Query: 127 HMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKD 186 H++ A+S E +++ Q + + KSNEITAIP++L+L L+ I+IDA+GCQK +A +I + Sbjct: 136 HLLHAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITE 195 Query: 187 KKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRL 246 DY+LA+KGNQ LH + +G E HGR ETR V++ + Sbjct: 196 AGGDYVLALKGNQSALHDDVRLFMETQADRHPQG-QAEAVEKDHGRIETRRIWVND--EI 252 Query: 247 NFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDA 292 ++ + +W GLK L + S R+ R +I+S D Sbjct: 253 DWLTQKPDWPGLKTLVMVESRRELNGQ---VSCERRCFITSHTADP 295 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 250 bits (637), Expect = 8e-65, Method: Composition-based stats. Identities = 101/349 (28%), Positives = 161/349 (46%), Gaps = 16/349 (4%) Query: 3 IQSLLDYISVTPDIRQQ--GKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++ L ++ S PD R+ G ++HKLS I+ L + ++ EI +FG L+ +K Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG-----EIIAIDGKTIRG 115 NGIP + T+ R+ ID A + + H+ G EII IDGK RG Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMG 175 + K R I VSA S + L E KSNEI A+P L++ + + ++T DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKET 235 QKDI KI++K D+++ +K NQ L + E+K E+ HGR ET Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDKIKELSPV---YSYCGEPELGHGRIET 269 Query: 236 RLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF 295 R + V + + + +W G + K+ R ++SS + Sbjct: 270 RSYRVFD--GTDLIANKEKWNGNLTIIE-YECETVKKSTGNCTTEKRLHVSSLPANTPRL 326 Query: 296 AHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 +R HW IE S+HW LD + +D + + AA + I+++ ++ Sbjct: 327 GTPVRNHWSIE-SMHWGLDRNLLQDKIKRKSARAARNLDTIQRIVYSVF 374 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 249 bits (635), Expect = 1e-64, Method: Composition-based stats. Identities = 103/359 (28%), Positives = 171/359 (47%), Gaps = 17/359 (4%) Query: 3 IQSLLDYISVTPDIRQ--QGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++ L +++ PD R+ +G K+KL IL L + + +I FG L+ + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT---DGEIIAIDGKTIRGSF 117 G +G+P + T+ R+ +ID A + E+ H+ G+I+ IDGK +RG+ Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAGDILCIDGKAMRGTV 137 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 + R I VSA+S E GV L E KSNEIT++P+LL+ + + ++T DAM Q Sbjct: 138 LENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADAMSFQ 195 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRL 237 K I KI++K D+L+ +K NQ L + E+ + + + HGR ETR+ Sbjct: 196 KAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAEPVDV---YSEGPFLEHGRIETRV 252 Query: 238 HIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 + + +W G + +++ + R+Y+SS A+ Sbjct: 253 CRIF--RGNDLITDREKWNGNLTVVEI-RTATERKSDGQKSSERRFYVSSFHGSARRLGT 309 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEK 356 R HW IE S+HW LD + +D R +A + I++M L +L KG+ +K Sbjct: 310 IARMHWAIE-SMHWDLDRNLRQDFIRRNSARSARNLDTIQRMVLAIL---SIWKGKRKK 364 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 248 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 105/360 (29%), Positives = 161/360 (44%), Gaps = 41/360 (11%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++S+ + + D RQ+ KV H+ I+ + V A W E+ DF ER+++++K+ Sbjct: 18 LESMYNALGEI-DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFF 76 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEW--------------------MQECHEITD 102 P DT+ R + A E+ + W + E E Sbjct: 77 PDIQKAPSHDTLRRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKP 136 Query: 103 GEIIAIDGKTIRGSFDKGKRK--------------GAIHMVSAFSNENGVVLGQVKTEAK 148 IAIDGKTI+ + ++ +R+ +H+VSAFS ++ + LGQ + + K Sbjct: 137 FRQIAIDGKTIKKAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKK 196 Query: 149 SNEITAIPELLNLLYL-KKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAF- 206 NEI AIP LL+ L + + +++TIDAMG QKDI S+I K+A YLL VK NQ L Sbjct: 197 ENEIVAIPRLLDDLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIA 256 Query: 207 --EEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVA 264 F N E HG R V + + +W+ L+ + Sbjct: 257 GNMRDFERIPLPNEVYKVHKEGENGHGFVFLRECRVCSSLH-SLGKIYKDWENLRSYGLI 315 Query: 265 LSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRI 324 + E V Y+ISS + D ++ R HW IE+ LHW LD+ ED R+ Sbjct: 316 -RTERVDEATGESAVETHYFISSLENDPEKIMRVKRKHWGIENGLHWQLDITFKEDDGRM 374 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 247 bits (630), Expect = 5e-64, Method: Composition-based stats. Identities = 94/246 (38%), Positives = 141/246 (57%), Gaps = 3/246 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +L+D+ D R + HKL I+ + +CA+I GAD + +E +G+ + EWLK++ + Sbjct: 8 TLIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLEL 67 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 +NGIP DT ARV + ID FE+ F +W+ E+ G+++ IDGKT++ S +K + K Sbjct: 68 ENGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKK 127 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 AIH+V+A+++E +VL Q K ++ EITAIP L+ +L L L+TIDAMG Q DIA + Sbjct: 128 AIHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELL 187 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY---KGDSFSTQEISHGRKETRLHIVS 241 K ADY LA+KGNQ L +E F + + + T E R E + Sbjct: 188 HSKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEVSSAYRT 247 Query: 242 NVTRLN 247 RL Sbjct: 248 EQERLW 253 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 240 bits (613), Expect = 5e-62, Method: Composition-based stats. Identities = 80/376 (21%), Positives = 138/376 (36%), Gaps = 31/376 (8%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++ LL+ + PD R++ V+ L +L L + AV GA + EI + + L Sbjct: 32 VEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAAF 91 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG--------------EIIAI 108 P T RV+ D A ++ W Q +I+ Sbjct: 92 GLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVISA 151 Query: 109 DGKTIRGSFDKGKRKGAI--HMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL---- 162 DGKT+RG+ + +V + +G V+ + +EI A+ ++ L Sbjct: 152 DGKTMRGARRRTGDGKIAQDQVVEILDHASGAVVA-CEPVNDGDEIGAVRTVMGRLADRW 210 Query: 163 -YLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGD 221 L ++ DA Q + ++ +LL VK NQ ++ V + Sbjct: 211 GSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRILAKVRALPWAQVRAQD--- 267 Query: 222 SFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSI 281 + + +HGR ETR V + +K + R +A Sbjct: 268 --TCRGKAHGRAETRTVRVVQAPTHVDLALAGTAQVIK-ITRHTRRRPHPGAPAASTREN 324 Query: 282 RYYISSKD---MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKK 338 Y ++S D A +R+HWLIE+ +HWV D +ED R GN ++ ++ Sbjct: 325 AYLLTSLPAEVADPATLAAMVRSHWLIENQVHWVRDTAYDEDRHTARTGNGPINLACLRN 384 Query: 339 MALNLLRDCKDIKGEE 354 A+ R + Sbjct: 385 TAITRHRAHGASNIAK 400 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 240 bits (612), Expect = 7e-62, Method: Composition-based stats. Identities = 93/375 (24%), Positives = 156/375 (41%), Gaps = 21/375 (5%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEI----EDFGHERLEW 57 I LL + D R+ + LS +L + A +AGA +EI DFG + L Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKKYGDFDNG---IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE--IIAIDGKT 112 L D G P + I + +D A + F W+ GE ++A+D K Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 IRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL-YLKKNLI-T 170 +RG++ +G ++ + +SA + G+V GQV+ +NEIT + LL L + ++ T Sbjct: 141 LRGAWSEGNKRVTL--LSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 171 IDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISH 230 +DA+ Q + A + + DY L VKGNQ L+ + F + K +E H Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLY---RKTFEQTLPLLQKPPQHEVEERGH 255 Query: 231 GRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM 290 GR + + + F + + F K S E I ++ Sbjct: 256 GRIKKWQAWTTEAKGIGFPEVAT-----AAVIRRDEFDLKGIRVSREYAHILTSVAGNRA 310 Query: 291 DAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDI 350 A IR HW IE+ +H+ D EDA++ GN+ ++ + +A+ ++R Sbjct: 311 TAAYIHRLIRGHWGIENEIHYPRDTAWREDANQTHTGNSPHTLASFRNLAIGIIRRNGIR 370 Query: 351 KGEEEKKEGCVKHRE 365 K +E + Sbjct: 371 KIKETLEYIAGDRDR 385 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 240 bits (612), Expect = 7e-62, Method: Composition-based stats. Identities = 92/249 (36%), Positives = 143/249 (57%), Gaps = 6/249 (2%) Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 ++H+V+A+ +++ ++LGQVK + KSNEITAIP+LL +L+L+ ++TIDAMGCQK IA + Sbjct: 1 NSLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQ 60 Query: 184 IKDKKADYLLAVKGNQGKLHHAFEEKF-PVNVFSNYKGDSFSTQEISHGRKETRLHIVSN 242 I KKADY+LAVK NQ +L+ + F V ++ + T + HGR ETR + S Sbjct: 61 IGSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREY--ST 118 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 + + W L + + S R+ RY+I S + A+ F A+R H Sbjct: 119 IVGDDLLAGITGWDNLNAIGMVESKREVGN---TISNEKRYFIMSINGHAQRFGDAVREH 175 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVK 362 W IE+++HWVLDV ED SRIR+ N+ E +S ++K+ALN ++ + K++ Sbjct: 176 WGIENTVHWVLDVSFGEDQSRIRKDNSPENLSMLRKIALNCVKQESTKTSMKRKRKMAGW 235 Query: 363 HRERSSEVH 371 +V Sbjct: 236 DNSFLIKVL 244 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 235 bits (599), Expect = 2e-60, Method: Composition-based stats. Identities = 88/365 (24%), Positives = 144/365 (39%), Gaps = 50/365 (13%) Query: 29 ILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEK 88 +L + G + + LE L+K+ GI TI R++ ID Sbjct: 1 MLVCVTLGFLCGRTTIRRSLKWCKTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALY 60 Query: 89 MFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAK 148 F+EW+ E + + +A+DGK + G+ +K K + +++ G++L Q+ ++K Sbjct: 61 AFMEWVGEIVD-SRNTHLAVDGKALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSK 119 Query: 149 SNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEE 208 +NEIT IPELL LL + +++TIDA+G Q I +I ++ + L VK NQ + + Sbjct: 120 TNEITVIPELLKLLDISGSIVTIDAVGTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHT 179 Query: 209 KFPVNVFSN-----------------YKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDF 251 ++ K + E + R E R + N Sbjct: 180 FMDKLEAADVQRKKGEVLDSGMREYLEKYEEIIRIEKNRDRNEYRTCQICK-DASNLTKS 238 Query: 252 EFEWKGLKKLCVALSFRQKKEDKSA-----------------------------EGVSIR 282 + EW ++ + R E S + V Sbjct: 239 QKEWPHVQSIGRIKQVRIPSEKDSHGNDVTPSKEEFLEKGSRRVPAPSAEEGTGKDVQCT 298 Query: 283 YYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALN 342 IS + A+E R HW IE+ LH VLD ED S ++ +S I+K A N Sbjct: 299 ALISDLILTAEELGSIKRMHWSIENRLHHVLDDTFREDRSPAKKSR--NNLSLIRKYAYN 356 Query: 343 LLRDC 347 +LR Sbjct: 357 ILRLA 361 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 235 bits (598), Expect = 2e-60, Method: Composition-based stats. Identities = 102/207 (49%), Positives = 137/207 (66%), Gaps = 1/207 (0%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 Q L PD R+ K + L +IL + + +VI GAD W E+E++ + + E+L+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 NGIP DT RV SNIDS FEK FI+W+ ++ EIIAIDGKTIRG+ G +K Sbjct: 65 LPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGGKK 123 Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 +HMVSA++N+N +VLGQVK KSNEITAIP+LL +L ++ ++TIDAMGCQ IA Sbjct: 124 SPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIAKA 183 Query: 184 IKDKKADYLLAVKGNQGKLHHAFEEKF 210 I K ADY+LAVK NQ +L E++F Sbjct: 184 IVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 233 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 98/380 (25%), Positives = 157/380 (41%), Gaps = 22/380 (5%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEW------ 57 SL+ ++ PD R V H L A+L V AV+ GA + ++ + + Sbjct: 27 SSLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELG 86 Query: 58 -LKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI--TDGEIIAIDGKTIR 114 + + P + T R+++ +D+ A + W+ C T + ++DGKT+R Sbjct: 87 VFRDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLR 146 Query: 115 GSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAM 174 GS G +H+++ G VLGQV + K+NE+T LL L L ++T DA+ Sbjct: 147 GS---GPAGEQVHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADAL 203 Query: 175 GCQKDIASKIKD-KKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRK 233 Q++ A + D KKA Y+ VK NQ +L+ + + T HGR Sbjct: 204 HTQREHARWLVDTKKAAYVFTVKKNQPRLYRQLKTLPWTKIPIQD-----ETSTRGHGRY 258 Query: 234 ETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAK 293 + R T DF + S V +S+ Sbjct: 259 DIRRLQAVTCTGPLALDFPHAVQ--ALRIRRRRLNLATGRWSTVTVYAITNLSAAQAGPA 316 Query: 294 EFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGE 353 E A +R HW IE LH + D EDASR+R GNA ++ ++ A+NLLR Sbjct: 317 ELADWLRGHWAIET-LHHIRDTTYAEDASRLRTGNAPRAMATLRNTAINLLRLTGITTIA 375 Query: 354 EEKKEGCVKHRERSSEVHFL 373 + ++ R ++ L Sbjct: 376 AALR-HNSRNPYRPLQLLGL 394 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 232 bits (592), Expect = 1e-59, Method: Composition-based stats. Identities = 96/240 (40%), Positives = 142/240 (59%), Gaps = 8/240 (3%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 +Q LL+++ D RQQ KV+H L IL + + A +A AD+W E+ F + ++L+KY Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD---GEIIAIDGKTIRGSFDK 119 + NG P DT+ RV+ + ++++ +W + + +II IDGKT+R +K Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRS--NK 118 Query: 120 GKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 + H+VSA+S E+G LGQ KSNEITAIPELL + +K ++TIDAMG Q Sbjct: 119 RNGEKPGHIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFS---NYKGDSFSTQEISHGRKETR 236 IA KI++K+ADY+L++K NQG L+ E F F +G TQE +HG+ ETR Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 226 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 90/344 (26%), Positives = 137/344 (39%), Gaps = 22/344 (6%) Query: 28 AILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFE 87 A+L + V A AG + + + + P + T V+S +D Sbjct: 2 ALLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 KMFIEWMQECHEITDGEI---IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVK 144 + +D IA+DGK +RG+ + A H+VS F++ +VLGQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGAL--RAKATATHLVSVFAHRARLVLGQLA 119 Query: 145 TEAKSNEITAIPELLNLLYLK-KNLITIDAMGCQKDIASKIKDK-KADYLLAVKGNQGKL 202 KSNEI + LL LL + L+T+DAM Q A I K+ YL+ VK NQ K+ Sbjct: 120 VAEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 HHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLC 262 V + DS HGR ETR + R + K++ Sbjct: 180 LARITALPWAEVPAAATDDS-----RGHGRVETRTLQIITAARGIGFPYA------KQII 228 Query: 263 VALSFRQKKEDKSAEGVSIRYYISSKDMDAKE---FAHAIRAHWLIEHSLHWVLDVKMNE 319 R V + Y I S + +R H IE+SLHW+ DV +E Sbjct: 229 RITRER-LITATDQRSVEVVYAICSLPFEHARPTAIMTWMRQHCRIENSLHWIRDVTFDE 287 Query: 320 DASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKH 363 D R GN A++++ ++ A+NL R E + + Sbjct: 288 DRQRAHTGNGAQVLATLRNTAINLHRLNGADNIAEACRITALTR 331 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 225 bits (573), Expect = 2e-57, Method: Composition-based stats. Identities = 86/274 (31%), Positives = 136/274 (49%), Gaps = 10/274 (3%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 I+ L++ + D R GK++H+L IL + VCAV+A A+ +++I +G + WL + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD------GEIIAIDGKTIRGS 116 D GIP DT RV ID AFE+ F+ W + E IA+DGK +R S Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 FDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGC 176 FD+ + +H+VSA++ G+VL Q + K E A+P +L L+L L+++DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG--DSFSTQEISHGRKE 234 ++++A I + A YLL +K NQ K+H F N F++ + +HGR Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAFDDTHGRLV 241 Query: 235 TRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFR 268 R W GL + + + R Sbjct: 242 RRRVFACPDAGCFTT--LRGWPGLTTVLASETIR 273 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 224 bits (570), Expect = 5e-57, Method: Composition-based stats. Identities = 84/406 (20%), Positives = 152/406 (37%), Gaps = 54/406 (13%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIA-GADEWQEIEDFGHERLEWLKKYGDF 64 L+D ++ D R +H L++IL + CA +A G D IE + + + Sbjct: 30 LIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLHI 89 Query: 65 D-------NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEI------------ 105 + P + TI RV++ +D + ++ + E Sbjct: 90 WRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRRT 149 Query: 106 ---------------------IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVK 144 A+DGK ++G+ + +H++S ++ + V Q + Sbjct: 150 EREARRAAHRSPTPAPGLLPAYAVDGKRLKGARHPDGGR--VHLISLAAHLDATVHAQRQ 207 Query: 145 TEAKSNEITAIPELLNL---LYLKKNLITIDAMGCQKDIAS-KIKDKKADYLLAVKGNQG 200 AKS+EI A+ LL L +IT DA+ Q+ A I++ A Y++ VK NQ Sbjct: 208 IPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQP 267 Query: 201 KLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKK 260 LH +++ + HGR E R+ + ++F ++ L+ Sbjct: 268 TLHATAITAL-TGTDTDFAAVTHRETHRGHGRTEYRILRTAPADGIDFPYAAQVFRVLRH 326 Query: 261 LCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHW-LIEHSLHWVLDVKMNE 319 R S E ++++ A +R HW IE+ +H V DV E Sbjct: 327 RGGLDGIRH-----SKEVCYGITDLTARQAGPAHLAAYVRGHWKAIENGVHHVRDVTFAE 381 Query: 320 DASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRE 365 DA + R ++ + +A LR + ++E H+ Sbjct: 382 DACQARTATLPRALAAFRNLATGTLRRAGHVNIAHARREHGYDHQR 427 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 223 bits (569), Expect = 7e-57, Method: Composition-based stats. Identities = 93/385 (24%), Positives = 154/385 (40%), Gaps = 29/385 (7%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLK-- 59 +Q L D ++ PD R ++H+L IL L+ AV AG +EI + + Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 60 -----KYGDFDNGIPVDDTIARVVSNIDSLAFEKM---FIEWMQECHEITDGEIIAIDGK 111 P DT+ RV+S +DS A + F ++A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 TIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL----YLKKN 167 T+RG+ G A H+++ + GVVL + + AK+NE+TA LL L L Sbjct: 158 TLRGA--AGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 LITIDAMGCQKDIASKIK-DKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQ 226 ++T DA+ + A I + A ++ VK N L + S + Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIP----IGHSAE 271 Query: 227 EISHGRKETRL---HIVSNVTRLNFCDFEFEWKGLKKLCVAL-----SFRQKKEDKSAEG 278 +HGR E R S R + + + + + R + S Sbjct: 272 GRAHGRFERRTIQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGRARVTRTIPSTVT 331 Query: 279 VSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKK 338 V + ++ + + A R HW IE+ +HWV DV EDASR+R G I++ ++ Sbjct: 332 VHVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRN 391 Query: 339 MALNLLRDCKDIKGEEEKKEGCVKH 363 + + L+R + + + Sbjct: 392 LIIGLIRLAGHNRIAPTIRRIRHDN 416 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 221 bits (563), Expect = 3e-56, Method: Composition-based stats. Identities = 77/368 (20%), Positives = 140/368 (38%), Gaps = 15/368 (4%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + L+ + PD R V+++L+ +L L V IAG D + ++ + Sbjct: 25 VAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAGL 84 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE----IIAIDGKTIRGSFD 118 F +P + TI R+V ++ W +A DGK ++G+ Sbjct: 85 GFPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGARS 144 Query: 119 KGKRKGAIH--MVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGC 176 + + +V A ++ G LG + A +EI ++ L+N + L+T D + Sbjct: 145 RPPQGSVRQEAVVEAVRHDTGTALGHQRVVA-GDEIASVRRLVNRVCDHNTLVTTDCLHA 203 Query: 177 QKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETR 236 + +A I+ K +L ++KGNQ + + G+ T+E +HGR E R Sbjct: 204 HEPLARAIRAKGGHWLFSIKGNQPTVRAKLAGLPW-----DEFGNQHVTREKAHGRIEER 258 Query: 237 LHIVSNVTRLNFCDFEFEWK--GLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKE 294 + + F + L + S E + +S+ + Sbjct: 259 ALKALTPSAPSLVGFRGTRQVVKLARTTRRKKTTTSPAATSTEEFYLVTSLSTDQASPAQ 318 Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEE 354 A R HW +E +H V D M+ED IR NAA + + ++ LR + Sbjct: 319 LARWARGHWTVEA-IHHVRDRTMDEDRHTIRTKNAALNWAIARDTTISALRLAGYKNIRQ 377 Query: 355 EKKEGCVK 362 ++ Sbjct: 378 ARRATIRD 385 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 220 bits (561), Expect = 4e-56, Method: Composition-based stats. Identities = 83/238 (34%), Positives = 123/238 (51%), Gaps = 7/238 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 ++D D R K HK+ I+++++ AVI GA W EIE+FG+ ++ + K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPS 63 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGS------FD 118 IP DT R S I FE +F W+++ + G ++AIDGK +RG Sbjct: 64 LEFIPSHDTFNRFFSMIKPDYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 +GK + MVSA+S NG+ LGQVK + KS+EITAIP L+N L L ++TIDAMGCQK Sbjct: 123 RGKEGFKLWMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETR 236 DI I A+Y++A+K N+ K + ++ + + R Sbjct: 183 DITQTIIGHDANYIIAIKENKKKKYQPAKQIIDDYQDRDEIINRVIRHVSEKCRTWKD 240 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 217 bits (553), Expect = 5e-55, Method: Composition-based stats. Identities = 90/248 (36%), Positives = 127/248 (51%), Gaps = 12/248 (4%) Query: 68 IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGS------FDKGK 121 IP DT R S I FE +F W+++ + G ++AIDGK +RG GK Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHTTGK 62 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 + MVSA+S NG+ LGQVK + KSNEITAIP L+N L L ++TIDAMGCQKDI Sbjct: 63 EGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDIT 122 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY---KGDSFSTQEISHGRKETRLH 238 I + A+Y++A+K N+ K + ++ + + ++ HGR ETR Sbjct: 123 QTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRTC 182 Query: 239 IVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMD-AKEFAH 297 V + + F+ + GLK + S R +RYY++S D +E A Sbjct: 183 TVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIV-ATGEYTQEVRYYVTSLDNTKPEEIAS 241 Query: 298 AIRAHWLI 305 AIR HW I Sbjct: 242 AIRQHWSI 249 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 212 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 80/321 (24%), Positives = 132/321 (41%), Gaps = 26/321 (8%) Query: 50 FGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAID 109 FG + +WLK GI T + V ++ +AFE + +Q Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAALPKPLQRA------------ 90 Query: 110 GKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLI 169 K S + + +V ++ G+V+GQ + NE+ + L LL L+ ++ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQQTAPGR-NEVQGALDALALLSLEGAIV 149 Query: 170 TIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEIS 229 T DA+ C+ D A I DY LA+K NQ L + + + E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLGVQ----TAAEND 205 Query: 230 HGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKD 289 H R E R + V ++ + GL+ + + + + +RY++ S Sbjct: 206 HDRCERRRACIVAVNDID-------FPGLQAIGSVEATSRHAD--GRLTSHVRYFLLSTI 256 Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 M A R HW IE+ LHWVLDV+ EDA+R R+ + I+ ++K+ALNL+R D Sbjct: 257 MSAAALIEVTRTHWQIENKLHWVLDVQFREDAARNRKDHGPANIALLRKIALNLIRAHPD 316 Query: 350 IKGEEEKKEGCVKHRERSSEV 370 K + + + Sbjct: 317 KASIRRKIKNAGWDDQFLISI 337 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 211 bits (537), Expect = 3e-53, Method: Composition-based stats. Identities = 88/388 (22%), Positives = 147/388 (37%), Gaps = 46/388 (11%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIA-GADEWQEIEDF---------G 51 S + + ++ PD R + + L + + +CAV A G D + ++ Sbjct: 20 SRAGIWERLAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERV 79 Query: 52 HERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT---------- 101 RL W G +P + TI R ++ +D A ++ Sbjct: 80 RLRLPWNPWDGHL---LPDEATIRRFLNTVDDQALATALLDPPLADTPADLTDAVPSAPV 136 Query: 102 ---------DGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEI 152 A+DGKT RG+ K +H++ ++ G +LGQ + +AKSNE Sbjct: 137 RPPAGDQAVPVRAYAVDGKTSRGA--KRADGSQVHLLGVAAHGAGALLGQREIDAKSNET 194 Query: 153 TAIPELLNLLYLKKNLITIDAMGC-QKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFP 211 T LL L L ++ DA+ + ++ + K A YL K NQ KL Sbjct: 195 TEFRALLAPLELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKLRAFLAALPW 254 Query: 212 VNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKK 271 + + T++ HGR+ETR V+ VT L+F + + + RQK Sbjct: 255 TEIPTADL-----TRDRGHGREETRTLKVATVTHLDFPHAA------QAIRIRRWRRQKG 303 Query: 272 EDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAE 331 + S E + ++ A R W IE H+V DV ED+S R G Sbjct: 304 QPASHETIYAITDATADQASPALLADLARGQWHIEVKQHYVRDVTFGEDSSTSRTGRGPA 363 Query: 332 IISGIKKMALNLLRDCKDIKGEEEKKEG 359 +++ + + LR ++ Sbjct: 364 VLALFRATVADTLRRAGHRSVPACRRAH 391 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 83/402 (20%), Positives = 153/402 (38%), Gaps = 59/402 (14%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVI-AGADEWQEIEDFGHERLEWLKKY 61 ++ L+ D R V++++S++L L VCA+ AG D ++ Sbjct: 31 VRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELAA 90 Query: 62 GDFDN-------GIPVDDTIARVVSNIDSLAFEKMFIEWMQ------------------- 95 IP + T+ V+ +D + ++ Sbjct: 91 FGLPYHPLRGRYRIPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPLMPDGGI 150 Query: 96 -------------ECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQ 142 + IA+DGK +R + + + ++SA + +G+ L Sbjct: 151 EREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR--VFVLSAVRHGDGITLAS 208 Query: 143 VKTEAKSNEITAIPELLNLLYL---KKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQ 199 + AK+NEI LL+ L K ++T DA+ Q+D A+ + ++ A YLL +K NQ Sbjct: 209 REIGAKTNEIPEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERGAHYLLTIKNNQ 268 Query: 200 GKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLK 259 + ++ D+ HGR E RL V V L F Sbjct: 269 RGQARQLHALPWKEIPVIHRDDA-----RGHGRHEQRLVQVVTVNGLLFPHA-------A 316 Query: 260 KLCVALSFRQKKE--DKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKM 317 ++ R+ S+E V + +++ A E A R HW +E+++HW DV Sbjct: 317 QVLRIQRRRRLYGAKKWSSETVYAITDLPAEEASAAEIASWARGHWTVENTVHWCRDVTF 376 Query: 318 NEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEG 359 NED S++R N +++ ++ + L+ + ++ Sbjct: 377 NEDKSQVRTHNTPSVLAAVRDLIRGALKLAGYVNTAAGRRAH 418 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 200 bits (508), Expect = 7e-50, Method: Composition-based stats. Identities = 83/223 (37%), Positives = 118/223 (52%), Gaps = 9/223 (4%) Query: 111 KTIRGSFDKGKRKGAIHMVSAF---SNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKN 167 K + S ++ S S +VLGQ K KSNEITAIP L+ +L ++ + Sbjct: 3 KGFQRSVKTEEKHKPSQKKSQVLKDSLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESS 62 Query: 168 LITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKF---PVNVFSNYKGDSFS 224 +ITIDAMGCQK+I S I+ KK DY++ +K NQ L +E F F + + + Sbjct: 63 IITIDAMGCQKEITSLIRKKKGDYIITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQ 122 Query: 225 TQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYY 284 E H R E R I +V+ L + W LK + + S R+ +R+Y Sbjct: 123 EIETGHHRIEKREVIAVSVSSLPCLHNQDLWTELKTVVMVKSERRLWNKT---TTEVRFY 179 Query: 285 ISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRG 327 ISS + ++++ A AIR+HW IE+SLHW LDV +ED SRIR Sbjct: 180 ISSVEKNSQKIATAIRSHWEIENSLHWTLDVTFSEDKSRIRTR 222 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 198 bits (502), Expect = 3e-49, Method: Composition-based stats. Identities = 106/219 (48%), Positives = 141/219 (64%), Gaps = 13/219 (5%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 MS+ L D+ + D RQ KV +KL +LFL + AVI+GA+ W+EIEDFGH RL+WLKK Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKG 120 YGDF +GIPV DTIAR+V ID F + FI+WMQ ++TD +++A+DGKT+ Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 HM+SAF+ +NGVVLGQ +T+ KSNEITA+PELL LL L+ ++T+DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 ASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYK 219 I KKADY +AVK + ++ Sbjct: 168 VKTIVKKKADYCIAVKKIKSPYIRHSRMHLSSVEATSQD 206 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 74/179 (41%), Positives = 105/179 (58%), Gaps = 3/179 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 SLL PD R+ + H+L +L +C VI+GA+ W + + +L+WL+ Y + Sbjct: 7 SLLTAFDDLPDPRR-RECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 +GI DT RV S +D+ FE F+ W+ +G+ +AIDGK +RGS D + Sbjct: 66 AHGIASHDTFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHD--GARS 123 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 IH+VSA+S+ + LGQV+T KSNEITAIPELL L ++ + ITIDAMGC A Sbjct: 124 PIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCHGMPARH 182 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 184 bits (466), Expect = 5e-45, Method: Composition-based stats. Identities = 82/234 (35%), Positives = 118/234 (50%), Gaps = 8/234 (3%) Query: 143 VKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKL 202 + TE KSNEITAIP LL L KK ++TIDAMGCQKDIA I D+++AVK NQ KL Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 HHAFEEKFPVN---VFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLK 259 A + + ++ T HGR++ R H V+ V E+ W +K Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVPPGFAAKGEWPW--IK 118 Query: 260 KLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNE 319 + A+ + + +RYY+ S+ + K F +R HW IE S+HWVLDV E Sbjct: 119 AIGTAVRITTHADGT--QSDEVRYYMLSRFLSGKRFGEVVRGHWGIE-SMHWVLDVTFGE 175 Query: 320 DASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSSEVHFL 373 D +R R+ A +S +++ A+ LL+ + K C+ +EV L Sbjct: 176 DRTRTRQRVLANNLSWVRRFAITLLKRHPEKDSIRGKMIRCLMDTSFLNEVLTL 229 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 181 bits (460), Expect = 2e-44, Method: Composition-based stats. Identities = 80/226 (35%), Positives = 109/226 (48%), Gaps = 10/226 (4%) Query: 154 AIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVN 213 AIPELL L L+ +TIDA+G Q IA I + ADY+LAVK NQ +L + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 VFSNYKGDS--FSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKK 271 +G + + HGR ETR+ VS + W GL++L + RQ Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETRVCRVSEDVAWLASTGQH-WAGLQRLVMLERTRQIG 119 Query: 272 EDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAE 331 + + E YYISSK + A + A IRAHW IE+ LHWVLDV EDAS IR AA Sbjct: 120 QKVTTERC---YYISSKAVKAAQMAQLIRAHWGIENQLHWVLDVSWGEDASLIRDTVAAR 176 Query: 332 IISGIKKMALNLLR----DCKDIKGEEEKKEGCVKHRERSSEVHFL 373 ++ ++K+ LNL R + + ++ L Sbjct: 177 NMASLRKITLNLARLAQNRQPKKVSLKNIRNLAAWDTAMRDDILGL 222 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 62/230 (26%), Positives = 109/230 (47%), Gaps = 5/230 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 SL++ ++ PD R + ++ L +L L + AV+ G + I FG R + L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 DNG-IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 NG +P +TIA ++ +D + + W+++ H E +A+DGK + GS + + Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGKRLCGS--RDGQV 120 Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLY-LKKNLITIDAMGCQKDIAS 182 H+++A++ + V+ Q+ EA +NE A LL +L L ++T DA+ Q D+ + Sbjct: 121 PGTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGR 232 ++ K D +L K NQG L E F ++ G Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAFATAAGGDFSPRVTGRVGSGRGN 230 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 171 bits (434), Expect = 3e-41, Method: Composition-based stats. Identities = 68/218 (31%), Positives = 103/218 (47%), Gaps = 3/218 (1%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M +SL + +S PD R + H L A+L L A++ G Q I FG + L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFDNG-IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDK 119 F G P T++R + D E W+ IA+DGKT+RGS + Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGS--R 118 Query: 120 GKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 + H+V+A++ VL QV+ +AK+NE A LL +L + +++T DAM CQ+D Sbjct: 119 DGQVPGQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSN 217 +A+ + ADY+L K NQ L + E + Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGLGFEDAAR 216 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 170 bits (430), Expect = 7e-41, Method: Composition-based stats. Identities = 63/231 (27%), Positives = 111/231 (48%), Gaps = 13/231 (5%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 IQ L+D +S T D R++ ++H +++ VCA+++GA + + ++ LKK Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDFDNG-------IPVDDTIARVVSNIDSLAFEKMFIEWMQECHE----ITDGEIIAIDG 110 F P + T+ R + +ID L +++ W ++ D +++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLIT 170 K +RG+ K K IH ++AF G+V+ Q + K+NEI + LL + ++ ++T Sbjct: 341 KAVRGA-SKAKGGQKIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 IDAMGCQKDIASKIKD-KKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG 220 DA+ Q + A I + KKADY+ VK NQ + E + Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIESLPWEAFPPSSDI 450 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 170 bits (430), Expect = 8e-41, Method: Composition-based stats. Identities = 79/280 (28%), Positives = 118/280 (42%), Gaps = 13/280 (4%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + L+ + D R +H L +LFL + A + GA E+ +F R E L++ Sbjct: 1 MSVLISILREVRDPRD-VNARHDLGELLFLALLATLRGAKTCVEMAEFSEARQEELREIV 59 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD----GEIIAIDGKTIRGSFD 118 +G P DT +RV +D E+ F +M ++AIDGK++R +D Sbjct: 60 ALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRGALGLPAPKGVVAIDGKSLRRGYD 119 Query: 119 KGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 KG+ MVS + E + ++ +EI A +L L LK +T DA+ C Sbjct: 120 KGRAFMPPLMVSVWDVETRPSIAAMRAPG-GDEIKATLSVLKALTLKGCTVTADALHCHP 178 Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLH 238 +A + KA Y L +K N G L A E F + F T+E HGR+E R Sbjct: 179 AMAQALLAAKAQYALGLKANHGPLFRAAEAGF----AAVTDLAVFETRERGHGREEQRRA 234 Query: 239 IVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEG 278 V V RL GLK + + R K + Sbjct: 235 SVLPVDRLVK---RPSLPGLKAIGRIEAVRTGANGKPEQA 271 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 68/270 (25%), Positives = 116/270 (42%), Gaps = 22/270 (8%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 +LL+ ++ PD R++ V+++ +A+L + VCA+++GA + I ++ + + Sbjct: 47 DQTALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAG 106 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD-------------GEIIAI 108 +P TI RV+ +D A E W+Q + D ++A+ Sbjct: 107 LGLTGRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAV 166 Query: 109 DGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL-YLKKN 167 DGK +R + +H++ + GVVL QV + K+NEI +L+ + L Sbjct: 167 DGKAMRAT---RHGTHPVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDV 223 Query: 168 LITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQE 227 LIT+DAM Q A + + A L+ VK NQ +H + +V +T Sbjct: 224 LITVDAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRLKTLPWKDVPVG-----HTTTG 278 Query: 228 ISHGRKETRLHIVSNVTRLNFCDFEFEWKG 257 HGR ETR V + G Sbjct: 279 RGHGRIETRTLKAVTVPAGLGFPHAAQAIG 308 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 163 bits (412), Expect = 1e-38, Method: Composition-based stats. Identities = 59/267 (22%), Positives = 112/267 (41%), Gaps = 17/267 (6%) Query: 68 IPVDDTIAR-----VVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKR 122 IP ++R ++ +D F+ + + + + + DGK +RGS + GK+ Sbjct: 14 IPETTVVSRSHLPVLLQKVDVEVFDYLLFTHYGFRLDSQEKQWFSGDGKELRGSIESGKK 73 Query: 123 KGAIHMVSAFSNENGVVLGQVKTEA-KSNEITAIPELLNLLYLKKNLITIDAMGCQKDIA 181 +G +V + +G + Q + K +EI + LL+ L IT+DA+ Sbjct: 74 RGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRALLSKDDLASQKITLDALHLCPSTT 132 Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVS 241 I +L+ +K NQ L + + D +T + +HGR E R + + Sbjct: 133 EMITKAGGVFLIGLKENQPTLLAHMTDC------ALPPIDQKTTFDFNHGRVEQRKYWLY 186 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA 301 +V++ F+ W + R + K+A+ Y S + + A+R Sbjct: 187 DVSK---QGFDPRWDNTAFKRLVKVQRTRINQKNAKISREVSYYISNETAKEGIFDAVRN 243 Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGN 328 HW +E + H + DV +NED + ++ Sbjct: 244 HWSVEVNNH-IRDVTLNEDQLKSKKRQ 269 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 161 bits (406), Expect = 4e-38, Method: Composition-based stats. Identities = 59/165 (35%), Positives = 90/165 (54%), Gaps = 3/165 (1%) Query: 47 IEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEII 106 + + ER L+ + NG P DT RV+ I+ + + +E +G+ I Sbjct: 1 MHELCLERGASLRPPVELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHI 60 Query: 107 AIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKK 166 AIDGK ++GS K H++SA+ +E G+ L Q K NE+ AIPE+L+ L L Sbjct: 61 AIDGKRLKGSKKKTGST---HILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSG 117 Query: 167 NLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFP 211 +I+IDAMG Q +IA +I +ADY+L++KGNQ L+ + F Sbjct: 118 AVISIDAMGTQTNIAEQIIQSEADYILSLKGNQKHLYEDVRDCFT 162 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 159 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 81/184 (44%), Positives = 112/184 (60%), Gaps = 3/184 (1%) Query: 94 MQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEIT 153 M+ H++T GE++AIDGKT+RGS+D+ R+ IHMVSA+++ N +VLGQ+KT KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVN 213 AIP L+ +L L+ ++TIDAM CQ IA I K DYLLAVKGNQGKL A + F + Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFTPH 120 Query: 214 VFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKED 273 + D+ E GR E R + V + + L W GL + + ++R K Sbjct: 121 RRAPIDRDTCQ-IEKQKGRVEARTYHVLSASDLIR--DFSTWSGLTSIVMVENYRAAKGR 177 Query: 274 KSAE 277 + A Sbjct: 178 QRAR 181 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 159 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 56/187 (29%), Positives = 96/187 (51%), Gaps = 4/187 (2%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWL-K 59 + +LL + PD R+ ++ L +L TV A+++GA ++ I F R E L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 60 KYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQE---CHEITDGEIIAIDGKTIRGS 116 +G PV +T+ V+ ++D+ E F + E+ + ++A+DGKT+RGS Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGS 130 Query: 117 FDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGC 176 FD + A ++AF + + +VL + + KSNEI A +++ L L + T DAM C Sbjct: 131 FDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHC 190 Query: 177 QKDIASK 183 QK + + Sbjct: 191 QKKHSRR 197 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 156 bits (393), Expect = 1e-36, Method: Composition-based stats. Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 19/223 (8%) Query: 11 SVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGI-- 68 + D R+ ++H ++L + + V+AG ++ I + + + + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 69 -----PVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 P + TI R++S D + +++ +++ + G IAIDGKTIR S Sbjct: 289 ERFLPPSEPTIRRILSKADPVELDRILSQYIVAH---SSGRAIAIDGKTIRSS------- 338 Query: 124 GAIHMVSAFSNENGVVLGQVKTEA-KSNEITAIPELLNLLYLKKNLITIDAMGCQKDIAS 182 ++ +++A +++G V+ Q + K +EI A LL L L ++T DA+ Q +AS Sbjct: 339 -SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALAS 397 Query: 183 KIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFST 225 +I++K DY+ VK N+ L + D T Sbjct: 398 RIREKGGDYVFTVKDNRKTLKDEISGLDDEAFSPSPYDDLLRT 440 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 153 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 50/192 (26%), Positives = 80/192 (41%), Gaps = 6/192 (3%) Query: 182 SKIKDKKADYLLAVKGNQGKLHHAFEEKFPV-NVFSNYKGDSFSTQEISHGRKETRLHIV 240 KI +KK DY++ +K N + E F + ++F R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 SNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIR 300 V+ + EWKG+K + R E +YISS D+D + A +R Sbjct: 61 LKVSD--WLSKAEEWKGIKSVLEVCRKRSDNGK---ESQEKVFYISSLDVDVQILAKCVR 115 Query: 301 AHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGC 360 HW +E+ HWVLDV ED + AE ++ ++++ALNL R + + K Sbjct: 116 GHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHPKKQSMKGKLTAA 175 Query: 361 VKHRERSSEVHF 372 E E+ Sbjct: 176 GWSDEFRDELLL 187 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 66/340 (19%), Positives = 123/340 (36%), Gaps = 41/340 (12%) Query: 58 LKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSF 117 + G P +T+ +++ +D+ ++ WM+ + G I A DGK + GS Sbjct: 14 WRPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGGIRA-DGKVLGGS- 71 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQ 177 K A+H V ++ G+ L Q + + A+ LL L ++++DA Sbjct: 72 -KRAGAPALHGVELVTHTTGMALAQREAVG-GDAAAALLALLTEAPLDGRMVSMDAGFLN 129 Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFS--------------------- 216 + I + +YL VKG+Q + + P FS Sbjct: 130 AAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALDQIAPPRR 189 Query: 217 ------------NYKGDSFSTQEISHGRKETRLHIVSNVTRL-NFCDFEFEWKGLKKLCV 263 + T E S GR E R V + + + W+ + ++ Sbjct: 190 KRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGDVGPSLMTAYGWRQVTQIGG 249 Query: 264 ALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASR 323 + +++ V +SS+ +F +IR HW IE+ +H D M ED R Sbjct: 250 LRRWCRRRH-ADLWTVEEVTVVSSRQRTPAQFLASIRNHWTIENQVHRPRDGSMQED--R 306 Query: 324 IRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKH 363 + I++ + + +NL+R + + Sbjct: 307 LHGRAIGVILAVCRNVVINLIRRHLPGRYIPTARNAITTD 346 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 58/142 (40%), Positives = 78/142 (54%), Gaps = 3/142 (2%) Query: 101 TDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLN 160 G +IAI+GK++RG+ A+H VSA++ G+ LGQ+ + KSNEITAI ELL Sbjct: 1 MGGLVIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLP 60 Query: 161 LLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG 220 L L+ ++TIDA+GCQ +A +I DY+LAVK NQ L HA + F Sbjct: 61 TLALEGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPV 120 Query: 221 DS---FSTQEISHGRKETRLHI 239 T + HGR ETR Sbjct: 121 RQTCVHETLDKGHGRIETRRCT 142 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 151 bits (380), Expect = 5e-35, Method: Composition-based stats. Identities = 55/193 (28%), Positives = 97/193 (50%), Gaps = 8/193 (4%) Query: 180 IASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHI 239 ++ + +K DY+LA+KGN + ++ F V S +T + HGR E R++ Sbjct: 1 MSERNSEKDNDYILALKGNHPLMEQEVKDFFLSPVTSTRSV--HTTFDKGHGRIERRIYT 58 Query: 240 VSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 + T + + + + EWK L + S +K E IRY+I+S D K+FA + Sbjct: 59 L--DTNIGWFEDKKEWKHLAGFGMVDSMVTRKGK---ECREIRYFITSVT-DVKQFAKGV 112 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEG 359 +HW+IE++LHW LDV +D + NAAE ++ I+++ N ++ + K Sbjct: 113 CSHWMIENNLHWCLDVLFCDDECTVLDRNAAENLAIIRRIVYNRIKMLSKMDTLSMGKRA 172 Query: 360 CVKHRERSSEVHF 372 C+ E +++ F Sbjct: 173 CIYDDEFRAQILF 185 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 58/189 (30%), Positives = 92/189 (48%), Gaps = 10/189 (5%) Query: 183 KIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSN 242 I KK DYLL VKGNQ KL A E F D + E HGR ++ V + Sbjct: 1 MIIAKKGDYLLMVKGNQPKLLEAIEIAFIDQHDV-KSVDRSALVERGHGRTVGQIASVLS 59 Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 + +W + S R E +S + YYI+S+ + A++ A ++RA Sbjct: 60 AKGIINPG---DWPNCVTIGRIDSMRVVDEKES--DLERCYYITSRALTAEQLAASVRAR 114 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVK 362 W +E+ HW+LDV +EDAS + + NA + +S ++K+ALN++R K + +K Sbjct: 115 WGVENRFHWILDVSFSEDASTVAKDNAPQNLSMLRKIALNIIRADKT----DTRKSSLRL 170 Query: 363 HRERSSEVH 371 R+ ++ Sbjct: 171 KRKGAARDD 179 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 57/225 (25%), Positives = 104/225 (46%), Gaps = 14/225 (6%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFG----HERLEWL 58 ++ L PD R + +H L AIL + V AV+ A + + ++ +L+ + Sbjct: 220 MEGLRQIFFRLPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKRI 279 Query: 59 KKYGDFD---NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRG 115 + + P + T+ RV+ + A + W+ E +A+DGK ++G Sbjct: 280 RARFNPRTQRYVAPSEPTLRRVLQGANVTALDAAIGAWLLGIA---GFEAVAVDGKVLKG 336 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMG 175 + + +H++SAF + G + Q + K+NEI + LL + ++ ++T DA+ Sbjct: 337 AV--REDGSQVHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADALH 394 Query: 176 CQKDIASKIKD-KKADYLL-AVKGNQGKLHHAFEEKFPVNVFSNY 218 Q+ A + + KKADYL AVKGNQ KL ++ + Sbjct: 395 TQRKTARFLVEDKKADYLFTAVKGNQRKLRNSLICLPWGDFPPQR 439 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 147 bits (370), Expect = 7e-34, Method: Composition-based stats. Identities = 48/212 (22%), Positives = 79/212 (37%), Gaps = 11/212 (5%) Query: 155 IPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV 214 +L + + IT DA+ QK +A I + A YL VK NQ L+ + F Sbjct: 2 FIPILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYFEH-- 59 Query: 215 FSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVAL--SFRQKKE 272 + D HGR +TR + + E+ + + S+ K Sbjct: 60 --RKEPDYCLQDPPGHGRIDTRSIWTTT-----ELNEYLEFPHVGQAFCIHKKSYDPKTN 112 Query: 273 DKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEI 332 D R HW IE+S H++LD +ED +RIR GN Sbjct: 113 KVCENTFYGVTSHHPNKADPARILQIHRGHWSIENSKHYILDWTYDEDRNRIRTGNGPAN 172 Query: 333 ISGIKKMALNLLRDCKDIKGEEEKKEGCVKHR 364 + ++ A+ LL+ ++ ++ + R Sbjct: 173 TNRLRGFAIGLLKSKGVKDIAQKVRDLHQQIR 204 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 144 bits (362), Expect = 6e-33, Method: Composition-based stats. Identities = 63/246 (25%), Positives = 96/246 (39%), Gaps = 18/246 (7%) Query: 28 AILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFE 87 A+L + V A A + + + + P + T V+S +D Sbjct: 2 ALLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 KMFIEWMQECHEITDGEI---IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVK 144 + +D IA+DGK +RG+ + A H+VS F++ +VLGQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGAL--RAKATATHLVSVFAHRARLVLGQLA 119 Query: 145 TEAKSNEITAIPELLNLLYLK-KNLITIDAMGCQKDIASKIKDK-KADYLLAVKGNQGKL 202 KSNEI + LL LL + L+T+DAM Q A I K+ YL+ VK NQ K+ Sbjct: 120 VAEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 HHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLC 262 V + DS HGR +TR + R + K++ Sbjct: 180 LARITALPWAEVPAAATDDS-----RGHGRVKTRTLQIITAARGIGFPYA------KQII 228 Query: 263 VALSFR 268 R Sbjct: 229 RITRER 234 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 143 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 57/182 (31%), Positives = 84/182 (46%), Gaps = 11/182 (6%) Query: 176 CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKET 235 K I + DY++AVKGNQ +LH + + + T E R T Sbjct: 1 MPKKTVQLIIEGGNDYVIAVKGNQKRLHEQIKLTTEQRLPVSLDI----TTERRSDRITT 56 Query: 236 RLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF 295 R V + + ++W+GL++L F + I YYISS ++A +F Sbjct: 57 RSVSVFD----DLSGISYDWEGLQRLVKVERFGTRAGKPYH---QIVYYISSLTINAAQF 109 Query: 296 AHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEE 355 A IR HW IE+ LHWV DV ++ED SR+R+GNA S I+ + L +LR Sbjct: 110 AQGIRGHWGIENRLHWVKDVVLDEDNSRMRQGNAPANFSIIRSLVLTILRYNGYSSITTG 169 Query: 356 KK 357 + Sbjct: 170 IR 171 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 141 bits (354), Expect = 4e-32, Method: Composition-based stats. Identities = 55/186 (29%), Positives = 78/186 (41%), Gaps = 11/186 (5%) Query: 192 LLAVKGNQGKLHHAFEEKFPVNVFSN---YKGDSFSTQEISHGRKETRLHIVSNVTRLNF 248 +LAVK NQ L + HGR ETR + + Sbjct: 1 MLAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREVDKGHGRIETRRCLALDFPGPFE 60 Query: 249 CDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHS 308 D W GL+ + + S R+ RYY+SS DA AHA+RAHW IE S Sbjct: 61 PDL---WPGLQSIPMVESTREI---GDTVTTGRRYYVSSLPADAVRIAHAVRAHWGIE-S 113 Query: 309 LHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEG-CVKHRERS 367 +HWVLDV NED R R NAA+ + ++++A L+R K + + Sbjct: 114 MHWVLDVTFNEDQCRTRLENAAQNFAILRRIATTLIRRDNSTKAGIRIRRLKAGASDDYR 173 Query: 368 SEVHFL 373 +++ L Sbjct: 174 AQLLGL 179 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 140 bits (353), Expect = 6e-32, Method: Composition-based stats. Identities = 46/182 (25%), Positives = 83/182 (45%), Gaps = 4/182 (2%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +LL+ ++ PD R ++ L +L L + ++ ++ +EDF E L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 DN-GIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRK 123 P D T RV+ ID +F W+ + + + +DGK+I+ + + Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 124 GA--IHMVSAFSNENGVVLG-QVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDI 180 I++VS FS + GV + Q + +EI + LL L L+ + T+D++ CQK + Sbjct: 124 YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKKL 183 Query: 181 AS 182 S Sbjct: 184 YS 185 >UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H9Y8_ANADF Length = 431 Score = 138 bits (348), Expect = 2e-31, Method: Composition-based stats. Identities = 57/388 (14%), Positives = 121/388 (31%), Gaps = 33/388 (8%) Query: 10 ISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIP 69 + PD+R + + L+ IL + ++AGA E E+ ++ +P Sbjct: 22 LEAVPDVRAREG-RWSLAEILTGVLLGIVAGARSLAEAEELTDGMSPAARRLASVPRRLP 80 Query: 70 VDDTIARVVSNIDSLAFEKMFIEWMQE-------CHEITDGEIIAIDGK-----TIRGSF 117 D T + + ++ ++A+DGK T+ Sbjct: 81 -DTTARDALCAVPLDGLRAALHRLVRAAWRRKALTPVDLPVGVVALDGKVTALPTLNHPL 139 Query: 118 DKG--------KRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL-YLKKN- 167 + + S + V A++NE +L L Sbjct: 140 IQNQHPDVGLPYGLARTVTCALVSAPGRPCIDAVPIPAETNEAGHFQHVLAGLVETYGAL 199 Query: 168 --LITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFST 225 ++T DA + + + DY+ A+K + + E + + + D Sbjct: 200 FQVVTYDAGALSEANGAAVVAAGKDYVFALKNDHFTMVKLATELLDPHEIAARREDVLDN 259 Query: 226 QEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYI 285 + + S+ E W + S ++ R ++ Sbjct: 260 ATTATREIQILAVDPSHGYGAGKGPEESVWSHARTFLRVTSTVRRSG--VVIERDSRLFV 317 Query: 286 SSKDMD---AKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISG--IKKMA 340 SS+ D ++ +RAHW +E++ H LD ED +A +++ ++++A Sbjct: 318 SSRAADQLTPDQWLQVVRAHWGVENNNHHTLDTAFAEDERPWIAADANGMLAVLLLRRIA 377 Query: 341 LNLLRDCKDIKGEEEKKEGCVKHRERSS 368 LL + + + Sbjct: 378 YTLLALFRAVTLRSDDHRAMRWLALLRW 405 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 138 bits (347), Expect = 3e-31, Method: Composition-based stats. Identities = 48/182 (26%), Positives = 80/182 (43%), Gaps = 3/182 (1%) Query: 20 GKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG-IPVDDTIARVV 78 H L A+L L AV+ Q I FG + L F G P T+++ + Sbjct: 2 QGRIHPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTL 61 Query: 79 SNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGV 138 ID E W+ +A+DGK +RGS + H V+A++ Sbjct: 62 RRIDPQQLEAALGRWIAGRLTPDARAHVALDGKCLRGS--RDGDVPGPHRVAAYAPHAAA 119 Query: 139 VLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGN 198 VLGQ++ +A++NE A LL ++ + +++T A C +D+A+ + D Y+ +G Sbjct: 120 VLGQIRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYVSHGQGQ 179 Query: 199 QG 200 Sbjct: 180 PT 181 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 138 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 69/174 (39%), Positives = 94/174 (54%), Gaps = 3/174 (1%) Query: 97 CHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIP 156 G+IIA+DGKT+RGS+D+ K AIHMVSA+S N +VLGQ+KTE KSNE TAIP Sbjct: 1 MAARIPGDIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIP 60 Query: 157 ELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFS 216 +L LL L+ +TIDA+G Q+DIA +I DK ADYLL VK NQ LH + + Sbjct: 61 KLFTLLALEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAK 120 Query: 217 NYKGDS---FSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSF 267 + D + + HGR + V++ + + + Sbjct: 121 GFTEDFTDSVTEEGDKHGRIDKLHCRVTHRFSGLGALADKTRPNSLWVAEVYRY 174 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 137 bits (345), Expect = 6e-31, Method: Composition-based stats. Identities = 66/142 (46%), Positives = 86/142 (60%), Gaps = 4/142 (2%) Query: 106 IAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLK 165 +AIDGK +RGS D + IH+VSA+S+ + LGQV+T KSNEITAIPELL L ++ Sbjct: 1 MAIDGKCLRGSHD--GARSPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 KNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDS--F 223 + ITIDAMGCQ DIA +I + ADY+L VKGNQ L A + F + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 STQEISHGRKETRLHIVSNVTR 245 S + +HGR ETR + +N Sbjct: 119 SQTDKNHGRIETRRCVATNDVA 140 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 53/180 (29%), Positives = 85/180 (47%), Gaps = 5/180 (2%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLK-KY 61 + +L + PD R+ L +L ++ A+++GA ++ I F H L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGK 121 G P +I + +D A F E +IA+DGKT+RGS D+ + Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHFRAHAARLAE--GAAVIALDGKTLRGSLDRFE 118 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTE--AKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 + A ++SAF+ E +VLGQ+ E K +EI A L+ L L L T+DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TCH7_HELMI Length = 453 Score = 136 bits (341), Expect = 2e-30, Method: Composition-based stats. Identities = 53/384 (13%), Positives = 112/384 (29%), Gaps = 60/384 (15%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 + D R+Q ++K AI + + ++++ + ++ ++ Sbjct: 40 GFSQMVRQAKDGRKQPRIK--APAIFTVAFFGAFFCMESMEQMDRW--QKTGVFRQLVPK 95 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI--------TDGEIIAIDGKTIRGS 116 + +P DT+ + + D + +Q E + AIDG + + Sbjct: 96 NIRLPSHDTVRQALMKWDLKEQREQHNCVIQRYKEQRGPQKESINGWRVTAIDGVELFHT 155 Query: 117 FD-----------KGKRKGAIHMVSAFSNENG---------VVLGQVKTEAKSNEITAIP 156 + K H V G + Q + E T Sbjct: 156 KAYRCPECLTREHRDKTTDYYHAVVVAQQVGGNANLIYDWEMRKPQDGVDKDEGETTVAQ 215 Query: 157 ELLNLLYLK----KNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPV 212 L+ + ++ T+DA+ + + D A ++ +K + ++ F Sbjct: 216 RLIRRMAETYGKITDVYTLDALFAKAPVIHAALDAGAHVVVRMKEERRRIMKEANACF-- 273 Query: 213 NVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKE 272 +N DS + G + +W ++ + + Sbjct: 274 ---ANRLPDSTWEERDGKGNTVYVQAW--------DEEGLAQWPQVRVPMRIVKIIRHTN 322 Query: 273 DKSAEGVSIRYYI-----------SSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDA 321 E + SS+ D + A A W IE+ L D Sbjct: 323 KTVIEANKEVFVTDVVERWIATTCSSEKADTQTIAQIAAARWDIENIGFRNLKTFNALDH 382 Query: 322 SRIRRGNAAEIISGIKKMALNLLR 345 + A + + G + +A NL R Sbjct: 383 CFVHDSVAIKAMIGFQVLAFNLKR 406 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 46/206 (22%), Positives = 82/206 (39%), Gaps = 11/206 (5%) Query: 174 MGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEE--------KFPVNVFSNYKGDSFST 225 M Q D+ + ++++ DY+L K NQG L E FP + + D+ + Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 QEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYI 285 E+S G +++ + ++ W G++++ RQ + E V + Sbjct: 61 CEVSKGHGWVERRTMTST--IWLNEYLTRWPGVQQVFRLTRTRQVGGKTTVEVVYGISSL 118 Query: 286 SSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 SS R HW IE H + D + ED R+RRG A +++ ++ +A+ LLR Sbjct: 119 SSVAAAPDALLRYTRTHWGIESR-HHIRDATLGEDRCRVRRGAAPRVLAVLRNVAVYLLR 177 Query: 346 DCKDIKGEEEKKEGCVKHRERSSEVH 371 + K + H Sbjct: 178 RLGTGTIAAVIRTVGAKPELALAAAH 203 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 61/348 (17%), Positives = 109/348 (31%), Gaps = 64/348 (18%) Query: 26 LSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLA 85 L+++L L V+AG + + ++ + L GIP + T R+V D +A Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFGLTRGIPSERTTRRLVEGCDPVA 107 Query: 86 FEKMFIEW--MQECHEITDGEIIAIDGKTIRG--SFDKGKRKGAIHMVSAFSNENGVVLG 141 ++ W +A DGKT++G SF + ++ A ++ G+ G Sbjct: 108 LDEALSGWIARAATVGDPGPRGLAFDGKTLKGTRSFTEAGAMSQEAVLEAVWHDTGITAG 167 Query: 142 QVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGK 201 + +EI A+ L L L L+T +K Sbjct: 168 HQRVVG-GDEIAALEALAGRLDLTDVLVT--------------TAEKGH----------- 201 Query: 202 LHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDF--EFEWKGLK 259 GR E R VT F + L+ Sbjct: 202 -----------------------------GRVEVRSLKALTVTTPKLVGFWGTKQVIELR 232 Query: 260 KLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF--AHAIRAHWLIEHSLHWVLDVKM 317 + S E + + ++ ++ R HW +E +H V D + Sbjct: 233 RRTRRKKTVTAAPTVSEEVFYLVTSLPAEQAHPRDLAARARARGHWTVEA-IHHVRDRVL 291 Query: 318 NEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRE 365 +ED R NA + + A++ LR + + + Sbjct: 292 DEDRHTARTANAPLAWAIARDTAISALRLTGHRSIAKALRTTARQPER 339 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 132 bits (332), Expect = 2e-29, Method: Composition-based stats. Identities = 63/331 (19%), Positives = 114/331 (34%), Gaps = 42/331 (12%) Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQE-------CHEITDGEIIAIDGKTIR 114 G P D T+ R+++ E+ +++ +++ +++ DGK Sbjct: 93 LGLGRGKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTW 152 Query: 115 GSFDKGKRKGAIHMVSAFSNENG------------------VVLGQVKTEAKSNEITAIP 156 D K KGA SA+ E +GQ +K E TA Sbjct: 153 SRTDGEKVKGAQQ--SAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFR 210 Query: 157 ELL----NLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPV 212 LL L + ++T DA C ++ A + Y+ +K NQ LH + Sbjct: 211 RLLPAISEQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLHDIARDYGQY 270 Query: 213 NVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKE 272 ++ + T E G R +V E + + ++ E Sbjct: 271 DLGTPL----ARTAERYRGHTIVRELYARDVAGNPAAAIEAAQQL--WYVCQTTTDRRGE 324 Query: 273 DKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAA-- 330 + E I + + + +R HW IE+ HW +DV + ED + + A Sbjct: 325 IVAVEQRYFVTSIPTGTLTRDQELALVRMHWAIENGCHWTMDVMLGEDEGHPCQASRASI 384 Query: 331 EIISGIKKMALNLLRDCKDIKGEEEKKEGCV 361 E +S ++ + N + +K+G Sbjct: 385 ETVSWLRLIGYN---AVSAWRTLAPRKDGRP 412 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 54/189 (28%), Positives = 90/189 (47%), Gaps = 12/189 (6%) Query: 155 IPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNV 214 + +L +KK + T+DA+ CQK I K+ Y++ VK NQ L A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIEDT----- 56 Query: 215 FSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDK 274 N +++S + HG + + T + +W GL++ S R++ Sbjct: 57 AKNSPLNAWSWTQKGHGHESHCRLKIWEATESM----KMQWAGLERFI---SIRRQGFRH 109 Query: 275 SAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIIS 334 + S Y+I+S+ + + A IR H IE++LHW DV +NED IR + A I+ Sbjct: 110 HKKFDSTTYHITSETLSSYRLAGFIRGHRRIENNLHWTKDVILNEDNCGIREPHPAAILG 169 Query: 335 GIKKMALNL 343 ++ +A NL Sbjct: 170 ILRNIAFNL 178 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 131 bits (329), Expect = 4e-29, Method: Composition-based stats. Identities = 58/146 (39%), Positives = 76/146 (52%), Gaps = 7/146 (4%) Query: 174 MGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSN---YKGDSFSTQEISH 230 MGCQK+IA I +++ADY+ AVK NQ LH A ++ F +N Y D T SH Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYFEEANEANFESYNIDFAETYNKSH 60 Query: 231 GRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM 290 GR E+R V L D W+GL+ + + S R KE + RYYISS Sbjct: 61 GRIESRRCWVG-YDALPLTDDSQNWEGLQTIVMVESERTLKEKT---TIEHRYYISSTMA 116 Query: 291 DAKEFAHAIRAHWLIEHSLHWVLDVK 316 A ++ R HW IE+SLHW LD+ Sbjct: 117 TAAYLLNSSREHWGIENSLHWRLDIA 142 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 129 bits (325), Expect = 1e-28, Method: Composition-based stats. Identities = 58/167 (34%), Positives = 84/167 (50%), Gaps = 13/167 (7%) Query: 128 MVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDK 187 MVS ++ N +VLGQVK SNEITAIPELL +L L ++ I A+ C KDI I + Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 KADYLLAVKGNQGKLHHAFEEKFPVNV---FSNYKGDSFSTQEISHGRKETRLHIVSNVT 244 ADY++ +K NQG L+ + E+ F + F + ++ +E HG E R Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQELQHSTYKPEETGHGLHEIR-------N 113 Query: 245 RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMD 291 D + W LK + + Q + + E RY+ISS D + Sbjct: 114 FGFQLDPDSVWSNLKSVGMVEPIGQVDDKTTVET---RYFISSLDSN 157 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 129 bits (324), Expect = 2e-28, Method: Composition-based stats. Identities = 40/189 (21%), Positives = 75/189 (39%), Gaps = 6/189 (3%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 L+ ++ PD R + V+ +L + V +++ + +++E F L + + Sbjct: 13 LISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGIE 72 Query: 66 -NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEI--IAIDGKTIRGSFDK--G 120 P D +D A +W G++ + DGKT+RGS + G Sbjct: 73 LKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTSG 132 Query: 121 KRKGAIHMVSAFSNENGVVLGQ-VKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKD 179 I V+ +S GV + Q + +E + +LL L L+ LI DA+ Q+ Sbjct: 133 GGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQA 192 Query: 180 IASKIKDKK 188 + + Sbjct: 193 FFGSSQSRG 201 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 129 bits (323), Expect = 2e-28, Method: Composition-based stats. Identities = 55/195 (28%), Positives = 78/195 (40%), Gaps = 10/195 (5%) Query: 186 DKKADYLLAV--KGNQGKLHHAFEEKFPVNVFSNYKGDS---FSTQEISHGRKETRLHIV 240 D+ + L +G L HA + F Y T + HGR ETR Sbjct: 91 DRGRWWRLRACRQGQPTHLAHALRDFFGTLDAPGYPVRQTCVHETLDKGHGRIETRRCTA 150 Query: 241 S-NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAI 299 + ++ L + WK + + S R RY ISS D++ HA+ Sbjct: 151 AGDLDWLATLGLKERWKKITSVAGIDSSRVIGSKT---ETDRRYVISSLPADSERILHAV 207 Query: 300 RAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE- 358 R HW IE+ LHW LDV EDA IR NAA S +++ A+NL R KK Sbjct: 208 RMHWGIENGLHWCLDVAFGEDACPIRLRNAALDFSLLRRAAMNLFRADHSRAMGLPKKRK 267 Query: 359 GCVKHRERSSEVHFL 373 + + + + L Sbjct: 268 AAAWNPDYLANILHL 282 >UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC8_9GAMM Length = 154 Score = 123 bits (308), Expect = 1e-26, Method: Composition-based stats. Identities = 40/122 (32%), Positives = 59/122 (48%), Gaps = 9/122 (7%) Query: 253 FEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWV 312 W+ L+ + + S R +K + RYYISS A R HW IE SLHW Sbjct: 5 ENWEELQTIVMVESER---AEKGETTIEHRYYISSTLGTAAYLLDYKREHWGIETSLHWC 61 Query: 313 LDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSSEVHF 372 LD+ ED SRI +GN AE + ++ +ALNLL+ K + K ++ + F Sbjct: 62 LDIAFREDESRISKGNGAENFAILRHIALNLLKKEDTAKIGIKNKRL------KAGGMEF 115 Query: 373 LY 374 ++ Sbjct: 116 IF 117 >UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azollae' 0708 RepID=B9YJB3_ANAAZ Length = 124 Score = 122 bits (306), Expect = 2e-26, Method: Composition-based stats. Identities = 37/118 (31%), Positives = 66/118 (55%), Gaps = 3/118 (2%) Query: 247 NFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIE 306 D + W LK + + S Q + + E RY+ISS D + ++ A+++R+HW IE Sbjct: 7 FQLDPDSVWSNLKSVGMVESIGQVDDKTTVET---RYFISSLDSNGEQLANSVRSHWAIE 63 Query: 307 HSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHR 364 +SLHWVLDV + +D +IR+ NA + + ++++A++LL +K + K+ Sbjct: 64 NSLHWVLDVALKQDDCQIRKDNAPQNFAVMRQIAVDLLGKENPVKRGIKNKQFLAAVD 121 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 122 bits (305), Expect = 2e-26, Method: Composition-based stats. Identities = 42/188 (22%), Positives = 76/188 (40%), Gaps = 13/188 (6%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +L +S PD R ++ L +L L + A ++ D + +E F L G Sbjct: 2 TLRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG-- 58 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 P I ++ +D + Q E GE++ +DGK +RGS + Sbjct: 59 LRKAPGHTAITLLLHRLDPEKLQAALG---QVFPEADLGEVLVVDGKHLRGSGK--GKSP 113 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL---YLKKNLITIDAMGCQKDIA 181 + +V + L Q + E + E A ELL+ L L+ ++ DA ++A Sbjct: 114 QVKLVEVLALHLHTTLAQARAEGR--EEKAFLELLDRLEARELEGKVVVGDAGYLYPEVA 171 Query: 182 SKIKDKKA 189 ++++ K Sbjct: 172 ARVRKKGG 179 >UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7Q7_9GAMM Length = 164 Score = 122 bits (305), Expect = 2e-26, Method: Composition-based stats. Identities = 43/166 (25%), Positives = 67/166 (40%), Gaps = 5/166 (3%) Query: 207 EEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALS 266 + K +S+ T+E HGRKE R V +W +K + + Sbjct: 1 MQFQDYWALPEDKQESYITEEKGHGRKEVREVYVLPAAFSEAL--RQKWCLVKSIVAVVR 58 Query: 267 FRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRR 326 R K S E YYI + + + + A R HW IE+ HW LDV ED RI Sbjct: 59 DRSVKGKGSYETS---YYICTDHLSLELASKATRKHWHIENQQHWALDVIFKEDEQRIYA 115 Query: 327 GNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSSEVHF 372 G++A ++ ++ NL R + K +++ +V F Sbjct: 116 GDSALNMACCRRFVQNLFRKSEGNLSVPRKMNQAAWNKDYREKVLF 161 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 122 bits (305), Expect = 2e-26, Method: Composition-based stats. Identities = 52/167 (31%), Positives = 80/167 (47%), Gaps = 9/167 (5%) Query: 3 IQSLLDYISVTPDIRQQ--GKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++ L ++ S PD R+ G ++HKL ++ L + ++ EI +FG L+ +K Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDG-----EIIAIDGKTIRG 115 NGIP + T+ R+ ID A + + H+ G EI+ IDGK RG Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCAQEIVCIDGKAERG 154 Query: 116 SFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL 162 + K R I VSA S + L E KSNEI A+P L++ + Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLIDKI 199 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 118 bits (294), Expect = 4e-25, Method: Composition-based stats. Identities = 57/157 (36%), Positives = 79/157 (50%), Gaps = 5/157 (3%) Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 H+VSA++ E+GV LG V TE KSNEITAI LL L KK ++TIDAMGCQKDIA I Sbjct: 3 PRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARNI 62 Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVN---VFSNYKGDSFSTQEISHGRKETRLHIVS 241 D++LAV+ NQ KL A + + + T HGR++ R + + Sbjct: 63 VAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWGA 122 Query: 242 NVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEG 278 V E+ W +K + A+ + + Sbjct: 123 QVPPDFAAKGEWPW--IKAIGTAVRITTHPDGTQTDE 157 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 117 bits (292), Expect = 8e-25, Method: Composition-based stats. Identities = 41/188 (21%), Positives = 77/188 (40%), Gaps = 13/188 (6%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 +L + +S PD R ++ L +L L + A ++ D + +E F L G Sbjct: 2 TLREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG-- 58 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 P + ++ +D ++ +Q GE++ +DGK ++GS + Sbjct: 59 LRKPPGHTILTLLLHRLDPEKLQEAL---LQVFPGADLGEVLVVDGKHLKGSGK--GKSP 113 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL---YLKKNLITIDAMGCQKDIA 181 + +V + L Q K E + + A+ ELL+ L LK ++ DA ++A Sbjct: 114 QVRLVEVLALHLLTTLAQAKAEGRED--QALLELLDRLGAEGLKGKVVVGDAGYLYPELA 171 Query: 182 SKIKDKKA 189 K+ K Sbjct: 172 GKVVQKGG 179 >UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C429C Length = 149 Score = 111 bits (276), Expect = 5e-23, Method: Composition-based stats. Identities = 39/133 (29%), Positives = 64/133 (48%), Gaps = 5/133 (3%) Query: 224 STQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRY 283 +T E HGR E R + + + +WKGLK+ R K K+ E V Sbjct: 2 TTSEKGHGRIEKRTLETTPIVTVG-----QKWKGLKQGLRITRERAVKGKKTVEVVYGIT 56 Query: 284 YISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNL 343 +S +A +R HW IE+ LH+V DV + EDA R+R+G A ++++ ++ + ++L Sbjct: 57 SLSMARANAATLLTILRDHWQIENGLHYVRDVTLGEDACRVRKGTAPQVLAAVRNVVVHL 116 Query: 344 LRDCKDIKGEEEK 356 L + E Sbjct: 117 LASVEAKSRPEAI 129 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 110 bits (275), Expect = 8e-23, Method: Composition-based stats. Identities = 46/211 (21%), Positives = 94/211 (44%), Gaps = 14/211 (6%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 SI S L Y++ PD R+ K +H+ +L + + AV +G + + ++ +L Sbjct: 5 SIPSPLPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDE 64 Query: 62 GDFDNG-----IPVDDTIARVVSNI--DSLAFEKMFIEWMQECHEITDGEI-----IAID 109 +P T+ R+ ++ D +K + W +E + E +A+D Sbjct: 65 VHIRTRRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVD 124 Query: 110 GKTIRGSFDKGKRKGAIHMVSAFSNENGVVLG-QVKTEAKSNEITAIPELLNLLYLKKNL 168 GK +RG+ + + A+ +SA G+ LG Q + ++ + + L + + Sbjct: 125 GKHLRGTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGV-DWV 183 Query: 169 ITIDAMGCQKDIASKIKDKKADYLLAVKGNQ 199 +T DA C +++A+ + ++K A KG + Sbjct: 184 LTGDAALCTQELAAVVVEQKGGICSASKGTR 214 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 48/187 (25%), Positives = 76/187 (40%), Gaps = 15/187 (8%) Query: 175 GCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISH-GRK 233 K + D L+ +KGN KL A S + T ++ R Sbjct: 3 STFKKTVETVLATGNDLLVQLKGNHPKLLAAVRTL----CQSRAHAEQSYTVDLGRRNRI 58 Query: 234 ETRLHIVSNVTRLNFC-----DFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSK 288 E R + + + F+ +G +++ V + ++ E + S YY+++ Sbjct: 59 EQRTVRLWPLPPGSGTDPWHDHFQTVIEGQRQIEVFNPYHRRFEPRQE---SPAYYLATC 115 Query: 289 DMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCK 348 A A IR HW IE+ LH VLDV + ED+SRIRR + + ++ ALNLLR Sbjct: 116 TASAATLAQVIRGHWAIENRLHHVLDVSLGEDSSRIRRN--PGVFALLRHFALNLLRHNG 173 Query: 349 DIKGEEE 355 Sbjct: 174 QANIRSA 180 >UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D5 Length = 148 Score = 108 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 33/144 (22%), Positives = 60/144 (41%), Gaps = 5/144 (3%) Query: 228 ISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISS 287 HGR E R + ++ W G++++ R+ + E V +S Sbjct: 3 KGHGRVERRSITTTT----WLNEYLTRWPGVQQVFRLERQRRADGKTTVEVVYGISSLSP 58 Query: 288 KDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDC 347 R+HW IE SLH+V DV ++ED R+RRG A +++ ++ +A+ LLR Sbjct: 59 VAAPPDTVLGYTRSHWGIE-SLHYVRDVTLDEDRCRVRRGTAPRVLASLRNVAVYLLRRL 117 Query: 348 KDIKGEEEKKEGCVKHRERSSEVH 371 + + + ++ Sbjct: 118 GAGTIAAAVRTVVARPELALAALN 141 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 108 bits (268), Expect = 5e-22, Method: Composition-based stats. Identities = 35/99 (35%), Positives = 55/99 (55%), Gaps = 1/99 (1%) Query: 3 IQSLL-DYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 ++ L + S PD R KH I+ L + +V+AGA + EIEDF ++WLK Y Sbjct: 1 MEGLFVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTY 60 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI 100 + NGIP DT +RV S I+ +F+ F+ W++ ++ Sbjct: 61 FNLPNGIPSHDTFSRVFSAINPASFQDSFLIWLKAINDA 99 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 104 bits (259), Expect = 5e-21, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 69/214 (32%), Gaps = 34/214 (15%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCA-VIAGADEWQEIEDFGHERLEWLK 59 +S + + + + D R + + + +C+ AG + + + Sbjct: 19 VSREGIWERLDRVTDPRSTRGRVYSWLCLAAVWLCSLTAAGHHRVSAVRAWLARTSGAER 78 Query: 60 KYGDFDN------GIPVDDTIARVVSNIDSLAF------------------EKMFIEWMQ 95 +P TI + +D + Sbjct: 79 ARLRLPWDPFAGWRLPSTATIHCFLQAVDDGELAVALLDPPLDPDPPAEQGDDTDQRTEP 138 Query: 96 ECHEITDG-------EIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAK 148 + G +A+DGKT R + K +H+V S+ +G +L QV+ EAK Sbjct: 139 SAAPVDPGHGCQPVESAVALDGKTSRHA--KRADGSKVHLVGVASHGDGRLLAQVEVEAK 196 Query: 149 SNEITAIPELLNLLYLKKNLITIDAMGCQKDIAS 182 +NE LL L L L+T DA+ + Sbjct: 197 TNETAVFRRLLRPLDLTNVLVTADALHTVRANLD 230 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 47/210 (22%), Positives = 79/210 (37%), Gaps = 18/210 (8%) Query: 95 QECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITA 154 IA+DGK ++ S H++SA ++ V L +V+ AK+NE T Sbjct: 123 GTSATAGPRRAIAVDGKALKAS--ARLTSPRRHLLSAVTHGRVVTLARVEVGAKTNETTH 180 Query: 155 IPELLNLLYLKKNLITIDAMG-CQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVN 213 LL L L ++T DA+ + +I+ ++ KKA Y+ +K NQ HH + Sbjct: 181 FKPLLAPLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQLATLPWRD 240 Query: 214 VFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKED 273 + + E+ HGR+E+ + + R+ + Sbjct: 241 IPVQ-----HAASEVGHGRRESSSIKTCAIPDELGGIAYPHAR-----LAIRVHRRCQPT 290 Query: 274 KSAEGVSIRYYISSKDMDAKEFAHAIRAHW 303 E Y ++S D A R W Sbjct: 291 GKRESRESVYAVTSLDAH-----QATRPIW 315 >UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BR51_9GAMM Length = 153 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 61/149 (40%), Gaps = 9/149 (6%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++SL Y + PD + +H+L +L L A + G ++ + ++ + ++ Sbjct: 8 MRSLPQYFTDLPDPHRAEGRRHRLPVVLTLAAGASLCGMQGYKSMAEWASSLAQAARRRF 67 Query: 63 DFDNG-----IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSF 117 +P I + + A ++ W ++ E +A+DGK ++G Sbjct: 68 GCRRVNGHYLVPSLYVIRDCLVRLGPEALDRRLQAWQAA--QLNSEEALAMDGKIMKGGV 125 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVKTE 146 D + H+VS +E+ + Q K+ Sbjct: 126 DHTGAQT--HIVSLIGHESKHCVAQKKSA 152 >UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE7_9GAMM Length = 185 Score = 101 bits (251), Expect = 4e-20, Method: Composition-based stats. Identities = 41/172 (23%), Positives = 69/172 (40%), Gaps = 6/172 (3%) Query: 185 KDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNV- 243 L+ +K NQ LH A E + F + + + + R E R V ++ Sbjct: 2 IATGNHLLVQLKRNQPLLHDAMVEYTRGHPFVD---EHHTHEIGRRNRIEKRAVHVWHLH 58 Query: 244 TRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHW 303 L + ++ L ++ + YY+ + A F+ AIR HW Sbjct: 59 PSLGSAPWYDHFRALIRVQRHTERFDTRLRDWRVSKECAYYLCDLVLPAARFSEAIRNHW 118 Query: 304 LIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEE 355 +E+ H+V D + EDASRIRR + ++ ALNL+R + + Sbjct: 119 RVENRAHYVRDTRFQEDASRIRRN--PCTFALLRSFALNLMRFNRVENISQG 168 >UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C378D Length = 453 Score = 101 bits (250), Expect = 5e-20, Method: Composition-based stats. Identities = 53/364 (14%), Positives = 98/364 (26%), Gaps = 56/364 (15%) Query: 8 DYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG 67 + PD R L +L + A F L+ ++ G Sbjct: 27 ERFETIPDAR--RGPTFSLPDVLMAGLALFALKAPSLLA---FQRRTLDHNLRHVFGLTG 81 Query: 68 IPVDDTIARVVSNIDSLAFEKMFI--------EWMQECHEITDGEIIAIDG--------- 110 P D + V+ ++D +F + + + + ++A+DG Sbjct: 82 RPSDSQMRAVLDDVDPDHLRPVFRDVFARLQAAHVLDEYRVDGCYVVALDGVEYFCSQKV 141 Query: 111 ---KTIRGSFDKGKRKGAIHMVSA--FSNENGVVLG------QVKTEAKSN--EITAIPE 157 + G M+ A + VL Q N E A Sbjct: 142 HCPHCMTRRHANGAVSYYHQMLGAAVVHPDFSAVLALAPEPIQRADGGTKNDCERNAARR 201 Query: 158 LLNLL----YLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVN 213 L L+ DA ++ + +LL VK A + Sbjct: 202 WLGRFREEHPDLAVLVVEDARSSNAPHVRDLQKARCHFLLGVK-------AADHAHLFAH 254 Query: 214 VFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKED 273 V + +F E + R R + + L + D Sbjct: 255 VCARQDQHAFEVVEDADPRTGLRRSYLWIADLPLNESNDDVRVNFVHLV------ELDPD 308 Query: 274 KSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRR-GNAAEI 332 + + ++ + ++ A A RA W IE+ L N+ G+ Sbjct: 309 GTPREWTWVADMAVTGANVRQLARAGRARWRIENETFNTLK---NQGYHFAHNFGHGDNN 365 Query: 333 ISGI 336 +S + Sbjct: 366 LSVV 369 >UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV84_9DELT Length = 357 Score = 101 bits (250), Expect = 6e-20, Method: Composition-based stats. Identities = 29/147 (19%), Positives = 63/147 (42%), Gaps = 9/147 (6%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 ++SL DY D R+ +H++S +L + A + G ++ I + ++ + ++ Sbjct: 215 MESLPDYFKTVTDPRRTHGRRHRISTVLSIAAAATLCGMKGYKAIYGWANKLGQKARQRF 274 Query: 63 DFD-----NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSF 117 IP I V+ D + + + ++ + + +A DGKT++ + Sbjct: 275 RCRKENGKYVIPSQFVIRDVLVRADPVELDLAVQRFNED--QGLEDTCLAFDGKTMKNAI 332 Query: 118 DKGKRKGAIHMVSAFSNENGVVLGQVK 144 D+ R+ H+ S +E+ Q K Sbjct: 333 DENARQT--HIASVVGHESKTTHTQKK 357 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 100 bits (249), Expect = 8e-20, Method: Composition-based stats. Identities = 32/129 (24%), Positives = 61/129 (47%), Gaps = 6/129 (4%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 +SL ++ PD R ++ L +IL + VCAV+AGA + I D+ ++ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE------IIAIDGKTIRGSF 117 F + +P T+ R++ ID+ ++ W++ +IA+DGK +RG+ Sbjct: 89 FTDRVPAATTVWRLLIRIDAEVLPQVLARWLRARTAPVVVTGRRLCLVIAVDGKVVRGAR 148 Query: 118 DKGKRKGAI 126 + A+ Sbjct: 149 LRAAGPSAL 157 >UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY37_9BACT Length = 138 Score = 100 bits (249), Expect = 8e-20, Method: Composition-based stats. Identities = 31/124 (25%), Positives = 51/124 (41%), Gaps = 7/124 (5%) Query: 232 RKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQ--KKEDKSAEGVSIRYYISSKD 289 R ET+ VS+ ++ L+++ + KK E +SS Sbjct: 1 RIETQTIRVSS-----LLKGYSDFPHLEQVFRIDRVTRFKKKGKTRKETALGVTSLSSGQ 55 Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 +E +R HW IE+ LHW+ D ED R GN A +++ ++ M ++LLR Sbjct: 56 ASPRELLDFVRGHWEIENRLHWIRDSVFREDTCTTRTGNGAHVMATLRNMTISLLRVAGS 115 Query: 350 IKGE 353 Sbjct: 116 KSIA 119 >UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=Bacteroides RepID=C6Z3A5_9BACE Length = 115 Score = 99.6 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 37/113 (32%), Positives = 53/113 (46%), Gaps = 8/113 (7%) Query: 262 CVALSFRQKKEDKSAEGVSIRYYISSKDMD-AKEFAHAIRAHWLIEHSLHWVLDVKMNED 320 S R +RYY++S D ++ A AIR HW I ++LHW LDV ED Sbjct: 1 VRIKSERTIV-AIGEYTQEVRYYVTSLDNTRPEKIASAIRQHWSIGNNLHWQLDVTFRED 59 Query: 321 ASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSSEVHFL 373 S+ + NAA S KMAL +L++ K KG K + + ++L Sbjct: 60 YSK-KVKNAAGNFSVATKMALTILKNEKTTKGSMNLKRL-----KAGWDENYL 106 >UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9DIV7_9GAMM Length = 133 Score = 99.6 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 31/102 (30%), Positives = 58/102 (56%), Gaps = 2/102 (1%) Query: 248 FCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEH 307 + + +++W GLK + S ++ E R+YISS D++A++ ++R HW +E Sbjct: 7 WLNNKYQWVGLKSIIKVTSDV-HEKTTGKETTETRWYISSLDLNAEQALSSVRNHWQVE- 64 Query: 308 SLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 S+HWVL++ ED SR R+G + ++K+A+ L + + Sbjct: 65 SMHWVLEMTFREDESRFRKGRGPLAFNVMRKIAMTLFKQDQT 106 >UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C6B8_ACAM1 Length = 145 Score = 99.2 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 31/143 (21%), Positives = 53/143 (37%), Gaps = 7/143 (4%) Query: 222 SFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSI 281 + S GR+E R V + EW+ ++ + + ++ Sbjct: 3 EHTHSIQSRGREEHRCIQVYEPVGIAL----QEWEAIRSVLCVQRWGTRQGKAYHNTA-- 56 Query: 282 RYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMAL 341 YYISS + +R HW IE+ LHW DV ED R+ A S ++ + + Sbjct: 57 -YYISSAATSPHHWQSLVREHWGIENRLHWPKDVVFGEDDYRLEDEQALLNWSVLRTIVI 115 Query: 342 NLLRDCKDIKGEEEKKEGCVKHR 364 N+LR + + + Sbjct: 116 NILRLNGYQSLKTAMTKLANRVD 138 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 97.2 bits (240), Expect = 8e-19, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 76/202 (37%), Gaps = 51/202 (25%) Query: 174 MGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKF---PVNVFSNYKGDSFSTQEISH 230 MGCQK+IA I +KADY+LA+KG+ L E + F+ D +T + H Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGH 60 Query: 231 GRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM 290 GR ETR V + ++ + +++W GLK + S E + E Sbjct: 61 GRIETRRCQQVLVNK-SWLNNKYQWVGLKSIIKVTS--DVHEKTTTE------------- 104 Query: 291 DAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDI 350 SRIR+G + ++K+A+ L + + Sbjct: 105 -------------------------------SRIRKGRGPLAFNVMRKIAMTLFKQEQTK 133 Query: 351 KGEE-EKKEGCVKHRERSSEVH 371 + KK+ E S + Sbjct: 134 RASIVAKKKMAGLDDEYRSTLL 155 >UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica OS145 RepID=A3WIW9_9GAMM Length = 96 Score = 96.5 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 37/83 (44%), Positives = 52/83 (62%) Query: 279 VSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKK 338 + RYYISS + A+EFA +RAHW IE+ LHWVLDV + ED I RG+AA+ ++ + Sbjct: 1 MQYRYYISSASLSAEEFASTVRAHWGIENRLHWVLDVTIREDNCPISRGHAADNLACFRH 60 Query: 339 MALNLLRDCKDIKGEEEKKEGCV 361 +ALN +R K I +K+ Sbjct: 61 VALNQIRREKTIDASVNRKQKMA 83 >UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BLK3_9GAMM Length = 190 Score = 96.5 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 51/128 (39%), Gaps = 6/128 (4%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 +++L Y + PD R+ +H+L + LT A + G ++ + ++ + ++ Sbjct: 60 MRALPQYFTDLPDPRRAQGRRHRLPVVPALTAGASLCGMQGYKAMAEWASSLGQAARQRF 119 Query: 63 DFDNG-----IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRGSF 117 +P I + + A ++ W +D E +A+DGK ++G Sbjct: 120 GCRRVNGHYLVPSLYVIRDCLVRLGPKALDRRLQAWQAAQLNSSD-EALAMDGKIMKGGV 178 Query: 118 DKGKRKGA 125 D + Sbjct: 179 DHTGAQTQ 186 >UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammaproteobacteria RepID=Q47YV8_COLP3 Length = 151 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 29/135 (21%), Positives = 57/135 (42%), Gaps = 3/135 (2%) Query: 238 HIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH 297 + + + + GLK + + + + R+ ISS D+ + + Sbjct: 12 IHLRTLIDKKWLAKAYRRSGLKSIIKVHTQV-HDKSTGKDTAETRWNISSLDLHVVQALN 70 Query: 298 AIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIK-GEEEK 356 A+R+HW +E S+HW+LD+ D SRI R + + ++K+A+ L + K Sbjct: 71 AVRSHWQVE-SIHWMLDMTFRVDESRICRKQGPHVFNVMRKIAMTLFKQDTTKLVSMARK 129 Query: 357 KEGCVKHRERSSEVH 371 K+ + S + Sbjct: 130 KKMAGLDDDYRSNLL 144 >UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobacteria RepID=Q2RP40_RHORT Length = 152 Score = 95.7 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 32/131 (24%), Positives = 51/131 (38%), Gaps = 2/131 (1%) Query: 224 STQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRY 283 +T HGR+E R V +V+ ++ + ++ + K + Sbjct: 6 TTDRGRHGRQEHRWVEVFDVSGRLGPTWDGLIAAVARVTRLTWHKDTKSGLWHKTQETAL 65 Query: 284 YISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNL 343 Y ++ A AIR HW +E H+V DV ED SRIR + ++ ALN+ Sbjct: 66 YACQINLPAAVAGTAIRQHWGVEKRSHYVRDVTFFEDQSRIRTK--PGHFARLRSFALNI 123 Query: 344 LRDCKDIKGEE 354 LR Sbjct: 124 LRANGTNNISR 134 >UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3V5_9PROT Length = 174 Score = 95.7 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 41/166 (24%), Positives = 65/166 (39%), Gaps = 14/166 (8%) Query: 195 VKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFE 254 +K NQ L ++ D+ ++ R+E R V V E Sbjct: 1 MKANQSNLFETACAI----AANDAPADTAFSRNKGRSRQEDRTVEVFPVGDALAGT---E 53 Query: 255 WKGLKKLCVALSFRQKKEDKSAEGVSIR-----YYISSKDMDAKEFAHAIRAHWLIEHSL 309 W+ K + ++ R + R Y S+ + A +A AIR HW IE+ Sbjct: 54 WQPFIKTIIRVTRRTLLHSAATGLWDQRGEVAFYVSSAANFPATAWAAAIRGHWGIENRN 113 Query: 310 HWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEE 355 H+V DV +ED SRIR I++ + ALN++R + Sbjct: 114 HYVRDVSCDEDKSRIRDN--PGIMARARSFALNIMRKNGIANVAQA 157 >UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=Bacteria RepID=B7UED9_ECOLX Length = 104 Score = 94.9 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 41/94 (43%), Positives = 60/94 (63%), Gaps = 1/94 (1%) Query: 279 VSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKK 338 +++RYYISS D A++F AIR HW +E++L+W LDV MNED +IRRGNAAE SGI+ Sbjct: 1 MTVRYYISSADSTAEKFVTAIRNHWHMENNLYWRLDVVMNEDDYKIRRGNAAESFSGIRH 60 Query: 339 MALNLLRDCKDIKGEEEKK-EGCVKHRERSSEVH 371 +A+N+L + + K +K + + V Sbjct: 61 IAINILTNNQVFKARSRRKMRKATMDKNYLASVL 94 >UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PA49_CLOTS Length = 245 Score = 93.8 bits (231), Expect = 8e-18, Method: Composition-based stats. Identities = 48/228 (21%), Positives = 89/228 (39%), Gaps = 37/228 (16%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 L + I+ D R + VK +S I F+ + + + +E + + KK Sbjct: 18 HLGEKINTLKDKRVKSSVK--ISTITFVVLFGFMLQIRSFNRLEHW--LKKGKFKKALPK 73 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKM--------FIEWMQECHEITDGEIIAIDG------ 110 +P DTI RV+SN D ++ + I +++AIDG Sbjct: 74 KTKMPRIDTIRRVLSNFDLDGLNELNNSIIKTSIKNKVFRRGTIDGLKVVAIDGVELFES 133 Query: 111 --KTIRGSFDKGKRKGAIH------MVSAFSNENGVVLGQVKTEAKSN-------EITAI 155 K + ++ G H + S +++ ++LGQ E K + EITA Sbjct: 134 TKKCCGNCLTRVQKDGITHYFHRTVVCSTIGSDSHLILGQEILEPKKDGSDKDEGEITAG 193 Query: 156 PELLNLLYLK----KNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQ 199 L+ L+ + ++I DA+ C+ ++ D ++ VK + Sbjct: 194 KRLIRKLHREFHHFADIIVADALYCKSTWVKEVLSIGMDAVVRVKDER 241 >UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3K1_9PROT Length = 134 Score = 93.8 bits (231), Expect = 8e-18, Method: Composition-based stats. Identities = 30/117 (25%), Positives = 50/117 (42%), Gaps = 6/117 (5%) Query: 4 QSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGD 63 +LL S D R+ + L+ +L TV A++AGA +++++ F L+ L D Sbjct: 3 STLLSLFSQISDPRRDQGKIYPLAPVLLFTVLAMLAGARSYRQVQAFIRTHLDRLNDGFD 62 Query: 64 FD-NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITD-----GEIIAIDGKTIR 114 P T+ ++ ID+ E+ F + + IAIDGKT Sbjct: 63 LSLRRAPAYSTVRFILRGIDAEEMERAFRDHALGLADGPAEGAAIPGAIAIDGKTWC 119 >UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX Length = 105 Score = 93.8 bits (231), Expect = 9e-18, Method: Composition-based stats. Identities = 42/88 (47%), Positives = 57/88 (64%) Query: 271 KEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAA 330 E K ++ RYY S D+ A++FA A R HW +E+ LHW LDV MN+D +IRRGNAA Sbjct: 18 TEQKKEPEMTDRYYSISADLTAEKFATANRNHWYVENKLHWHLDVVMNKDDCKIRRGNAA 77 Query: 331 EIISGIKKMALNLLRDCKDIKGEEEKKE 358 E+ SGI+K+A+N+L K +K K Sbjct: 78 ELFSGIRKIAINILTKDKILKAGARCKM 105 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 93.4 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 36/182 (19%), Positives = 66/182 (36%), Gaps = 15/182 (8%) Query: 195 VKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFE 254 +K NQ + + + HGR+E+R + Sbjct: 2 IKRNQPTTYRQLAALPWPDSAVQ-----HTASSAGHGRRESRSIKTCGIADELGGIAFPH 56 Query: 255 WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMD---AKEFAHAIRAHWLIEHSLHW 311 + ++ R++K+ E Y ++S D E A A+R HW +E H Sbjct: 57 GRLALRV-----HRRRKQTGGCESRETVYAVTSLDAHETTPAELAAAVRGHWTVEALRH- 110 Query: 312 VLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSSEVH 371 V DV E+AS + G A ++ + +A+ LL+ I + + ER+ + Sbjct: 111 VRDVTYAEEASTLHTGTAPRAMATFRNLAVGLLKTLGAINIAKTTR-AIRDQPERALPLL 169 Query: 372 FL 373 + Sbjct: 170 GI 171 >UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N3J1_VIBHB Length = 184 Score = 93.0 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 34/146 (23%), Positives = 58/146 (39%), Gaps = 5/146 (3%) Query: 227 EISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYIS 286 + HGR R + + G+K Q + + YYI+ Sbjct: 34 DEGHGRLVRRRYFAFPLPEEL---HNHALSGIKSCIAVERIVQ-EGKGEPKTSHFSYYIT 89 Query: 287 SKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRD 346 + + A +R HW IE S HW+LDV N+D + N+AE + IK++ LNL++ Sbjct: 90 NHPASDPKLADYVRQHWEIE-SYHWLLDVYFNDDRDKKYEENSAENFAQIKRLPLNLVKA 148 Query: 347 CKDIKGEEEKKEGCVKHRERSSEVHF 372 ++ K + ++ F Sbjct: 149 KDWAGKKKSVKSEADEWVKKLISSLF 174 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 92.2 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 36/84 (42%), Positives = 52/84 (61%), Gaps = 1/84 (1%) Query: 124 GAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASK 183 A+H++SAF + GVVL Q+ KSNEI A ELL L + +T DAM Q++ A Sbjct: 7 KAVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQREHARF 66 Query: 184 -IKDKKADYLLAVKGNQGKLHHAF 206 ++DK+AD+++ VK NQ +L A Sbjct: 67 AVEDKRADFVMTVKDNQPELREAL 90 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 91.5 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 39/85 (45%) Query: 7 LDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDN 66 L + D R L +I+ + + AV+ GAD + IE +G + WL+ + D Sbjct: 28 LKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLPK 87 Query: 67 GIPVDDTIARVVSNIDSLAFEKMFI 91 GIP DT RV+ ++ + F Sbjct: 88 GIPSHDTFGRVLRILEPKQLQSGFR 112 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 89.5 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 24/86 (27%), Positives = 40/86 (46%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 ++++S+ Y D R KH+ I+ + VC V+ G D I + R EWL+ Sbjct: 6 VAVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQG 65 Query: 61 YGDFDNGIPVDDTIARVVSNIDSLAF 86 + + NG+P D I + + AF Sbjct: 66 FLELPNGLPSRDCIRNWLMALQPDAF 91 >UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C56B7 Length = 116 Score = 89.5 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 26/101 (25%), Positives = 50/101 (49%) Query: 259 KKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMN 318 K+ R + + E +S++ DA + +R HW IE+ LH+V DV + Sbjct: 2 KQGFQLTRERTVRGQTTVEVHFGITSLSAEKADAATLLNHVRTHWRIENELHYVRDVTLG 61 Query: 319 EDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEG 359 ED R+R G+A ++++ ++ ++L R+ K + E + Sbjct: 62 EDVCRVRMGHAPQVLAALRNAVVHLWREVKAVSCPEAIERL 102 >UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q750_9BACT Length = 129 Score = 89.2 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 25/99 (25%), Positives = 40/99 (40%), Gaps = 2/99 (2%) Query: 258 LKKLCVALSFRQK--KEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDV 315 +K++ K K+ E V ++ + K R HW IE+ LH+V D Sbjct: 26 VKQVFCIHRIFTKVKTGKKTEEIVYGITSLTQQKASPKTILKFSRGHWSIENGLHYVRDT 85 Query: 316 KMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEE 354 ED S+IR NA ++ +K + + L E Sbjct: 86 AFREDHSQIRTQNAPRAMASLKNLVVGLFHFLNVPNITE 124 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 89.2 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 27/118 (22%), Positives = 51/118 (43%), Gaps = 9/118 (7%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 L ++++ PD R +H L +IL + + A+ +GA+ + + ++ + L + Sbjct: 15 GLWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGC 74 Query: 65 DNG-------IPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRG 115 P T+ RV+ I LA E+ W+ +A+DGKT+ G Sbjct: 75 QESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSL--GLSPAALAVDGKTLAG 130 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 89.2 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 49/128 (38%), Positives = 73/128 (57%), Gaps = 1/128 (0%) Query: 175 GCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKE 234 + ++ KI +K DYLLAVKGNQG L AF++ F ++ +N + ++T+E S GR E Sbjct: 11 SVRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHE 70 Query: 235 TRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKE 294 +R VS+ + D EW GLK + +S +KE + +RYYISSK ++A+E Sbjct: 71 SRAAFVSHDLSV-LGDISDEWPGLKSMAFVVSMNSEKEVAEEADIYVRYYISSKQLNAEE 129 Query: 295 FAHAIRAH 302 A R H Sbjct: 130 LLTASRLH 137 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 88.8 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 36/108 (33%), Positives = 60/108 (55%), Gaps = 2/108 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 + L + +T D R + KH L I+ L + AV++G++ W++IE+FGH +L+WL++Y F Sbjct: 6 TFLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPF 65 Query: 65 DNGIPVDDTIARVVSNI--DSLAFEKMFIEWMQECHEITDGEIIAIDG 110 GIP DTIARV+ + D K+ ++ + G + G Sbjct: 66 KAGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQG 113 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 88.0 bits (216), Expect = 4e-16, Method: Composition-based stats. Identities = 34/121 (28%), Positives = 58/121 (47%), Gaps = 4/121 (3%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 + L + D R + KH L I+ L + AV++G++ W+ IE+FGH +L+WL ++ F Sbjct: 6 TFLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPF 65 Query: 65 DNGIPVDDTIARVVSNI--DSLAFEKMFIEWMQECHEITDGEIIAIDG--KTIRGSFDKG 120 GIP DTIARV+ + D K+ ++ + G + G + + Sbjct: 66 KAGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQRE 125 Query: 121 K 121 Sbjct: 126 G 126 Score = 56.8 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 37/83 (44%), Gaps = 3/83 (3%) Query: 178 KDIASKIKDKKADYLLAVKGNQGKLHHAFEEKF---PVNVFSNYKGDSFSTQEISHGRKE 234 K+IA I +KADY+LA+KG+ L E + F+ D +T + HGR E Sbjct: 87 KEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGHGRIE 146 Query: 235 TRLHIVSNVTRLNFCDFEFEWKG 257 TR V + + + G Sbjct: 147 TRRCQQVLVNKSWLNNKYRKRPG 169 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 88.0 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 42/143 (29%), Positives = 62/143 (43%), Gaps = 5/143 (3%) Query: 161 LLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKG 220 + LK +L+T+DAMGCQ+ IA ++++ AD +L++KGNQGK A F + Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 221 --DSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEG 278 E SHGR R V +T W ++ L V RQ ++ Sbjct: 61 LKPDHDEFEDSHGRTVRRRGWVLPLT--PETKHSGSWPDIQALLVTEKIRQAHYSETV-T 117 Query: 279 VSIRYYISSKDMDAKEFAHAIRA 301 RYY+S + H A Sbjct: 118 SDFRYYLSRCQEARPDIGHTTHA 140 >UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina DSM 3645 RepID=A3ZX26_9PLAN Length = 115 Score = 86.8 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 35/94 (37%), Positives = 53/94 (56%) Query: 273 DKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEI 332 E +RYYI SK + + FA A+R HW IE+SLHW LDV E SRIR+G+A Sbjct: 13 QNGKEASEVRYYIVSKYLTGRRFAEAVRGHWGIENSLHWQLDVTFGEHQSRIRKGHADIN 72 Query: 333 ISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRER 366 S +++ +L+LL++ K + + K ++ Sbjct: 73 FSLLRRTSLSLLKNNKTARVGVKNKRLKAGRNDK 106 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 86.8 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 33/131 (25%), Positives = 53/131 (40%), Gaps = 2/131 (1%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF- 64 L Y+ PD R+ + L+ +L ++ AV++GA +++I+ F E L Sbjct: 3 LKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQLH 62 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQEC-HEITDGEIIAIDGKTIRGSFDKGKRK 123 PV +I + +D+ A E F E IA+DGKT+R + R Sbjct: 63 WKRAPVHTSIRYALQGLDAKAGELAFHRHASGLDGEGAQHASIAMDGKTLRAAVSITSRT 122 Query: 124 GAIHMVSAFSN 134 SA Sbjct: 123 ARPLRYSAHWP 133 >UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REV9_LEGLO Length = 139 Score = 86.8 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 36/132 (27%), Positives = 58/132 (43%), Gaps = 6/132 (4%) Query: 243 VTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAH 302 RL G+K + + K + A RYY++S + + + +R H Sbjct: 7 AYRLPKTINTGSLVGIKSIIATETISSKTNET-AISAEWRYYVTSHETEKSDLHLYVRNH 65 Query: 303 WLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVK 362 W IE+ LHW LDV +N+DA + R A S IK+M L+L++ K KK Sbjct: 66 WSIENELHWHLDVHLNDDADKKRDDTTAINFSSIKRMLLSLVK----TKLPPGKKRSVRS 121 Query: 363 H-RERSSEVHFL 373 ++ + +L Sbjct: 122 RLKQVGWDTEYL 133 >UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JBB1_FRASC Length = 173 Score = 85.7 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 43/94 (45%) Query: 272 EDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAE 331 +AE V + + + A +AHW IE+ LHWV DV +ED R R GNA + Sbjct: 70 GPATAETVHAVTSLPTHHASPRLLAELAQAHWAIENRLHWVRDVTYDEDRHRARTGNAPQ 129 Query: 332 IISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRE 365 +++ ++ +A+ +LR + + Sbjct: 130 VMTSLRNLAITILRLTGAKNIAKALRHHARHPER 163 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 85.3 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 33/76 (43%), Positives = 47/76 (61%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 +I+ +++ + D R G+ H L IL L +CAV++GA W +IED+GH R WL++Y Sbjct: 6 TIEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRY 65 Query: 62 GDFDNGIPVDDTIARV 77 NGIP DTI RV Sbjct: 66 LKLRNGIPGHDTIRRV 81 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 84.9 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 48/77 (62%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M+ +S+LD+ S D RQ +V + L I L +CA ++G +++ EI +G RLE+L++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFDNGIPVDDTIARV 77 + ++ G+P DT+ + Sbjct: 77 FLPYERGLPAHDTLKGL 93 >UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus pneumoniae RepID=B2ISL1_STRPS Length = 535 Score = 84.5 bits (207), Expect = 6e-15, Method: Composition-based stats. Identities = 51/350 (14%), Positives = 113/350 (32%), Gaps = 62/350 (17%) Query: 18 QQGKVKHKLSAILFLTVCAVIAGA-DEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIAR 76 Q+ ++ S IL + ++ G ++ E L + G T++R Sbjct: 142 QRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGGQL----ASQPTLSR 197 Query: 77 VVSNIDSL----------AFEKMFIEW--MQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 +S D + F+++ + + D GK +++ R Sbjct: 198 FLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDSTHFTTYGKQEGVAYNAHYRAH 257 Query: 125 AIHMVSAFSNENGVVL-GQVKTEAK--SNE----ITAIPELLNLLYLKKNLITIDAMGCQ 177 H + AF + G Q++ + S E IT + E N L L +D+ Sbjct: 258 GYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFITPVLERFNQL-----LFRMDSGFAT 312 Query: 178 KDIASKIKDKKADYLLAVKGNQ--GKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKET 235 + I+ YL+ +K N +L + ++S Sbjct: 313 PKLYDLIEKTGQYYLIKLKKNTVLSRLGDLSLPCPQDEDLTILPHSAYSET--------- 363 Query: 236 RLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEF 295 W +++C +K+ + + +S+ ++S +++ Sbjct: 364 -------------LYQAGSWSHKRRVCQF--SERKEGNLFYDVISLVTNMTS--GTSQDQ 406 Query: 296 AHAIRAHWLIEHSLHWVLDVKMNE--DASRIRRGNAAEIISGIKKMALNL 343 R E+ + + + + D+S + + ++S +A NL Sbjct: 407 FQLYRGRGQAENFIKEMKEGFFGDKTDSSTLIKNEVRMMMSC---IAYNL 453 >UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ26_GLOVI Length = 125 Score = 84.1 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 28/113 (24%), Positives = 48/113 (42%), Gaps = 3/113 (2%) Query: 246 LNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLI 305 + F G + +R+ + + + +Y+SS + A E IR HW + Sbjct: 1 MKAFPPLFSGNGRTRSIRLERYRELRGIVTVKT---HWYLSSIEASASELGRRIRGHWGV 57 Query: 306 EHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 E+ +H+ DV ED SRIR ++ S + ALNL R + ++ Sbjct: 58 ENQVHYPKDVTFGEDRSRIRTLPLVQVWSVARSFALNLYRSLLMANRAQAQRR 110 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 83.0 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 34/131 (25%), Positives = 55/131 (41%), Gaps = 13/131 (9%) Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQ----ECHEITDGEIIAIDGKTIRGSFDKGK 121 PV ++ ++ ID A F + C IAIDGKT+R SFD Sbjct: 9 RRAPVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAFS 68 Query: 122 RKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELL---------NLLYLKKNLITID 172 A +++SAF+ ++ ++L + KSNEI A L+ + + + +D Sbjct: 69 DTKAAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALIVATALWKSREETSIYASSVMLD 128 Query: 173 AMGCQKDIASK 183 AM I + Sbjct: 129 AMTFAPAIRNH 139 >UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK19_ACIF5 Length = 104 Score = 82.6 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 39/96 (40%) Query: 270 KKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNA 329 K + ++ + R HW IE+ H V D +ED S+IR N Sbjct: 2 KDGTLREDCAFGLTSLTKDRTTPENLLGIARGHWEIENRNHHVRDTTYHEDLSQIRTENG 61 Query: 330 AEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRE 365 +++ ++ +A+++LR + ++ R+ Sbjct: 62 PHMMATLRGLAMSILRLIGVKNIAQAGRDFAASARK 97 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 81.8 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 42/279 (15%), Positives = 84/279 (30%), Gaps = 29/279 (10%) Query: 51 GHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI---TDGEIIA 107 G + L +Y +FDN P + + + + I AFE +F E+ + + +IA Sbjct: 53 GKALRDELLEYFEFDNTTPSNSSFNQRRAQILPEAFEFLFQEFTKSFTDNVTYNGLRLIA 112 Query: 108 IDGKTIRGSFDK------------GKRKGAIHMVSAFS-NENGVVLGQVKTEAKSNEITA 154 DG + + + K +H+ + + ++ +NE A Sbjct: 113 CDGSDLCIAHNPQDETTYFQTLPDRKGYNLLHLNAFYDLCSRQYTDAIIQPSRLANERRA 172 Query: 155 IPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKG-NQGKLHHAFEEKFPVN 213 + E+++ + D +I + ++ K YL+ VK + Sbjct: 173 MCEMIDRYNDTSAIFIADRGYENYNIFAHVEHKGMYYLIRVKDITSNGITSKLTMLPESG 232 Query: 214 VFSNY---KGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQK 270 F + T E+ K+ R+ F + R Sbjct: 233 EFDEWVNVTLTKKQTNEVKANPKKYRVIDKKTPFDYLDLHFNN--------FYEMKMRVI 284 Query: 271 KEDKSAEGVSI-RYYISSKDMDAKEFAHAIRAHWLIEHS 308 + + ++ E W IE S Sbjct: 285 RFPIPQGSYECIITNLPQDKFNSDEIKRLYAKRWGIETS 323 >UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H5_9SYNE Length = 177 Score = 81.5 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 59/159 (37%), Gaps = 12/159 (7%) Query: 197 GNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWK 256 G+Q L+ ++ + + + EI HGR + + W Sbjct: 8 GDQKTLYRQIADQL---LGKRHIPLMATDHEIGHGR---DILWTLRAKEAPQ-HIKANWH 60 Query: 257 GLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVK 316 G + ++ + + +I+S +R W +E HW+ D + Sbjct: 61 GTSWIAEVIATGTRDRK---PFKATHRFITSLRTTPDALLRLVRERWSVESW-HWIRDTQ 116 Query: 317 MNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEE 355 ++ED R R GN A +++ ++ A+NLLR E Sbjct: 117 LHEDDHRYR-GNGAGVMAALRTAAMNLLRLTGFGSIREG 154 >UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E6_9ACTO Length = 128 Score = 81.1 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 49/133 (36%), Gaps = 13/133 (9%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + L + ++ D R++ +H A+L + AV+ GA + I ++ + + + Sbjct: 1 MGPLAERLATLADPRRRRGKRHPFVAVLLIACSAVVTGARSFVAIHEWATDSPQDVLARL 60 Query: 63 DFDNGI-------PVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIRG 115 P TI RV+ + + H+ + +AIDGK+ RG Sbjct: 61 GARTATALAVRIPPSGVTIRRVIKDTCPGGLADLLG------HDPAGTDTLAIDGKSARG 114 Query: 116 SFDKGKRKGAIHM 128 S R Sbjct: 115 SRLGSTRPPIYWP 127 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 80.3 bits (196), Expect = 9e-14, Method: Composition-based stats. Identities = 32/97 (32%), Positives = 41/97 (42%), Gaps = 5/97 (5%) Query: 172 DAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPV---NVFSNYKGDSFSTQEI 228 D +GCQK IA I +++ADYLLAVK NQ LH A F F+ Y D Sbjct: 8 DGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYFEEANKARFAGYNIDYDEKINK 67 Query: 229 SHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVAL 265 GR E R V + W L+ + + Sbjct: 68 GPGRLEQRRCWV--GYEIPDTINSQNWAKLETIVMVE 102 >UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_9ACTO Length = 410 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 58/189 (30%), Gaps = 11/189 (5%) Query: 45 QEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHE---IT 101 + + R G + P + I R++ ID W+ Sbjct: 224 SALIAWVLARPTVAVLLGIDADRRPSEAMIRRLLQAIDPDLLTTAIGIWLAARIPAPAPG 283 Query: 102 DGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPE---- 157 IA+DGKT+RGS + + A H+++A G+VL + K+NEIT Sbjct: 284 SRRAIAVDGKTLRGS--RTRDSAARHVLAAADQHTGIVLASTDVDTKTNEITRFTASGSH 341 Query: 158 --LLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVF 215 LL+ ++ +++ A S A G + Sbjct: 342 ADLLSSRCIRSGVVSPAASARASRSCSPAATATRSGSSASAGAAPPARTGARTAPTDHPE 401 Query: 216 SNYKGDSFS 224 Sbjct: 402 PPASPRCHR 410 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 4/108 (3%) Query: 29 ILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNG-IPVDDTIARVVSNIDSLAFE 87 +L L + AV+AG + I FG R + L F NG +P +TIA ++ +D+ + Sbjct: 3 LLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLDADHLD 62 Query: 88 KMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNE 135 ++ W+ + H + IA+DGK + GS + H+++A++ + Sbjct: 63 RIIGAWLGDRHP-DGWDHIALDGKRLCGS--RDGAVPGTHLLAAYAPQ 107 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 77.6 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 37/77 (48%), Positives = 47/77 (61%), Gaps = 1/77 (1%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 S+ + D R KH I+FL V AVI+GA+ W EI+ FG L+WL+KY F Sbjct: 2 SVFRFFENLSDPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPF 60 Query: 65 DNGIPVDDTIARVVSNI 81 + GIPVDDTIARV+ I Sbjct: 61 ECGIPVDDTIARVIKRI 77 >UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN77_ANAAZ Length = 77 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 24/62 (38%), Positives = 42/62 (67%) Query: 295 FAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEE 354 A+++R+HW IE+SLHWVLDV + +D RIR+ NA + + ++++A++LL +K Sbjct: 1 MANSVRSHWAIENSLHWVLDVALKQDDCRIRKDNAQQNFAVMRQIAVDLLGKENPVKRGI 60 Query: 355 EK 356 + Sbjct: 61 KI 62 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 23/112 (20%), Positives = 46/112 (41%), Gaps = 6/112 (5%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF- 64 L Y+S PD R+ + L+ +L ++ A+++GA +++I+ F E L Sbjct: 3 LKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQLH 62 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWM-----QECHEITDGEIIAIDGK 111 P +I + +D+ A E F ++ + +I + K Sbjct: 63 RKRAPAHTSIRYALQGLDAKAVELAFPRHASGLDGEDHNRFFPSTVIDAEWK 114 >UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVV4_METNO Length = 123 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 48/113 (42%), Gaps = 1/113 (0%) Query: 255 WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLD 314 W GL + + R + +R+ + S ++ A AIR H + WVL+ Sbjct: 7 WPGLTTVLATETLRG-GNGTDSVPAQVRHSLGSSTAPSEVLAQAIRRHGALATGEPWVLE 65 Query: 315 VKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERS 367 V E+ SR+R AA ++ ++++AL+ R + ++ + R Sbjct: 66 VSFGEERSRVRERCAARHLALLRRVALDRRRADASLTASRPAQDRGLGRRRHG 118 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 25/69 (36%), Positives = 38/69 (55%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKK 60 M +++ + + D RQ KV + L +LF+T+C VIAGA+ W EI D+ W K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFDNGIP 69 G G+P Sbjct: 72 KGILTEGVP 80 >UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C544B Length = 103 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 30/106 (28%), Positives = 39/106 (36%), Gaps = 5/106 (4%) Query: 227 EISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYIS 286 + HGR ETR + W GLK R K + E V Sbjct: 2 DPGHGRIETRTVRATP-----LLTCHDRWTGLKHGFRITRTRTVKGVTTVEVVHGITSRP 56 Query: 287 SKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEI 332 + DA+ +R+HW IE+ H V DV + ED R R A Sbjct: 57 VERADARALLGLVRSHWRIENQRHDVRDVTLREDEPRCRAAGAGRA 102 >UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LD44_FRASN Length = 261 Score = 73.0 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 35/158 (22%), Positives = 58/158 (36%), Gaps = 10/158 (6%) Query: 179 DIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEIS------HGR 232 ++A+++ D+ + L +G+Q L A + + + + I+ G Sbjct: 38 ELAAQVPDRISQPRLVTEGDQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLTAGS 97 Query: 233 KETRLHIVSNVT-RLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMD 291 ++TR V L F + + +K E K + Y I + Sbjct: 98 RQTRALKAVTVPAGLGFPHAAQAIQLTRTSRPINKNTKKTEGKRRQRRETVYAICTLPAH 157 Query: 292 ---AKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRR 326 E A IR HW IE L WV DV + ED + R Sbjct: 158 DALPAELATWIRGHWSIEVRLRWVRDVTLGEDLHQART 195 Score = 43.7 bits (101), Expect = 0.010, Method: Composition-based stats. Identities = 12/35 (34%), Positives = 25/35 (71%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIA 39 +LL+ ++ PD+R++ V+H +A+L + VCA++ Sbjct: 60 ALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLT 94 >UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XCB4_SALTO Length = 117 Score = 72.6 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 26/107 (24%), Positives = 51/107 (47%), Gaps = 3/107 (2%) Query: 26 LSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLA 85 ++++L VCAV+AGA + D+ + F + +PV T+ R++ +D+ Sbjct: 1 MASVLADAVCAVMAGASTFAAFGDWVEDLDAPAWSRLGFTDRVPVLTTLWRLLVRVDAET 60 Query: 86 FEKMFIEWMQECHEITD---GEIIAIDGKTIRGSFDKGKRKGAIHMV 129 ++ +W+ + +IA+DGK +RG+ R A+ M Sbjct: 61 LTAVWADWLCSRLPVAPPPVRRVIAVDGKVVRGAVLTEGRVPALWMP 107 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 72.6 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 43/197 (21%), Positives = 74/197 (37%), Gaps = 45/197 (22%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIED------FGHERLE 56 ++ LL+ S PD R+ VKH+L+ +L + + + +E F Sbjct: 80 LKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSRPAFLQALQG 139 Query: 57 WLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEI--------IAI 108 + +G DT+ARV+ I+ E+ FI ++ + IAI Sbjct: 140 LFPELETLPHG----DTLARVLERIEPQKLEESFIRLLRRYIRHKKFKRHLINKCYPIAI 195 Query: 109 DG--KTIR-------------GSFDKGKRKGAIHMVSA-FSNENGVVLGQVKT------- 145 DG K +R + D K + I+++ A F +NG+ + + Sbjct: 196 DGTQKLVRDGELGEEWLERHIKTKDGEKVQQYIYVLEANFVFKNGLTIPIMSEFLSYSED 255 Query: 146 ---EAKSN-EITAIPEL 158 E K + EI A L Sbjct: 256 DSKEVKQDCEIKAFKRL 272 >UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW01_RHOSR Length = 130 Score = 72.6 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 19/81 (23%), Positives = 36/81 (44%) Query: 11 SVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPV 70 PD R + V+H+ S IL + A AGA + I ++ H+ +P Sbjct: 49 DEVPDPRHRRGVRHRFSVILAPALSATCAGARSFIAIAEWAHDAPAEALGGLGVSAVVPS 108 Query: 71 DDTIARVVSNIDSLAFEKMFI 91 + T R ++ +D+ A +++ Sbjct: 109 ESTSRRFLAGVDATALDQVLG 129 >UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TKB9_RICCO Length = 107 Score = 72.2 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 20/100 (20%), Positives = 40/100 (40%), Gaps = 1/100 (1%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWL-KKYGDF 64 D S D+R+ ++ L +L V ++++G+ ++++ F E+L L + +G Sbjct: 8 FGDVFSELRDVRRAQGKRYALEPLLCAIVMSILSGSASLRKMQVFIEEQLPNLNRLFGTS 67 Query: 65 DNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGE 104 P I + +D E+ F E G Sbjct: 68 WRKAPCWVAIREFLLGLDEQELERAFREHANRQVSPPPGR 107 >UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RIX6_9ACTO Length = 90 Score = 70.7 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 16/68 (23%), Positives = 26/68 (38%) Query: 285 ISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLL 344 + + + R W IE+ LHWV DV E R R G + + ++ A+ Sbjct: 6 LPAAYAQPADLQQWARLEWHIENRLHWVRDVTFGEGTHRARTGTGPAVAAVLRNTAIGFH 65 Query: 345 RDCKDIKG 352 R + Sbjct: 66 RGNGETNI 73 >UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A0_AZOSE Length = 92 Score = 70.7 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 26/76 (34%), Positives = 36/76 (47%), Gaps = 1/76 (1%) Query: 299 IRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKE 358 +R H LHW LDV+ N+D SR+RRG AA ++ + LNLLR K + K Sbjct: 15 VRLPRPTRHQLHWSLDVQFNDDQSRVRRGYAANNFVVLRHIVLNLLRHNTTRKASIKSKR 74 Query: 359 GCV-KHRERSSEVHFL 373 + E+ L Sbjct: 75 LLACMEDDFREELLGL 90 >UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus aquaticus Y51MC23 RepID=B7A7V9_THEAQ Length = 161 Score = 69.5 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 50/130 (38%), Gaps = 9/130 (6%) Query: 229 SHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSK 288 G T S + + G ++ +K ++ Y ++S Sbjct: 18 RDGEVWTYRVWASP----YLPEEMRAFPGCGQVVRMEREVVRKG-TGEVRRTVSYALTSL 72 Query: 289 ---DMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 DA+ + + W +E+ WV D ++EDA ++R G A++++ ++ ++LL Sbjct: 73 GPEVADARRLGELLLSRWEVENRSFWVRDFLLHEDACQVR-GVGAQVLAALRAFLVSLLH 131 Query: 346 DCKDIKGEEE 355 + + Sbjct: 132 RQGVREKKAA 141 >UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XTV7_9DEIN Length = 118 Score = 68.4 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 44/91 (48%), Gaps = 3/91 (3%) Query: 255 WKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLD 314 W+G ++ + + R +++ Y ++S AK R HW +E+ LH D Sbjct: 4 WRG-SRMALRMRRRVVRKNSGELREETAYALTSLQAPAKRLYALWRGHWEVENRLHHKRD 62 Query: 315 VKMNEDASRIRRGNAAEIISGIKKMALNLLR 345 + EDASR R+G A + ++ + LNLL Sbjct: 63 TVLGEDASRSRKGAAG--LMYLRDVILNLLH 91 >UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium meliloti RepID=Q1WLF6_RHIME Length = 105 Score = 68.0 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 18/74 (24%), Positives = 32/74 (43%), Gaps = 1/74 (1%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 + PD R +H L++ILF+ + A++ GA+ ++ DFG + +WLK Sbjct: 1 MSRFAACFEDLPDPR-GRNARHPLTSILFIAIAAIVCGAESCTDMADFGVAKKKWLKTIV 59 Query: 63 DFDNGIPVDDTIAR 76 I + Sbjct: 60 PLPYASRCWRDIRK 73 >UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3Z283_9SYNE Length = 156 Score = 66.0 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 4/96 (4%) Query: 84 LAFEKMFIEWMQECHEITDG-EIIAIDGKTIRGSFD--KGKRKGAIHMVSAFSNENGVVL 140 AFE + ++WM + + DG + + DGKT+RGS D G I VS +S GV + Sbjct: 2 EAFEALLLQWMSQQPALADGVDTLVCDGKTLRGSIDQKPGAAASFIAQVSLYSQPLGVAI 61 Query: 141 GQVKTE-AKSNEITAIPELLNLLYLKKNLITIDAMG 175 Q +S+E ++ LL+ + L L+ D +G Sbjct: 62 AQTTYATDESSETASLLWLLSGIELTDMLVQADEVG 97 >UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3536 Length = 130 Score = 64.9 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 21/67 (31%), Positives = 31/67 (46%) Query: 253 FEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWV 312 +WKGLK+ R + E V +S+ +A +R HW IE+ LH+V Sbjct: 9 QDWKGLKQGFQITRERTVNGVTTVEVVHGITSLSADRANAGALLSLLRDHWRIENQLHYV 68 Query: 313 LDVKMNE 319 DV + E Sbjct: 69 PDVTLGE 75 >UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JGX0_FRASC Length = 222 Score = 64.5 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 21/108 (19%), Positives = 40/108 (37%), Gaps = 6/108 (5%) Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIE-WMQECHEITDGEIIAIDGKTIRGSFDKGKRKG 124 G P + + ++D + + ++ +DG T+R Sbjct: 31 PGTPAPGGVGKSCRSLDPGSLAALDAAPHRPTWRAGRVRRVLTVDGTTMR----PQHGSR 86 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLL-YLKKNLITI 171 +H+ ++ GV+L QV + K+NE + L + L LIT Sbjct: 87 HVHLPEGLAHACGVLLTQVDVDEKTNENPFVLRGLGQIPDLTGVLITA 134 >UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C435B Length = 133 Score = 63.7 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 30/138 (21%), Positives = 42/138 (30%), Gaps = 20/138 (14%) Query: 192 LLAVKGNQGKLHHAFEEKFP---------------VNVFSNYKGDSFSTQEISHGRKETR 236 +L K NQ L E + G HGR ETR Sbjct: 1 MLTAKDNQPGLVADIEAGLGFEDAARGLAAATSPLTGPDARATGAPGHVGGPGHGRIETR 60 Query: 237 LHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFA 296 + W GLK R K + E + ++ + DA+ Sbjct: 61 TVRATP-----LLTCHDRWTGLKHGSRITRARTVKGVTTVEVLHGITSLTVERADARALL 115 Query: 297 HAIRAHWLIEHSLHWVLD 314 +R+HW IE+ H V D Sbjct: 116 GLVRSHWRIENQRHDVRD 133 >UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L7Y6_FRASN Length = 209 Score = 61.8 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 13/66 (19%), Positives = 29/66 (43%) Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 + A +R +W IE+ +H+ D EDA+ GN ++ + +A+ ++ Sbjct: 89 VTAAYLHTHVRGNWGIENEVHYTRDAAWREDANPTYTGNTNHALASFRNLAIGVIGLNGT 148 Query: 350 IKGEEE 355 ++ Sbjct: 149 RNIKKP 154 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 33/86 (38%), Gaps = 1/86 (1%) Query: 53 ERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKT 112 RL + + P T+ + ID A + F W+ +AIDGK Sbjct: 20 ARLGAPLDHFRRNTRAPSKKTLRAPLKKIDVDALDATFGAWLCAQIAR-GRVALAIDGKV 78 Query: 113 IRGSFDKGKRKGAIHMVSAFSNENGV 138 +RG++ + A ++ + G+ Sbjct: 79 LRGAWSGDESVTAAYLHTHVRGNWGI 104 >UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BQC4_9GAMM Length = 96 Score = 61.4 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 36/78 (46%), Gaps = 3/78 (3%) Query: 290 MDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKD 349 M ++ R HW I SLH++ D NED +IR G+ ++ + + A+ +L+ Sbjct: 1 MTPQQVLAINRGHWSIA-SLHYISDWNYNEDRGQIRTGHGPANVTRLCRFAIGVLKHFPK 59 Query: 350 --IKGEEEKKEGCVKHRE 365 E ++ + R+ Sbjct: 60 PGQYIPEMMRQLARRPRQ 77 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 61.4 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 26/60 (43%), Positives = 39/60 (65%) Query: 94 MQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEIT 153 + + + ++ DGKT+R S D+ K AIH+VSA+++ N +VLGQVKT+ KSNE Sbjct: 14 QKVYQKSLKEKSLSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNEHK 73 >UniRef50_A5GAF0 Putative uncharacterized protein n=6 Tax=Deltaproteobacteria RepID=A5GAF0_GEOUR Length = 439 Score = 61.0 bits (146), Expect = 7e-08, Method: Composition-based stats. Identities = 48/402 (11%), Positives = 106/402 (26%), Gaps = 53/402 (13%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 L + PD R K+ L+ +L ++ R + Sbjct: 17 LRCCLEHIPDQRDGAKI--SLADVLMSGYAMFDLKDPSLLAFDE-RRCRDAANLQRIYGI 73 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIA---------IDGKTIRGS 116 + D + V+ +D F + + +A +DG GS Sbjct: 74 GKVACDTQLRTVIDPVDPAGLRPGFKTIVATLQRGKALQQLAYYEGYYLLSLDGTGSFGS 133 Query: 117 ------------FDKGKRKGAIHMVSA--FSNENGVVLG--------QVKTEAKSNEITA 154 GK+ ++ A ++ VV+ Q E A Sbjct: 134 ENLSSASCLVKNKSNGKKLYYQQVLGAALVHPDSRVVIPLAPEMIIPQDGATKNDCERNA 193 Query: 155 IPELL----NLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVK-GNQGKLHHAFEEK 209 L ++ D + ++ ++L K G+ L + Sbjct: 194 SKRFLPNFREDFPRLPVIVVEDGLSSNGPHIRDLQQHNMRFILGAKPGDHPLLFENLTDA 253 Query: 210 FPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQ 269 + +F+ + + + +++ LK + Sbjct: 254 IKKKTAT-----TFAQIDPKNPQIMHSYCFLNDTPLNQAN------PDLKVNFLVYEEHN 302 Query: 270 KKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVK--MNEDASRIRRG 327 K K+ + S + + +A R+ W IE+ L + E + + Sbjct: 303 AKTGKT-QRFSWVTDLPITEENAYILMRGGRSRWKIENETFNTLKNQGYNLEHNYGLGKE 361 Query: 328 NAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVKHRERSSE 369 + +E + +A + + + + R E Sbjct: 362 HLSENFVMLMMLAFLVDQAQQLCSPLFQAALERAGSRRSLWE 403 >UniRef50_B2IT45 Putative uncharacterized protein n=5 Tax=Cyanobacteria RepID=B2IT45_NOSP7 Length = 435 Score = 60.3 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 46/416 (11%), Positives = 119/416 (28%), Gaps = 61/416 (14%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY- 61 +Q + PD R ++++S + + + Sbjct: 11 VQYFQSILKDLPDKRTGKNKRYQMSDAALSAFSIFFTQSPSFLAHQRSMAHSKGHNNAQS 70 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEKMF---------IEWMQECHEITDGEIIAIDG-- 110 + IP D+ I ++ I+ +F + + + + +IA+DG Sbjct: 71 LFGVHQIPSDNHIRDLLDEIEPTVVFPVFTKIFKALENGKHLSKFRSFKNNLLIALDGTE 130 Query: 111 ----------KTIRGSFDKGKRKGAIHMVS---AFSNENGV-------VLGQVKTEAKSN 150 +F G + +V+ + + V V+ Q + + Sbjct: 131 YFCSNEIHCEHCSSRTFKNGTTQYFHTVVTPVIVCPSNSQVIPLIPEFVVPQDGYQKQDC 190 Query: 151 EITAIPELLNLLYLK----KNLITIDAMGCQKDIASKIKDKKADYLLAVK-GNQGKLHHA 205 E A + + I D + C + + + +K +++L + + L+ Sbjct: 191 ENAAAKRWIQKYAKQYASLGITILGDDLYCHQPLCELLLQEKLNFILVCRSKSHKTLYEW 250 Query: 206 FEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVAL 265 E + K EI R ++ + ++ L +L + Sbjct: 251 LEGMPLDTF--SVKHWKGKVYEIYTYRYVNQIPLRNSEDALL--------VNWCELAITR 300 Query: 266 SFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLH-------WVLDVKMN 318 S + I + + R+ W IE+ + + L+ Sbjct: 301 SDGTIIYKNTFATNHRITDI-----NVEAIVSDGRSRWKIENENNNTLKTKGYNLEHNFG 355 Query: 319 EDASRIRRGNAAEI-ISGIKKMALNLL-RDCKDIKGEEEKKEGCVKHRERSSEVHF 372 + + A ++ + L+++ + I+ ++ + + Sbjct: 356 HGKTHLSSLLATFNILAFLFHTLLDIIDEKYQFIRQHLPTRKTFFDDLRALTRYLY 411 >UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ITI9_METNO Length = 74 Score = 60.3 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 1/66 (1%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 I+ L D+R+ H+L AIL + VCAVIA A+ ++I +G + WL+ + Sbjct: 2 IEGLAVCFPGLEDLRETSGCDHRLIAILVIAVCAVIACAES-KDIGLYGRSKQAWLQTFL 60 Query: 63 DFDNGI 68 Sbjct: 61 PLPCAT 66 >UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B1F Length = 84 Score = 58.7 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 21/61 (34%), Positives = 32/61 (52%) Query: 159 LNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNY 218 L++ L ++ + IDA+G Q IA +I + ADY+LA+K NQ A F + Sbjct: 17 LDMEDLAQSQLVIDAVGTQGPIAEQIIEAGADYVLALKANQPSALQAVSAHFKEAESVDL 76 Query: 219 K 219 K Sbjct: 77 K 77 >UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostridium spiroforme DSM 1552 RepID=B1C560_9FIRM Length = 399 Score = 58.0 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 38/275 (13%), Positives = 83/275 (30%), Gaps = 31/275 (11%) Query: 56 EWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEIT---DGEIIAIDGKT 112 + L K+ DF P + S I AF +F + ++ + ++AIDG Sbjct: 21 DELLKFNDFSITTPSASAFVQARSKIKPEAFRTLFDGFNKKTFKKKLYHGYRLLAIDGSE 80 Query: 113 --------------IRGSFDKGKRKGAIHMVSAFSNENGVVLG-QVKTEAKSNEITAIPE 157 +R K A H+ +++ ++ EAK +E A + Sbjct: 81 LPIDNTIFDDETTVLRHGTLA-KTFSAYHLNASYDLMERTYDDIIIQGEAKRDEHGAFCQ 139 Query: 158 LLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSN 217 L++ +K + D + + YL+ V E + + Sbjct: 140 LVDRYDGQKAIFIADRGYESYNGFEHVVHSGHKYLIRV--------RDIESQSSITKSLG 191 Query: 218 YKGDSFSTQEISHGRKETRLHIVSNVTRLNFC---DFEFEWKGLKKLCVALSFRQKKEDK 274 D ++S + ++ + + F++ + + R + Sbjct: 192 PFPDGEFDVDVSRMLTLKQTKMIKACPDVYKFVPKNMRFDFMNKQNPWYEFNCRVVRLKI 251 Query: 275 SAEGVSIR-YYISSKDMDAKEFAHAIRAHWLIEHS 308 + +S + ++ W E S Sbjct: 252 TENTYETVITNLSRNEFSMEDICEIYNMRWGEETS 286 >UniRef50_A8MIZ4 Putative uncharacterized protein n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MIZ4_ALKOO Length = 218 Score = 56.0 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 62/186 (33%), Gaps = 33/186 (17%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 + + I+ D R + VK +S I F+ + + + + + KK Sbjct: 19 IGEKINTLKDKRVKSPVK--VSTISFVVLFGFMLQIRSFNRLNHWIE--KGKFKKVVPKK 74 Query: 66 NGIPVDDTIARVVSNIDSLAFEKMFI--------EWMQECHEITDGEIIAIDG------- 110 +P D++ R +++ D + M + + ++ AIDG Sbjct: 75 TKMPCIDSVRRFLADFDLHGLKNMHSHIVKTSIKNKVFRSGTVDGLKVAAIDGVELFEST 134 Query: 111 -KTIRGSFDKGKRKGAIH------MVSAFSNENGVVLGQVKTEAK-------SNEITAIP 156 K + + H + S ++ ++LGQ E K E+T Sbjct: 135 KKCCNNCLTRVHKDEITHYFHRSVICSTVGSDPHLILGQEMLEPKRDGSNKDEGEVTGGK 194 Query: 157 ELLNLL 162 L+ L Sbjct: 195 RLIKKL 200 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 55.3 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 28/62 (45%), Positives = 36/62 (58%) Query: 55 LEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTIR 114 LK+YG F+ GI DTI +VS I + F+K FI+WM C E+ A DGKT+R Sbjct: 9 RGLLKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVR 68 Query: 115 GS 116 S Sbjct: 69 RS 70 >UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J572_FRASC Length = 148 Score = 54.9 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 19/63 (30%), Gaps = 1/63 (1%) Query: 7 LDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDN 66 + PD R V+H+L +L L AV+ G + + Sbjct: 69 AECFEQIPDPRDPRGVRHRLPVVLSLC-AAVLCGESGLAGVAAWVAAEGPGDPTGEGCRW 127 Query: 67 GIP 69 P Sbjct: 128 PRP 130 >UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HUT6_9ACTO Length = 130 Score = 54.5 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 37/87 (42%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 S SL +S PD R+ ++ L ++L L + AV+ GA I F + L++ Sbjct: 42 SRTSLAGTLSRIPDPRRVRGRRYHLGSLLALCLVAVLGGARSLATIARFAADTNSSLREQ 101 Query: 62 GDFDNGIPVDDTIARVVSNIDSLAFEK 88 + P T+ + +N+ + Sbjct: 102 LGLASSTPNASTLGGLRANLKDEWVRE 128 >UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36C2 Length = 109 Score = 54.1 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 18/71 (25%), Positives = 25/71 (35%), Gaps = 2/71 (2%) Query: 263 VALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDAS 322 R+ + E V +S DA R HW IE+ LH+ DV + ED Sbjct: 3 RLERRRKANGKATVEVVYGITSLSRLAADAAALLGYSRRHWGIENGLHYTRDVTLGEDRC 62 Query: 323 RI--RRGNAAE 331 + R Sbjct: 63 PVGARSRPTPR 73 >UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria RepID=A8FXM7_SHESH Length = 57 Score = 53.7 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 26/57 (45%), Positives = 37/57 (64%), Gaps = 3/57 (5%) Query: 263 VALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNE 319 + +FR +K + RYYISSK++ A++ A+ + HW IE S+HWVLDV MNE Sbjct: 1 MVENFRFVIGNK--LVLEYRYYISSKELTAEQAANTVSEHWGIE-SMHWVLDVSMNE 54 >UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VX04_POLNA Length = 100 Score = 53.3 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 15/66 (22%), Positives = 34/66 (51%), Gaps = 4/66 (6%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEI----EDFGHERLEWLKKY 61 L+ S+ PD R ++ L ++ +T+ AV+ GAD W ++ + +G ++ +++ Sbjct: 2 LIQAFSILPDPRTGPAQRYDLREMIVMTLSAVLCGADNWVDVPVGSKKYGDSCMQVVREK 61 Query: 62 GDFDNG 67 +G Sbjct: 62 CCLTSG 67 >UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillaceae RepID=C5D2E6_GEOSW Length = 437 Score = 52.6 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 48/354 (13%), Positives = 108/354 (30%), Gaps = 76/354 (21%) Query: 3 IQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDF-----GHERLEW 57 + L+D + D R Q + + IL+ + + G + + + E + Sbjct: 28 FKDLVDQLKKVKDKRHQSYITYGPETILYTILLKSVFGIKSMRSMTELFNKDECIENIRV 87 Query: 58 LKKYGDFDNGIPVDDTIARVVSNIDSLAFEKM---FIEWMQECHEITDGEIIAIDG---- 110 + + + +P DTI ++ ++ E + I+ + E + I+ Sbjct: 88 VLGLKELNE-LPHYDTINDFLAKLEPKELETIRIYLIKKLFEKRCLESFRILNKYWPIVF 146 Query: 111 --------------KTIRGSFD-----KGKRKGAIHMVSA--FSNENGVVLGQVKTEAKS 149 +R + + K H++ A + + + E +S Sbjct: 147 DGTGIHTFKEKHCEHCLRREYKDKETGETKVVYMHHVLEAKLVVGDMVLSIATEFIENES 206 Query: 150 N-------EITAIPELLNLLY-----LKKNLITIDAMGCQKDIASKIKDKKAD-YLLAVK 196 E+ A L++ L L LI D++ + +I DK Y+ K Sbjct: 207 ENVPKQDCELKAFMRLVDKLKKTFKRLPICLI-ADSLYAC-EPVFEICDKHNWKYIFRFK 264 Query: 197 GNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWK 256 ++ K + + F +I++ RL + ++ + + E+ Sbjct: 265 EDRIKTVSQEFRAIQSLETNGKSSEYFWVNDIAYN---DRLVNLVEKVKVTENEKKQEFL 321 Query: 257 GLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLH 310 + + +A+ A R W IE+ Sbjct: 322 FITNFRITER------------------------NAEILVQAGRRRWKIENEGF 351 >UniRef50_A7C035 Transposase n=5 Tax=Bacteria RepID=A7C035_9GAMM Length = 437 Score = 52.2 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 61/431 (14%), Positives = 121/431 (28%), Gaps = 87/431 (20%) Query: 1 MSIQSLLD----YISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERL- 55 +S+ LL Y P + K LS L + + + F +++ Sbjct: 9 LSMPGLLSEIKNYFEKIPSPVVKQKDSISLSDCLMSGLAIFSLK---YPSLLQFDNDKRT 65 Query: 56 ---EWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI---------TDG 103 E K IP D + + + + + +++ D Sbjct: 66 PVVEHNLKSLYKIGIIPSDTYMRERLDELPTSELRGAYTTLIRQAQRGKVLEKFTYYNDY 125 Query: 104 EIIAIDGKTIRGSFD-----------KGKRKGAIHM---VSAFSNENGVVLG-------- 141 ++++DG S D + + H ++ + VL Sbjct: 126 YLVSMDGTGYFSSHDIHCDQCCEKHHRNGKITYHHQMLGIALVHPNHHHVLPLAPEPIIK 185 Query: 142 QVKTEAKSNEITAIPELLNLLYLK----KNLITIDAMGCQKDIASKIKDKKADYLLAVK- 196 Q E E A LL L + K +IT D + +K Y+L K Sbjct: 186 QDGVEKNDCERNAGKRLLTQLRKEYPKMKMIITEDGLASNGPHIKLLKSLNMSYILGAKP 245 Query: 197 GNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWK 256 + L + ++ + + TQ+ + R + +F Sbjct: 246 KDHTYLFDRIK--------NSSQTKFYQTQDDDGTIHKYRYVNQVPLNESHF-------- 289 Query: 257 GLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVK 316 L + K + S I + + RA W IE+ L Sbjct: 290 DLNVNFLIYQEISPKGKVTN--FSWVTDILLSEQTLEIVMKGGRARWRIENETFNTLK-- 345 Query: 317 MNEDASRIRR-GNAAEIISG--------------IKKMALNLLRDCKDIKGEEEKKEGCV 361 N+ G+ + +S ++ + N+ + ++ K+ Sbjct: 346 -NQGYHFEHNFGHGKQHLSSVFAHLMLLAFLIDQLQGLCCNIFKQA----LKKAKRPLYF 400 Query: 362 KHRERSSEVHF 372 R R+ +F Sbjct: 401 WERFRAIFFNF 411 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 20/59 (33%), Positives = 30/59 (50%) Query: 1 MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLK 59 M ++ L+++IS+ PD RQ KV+HKL IL + FG L++LK Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRGFGETHLDFLK 59 >UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E7_9ACTO Length = 72 Score = 49.9 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 13/45 (28%), Positives = 24/45 (53%) Query: 134 NENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQK 178 G+ + Q++ +NEIT LL+ L++ +T DA+ Q+ Sbjct: 2 TGTGMTVTQLRVPENTNEITCFAALLDPYDLREVTVTGDALHTQR 46 >UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=A7B831_RUMGN Length = 366 Score = 49.1 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 43/273 (15%), Positives = 89/273 (32%), Gaps = 29/273 (10%) Query: 53 ERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQEC---HEITDGEIIAID 109 + L Y F P + + + AF+ +F E+ + D +++A D Sbjct: 64 SLKKELLDYFQFSVDTPSASAFCQQRNKLLLEAFQFLFYEFNSCFSFEKKYKDYQLLACD 123 Query: 110 GKTIRGSFDKG------------KRKGAIHMVSAFS-NENGVVLGQVKTEAKSNEITAIP 156 G + + + + IH+ + F E + ++ NE A+ Sbjct: 124 GSDLNIARNPNDAGTYFQSQPTDRGFNQIHLNALFDLCEKRYIDLVIQPARLENESLAMT 183 Query: 157 ELLNLL-YLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVF 215 ++++ KK + D +I + +++K YL+ VK G + N F Sbjct: 184 QMIDRYKGEKKTIFIADRGYETYNIFAHVQEKGMYYLIRVKDGGGGSMTGSFDLPDENEF 243 Query: 216 SNYKG----DSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKK 271 + + + +K + S L+ D +F + + A+S Sbjct: 244 DHDMQLILTRKQTKDVKAKPKKFKFIAKSSPFDYLDLYDKKFYTLNFRVVRFAISEDS-- 301 Query: 272 EDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWL 304 SI + +D +E W Sbjct: 302 ------YESIITNLPKEDFPVEEIKKVYAMRWH 328 >UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID=A1RCW9_ARTAT Length = 436 Score = 48.7 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 69/215 (32%), Gaps = 20/215 (9%) Query: 4 QSLLDYISVT-PDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 L ++ PD R G+V+H L +L + A+ AG ++ + + G L+ Sbjct: 41 HGLTRRLAKIVPDRRDPGRVQHGLQTLLAQRIYALAAGYEDLNDHD--GLRHDYALQTAV 98 Query: 63 DFDNGIPVDDTIARVVSNIDSLAFEKMFI---EWMQECHEITDGEII----AID----GK 111 + + T+ R+ D + E H+ EI+ A D G Sbjct: 99 NRLQPLAGKSTLGRLEQQADRETVVQAHRLLWEHFIAQHDQAPAEIVLDFDATDVPVHGD 158 Query: 112 TIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEIT-AIPELLNLL-----YLK 165 F + F + +V + + AI LL Sbjct: 159 QEGRFFHGYYDHYCFLPLYVFCGRHLLVSYLRPSNIDGARHSWAILALLVKFIRRFWPET 218 Query: 166 KNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQG 200 + + D C+ + K+ DY++ + N Sbjct: 219 RIVFRGDGGFCRHRMLDWCDRKQVDYVVGLARNTR 253 >UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AI67_NITEC Length = 94 Score = 47.9 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 11/60 (18%) Query: 6 LLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFD 65 L D D RQ K +H L +L +T+ EI + +E+L+WL++Y Sbjct: 35 LADVFVSITDPRQ-RKSRHDLVKVLVITI----------NEILAWANEKLDWLRQYLKLT 83 >UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ13_9PLAN Length = 75 Score = 47.6 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 1/55 (1%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLE 56 ++ L Y D R +HKL ++ + +CAVIAGAD IE + RL+ Sbjct: 19 ELRELAKYFQSLDDSRSHVNQRHKLVNVVLIAMCAVIAGADGPTAIE-WLAGRLQ 72 >UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitrosococcus oceani AFC27 RepID=B6C2C4_9GAMM Length = 77 Score = 47.6 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 9/57 (15%), Positives = 21/57 (36%), Gaps = 1/57 (1%) Query: 316 KMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKK-EGCVKHRERSSEVH 371 ED R+ A + ++K+A++LL + K + + ++ Sbjct: 21 SFREDECRVHDPMAGGNFALLRKIAISLLVRDRSNKTSLRGRCRKVAWDNDYMRQLF 77 >UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P672_AZOSE Length = 47 Score = 47.2 bits (110), Expect = 9e-04, Method: Composition-based stats. Identities = 15/31 (48%), Positives = 20/31 (64%) Query: 302 HWLIEHSLHWVLDVKMNEDASRIRRGNAAEI 332 HW +E+ LHW L+V+ NED SR+R A Sbjct: 1 HWGVENWLHWCLNVQFNEDRSRVRSAYAVNN 31 >UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUW9_9BACT Length = 61 Score = 47.2 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 10/50 (20%), Positives = 19/50 (38%) Query: 313 LDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVK 362 D ED S+IR NA ++ +K + + L + + + Sbjct: 2 RDTSFREDHSQIRTQNAPRAMASLKNLVVGLFHFLNVPNIAKTLRNFAAR 51 >UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BWL6_9GAMM Length = 94 Score = 47.2 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 17/59 (28%), Positives = 34/59 (57%), Gaps = 1/59 (1%) Query: 4 QSLLDYISVTPDIRQQG-KVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 + + ++ PD R++ ++HK IL + +CA+I GAD W + +FG + +W + + Sbjct: 36 RVIEEHFKSLPDPRRRTMNLRHKFIDILIIAICAIICGADSWVAVAEFGKAKEDWFRVF 94 >UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L6S8_RUMHA Length = 107 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 14/48 (29%), Positives = 23/48 (47%) Query: 47 IEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWM 94 + F + + +K D G P DT+ RV + I+ F +MF W+ Sbjct: 1 MAFFMKLQEPYFEKILDLKYGTPSADTLLRVFAIIEPEKFMEMFYHWI 48 >UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKL6_9PROT Length = 506 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 63/422 (14%), Positives = 134/422 (31%), Gaps = 72/422 (17%) Query: 2 SIQSLLDYISVTPDIRQQGKVKHK----LSAILFLTVCAVIAGADEWQEIEDFGHERLEW 57 + +LL + PD R K +HK L L + V + + +E+ + L Sbjct: 75 ELPALLGQLEQIPDPRDPRKRRHKLTVLLLYGLLMFVFQFASRRETNREMTR--PQFLAN 132 Query: 58 LKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEI--------IAID 109 L++ +P DT+ R++ +ID E+ ++ ++ IAID Sbjct: 133 LQRLFPEIEALPHADTLYRLLRDIDLAHLEQAHVDLVRRLIRGKSFRRYLINHCHPIAID 192 Query: 110 GK------TIR---------GSFDKGKRKGAIHMVSA-FSNENGVV-----------LGQ 142 G T+ G + + ++++ A NG+V LG Sbjct: 193 GSQKLAGDTLWAEELLQRHVGKDETRHTQYFVYVLEASLVFHNGLVIPLLSEFLEHALGD 252 Query: 143 VKTEAKSNEITAIPELLNLL----YLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKG- 197 + + + E+ L + L L+ +D + + + +++ +K Sbjct: 253 SEAQKQDCELRGFARLSDRLKRLFPRLPILLLLDGLYANGPVMQRCLRAHWQFMIVLKDK 312 Query: 198 NQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKG 257 + + F P + T + GR++ V+++ + K Sbjct: 313 DLPTVWEEFRALQPRQLP---------TLQQDWGRRQQHFSWVNDIEYAYGSNGRCRLKL 363 Query: 258 LKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAH----AIRAHWLIEHSL---- 309 +C +E + + ++SS+ + + R W IE Sbjct: 364 HVVVCEERWQGVDQEARIVTETARHAWLSSQPLSRENVHERCNLGARHRWGIEAGFLVEK 423 Query: 310 ----HW----VLDVK-MNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGC 360 H+ LD M +R + ++ + +L R + C Sbjct: 424 HQGYHYEHAFALDWNAMRGYHLLMRLAHVFNTLARFTRQLRDLYRQFGVRGAIAFIRNSC 483 Query: 361 VK 362 Sbjct: 484 AG 485 >UniRef50_A4BVT6 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVT6_9GAMM Length = 120 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 10/94 (10%), Positives = 28/94 (29%), Gaps = 4/94 (4%) Query: 3 IQSLLDYISVTPDIRQQGK-VKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKY 61 ++++ D R + ++ L+ L + + + +D E + Sbjct: 14 LRTVRACFEALDDPRSRPNSTRYTLADALSSALAMFLLKYPSLLQFDDSARAADEVTRHN 73 Query: 62 GDFDNGI---PVDDTIARVVSNIDSLAFEKMFIE 92 G+ P D + ++ + F Sbjct: 74 LGTLYGVEQVPCDTQMRAILDPLKPSTLRGAFRA 107 >UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LF21_FRASN Length = 420 Score = 45.2 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 48/312 (15%), Positives = 93/312 (29%), Gaps = 37/312 (11%) Query: 18 QQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARV 77 +Q K +++ T+ + D++ E+ L WL D +P I++ Sbjct: 47 EQRKRLLPARVVVYFTMAMCLFFDDDYDEVMRRLVGTLRWLGS-WKGDWKVPSTGAISQA 105 Query: 78 VSNIDSLAFEKMFIEWMQECHEIT-------DGEIIAIDGKTI-------------RGSF 117 + + + +F + ++A+DG + R S Sbjct: 106 RTRLGPEPLKLLFERVAVPVAGLGTKGAWLGSRRLVAVDGVHLDTADTPENADAFGRFSH 165 Query: 118 DKGKRKGA-IHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGC 176 +H+V+ V S+E + L + L+T D Sbjct: 166 GPKTAAFPQVHVVALAECGTHAVFAAAIGAYTSDERSLAATLFDACE-PGMLLTADRNFY 224 Query: 177 QKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETR 236 + + AD L V N L P + + D R Sbjct: 225 GYGLWQQALATGADLLWRVNAN---LTLPVIRALPDGSYLSLLIDPKIPVAR-------R 274 Query: 237 LHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISS-KDMDAKEF 295 ++++ + E + S +E+ ++E + + I D+ A E Sbjct: 275 GQLIADARAGHAPPTESALPVR---VIEYSVPDHEENGTSELICLITNILDPTDVAAIEL 331 Query: 296 AHAIRAHWLIEH 307 A A W IE Sbjct: 332 ATAYHERWEIES 343 >UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteobacteria RepID=A6X872_OCHA4 Length = 330 Score = 44.9 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 23/231 (9%), Positives = 70/231 (30%), Gaps = 22/231 (9%) Query: 17 RQQGK--VKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTI 74 R+ ++ AI +C + + + + L + + + +P T Sbjct: 53 RKTRGGQCRYSDLAIETTLICGKV-----FNQPLRQTEGLMASLLRLLNVELPVPDHTTF 107 Query: 75 ARVVSNIDSLAFEKMFIEWMQECHEITDGEII------AIDGKTIRGSFDKGKRKGAIHM 128 +R +N+ + + + + A + ++ +H+ Sbjct: 108 SRRCANLVVSSLTRCTRRDGTDEPLHVIVDSTGMKIYEAGQWLEEKHGAKSARKWLKLHL 167 Query: 129 VSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKK 188 A ++ V+ + T+ +++++ +P+LL+++ D ++ Sbjct: 168 --AIDADSNQVIAETLTDQNTSDLSQVPDLLDMIDRPIACFMADGAYDSDQTYQALRSHS 225 Query: 189 ADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHI 239 + + + + + D S GR E + Sbjct: 226 PGVSIIIPP-------RIRDLQEASYGPPDQRDWHSRTNAQRGRMEWQNLT 269 >UniRef50_A7BZU6 Transposase, IS4 n=2 Tax=Beggiatoa sp. PS RepID=A7BZU6_9GAMM Length = 270 Score = 44.9 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 57/194 (29%), Gaps = 24/194 (12%) Query: 18 QQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARV 77 + L + +AG + F L+ L T Sbjct: 34 KHHNQIFNYYDFFILLMYYFVAGKQS---VGLFVKTELKLLP--ITLGLRQVAYSTFNDA 88 Query: 78 VSNIDSLAFEKMF------IEWMQECHEITDGEIIAIDG-------KTIRGSFDKGKRKG 124 F+++F I + Q T G + IDG + + Sbjct: 89 FERFSPNLFQEVFKYILSTIPFKQISELSTLGVLYCIDGSLFPVINSMLWAEYTSKHCAL 148 Query: 125 AIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKI 184 +H+ F +V+ + T A +E A+ E+L D ++ + Sbjct: 149 KLHLC--FELNRMIVVEFLVTAANGSERKALQEMLK----AGVTYIGDRGYMSFELCHLM 202 Query: 185 KDKKADYLLAVKGN 198 K+A ++ +K N Sbjct: 203 MQKEAYFVFRLKRN 216 >UniRef50_Q877V8 ISPpu8, transposase n=3 Tax=Proteobacteria RepID=Q877V8_PSEPK Length = 433 Score = 44.1 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 37/301 (12%), Positives = 82/301 (27%), Gaps = 41/301 (13%) Query: 14 PDIRQQGKVKHKLSAILFLTVCAVIAGAD-EWQEIEDFGHERLEWLKKYGDFDNGIPVDD 72 + RQ+ + L + + + V G + L D + + Sbjct: 20 EEHRQRQYSRELLFSTIIKLMSLVSLGLKPSLHAAARQLDDLPVSLAALYDKISR--TEP 77 Query: 73 TIARVVSNIDSLAFEKMFIEWMQECHEIT------DGEIIAIDGKTIRGSFDKGKRKGAI 126 + R + + E DG +A K + + Sbjct: 78 ALLRALVTGCAQRLAPTIHELGCSAMLPDWQVRVVDGSHLASTEKRLGALRQERGAARPG 137 Query: 127 HMVSAFSNENGVVLG-QVKTEAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIK 185 V + + V+ Q +A ++E + LL + D + C + + Sbjct: 138 FSVVVYDPDLDQVIDLQPCEDAYASERVCVLPLLAEAK-TNQVWIADRLYCTLPVMEACE 196 Query: 186 DKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTR 245 K +++ + +L E + P+ V + + + H R+ + + Sbjct: 197 QVKTSFVIRQQAKHPRLIQEGEWQAPMPVATGTVREQSIEVKGGHR--WRRVELTLHSPN 254 Query: 246 LNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLI 305 + + W L + + A++ A R W I Sbjct: 255 DSGDNSLMFWSNLP----------------------------ESISAQQIADFYRRRWSI 286 Query: 306 E 306 E Sbjct: 287 E 287 >UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001905F7C Length = 367 Score = 44.1 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 42/289 (14%), Positives = 82/289 (28%), Gaps = 29/289 (10%) Query: 27 SAILFLTVCAVIAGADEWQEIEDFGHER-LEWLKKYGDFDNGIPVDDTIARVVSNIDSLA 85 + +L L V+ G + + + +R L + D + +VS + + Sbjct: 43 ADLLRLCFAYVLGGF-SLRTLAAWADQRGLASMSDVAMLKRLKASADWVGYLVSELLAER 101 Query: 86 FEKMFIEWMQECHEITDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKT 145 + F +D ++A+D K MV + + + L V+ Sbjct: 102 CPEAFA------GVHSDLRLMAVDATV----VAPPGPKRDYWMVHTVFDLSRLKLSSVEV 151 Query: 146 EAKSNEITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHA 205 + E + + L D + + + AD+L+ N +L Sbjct: 152 TDRR-EAERLSRGVK----AGELRIADRAHAKATDLAAVVKAGADFLVRAPSNYPRLLDG 206 Query: 206 FEEKFPVNVFSNYKG-----DSFSTQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKK 260 + G D + + E + V L Sbjct: 207 DGQLLERLALCREAGDKGVLDRSVRIQDGKSKVEV----AARVVILPLPPEAAAKARRAA 262 Query: 261 LCVALSFRQKKEDKSAEGVSIRYYISSKDMD---AKEFAHAIRAHWLIE 306 +A R K + E ++S + D + A R W IE Sbjct: 263 RRLAAKARYKPSEAGIEMAGYLVLLTSLNADDWPPERLASTYRLRWQIE 311 >UniRef50_Q745Z8 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z8_THET2 Length = 77 Score = 43.3 bits (100), Expect = 0.014, Method: Composition-based stats. Identities = 12/58 (20%), Positives = 29/58 (50%), Gaps = 1/58 (1%) Query: 305 IEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIKGEEEKKEGCVK 362 +E+ WV DV + E+A ++R G A++++ ++ ++LL + ++ Sbjct: 1 MENRSFWVRDVLLYEEACQVR-GVGAQVLAALRAFLVSLLHRRGVREKVTRQRTLKAA 57 >UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z7_THET2 Length = 112 Score = 42.9 bits (99), Expect = 0.017, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 37/108 (34%), Gaps = 7/108 (6%) Query: 41 ADEWQEIEDFGHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI 100 D + +E F L G + ++ +D ++ Q E Sbjct: 1 MDSLRGVERFARANPHLLPHLGLRNPPGHTLL--PLLLHRLDPKKLQEALH---QVFPEA 55 Query: 101 TDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAK 148 G ++ +DGK +RGS + + +V + L Q + E K Sbjct: 56 DLGGVLVVDGKHLRGSGK--GKSPQVRLVEVLALHLKTTLAQARVEGK 101 >UniRef50_A3YV03 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3YV03_9SYNE Length = 113 Score = 42.5 bits (98), Expect = 0.024, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 30/70 (42%), Gaps = 2/70 (2%) Query: 276 AEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISG 335 + +++S K +R W IE+ H+ + +++E A + N A ++ Sbjct: 12 KPFKATHLFLTSLSSTPKTLLQLVRDRWSIENW-HFFRNTQLHESAH-GYQDNGACAMTT 69 Query: 336 IKKMALNLLR 345 K NLLR Sbjct: 70 QKTGTQNLLR 79 >UniRef50_A1TX01 Transposase, IS4 family protein n=5 Tax=Marinobacter aquaeolei VT8 RepID=A1TX01_MARAV Length = 433 Score = 42.5 bits (98), Expect = 0.026, Method: Composition-based stats. Identities = 40/283 (14%), Positives = 92/283 (32%), Gaps = 26/283 (9%) Query: 4 QSLLDYISVT-PDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYG 62 L ++ D R V+HKL ++ V V AG ++ + E + L+ Sbjct: 41 HQLTQRLATVLDDTRNPVLVRHKLQTMIRQRVFGVAAGYEDLNDHETL--RADQALQTAT 98 Query: 63 DFDNGIPVDDTIARVVSNIDSLAF----EKMFIEWMQECHEITDGEIIAIDGKTI----- 113 + + T+ R+ +D A E ++ ++++ ++ DG + Sbjct: 99 GEEAILAGKSTLCRMEQRVDRQAVVKAHELLWHHFIEQHETPPKEIVLDFDGTDVPVHGD 158 Query: 114 --RGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEIT--AIPELLNLL-----YL 164 F+ + F +++ ++T +S+ AI LL Sbjct: 159 QPGKFFNAYYDHHCYFPLYVF-CGRHLLVSYLRTSNRSDSRHSWAILALLVKFIRQYWPD 217 Query: 165 KKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFS 224 + + D+ + + S DYL+ + N L +E ++ Sbjct: 218 TRIVFRGDSGFYRPRLLSWCDRNNVDYLVGISKNSRLL----KEVDVPSMLVRRAHGELG 273 Query: 225 TQEISHGRKETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSF 267 + + R + + + + E E + V+ + Sbjct: 274 EKVSATYRFQYQARTWKHPRWVIARLEEGELGANPRFIVSSRY 316 >UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms RepID=Q11MU1_MESSB Length = 447 Score = 42.2 bits (97), Expect = 0.036, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 69/218 (31%), Gaps = 24/218 (11%) Query: 5 SLLDYISV-TPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIED--------FGHERL 55 L + ++ D R +V+H L+ IL + A+ G ++ +++ RL Sbjct: 45 KLAEKLAAAIRDPRDPARVRHSLTDILRARIFAIACGYEDANDLDRLRNDPAFKLACGRL 104 Query: 56 EWLKKYGDFDNGI------PVDDTIARVVSNIDSLAFEKMFIEWMQECHEITDGEIIAID 109 + P T+ R+ + L + + D + + Sbjct: 105 PDSGQDLCSQPTCSRLENLPDLRTVIRLGRVLVDLWLSS-YPAPPKSVTLDIDDTLDVVH 163 Query: 110 GKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAK---SNEITAIPELLNLL---- 162 G F+ + + + G + + K EI L Sbjct: 164 GHQQLSLFNGHHDERCFLPIHIYDAATGRPVAMILRPGKTPSGKEIRGHLRRLARCIRAR 223 Query: 163 -YLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQ 199 + L+ D+ + ++ + ++ DY+ + GN+ Sbjct: 224 WPDTRILVRGDSHYGRVEVMAWCEENAIDYVFGLAGNK 261 >UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW00_RHOSR Length = 98 Score = 41.8 bits (96), Expect = 0.039, Method: Composition-based stats. Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 2/48 (4%) Query: 110 GKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPE 157 GKT RG+ D H+++A ++ GVVL QV A+ NEI + + Sbjct: 18 GKTWRGAKD--GSGHLTHLLAAVDHDAGVVLRQVAVGARINEIPLLLD 63 >UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4X2F9_SALTO Length = 143 Score = 41.8 bits (96), Expect = 0.039, Method: Composition-based stats. Identities = 13/64 (20%), Positives = 23/64 (35%) Query: 5 SLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDFGHERLEWLKKYGDF 64 L + PD + V H+L+ +L +CAV + I ++ + G Sbjct: 14 GLPAALLDLPDPLCRLGVLHRLTVVLIAAICAVAVSNRSYTAIAEWFPDVPAATGARGGH 73 Query: 65 DNGI 68 G Sbjct: 74 RPGP 77 >UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 Length = 41 Score = 41.8 bits (96), Expect = 0.040, Method: Composition-based stats. Identities = 17/29 (58%), Positives = 24/29 (82%) Query: 128 MVSAFSNENGVVLGQVKTEAKSNEITAIP 156 MV+A + NG+ +GQ+K ++KSNEITAIP Sbjct: 1 MVNALATANGMSIGQLKVDSKSNEITAIP 29 >UniRef50_A3CU17 AAA ATPase n=1 Tax=Methanoculleus marisnigri JR1 RepID=A3CU17_METMJ Length = 516 Score = 41.4 bits (95), Expect = 0.057, Method: Composition-based stats. Identities = 50/310 (16%), Positives = 105/310 (33%), Gaps = 33/310 (10%) Query: 9 YISVTPDIRQQGKVKHKLSA-ILFLTVCAVIAGA-DEWQEIEDFGHE-----------RL 55 +++ D R V H L IL C GA + +++ + + Sbjct: 209 HLASNHDRRLLRDVVHDLPPGILLAFTCRTEDGAGSGYATMQEEIRDLGIHEVQLSGMQR 268 Query: 56 EWLKKYGDFDNGIPVDDTIARVV--SNIDSLAFEKMFIEWMQECHEITDGEIIAIDGKTI 113 +++ G+ + ++D A ++ S D F + + + I Sbjct: 269 HEIRELGERRFHLSIEDAAAGLLEESAGDPFRLIACFNALRNRGLAPSRENVAEV----I 324 Query: 114 RGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSNEITAIPELLNLLYLKKNLITIDA 173 G+ D A+ + + G+ + N +P + +L L+ +T A Sbjct: 325 AGATDPAGLVFAVLPEAMKAWTEGLCI--------LNPPFPVPIMACMLDLQGAGVTAMA 376 Query: 174 MGCQKDIASKIKDKKADYLLAVKGNQGKLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRK 233 Q+ + K +Y A L + P + + E S R Sbjct: 377 NRLQESGMFR-KLPDGEYAFA----HPLLQGHCRRELPEDARVALNARAADCFERSMHRL 431 Query: 234 ETRLHIVSNVTRLNFCDFEFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDM-DA 292 RL+++ ++ F E+ L + L F +++ +A ++ R +S++ + D Sbjct: 432 PGRLYVLLSLAGHLFHAREYGKAADLNLEIGLRFHHREDHDTALMLTERAALSAERLGDD 491 Query: 293 KEFAHAIRAH 302 A A R Sbjct: 492 ALLAAAERQR 501 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.118 0.295 Lambda K H 0.267 0.0359 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,854,754,363 Number of Sequences: 3077464 Number of extensions: 67647162 Number of successful extensions: 213258 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 556 Number of HSP's successfully gapped in prelim test: 100 Number of HSP's that attempted gapping in prelim test: 211449 Number of HSP's gapped (non-prelim): 704 length of query: 374 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 244 effective length of database: 640,326,036 effective search space: 156239552784 effective search space used: 156239552784 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 93 (40.6 bits)