BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (253 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 493 e-138 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 247 2e-64 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 227 3e-58 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 212 1e-53 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 192 6e-48 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 191 2e-47 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 187 2e-46 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 184 2e-45 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 183 5e-45 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 177 3e-43 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 175 1e-42 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 175 2e-42 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 167 2e-40 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 164 4e-39 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 163 5e-39 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 162 7e-39 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 162 8e-39 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 162 8e-39 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 161 2e-38 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 159 1e-37 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 159 1e-37 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 158 1e-37 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 158 2e-37 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 157 4e-37 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 155 1e-36 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 154 2e-36 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 153 5e-36 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 152 1e-35 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 148 2e-34 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 148 2e-34 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 147 3e-34 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 147 4e-34 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 146 5e-34 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 145 2e-33 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 144 3e-33 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 143 5e-33 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 143 6e-33 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 140 3e-32 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 139 7e-32 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 139 9e-32 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 137 5e-31 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 136 8e-31 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 134 3e-30 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 133 7e-30 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 130 4e-29 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 129 7e-29 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 124 2e-27 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 124 4e-27 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 122 1e-26 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 122 2e-26 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 118 2e-25 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 117 3e-25 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 115 1e-24 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 114 2e-24 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 111 2e-23 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 109 7e-23 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 109 1e-22 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 105 2e-21 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 104 3e-21 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 103 4e-21 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 103 6e-21 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 100 4e-20 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 97 5e-19 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 96 1e-18 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 94 3e-18 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 92 2e-17 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 92 2e-17 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 92 2e-17 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 91 3e-17 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 91 5e-17 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 90 7e-17 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 89 2e-16 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 86 1e-15 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 85 2e-15 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 85 3e-15 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 82 1e-14 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 81 3e-14 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 80 8e-14 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 80 9e-14 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 80 9e-14 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 77 5e-13 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 76 1e-12 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 76 1e-12 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 74 4e-12 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 74 5e-12 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 73 9e-12 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 72 2e-11 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 72 2e-11 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 71 3e-11 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 70 5e-11 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 70 7e-11 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 70 9e-11 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 69 2e-10 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 68 3e-10 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 68 4e-10 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 67 5e-10 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 67 5e-10 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 64 3e-09 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 64 4e-09 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 64 4e-09 UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX 62 2e-08 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 60 5e-08 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 60 8e-08 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 59 1e-07 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 59 1e-07 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 59 2e-07 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 57 7e-07 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 57 8e-07 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 57 8e-07 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 55 2e-06 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 55 2e-06 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 54 5e-06 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 54 6e-06 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 54 7e-06 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 53 1e-05 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 53 1e-05 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 51 3e-05 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 49 2e-04 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 48 3e-04 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 45 0.002 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 44 0.004 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 44 0.005 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 43 0.010 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 43 0.011 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 41 0.041 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 40 0.074 UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 40 0.092 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 493 bits (1269), Expect = e-138, Method: Compositional matrix adjust. Identities = 239/253 (94%), Positives = 242/253 (95%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 MELKKLMEHISIIPDYRQ WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETH DFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 YGDFENGIPVHDTIARVVSCI PAKFHE FINWM D HSSDDKDVIAIDGK RHSYDKS Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 RR+GAIHVISAFSTMHSLVIGQIKTD+KSNEITAIPELLNMLDIKGKII TDAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 AEKIQKQGGDYLFAVKG QGRLNKAFEEKFPLKELNNP+HDSYAISEKSHGREE RLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 CDVPDELIDFTFE 253 CDVPDELIDFTFE Sbjct: 241 CDVPDELIDFTFE 253 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 247 bits (631), Expect = 2e-64, Method: Compositional matrix adjust. Identities = 121/248 (48%), Positives = 165/248 (66%), Gaps = 1/248 (0%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ +SII D RQ KV H L D+L L I AVISG EGWE+I+DFG D+L++Y F Sbjct: 6 LINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRKYLPFS 65 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 GIP DTI+R+ I P +F + F WM DVIAIDGK R S++K + Sbjct: 66 GGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKKDKSDT 125 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 IH++SAF+ +S+V+GQ+KT+ KSNEITAIP+LL++LD++G ++ DAMGCQ IA+KI Sbjct: 126 IHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKIAKKIV 185 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPD 245 +GGDYL VKGNQ RL A + F ++ L P+ ++Y EK HGRE++R+ +V D + Sbjct: 186 DKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMVADA-N 244 Query: 246 ELIDFTFE 253 E+ D FE Sbjct: 245 EIGDLVFE 252 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 227 bits (579), Expect = 3e-58, Method: Compositional matrix adjust. Identities = 121/248 (48%), Positives = 164/248 (66%), Gaps = 9/248 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+E SII D RQ K++H+L DIL L + AVI GAEGW+DIE+ G ++L++ G F+ Sbjct: 7 LVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFFK 66 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 GIPV DTIAR++S + P + FI WM + D +IA+DGK RHSYDK +RK A Sbjct: 67 KGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKSA 126 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 IH++SA++ + +V+GQ KTD KSNEI AIP LL++LDIKG I+ DAMGCQ+ IAEKI Sbjct: 127 IHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKIV 186 Query: 186 KQGGDYLFAVKGNQGRLNKA----FE--EKFPLKELNNPKHDSYAISEKSHGREETRLHI 239 + GDY+ AVK NQ +L++ FE +F K + +HD + S K HGR E R + Sbjct: 187 TKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRV---RHDYFEESHKGHGRVELRRYW 243 Query: 240 VCDVPDEL 247 + D+ L Sbjct: 244 ISDMLSTL 251 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 212 bits (539), Expect = 1e-53, Method: Compositional matrix adjust. Identities = 113/251 (45%), Positives = 152/251 (60%), Gaps = 1/251 (0%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M ++ +H S I D+RQ+ KV + L D+L ++CAVI+ + GW +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 F +GIP DTIAR+VS I P F+ F+ WM H + +VIAIDGK R SY++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 R IH+ISA+++ + LV+GQ+K ++KSNEITAIP LL MLD++G ++ DAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 A I +GGDYL AVK NQG L KA + F D I EKSHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFSPHRSAGLSDDHVNI-EKSHGRIENRTCYV 239 Query: 241 CDVPDELIDFT 251 DFT Sbjct: 240 LSSAALDGDFT 250 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 192 bits (489), Expect = 6e-48, Method: Compositional matrix adjust. Identities = 107/248 (43%), Positives = 147/248 (59%), Gaps = 8/248 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + + H S I D RQ KV + L +ILLLT+CAV+SGA W I +G FLK++ F Sbjct: 24 EFLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPF 83 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 +G P HD + + + + F FI+W+ + + V+AIDGK R S DK+ K Sbjct: 84 ADGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTV-TGVVAIDGKTSRRSLDKAGGKA 142 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 AIH+ISA+S+ +L + Q + D KSNEITAIPELL +L +KG I+ DAMGCQ++IA KI Sbjct: 143 AIHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKI 202 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAIS-----EKSHGREETRLHI 239 + DY+ A+KGNQG L K + + + E +D ++ EKSHGR ETR Sbjct: 203 ISKEADYILALKGNQGSLRK--DTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVT 260 Query: 240 VCDVPDEL 247 VC D L Sbjct: 261 VCTDIDWL 268 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 191 bits (485), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 96/249 (38%), Positives = 148/249 (59%), Gaps = 3/249 (1%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++EH S + D R A ++E+ L DI+++T+CAV+ GA+ W ++ ++G + +LKQ+ Sbjct: 6 FASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQWI 65 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRR 122 NG+P HDT V + + P + + F+NW + ++IAIDGK R + + Sbjct: 66 ALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGEQ 125 Query: 123 KGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE 182 IH++SA+++ + LV+GQ D+KSNEITAIPELL +L+++G ++ DAMGCQ IAE Sbjct: 126 CSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIAE 185 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPKHDSYAISEKSHGREETRLHI 239 I + GDY+ A+KGNQG L + F + +HDSY EK HGR E R + Sbjct: 186 TIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTYW 245 Query: 240 VCDVPDELI 248 D L+ Sbjct: 246 TMGQTDYLL 254 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 187 bits (476), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 100/196 (51%), Positives = 131/196 (66%), Gaps = 13/196 (6%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L L +H + + D RQA KV +KL D+L L + AVISGAEGWE+IEDFG +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 YGDF +GIPVHDTIAR+V I P FH+ FI WM DK V+A+DGK Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGK--------- 111 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 +H+ISAF+T + +V+GQ +TD+KSNEITA+PELL +L+++G ++ DAM CQK I Sbjct: 112 ----TLHMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 AEKIQKQGGDYLFAVK 196 + I K+ DY AVK Sbjct: 168 VKTIVKKKADYCIAVK 183 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 184 bits (467), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 98/244 (40%), Positives = 148/244 (60%), Gaps = 19/244 (7%) Query: 23 EHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCIC 82 +HKL DI+ +TICAVI GA+ W DIE FG+ +LK++ + NGIP HDT RV S + Sbjct: 26 KHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNGIPSHDTFGRVFSLLN 85 Query: 83 PAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQ 142 P + F+ W+ S +++AIDGK RHSYD+S+ K A+ +ISA++T + LV+GQ Sbjct: 86 PEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQMISAWATTNGLVLGQ 145 Query: 143 IKTDKKSNEITAIPE---------------LLNMLDIKGKIIKTDAMGCQKDIAEKIQKQ 187 D+KSNEITAIP+ LL +L + G I+ DA+GCQK+I ++I +Q Sbjct: 146 SIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLDAIGCQKEIVKQITEQ 205 Query: 188 GGDYLFAVKGNQGRLNKAFEEKFPLKELNNPK---HDSYAISEKSHGREETRLH-IVCDV 243 DY+ +K NQG L + E F ++N + Y + ++ HGR+E R + ++ +V Sbjct: 206 DADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEGHGRQEVRYYQMLSNV 265 Query: 244 PDEL 247 +E+ Sbjct: 266 AEEI 269 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 183 bits (464), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 100/250 (40%), Positives = 148/250 (59%), Gaps = 4/250 (1%) Query: 3 LKKLMEHISIIPD-YRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +K E+ + D R+ H DIL++ +CA+ISGA + +IE FG + ++ + + Sbjct: 6 VKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQTF 65 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSR 121 NGIP HDT V++ + P +F F+ W + + IAID K R S DK Sbjct: 66 LALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKKN 125 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIA 181 K +H++SA++T +LVIGQIKT++ SNEITAIPELLN LD+KG ++ DAMGCQ +IA Sbjct: 126 GKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEIA 185 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKH---DSYAISEKSHGREETRLH 238 EKI ++ DY+ A+KGNQ +L+++ E F L N + D E S+GREE R Sbjct: 186 EKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRCA 245 Query: 239 IVCDVPDELI 248 + +++I Sbjct: 246 YATNEIEKII 255 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 177 bits (449), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 99/238 (41%), Positives = 142/238 (59%), Gaps = 4/238 (1%) Query: 12 IIPDYRQAWKVE-HKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 +IPD R+A + H LSDIL + +CAV+SG + WE + +FG T +L+Q+ NGIP Sbjct: 20 LIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANGIPS 79 Query: 71 HDTIARVVSCICPAKFHESFINWMLDYH-SSDDKDVIAIDGKIHRHSYDKSRRKGAIHVI 129 HDT RV S I P F +F +W D D +A+DGK R S+ S + A+H++ Sbjct: 80 HDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSHRGSAGR-ALHLL 138 Query: 130 SAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGG 189 A+S L++ Q + D KSNEITAIP++L++ D++G I DA+GCQK +A +I + GG Sbjct: 139 HAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITEAGG 198 Query: 190 DYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDEL 247 DY+ A+KGNQ L+ + +P+ + A+ EK HGR ETR V D D L Sbjct: 199 DYVLALKGNQSALHDDVRLFMETQADRHPQGQAEAV-EKDHGRIETRRIWVNDEIDWL 255 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 175 bits (444), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 91/237 (38%), Positives = 142/237 (59%), Gaps = 4/237 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + H + D R +H L D++ LT+ A++SGAEGW+DI+ FG++ D+L+++ F+ Sbjct: 3 FITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAFK 62 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 G+PV DTIAR++S + P SFI+W+ + + VIA DGK RHS+D RK A Sbjct: 63 EGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFDGD-RKTA 121 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 +H +SA++ LV+ Q K+ K NE++ + EL+ +L++KG I+ DAM C K +A+ I Sbjct: 122 LHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAIN 181 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP---KHDSYAISEKSHGREETRLHI 239 +GGDY+ VK NQG+L F + P K +S ++ HGR E R ++ Sbjct: 182 AKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYV 238 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 175 bits (443), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 100/237 (42%), Positives = 141/237 (59%), Gaps = 14/237 (5%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 + D R+ WK++H LSDI+LL A +SGAE W++IE FG+ + LK ENGIP HD Sbjct: 16 VKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVLQLENGIPSHD 75 Query: 73 TIARVVSCICPAKFHESFINWMLDYHSSD---------DKDVIAIDGKIHRHSYDKSRRK 123 T+ RV + + P E W SD K ++AIDGK R + S ++ Sbjct: 76 TLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTIRG--NGSAKQ 133 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 A+H+++A++T + GQ+ T++KSNEITAIPELL+M+ +KG ++ DAMG QK IA+K Sbjct: 134 KALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDAMGTQKAIADK 193 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 I K+ DY AVK NQ L E+ P E++ D Y EK+HG+ ETR + V Sbjct: 194 IIKKKADYCLAVKENQKTL---LEDIVPFFEMSQEADDHYHTVEKAHGQIETRAYEV 247 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 167 bits (424), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 86/207 (41%), Positives = 130/207 (62%), Gaps = 1/207 (0%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 +KL IPD+R++ K + L ILL+ I +VI GA+ W ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 NGIP HDT RV S I +F + FI W+ +++IAIDGK R + +K Sbjct: 65 LPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGGKK 123 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 +H++SA++ ++LV+GQ+K KSNEITAIP+LL +L I+ I+ DAMGCQ IA+ Sbjct: 124 SPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIAKA 183 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKF 210 I K+ DY+ AVK NQ +L + E++F Sbjct: 184 IVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 164 bits (414), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 87/243 (35%), Positives = 147/243 (60%), Gaps = 12/243 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L+EH I D R + +H+L D+L++ +C ++ G E + D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 +GIP HDT RV + + P F + F+ W ++ +++A+DGK R + ++ + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQGQSP 126 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 I +SA++ +SLV+GQI+ K+NEITA+P+LL +L++ G I+ DAMGCQK+IA + Sbjct: 127 RVI--VSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IQKQGGDYLFAVKGNQGRLN---KAF-EEKFPLKELNNP-KHDSYAI-----SEKSHGRE 233 I + +Y+ A+KGNQG+ + KA+ E+ + P + ++ A+ +EK HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 ETR 236 ETR Sbjct: 245 ETR 247 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 163 bits (412), Expect = 5e-39, Method: Compositional matrix adjust. Identities = 82/248 (33%), Positives = 146/248 (58%), Gaps = 7/248 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+EH++++ + R +H L D++ L I A++SGAEGW DIE +G++ D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP T+AR++ CI E+ + W+ + + K +IA DGK+ R S+ + K A Sbjct: 69 NGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKDA 127 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 + +++A+ T + LV+ Q T K EI + ++L++L++KG ++ DA+ CQ++ EKI Sbjct: 128 LQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKIS 187 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAI--SEKSHGREETR--LHIVC 241 ++ + VK NQ +L +A + +F + L + + + + E HGR+E R + Sbjct: 188 EKKAHVVVQVKNNQPKLYQAVQSQF--QSLFDAQKEKIVVEHKESGHGRQEERYVFQLKA 245 Query: 242 DVPDELID 249 +P EL + Sbjct: 246 KLPPELTE 253 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 162 bits (411), Expect = 7e-39, Method: Compositional matrix adjust. Identities = 92/242 (38%), Positives = 135/242 (55%), Gaps = 8/242 (3%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M K L++++ IPD R K H LS+++ + ICA++ G + W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDV--IAIDGKIHRHSYD 118 + GIP HDT R+ + + PA F W+ D DDK V +A+DGK R + Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMG-DDKLVGQLAVDGKALRATA- 118 Query: 119 KSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 K R A+H+++ +ST + +GQ K KSNEITAIPELL +L++KG ++ DAMG Q Sbjct: 119 KGRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQV 178 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL----KELNNPKHDSYAISEKSHGREE 234 IA+ I K+ GDYL AVK NQ LN +E+F E + H + HGR+E Sbjct: 179 KIADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKE 238 Query: 235 TR 236 R Sbjct: 239 HR 240 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 162 bits (411), Expect = 8e-39, Method: Compositional matrix adjust. Identities = 82/205 (40%), Positives = 129/205 (62%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L++H + D R +HKL DI+++ +CA+I GA+ + +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP HDT ARV + I P +F + F +W+ DV+ IDGK +HS +K K A Sbjct: 69 NGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKKA 128 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 IH+++A+++ LV+ Q K +++ EITAIP L+ +L++ G ++ DAMG Q DIAE + Sbjct: 129 IHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELLH 188 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKF 210 +G DY A+KGNQ L + +E F Sbjct: 189 SKGADYCLALKGNQRGLFQEVKEVF 213 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 162 bits (411), Expect = 8e-39, Method: Compositional matrix adjust. Identities = 87/234 (37%), Positives = 133/234 (56%), Gaps = 3/234 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ ++ I D R +H L D+L + I AVI+G++GWED+E++G ++L ++ + Sbjct: 31 LLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLELP 90 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 +GIP DT RV I P + W+ +S ++I IDGK R SYD++ + A Sbjct: 91 HGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQCA 150 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 ++ ++A+++ SLV+GQ+K + SNEITAIP LL +LDI G II DAMG Q I ++I Sbjct: 151 LYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQIC 210 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP---KHDSYAISEKSHGREETR 236 +Q DY+ +K N L ++ F + N +HD Y K H R E R Sbjct: 211 RQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKR 264 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 161 bits (407), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 102/243 (41%), Positives = 133/243 (54%), Gaps = 26/243 (10%) Query: 23 EHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCIC 82 +H+ S I+L+ I AVI GA+ W IEDFG++ F NGIP HDT R S + Sbjct: 33 KHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKLSNFNGIPSHDTFNRFFSALD 92 Query: 83 PAKFHESFINW---MLDYHSSDDKDVIAIDGKIHRHSY----DKSRRKGAI--------- 126 P KF ES+ W +L +S IAIDGK R +Y DK RK + Sbjct: 93 PLKFEESYRQWVQSILKCYSGH----IAIDGKTIRGAYESEQDKRHRKQGVLPDSNTGKY 148 Query: 127 --HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 HVISAF+T + +GQ+ T +K NEI IPELL+ML IK II DA+GCQ+ IAEK+ Sbjct: 149 KLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDCIITIDALGCQRTIAEKV 208 Query: 185 QKQGGDYLFAVKGNQGRLNK---AFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVC 241 K GDY+F VK NQ +L + + E K + D Y E+ HGR E+R+ C Sbjct: 209 IKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKG-TTVRFDKYETHEEGHGRNESRICYCC 267 Query: 242 DVP 244 + P Sbjct: 268 NDP 270 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 159 bits (401), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 93/246 (37%), Positives = 135/246 (54%), Gaps = 5/246 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L ++ I D R H+L DI+ + + AV++GA+ W IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP HDT ARV + + P F +W+ S+ VIAIDGK + SYD+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 + ++SA+++ H LV+GQ D KSNEITAIP LL L + G I+ DAMG + IA +I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 KQGGDYLFAVKGNQGRLNKAFE---EKFPLKELNNPKHDSYAISEKSHGREETRLHIVCD 242 KQ DY+ A+KGNQ L K + E+F ++ + E +H R E+R V Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRR--VFQ 251 Query: 243 VPDELI 248 VP E + Sbjct: 252 VPVEQV 257 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 159 bits (401), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 90/247 (36%), Positives = 133/247 (53%), Gaps = 2/247 (0%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +L + E +PD R + H LS++L + +CAV+ GA + D+ +G+++ +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD-VIAIDGKIHRHSYDKS 120 + G+P HDT RV++ I PA F +F+ W+ + D V+AIDGK R S K Sbjct: 65 LKLKAGVPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGKD 124 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 G +H++SAF+ LV+GQ TD+KSNEITAIPELL ML ++G I+ DAMG Q I Sbjct: 125 T-SGPLHMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAI 183 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 A I+ +G DY+ VK N L + + K HGR E R Sbjct: 184 ARTIRSRGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWA 243 Query: 241 CDVPDEL 247 D +L Sbjct: 244 YDAVSQL 250 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 158 bits (400), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 87/240 (36%), Positives = 135/240 (56%), Gaps = 8/240 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++L+E + I D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD---KDVIAIDGKIHRHSYDK 119 + +NG P HDT+ RV+ + P + + W + ++ K +I IDGK R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + G H++SA+S +GQ +KSNEITAIPELL + +KG+I+ DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDS---YAISEKSHGREETR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ ETR Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 158 bits (399), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 87/240 (36%), Positives = 135/240 (56%), Gaps = 8/240 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++L+E + I D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD---KDVIAIDGKIHRHSYDK 119 + +NG P HDT+ RV+ + P + + W + ++ K +I IDGK R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + G H++SA+S +GQ +KSNEITAIPELL + +KG+I+ DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDS---YAISEKSHGREETR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ ETR Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 157 bits (396), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 83/250 (33%), Positives = 142/250 (56%), Gaps = 8/250 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++E+ + + D R+ +H L D+L++ + AVI+GA+G I + E H ++LK + Sbjct: 13 ILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSRLELP 72 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS-----DDKDVIAIDGKIHRHSYDKS 120 +G+P HDTI R+++ + P F + F W+ + D +++IAIDGK R S+D+ Sbjct: 73 SGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRSHDRG 132 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 + G + + SA++ + +GQ+ KSNEI PEL+ +D++ I+ DA GCQ+D+ Sbjct: 133 KGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGCQRDV 192 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN---PKHDSYAISEKSHGREETRL 237 AEKI GDY+ A+K NQ RL++ + + N+ K + + K HGR + R Sbjct: 193 AEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRLDKRF 252 Query: 238 HIVCDVPDEL 247 + +PDE+ Sbjct: 253 YYQVKLPDEV 262 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 155 bits (392), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 95/248 (38%), Positives = 143/248 (57%), Gaps = 20/248 (8%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + +E ++ I D+R + ++L DILL++ AVI + + ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDV------IAIDGKIHRHSY 117 F +G P HDT +V+S + P E F WM + + K V +AIDGK S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS- 121 Query: 118 DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 S + A HV++AF++ LV+GQIKTD+KSNEITAIPELL + +K ++ DAMG Q Sbjct: 122 -GSAEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGR--------LNKAFEEKFPLKELNNPKHDSYAIS-EK 228 K+IA KI ++GGDY+ AVKGNQ + L+ +++ +EL YA++ EK Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDR-STRELKAKGQ--YAVTLEK 237 Query: 229 SHGREETR 236 HGR E R Sbjct: 238 DHGRIEKR 245 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 154 bits (390), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 90/253 (35%), Positives = 142/253 (56%), Gaps = 9/253 (3%) Query: 1 MELKKLMEHISI---IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDF 57 M++ KL + + + + D+R A + H+LS++L + +CAV+SGA+ +E+I +G + Sbjct: 1 MDIGKLADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPW 60 Query: 58 LKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD-VIAIDGKIHRHS 116 L+ + + G+ DT RV + + P +F ++F W+ + KD VIAIDGK R + Sbjct: 61 LRGFLRLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRT 120 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 K+ +H++SAF+ +V+GQ T +KSNEITAIPELL +LDI+G I+ DAMG Sbjct: 121 TSKAA-AAPLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGT 179 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRL--NKAFEEKFPLKELNNPKHDSYAISEKSHGREE 234 Q IA I+++G Y+ VK N +L + F + P L ++ + HGR E Sbjct: 180 QTKIARAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLT--PSSTHETTSTGHGRIE 237 Query: 235 TRLHIVCDVPDEL 247 R D D L Sbjct: 238 VRRCTAFDATDRL 250 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 153 bits (387), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 82/210 (39%), Positives = 124/210 (59%), Gaps = 5/210 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + + E++S Y Q +H DI+ L + AVISGA W +I+ FGE H D+L++ Sbjct: 1 MSVFRFFENLSDPRAYNQ----KHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 Y FE GIPV DTIARV+ I P F+E F+N++ + + ++VIAIDGK RHS++ Sbjct: 57 YRPFECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFN-P 115 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 + A+H ++ +S L++ Q K+ K NE A+ E+++ +K +I DAM QK I Sbjct: 116 ETQSALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKI 175 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 AEKI ++ GDY+ +K N + E F Sbjct: 176 AEKIIEKKGDYVMPLKKNHRQFQSEVEAYF 205 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 152 bits (384), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 95/251 (37%), Positives = 136/251 (54%), Gaps = 14/251 (5%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L E I D+R H L+DIL++ CA++ G + +E FG +L+ + Sbjct: 16 LREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLALP 75 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINW---MLDYHSSDD-----KDVIAIDGKIHRHSY 117 NGIP HDT +V S + P +F E+F W +L SS+ K VIAIDGK R + Sbjct: 76 NGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRGAV 135 Query: 118 DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 DK + I + A+++ SL +GQ+K KSNEI A+PELL ML +KG I+ DAMGCQ Sbjct: 136 DKGQAPAVI--VGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMGCQ 193 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE-LNNPKHDSYAISEKSHGREETR 236 +++A KI +Q GDY+ A+K NQ L++ E L + + + HGR E R Sbjct: 194 REVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHEVR 253 Query: 237 LHIVCDVPDEL 247 C V +E+ Sbjct: 254 R---CWVSEEV 261 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 148 bits (373), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 90/214 (42%), Positives = 124/214 (57%), Gaps = 8/214 (3%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICP 83 H +IL++ I AV+S + EDI + T +L+++ +NGIP +T R++ + P Sbjct: 19 HDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLKNGIPSEETFLRILRALDP 78 Query: 84 AKFHESFINWMLDYHS--SDDKDV---IAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSL 138 +F F W+ SDD + IAIDGK R S S + AIH++SAF+T L Sbjct: 79 KQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGS--GSGGESAIHMVSAFATELGL 136 Query: 139 VIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 V+GQ K KSNEITAIPELL L IKG ++ DAMGCQK IA++I + GDYL VKGN Sbjct: 137 VLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSIAKQIVAKKGDYLLMVKGN 196 Query: 199 QGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGR 232 Q +L +A E F + + D + E+ HGR Sbjct: 197 QPKLLEAIETAF-IDQHGVESVDRSSRVERGHGR 229 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 148 bits (373), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 80/242 (33%), Positives = 126/242 (52%), Gaps = 4/242 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L +H+S++ D R H L D+L L + AV SG +GW +I+ FGE ++L+++ F Sbjct: 3 LFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPFA 62 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP TIAR++ + P +W+ D ++ K +IAIDGK R + Sbjct: 63 NGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLG--CNT 120 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 +H + AF + L + Q K EI + L+ ML+I +I DA+ Q+ E I Sbjct: 121 LHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAIV 180 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPD 245 + GDY+ VK NQ L +A + ++ + ++ + +A SEK HGR E R I +P Sbjct: 181 ARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQR--ITFQIPS 238 Query: 246 EL 247 +L Sbjct: 239 KL 240 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 147 bits (371), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 82/254 (32%), Positives = 132/254 (51%), Gaps = 16/254 (6%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L+E + + D R K+EH+L DIL++ +CAV++ AE +EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD------VIAIDGKIHRHS 116 D GIP HDT RV I P F F+NW + D IA+DGK+ RHS Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 +D+ + +H++SA++T LV+ Q D K E A+P +L L + G ++ DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN-----PKHDSYAISEKSHG 231 ++++A+ I +G YL +K NQ +++ F + P D++ + +HG Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAF---DDTHG 238 Query: 232 REETRLHIVCDVPD 245 R R C PD Sbjct: 239 RLVRRRVFAC--PD 250 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 147 bits (370), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 79/244 (32%), Positives = 131/244 (53%), Gaps = 3/244 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 +++H+ I D R EH + DI L + AVISGA+ W +FG ++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP +I R+ + ++ ++W+ +Y + + IAIDGK+ + S A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLK-GAKASASSAA 119 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 +H+++A+ T LV K +E+ + ELL LD+K +++ DA+ CQ + I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVC--DV 243 K+GGD + VKGNQ +L +A + +F NNP + + + K HGR E R+ C ++ Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLNL 239 Query: 244 PDEL 247 P E+ Sbjct: 240 PAEI 243 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 146 bits (369), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 83/239 (34%), Positives = 126/239 (52%), Gaps = 9/239 (3%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 L+ L+EH S I D R ++ H L +ILLL +C ++ + +E+I +G H FL+++ Sbjct: 11 RLRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRH 70 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSR 121 + +G+P + +++ I PA F +F W+ D +AIDGK R S+D+ Sbjct: 71 LPYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFPG-RADFVAIDGKTSRRSHDRRA 129 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML----DIKGKIIKTDAMGCQ 177 IH++SAF+T LV+ Q K+NE+ AIP LL+ L + G ++ DA+ Sbjct: 130 GTAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATN 189 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETR 236 IA I+ QG DYL AVK NQ L E F + + + HD +K HGR E R Sbjct: 190 PTIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGADHHHD----LDKGHGRVEER 244 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 145 bits (365), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 84/209 (40%), Positives = 119/209 (56%), Gaps = 4/209 (1%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICP 83 H ++L++ I AV+S + EDI +G D+L+Q+ NG+ +T R+ + P Sbjct: 28 HDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLVLLNGVASEETFLRIFRALDP 87 Query: 84 AKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQI 143 +F +F W+ + + +DGK R S S + AIH++SAF+T +V+GQ Sbjct: 88 KQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGS--GSGGESAIHMVSAFATELGVVLGQE 144 Query: 144 KTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLN 203 K KSNEITAIPELL L I G ++ DAMGCQK+IA +I QGGDYL AVKGNQ L Sbjct: 145 KVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIARQITDQGGDYLLAVKGNQPTLL 204 Query: 204 KAFEEKFPLKELNNPKHDSYAISEKSHGR 232 A E +F + + + D + SHGR Sbjct: 205 DAIETEF-IDQYQSDDVDRHRQVHPSHGR 232 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 144 bits (362), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 85/246 (34%), Positives = 127/246 (51%), Gaps = 13/246 (5%) Query: 13 IPDYRQ--AWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 +PD R A K+ H L+DIL + CAVI+GAEGWEDI ++G + F +++ + +NG+P Sbjct: 12 LPDPRTETANKI-HTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLELKNGVPS 70 Query: 71 HDTIARVVSCICPAKFHESFINWMLDYHSS--------DDKDVIAIDGKIHRHSYDKSRR 122 HDT RV + + P F + F W + + D +A+DGK R S K Sbjct: 71 HDTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGKSARRSA-KPTF 129 Query: 123 KGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE 182 G +H++ + +L++GQ + +EIT ++L LD+ G ++ DA GCQ + E Sbjct: 130 SGCLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTLDAAGCQTETLE 189 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKF-PLKELNNPKHDSYAISEKSHGREETRLHIVC 241 I+ +GG+Y+ VKGNQ L A F E D + +HGR E R V Sbjct: 190 VIRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAHGRHEERNVTVV 249 Query: 242 DVPDEL 247 PD L Sbjct: 250 HDPDGL 255 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 143 bits (361), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 71/148 (47%), Positives = 97/148 (65%), Gaps = 1/148 (0%) Query: 104 DVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD 163 +V+AIDGK R SYD+ R+ IH++SA+++ + LV+GQ+KT+ KSNEITAIP L+ MLD Sbjct: 11 EVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEITAIPALIQMLD 70 Query: 164 IKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSY 223 ++G I+ DAM CQ IA+ I ++GGDYL AVKGNQG+L A + F D+ Sbjct: 71 LRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFTPHRRAPIDRDTC 130 Query: 224 AISEKSHGREETRLHIVCDVPDELIDFT 251 I EK GR E R + V D + DF+ Sbjct: 131 QI-EKQKGRVEARTYHVLSASDLIRDFS 157 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 143 bits (360), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 81/231 (35%), Positives = 122/231 (52%), Gaps = 3/231 (1%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 I D R + + L +ILL+T+CA+I G + W+ I DFG+ +L Q+ + G+P Sbjct: 23 IKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVNMRCGVPSTL 82 Query: 73 TIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAF 132 T ARV S I P +F WM + D+I +DGK S + + + A H+++A+ Sbjct: 83 TFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQKATHIVNAY 142 Query: 133 STMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYL 192 + +G+++ KSNEI AIP LLN L+++G II DAMG QK IA I+ + DY+ Sbjct: 143 LPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANLIRLKQADYV 202 Query: 193 FAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEK---SHGREETRLHIV 240 A+K N R + E F + + + Y E HGR E R + V Sbjct: 203 LALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSYCV 253 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 140 bits (354), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 78/231 (33%), Positives = 118/231 (51%), Gaps = 7/231 (3%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 +PD R H L D+L + + A I GAE D F +++ + G+P HD Sbjct: 12 LPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLPSHD 71 Query: 73 TIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAF 132 T +RV + P F F + LD+ D V+AIDGK R S+D++ + A+HV+SAF Sbjct: 72 TFSRVFRLLDPVAFSRCFQQF-LDHLGEDGAGVLAIDGKTLRRSFDRAAGRSALHVVSAF 130 Query: 133 STMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYL 192 ++ +++GQ NEI A LL + D+KG ++ DA+ Q+ A+ I ++GGD+L Sbjct: 131 ASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGGDWL 190 Query: 193 FAVKGNQGRLNKAFEEKF--PLKELNNPKHDSYAISEKSHGREETRLHIVC 241 F +K N+ L E F P L P + ++ HGR E R H V Sbjct: 191 FPLKDNRPALRAEVERYFADPATVLAVP----HVTTDADHGRIEVRRHWVS 237 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 139 bits (351), Expect = 7e-32, Method: Compositional matrix adjust. Identities = 71/164 (43%), Positives = 102/164 (62%), Gaps = 3/164 (1%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 +PD R+ + H+L ++LL IC VISGAE W + + + D+L+ Y + +GI HD Sbjct: 15 LPDPRRR-ECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPYAHGIASHD 73 Query: 73 TIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAF 132 T RV S + ++F F+ W+ S + +AIDGK R S+D +R IH++SA+ Sbjct: 74 TFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHDGAR--SPIHLVSAW 131 Query: 133 STMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 S+ +L +GQ++T KSNEITAIPELL LDI+G I DAMGC Sbjct: 132 SSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGC 175 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 139 bits (350), Expect = 9e-32, Method: Compositional matrix adjust. Identities = 87/228 (38%), Positives = 125/228 (54%), Gaps = 13/228 (5%) Query: 22 VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCI 81 V + L+++LL T+ +I A +++IE G D+L+Q+ FE+G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 CPAKFHESFINWM--LDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLV 139 P +F W+ L H +AIDGK R S + GA+H++SA++ LV Sbjct: 62 DPKYLETAFSAWVESLRVHVGGG---VAIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLV 118 Query: 140 IGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 IGQ + KSNEITAIPELL+ L + G I+ DAMG QK IA K+ +G DY+ A+KGNQ Sbjct: 119 IGQRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQ 178 Query: 200 GRLNKAFEEKFPLKEL--NNPKHDSYAISEKSHGREETRLHIVCDVPD 245 G L+ + F +L +HD I HGR E R C V D Sbjct: 179 GTLHDDVRDFFADPDLLRECARHDDTCI---GHGRIEER---TCQVAD 220 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 137 bits (344), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 80/246 (32%), Positives = 123/246 (50%), Gaps = 7/246 (2%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L + I D RQA KV H++ ++L++ C+ + E + D+ DF ++ +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRR 122 ++G P HD V+ I P E W D IAIDGK R +++ Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGWCGDLEGRH----IAIDGKALRGTHNAETG 116 Query: 123 KGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE 182 + +H++ A+ + L GQI +KSNEI AIP LL L +KG + DAMG Q IAE Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPKHDSYAISEKSHGREETRLHI 239 +I G DY+ A+K N R ++ + F E L+ H E SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VCDVPD 245 + + D Sbjct: 237 ITEELD 242 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 136 bits (342), Expect = 8e-31, Method: Compositional matrix adjust. Identities = 84/232 (36%), Positives = 125/232 (53%), Gaps = 10/232 (4%) Query: 15 DYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTI 74 D RQ+WK+ + LS IL L ++G E +++EDF E + Y D G P HDT+ Sbjct: 19 DSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEPLFATYVDLSEGCPSHDTL 78 Query: 75 ARVVSCICPAKFHESFINWMLDYHSSDD-KDVIAIDGKIHRHSYDKSRRKGAIHVISAFS 133 RV+S + + E + + S D +I++DGK R + K+++ +H+++A+ Sbjct: 79 ERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRGNRGKNQK--PVHIVTAYD 136 Query: 134 TMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLF 193 H L +GQ+ ++KSNEI AIP+LL +DI+ I+ DAMG Q I + I K DY Sbjct: 137 GGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCL 196 Query: 194 AVKGNQGRLNKAFEEKFP----LKELN-NPKHDSYAISEKSHGREETRLHIV 240 AVKGNQ L F L+EL N ++ Y EKS G+ E R + V Sbjct: 197 AVKGNQETLYDDIALYFSDVNLLEELQENAQY--YQTVEKSRGQIEVREYWV 246 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 134 bits (337), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 81/198 (40%), Positives = 115/198 (58%), Gaps = 13/198 (6%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLK-QYGDFENGIPVH 71 I D R K HK+ I+ ++I AVI GA+ W +IE+FG F K + D E IP H Sbjct: 12 IEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPDLE-FIPSH 70 Query: 72 DTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHR--------HSYDKSRRK 123 DT R S I P F F NW+ + K V+AIDGK+ R H+ K K Sbjct: 71 DTFNRFFSIIKPEYFELIFRNWVKQV-CQEVKGVVAIDGKLMRGPSQCDGEHTTGKEGFK 129 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 + ++SA+S ++ + +GQ+K D KSNEITAIP L+N L++ G I+ DAMGCQKDI + Sbjct: 130 --LWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDITQT 187 Query: 184 IQKQGGDYLFAVKGNQGR 201 I ++ +Y+ A+K N+ + Sbjct: 188 IIERDANYIIAIKENKKK 205 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 133 bits (334), Expect = 7e-30, Method: Compositional matrix adjust. Identities = 77/238 (32%), Positives = 127/238 (53%), Gaps = 7/238 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF- 64 L+E S +PD R+ + L+ IL++ +CA++ GA+ W ++ D+ + ++L + + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 65 -ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 E G P HDT + + F F +W+ + D V+AIDGK R S K + Sbjct: 63 LEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVID-GVVAIDGKTLRGSGKKGSNE 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 +H+++A++ L + Q T K +E+ + LL++L +KG I+ DA+GCQ ++AEK Sbjct: 122 -LLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAEK 180 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPKHDSYAISEKSHGREETRLH 238 I +GGDY+ VK NQ L +A E F + + +EK HGR ETR + Sbjct: 181 IVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRY 238 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 130 bits (327), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 72/200 (36%), Positives = 103/200 (51%), Gaps = 9/200 (4%) Query: 38 ISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDY 97 ++ AE WEDIE +G + +L+ + NGIP HDT RV + F F + Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 ++V+A+DGK R S G +H++S +++ L +GQ D KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 LL L + G I+ DAMGCQ IAE+I+ +G D L +K N G +A F L + Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 218 -----PKHDSYAISEKSHGR 232 P D++ + HGR Sbjct: 184 GAAGRPVFDAF----EGHGR 199 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 129 bits (325), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 75/195 (38%), Positives = 108/195 (55%), Gaps = 7/195 (3%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 I D R K HK+ I+ ++I AVI GA+ W +IE+FG F K IP HD Sbjct: 12 IEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPSLEFIPSHD 71 Query: 73 TIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRH------SYDKSRRKGAI 126 T R S I P F F NW+ + K V+AIDGK+ R + + + + Sbjct: 72 TFNRFFSMIKPDYFELIFRNWVKQV-CQEVKGVVAIDGKLMRGPSQCDGEHTRGKEGFKL 130 Query: 127 HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQK 186 ++SA+S + + +GQ+K D KS+EITAIP L+N L++ G I+ DAMGCQKDI + I Sbjct: 131 WMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQKDITQTIIG 190 Query: 187 QGGDYLFAVKGNQGR 201 +Y+ A+K N+ + Sbjct: 191 HDANYIIAIKENKKK 205 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 124 bits (312), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 78/247 (31%), Positives = 134/247 (54%), Gaps = 10/247 (4%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L++H+ II D R ++H L D++ LT+ A++SGA GW+ IE FG D+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 +GIP IA ++ + ++ W+ D K +IA+DGK R ++ + A Sbjct: 63 HGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ-A 121 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 +H++SAF + + + +KK +E ++++ L + ++ DA+ CQK +KI Sbjct: 122 LHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKII 181 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKS---HGREETR--LHIV 240 + D++ +KGNQ L A + F ++P + AISE++ HGR+E R + I Sbjct: 182 SKKSDFVIQIKGNQPALLAAVKAAF-AACYDSP---ALAISEQTNTGHGRKECRRVMQIE 237 Query: 241 CDVPDEL 247 ++P EL Sbjct: 238 GNLPPEL 244 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 124 bits (310), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 87/241 (36%), Positives = 124/241 (51%), Gaps = 16/241 (6%) Query: 11 SIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 +IIPD R ++ ++I+ + + AVI GA+ W +IE FG+TH + K IP Sbjct: 8 AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKARLPGLVSIPS 67 Query: 71 HDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRR-----KGA 125 HDT++R S + F E F W+ D V+AIDGK + DKS + Sbjct: 68 HDTLSRFFSILDIDWFEECFRLWVDDI-CRRIPGVVAIDGKAICDNPDKSSNSKNGVRSK 126 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 ++++SA+S + + +GQ K ++KSNE AIPEL+ LD++ II DA+GCQK I + I Sbjct: 127 LYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIGCQKSITKLII 186 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKE-----LNNPKHDSYAISEKSHGREETRLHIV 240 + DY+ K N L E F L E L + K Y K HGR E R V Sbjct: 187 ENKADYILCAKDNHEALRNIIE--FNLSEESRYYLCHAKR--YFEENKGHGRSEYR-ECV 241 Query: 241 C 241 C Sbjct: 242 C 242 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 122 bits (305), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 64/133 (48%), Positives = 89/133 (66%), Gaps = 4/133 (3%) Query: 106 IAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIK 165 +AIDGK R S+D +R IH++SA+S+ +L +GQ++T KSNEITAIPELL LDI+ Sbjct: 1 MAIDGKCLRGSHDGAR--SPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 GKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHD--SY 223 G I DAMGCQ DIAE+I ++G DY+ VKGNQ L +A + F + + + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 AISEKSHGREETR 236 + ++K+HGR ETR Sbjct: 119 SQTDKNHGRIETR 131 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 122 bits (305), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 89/252 (35%), Positives = 135/252 (53%), Gaps = 16/252 (6%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 L+KL E S IPD+R+A K + HKLSDI++L I +S +I +FG+ + ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCICPAK-------FHESFINWMLDYHSSDDKDVIAIDGKIH 113 NGIP T+ R+ I F E+F ++ + +++I IDGK Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCT--QEIICIDGKAE 152 Query: 114 RHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDA 173 R + K+ R I +SA S + + ++KSNEI A+P L++ +DI GKI+ DA Sbjct: 153 RGTVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADA 210 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGRE 233 M QKDI +KI+++ GD++ +K NQ L E+K +KEL +P + E HGR Sbjct: 211 MSMQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDK--IKEL-SPVYSYCGEPELGHGRI 267 Query: 234 ETRLHIVCDVPD 245 ETR + V D D Sbjct: 268 ETRSYRVFDGTD 279 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 118 bits (296), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 71/254 (27%), Positives = 120/254 (47%), Gaps = 8/254 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + E +PD R A H L++IL + + A + GA D+ F + Sbjct: 5 MDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDVL 63 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS----DDKDVIAIDGKIHRHSYD 118 +NG+P HDT +RV + P F ++F +M + K VIA+DGK R Y+ Sbjct: 64 VLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGYE 123 Query: 119 KSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 R +++A++ + + ++ +NE +L+ +L +KG ++ DA+ C + Sbjct: 124 SGRSHMPPVMVTAWAAQTRMALANVQA-PNNNEAAGALQLIELLQLKGCVVTADALHCHR 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLH 238 +AE I+ +GGDY+ AVK NQ L + + K ++ S + HGR+E R Sbjct: 183 GMAEAIKARGGDYVLAVKDNQPALMR--DAKAAIRAATRQGKPSTITVDAGHGRKEKRRA 240 Query: 239 IVCDVPDELIDFTF 252 +V VP D F Sbjct: 241 VVAAVPQMAQDHDF 254 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 117 bits (293), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 71/223 (31%), Positives = 107/223 (47%), Gaps = 35/223 (15%) Query: 15 DYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTI 74 D RQ KV H+ I++ + V + + W ++ DF DF++++ P HDT+ Sbjct: 29 DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFFPDIQKAPSHDTL 88 Query: 75 ARVVSCICPAKFHESFINWMLDYH----SSDDKDV----------------IAIDGKIHR 114 R +CP + W L+ +S ++ + IAIDGK + Sbjct: 89 RRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKPFRQIAIDGKTIK 148 Query: 115 HSYDKSRRK--------------GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLN 160 + ++ RR+ +H++SAFS L +GQ + DKK NEI AIP LL+ Sbjct: 149 KAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKKENEIVAIPRLLD 208 Query: 161 MLDI-KGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL 202 LDI +G ++ DAMG QKDI +I K+ YL VK NQ L Sbjct: 209 DLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATL 251 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 115 bits (288), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 69/233 (29%), Positives = 116/233 (49%), Gaps = 9/233 (3%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 +PD R +H L +IL + + AV+ GA ++E F + D L+Q+ E G P HD Sbjct: 10 VPDPRD-LTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLERGAPSHD 68 Query: 73 TIARVVSCICPAKFHESFINWMLDYHSSDDKDV----IAIDGKIHRHSYDKSRRKGAIHV 128 T +RV++ + P +E+F+ +M + D +A+DGK R +Y K R V Sbjct: 69 TFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGRSHMPPLV 128 Query: 129 ISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQG 188 ++ F + + Q ++ E+ A L +L +KG + DA+ C + + + ++ G Sbjct: 129 VTVFGCDTFMSLAQT-VAQEGGEVQAAIAALELLSLKGLTVTADALHCHRRMTKTVRDGG 187 Query: 189 GDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEK-SHGREETRLHIV 240 G Y+ A+KGNQ +L A E L + K + +E+ +HGR E R V Sbjct: 188 GHYVIAIKGNQSKL--AAEANTALDKAAAGKATKFHQTEEDAHGRHEVRRAFV 238 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 114 bits (286), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 69/230 (30%), Positives = 113/230 (49%), Gaps = 8/230 (3%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 +PD R A V H L ++L++ +V+ G+ ++ FG F + + ++ IP HD Sbjct: 22 VPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRNFLKLKHAIPSHD 80 Query: 73 TIARVVSCICPAKFHESFINWMLDYHSS-DDKDVIAIDGKIHRHSYDKSRRKGAIHVISA 131 T + V I P +F + D D D+IAIDGK R + D ++SA Sbjct: 81 TFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDPGESARTRMMVSA 140 Query: 132 FSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDY 191 +++ L + + D+ E++A E L ++D++GK++ DA+ C + I GGD+ Sbjct: 141 YASRLRLTLATVPADR-GTELSAAIEALGLIDLRGKVVTGDALHCNRRTVAAINAGGGDW 199 Query: 192 LFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKS-HGREETRLHIV 240 A+KGNQ L F K D A++E + HGR+ETR +V Sbjct: 200 CLALKGNQESLLSDARGCFS----KGHKSDPTAVTENTGHGRKETRKAVV 245 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 111 bits (278), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 69/234 (29%), Positives = 122/234 (52%), Gaps = 10/234 (4%) Query: 10 ISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG-- 67 I+++ D R ++++ L ILL+++ A ISG + WE IED+ H + L+ +G Sbjct: 9 IAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKLSGKE 68 Query: 68 -----IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRR 122 +P HDT V I P +F E + +++ + + IAIDGK R ++ Sbjct: 69 LKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPR-GIKQTAN 127 Query: 123 KGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE 182 +++SA+ T H VI I ++ K +E+++I +L+ +L ++ + DA G ++ E Sbjct: 128 SHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGTYVEVIE 187 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETR 236 I +GG+++ VKGNQ +L + E++F N D+ + HGR E R Sbjct: 188 MILSKGGNFVLPVKGNQKKLLEFIEKEFREYRGNTVSADTQ--EDIGHGRVEKR 239 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 109 bits (273), Expect = 7e-23, Method: Compositional matrix adjust. Identities = 78/246 (31%), Positives = 123/246 (50%), Gaps = 13/246 (5%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 +K L E + +PDYR+ K ++KL DILLL I + DI FG+ + + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHS---SDDKDVIAIDGKIHRHSY 117 G +G+P T+ R+ I E + +H D++ IDGK R + Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAGDILCIDGKAMRGTV 137 Query: 118 DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 ++ R I +SA+S + + ++KSNEIT++P+LL+ +D+ G I+ DAM Q Sbjct: 138 LENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADAMSFQ 195 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRL 237 K I +KI+++GGD+L +K NQ L E+ L E + + + HGR ETR Sbjct: 196 KAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAEPVDVYSEGPFL---EHGRIETR- 251 Query: 238 HIVCDV 243 VC + Sbjct: 252 --VCRI 255 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 109 bits (272), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 83/243 (34%), Positives = 124/243 (51%), Gaps = 21/243 (8%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L E + ++P R K + L +LL+ + +SG W +IED+ E + + LK + Sbjct: 5 LFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEML 64 Query: 66 NG------IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRH---- 115 G +P HDT+ R +S + F ++ W+ + S+ I IDGK R Sbjct: 65 TGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRGVKKL 124 Query: 116 SYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMG 175 S+D HV+SAFS + Q+ D+K+NEI AI +LL++LD+ G ++ DA+G Sbjct: 125 SFDTQS-----HVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIG 179 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF-PLKELNNPKHDSY-AISEKSHGRE 233 Q I E+I +GGDY+ VK NQ + E F PL + KH +E SHGR Sbjct: 180 TQTAIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQ----KHILLDEQTELSHGRI 235 Query: 234 ETR 236 ETR Sbjct: 236 ETR 238 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 105 bits (261), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 86/283 (30%), Positives = 129/283 (45%), Gaps = 39/283 (13%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + E I I D R+ KV + I+L+T+ V + W DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWML----DYHSSDDKDV------------- 105 P HDT+ R I + + W D S +D D Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 106 ---IAIDG----------KIHRHSYDKSRRKGA----IHVISAFSTMHSLVIGQIKTDKK 148 IAIDG K+ + S K ++ A +H++SAF + SL +GQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNMLDIK-GKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFE 207 NEI AIP+LL+ +DI+ G ++ DA+G QK I EKI ++ DYL VK N +L + E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPLKELNNPKHDSYAISEKS---HGREETRLHIVCDVPDEL 247 ++ ++D +E++ HG TR I C P L Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRL 299 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 104 bits (260), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 69/184 (37%), Positives = 98/184 (53%), Gaps = 14/184 (7%) Query: 68 IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHR--------HSYDK 119 IP HDT R S I P F F NW+ + K V+AIDGK+ R H+ K Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQV-CQEVKGVVAIDGKLMRGPSQCDGEHTTGK 62 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 K + ++SA+S + + +GQ+K D KSNEITAIP L+N L++ G I+ DAMGCQKD Sbjct: 63 EGFK--LWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKD 120 Query: 180 IAEKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPKHDSYAISEKSHGREETR 236 I + I + +Y+ A+K N+ + L K + + K+ + + HGR ETR Sbjct: 121 ITQTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETR 180 Query: 237 LHIV 240 V Sbjct: 181 TCTV 184 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 103 bits (258), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 56/132 (42%), Positives = 87/132 (65%), Gaps = 3/132 (2%) Query: 104 DVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD 163 D+IA+DGK R SYD++ K AIH++SA+ST + LV+GQ+KT++KSNE TAIP+L +L Sbjct: 8 DIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIPKLFTLLA 67 Query: 164 IKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHD-S 222 ++ + DA+G Q+DIA++I + DYL VK NQ L++ + + E D + Sbjct: 68 LEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAKGFTEDFT 127 Query: 223 YAISEKS--HGR 232 +++E+ HGR Sbjct: 128 DSVTEEGDKHGR 139 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 103 bits (257), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 65/226 (28%), Positives = 105/226 (46%), Gaps = 8/226 (3%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 +PD R A H L ++L++ +V+ GA ++ FG + + ++ +P HD Sbjct: 44 VPDPR-AENTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLKHAVPSHD 102 Query: 73 TIARVVSCICPAKFHESFINWMLDYHSS-DDKDVIAIDGKIHRHSYDKSRRKGAIHVISA 131 T + V I P +F + D + D DVIA+DGK R + D ++SA Sbjct: 103 TFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGRTRMMVSA 162 Query: 132 FSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDY 191 ++ L + + D+ + E+ A E L ++ +KGK++ DA+ C + I GGD+ Sbjct: 163 YAARLRLTLASVPADRGT-ELEAAIEALGLIALKGKVVTADALHCNRRTVAAINAGGGDW 221 Query: 192 LFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEK-SHGREETR 236 A+K NQ L F + P A+SE HGR ETR Sbjct: 222 CLALKANQDSLLSDARASFGAE----PDAHPSALSEDIGHGRTETR 263 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 100 bits (250), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 78/256 (30%), Positives = 117/256 (45%), Gaps = 38/256 (14%) Query: 30 LLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHES 89 +L+T+ V + W DI DF DFL+++ P HDT+ R I + Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FINWML----DYHSSDDKDV----------------IAIDGK----------IHRHSYDK 119 + W D S +D D IAIDGK + + S K Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 120 SRRKGA----IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIK-GKIIKTDAM 174 ++ A +H++SAF + SL +GQ + K NEI AIP+LL+ +DI+ G ++ DA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKS---HG 231 G QK I EKI ++ DYL VK N +L + E ++ ++D +E++ HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 REETRLHIVCDVPDEL 247 TR I C P L Sbjct: 241 FMVTRTCISCSEPSRL 256 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 97.1 bits (240), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 56/135 (41%), Positives = 80/135 (59%), Gaps = 3/135 (2%) Query: 105 VIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDI 164 VIAI+GK R + + A+H +SA++ + L +GQ+ +KSNEITAI ELL L + Sbjct: 5 VIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLPTLAL 64 Query: 165 KGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF-PLKELNNPKHDS- 222 +G ++ DA+GCQ +AE+I GGDY+ AVK NQ L A + F L +P + Sbjct: 65 EGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPVRQTC 124 Query: 223 -YAISEKSHGREETR 236 + +K HGR ETR Sbjct: 125 VHETLDKGHGRIETR 139 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 95.9 bits (237), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 49/113 (43%), Positives = 74/113 (65%), Gaps = 1/113 (0%) Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 ++H+++A+ + +L++GQ+K D KSNEITAIP+LL ML ++G I+ DAMGCQK IA++I Sbjct: 2 SLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQI 61 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP-KHDSYAISEKSHGREETR 236 + DY+ AVK NQ L + + F ++N H + + HGR ETR Sbjct: 62 GSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETR 114 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 94.4 bits (233), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 50/127 (39%), Positives = 71/127 (55%), Gaps = 3/127 (2%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 G H++SA++T H + +G + T++KSNEITAI LL L K ++ DAMGCQKDIA Sbjct: 2 GPRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARN 61 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFE---EKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 I GGD++ AV+ NQ +L A EK E +H ++ HGR + R + Sbjct: 62 IVAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWG 121 Query: 241 CDVPDEL 247 VP + Sbjct: 122 AQVPPDF 128 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 92.0 bits (227), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 62/211 (29%), Positives = 96/211 (45%), Gaps = 3/211 (1%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + L E +S IPD R + H L +L L A++ G + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDK 119 F G P T++R + P + + W+ + IA+DGK R S D Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGSRDG 120 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + H+++A++ V+ Q++ D K+NE A LL +L + G ++ DAM CQ+D Sbjct: 121 --QVPGQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 +A + G DY+ K NQ L + E Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGL 209 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 92.0 bits (227), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 69/251 (27%), Positives = 121/251 (48%), Gaps = 16/251 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDF-GETHPDFLKQ 60 E+ L+E ++ +PD R V H L+ +L LT CAV++GA + ++ E + L++ Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 61 YGDFENGI------PVHDTIARVVSCICPAKFHESFINWM-LDYHSSDDKDVIAIDGKIH 113 G + + P TI RV++ I + W+ + +A+DGK Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWLACRQQDAGGLRALAVDGKSL 157 Query: 114 RHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIIKTD 172 R + R+ +H+++A + LV+ Q+ +K+NEIT LL+ L D+ G ++ +D Sbjct: 158 RGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTSD 215 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGR 232 A+ Q D A ++ + Y+ VK N +L+ + P +++ P D + HGR Sbjct: 216 ALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLKS-LPWQQI--PLQDRTRTT--GHGR 270 Query: 233 EETRLHIVCDV 243 E R VC V Sbjct: 271 CEIRRLKVCTV 281 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 91.7 bits (226), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 60/208 (28%), Positives = 99/208 (47%), Gaps = 5/208 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+E ++ +PD R ++ L +L L + AV+ G E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 NG +P +TIA ++ + P + W+ D H D + +A+DGK R + + Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGK--RLCGSRDGQV 120 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIIKTDAMGCQKDIAE 182 H+++A++ S V+ Q+ + +NE A LL +L + G ++ DA+ Q D+ Sbjct: 121 PGTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 +Q +GGD + K NQG L E F Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAF 208 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 91.3 bits (225), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 53/157 (33%), Positives = 89/157 (56%), Gaps = 4/157 (2%) Query: 52 ETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGK 111 +TH + L+++ + GI TI R++ I +F+ W+ + S + +A+DGK Sbjct: 24 KTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALYAFMEWVGEIVDSRNTH-LAVDGK 82 Query: 112 IHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKT 171 + +K++ + +++ T+ L++ Q+ D K+NEIT IPELL +LDI G I+ Sbjct: 83 ALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSKTNEITVIPELLKLLDISGSIVTI 142 Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEE 208 DA+G Q I E+I +QGG + VK NQ +A+EE Sbjct: 143 DAVGTQTAIMEQIHEQGGHFALTVKKNQ---PEAYEE 176 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 90.5 bits (223), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 49/105 (46%), Positives = 60/105 (57%), Gaps = 3/105 (2%) Query: 143 IKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL 202 + T+ KSNEITAIP LL L+ K ++ DAMGCQKDIA I GGD++ AVK NQ +L Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 NKAFE---EKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVP 244 A EK EL +H +Y HGR + R H V VP Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVP 105 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 90.1 bits (222), Expect = 7e-17, Method: Compositional matrix adjust. Identities = 50/116 (43%), Positives = 67/116 (57%), Gaps = 3/116 (2%) Query: 133 STMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYL 192 S +LV+GQ K + KSNEITAIP L+ ML+I+ II DAMGCQK+I I+K+ GDY+ Sbjct: 28 SLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESSIITIDAMGCQKEITSLIRKKKGDYI 87 Query: 193 FAVKGNQGRLNKAFEEKFPL---KELNNPKHDSYAISEKSHGREETRLHIVCDVPD 245 +K NQ L + +E F + +E + +H Y E H R E R I V Sbjct: 88 ITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQEIETGHHRIEKREVIAVSVSS 143 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 88.6 bits (218), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 53/148 (35%), Positives = 80/148 (54%), Gaps = 3/148 (2%) Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRR 122 + NG P DT RV+ I P + + + S + IAIDGK + S K+ Sbjct: 17 ELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHIAIDGKRLKGSKKKT-- 74 Query: 123 KGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE 182 G+ H++SA+ L + Q +K NE+ AIPE+L+ LD+ G +I DAMG Q +IAE Sbjct: 75 -GSTHILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSGAVISIDAMGTQTNIAE 133 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 +I + DY+ ++KGNQ L + + F Sbjct: 134 QIIQSEADYILSLKGNQKHLYEDVRDCF 161 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 85.5 bits (210), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 72/246 (29%), Positives = 120/246 (48%), Gaps = 24/246 (9%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDI----EDFGETHPDF 57 ++ L+ + I D R+A + LS +L + A ++GA G +I DFG+ D Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQ---DL 77 Query: 58 LKQYG---DFENG---IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD--VIAID 109 L + G D G P I + + A +F W+ + + + V+A+D Sbjct: 78 LARLGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMD 137 Query: 110 GKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELL-NMLDIKGKI 168 K+ R ++ + ++ + ++SA LV GQ++ +NEIT + LL N+ DI G + Sbjct: 138 VKVLRGAWSEGNKR--VTLLSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPV 195 Query: 169 IKT-DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL-NKAFEEKFPLKELNNPKHDSYAIS 226 + T DA+ Q + A + + G DY VKGNQ L K FE+ PL + P+H+ + Sbjct: 196 VATLDAVHTQHETAFLLVEHGIDYALTVKGNQPTLYRKTFEQTLPLLQ-KPPQHE---VE 251 Query: 227 EKSHGR 232 E+ HGR Sbjct: 252 ERGHGR 257 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 85.1 bits (209), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 45/115 (39%), Positives = 70/115 (60%), Gaps = 9/115 (7%) Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQ 187 ++S ++T + LV+GQ+K + SNEITAIPELL +L++ G I++ A+ C KDI + I ++ Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 GGDYLFAVKGNQGRLNKAFEEKFP------LKELNNPKHDSYAISEKSHGREETR 236 DY+ +K NQG L ++ E+ F +EL +H +Y E HG E R Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQEL---QHSTYKPEETGHGLHEIR 112 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 84.7 bits (208), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 63/237 (26%), Positives = 113/237 (47%), Gaps = 8/237 (3%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L+ + +PD R ++L ++ + +CAV +GA + I D+ P + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD-KDVIAIDGKIHRHSYDKS 120 +P TI +V + + +D + +A+DGK R + + Sbjct: 102 CGIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RI 159 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 ++ A H+++A + ++V+GQ +T KSNEI + LL +DI G ++ DAM QK Sbjct: 160 GKQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKAT 219 Query: 181 AEKIQKQG-GDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETR 236 A +++Q +Y+ VK NQ L ++ P +++ D E+ HGREE R Sbjct: 220 ARCLREQCRAEYVMIVKANQPGLLARVRDQ-PWEQVPVVWSDPV---ERGHGREEHR 272 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 82.4 bits (202), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 51/177 (28%), Positives = 88/177 (49%), Gaps = 4/177 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ-YGDF 64 L+ + +PD R+A + L +L+ T+ A++SGA + I F E + L +G Sbjct: 16 LLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTHHFGVD 75 Query: 65 ENGIPVHDTIARVVSCICPAKFHESF---INWMLDYHSSDDKDVIAIDGKIHRHSYDKSR 121 PV +T+ V+ + ++F +L +K V+A+DGK R S+D Sbjct: 76 LKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGSFDHIN 135 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 + A ++AF + ++V+ + D KSNEI A +++ L + G + DAM CQK Sbjct: 136 DRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHCQK 192 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 81.3 bits (199), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 57/202 (28%), Positives = 91/202 (45%), Gaps = 7/202 (3%) Query: 40 GAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHS 99 GA+ +I +F E LK+ +G P HDT +R+ I P + + ++ Sbjct: 37 GAKNCVEIAEFVEGREAELKEIVTLRHGCPSHDTFSRIFRLIDPDELARALGAFLAALRQ 96 Query: 100 S-----DDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITA 154 + V+A+DGK R Y+K R ++S + L + K + S+E+ A Sbjct: 97 GLGLGPRPRGVVAVDGKALRRGYEKGRAFMPPVMVSVWDAETRLSVA-TKRAEGSDEVAA 155 Query: 155 IPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 LL +D+KG I+ DA+ C+ D A+ + + Y A+K N+GRL E F + Sbjct: 156 TLALLKSIDLKGCIVTADALHCRPDTAKALIGRKAHYALALKANRGRLFACAEAGFVAAD 215 Query: 215 LNNPKHDSYAISEKSHGREETR 236 + E HGR ETR Sbjct: 216 AAG-DLAFHETRETGHGRLETR 236 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 79.7 bits (195), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 39/89 (43%), Positives = 56/89 (62%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 +E S IPD R +H +I+ L + +V++GA+ + +IEDF E H D+LK Y + Sbjct: 5 FVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTYFNLP 64 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWM 94 NGIP HDT +RV S I PA F +SF+ W+ Sbjct: 65 NGIPSHDTFSRVFSAINPASFQDSFLIWL 93 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 79.7 bits (195), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 35/75 (46%), Positives = 53/75 (70%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++++E + + D R A + +H L DIL+L +CAV+SGA+GW+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARV 77 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRV 81 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 79.7 bits (195), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 46/105 (43%), Positives = 63/105 (60%), Gaps = 5/105 (4%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++H + D R +H L DI+LL I AV+SG+EGWEDIE+FG D+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDG 110 GIP HDTIARV IC K E I ++ +D ++A+ G Sbjct: 67 AGIPRHDTIARV---ICRLKADEKEIAKLIVKQKAD--YILALKG 106 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 77.0 bits (188), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 60/214 (28%), Positives = 95/214 (44%), Gaps = 12/214 (5%) Query: 40 GAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHS 99 GA+ ++ +F E + L++ +G P HDT +RV + P + +F +M Sbjct: 37 GAKTCVEMAEFSEARQEELREIVALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRG 96 Query: 100 S----DDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAI 155 + K V+AIDGK R YDK R ++S + I ++ +EI A Sbjct: 97 ALGLPAPKGVVAIDGKSLRRGYDKGRAFMPPLMVSVWDVETRPSIAAMRA-PGGDEIKAT 155 Query: 156 PELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF-PLKE 214 +L L +KG + DA+ C +A+ + Y +K N G L +A E F + + Sbjct: 156 LSVLKALTLKGCTVTADALHCHPAMAQALLAAKAQYALGLKANHGPLFRAAEAGFAAVTD 215 Query: 215 LNNPKHDSYAISEKSHGREETRLHIVCDVPDELI 248 L + E+ HGREE R V V D L+ Sbjct: 216 LA-----VFETRERGHGREEQRRASVLPV-DRLV 243 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 61/236 (25%), Positives = 112/236 (47%), Gaps = 9/236 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ-YGDF 64 L+E + + D+R+ H L +L++ I + G G+ ++ +F + + L Q + Sbjct: 4 LIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEFNII 63 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLD-YHSSDDKDVIAIDGKIHRHSYDK--SR 121 +P + TI RV+ + + F W L+ Y DD + + +DGK +++ + Sbjct: 64 PERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNPNNE 123 Query: 122 RKGAIHVISAFSTMHSLVIGQIKT-DKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 ++ I +S FS LV+ + +KK +EI ++ ++ K+ DA+ CQK Sbjct: 124 QQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQKKT 183 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETR 236 I K DY+ VKGNQ L K ++ L + P+ + + SHGR+ +R Sbjct: 184 ISLIAKTKNDYVITVKGNQKNLYKRIQD---LSNSSKPE-SCFLEQDNSHGRKISR 235 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 61/226 (26%), Positives = 106/226 (46%), Gaps = 17/226 (7%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ LM+ +S D R+ + H ++ +CA++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDF----ENGI---PVHDTIARVVSCICPAKFHESFINW----MLDYHSSDDKDVIAIDG 110 F E I P T+ R + I + W + D V++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIK 170 K R + K++ IH ++AF +V+ Q D+K+NEI + LL ++I+G+I+ Sbjct: 341 KAVRGA-SKAKGGQKIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 TDAMGCQKDIAEKI-QKQGGDYLFAVKGNQGRLNKAFE----EKFP 211 DA+ Q + A I + + DY+F VK NQ + + E E FP Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIESLPWEAFP 445 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 73.9 bits (180), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 56/200 (28%), Positives = 97/200 (48%), Gaps = 25/200 (12%) Query: 11 SIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDI----EDFGETHPDFLKQYG-DFE 65 + + D R+A + H +LL+ + V++G +E I +D ++ L++ G + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQ---LRRLGCRWS 285 Query: 66 NG-----IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 G P TI R++S P + +++ + S IAIDGK R S Sbjct: 286 PGKERFLPPSEPTIRRILSKADPVELDRILSQYIVAHSSGR---AIAIDGKTIRSS---- 338 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTD-KKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 ++ +++A V+ Q D K +EI A LL LD+ GK++ DA+ Q Sbjct: 339 ----SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNA 394 Query: 180 IAEKIQKQGGDYLFAVKGNQ 199 +A +I+++GGDY+F VK N+ Sbjct: 395 LASRIREKGGDYVFTVKDNR 414 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 73.9 bits (180), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 89/186 (47%), Gaps = 17/186 (9%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ-Y 61 + L + + IPD+R+A L +LL +I A++SGA + I F TH L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFENGIPVHDTIARV-----VSCICP-AKFHESFINWMLDYHSSDDKDVIAIDGKIHRH 115 G P + +I V + P + H + + ++ VIA+DGK R Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHFRAHAARL--------AEGAAVIALDGKTLRG 112 Query: 116 SYDKSRRKGAIHVISAFSTMHSLVIGQIKTD--KKSNEITAIPELLNMLDIKGKIIKTDA 173 S D+ + A V+SAF+T +V+GQI + K +EI A L+ L + G++ DA Sbjct: 113 SLDRFEDRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDA 172 Query: 174 MGCQKD 179 + QK+ Sbjct: 173 LHLQKN 178 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 73.2 bits (178), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 46/170 (27%), Positives = 79/170 (46%), Gaps = 3/170 (1%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG-IPVHDTIARVVSCIC 82 H L +L L AV+ G + I FG + L F G P T+++ + I Sbjct: 6 HPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTLRRID 65 Query: 83 PAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQ 142 P + + W+ + D + +A+DGK R S D H ++A++ + V+GQ Sbjct: 66 PQQLEAALGRWIAGRLTPDARAHVALDGKCLRGSRDGD--VPGPHRVAAYAPHAAAVLGQ 123 Query: 143 IKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYL 192 I+ D ++NE A LL ++ + G ++ A C +D+A + GG Y+ Sbjct: 124 IRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYV 173 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 72.0 bits (175), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 40/65 (61%), Positives = 43/65 (66%), Gaps = 12/65 (18%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTI------CAVISGAEGWEDIEDFGETH 54 MELKKLMEHISIIPDYRQAWKVEHKL DIL + C ++ G FGETH Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRG------FGETH 54 Query: 55 PDFLK 59 DFLK Sbjct: 55 LDFLK 59 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 71.6 bits (174), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 36/73 (49%), Positives = 48/73 (65%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++H I D R +H L +I+LL I AV+SG+EGWE IE+FG D+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVV 78 GIP HDTIARV+ Sbjct: 67 AGIPRHDTIARVI 79 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 71.2 bits (173), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 58/257 (22%), Positives = 116/257 (45%), Gaps = 15/257 (5%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L+ + +PD+R V ++L+ +L L + I+G + + ++ P + Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWM--LDYHSSDDKDVIAI--DGKIHRHSY 117 F +P TI R+V P + ++ W + +D ++A+ DGK+ + + Sbjct: 84 LGFPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGAR 143 Query: 118 DKSRRKGAIH---VISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAM 174 + +G++ V+ A +G + +EI ++ L+N + ++ TD + Sbjct: 144 SRPP-QGSVRQEAVVEAVRHDTGTALGHQRV-VAGDEIASVRRLVNRVCDHNTLVTTDCL 201 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGR-E 233 + +A I+ +GG +LF++KGNQ + +A P E N + EK+HGR E Sbjct: 202 HAHEPLARAIRAKGGHWLFSIKGNQPTV-RAKLAGLPWDEFGN----QHVTREKAHGRIE 256 Query: 234 ETRLHIVCDVPDELIDF 250 E L + L+ F Sbjct: 257 ERALKALTPSAPSLVGF 273 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 70.5 bits (171), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 61/256 (23%), Positives = 112/256 (43%), Gaps = 22/256 (8%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+E ++ +PD R+ V ++ + +L + +CA++SGA + I ++ P + Sbjct: 51 LLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAGLGLT 110 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDK-------------DVIAIDGKI 112 +P TI RV+ + A + W+ + D V+A+DGK Sbjct: 111 GRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAVDGKA 170 Query: 113 HRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIIKT 171 R + + +H++ +V+ Q+ D+K+NEI +L+ + D+ +I Sbjct: 171 MRATRHGTH---PVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDVLITV 227 Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHG 231 DAM Q A+ + +G L VK NQ ++ + P K++ + + + HG Sbjct: 228 DAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRL-KTLPWKDVPV----GHTTTGRGHG 282 Query: 232 REETRLHIVCDVPDEL 247 R ETR VP L Sbjct: 283 RIETRTLKAVTVPAGL 298 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 70.1 bits (170), Expect = 7e-11, Method: Composition-based stats. Identities = 33/64 (51%), Positives = 44/64 (68%) Query: 18 QAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARV 77 +A+ +H DI+ L + AVISGA W +I+ FGE H D+L++Y FE GIPV DTIARV Sbjct: 14 RAYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRPFECGIPVDDTIARV 73 Query: 78 VSCI 81 + I Sbjct: 74 IKRI 77 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 69.7 bits (169), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 54/204 (26%), Positives = 95/204 (46%), Gaps = 16/204 (7%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGE--THPDFLKQYGDFENGI-- 68 +PD R +H L IL + + AV++ A+ + + ++ T + F Sbjct: 230 LPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKRIRARFNPRTQR 289 Query: 69 ---PVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 P T+ RV+ + W+L + +A+DGK+ + + R G+ Sbjct: 290 YVAPSEPTLRRVLQGANVTALDAAIGAWLLGIAGFE---AVAVDGKVLKGAV---REDGS 343 Query: 126 -IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE-K 183 +H++SAF I Q + +K+NEI + LL +DI+ K++ DA+ Q+ A Sbjct: 344 QVHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADALHTQRKTARFL 403 Query: 184 IQKQGGDYLF-AVKGNQGRLNKAF 206 ++ + DYLF AVKGNQ +L + Sbjct: 404 VEDKKADYLFTAVKGNQRKLRNSL 427 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 68.6 bits (166), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 62/215 (28%), Positives = 95/215 (44%), Gaps = 16/215 (7%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPD-FLKQYG-DFENGIPVHDTIARVVSCICPAKF 86 +L + + A +G G+ + T D L Q G F P T V+S + PA Sbjct: 3 LLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRR--PSEKTFRAVLSRLDPADL 60 Query: 87 HESFINWMLDYHSSDDKD---VIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQI 143 + ++ + +S D IA+DGK+ R + + A H++S F+ LV+GQ+ Sbjct: 61 NARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQL 118 Query: 144 KTDKKSNEITAIPELLNMLDIKGK-IIKTDAMGCQKDIAEKI-QKQGGDYLFAVKGNQGR 201 +KSNEI + LL +L + ++ DAM Q A+ I YL VK NQ + Sbjct: 119 AVAEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAK 178 Query: 202 LNKAFEEKFPLKELNNPKHDSYAISEKSHGREETR 236 + A P E+ D + HGR ETR Sbjct: 179 I-LARITALPWAEVPAAATD----DSRGHGRVETR 208 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 67.8 bits (164), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 29/70 (41%), Positives = 43/70 (61%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L+ H + I D RQ+ KV + L D+L +T+C VI+GAEGW +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIPV 70 G G+PV Sbjct: 72 KGILTEGVPV 81 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 67.8 bits (164), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 66/257 (25%), Positives = 117/257 (45%), Gaps = 21/257 (8%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPD-FLKQYGDF 64 L+ ++ +PD R V H L +L + AV++GA + ++ P L + G F Sbjct: 29 LVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELGVF 88 Query: 65 EN---GI---PVHDTIARVVSCICPAKFHESFINWMLDYH--SSDDKDVIAIDGKIHRHS 116 + G+ P T R+++ + ++ W+L ++ + V ++DGK R S Sbjct: 89 RDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLRGS 148 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 + +H+++ V+GQ+ D K+NE+T LL LD+ ++ DA+ Sbjct: 149 GPAGEQ---VHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADALHT 205 Query: 177 QKDIAE-KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREET 235 Q++ A + + Y+F VK NQ RL + + P ++ P D S + HGR + Sbjct: 206 QREHARWLVDTKKAAYVFTVKKNQPRLYRQL-KTLPWTKI--PIQDE--TSTRGHGRYDI 260 Query: 236 RL--HIVCDVPDELIDF 250 R + C P L DF Sbjct: 261 RRLQAVTCTGPLAL-DF 276 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 67.0 bits (162), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 51/167 (30%), Positives = 84/167 (50%), Gaps = 13/167 (7%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 L+KL E S IPD+R+A K + HKL D+++L I +S +I +FG+ + ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCICPAK-------FHESFINWMLDYHSSDDKDVIAIDGKIH 113 NGIP T+ R+ I F E+F +L + ++++ IDGK Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCA--QEIVCIDGKAE 152 Query: 114 RHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLN 160 R + K+ R I +SA S + + ++KSNEI A+P L++ Sbjct: 153 RGTVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLID 197 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 67.0 bits (162), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 29/82 (35%), Positives = 53/82 (64%), Gaps = 2/82 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + +++H S + D RQ+W+V + L +I LL +CA +SG E + +I +G+ +FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTIARV--VSC 80 + +E G+P HDT+ + +SC Sbjct: 77 FLPYERGLPAHDTLKGLSGISC 98 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 64.3 bits (155), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 61/215 (28%), Positives = 94/215 (43%), Gaps = 16/215 (7%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPD-FLKQYG-DFENGIPVHDTIARVVSCICPAKF 86 +L + + A + G+ + T D L Q G F P T V+S + PA Sbjct: 3 LLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRR--PSEKTFRAVLSRLDPADL 60 Query: 87 HESFINWMLDYHSSDDKD---VIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQI 143 + ++ + +S D IA+DGK+ R + + A H++S F+ LV+GQ+ Sbjct: 61 NARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQL 118 Query: 144 KTDKKSNEITAIPELLNML-DIKGKIIKTDAMGCQKDIAEKI-QKQGGDYLFAVKGNQGR 201 +KSNEI + LL +L D ++ DAM Q A+ I YL VK NQ + Sbjct: 119 AVAEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAK 178 Query: 202 LNKAFEEKFPLKELNNPKHDSYAISEKSHGREETR 236 + A P E+ D + HGR +TR Sbjct: 179 I-LARITALPWAEVPAAATD----DSRGHGRVKTR 208 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 64.3 bits (155), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 51/181 (28%), Positives = 86/181 (47%), Gaps = 6/181 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+E ++ +PD+R A + L +LLL I +S G+ +EDF H + L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 -ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS---YDKS 120 P T RV+ I F NW+ ++D + +DGK + + YD++ Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 121 RRKGAIHVISAFSTMHSLVIG-QIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + I+V+S FS + I Q +K+ +EI + LL LD++G + D++ CQK Sbjct: 124 -YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKK 182 Query: 180 I 180 + Sbjct: 183 L 183 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 64.3 bits (155), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 38/96 (39%), Positives = 52/96 (54%), Gaps = 5/96 (5%) Query: 154 AIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIPELL LD++G + DA+G Q IA I + G DY+ AVK NQ RL + + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 ELNNPKHDSYAISE--KSHGREETRLHIVCDVPDEL 247 + + +E K HGR ETR VC V +++ Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETR---VCRVSEDV 93 >UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX Length = 98 Score = 61.6 bits (148), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 32/48 (66%), Positives = 35/48 (72%) Query: 78 VSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 +SCI KFHE FIN M + HSSDD DVIAIDGK HS DKSRR+ A Sbjct: 1 MSCIRSVKFHECFINRMRECHSSDDIDVIAIDGKALPHSCDKSRRRRA 48 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 60.5 bits (145), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 32/83 (38%), Positives = 49/83 (59%), Gaps = 1/83 (1%) Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE-K 183 A+H++SAF + +V+ Q+ +KSNEI A ELL LDI G + DAM Q++ A Sbjct: 8 AVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQREHARFA 67 Query: 184 IQKQGGDYLFAVKGNQGRLNKAF 206 ++ + D++ VK NQ L +A Sbjct: 68 VEDKRADFVMTVKDNQPELREAL 90 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 60.1 bits (144), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 62/255 (24%), Positives = 112/255 (43%), Gaps = 31/255 (12%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHP-DFLKQ 60 E++ L + ++ +PD R + H+L IL L+ AV +G + E+I + P L Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 61 YGDFENGI------PVHDTIARVVSCICPAKFHES---FINWMLDYHSSDDKDVIAIDGK 111 G + + P DT+ RV+S + + + F + V+A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 IHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML----DIKGK 167 R + R A H+++ +V+ + + K+NE+TA LL L + G Sbjct: 158 TLRGAAGPEGR--APHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 IIKTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRLN----KAFE-EKFPLKELNNPKHD 221 ++ DA+ + A+ I + G ++F VK N L+ +A + K P+ Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPI--------- 266 Query: 222 SYAISEKSHGREETR 236 ++ ++HGR E R Sbjct: 267 GHSAEGRAHGRFERR 281 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 59.3 bits (142), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 7/78 (8%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREET 235 + ++ +KI ++ DYL AVKGNQG L AF++ F LNN + Y E+S GR E+ Sbjct: 12 VRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHES 71 Query: 236 R-------LHIVCDVPDE 246 R L ++ D+ DE Sbjct: 72 RAAFVSHDLSVLGDISDE 89 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 59.3 bits (142), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 64/266 (24%), Positives = 104/266 (39%), Gaps = 31/266 (11%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFG-ETHPDFLKQ 60 +++ L+E + +PD R+ V L +L L + AV GA G+ +I + + P+ Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWM--------------LDYHSSDDKDVI 106 +G P T RV+ P E+ W VI Sbjct: 91 FG-LVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVI 149 Query: 107 AIDGKIHRHSYDKSRRKGAI---HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML- 162 + DGK R + ++ G I V+ V+ + +EI A+ ++ L Sbjct: 150 SADGKTMRGARRRTG-DGKIAQDQVVEILDHASGAVVA-CEPVNDGDEIGAVRTVMGRLA 207 Query: 163 ----DIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP 218 + G ++ TDA Q + E++ GG +L VK NQ R+ A P ++ Sbjct: 208 DRWGSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRI-LAKVRALPWAQVR-- 264 Query: 219 KHDSYAISEKSHGREETRLHIVCDVP 244 D+ K+HGR ETR V P Sbjct: 265 AQDT--CRGKAHGRAETRTVRVVQAP 288 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 58.9 bits (141), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 64/265 (24%), Positives = 111/265 (41%), Gaps = 34/265 (12%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISG-----AEGWEDIEDFGETHPDFLKQ 60 + E ++ IPD+R A + + L + + +CAV + A E + T L+ Sbjct: 24 IWERLAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERVRLRL 83 Query: 61 YGDFENG--IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVI------------ 106 + +G +P TI R ++ + + ++ L +D D + Sbjct: 84 PWNPWDGHLLPDEATIRRFLNTVDDQALATALLDPPLADTPADLTDAVPSAPVRPPAGDQ 143 Query: 107 -------AIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELL 159 A+DGK R + K +H++ + ++GQ + D KSNE T LL Sbjct: 144 AVPVRAYAVDGKTSRGA--KRADGSQVHLLGVAAHGAGALLGQREIDAKSNETTEFRALL 201 Query: 160 NMLDIKGKIIKTDAM-GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP 218 L++ G + DA+ + ++ + ++ YL K NQ +L +AF P E+ P Sbjct: 202 APLELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKL-RAFLAALPWTEI--P 258 Query: 219 KHDSYAISEKSHGREETRLHIVCDV 243 D ++ HGREETR V V Sbjct: 259 TAD--LTRDRGHGREETRTLKVATV 281 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 56.6 bits (135), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 8/91 (8%) Query: 161 MLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN--- 217 M +KG ++ DAMGCQ+ IA+++++ G D + ++KGNQG+ A F ++ Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 218 --PKHDSYAISEKSHGREETRLHIVCDVPDE 246 P HD + E SHGR R V + E Sbjct: 61 LKPDHDEF---EDSHGRTVRRRGWVLPLTPE 88 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 56.6 bits (135), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 51/194 (26%), Positives = 82/194 (42%), Gaps = 16/194 (8%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+ H+ IPD R V +LL+ + ++S E D+E F H L + E Sbjct: 13 LISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGIE 72 Query: 66 NGIPVHDTIARV------VSCICPAKFHESFINWMLDY--HSSDDKDVIAIDGKIHRHSY 117 P D+ R V+ IC A +W L + D D + DGK R S Sbjct: 73 LKRPPSDSAFRYFFLQVDVAAICGA-----IRDWTLAQIPGGAGDLDQLICDGKTLRGSI 127 Query: 118 DKSRRKGA--IHVISAFSTMHSLVIGQ-IKTDKKSNEITAIPELLNMLDIKGKIIKTDAM 174 + + GA I ++ +S + I Q + +E + +LL LD++G +I+ DA+ Sbjct: 128 EPTSGGGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADAL 187 Query: 175 GCQKDIAEKIQKQG 188 Q+ Q +G Sbjct: 188 HTQQAFFGSSQSRG 201 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 56.6 bits (135), Expect = 8e-07, Method: Composition-based stats. Identities = 27/55 (49%), Positives = 41/55 (74%) Query: 97 YHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNE 151 Y S + ++ DGK R S+D+S K AIH++SA+++ +SLV+GQ+KTD+KSNE Sbjct: 17 YQKSLKEKSLSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNE 71 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 55.5 bits (132), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 43/125 (34%), Positives = 63/125 (50%), Gaps = 22/125 (17%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVI----SGAEGWEDIEDFGETHPDF 57 +LKKL+E S IPD R+A V+H+L+ +LL + + + S E D+ + P F Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDM-----SRPAF 133 Query: 58 LKQ----YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDY--------HSSDDKDV 105 L+ + + E +P DT+ARV+ I P K ESFI + Y H + Sbjct: 134 LQALQGLFPELET-LPHGDTLARVLERIEPQKLEESFIRLLRRYIRHKKFKRHLINKCYP 192 Query: 106 IAIDG 110 IAIDG Sbjct: 193 IAIDG 197 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 55.1 bits (131), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 26/85 (30%), Positives = 44/85 (51%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 +++H + D R L I+ + I AV++GA+G+ IE +G+ +L+ + D Sbjct: 27 VLKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLP 86 Query: 66 NGIPVHDTIARVVSCICPAKFHESF 90 GIP HDT RV+ + P + F Sbjct: 87 KGIPSHDTFGRVLRILEPKQLQSGF 111 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 53.9 bits (128), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 39/157 (24%), Positives = 74/157 (47%), Gaps = 10/157 (6%) Query: 99 SSDDKDVIAIDGKIHRHSYD-KSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 ++ + IA+DGK + S S R+ H++SA + + + +++ K+NE T Sbjct: 127 TAGPRRAIAVDGKALKASARLTSPRR---HLLSAVTHGRVVTLARVEVGAKTNETTHFKP 183 Query: 158 LLNMLDIKGKIIKTDAM-GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 LL LD+ ++ DA+ + +I+ ++ + Y+ +K NQ + P +++ Sbjct: 184 LLAPLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQLAT-LPWRDIP 242 Query: 217 NPKHDSYAISEKSHGREETRLHIVCDVPDELIDFTFE 253 +A SE HGR E+ C +PDEL + Sbjct: 243 V----QHAASEVGHGRRESSSIKTCAIPDELGGIAYP 275 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 53.5 bits (127), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 32/68 (47%), Positives = 42/68 (61%), Gaps = 7/68 (10%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAIS-----EK 228 MGCQK+IAE I +Q DY+ AVK NQ L++A ++ F +E N +SY I K Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYF--EEANEANFESYNIDFAETYNK 58 Query: 229 SHGREETR 236 SHGR E+R Sbjct: 59 SHGRIESR 66 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 53.5 bits (127), Expect = 7e-06, Method: Composition-based stats. Identities = 28/60 (46%), Positives = 30/60 (50%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS 116 LKQYG FE GI HDTI +VSCI F + FI WM A DGK R S Sbjct: 11 LLKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 52.8 bits (125), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 46/187 (24%), Positives = 87/187 (46%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L E +S IPD R A ++ L +L L + A +S + +E F +P L G + Sbjct: 3 LREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLGLRK 61 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 P H + ++ + P K E+ + + +D +V+ +DGK H K + Sbjct: 62 P--PGHTILTLLLHRLDPEKLQEALLQV---FPGADLGEVLVVDGK-HLKGSGKGKSP-Q 114 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD---IKGKIIKTDAMGCQKDIAE 182 + ++ + + Q K + + ++ A+ ELL+ L +KGK++ DA ++A Sbjct: 115 VRLVEVLALHLLTTLAQAKAEGREDQ--ALLELLDRLGAEGLKGKVVVGDAGYLYPELAG 172 Query: 183 KIQKQGG 189 K+ ++GG Sbjct: 173 KVVQKGG 179 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 52.8 bits (125), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 69/295 (23%), Positives = 114/295 (38%), Gaps = 70/295 (23%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAV-------ISGAEGW------EDIE 48 +++ L+ + D R A V +++S +L L +CA+ I+ A W E++ Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 49 DFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS-------- 100 FG + +Y IP T+ V+ + P + + + + S+ Sbjct: 90 AFGLPYHPLRGRYR-----IPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPL 144 Query: 101 ------------------------DDKDVIAIDGKIHRHSY--DKSRRKGAIHVISAFST 134 + IA+DGK R + D SR + V+SA Sbjct: 145 MPDGGIEREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR----VFVLSAVRH 200 Query: 135 MHSLVIGQIKTDKKSNEITAIPEL------LNMLDIKGKIIKTDAMGCQKDIAEKIQKQG 188 + + + K+NEI PE L+ D+KG ++ DA+ Q+D A + ++G Sbjct: 201 GDGITLASREIGAKTNEI---PEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERG 257 Query: 189 GDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDV 243 YL +K NQ R P KE+ D + HGR E RL V V Sbjct: 258 AHYLLTIKNNQ-RGQARQLHALPWKEIPVIHRD----DARGHGRHEQRLVQVVTV 307 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 51.2 bits (121), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 52/202 (25%), Positives = 79/202 (39%), Gaps = 21/202 (10%) Query: 50 FGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAID 109 FG + +LK GI H T + V C+ F + + Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAAL------------PKPLQRA 90 Query: 110 GKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKII 169 K S + + ++ ++ LVIGQ +T NE+ + L +L ++G I+ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQ-QTAPGRNEVQGALDALALLSLEGAIV 149 Query: 170 KTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ-GRLNKAFEEKFPLKELNNPKHDSYAISEK 228 DA+ C+ D A I GGDY A+K NQ G L + ++ L +E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLG-----VQTAAEN 204 Query: 229 SHGREETRLHIVCDVPDELIDF 250 H R E R + V D IDF Sbjct: 205 DHDRCERRRACIVAVND--IDF 224 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 48.5 bits (114), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 47/188 (25%), Positives = 89/188 (47%), Gaps = 15/188 (7%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L + +S +PD R A + L +L L + A +S + +E F +P L G + Sbjct: 3 LRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLGLRK 61 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS-YDKSRRKG 124 P H I ++ + P K + + +D +V+ +DGK R S KS + Sbjct: 62 A--PGHTAITLLLHRLDPEKLQAALGQ---VFPEADLGEVLVVDGKHLRGSGKGKSPQVK 116 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD---IKGKIIKTDAMGCQKDIA 181 + V++ +H+ + Q + + + E A ELL+ L+ ++GK++ DA ++A Sbjct: 117 LVEVLALH--LHT-TLAQARAEGR--EEKAFLELLDRLEARELEGKVVVGDAGYLYPEVA 171 Query: 182 EKIQKQGG 189 +++K+GG Sbjct: 172 ARVRKKGG 179 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 48.1 bits (113), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 31/84 (36%), Positives = 44/84 (52%), Gaps = 8/84 (9%) Query: 170 KTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAIS--- 226 + D +GCQK IA+ I +Q DYL AVK NQ L++A F +E N + Y I Sbjct: 6 RCDGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYF--EEANKARFAGYNIDYDE 63 Query: 227 --EKSHGR-EETRLHIVCDVPDEL 247 K GR E+ R + ++PD + Sbjct: 64 KINKGPGRLEQRRCWVGYEIPDTI 87 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 45.1 bits (105), Expect = 0.002, Method: Compositional matrix adjust. Identities = 23/71 (32%), Positives = 39/71 (54%), Gaps = 1/71 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 +L ++S IPD+R+A + L+ +LL +I A++SGA + I+ F +TH + L Sbjct: 2 QLKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQL 61 Query: 65 -ENGIPVHDTI 74 P H +I Sbjct: 62 HRKRAPAHTSI 72 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 44.3 bits (103), Expect = 0.004, Method: Compositional matrix adjust. Identities = 32/113 (28%), Positives = 53/113 (46%), Gaps = 9/113 (7%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGET-HPDFLKQYGDF 64 L EH++ +PD R + H L IL + + A+ SGAE + + ++ T + L++ G Sbjct: 16 LWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGCQ 75 Query: 65 ENG------IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGK 111 E+ P T+ RV+ I + NW+L S +A+DGK Sbjct: 76 ESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSLGLSP--AALAVDGK 126 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 43.9 bits (102), Expect = 0.005, Method: Compositional matrix adjust. Identities = 25/79 (31%), Positives = 38/79 (48%), Gaps = 5/79 (6%) Query: 155 IPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 + +L IK I DA+ CQK E I ++ Y+ VK NQ L +A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIEDTAKNSP 61 Query: 215 LNNPKHDSYAISEKSHGRE 233 LN +++ ++K HG E Sbjct: 62 LN-----AWSWTQKGHGHE 75 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 42.7 bits (99), Expect = 0.010, Method: Compositional matrix adjust. Identities = 23/54 (42%), Positives = 32/54 (59%) Query: 106 IAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELL 159 IAIDGK R S+D A +V+SAF+ H +++ D+KSNEI A L+ Sbjct: 53 IAIDGKTLRQSFDAFSDTKAAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALI 106 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 42.7 bits (99), Expect = 0.011, Method: Compositional matrix adjust. Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 5/67 (7%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGN----QGRLNKAFEEKFPLKELNNPKHDSYAISEKS 229 MGCQK+IA+ I KQ DY+ A+KG+ QG L +A+ K + D + + Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGEL-EAWWHKCQREGFTADNFDEHTTIDSG 59 Query: 230 HGREETR 236 HGR ETR Sbjct: 60 HGRIETR 66 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 40.8 bits (94), Expect = 0.041, Method: Compositional matrix adjust. Identities = 29/94 (30%), Positives = 44/94 (46%), Gaps = 6/94 (6%) Query: 158 LLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 +L +++ GK I DA+ QK +AE I + YLF VK NQ L + F + Sbjct: 5 ILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYFEHR---- 60 Query: 218 PKHDSYAISE-KSHGREETRLHIVCDVPDELIDF 250 K Y + + HGR +TR +E ++F Sbjct: 61 -KEPDYCLQDPPGHGRIDTRSIWTTTELNEYLEF 93 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 40.0 bits (92), Expect = 0.074, Method: Compositional matrix adjust. Identities = 41/151 (27%), Positives = 64/151 (42%), Gaps = 8/151 (5%) Query: 100 SDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTD-KKSNEITAIPEL 158 S +K + DGK R S + +++G V+ I Q D +K +EI + L Sbjct: 51 SQEKQWFSGDGKELRGSIESGKKRGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRAL 109 Query: 159 LNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP 218 L+ D+ + I DA+ E I K GG +L +K NQ L + + P Sbjct: 110 LSKDDLASQKITLDALHLCPSTTEMITKAGGVFLIGLKENQPTLLAH------MTDCALP 163 Query: 219 KHDSYAISEKSHGREETRLHIVCDVPDELID 249 D + +HGR E R + + DV + D Sbjct: 164 PIDQKTTFDFNHGRVEQRKYWLYDVSKQGFD 194 >UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 Length = 41 Score = 39.7 bits (91), Expect = 0.092, Method: Composition-based stats. Identities = 16/30 (53%), Positives = 24/30 (80%) Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 +++A +T + + IGQ+K D KSNEITAIP+ Sbjct: 1 MVNALATANGMSIGQLKVDSKSNEITAIPK 30 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 363 4e-99 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 322 6e-87 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 306 3e-82 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 305 7e-82 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 301 1e-80 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 301 1e-80 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 289 5e-77 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 289 6e-77 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 288 9e-77 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 285 9e-76 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 284 2e-75 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 279 8e-74 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 272 7e-72 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 271 1e-71 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 271 1e-71 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 271 2e-71 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 270 3e-71 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 269 5e-71 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 262 8e-69 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 262 8e-69 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 261 1e-68 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 261 2e-68 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 260 3e-68 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 256 4e-67 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 256 8e-67 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 255 9e-67 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 254 2e-66 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 254 2e-66 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 252 8e-66 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 252 8e-66 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 251 2e-65 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 250 3e-65 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 250 4e-65 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 248 2e-64 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 247 2e-64 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 247 2e-64 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 247 3e-64 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 246 6e-64 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 245 1e-63 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 242 1e-62 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 241 1e-62 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 239 5e-62 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 234 2e-60 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 231 2e-59 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 229 1e-58 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 228 1e-58 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 228 2e-58 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 226 4e-58 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 224 2e-57 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 224 2e-57 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 224 3e-57 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 222 7e-57 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 222 1e-56 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 218 1e-55 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 217 2e-55 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 212 9e-54 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 210 4e-53 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 207 2e-52 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 205 9e-52 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 200 4e-50 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 200 5e-50 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 198 1e-49 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 198 2e-49 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 197 2e-49 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 197 4e-49 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 194 3e-48 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 193 6e-48 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 190 5e-47 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 189 6e-47 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 188 2e-46 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 183 5e-45 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 183 6e-45 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 182 7e-45 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 181 2e-44 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 178 2e-43 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 177 3e-43 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 177 4e-43 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 173 6e-42 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 172 8e-42 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 170 5e-41 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 169 7e-41 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 165 1e-39 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 163 6e-39 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 162 1e-38 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 162 1e-38 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 158 2e-37 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 154 3e-36 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 152 1e-35 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 147 3e-34 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 147 4e-34 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 144 2e-33 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 140 4e-32 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 137 3e-31 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 137 4e-31 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 137 4e-31 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 136 6e-31 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 135 2e-30 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 132 9e-30 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 132 1e-29 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 122 1e-26 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 118 1e-25 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 106 7e-22 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 103 5e-21 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 101 3e-20 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 101 3e-20 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 100 4e-20 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 99 2e-19 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 98 2e-19 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 92 2e-17 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 90 8e-17 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 88 2e-16 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 86 1e-15 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 85 1e-15 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 84 4e-15 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 78 2e-13 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 72 1e-11 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 67 5e-10 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 60 5e-08 UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX 47 6e-04 Sequences not found previously or not previously below threshold: UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 153 4e-36 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 101 2e-20 UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 101 2e-20 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 100 6e-20 UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis... 96 1e-18 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 92 1e-17 UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfo... 91 3e-17 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 91 4e-17 UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis... 90 7e-17 UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteoba... 89 1e-16 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 89 2e-16 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 88 3e-16 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 86 1e-15 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 85 2e-15 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 85 2e-15 UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 85 3e-15 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 82 2e-14 UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_... 79 1e-13 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 76 1e-12 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 75 2e-12 UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium m... 73 6e-12 UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoa... 73 7e-12 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 71 3e-11 UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodoco... 71 4e-11 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 70 1e-10 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 69 2e-10 UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinis... 67 5e-10 UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobi... 63 8e-09 UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylo... 63 1e-08 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 59 1e-07 UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synecho... 59 2e-07 UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus... 58 2e-07 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 58 4e-07 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 57 5e-07 UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia... 57 8e-07 UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polarom... 55 2e-06 UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaerom... 55 3e-06 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 55 3e-06 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 53 1e-05 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 53 1e-05 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 53 1e-05 UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastop... 52 1e-05 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 52 2e-05 UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus... 52 2e-05 UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 51 4e-05 UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Strepto... 51 5e-05 UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 R... 50 6e-05 UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=... 50 6e-05 UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiat... 50 9e-05 UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliob... 49 1e-04 UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia... 49 1e-04 UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=... 48 2e-04 UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitroso... 48 2e-04 UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus p... 48 4e-04 UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candida... 47 4e-04 UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinis... 47 6e-04 UniRef50_A6FBF2 Putative uncharacterized protein n=1 Tax=Moritel... 47 7e-04 UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 46 8e-04 UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia... 46 8e-04 UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coproco... 46 0.001 UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewane... 45 0.002 UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodoco... 45 0.002 UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromon... 45 0.003 UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia... 45 0.003 UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms Rep... 45 0.003 UniRef50_Q2RR82 Putative uncharacterized protein n=1 Tax=Rhodosp... 44 0.005 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 43 0.008 UniRef50_A4WVT3 Transposase, IS4 family n=63 Tax=Bacteria RepID=... 42 0.017 UniRef50_D1RJD3 Putative uncharacterized protein n=1 Tax=Legione... 42 0.018 UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID... 41 0.026 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 41 0.029 UniRef50_A6FLE0 Transposase, IS4 n=2 Tax=Roseobacter sp. AzwK-3b... 41 0.046 UniRef50_A5FU21 Transposase, IS4 family protein n=11 Tax=Alphapr... 40 0.059 UniRef50_C7S7P7 Transposase n=4 Tax=root RepID=C7S7P7_METEA 40 0.080 UniRef50_A0P2Q4 Putative uncharacterized protein n=2 Tax=Labrenz... 40 0.081 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 363 bits (931), Expect = 4e-99, Method: Composition-based stats. Identities = 239/253 (94%), Positives = 242/253 (95%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 MELKKLMEHISIIPDYRQ WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETH DFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 YGDFENGIPVHDTIARVVSCI PAKFHE FINWM D HSSDDKDVIAIDGK RHSYDKS Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 RR+GAIHVISAFSTMHSLVIGQIKTD+KSNEITAIPELLNMLDIKGKII TDAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 AEKIQKQGGDYLFAVKG QGRLNKAFEEKFPLKELNNP+HDSYAISEKSHGREE RLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 CDVPDELIDFTFE 253 CDVPDELIDFTFE Sbjct: 241 CDVPDELIDFTFE 253 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 322 bits (826), Expect = 6e-87, Method: Composition-based stats. Identities = 122/253 (48%), Positives = 166/253 (65%), Gaps = 1/253 (0%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L+ +SII D RQ KV H L D+L L I AVISG EGWE+I+DFG D+L++ Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 Y F GIP DTI+R+ I P +F + F WM DVIAIDGK R S++K Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKK 120 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 + IH++SAF+ +S+V+GQ+KT+ KSNEITAIP+LL++LD++G ++ DAMGCQ I Sbjct: 121 DKSDTIHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 A+KI +GGDYL VKGNQ RL A + F ++ L P+ ++Y EK HGRE++R+ +V Sbjct: 181 AKKIVDKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMV 240 Query: 241 CDVPDELIDFTFE 253 D +E+ D FE Sbjct: 241 AD-ANEIGDLVFE 252 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 306 bits (785), Expect = 3e-82, Method: Composition-based stats. Identities = 96/251 (38%), Positives = 148/251 (58%), Gaps = 3/251 (1%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++EH S + D R A ++E+ L DI+++T+CAV+ GA+ W ++ ++G + +LKQ+ Sbjct: 5 PFASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQW 64 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSR 121 NG+P HDT V + + P + + F+NW + ++IAIDGK R + Sbjct: 65 IALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGE 124 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIA 181 + IH++SA+++ + LV+GQ D+KSNEITAIPELL +L+++G ++ DAMGCQ IA Sbjct: 125 QCSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIA 184 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPKHDSYAISEKSHGREETRLH 238 E I + GDY+ A+KGNQG L + F + +HDSY EK HGR E R + Sbjct: 185 ETIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTY 244 Query: 239 IVCDVPDELID 249 D L+ Sbjct: 245 WTMGQTDYLLG 255 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 305 bits (782), Expect = 7e-82, Method: Composition-based stats. Identities = 118/248 (47%), Positives = 161/248 (64%), Gaps = 3/248 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+E SII D RQ K++H+L DIL L + AVI GAEGW+DIE+ G ++L++ G F Sbjct: 6 SLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFF 65 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 + GIPV DTIAR++S + P + FI WM + D +IA+DGK RHSYDK +RK Sbjct: 66 KKGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKS 125 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 AIH++SA++ + +V+GQ KTD KSNEI AIP LL++LDIKG I+ DAMGCQ+ IAEKI Sbjct: 126 AIHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKI 185 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPKHDSYAISEKSHGREETRLHIVC 241 + GDY+ AVK NQ +L++ + F +HD + S K HGR E R + + Sbjct: 186 VTKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRRYWIS 245 Query: 242 DVPDELID 249 D+ L + Sbjct: 246 DMLSTLGN 253 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 301 bits (771), Expect = 1e-80, Method: Composition-based stats. Identities = 112/251 (44%), Positives = 152/251 (60%), Gaps = 1/251 (0%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M ++ +H S I D+RQ+ KV + L D+L ++CAVI+ + GW +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 F +GIP DTIAR+VS I P F+ F+ WM H + +VIAIDGK R SY++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 R IH+ISA+++ + LV+GQ+K ++KSNEITAIP LL MLD++G ++ DAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 A I +GGDYL AVK NQG L KA + F D + EKSHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFSPHRSAGL-SDDHVNIEKSHGRIENRTCYV 239 Query: 241 CDVPDELIDFT 251 DFT Sbjct: 240 LSSAALDGDFT 250 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 301 bits (771), Expect = 1e-80, Method: Composition-based stats. Identities = 100/251 (39%), Positives = 148/251 (58%), Gaps = 4/251 (1%) Query: 2 ELKKLMEHISIIPDYRQAW-KVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 +K E+ + D R+ H DIL++ +CA+ISGA + +IE FG + ++ + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 + NGIP HDT V++ + P +F F+ W + + IAID K R S DK Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKK 124 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 K +H++SA++T +LVIGQIKT++ SNEITAIPELLN LD+KG ++ DAMGCQ +I Sbjct: 125 NGKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEI 184 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPKHDSYAISEKSHGREETRL 237 AEKI ++ DY+ A+KGNQ +L+++ E F L E + D E S+GREE R Sbjct: 185 AEKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRC 244 Query: 238 HIVCDVPDELI 248 + +++I Sbjct: 245 AYATNEIEKII 255 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 289 bits (740), Expect = 5e-77, Method: Composition-based stats. Identities = 107/253 (42%), Positives = 147/253 (58%), Gaps = 5/253 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + + H S I D RQ KV + L +ILLLT+CAV+SGA W I +G FLK++ F Sbjct: 24 EFLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPF 83 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 +G P HD + + + + F FI+W+ + + V+AIDGK R S DK+ K Sbjct: 84 ADGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTVTG-VVAIDGKTSRRSLDKAGGKA 142 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 AIH+ISA+S+ +L + Q + D KSNEITAIPELL +L +KG I+ DAMGCQ++IA KI Sbjct: 143 AIHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKI 202 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPKHDSYAISEKSHGREETRLHIVC 241 + DY+ A+KGNQG L K E + + ++ + EKSHGR ETR VC Sbjct: 203 ISKEADYILALKGNQGSLRKDTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVTVC 262 Query: 242 DVPDEL-IDFTFE 253 D L D + Sbjct: 263 TDIDWLKADHNWP 275 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 289 bits (740), Expect = 6e-77, Method: Composition-based stats. Identities = 88/244 (36%), Positives = 134/244 (54%), Gaps = 3/244 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ ++ I D R +H L D+L + I AVI+G++GWED+E++G ++L ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 +GIP DT RV I P + W+ +S ++I IDGK R SYD++ + Sbjct: 90 PHGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQC 149 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 A++ ++A+++ SLV+GQ+K + SNEITAIP LL +LDI G II DAMG Q I ++I Sbjct: 150 ALYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQI 209 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP---KHDSYAISEKSHGREETRLHIVC 241 +Q DY+ +K N L ++ F + N +HD Y K H R E R Sbjct: 210 CRQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRYVWAI 269 Query: 242 DVPD 245 V Sbjct: 270 PVAA 273 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 288 bits (738), Expect = 9e-77, Method: Composition-based stats. Identities = 90/247 (36%), Positives = 133/247 (53%), Gaps = 2/247 (0%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +L + E +PD R + H LS++L + +CAV+ GA + D+ +G+++ +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD-VIAIDGKIHRHSYDKS 120 + G+P HDT RV++ I PA F +F+ W+ + D V+AIDGK R S K Sbjct: 65 LKLKAGVPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGKD 124 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 G +H++SAF+ LV+GQ TD+KSNEITAIPELL ML ++G I+ DAMG Q I Sbjct: 125 TS-GPLHMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAI 183 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 A I+ +G DY+ VK N L + + K HGR E R Sbjct: 184 ARTIRSRGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWA 243 Query: 241 CDVPDEL 247 D +L Sbjct: 244 YDAVSQL 250 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 285 bits (730), Expect = 9e-76, Method: Composition-based stats. Identities = 101/264 (38%), Positives = 152/264 (57%), Gaps = 18/264 (6%) Query: 8 EHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG 67 ++ + D R +HKL DI+ +TICAVI GA+ W DIE FG+ +LK++ + NG Sbjct: 11 DYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNG 70 Query: 68 IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIH 127 IP HDT RV S + P + F+ W+ S +++AIDGK RHSYD+S+ K A+ Sbjct: 71 IPSHDTFGRVFSLLNPEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQ 130 Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPE---------------LLNMLDIKGKIIKTD 172 +ISA++T + LV+GQ D+KSNEITAIP+ LL +L + G I+ D Sbjct: 131 MISAWATTNGLVLGQSIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLD 190 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPK---HDSYAISEKS 229 A+GCQK+I ++I +Q DY+ +K NQG L + E F ++N + Y + ++ Sbjct: 191 AIGCQKEIVKQITEQDADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEG 250 Query: 230 HGREETRLHIVCDVPDELIDFTFE 253 HGR+E R + + E ID ++ Sbjct: 251 HGRQEVRYYQMLSNVAEEIDPDWQ 274 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 284 bits (728), Expect = 2e-75, Method: Composition-based stats. Identities = 92/246 (37%), Positives = 135/246 (54%), Gaps = 5/246 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L ++ I D R H+L DI+ + + AV++GA+ W IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP HDT ARV + + P F +W+ S+ VIAIDGK + SYD+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 + ++SA+++ H LV+GQ D KSNEITAIP LL L + G I+ DAMG + IA +I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPKHDSYAISEKSHGREETRLHIVCD 242 KQ DY+ A+KGNQ L K ++ F + ++ + E +H R E+R V Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRR--VFQ 251 Query: 243 VPDELI 248 VP E + Sbjct: 252 VPVEQV 257 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 279 bits (713), Expect = 8e-74, Method: Composition-based stats. Identities = 91/248 (36%), Positives = 134/248 (54%), Gaps = 6/248 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M K L++++ IPD R K H LS+++ + ICA++ G + W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD-KDVIAIDGKIHRHSYDK 119 + GIP HDT R+ + + PA F W+ D D +A+DGK R + K Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMGDDKLVGQLAVDGKALR-ATAK 119 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 R A+H+++ +ST + +GQ K KSNEITAIPELL +L++KG ++ DAMG Q Sbjct: 120 GRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQVK 179 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL----KELNNPKHDSYAISEKSHGREET 235 IA+ I K+ GDYL AVK NQ LN +E+F E + H + HGR+E Sbjct: 180 IADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKEH 239 Query: 236 RLHIVCDV 243 R V V Sbjct: 240 RRCWVLMV 247 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 272 bits (696), Expect = 7e-72, Method: Composition-based stats. Identities = 86/246 (34%), Positives = 134/246 (54%), Gaps = 2/246 (0%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 L ++E + D+R A + H+LS++L + +CAV+SGA+ +E+I +G +L+ + Sbjct: 6 LADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPWLRGFL 65 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD-VIAIDGKIHRHSYDKSR 121 + G+ DT RV + + P +F ++F W+ + KD VIAIDGK R + K+ Sbjct: 66 RLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRTTSKAA 125 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIA 181 +H++SAF+ +V+GQ T +KSNEITAIPELL +LDI+G I+ DAMG Q IA Sbjct: 126 AAP-LHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGTQTKIA 184 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVC 241 I+++G Y+ VK N +L + ++ + HGR E R Sbjct: 185 RAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLTPSSTHETTSTGHGRIEVRRCTAF 244 Query: 242 DVPDEL 247 D D L Sbjct: 245 DATDRL 250 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 271 bits (694), Expect = 1e-71, Method: Composition-based stats. Identities = 86/256 (33%), Positives = 142/256 (55%), Gaps = 12/256 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L+EH I D R + +H+L D+L++ +C ++ G E + D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 +GIP HDT RV + + P F + F+ W ++ +++A+DGK R + ++ + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQGQSP 126 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 I +SA++ +SLV+GQI+ K+NEITA+P+LL +L++ G I+ DAMGCQK+IA + Sbjct: 127 RVI--VSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLK----------ELNNPKHDSYAISEKSHGRE 233 I + +Y+ A+KGNQG+ ++ + E N +EK HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 ETRLHIVCDVPDELID 249 ETR + L D Sbjct: 245 ETRRYWQSGDVSWLAD 260 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 271 bits (693), Expect = 1e-71, Method: Composition-based stats. Identities = 87/237 (36%), Positives = 137/237 (57%), Gaps = 3/237 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L++H + D R +HKL DI+++ +CA+I GA+ + +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP HDT ARV + I P +F + F +W+ DV+ IDGK +HS +K K A Sbjct: 69 NGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKKA 128 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 IH+++A+++ LV+ Q K +++ EITAIP L+ +L++ G ++ DAMG Q DIAE + Sbjct: 129 IHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELLH 188 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELN---NPKHDSYAISEKSHGREETRLHI 239 +G DY A+KGNQ L + +E F + +H + EK R E Sbjct: 189 SKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEVSSAY 245 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 271 bits (693), Expect = 2e-71, Method: Composition-based stats. Identities = 86/211 (40%), Positives = 131/211 (62%), Gaps = 1/211 (0%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 +KL IPD+R++ K + L ILL+ I +VI GA+ W ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 NGIP HDT RV S I +F + FI W+ +++IAIDGK R + +K Sbjct: 65 LPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGGKK 123 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 +H++SA++ ++LV+GQ+K KSNEITAIP+LL +L I+ I+ DAMGCQ IA+ Sbjct: 124 SPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIAKA 183 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 I K+ DY+ AVK NQ +L + E++F + Sbjct: 184 IVKKNADYILAVKENQPQLLEHIEDEFRFGK 214 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 270 bits (690), Expect = 3e-71, Method: Composition-based stats. Identities = 89/249 (35%), Positives = 128/249 (51%), Gaps = 11/249 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 L E I D+R H L+DIL++ CA++ G + +E FG +L+ + Sbjct: 14 SNLREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLA 73 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD--------VIAIDGKIHRH 115 NGIP HDT +V S + P +F E+F W + VIAIDGK R Sbjct: 74 LPNGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRG 133 Query: 116 SYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMG 175 + DK + I + A+++ SL +GQ+K KSNEI A+PELL ML +KG I+ DAMG Sbjct: 134 AVDKGQAPAVI--VGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMG 191 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE-LNNPKHDSYAISEKSHGREE 234 CQ+++A KI +Q GDY+ A+K NQ L++ E L + + + HGR E Sbjct: 192 CQREVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHE 251 Query: 235 TRLHIVCDV 243 R V + Sbjct: 252 VRRCWVSEE 260 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 269 bits (689), Expect = 5e-71, Method: Composition-based stats. Identities = 98/242 (40%), Positives = 137/242 (56%), Gaps = 4/242 (1%) Query: 9 HISIIPDYRQAW-KVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG 67 +IPD R+A H LSDIL + +CAV+SG + WE + +FG T +L+Q+ NG Sbjct: 17 FFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANG 76 Query: 68 IPVHDTIARVVSCICPAKFHESFINWML-DYHSSDDKDVIAIDGKIHRHSYDKSRRKGAI 126 IP HDT RV S I P F +F +W D D +A+DGK R S+ S + A+ Sbjct: 77 IPSHDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSHRGSAGR-AL 135 Query: 127 HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQK 186 H++ A+S L++ Q + D KSNEITAIP++L++ D++G I DA+GCQK +A +I + Sbjct: 136 HLLHAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITE 195 Query: 187 QGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDE 246 GGDY+ A+KGNQ L+ F + + EK HGR ETR V D D Sbjct: 196 AGGDYVLALKGNQSALHDDVR-LFMETQADRHPQGQAEAVEKDHGRIETRRIWVNDEIDW 254 Query: 247 LI 248 L Sbjct: 255 LT 256 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 262 bits (670), Expect = 8e-69, Method: Composition-based stats. Identities = 88/251 (35%), Positives = 138/251 (54%), Gaps = 8/251 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++L+E + I D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD---KDVIAIDGKIHRHSYDK 119 + +NG P HDT+ RV+ + P + + W + ++ K +I IDGK R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + G H++SA+S +GQ +KSNEITAIPELL + +KG+I+ DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHD---SYAISEKSHGREETR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ ETR Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVCDVPDEL 247 + + L Sbjct: 239 EYYQTEKIKWL 249 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 262 bits (670), Expect = 8e-69, Method: Composition-based stats. Identities = 82/243 (33%), Positives = 125/243 (51%), Gaps = 3/243 (1%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L I D R + + L +ILL+T+CA+I G + W+ I DFG+ +L Q+ + Sbjct: 14 QYLFHCFLSIKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVN 73 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 G+P T ARV S I P +F WM + D+I +DGK S + + + Sbjct: 74 MRCGVPSTLTFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQ 133 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 A H+++A+ + +G+++ KSNEI AIP LLN L+++G II DAMG QK IA Sbjct: 134 KATHIVNAYLPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANL 193 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEK---SHGREETRLHIV 240 I+ + DY+ A+K N R + E F + + + Y E HGR E R + V Sbjct: 194 IRLKQADYVLALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSYCV 253 Query: 241 CDV 243 + Sbjct: 254 LPM 256 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 261 bits (668), Expect = 1e-68, Method: Composition-based stats. Identities = 87/240 (36%), Positives = 135/240 (56%), Gaps = 8/240 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++L+E + I D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD---KDVIAIDGKIHRHSYDK 119 + +NG P HDT+ RV+ + P + + W + ++ K +I IDGK R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + G H++SA+S +GQ +KSNEITAIPELL + +KG+I+ DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHD---SYAISEKSHGREETR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ ETR Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 261 bits (667), Expect = 2e-68, Method: Composition-based stats. Identities = 82/254 (32%), Positives = 143/254 (56%), Gaps = 8/254 (3%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ ++E+ + + D R+ +H L D+L++ + AVI+GA+G I + E H ++LK Sbjct: 9 DVVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSR 68 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS-----DDKDVIAIDGKIHRHS 116 + +G+P HDTI R+++ + P F + F W+ + D +++IAIDGK R S Sbjct: 69 LELPSGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRS 128 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 +D+ + G + + SA++ + +GQ+ KSNEI PEL+ +D++ I+ DA GC Sbjct: 129 HDRGKGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGC 188 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPKHDSYAISEKSHGRE 233 Q+D+AEKI GDY+ A+K NQ RL++ + + + K + + K HGR Sbjct: 189 QRDVAEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRL 248 Query: 234 ETRLHIVCDVPDEL 247 + R + +PDE+ Sbjct: 249 DKRFYYQVKLPDEV 262 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 260 bits (664), Expect = 3e-68, Method: Composition-based stats. Identities = 91/247 (36%), Positives = 144/247 (58%), Gaps = 4/247 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + H + D R +H L D++ LT+ A++SGAEGW+DI+ FG++ D+L+++ F Sbjct: 2 SFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAF 61 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 + G+PV DTIAR++S + P SFI+W+ + + VIA DGK RHS+D RK Sbjct: 62 KEGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFD-GDRKT 120 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 A+H +SA++ LV+ Q K+ K NE++ + EL+ +L++KG I+ DAM C K +A+ I Sbjct: 121 ALHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAI 180 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPK---HDSYAISEKSHGREETRLHIVC 241 +GGDY+ VK NQG+L F + P+ +S ++ HGR E R ++ Sbjct: 181 NAKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYVQL 240 Query: 242 DVPDELI 248 + L Sbjct: 241 PITPWLT 247 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 256 bits (655), Expect = 4e-67, Method: Composition-based stats. Identities = 79/245 (32%), Positives = 129/245 (52%), Gaps = 8/245 (3%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY--GD 63 L+E S +PD R+ + L+ IL++ +CA++ GA+ W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 E G P HDT + + F F +W+ + D V+AIDGK R S K + Sbjct: 63 LEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVIDG-VVAIDGKTLRGSGKKGSNE 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 +H+++A++ L + Q T K +E+ + LL++L +KG I+ DA+GCQ ++AEK Sbjct: 122 -LLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAEK 180 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKH---DSYAISEKSHGREETRLH-I 239 I +GGDY+ VK NQ L +A E F + + +EK HGR ETR + Sbjct: 181 IVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYTW 240 Query: 240 VCDVP 244 + DV Sbjct: 241 INDVT 245 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 256 bits (653), Expect = 8e-67, Method: Composition-based stats. Identities = 83/247 (33%), Positives = 125/247 (50%), Gaps = 10/247 (4%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 +++ I D R K HK+ I+ ++I AVI GA+ W +IE+FG F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPD 63 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRH------SYD 118 IP HDT R S I P F F NW+ V+AIDGK+ R + Sbjct: 64 LEFIPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 + + ++SA+S ++ + +GQ+K D KSNEITAIP L+N L++ G I+ DAMGCQK Sbjct: 123 TGKEGFKLWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPKHDSYAISEKSHGREET 235 DI + I ++ +Y+ A+K N+ + L K + + ++ + + HGR E Sbjct: 183 DITQTIIERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEK 242 Query: 236 RLHIVCD 242 R V Sbjct: 243 RTCTVVS 249 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 255 bits (652), Expect = 9e-67, Method: Composition-based stats. Identities = 99/254 (38%), Positives = 145/254 (57%), Gaps = 14/254 (5%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + ++ + + D R+ WK++H LSDI+LL A +SGAE W++IE FG+ + LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSD---------DKDVIAIDGKIH 113 ENGIP HDT+ RV + + P E W SD K ++AIDGK Sbjct: 66 QLENGIPSHDTLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTI 125 Query: 114 RHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDA 173 R + S ++ A+H+++A++T + GQ+ T++KSNEITAIPELL+M+ +KG ++ DA Sbjct: 126 RG--NGSAKQKALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDA 183 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGRE 233 MG QK IA+KI K+ DY AVK NQ L + F + + + D Y EK+HG+ Sbjct: 184 MGTQKAIADKIIKKKADYCLAVKENQKTLLEDIVPFFEMSQEAD---DHYHTVEKAHGQI 240 Query: 234 ETRLHIVCDVPDEL 247 ETR + V L Sbjct: 241 ETRAYEVIHDVSWL 254 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 254 bits (650), Expect = 2e-66, Method: Composition-based stats. Identities = 86/252 (34%), Positives = 129/252 (51%), Gaps = 9/252 (3%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 L+ L+EH S I D R ++ H L +ILLL +C ++ + +E+I +G H FL+++ Sbjct: 11 RLRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRH 70 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSR 121 + +G+P + +++ I PA F +F W+ D +AIDGK R S+D+ Sbjct: 71 LPYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFPGRA-DFVAIDGKTSRRSHDRRA 129 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML----DIKGKIIKTDAMGCQ 177 IH++SAF+T LV+ Q K+NE+ AIP LL+ L + G ++ DA+ Sbjct: 130 GTAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATN 189 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRL 237 IA I+ QG DYL AVK NQ L E F + + + HD +K HGR E R Sbjct: 190 PTIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGADHHHD----LDKGHGRVEERH 245 Query: 238 HIVCDVPDELID 249 V D L Sbjct: 246 VSVIREVDWLSG 257 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 254 bits (649), Expect = 2e-66, Method: Composition-based stats. Identities = 84/252 (33%), Positives = 126/252 (50%), Gaps = 11/252 (4%) Query: 6 LMEHISIIPDYRQA-WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L + +PD R H L+DIL + CAVI+GAEGWEDI ++G + F +++ + Sbjct: 5 LTSVFADLPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLEL 64 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS--------DDKDVIAIDGKIHRHS 116 +NG+P HDT RV + + P F + F W + + D +A+DGK R S Sbjct: 65 KNGVPSHDTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGKSARRS 124 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 K G +H++ + +L++GQ + +EIT ++L LD+ G ++ DA GC Sbjct: 125 A-KPTFSGCLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTLDAAGC 183 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP-LKELNNPKHDSYAISEKSHGREET 235 Q + E I+ +GG+Y+ VKGNQ L A F E D + +HGR E Sbjct: 184 QTETLEVIRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAHGRHEE 243 Query: 236 RLHIVCDVPDEL 247 R V PD L Sbjct: 244 RNVTVVHDPDGL 255 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 252 bits (644), Expect = 8e-66, Method: Composition-based stats. Identities = 90/249 (36%), Positives = 132/249 (53%), Gaps = 6/249 (2%) Query: 4 KKLMEHISIIPDYRQA-WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 LM + D R+ H ++L++ I AV+S + EDI +G D+L+Q+ Sbjct: 7 ASLMCCFREVDDPRKRSNGTLHDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFL 66 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRR 122 NG+ +T R+ + P +F +F W+ + + +DGK R S S Sbjct: 67 VLLNGVASEETFLRIFRALDPKQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGS--GSGG 123 Query: 123 KGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE 182 + AIH++SAF+T +V+GQ K KSNEITAIPELL L I G ++ DAMGCQK+IA Sbjct: 124 ESAIHMVSAFATELGVVLGQEKVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIAR 183 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCD 242 +I QGGDYL AVKGNQ L A E +F + + + D + SHGR ++ V Sbjct: 184 QITDQGGDYLLAVKGNQPTLLDAIETEF-IDQYQSDDVDRHRQVHPSHGRIVAQIASVLP 242 Query: 243 VPDELIDFT 251 + ++D Sbjct: 243 -AEGIVDLA 250 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 252 bits (644), Expect = 8e-66, Method: Composition-based stats. Identities = 77/250 (30%), Positives = 126/250 (50%), Gaps = 8/250 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L+E + + D R K+EH+L DIL++ +CAV++ AE +EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD------KDVIAIDGKIHRHS 116 D GIP HDT RV I P F F+NW + + IA+DGK+ RHS Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 +D+ + +H++SA++T LV+ Q D K E A+P +L L + G ++ DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPK--HDSYAISEKSHGREE 234 ++++A+ I +G YL +K NQ +++ F + + +HGR Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAFDDTHGRLV 241 Query: 235 TRLHIVCDVP 244 R C Sbjct: 242 RRRVFACPDA 251 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 251 bits (640), Expect = 2e-65, Method: Composition-based stats. Identities = 92/232 (39%), Positives = 128/232 (55%), Gaps = 9/232 (3%) Query: 7 MEHISIIPDYRQA-WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 M + I D R+ H +IL++ I AV+S + EDI + T +L+++ + Sbjct: 1 MSCLDEIDDPRKPSNGTLHDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLK 60 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDV-----IAIDGKIHRHSYDKS 120 NGIP +T R++ + P +F F W+ + D IAIDGK R S S Sbjct: 61 NGIPSEETFLRILRALDPKQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGS--GS 118 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 + AIH++SAF+T LV+GQ K KSNEITAIPELL L IKG ++ DAMGCQK I Sbjct: 119 GGESAIHMVSAFATELGLVLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSI 178 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGR 232 A++I + GDYL VKGNQ +L +A E F + + D + E+ HGR Sbjct: 179 AKQIVAKKGDYLLMVKGNQPKLLEAIETAF-IDQHGVESVDRSSRVERGHGR 229 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 250 bits (639), Expect = 3e-65, Method: Composition-based stats. Identities = 70/255 (27%), Positives = 118/255 (46%), Gaps = 8/255 (3%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 + + E +PD R A H L++IL + + A + GA D+ F + Sbjct: 4 PMDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDV 62 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSD----DKDVIAIDGKIHRHSY 117 +NG+P HDT +RV + P F ++F +M + K VIA+DGK R Y Sbjct: 63 LVLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGY 122 Query: 118 DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 + R +++A++ + + ++ NE +L+ +L +KG ++ DA+ C Sbjct: 123 ESGRSHMPPVMVTAWAAQTRMALANVQAPNN-NEAAGALQLIELLQLKGCVVTADALHCH 181 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRL 237 + +AE I+ +GGDY+ AVK NQ L + + ++ S + HGR+E R Sbjct: 182 RGMAEAIKARGGDYVLAVKDNQPALMRDAKAA--IRAATRQGKPSTITVDAGHGRKEKRR 239 Query: 238 HIVCDVPDELIDFTF 252 +V VP D F Sbjct: 240 AVVAAVPQMAQDHDF 254 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 250 bits (638), Expect = 4e-65, Method: Composition-based stats. Identities = 93/259 (35%), Positives = 137/259 (52%), Gaps = 14/259 (5%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + +E ++ I D+R + ++L DILL++ AVI + + ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDV------IAIDGKIHRHSY 117 F +G P HDT +V+S + P E F WM + + K V +AIDGK S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS- 121 Query: 118 DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 S + A HV++AF++ LV+GQIKTD+KSNEITAIPELL + +K ++ DAMG Q Sbjct: 122 -GSAEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAF------EEKFPLKELNNPKHDSYAISEKSHG 231 K+IA KI ++GGDY+ AVKGNQ +L E + K EK HG Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTRELKAKGQYAVTLEKDHG 240 Query: 232 REETRLHIVCDVPDELIDF 250 R E R + + Sbjct: 241 RIEKRECYLSNDLSWFEGL 259 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 248 bits (633), Expect = 2e-64, Method: Composition-based stats. Identities = 99/251 (39%), Positives = 131/251 (52%), Gaps = 18/251 (7%) Query: 15 DYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTI 74 D R +H+ S I+L+ I AVI GA+ W IEDFG++ F NGIP HDT Sbjct: 25 DNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKLSNFNGIPSHDTF 84 Query: 75 ARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSY---------------DK 119 R S + P KF ES+ W+ IAIDGK R +Y D Sbjct: 85 NRFFSALDPLKFEESYRQWVQSILKCYSG-HIAIDGKTIRGAYESEQDKRHRKQGVLPDS 143 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + K +HVISAF+T + +GQ+ T +K NEI IPELL+ML IK II DA+GCQ+ Sbjct: 144 NTGKYKLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDCIITIDALGCQRT 203 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP--LKELNNPKHDSYAISEKSHGREETRL 237 IAEK+ K GDY+F VK NQ +L + + + + D Y E+ HGR E+R+ Sbjct: 204 IAEKVIKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKGTTVRFDKYETHEEGHGRNESRI 263 Query: 238 HIVCDVPDELI 248 C+ P L Sbjct: 264 CYCCNDPGFLG 274 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 247 bits (631), Expect = 2e-64, Method: Composition-based stats. Identities = 80/243 (32%), Positives = 126/243 (51%), Gaps = 4/243 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L +H+S++ D R H L D+L L + AV SG +GW +I+ FGE ++L+++ F Sbjct: 2 SLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPF 61 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 NGIP TIAR++ + P +W+ D ++ K +IAIDGK R + Sbjct: 62 ANGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLGC--N 119 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 +H + AF + L + Q K EI + L+ ML+I +I DA+ Q+ E I Sbjct: 120 TLHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAI 179 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVP 244 + GDY+ VK NQ L +A + ++ + ++ + +A SEK HGR E R I +P Sbjct: 180 VARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQR--ITFQIP 237 Query: 245 DEL 247 +L Sbjct: 238 SKL 240 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 247 bits (631), Expect = 2e-64, Method: Composition-based stats. Identities = 83/228 (36%), Positives = 123/228 (53%), Gaps = 2/228 (0%) Query: 22 VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCI 81 V + L+++LL T+ +I A +++IE G D+L+Q+ FE+G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 CPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIG 141 P +F W+ V AIDGK R S + GA+H++SA++ LVIG Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 Q + KSNEITAIPELL+ L + G I+ DAMG QK IA K+ +G DY+ A+KGNQG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELID 249 L+ + F +L + + + HGR E R V D L + Sbjct: 181 LHDDVRDFFADPDL-LRECARHDDTCIGHGRIEERTCQVADASAWLTE 227 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 247 bits (631), Expect = 3e-64, Method: Composition-based stats. Identities = 81/246 (32%), Positives = 140/246 (56%), Gaps = 3/246 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+EH++++ + R +H L D++ L I A++SGAEGW DIE +G++ D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP T+AR++ CI E+ + W+ + + K +IA DGK+ R S+ + K A Sbjct: 69 NGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKDA 127 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 + +++A+ T + LV+ Q T K EI + ++L++L++KG ++ DA+ CQ++ EKI Sbjct: 128 LQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKIS 187 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV--CDV 243 ++ + VK NQ +L +A + +F + E HGR+E R + Sbjct: 188 EKKAHVVVQVKNNQPKLYQAVQSQFQSLFDAQKEKIVVEHKESGHGRQEERYVFQLKAKL 247 Query: 244 PDELID 249 P EL + Sbjct: 248 PPELTE 253 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 246 bits (627), Expect = 6e-64, Method: Composition-based stats. Identities = 75/238 (31%), Positives = 117/238 (49%), Gaps = 3/238 (1%) Query: 10 ISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP 69 +PD R H L D+L + + A I GAE D F +++ + G+P Sbjct: 9 FQGLPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLP 68 Query: 70 VHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVI 129 HDT +RV + P F F ++ D+ D V+AIDGK R S+D++ + A+HV+ Sbjct: 69 SHDTFSRVFRLLDPVAFSRCFQQFL-DHLGEDGAGVLAIDGKTLRRSFDRAAGRSALHVV 127 Query: 130 SAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGG 189 SAF++ +++GQ NEI A LL + D+KG ++ DA+ Q+ A+ I ++GG Sbjct: 128 SAFASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGG 187 Query: 190 DYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDEL 247 D+LF +K N+ L E F + + ++ HGR E R H V L Sbjct: 188 DWLFPLKDNRPALRAEVERYF--ADPATVLAVPHVTTDADHGRIEVRRHWVSHDVAWL 243 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 245 bits (625), Expect = 1e-63, Method: Composition-based stats. Identities = 80/247 (32%), Positives = 124/247 (50%), Gaps = 7/247 (2%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L + I D RQA KV H++ ++L++ C+ + E + D+ DF ++ +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRR 122 ++G P HD V+ I P E W D + IAIDGK R +++ Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGWCGDL----EGRHIAIDGKALRGTHNAETG 116 Query: 123 KGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE 182 + +H++ A+ + L GQI +KSNEI AIP LL L +KG + DAMG Q IAE Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPKHDSYAISEKSHGREETRLHI 239 +I G DY+ A+K N R ++ + F E L+ H E SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VCDVPDE 246 + + D Sbjct: 237 ITEELDW 243 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 242 bits (617), Expect = 1e-62, Method: Composition-based stats. Identities = 90/248 (36%), Positives = 139/248 (56%), Gaps = 6/248 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + + E++S D R A+ +H DI+ L + AVISGA W +I+ FGE H D+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 Y FE GIPV DTIARV+ I P F+E F+N++ + + ++VIAIDGK RHS++ Sbjct: 57 YRPFECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFNP- 115 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 + A+H ++ +S L++ Q K+ K NE A+ E+++ +K +I DAM QK I Sbjct: 116 ETQSALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKI 175 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPK-HDSYAISEKSHGREETRLHI 239 AEKI ++ GDY+ +K N + E F + P+ ++Y R + R + Sbjct: 176 AEKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYR 235 Query: 240 VCDVPDEL 247 V D L Sbjct: 236 KLKVSDWL 243 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 241 bits (616), Expect = 1e-62, Method: Composition-based stats. Identities = 84/248 (33%), Positives = 124/248 (50%), Gaps = 14/248 (5%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M ++ +IIPD R ++ ++I+ + + AVI GA+ W +IE FG+TH + K Sbjct: 1 MTIQAFS---AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKA 57 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 IP HDT++R S + F E F W+ D V+AIDGK + DKS Sbjct: 58 RLPGLVSIPSHDTLSRFFSILDIDWFEECFRLWVDDICRRIPG-VVAIDGKAICDNPDKS 116 Query: 121 RR-----KGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMG 175 + ++++SA+S + + +GQ K ++KSNE AIPEL+ LD++ II DA+G Sbjct: 117 SNSKNGVRSKLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIG 176 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP---KHDSYAISEKSHGR 232 CQK I + I + DY+ K N L E F L E + Y K HGR Sbjct: 177 CQKSITKLIIENKADYILCAKDNHEALRNIIE--FNLSEESRYYLCHAKRYFEENKGHGR 234 Query: 233 EETRLHIV 240 E R + Sbjct: 235 SEYRECVC 242 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 239 bits (611), Expect = 5e-62, Method: Composition-based stats. Identities = 79/244 (32%), Positives = 132/244 (54%), Gaps = 3/244 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 +++H+ I D R EH + DI L + AVISGA+ W +FG ++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP +I R+ + ++ ++W+ +Y + + IAIDGK+ + + S A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLKGAKA-SASSAA 119 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 +H+++A+ T LV K +E+ + ELL LD+K +++ DA+ CQ + I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVC--DV 243 K+GGD + VKGNQ +L +A + +F NNP + + + K HGR E R+ C ++ Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLNL 239 Query: 244 PDEL 247 P E+ Sbjct: 240 PAEI 243 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 234 bits (598), Expect = 2e-60, Method: Composition-based stats. Identities = 67/254 (26%), Positives = 113/254 (44%), Gaps = 6/254 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 + + + +PD R A V H L ++L++ +V+ G+ ++ FG F + Sbjct: 10 IAMHIFLSAFDEVPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSD-DKDVIAIDGKIHRHSYDK 119 + ++ IP HDT + V I P +F + D D D+IAIDGK R + D Sbjct: 69 FLKLKHAIPSHDTFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDP 128 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 ++SA+++ L + + D + E++A E L ++D++GK++ DA+ C + Sbjct: 129 GESARTRMMVSAYASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRR 187 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHI 239 I GGD+ A+KGNQ L F ++P HGR+ETR + Sbjct: 188 TVAAINAGGGDWCLALKGNQESLLSDARGCFSKGHKSDPTA---VTENTGHGRKETRKAV 244 Query: 240 VCDVPDELIDFTFE 253 V F Sbjct: 245 VVSAKALAEYHEFP 258 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 231 bits (589), Expect = 2e-59, Method: Composition-based stats. Identities = 82/237 (34%), Positives = 120/237 (50%), Gaps = 6/237 (2%) Query: 15 DYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTI 74 D RQ+WK+ + LS IL L ++G E +++EDF E + Y D G P HDT+ Sbjct: 19 DSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEPLFATYVDLSEGCPSHDTL 78 Query: 75 ARVVSCICPAKFHESFINWMLDYHSSDD-KDVIAIDGKIHRHSYDKSRRKGAIHVISAFS 133 RV+S + + E + + S D +I++DGK R ++ + + +H+++A+ Sbjct: 79 ERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRG--NRGKNQKPVHIVTAYD 136 Query: 134 TMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLF 193 H L +GQ+ ++KSNEI AIP+LL +DI+ I+ DAMG Q I + I K DY Sbjct: 137 GGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCL 196 Query: 194 AVKGNQGRLNKAFEEKFP---LKELNNPKHDSYAISEKSHGREETRLHIVCDVPDEL 247 AVKGNQ L F L E Y EKS G+ E R + V L Sbjct: 197 AVKGNQETLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQIEVREYWVSSDIKWL 253 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 229 bits (583), Expect = 1e-58, Method: Composition-based stats. Identities = 65/239 (27%), Positives = 111/239 (46%), Gaps = 7/239 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++ +PD R +H L +IL + + AV+ GA ++E F + D L+Q+ E Sbjct: 3 FLDVFGEVPDPRD-LTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS----DDKDVIAIDGKIHRHSYDKSR 121 G P HDT +RV++ + P +E+F+ +M + K +A+DGK R +Y K R Sbjct: 62 RGAPSHDTFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGR 121 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIA 181 V++ F + + Q ++ E+ A L +L +KG + DA+ C + + Sbjct: 122 SHMPPLVVTVFGCDTFMSLAQT-VAQEGGEVQAAIAALELLSLKGLTVTADALHCHRRMT 180 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 + ++ GG Y+ A+KGNQ +L + E +HGR E R V Sbjct: 181 KTVRDGGGHYVIAIKGNQSKLAAEANTALDKAAAGKATK-FHQTEEDAHGRHEVRRAFV 238 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 228 bits (581), Expect = 1e-58, Method: Composition-based stats. Identities = 100/197 (50%), Positives = 130/197 (65%), Gaps = 13/197 (6%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L L +H + + D RQA KV +KL D+L L + AVISGAEGWE+IEDFG +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 YGDF +GIPVHDTIAR+V I P FH+ FI WM DK V+A+DGK Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 H+ISAF+T + +V+GQ +TD+KSNEITA+PELL +L+++G ++ DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 AEKIQKQGGDYLFAVKG 197 + I K+ DY AVK Sbjct: 168 VKTIVKKKADYCIAVKK 184 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 228 bits (581), Expect = 2e-58, Method: Composition-based stats. Identities = 86/254 (33%), Positives = 132/254 (51%), Gaps = 16/254 (6%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 L+KL E S IPD+R+A K + HKLSDI++L I +S +I +FG+ + ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCICPAK-------FHESFINWMLDYHSSDDKDVIAIDGKIH 113 NGIP T+ R+ I F E+F ++ + +++I IDGK Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCT--QEIICIDGKAE 152 Query: 114 RHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDA 173 R + K+ R I +SA S + + ++KSNEI A+P L++ +DI GKI+ DA Sbjct: 153 RGTVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADA 210 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGRE 233 M QKDI +KI+++ GD++ +K NQ L E+K +P + E HGR Sbjct: 211 MSMQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDKIKEL---SPVYSYCGEPELGHGRI 267 Query: 234 ETRLHIVCDVPDEL 247 ETR + V D D + Sbjct: 268 ETRSYRVFDGTDLI 281 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 226 bits (577), Expect = 4e-58, Method: Composition-based stats. Identities = 78/284 (27%), Positives = 119/284 (41%), Gaps = 39/284 (13%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + E I I D R+ KV + I+L+T+ V + W DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDK------------------- 103 P HDT+ R I + + W + Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 104 -DVIAIDGKIHRHSYDKSR--------------RKGAIHVISAFSTMHSLVIGQIKTDKK 148 IAIDGK + + + +H++SAF + SL +GQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNMLDIK-GKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFE 207 NEI AIP+LL+ +DI+ G ++ DA+G QK I EKI ++ DYL VK N +L + E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPLKELNNPKHDSY---AISEKSHGREETRLHIVCDVPDELI 248 ++ ++D + + HG TR I C P L Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRLG 300 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 224 bits (571), Expect = 2e-57, Method: Composition-based stats. Identities = 77/252 (30%), Positives = 124/252 (49%), Gaps = 10/252 (3%) Query: 3 LKKLMEHISIIPDYRQ--AWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 +K L E + +PDYR+ ++KL DILLL I + DI FG+ + + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHS---SDDKDVIAIDGKIHRHSY 117 G +G+P T+ R+ I E + +H D++ IDGK R + Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAGDILCIDGKAMRGTV 137 Query: 118 DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 ++ R I +SA+S + + ++KSNEIT++P+LL+ +D+ G I+ DAM Q Sbjct: 138 LENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADAMSFQ 195 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRL 237 K I +KI+++GGD+L +K NQ L E+ L E + + + HGR ETR+ Sbjct: 196 KAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAEPVDVYSEGPFLE---HGRIETRV 252 Query: 238 HIVCDVPDELID 249 + D + D Sbjct: 253 CRIFRGNDLITD 264 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 224 bits (571), Expect = 2e-57, Method: Composition-based stats. Identities = 64/249 (25%), Positives = 106/249 (42%), Gaps = 6/249 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++ +PD R A H L ++L++ +V+ GA ++ FG + + + Sbjct: 37 ILSAFEDVPDPR-AENTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLK 95 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHS-SDDKDVIAIDGKIHRHSYDKSRRKG 124 + +P HDT + V I P +F + D + D DVIA+DGK R + D Sbjct: 96 HAVPSHDTFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGR 155 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 ++SA++ L + + D + E+ A E L ++ +KGK++ DA+ C + I Sbjct: 156 TRMMVSAYAARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAI 214 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVP 244 GGD+ A+K NQ L F + +P + HGR ETR V Sbjct: 215 NAGGGDWCLALKANQDSLLSDARASFGAEPDAHPSA---LSEDIGHGRTETRKATVVSSK 271 Query: 245 DELIDFTFE 253 F Sbjct: 272 ALAEHHEFP 280 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 224 bits (570), Expect = 3e-57, Method: Composition-based stats. Identities = 74/219 (33%), Positives = 114/219 (52%), Gaps = 7/219 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 +++ I D R K HK+ I+ ++I AVI GA+ W +IE+FG F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPS 63 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRH------SYD 118 IP HDT R S I P F F NW+ V+AIDGK+ R + Sbjct: 64 LEFIPSHDTFNRFFSMIKPDYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 + + + ++SA+S + + +GQ+K D KS+EITAIP L+N L++ G I+ DAMGCQK Sbjct: 123 RGKEGFKLWMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 DI + I +Y+ A+K N+ + + ++ + + Sbjct: 183 DITQTIIGHDANYIIAIKENKKKKYQPAKQIIDDYQDRD 221 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 222 bits (567), Expect = 7e-57, Method: Composition-based stats. Identities = 71/246 (28%), Positives = 126/246 (51%), Gaps = 4/246 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L++H+ II D R ++H L D++ LT+ A++SGA GW+ IE FG D+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 +GIP IA ++ + ++ W+ D K +IA+DGK R ++ + A Sbjct: 63 HGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ-A 121 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 +H++SAF + + + +KK +E ++++ L + ++ DA+ CQK +KI Sbjct: 122 LHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKII 181 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV--CDV 243 + D++ +KGNQ A + ++P + HGR+E R + ++ Sbjct: 182 SKKSDFVIQIKGNQPA-LLAAVKAAFAACYDSPALAISEQTNTGHGRKECRRVMQIEGNL 240 Query: 244 PDELID 249 P EL + Sbjct: 241 PPELSE 246 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 222 bits (565), Expect = 1e-56, Method: Composition-based stats. Identities = 81/249 (32%), Positives = 123/249 (49%), Gaps = 9/249 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L E + ++P R K + L +LL+ + +SG W +IED+ E + + LK + Sbjct: 4 SLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEM 63 Query: 65 ENG------IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYD 118 G +P HDT+ R +S + F ++ W+ + S+ I IDGK R Sbjct: 64 LTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRG-VK 122 Query: 119 KSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 K HV+SAFS + Q+ D+K+NEI AI +LL++LD+ G ++ DA+G Q Sbjct: 123 KLSFDTQSHVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQT 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLH 238 I E+I +GGDY+ VK NQ + E F + D +E SHGR ETR + Sbjct: 183 AIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLD--EQTELSHGRIETRRY 240 Query: 239 IVCDVPDEL 247 P E+ Sbjct: 241 ESILNPLEI 249 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 218 bits (556), Expect = 1e-55, Method: Composition-based stats. Identities = 70/255 (27%), Positives = 124/255 (48%), Gaps = 10/255 (3%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 ++ I+++ D R ++++ L ILL+++ A ISG + WE IED+ H + L+ Sbjct: 3 AEIWNAIAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYT 62 Query: 64 FENG-------IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS 116 +G +P HDT V I P +F E + +++ + + IAIDGK R Sbjct: 63 KLSGKELKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPRG- 121 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 ++ +++SA+ T H VI I ++ K +E+++I +L+ +L ++ + DA G Sbjct: 122 IKQTANSHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGT 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETR 236 ++ E I +GG+++ VKGNQ +L + E++F N D + HGR E R Sbjct: 182 YVEVIEMILSKGGNFVLPVKGNQKKLLEFIEKEFREYRGNTVSAD--TQEDIGHGRVEKR 239 Query: 237 LHIVCDVPDELIDFT 251 D Sbjct: 240 TVYCITEIKTDDDID 254 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 217 bits (554), Expect = 2e-55, Method: Composition-based stats. Identities = 67/255 (26%), Positives = 116/255 (45%), Gaps = 16/255 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 E+ L+E ++ +PD R V H L+ +L LT CAV++GA + ++ P+ L + Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 62 GDFE-------NGIPVHDTIARVVSCICPAKFHESFINWML-DYHSSDDKDVIAIDGKIH 113 P TI RV++ I + W+ + +A+DGK Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWLACRQQDAGGLRALAVDGKSL 157 Query: 114 RHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIIKTD 172 R + R+ +H+++A + LV+ Q+ +K+NEIT LL+ L D+ G ++ +D Sbjct: 158 RGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTSD 215 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGR 232 A+ Q D A ++ + Y+ VK N +L+ + P +++ HGR Sbjct: 216 ALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLKS-LPWQQIPLQDR----TRTTGHGR 270 Query: 233 EETRLHIVCDVPDEL 247 E R VC V + L Sbjct: 271 CEIRRLKVCTVNNLL 285 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 212 bits (540), Expect = 9e-54, Method: Composition-based stats. Identities = 72/204 (35%), Positives = 101/204 (49%), Gaps = 1/204 (0%) Query: 38 ISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDY 97 ++ AE WEDIE +G + +L+ + NGIP HDT RV + F F + Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 ++V+A+DGK R S G +H++S +++ L +GQ D KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 LL L + G I+ DAMGCQ IAE+I+ +G D L +K N G +A F L + Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 218 PKHDSYAISE-KSHGREETRLHIV 240 + HGR R V Sbjct: 184 GAAGRPVFDAFEGHGRLVRRRVFV 207 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 210 bits (535), Expect = 4e-53, Method: Composition-based stats. Identities = 65/248 (26%), Positives = 117/248 (47%), Gaps = 8/248 (3%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L+ + +PD R ++L ++ + +CAV +GA + I D+ P + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDV-IAIDGKIHRHSYDKS 120 +P TI +V + + +D + +A+DGK R + + Sbjct: 102 CGIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RI 159 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 ++ A H+++A + ++V+GQ +T KSNEI + LL +DI G ++ DAM QK Sbjct: 160 GKQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKAT 219 Query: 181 AEKIQKQ-GGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHI 239 A +++Q +Y+ VK NQ L ++ P +++ D E+ HGREE R + Sbjct: 220 ARCLREQCRAEYVMIVKANQPGLLARVRDQ-PWEQVPVVWSDP---VERGHGREEHRSYK 275 Query: 240 VCDVPDEL 247 + V L Sbjct: 276 ILTVARGL 283 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 207 bits (528), Expect = 2e-52, Method: Composition-based stats. Identities = 70/257 (27%), Positives = 107/257 (41%), Gaps = 38/257 (14%) Query: 30 LLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHES 89 +L+T+ V + W DI DF DFL+++ P HDT+ R I + Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FINWMLDYHSSDDK--------------------DVIAIDGKIHRHSYDKSR-------- 121 + W + IAIDGK + + + Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 122 ------RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIK-GKIIKTDAM 174 +H++SAF + SL +GQ + K NEI AIP+LL+ +DI+ G ++ DA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSY---AISEKSHG 231 G QK I EKI ++ DYL VK N +L + E ++ ++D + + HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 REETRLHIVCDVPDELI 248 TR I C P L Sbjct: 241 FMVTRTCISCSEPSRLG 257 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 205 bits (523), Expect = 9e-52, Method: Composition-based stats. Identities = 81/284 (28%), Positives = 120/284 (42%), Gaps = 39/284 (13%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 L+ + + I D RQ KV H+ I++ + V + + W ++ DF DF++++ Sbjct: 18 LESMYNALGEI-DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFF 76 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINW--------------------MLDYHSSDD 102 P HDT+ R +CP + W + + Sbjct: 77 PDIQKAPSHDTLRRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKP 136 Query: 103 KDVIAIDGKIHRHSYDKSRRK--------------GAIHVISAFSTMHSLVIGQIKTDKK 148 IAIDGK + + ++ RR+ +H++SAFS L +GQ + DKK Sbjct: 137 FRQIAIDGKTIKKAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKK 196 Query: 149 SNEITAIPELLNMLDI-KGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFE 207 NEI AIP LL+ LDI +G ++ DAMG QKDI +I K+ YL VK NQ L + Sbjct: 197 ENEIVAIPRLLDDLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIA 256 Query: 208 ---EKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELI 248 F L N + + E HG R VC L Sbjct: 257 GNMRDFERIPLPNEVYKVHKEGENGHGFVFLRECRVCSSLHSLG 300 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 200 bits (508), Expect = 4e-50, Method: Composition-based stats. Identities = 65/242 (26%), Positives = 108/242 (44%), Gaps = 8/242 (3%) Query: 7 MEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFEN 66 + + I D R H L+++L L + A + GA+ +I +F E LK+ + Sbjct: 5 LSILREIHDPRD-INARHDLAELLFLALAATLCGAKNCVEIAEFVEGREAELKEIVTLRH 63 Query: 67 GIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD-----KDVIAIDGKIHRHSYDKSR 121 G P HDT +R+ I P + + ++ + V+A+DGK R Y+K R Sbjct: 64 GCPSHDTFSRIFRLIDPDELARALGAFLAALRQGLGLGPRPRGVVAVDGKALRRGYEKGR 123 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIA 181 ++S + L + + + S+E+ A LL +D+KG I+ DA+ C+ D A Sbjct: 124 AFMPPVMVSVWDAETRLSVATKRAEG-SDEVAATLALLKSIDLKGCIVTADALHCRPDTA 182 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVC 241 + + + Y A+K N+GRL E F + + E HGR ETR V Sbjct: 183 KALIGRKAHYALALKANRGRLFACAEAGFVAADAAG-DLAFHETRETGHGRLETRRASVL 241 Query: 242 DV 243 + Sbjct: 242 PL 243 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 200 bits (508), Expect = 5e-50, Method: Composition-based stats. Identities = 71/173 (41%), Positives = 104/173 (60%), Gaps = 3/173 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ +PD R+ + H+L ++LL IC VISGAE W + + + D+L+ Y + Sbjct: 7 SLLTAFDDLPDPRR-RECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 +GI HDT RV S + ++F F+ W+ S + +AIDGK R S+D + + Sbjct: 66 AHGIASHDTFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHDGA--RS 123 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 IH++SA+S+ +L +GQ++T KSNEITAIPELL LDI+G I DAMGC Sbjct: 124 PIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCH 176 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 198 bits (504), Expect = 1e-49, Method: Composition-based stats. Identities = 59/246 (23%), Positives = 112/246 (45%), Gaps = 9/246 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ-Y 61 + L+E + + D+R+ H L +L++ I + G G+ ++ +F + + L Q + Sbjct: 1 MLSLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEF 60 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINW-MLDYHSSDDKDVIAIDGKIHRHSYDK- 119 +P + TI RV+ + + F W + +Y DD + + +DGK +++ Sbjct: 61 NIIPERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNP 120 Query: 120 -SRRKGAIHVISAFSTMHSLVIGQIKTDKK-SNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 + ++ I +S FS LV+ + + K +EI ++ ++ K+ DA+ CQ Sbjct: 121 NNEQQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRL 237 K I K DY+ VKGNQ L K ++ ++ + + SHGR+ +R Sbjct: 181 KKTISLIAKTKNDYVITVKGNQKNLYKRIQDL----SNSSKPESCFLEQDNSHGRKISRK 236 Query: 238 HIVCDV 243 V V Sbjct: 237 IEVFKV 242 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 61/261 (23%), Positives = 113/261 (43%), Gaps = 22/261 (8%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 + L+E ++ +PD R+ V ++ + +L + +CA++SGA + I ++ P + Sbjct: 47 DQTALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAG 106 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD-------------KDVIAI 108 +P TI RV+ + A + W+ + D + V+A+ Sbjct: 107 LGLTGRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAV 166 Query: 109 DGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGK 167 DGK R + +H++ +V+ Q+ D+K+NEI +L+ + D+ Sbjct: 167 DGKAMRAT---RHGTHPVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDV 223 Query: 168 IIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISE 227 +I DAM Q A+ + +G L VK NQ ++ + P K++ + + Sbjct: 224 LITVDAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRLKT-LPWKDVPV----GHTTTG 278 Query: 228 KSHGREETRLHIVCDVPDELI 248 + HGR ETR VP L Sbjct: 279 RGHGRIETRTLKAVTVPAGLG 299 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 197 bits (502), Expect = 2e-49, Method: Composition-based stats. Identities = 55/256 (21%), Positives = 108/256 (42%), Gaps = 13/256 (5%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L+ + +PD+R V ++L+ +L L + I+G + + ++ P + Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDY--HSSDDKDVIAI--DGKIHRHSY 117 F +P TI R+V P + ++ W +D ++A+ DGK+ + + Sbjct: 84 LGFPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGAR 143 Query: 118 DKSRRKGAIH--VISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMG 175 + + V+ A +G + +EI ++ L+N + ++ TD + Sbjct: 144 SRPPQGSVRQEAVVEAVRHDTGTALGHQRVVA-GDEIASVRRLVNRVCDHNTLVTTDCLH 202 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREET 235 + +A I+ +GG +LF++KGNQ + P E N + EK+HGR E Sbjct: 203 AHEPLARAIRAKGGHWLFSIKGNQPTVRAKL-AGLPWDEFGN----QHVTREKAHGRIEE 257 Query: 236 RLHIV-CDVPDELIDF 250 R L+ F Sbjct: 258 RALKALTPSAPSLVGF 273 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 197 bits (500), Expect = 4e-49, Method: Composition-based stats. Identities = 61/229 (26%), Positives = 101/229 (44%), Gaps = 5/229 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+E ++ +PD R ++ L +L L + AV+ G E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 NG +P +TIA ++ + P + W+ D H D + +A+DGK S + + Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGKRLCGS--RDGQV 120 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD-IKGKIIKTDAMGCQKDIAE 182 H+++A++ S V+ Q+ + +NE A LL +L + G ++ DA+ Q D+ Sbjct: 121 PGTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHG 231 +Q +GGD + K NQG L E F + G Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAFATAAGGDFSPRVTGRVGSGRG 229 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 194 bits (492), Expect = 3e-48, Method: Composition-based stats. Identities = 65/184 (35%), Positives = 94/184 (51%), Gaps = 10/184 (5%) Query: 68 IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRH------SYDKSR 121 IP HDT R S I P F F NW+ V+AIDGK+ R + + Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHTTGK 62 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIA 181 + ++SA+S + + +GQ+K D KSNEITAIP L+N L++ G I+ DAMGCQKDI Sbjct: 63 EGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDIT 122 Query: 182 EKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLH 238 + I + +Y+ A+K N+ + L K + + K+ + + HGR ETR Sbjct: 123 QTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRTC 182 Query: 239 IVCD 242 V Sbjct: 183 TVVS 186 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 193 bits (490), Expect = 6e-48, Method: Composition-based stats. Identities = 59/264 (22%), Positives = 95/264 (35%), Gaps = 27/264 (10%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ L+E + +PD R+ V L +L L + AV GA G+ +I + L Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYH--------------SSDDKDVIA 107 P T RV+ P E+ W VI+ Sbjct: 91 FGLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVIS 150 Query: 108 IDGKIHRHSYDKSRRKGAI--HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML--- 162 DGK R + ++ V+ V+ + +EI A+ ++ L Sbjct: 151 ADGKTMRGARRRTGDGKIAQDQVVEILDHASGAVVA-CEPVNDGDEIGAVRTVMGRLADR 209 Query: 163 --DIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKH 220 + G ++ TDA Q + E++ GG +L VK NQ R+ P ++ Sbjct: 210 WGSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRILAKVRA-LPWAQVRAQD- 267 Query: 221 DSYAISEKSHGREETRLHIVCDVP 244 K+HGR ETR V P Sbjct: 268 ---TCRGKAHGRAETRTVRVVQAP 288 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 190 bits (482), Expect = 5e-47, Method: Composition-based stats. Identities = 57/228 (25%), Positives = 103/228 (45%), Gaps = 14/228 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ LM+ +S D R+ + H ++ +CA++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDFENGI-------PVHDTIARVVSCICPAKFHESFINW----MLDYHSSDDKDVIAIDG 110 F P T+ R + I + W + D V++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIK 170 K R + K++ IH ++AF +V+ Q D+K+NEI + LL ++I+G+I+ Sbjct: 341 KAVRGA-SKAKGGQKIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 TDAMGCQKDIAEKIQK-QGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 DA+ Q + A I + + DY+F VK NQ + + E P + Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIES-LPWEAFPP 446 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 189 bits (481), Expect = 6e-47, Method: Composition-based stats. Identities = 66/254 (25%), Positives = 114/254 (44%), Gaps = 18/254 (7%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDI----EDFGETHPDF 57 ++ L+ + I D R+A + LS +L + A ++GA G +I DFG+ Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKQYGDFENGI---PVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD--VIAIDGKI 112 L D G P I + + A +F W+ + + + V+A+D K+ Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 HRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKII-K 170 R ++ + ++ + ++SA LV GQ++ +NEIT + LL L DI G ++ Sbjct: 141 LRGAWSEGNKR--VTLLSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 171 TDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNK-AFEEKFPLKELNNPKHDSYAISEKS 229 DA+ Q + A + + G DY VKGNQ L + FE+ PL + + + E+ Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLYRKTFEQTLPLLQKPP----QHEVEERG 254 Query: 230 HGREETRLHIVCDV 243 HGR + + Sbjct: 255 HGRIKKWQAWTTEA 268 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 188 bits (477), Expect = 2e-46, Method: Composition-based stats. Identities = 61/218 (27%), Positives = 98/218 (44%), Gaps = 3/218 (1%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + L E +S IPD R + H L +L L A++ G + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDK 119 F G P T++R + P + + W+ + IA+DGK R S + Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGS--R 118 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + H+++A++ V+ Q++ D K+NE A LL +L + G ++ DAM CQ+D Sbjct: 119 DGQVPGQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 +A + G DY+ K NQ L + E ++ Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGLGFEDAAR 216 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 183 bits (465), Expect = 5e-45, Method: Composition-based stats. Identities = 73/158 (46%), Positives = 99/158 (62%), Gaps = 1/158 (0%) Query: 94 MLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEIT 153 M H +V+AIDGK R SYD+ R+ IH++SA+++ + LV+GQ+KT+ KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIP L+ MLD++G I+ DAM CQ IA+ I ++GGDYL AVKGNQG+L A + F Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFTPH 120 Query: 214 ELNNPKHDSYAISEKSHGREETRLHIVCDVPDELIDFT 251 D+ I EK GR E R + V D + DF+ Sbjct: 121 RRAPIDRDTCQI-EKQKGRVEARTYHVLSASDLIRDFS 157 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 183 bits (464), Expect = 6e-45, Method: Composition-based stats. Identities = 60/237 (25%), Positives = 105/237 (44%), Gaps = 18/237 (7%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHE 88 +L+ + G + +TH + L+++ + GI TI R++ I Sbjct: 1 MLVCVTLGFLCGRTTIRRSLKWCKTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALY 60 Query: 89 SFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKK 148 +F+ W+ + S + +A+DGK + +K++ + +++ T+ L++ Q+ D K Sbjct: 61 AFMEWVGEIVDSRNT-HLAVDGKALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSK 119 Query: 149 SNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEE 208 +NEIT IPELL +LDI G I+ DA+G Q I E+I +QGG + VK NQ + Sbjct: 120 TNEITVIPELLKLLDISGSIVTIDAVGTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHT 179 Query: 209 KFPLKELNNPK-----------------HDSYAISEKSHGREETRLHIVCDVPDELI 248 E + + ++ EK+ R E R +C L Sbjct: 180 FMDKLEAADVQRKKGEVLDSGMREYLEKYEEIIRIEKNRDRNEYRTCQICKDASNLT 236 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 182 bits (463), Expect = 7e-45, Method: Composition-based stats. Identities = 67/250 (26%), Positives = 108/250 (43%), Gaps = 11/250 (4%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + L+ + + D R H L ++L L + A + GA+ ++ +F E + L++ Sbjct: 1 MSVLISILREVRDPRD-VNARHDLGELLFLALLATLRGAKTCVEMAEFSEARQEELREIV 59 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD----KDVIAIDGKIHRHSYD 118 +G P HDT +RV + P + +F +M + K V+AIDGK R YD Sbjct: 60 ALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRGALGLPAPKGVVAIDGKSLRRGYD 119 Query: 119 KSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 K R ++S + I ++ +EI A +L L +KG + DA+ C Sbjct: 120 KGRAFMPPLMVSVWDVETRPSIAAMRAPG-GDEIKATLSVLKALTLKGCTVTADALHCHP 178 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLH 238 +A+ + Y +K N G L +A E F + E+ HGREE R Sbjct: 179 AMAQALLAAKAQYALGLKANHGPLFRAAEAGFA----AVTDLAVFETRERGHGREEQRRA 234 Query: 239 IVCDVPDELI 248 V V D L+ Sbjct: 235 SVLPV-DRLV 243 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 181 bits (460), Expect = 2e-44, Method: Composition-based stats. Identities = 54/227 (23%), Positives = 102/227 (44%), Gaps = 15/227 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGE--THPDFLK 59 +++ L + +PD R +H L IL + + AV++ A+ + + ++ T + Sbjct: 219 QMEGLRQIFFRLPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKR 278 Query: 60 QYGDFENGI-----PVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHR 114 F P T+ RV+ + W+L + +A+DGK+ + Sbjct: 279 IRARFNPRTQRYVAPSEPTLRRVLQGANVTALDAAIGAWLLGIAGF---EAVAVDGKVLK 335 Query: 115 HSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAM 174 + + + +H++SAF I Q + +K+NEI + LL +DI+ K++ DA+ Sbjct: 336 GAVREDGSQ--VHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADAL 393 Query: 175 GCQKDIAE-KIQKQGGDYLF-AVKGNQGRLNKAFEEKFPLKELNNPK 219 Q+ A ++ + DYLF AVKGNQ +L + P + + Sbjct: 394 HTQRKTARFLVEDKKADYLFTAVKGNQRKLRNSLI-CLPWGDFPPQR 439 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 178 bits (451), Expect = 2e-43, Method: Composition-based stats. Identities = 58/261 (22%), Positives = 111/261 (42%), Gaps = 21/261 (8%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHP-DFLKQ 60 E++ L + ++ +PD R + H+L IL L+ AV +G + E+I + P L Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 61 YGDFENGI------PVHDTIARVVSCICPAKFHES---FINWMLDYHSSDDKDVIAIDGK 111 G + + P DT+ RV+S + + + F + V+A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 IHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML----DIKGK 167 R + R H+++ +V+ + + K+NE+TA LL L + G Sbjct: 158 TLRGAAGPEGRAP--HLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 IIKTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAIS 226 ++ DA+ + A+ I + G ++F VK N L+ + ++ ++ Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPI----GHSAE 271 Query: 227 EKSHGREETRLHIVCDVPDEL 247 ++HGR E R + + + Sbjct: 272 GRAHGRFERRTIQLAQASEAI 292 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 177 bits (449), Expect = 3e-43, Method: Composition-based stats. Identities = 57/223 (25%), Positives = 98/223 (43%), Gaps = 19/223 (8%) Query: 11 SIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETH-PDFLKQYGDFENG-- 67 + + D R+A + H +LL+ + V++G +E I + + L++ G + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 68 ----IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 P TI R++S P + +++ + S IAIDGK R S Sbjct: 289 ERFLPPSEPTIRRILSKADPVELDRILSQYIVAHSS---GRAIAIDGKTIRSS------- 338 Query: 124 GAIHVISAFSTMHSLVIGQIKTDK-KSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE 182 ++ +++A V+ Q D K +EI A LL LD+ GK++ DA+ Q +A Sbjct: 339 -SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALAS 397 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAI 225 +I+++GGDY+F VK N+ L +P D Sbjct: 398 RIREKGGDYVFTVKDNRKTLKDEISGLDDEAFSPSPYDDLLRT 440 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 177 bits (448), Expect = 4e-43, Method: Composition-based stats. Identities = 61/256 (23%), Positives = 110/256 (42%), Gaps = 20/256 (7%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDF-LKQYG 62 L+ ++ +PD R V H L +L + AV++GA + ++ P L + G Sbjct: 27 SSLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELG 86 Query: 63 DFE------NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS--DDKDVIAIDGKIHR 114 F + P T R+++ + ++ W+L + + V ++DGK R Sbjct: 87 VFRDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLR 146 Query: 115 HSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAM 174 S + +H+++ V+GQ+ D K+NE+T LL LD+ ++ DA+ Sbjct: 147 GSGPAGEQ---VHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADAL 203 Query: 175 GCQKDIA-EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGRE 233 Q++ A + + Y+F VK NQ RL + + P ++ S + HGR Sbjct: 204 HTQREHARWLVDTKKAAYVFTVKKNQPRLYRQLKT-LPWTKIPIQD----ETSTRGHGRY 258 Query: 234 ETRL--HIVCDVPDEL 247 + R + C P L Sbjct: 259 DIRRLQAVTCTGPLAL 274 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 173 bits (438), Expect = 6e-42, Method: Composition-based stats. Identities = 62/268 (23%), Positives = 108/268 (40%), Gaps = 34/268 (12%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISG-----AEGWEDIEDFGETHPDFL 58 + E ++ IPD+R A + + L + + +CAV + A E + T L Sbjct: 22 AGIWERLAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERVRL 81 Query: 59 KQYGDFENG--IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVI---------- 106 + + +G +P TI R ++ + + ++ L +D D + Sbjct: 82 RLPWNPWDGHLLPDEATIRRFLNTVDDQALATALLDPPLADTPADLTDAVPSAPVRPPAG 141 Query: 107 ---------AIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 A+DGK R + K +H++ + ++GQ + D KSNE T Sbjct: 142 DQAVPVRAYAVDGKTSRGA--KRADGSQVHLLGVAAHGAGALLGQREIDAKSNETTEFRA 199 Query: 158 LLNMLDIKGKIIKTDAMGC-QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 LL L++ G + DA+ + ++ + ++ YL K NQ +L AF P E+ Sbjct: 200 LLAPLELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKLR-AFLAALPWTEIP 258 Query: 217 NPKHDSYAISEKSHGREETRLHIVCDVP 244 ++ HGREETR V V Sbjct: 259 TAD----LTRDRGHGREETRTLKVATVT 282 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 172 bits (437), Expect = 8e-42, Method: Composition-based stats. Identities = 53/165 (32%), Positives = 82/165 (49%), Gaps = 3/165 (1%) Query: 47 IEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVI 106 + + L+ + NG P DT RV+ I P + + + S + I Sbjct: 1 MHELCLERGASLRPPVELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHI 60 Query: 107 AIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKG 166 AIDGK + S K+ H++SA+ L + Q +K NE+ AIPE+L+ LD+ G Sbjct: 61 AIDGKRLKGSKKKTGS---THILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSG 117 Query: 167 KIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP 211 +I DAMG Q +IAE+I + DY+ ++KGNQ L + + F Sbjct: 118 AVISIDAMGTQTNIAEQIIQSEADYILSLKGNQKHLYEDVRDCFT 162 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 170 bits (430), Expect = 5e-41, Method: Composition-based stats. Identities = 63/140 (45%), Positives = 91/140 (65%), Gaps = 4/140 (2%) Query: 106 IAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIK 165 +AIDGK R S+D + + IH++SA+S+ +L +GQ++T KSNEITAIPELL LDI+ Sbjct: 1 MAIDGKCLRGSHDGA--RSPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 GKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHD--SY 223 G I DAMGCQ DIAE+I ++G DY+ VKGNQ L +A + F + + + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 AISEKSHGREETRLHIVCDV 243 + ++K+HGR ETR + + Sbjct: 119 SQTDKNHGRIETRRCVATND 138 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 169 bits (429), Expect = 7e-41, Method: Composition-based stats. Identities = 57/225 (25%), Positives = 93/225 (41%), Gaps = 12/225 (5%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHE 88 +L + + A +G G+ + T D + P T V+S + PA + Sbjct: 3 LLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLNA 62 Query: 89 SFINWMLDYHSSDDKD---VIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKT 145 ++ + +S D IA+DGK+ R + + A H++S F+ LV+GQ+ Sbjct: 63 RMGSYFTAHVASSDPSGLVPIALDGKMLRGAL--RAKATATHLVSVFAHRARLVLGQLAV 120 Query: 146 DKKSNEITAIPELLNMLDIKGK-IIKTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRLN 203 +KSNEI + LL +L + ++ DAM Q A+ I YL VK NQ ++ Sbjct: 121 AEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKIL 180 Query: 204 KAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELI 248 P E+ D + HGR ETR + + Sbjct: 181 ARI-TALPWAEVPAAATD----DSRGHGRVETRTLQIITAARGIG 220 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 165 bits (417), Expect = 1e-39, Method: Composition-based stats. Identities = 56/142 (39%), Positives = 80/142 (56%), Gaps = 3/142 (2%) Query: 101 DDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLN 160 VIAI+GK R + + A+H +SA++ + L +GQ+ +KSNEITAI ELL Sbjct: 1 MGGLVIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLP 60 Query: 161 MLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF-PLKELNNPK 219 L ++G ++ DA+GCQ +AE+I GGDY+ AVK NQ L A + F L +P Sbjct: 61 TLALEGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPV 120 Query: 220 HDS--YAISEKSHGREETRLHI 239 + + +K HGR ETR Sbjct: 121 RQTCVHETLDKGHGRIETRRCT 142 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 163 bits (412), Expect = 6e-39, Method: Composition-based stats. Identities = 62/293 (21%), Positives = 109/293 (37%), Gaps = 50/293 (17%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVI-SGAEGWEDIEDFGE-THPDFLK 59 +++ L+ + D R A V +++S +L L +CA+ +G + ++ P+ L Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 60 QY------GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS------------- 100 + IP T+ V+ + P + + + + S+ Sbjct: 90 AFGLPYHPLRGRYRIPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPLMPDGG 149 Query: 101 -------------------DDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIG 141 + IA+DGK R + + + V+SA + + Sbjct: 150 IEREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR--VFVLSAVRHGDGITLA 207 Query: 142 QIKTDKKSNEITAIPELLNMLDI---KGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 + K+NEI LL+ LD KG ++ DA+ Q+D A + ++G YL +K N Sbjct: 208 SREIGAKTNEIPEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERGAHYLLTIKNN 267 Query: 199 QGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELIDFT 251 Q + P KE+ D + HGR E RL V V L Sbjct: 268 QRGQARQL-HALPWKEIPVIHRD----DARGHGRHEQRLVQVVTVNGLLFPHA 315 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 162 bits (409), Expect = 1e-38, Method: Composition-based stats. Identities = 55/225 (24%), Positives = 92/225 (40%), Gaps = 12/225 (5%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHE 88 +L + + A + G+ + T D + P T V+S + PA + Sbjct: 3 LLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLNA 62 Query: 89 SFINWMLDYHSSDDKD---VIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKT 145 ++ + +S D IA+DGK+ R + + A H++S F+ LV+GQ+ Sbjct: 63 RMGSYFTAHVASSDPSGLVPIALDGKMLRGAL--RAKATATHLVSVFAHRARLVLGQLAV 120 Query: 146 DKKSNEITAIPELLNMLDIKGK-IIKTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRLN 203 +KSNEI + LL +L + ++ DAM Q A+ I YL VK NQ ++ Sbjct: 121 AEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKIL 180 Query: 204 KAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELI 248 P E+ D + HGR +TR + + Sbjct: 181 ARI-TALPWAEVPAAATD----DSRGHGRVKTRTLQIITAARGIG 220 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 162 bits (409), Expect = 1e-38, Method: Composition-based stats. Identities = 51/186 (27%), Positives = 90/186 (48%), Gaps = 4/186 (2%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ- 60 L+ + +PD R+A + L +L+ T+ A++SGA + I F E + L Sbjct: 12 PFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTHH 71 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINW---MLDYHSSDDKDVIAIDGKIHRHSY 117 +G PV +T+ V+ + ++F +L +K V+A+DGK R S+ Sbjct: 72 FGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGSF 131 Query: 118 DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 D + A ++AF + ++V+ + D KSNEI A +++ L + G + DAM CQ Sbjct: 132 DHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHCQ 191 Query: 178 KDIAEK 183 K + + Sbjct: 192 KKHSRR 197 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 58/153 (37%), Positives = 89/153 (58%), Gaps = 3/153 (1%) Query: 102 DKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNM 161 D+IA+DGK R SYD++ K AIH++SA+ST + LV+GQ+KT++KSNE TAIP+L + Sbjct: 6 PGDIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIPKLFTL 65 Query: 162 LDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHD 221 L ++ + DA+G Q+DIA++I + DYL VK NQ L++ + + E D Sbjct: 66 LALEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAKGFTED 125 Query: 222 SY-AISEKS--HGREETRLHIVCDVPDELIDFT 251 +++E+ HGR + V L Sbjct: 126 FTDSVTEEGDKHGRIDKLHCRVTHRFSGLGALA 158 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 154 bits (389), Expect = 3e-36, Method: Composition-based stats. Identities = 52/129 (40%), Positives = 78/129 (60%), Gaps = 1/129 (0%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 ++H+++A+ + +L++GQ+K D KSNEITAIP+LL ML ++G I+ DAMGCQK IA++ Sbjct: 1 NSLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQ 60 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP-KHDSYAISEKSHGREETRLHIVCD 242 I + DY+ AVK NQ L + + F ++N H + + HGR ETR + Sbjct: 61 IGSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREYSTIV 120 Query: 243 VPDELIDFT 251 D L T Sbjct: 121 GDDLLAGIT 129 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 153 bits (387), Expect = 4e-36, Method: Composition-based stats. Identities = 58/293 (19%), Positives = 107/293 (36%), Gaps = 50/293 (17%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVIS-GAEGWEDIEDFGETHPDF------ 57 L++ ++I D R H L+ IL + CA ++ G + IE + + P Sbjct: 29 DLIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLH 88 Query: 58 -LKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD------------ 104 + + P TI RV++ + + ++ + Sbjct: 89 IWRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRR 148 Query: 105 ---------------------VIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQI 143 A+DGK + + + +H+IS + + + V Q Sbjct: 149 TEREARRAAHRSPTPAPGLLPAYAVDGKRLKGARHPDGGR--VHLISLAAHLDATVHAQR 206 Query: 144 KTDKKSNEITAIPELLNML---DIKGKIIKTDAMGCQKDIAE-KIQKQGGDYLFAVKGNQ 199 + KS+EI A+ LL D+ G +I DA+ Q+ A I++ Y+ VK NQ Sbjct: 207 QIPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQ 266 Query: 200 GRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELIDFTF 252 L+ + + ++ + + HGR E R+ P + IDF + Sbjct: 267 PTLHATAITALTGTDTDFAAV-THRETHRGHGRTEYRILR--TAPADGIDFPY 316 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 152 bits (383), Expect = 1e-35, Method: Composition-based stats. Identities = 49/180 (27%), Positives = 82/180 (45%), Gaps = 4/180 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+E ++ +PD+R A + L +LLL I +S G+ +EDF H + L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 -ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDK--SR 121 P T RV+ I F NW+ ++D + +DGK + + Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 122 RKGAIHVISAFSTMHSLVIG-QIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 + I+V+S FS + I Q +K+ +EI + LL LD++G + D++ CQK + Sbjct: 124 YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKKL 183 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 147 bits (371), Expect = 3e-34, Method: Composition-based stats. Identities = 45/174 (25%), Positives = 79/174 (45%), Gaps = 3/174 (1%) Query: 20 WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG-IPVHDTIARVV 78 H L +L L AV+ G + I FG + L F G P T+++ + Sbjct: 2 QGRIHPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTL 61 Query: 79 SCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSL 138 I P + + W+ + D + +A+DGK R S + H ++A++ + Sbjct: 62 RRIDPQQLEAALGRWIAGRLTPDARAHVALDGKCLRGS--RDGDVPGPHRVAAYAPHAAA 119 Query: 139 VIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYL 192 V+GQI+ D ++NE A LL ++ + G ++ A C +D+A + GG Y+ Sbjct: 120 VLGQIRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYV 173 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 147 bits (370), Expect = 4e-34, Method: Composition-based stats. Identities = 50/201 (24%), Positives = 76/201 (37%), Gaps = 19/201 (9%) Query: 50 FGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAID 109 FG + +LK GI H T + V C+ F + + Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAAL------------PKPLQRA 90 Query: 110 GKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKII 169 K S + + ++ ++ LVIGQ + NE+ + L +L ++G I+ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQQTAPGR-NEVQGALDALALLSLEGAIV 149 Query: 170 KTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKS 229 DA+ C+ D A I GGDY A+K NQ L + E + +E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLGVQ----TAAEND 205 Query: 230 HGREETRLHIVCDVPDELIDF 250 H R E R + V D IDF Sbjct: 206 HDRCERRRACIVAVND--IDF 224 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 144 bits (364), Expect = 2e-33, Method: Composition-based stats. Identities = 53/180 (29%), Positives = 85/180 (47%), Gaps = 5/180 (2%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFL-KQY 61 + L + + IPD+R+A L +LL +I A++SGA + I F TH L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSR 121 G P + +I + + F ++ VIA+DGK R S D+ Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHFRAH--AARLAEGAAVIALDGKTLRGSLDRFE 118 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTD--KKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + A V+SAF+T +V+GQI + K +EI A L+ L + G++ DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 140 bits (353), Expect = 4e-32, Method: Composition-based stats. Identities = 43/187 (22%), Positives = 83/187 (44%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L + +S +PD R A + L +L L + A +S + +E F +P L G Sbjct: 3 LRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 P H I ++ + P K + + +D +V+ +DGK R S + Sbjct: 60 RKAPGHTAITLLLHRLDPEKLQAALGQ---VFPEADLGEVLVVDGKHLRGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML---DIKGKIIKTDAMGCQKDIAE 182 + ++ + + Q + + + E A ELL+ L +++GK++ DA ++A Sbjct: 115 VKLVEVLALHLHTTLAQARAEGR--EEKAFLELLDRLEARELEGKVVVGDAGYLYPEVAA 172 Query: 183 KIQKQGG 189 +++K+GG Sbjct: 173 RVRKKGG 179 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 137 bits (346), Expect = 3e-31, Method: Composition-based stats. Identities = 50/127 (39%), Positives = 71/127 (55%), Gaps = 3/127 (2%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 G H++SA++T H + +G + T++KSNEITAI LL L K ++ DAMGCQKDIA Sbjct: 2 GPRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARN 61 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFE---EKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 I GGD++ AV+ NQ +L A EK E +H ++ HGR + R + Sbjct: 62 IVAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWG 121 Query: 241 CDVPDEL 247 VP + Sbjct: 122 AQVPPDF 128 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 137 bits (345), Expect = 4e-31, Method: Composition-based stats. Identities = 50/116 (43%), Positives = 66/116 (56%), Gaps = 3/116 (2%) Query: 133 STMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYL 192 S +LV+GQ K + KSNEITAIP L+ ML+I+ II DAMGCQK+I I+K+ GDY+ Sbjct: 28 SLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESSIITIDAMGCQKEITSLIRKKKGDYI 87 Query: 193 FAVKGNQGRLNKAFEEKF---PLKELNNPKHDSYAISEKSHGREETRLHIVCDVPD 245 +K NQ L + +E F +E + +H Y E H R E R I V Sbjct: 88 ITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQEIETGHHRIEKREVIAVSVSS 143 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 137 bits (344), Expect = 4e-31, Method: Composition-based stats. Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 6/190 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ H+ IPD R V +LL+ + ++S E D+E F H L + Sbjct: 12 DLISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGI 71 Query: 65 ENGIPVHDT-IARVVSCICPAKFHESFINWMLDY--HSSDDKDVIAIDGKIHRHSYDK-- 119 E P D+ + A + +W L + D D + DGK R S + Sbjct: 72 ELKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTS 131 Query: 120 SRRKGAIHVISAFSTMHSLVIGQ-IKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 I ++ +S + I Q + +E + +LL LD++G +I+ DA+ Q+ Sbjct: 132 GGGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQ 191 Query: 179 DIAEKIQKQG 188 Q +G Sbjct: 192 AFFGSSQSRG 201 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 136 bits (343), Expect = 6e-31, Method: Composition-based stats. Identities = 51/169 (30%), Positives = 85/169 (50%), Gaps = 13/169 (7%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 L+KL E S IPD+R+A K + HKL D+++L I +S +I +FG+ + ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCICPAK-------FHESFINWMLDYHSSDDKDVIAIDGKIH 113 NGIP T+ R+ I F E+F +L + ++++ IDGK Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCA--QEIVCIDGKAE 152 Query: 114 RHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML 162 R + K+ R I +SA S + + ++KSNEI A+P L++ + Sbjct: 153 RGTVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLIDKI 199 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 135 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 43/112 (38%), Positives = 67/112 (59%), Gaps = 3/112 (2%) Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQ 187 ++S ++T + LV+GQ+K + SNEITAIPELL +L++ G I++ A+ C KDI + I ++ Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 GGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPKHDSYAISEKSHGREETR 236 DY+ +K NQG L ++ E+ F +H +Y E HG E R Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQELQHSTYKPEETGHGLHEIR 112 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 132 bits (333), Expect = 9e-30, Method: Composition-based stats. Identities = 45/187 (24%), Positives = 86/187 (45%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L E +S IPD R A ++ L +L L + A +S + +E F +P L G Sbjct: 3 LREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 P H + ++ + P K E+ + + +D +V+ +DGK + S + Sbjct: 60 RKPPGHTILTLLLHRLDPEKLQEAL---LQVFPGADLGEVLVVDGKHLKGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML---DIKGKIIKTDAMGCQKDIAE 182 + ++ + + Q K + + ++ A+ ELL+ L +KGK++ DA ++A Sbjct: 115 VRLVEVLALHLLTTLAQAKAEGREDQ--ALLELLDRLGAEGLKGKVVVGDAGYLYPELAG 172 Query: 183 KIQKQGG 189 K+ ++GG Sbjct: 173 KVVQKGG 179 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 37/157 (23%), Positives = 73/157 (46%), Gaps = 8/157 (5%) Query: 98 HSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 ++ + IA+DGK + S + + H++SA + + + +++ K+NE T Sbjct: 126 ATAGPRRAIAVDGKALKASARLTSPRR--HLLSAVTHGRVVTLARVEVGAKTNETTHFKP 183 Query: 158 LLNMLDIKGKIIKTDAMG-CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 LL LD+ ++ DA+ + +I+ ++ + Y+ +K NQ + P +++ Sbjct: 184 LLAPLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQL-ATLPWRDIP 242 Query: 217 NPKHDSYAISEKSHGREETRLHIVCDVPDELIDFTFE 253 +A SE HGR E+ C +PDEL + Sbjct: 243 V----QHAASEVGHGRRESSSIKTCAIPDELGGIAYP 275 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 122 bits (307), Expect = 1e-26, Method: Composition-based stats. Identities = 49/108 (45%), Positives = 60/108 (55%), Gaps = 3/108 (2%) Query: 143 IKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL 202 + T+ KSNEITAIP LL L+ K ++ DAMGCQKDIA I GGD++ AVK NQ +L Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 NKAFE---EKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDEL 247 A EK EL +H +Y HGR + R H V VP Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVPPGF 108 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 118 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 41/109 (37%), Positives = 63/109 (57%), Gaps = 3/109 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 +E S IPD R +H +I+ L + +V++GA+ + +IEDF E H D+LK Y + Sbjct: 5 FVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTYFNLP 64 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSD---DKDVIAIDGK 111 NGIP HDT +RV S I PA F +SF+ W+ + + + I ++ K Sbjct: 65 NGIPSHDTFSRVFSAINPASFQDSFLIWLKAINDAFMYASQRPICLNFK 113 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 106 bits (265), Expect = 7e-22, Method: Composition-based stats. Identities = 36/96 (37%), Positives = 49/96 (51%), Gaps = 2/96 (2%) Query: 154 AIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIPELL LD++G + DA+G Q IA I + G DY+ AVK NQ RL + + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 ELNNPKHDS--YAISEKSHGREETRLHIVCDVPDEL 247 + + +K HGR ETR+ V + L Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETRVCRVSEDVAWL 96 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 103 bits (258), Expect = 5e-21, Method: Composition-based stats. Identities = 33/89 (37%), Positives = 50/89 (56%), Gaps = 1/89 (1%) Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 S A+H++SAF + +V+ Q+ +KSNEI A ELL LDI G + DAM Q++ Sbjct: 3 SETVKAVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQRE 62 Query: 180 IAE-KIQKQGGDYLFAVKGNQGRLNKAFE 207 A ++ + D++ VK NQ L +A Sbjct: 63 HARFAVEDKRADFVMTVKDNQPELREALA 91 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 101 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 41/169 (24%), Positives = 64/169 (37%), Gaps = 8/169 (4%) Query: 77 VVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMH 136 ++ + F S +K + DGK R S + +++G V+ Sbjct: 28 LLQKVDVEVFDYLLFTHYGFRLDSQEKQWFSGDGKELRGSIESGKKRGQA-VVQIVHHHS 86 Query: 137 SLVIGQIKTDK-KSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAV 195 I Q D K +EI + LL+ D+ + I DA+ E I K GG +L + Sbjct: 87 GEAIAQNYYDGQKESEIPTLRALLSKDDLASQKITLDALHLCPSTTEMITKAGGVFLIGL 146 Query: 196 KGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVP 244 K NQ L + P D + +HGR E R + + DV Sbjct: 147 KENQPTLLAHM------TDCALPPIDQKTTFDFNHGRVEQRKYWLYDVS 189 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 32/150 (21%), Positives = 58/150 (38%), Gaps = 5/150 (3%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAK 85 L+ +L L V++G + + + ++ P L GIP T R+V P Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFGLTRGIPSERTTRRLVEGCDPVA 107 Query: 86 FHESFINWMLDYHSSDDKDV--IAIDGKIHRH--SYDKSRRKGAIHVISAFSTMHSLVIG 141 E+ W+ + D +A DGK + S+ ++ V+ A + G Sbjct: 108 LDEALSGWIARAATVGDPGPRGLAFDGKTLKGTRSFTEAGAMSQEAVLEAVWHDTGITAG 167 Query: 142 QIKTDKKSNEITAIPELLNMLDIKGKIIKT 171 + +EI A+ L LD+ ++ T Sbjct: 168 HQRVVG-GDEIAALEALAGRLDLTDVLVTT 196 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 35/75 (46%), Positives = 53/75 (70%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++++E + + D R A + +H L DIL+L +CAV+SGA+GW+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARV 77 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRV 81 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 42/105 (40%), Positives = 61/105 (58%), Gaps = 5/105 (4%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++H I D R +H L +I+LL I AV+SG+EGWE IE+FG D+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDG 110 GIP HDTIARV+ + K E I ++ +D ++A+ G Sbjct: 67 AGIPRHDTIARVICRL---KADEKEIAKLIVKQKAD--YILALKG 106 Score = 63.4 bits (153), Expect = 7e-09, Method: Composition-based stats. Identities = 20/67 (29%), Positives = 29/67 (43%), Gaps = 3/67 (4%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPKHDSYAISEKSHGREE 234 K+IA+ I KQ DY+ A+KG+ L E + + D + + HGR E Sbjct: 87 KEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGHGRIE 146 Query: 235 TRLHIVC 241 TR Sbjct: 147 TRRCQQV 153 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 100 bits (250), Expect = 4e-20, Method: Composition-based stats. Identities = 44/105 (41%), Positives = 63/105 (60%), Gaps = 5/105 (4%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++H + D R +H L DI+LL I AV+SG+EGWEDIE+FG D+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDG 110 GIP HDTIARV+ + K E I ++ +D ++A+ G Sbjct: 67 AGIPRHDTIARVICRL---KADEKEIAKLIVKQKAD--YILALKG 106 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 100 bits (248), Expect = 6e-20, Method: Composition-based stats. Identities = 28/131 (21%), Positives = 54/131 (41%), Gaps = 6/131 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L ++ +PD R + L IL + +CAV++GA + I D+ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLD------YHSSDDKDVIAIDGKIHRHSY 117 F + +P T+ R++ I + W+ VIA+DGK+ R + Sbjct: 89 FTDRVPAATTVWRLLIRIDAEVLPQVLARWLRARTAPVVVTGRRLCLVIAVDGKVVRGAR 148 Query: 118 DKSRRKGAIHV 128 ++ A+ + Sbjct: 149 LRAAGPSALGL 159 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 98.9 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 43/85 (50%) Query: 7 MEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFEN 66 ++H + D R L I+ + I AV++GA+G+ IE +G+ +L+ + D Sbjct: 28 LKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLPK 87 Query: 67 GIPVHDTIARVVSCICPAKFHESFI 91 GIP HDT RV+ + P + F Sbjct: 88 GIPSHDTFGRVLRILEPKQLQSGFR 112 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 98.5 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 27/74 (36%), Positives = 49/74 (66%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + +++H S + D RQ+W+V + L +I LL +CA +SG E + +I +G+ +FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTI 74 + +E G+P HDT+ Sbjct: 77 FLPYERGLPAHDTL 90 >UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BR51_9GAMM Length = 153 Score = 95.8 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 64/151 (42%), Gaps = 9/151 (5%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ L ++ + +PD +A H+L +L L A + G +G++ + ++ + ++ Sbjct: 7 QMRSLPQYFTDLPDPHRAEGRRHRLPVVLTLAAGASLCGMQGYKSMAEWASSLAQAARRR 66 Query: 62 --GDFENG---IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS 116 NG +P I + + P W +S+ + +A+DGKI + Sbjct: 67 FGCRRVNGHYLVPSLYVIRDCLVRLGPEALDRRLQAWQAAQLNSE--EALAMDGKIMKGG 124 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDK 147 D + + H++S + Q K+ + Sbjct: 125 VDHTGAQT--HIVSLIGHESKHCVAQKKSAR 153 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 92.3 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 67/208 (32%), Gaps = 34/208 (16%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICA-VISGAEGWEDIEDFGETHPDFLKQYG 62 + + E + + D R + + + +C+ +G + + + Sbjct: 22 EGIWERLDRVTDPRSTRGRVYSWLCLAAVWLCSLTAAGHHRVSAVRAWLARTSGAERARL 81 Query: 63 DFEN------GIPVHDTIARVVSCICPAKFHESFINWML--------------------- 95 +P TI + + + + ++ L Sbjct: 82 RLPWDPFAGWRLPSTATIHCFLQAVDDGELAVALLDPPLDPDPPAEQGDDTDQRTEPSAA 141 Query: 96 ----DYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNE 151 + + +A+DGK RH+ K +H++ S ++ Q++ + K+NE Sbjct: 142 PVDPGHGCQPVESAVALDGKTSRHA--KRADGSKVHLVGVASHGDGRLLAQVEVEAKTNE 199 Query: 152 ITAIPELLNMLDIKGKIIKTDAMGCQKD 179 LL LD+ ++ DA+ + Sbjct: 200 TAVFRRLLRPLDLTNVLVTADALHTVRA 227 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 91.9 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 38/81 (46%), Positives = 52/81 (64%), Gaps = 4/81 (4%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + + E++S D R A+ +H DI+ L + AVISGA W +I+ FGE H D+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCI 81 Y FE GIPV DTIARV+ I Sbjct: 57 YRPFECGIPVDDTIARVIKRI 77 >UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV84_9DELT Length = 357 Score = 91.2 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 31/148 (20%), Positives = 60/148 (40%), Gaps = 9/148 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ- 60 +++ L ++ + D R+ H++S +L + A + G +G++ I + +Q Sbjct: 214 QMESLPDYFKTVTDPRRTHGRRHRISTVLSIAAAATLCGMKGYKAIYGWANKLGQKARQR 273 Query: 61 -YGDFENG---IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS 116 ENG IP I V+ P + + + D D +A DGK +++ Sbjct: 274 FRCRKENGKYVIPSQFVIRDVLVRADPVELDLAVQRFNEDQGLEDTC--LAFDGKTMKNA 331 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIK 144 D++ R+ H+ S Q K Sbjct: 332 IDENARQT--HIASVVGHESKTTHTQKK 357 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 90.8 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 44/209 (21%), Positives = 89/209 (42%), Gaps = 14/209 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLK-- 59 + + +++ IPD R+ K +H+ D+LL+ + AV SG + + + FL Sbjct: 5 SIPSPLPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDE 64 Query: 60 ---QYGDFENGIPVHDTIARVVSCI--CPAKFHESFINWMLDYHSSDDKD-----VIAID 109 + E +P T+ R+ + ++ ++W + + K+ +A+D Sbjct: 65 VHIRTRRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVD 124 Query: 110 GKIHRHSYDKSRRKGAIHVISAFSTMHSLVIG-QIKTDKKSNEITAIPELLNMLDIKGKI 168 GK R + R + A+ +SA L +G Q D ++ + + L + + Sbjct: 125 GKHLRGTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGV-DWV 183 Query: 169 IKTDAMGCQKDIAEKIQKQGGDYLFAVKG 197 + DA C +++A + +Q G A KG Sbjct: 184 LTGDAALCTQELAAVVVEQKGGICSASKG 212 >UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BLK3_9GAMM Length = 190 Score = 90.0 bits (222), Expect = 7e-17, Method: Composition-based stats. Identities = 27/129 (20%), Positives = 56/129 (43%), Gaps = 6/129 (4%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ L ++ + +PD R+A H+L + LT A + G +G++ + ++ + +Q Sbjct: 59 QMRALPQYFTDLPDPRRAQGRRHRLPVVPALTAGASLCGMQGYKAMAEWASSLGQAARQR 118 Query: 62 --GDFENG---IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS 116 NG +P I + + P W +S D + +A+DGKI + Sbjct: 119 FGCRRVNGHYLVPSLYVIRDCLVRLGPKALDRRLQAWQAAQLNSSD-EALAMDGKIMKGG 177 Query: 117 YDKSRRKGA 125 D + + Sbjct: 178 VDHTGAQTQ 186 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 89.6 bits (221), Expect = 8e-17, Method: Composition-based stats. Identities = 32/79 (40%), Positives = 41/79 (51%), Gaps = 3/79 (3%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKH---DSYAISEKSH 230 MGCQK+IAE I +Q DY+ AVK NQ L++A ++ F N + D KSH Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYFEEANEANFESYNIDFAETYNKSH 60 Query: 231 GREETRLHIVCDVPDELID 249 GR E+R V L D Sbjct: 61 GRIESRRCWVGYDALPLTD 79 >UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3K1_9PROT Length = 134 Score = 89.3 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 29/117 (24%), Positives = 49/117 (41%), Gaps = 6/117 (5%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 L+ S I D R+ + L+ +LL T+ A+++GA + ++ F TH D L D Sbjct: 3 STLLSLFSQISDPRRDQGKIYPLAPVLLFTVLAMLAGARSYRQVQAFIRTHLDRLNDGFD 62 Query: 64 F-ENGIPVHDTIARVVSCICPAKFHESFINWMLDYH-----SSDDKDVIAIDGKIHR 114 P + T+ ++ I + +F + L + IAIDGK Sbjct: 63 LSLRRAPAYSTVRFILRGIDAEEMERAFRDHALGLADGPAEGAAIPGAIAIDGKTWC 119 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 88.9 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 48/118 (40%), Gaps = 9/118 (7%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L EH++ +PD R + H L IL + + A+ SGAE + + ++ T L Q Sbjct: 15 GLWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGC 74 Query: 65 ENG-------IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRH 115 + P T+ RV+ I + NW+L +A+DGK Sbjct: 75 QESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSL--GLSPAALAVDGKTLAG 130 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 88.5 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 4/82 (4%) Query: 170 KTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL---KELNNPKHDSYAIS 226 + D +GCQK IA+ I +Q DYL AVK NQ L++A F D Sbjct: 6 RCDGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYFEEANKARFAGYNIDYDEKI 65 Query: 227 EKSHGREETRLHIV-CDVPDEL 247 K GR E R V ++PD + Sbjct: 66 NKGPGRLEQRRCWVGYEIPDTI 87 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 88.1 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 28/96 (29%), Positives = 42/96 (43%), Gaps = 4/96 (4%) Query: 155 IPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 +L +++ GK I DA+ QK +AE I + YLF VK NQ L + F ++ Sbjct: 2 FIPILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYFEHRK 61 Query: 215 LNNPKHDSYAISEKSHGREETRLHIVCDVPDELIDF 250 D HGR +TR +E ++F Sbjct: 62 EP----DYCLQDPPGHGRIDTRSIWTTTELNEYLEF 93 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 86.2 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 33/129 (25%), Positives = 54/129 (41%), Gaps = 2/129 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF- 64 L ++ IPD+R+A + L+ +LL +I AV+SGA + I+ F + H + L Sbjct: 3 LKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQLH 62 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDV-IAIDGKIHRHSYDKSRRK 123 PVH +I + + +F + IA+DGK R + + R Sbjct: 63 WKRAPVHTSIRYALQGLDAKAGELAFHRHASGLDGEGAQHASIAMDGKTLRAAVSITSRT 122 Query: 124 GAIHVISAF 132 SA Sbjct: 123 ARPLRYSAH 131 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 85.8 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 2/88 (2%) Query: 161 MLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKH 220 M +KG ++ DAMGCQ+ IA+++++ G D + ++KGNQG+ A F ++ + Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 221 --DSYAISEKSHGREETRLHIVCDVPDE 246 + E SHGR R V + E Sbjct: 61 LKPDHDEFEDSHGRTVRRRGWVLPLTPE 88 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 85.4 bits (210), Expect = 1e-15, Method: Composition-based stats. Identities = 28/69 (40%), Positives = 42/69 (60%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L+ H + I D RQ+ KV + L D+L +T+C VI+GAEGW +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIP 69 G G+P Sbjct: 72 KGILTEGVP 80 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 85.4 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 40/85 (47%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ + + + D R +H+ DI+++ +C V+ G +G I + ++L+ + Sbjct: 7 AVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQGF 66 Query: 62 GDFENGIPVHDTIARVVSCICPAKF 86 + NG+P D I + + P F Sbjct: 67 LELPNGLPSRDCIRNWLMALQPDAF 91 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 85.0 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 49/108 (45%), Gaps = 4/108 (3%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG-IPVHDTIARVVSCICPAKFH 87 +L L + AV++G E I FG P L F+NG +P +TIA ++ + Sbjct: 3 LLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLDADHLD 62 Query: 88 ESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTM 135 W+ D H D D IA+DGK S + H+++A++ Sbjct: 63 RIIGAWLGDRHP-DGWDHIALDGKRLCGS--RDGAVPGTHLLAAYAPQ 107 >UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E6_9ACTO Length = 128 Score = 84.6 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 31/129 (24%), Positives = 47/129 (36%), Gaps = 13/129 (10%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + L E ++ + D R+ H +LL+ AV++GA + I ++ P + Sbjct: 1 MGPLAERLATLADPRRRRGKRHPFVAVLLIACSAVVTGARSFVAIHEWATDSPQDVLARL 60 Query: 63 DFENGI-------PVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRH 115 P TI RV+ CP + H D +AIDGK R Sbjct: 61 GARTATALAVRIPPSGVTIRRVIKDTCPGGLADLLG------HDPAGTDTLAIDGKSARG 114 Query: 116 SYDKSRRKG 124 S S R Sbjct: 115 SRLGSTRPP 123 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 84.2 bits (207), Expect = 4e-15, Method: Composition-based stats. Identities = 49/169 (28%), Positives = 74/169 (43%), Gaps = 34/169 (20%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVI----SGAEGWEDIED--FGETHP 55 +LKKL+E S IPD R+A V+H+L+ +LL + + + S E D+ F + Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSRPAFLQALQ 138 Query: 56 DFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDY--------HSSDDKDVIA 107 + +G DT+ARV+ I P K ESFI + Y H + IA Sbjct: 139 GLFPELETLPHG----DTLARVLERIEPQKLEESFIRLLRRYIRHKKFKRHLINKCYPIA 194 Query: 108 IDG--KIHR-------------HSYDKSRRKGAIHVISA-FSTMHSLVI 140 IDG K+ R + D + + I+V+ A F + L I Sbjct: 195 IDGTQKLVRDGELGEEWLERHIKTKDGEKVQQYIYVLEANFVFKNGLTI 243 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 81.5 bits (200), Expect = 2e-14, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 73/221 (33%), Gaps = 37/221 (16%) Query: 58 LKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSY 117 + G P ++T+ +++C+ WM I DGK+ S Sbjct: 14 WRPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLG-MPVGGIRADGKVLGGS- 71 Query: 118 DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 K A+H + + + + Q + + A+ LL + G+++ DA Sbjct: 72 -KRAGAPALHGVELVTHTTGMALAQREAVG-GDAAAALLALLTEAPLDGRMVSMDAGFLN 129 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP-------------------------- 211 + + I ++ G+YL VKG+Q ++ P Sbjct: 130 AAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALDQIAPPRR 189 Query: 212 -------LKELNNPKHDSYAISEKSHGREETRLHIVCDVPD 245 +EL + E+S GR E R V D D Sbjct: 190 KRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGD 230 >UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_9ACTO Length = 410 Score = 78.9 bits (193), Expect = 1e-13, Method: Composition-based stats. Identities = 30/138 (21%), Positives = 52/138 (37%), Gaps = 11/138 (7%) Query: 42 EGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS- 100 + + P G + P I R++ I P + W+ + Sbjct: 221 RATSALIAWVLARPTVAVLLGIDADRRPSEAMIRRLLQAIDPDLLTTAIGIWLAARIPAP 280 Query: 101 --DDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAI--- 155 + IA+DGK R S ++R A HV++A +V+ D K+NEIT Sbjct: 281 APGSRRAIAVDGKTLRGS--RTRDSAARHVLAAADQHTGIVLASTDVDTKTNEITRFTAS 338 Query: 156 ---PELLNMLDIKGKIIK 170 +LL+ I+ ++ Sbjct: 339 GSHADLLSSRCIRSGVVS 356 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 78.5 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 30/79 (37%), Positives = 42/79 (53%) Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREE 234 + ++ +KI ++ DYL AVKGNQG L AF++ F LNN + Y E+S GR E Sbjct: 11 SVRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHE 70 Query: 235 TRLHIVCDVPDELIDFTFE 253 +R V L D + E Sbjct: 71 SRAAFVSHDLSVLGDISDE 89 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 76.2 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 28/112 (25%), Positives = 47/112 (41%), Gaps = 6/112 (5%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF- 64 L ++S IPD+R+A + L+ +LL +I A++SGA + I+ F +TH + L Sbjct: 3 LKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQLH 62 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSD-----DKDVIAIDGK 111 P H +I + + +F D VI + K Sbjct: 63 RKRAPAHTSIRYALQGLDAKAVELAFPRHASGLDGEDHNRFFPSTVIDAEWK 114 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 75.4 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 33/129 (25%), Positives = 50/129 (38%), Gaps = 13/129 (10%) Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLD----YHSSDDKDVIAIDGKIHRHSYDKS 120 PV+ ++ ++ I P +F + IAIDGK R S+D Sbjct: 8 LRRAPVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAF 67 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELL---------NMLDIKGKIIKT 171 A +V+SAF+ H +++ D+KSNEI A L+ I + Sbjct: 68 SDTKAAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALIVATALWKSREETSIYASSVML 127 Query: 172 DAMGCQKDI 180 DAM I Sbjct: 128 DAMTFAPAI 136 >UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium meliloti RepID=Q1WLF6_RHIME Length = 105 Score = 73.5 bits (179), Expect = 6e-12, Method: Composition-based stats. Identities = 18/64 (28%), Positives = 27/64 (42%), Gaps = 1/64 (1%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + +PD R H L+ IL + I A++ GAE D+ DFG +LK Sbjct: 1 MSRFAACFEDLPDPR-GRNARHPLTSILFIAIAAIVCGAESCTDMADFGVAKKKWLKTIV 59 Query: 63 DFEN 66 Sbjct: 60 PLPY 63 >UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PA49_CLOTS Length = 245 Score = 73.5 bits (179), Expect = 7e-12, Method: Composition-based stats. Identities = 50/228 (21%), Positives = 86/228 (37%), Gaps = 37/228 (16%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + L E I+ + D R V+ +S I + + + + +E + K+ Sbjct: 16 VYHLGEKINTLKDKRVKSSVK--ISTITFVVLFGFMLQIRSFNRLEHW--LKKGKFKKAL 71 Query: 63 DFENGIPVHDTIARVVSCICPAKFHE--------SFINWMLDYHSSDDKDVIAIDG---- 110 + +P DTI RV+S +E S N + + D V+AIDG Sbjct: 72 PKKTKMPRIDTIRRVLSNFDLDGLNELNNSIIKTSIKNKVFRRGTIDGLKVVAIDGVELF 131 Query: 111 ----KIHRHSYDKSRRKGAIH------VISAFSTMHSLVIGQIKTDKKSN-------EIT 153 K + + ++ G H V S + L++GQ + K + EIT Sbjct: 132 ESTKKCCGNCLTRVQKDGITHYFHRTVVCSTIGSDSHLILGQEILEPKKDGSDKDEGEIT 191 Query: 154 AIPELLNMLDIK----GKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKG 197 A L+ L + II DA+ C+ +++ G D + VK Sbjct: 192 AGKRLIRKLHREFHHFADIIVADALYCKSTWVKEVLSIGMDAVVRVKD 239 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 72.3 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 27/57 (47%), Positives = 41/57 (71%) Query: 97 YHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEIT 153 Y S + ++ DGK R S+D+S K AIH++SA+++ +SLV+GQ+KTD+KSNE Sbjct: 17 YQKSLKEKSLSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNEHK 73 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 71.1 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 24/71 (33%), Positives = 33/71 (46%), Gaps = 3/71 (4%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPKHDSYAISEKSH 230 MGCQK+IA+ I KQ DY+ A+KG+ L E + + D + + H Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGH 60 Query: 231 GREETRLHIVC 241 GR ETR Sbjct: 61 GRIETRRCQQV 71 >UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW01_RHOSR Length = 130 Score = 70.8 bits (172), Expect = 4e-11, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 28/81 (34%) Query: 11 SIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 +PD R V H+ S IL + A +GA + I ++ P +P Sbjct: 49 DEVPDPRHRRGVRHRFSVILAPALSATCAGARSFIAIAEWAHDAPAEALGGLGVSAVVPS 108 Query: 71 HDTIARVVSCICPAKFHESFI 91 T R ++ + + Sbjct: 109 ESTSRRFLAGVDATALDQVLG 129 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 45/198 (22%), Positives = 69/198 (34%), Gaps = 33/198 (16%) Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDY-------HSSDDKDVIAIDGKIHR 114 G P T+ R+++ PA E+ + D + V++ DGK Sbjct: 93 LGLGRGKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTW 152 Query: 115 HSYDKSRRKGAIHVISAFSTMHS------------------LVIGQIKTDKKSNEITAIP 156 D + KGA SA+ S +GQ K E TA Sbjct: 153 SRTDGEKVKGAQQ--SAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFR 210 Query: 157 ELL----NMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL 212 LL L + +I+ DA C ++ AE + G Y+F +K NQ L+ + Sbjct: 211 RLLPAISEQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLH-DIARDYGQ 269 Query: 213 KELNNPKHDSYAISEKSH 230 +L P A + H Sbjct: 270 YDLGTPLA-RTAERYRGH 286 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 5/77 (6%) Query: 155 IPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 + +L IK I DA+ CQK E I ++ Y+ VK NQ L +A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIEDT----- 56 Query: 215 LNNPKHDSYAISEKSHG 231 N ++++ ++K HG Sbjct: 57 AKNSPLNAWSWTQKGHG 73 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 67.3 bits (163), Expect = 5e-10, Method: Composition-based stats. Identities = 28/60 (46%), Positives = 30/60 (50%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS 116 LKQYG FE GI HDTI +VSCI F + FI WM A DGK R S Sbjct: 11 LLKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XCB4_SALTO Length = 117 Score = 67.3 bits (163), Expect = 5e-10, Method: Composition-based stats. Identities = 23/106 (21%), Positives = 45/106 (42%), Gaps = 3/106 (2%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAK 85 ++ +L +CAV++GA + D+ E F + +PV T+ R++ + Sbjct: 1 MASVLADAVCAVMAGASTFAAFGDWVEDLDAPAWSRLGFTDRVPVLTTLWRLLVRVDAET 60 Query: 86 FHESFINWMLDYHSSDDK---DVIAIDGKIHRHSYDKSRRKGAIHV 128 + +W+ VIA+DGK+ R + R A+ + Sbjct: 61 LTAVWADWLCSRLPVAPPPVRRVIAVDGKVVRGAVLTEGRVPALWM 106 >UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B1F Length = 84 Score = 63.1 bits (152), Expect = 8e-09, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 30/56 (53%) Query: 159 LNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 L+M D+ + DA+G Q IAE+I + G DY+ A+K NQ +A F E Sbjct: 17 LDMEDLAQSQLVIDAVGTQGPIAEQIIEAGADYVLALKANQPSALQAVSAHFKEAE 72 >UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ITI9_METNO Length = 74 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 17/63 (26%), Positives = 33/63 (52%), Gaps = 1/63 (1%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L + D R+ +H+L IL++ +CAVI+ AE +DI +G + +L+ + Sbjct: 2 IEGLAVCFPGLEDLRETSGCDHRLIAILVIAVCAVIACAES-KDIGLYGRSKQAWLQTFL 60 Query: 63 DFE 65 Sbjct: 61 PLP 63 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 60.4 bits (145), Expect = 5e-08, Method: Composition-based stats. Identities = 40/65 (61%), Positives = 43/65 (66%), Gaps = 12/65 (18%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTI------CAVISGAEGWEDIEDFGETH 54 MELKKLMEHISIIPDYRQAWKVEHKL DIL + C ++ G FGETH Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRG------FGETH 54 Query: 55 PDFLK 59 DFLK Sbjct: 55 LDFLK 59 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 59.2 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 1/49 (2%) Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGR 232 I + GDYL VKGNQ +L +A E F + + + D A+ E+ HGR Sbjct: 2 IIAKKGDYLLMVKGNQPKLLEAIEIAF-IDQHDVKSVDRSALVERGHGR 49 >UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3Z283_9SYNE Length = 156 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 22/96 (22%), Positives = 41/96 (42%), Gaps = 4/96 (4%) Query: 84 AKFHESFINWMLDYHS-SDDKDVIAIDGKIHRHSYD--KSRRKGAIHVISAFSTMHSLVI 140 F + WM + +D D + DGK R S D I +S +S + I Sbjct: 2 EAFEALLLQWMSQQPALADGVDTLVCDGKTLRGSIDQKPGAAASFIAQVSLYSQPLGVAI 61 Query: 141 GQ-IKTDKKSNEITAIPELLNMLDIKGKIIKTDAMG 175 Q +S+E ++ LL+ +++ +++ D +G Sbjct: 62 AQTTYATDESSETASLLWLLSGIELTDMLVQADEVG 97 >UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TKB9_RICCO Length = 107 Score = 58.4 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 16/98 (16%), Positives = 34/98 (34%), Gaps = 1/98 (1%) Query: 8 EHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFL-KQYGDFEN 66 + S + D R+A + L +L + +++SG+ ++ F E L + +G Sbjct: 10 DVFSELRDVRRAQGKRYALEPLLCAIVMSILSGSASLRKMQVFIEEQLPNLNRLFGTSWR 69 Query: 67 GIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD 104 P I + + + +F S Sbjct: 70 KAPCWVAIREFLLGLDEQELERAFREHANRQVSPPPGR 107 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 57.7 bits (138), Expect = 4e-07, Method: Composition-based stats. Identities = 13/55 (23%), Positives = 26/55 (47%), Gaps = 2/55 (3%) Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 ++ DY+ A+KGN + + ++ F + +K HGR E R++ + Sbjct: 7 EKDNDYILALKGNHPLMEQEVKDFF--LSPVTSTRSVHTTFDKGHGRIERRIYTL 59 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 57.3 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 67/176 (38%), Gaps = 21/176 (11%) Query: 43 GWEDIEDF-----GETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDY 97 +E++ F G+ D L +Y +F+N P + + + + I P F F + + Sbjct: 40 SFEEVMKFMLTMEGKALRDELLEYFEFDNTTPSNSSFNQRRAQILPEAFEFLFQEFTKSF 99 Query: 98 HSS---DDKDVIAIDGKIHRHSYDK------------SRRKGAIHVISAFS-TMHSLVIG 141 + + +IA DG +++ + +H+ + + Sbjct: 100 TDNVTYNGLRLIACDGSDLCIAHNPQDETTYFQTLPDRKGYNLLHLNAFYDLCSRQYTDA 159 Query: 142 QIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKG 197 I+ + +NE A+ E+++ + I D +I ++ +G YL VK Sbjct: 160 IIQPSRLANERRAMCEMIDRYNDTSAIFIADRGYENYNIFAHVEHKGMYYLIRVKD 215 >UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J572_FRASC Length = 148 Score = 56.5 bits (135), Expect = 8e-07, Method: Composition-based stats. Identities = 14/46 (30%), Positives = 19/46 (41%), Gaps = 1/46 (2%) Query: 8 EHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGET 53 E IPD R V H+L +L L AV+ G G + + Sbjct: 70 ECFEQIPDPRDPRGVRHRLPVVLSLC-AAVLCGESGLAGVAAWVAA 114 >UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VX04_POLNA Length = 100 Score = 55.0 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 16/66 (24%), Positives = 37/66 (56%), Gaps = 4/66 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDI----EDFGETHPDFLKQY 61 L++ SI+PD R + L +++++T+ AV+ GA+ W D+ + +G++ +++ Sbjct: 2 LIQAFSILPDPRTGPAQRYDLREMIVMTLSAVLCGADNWVDVPVGSKKYGDSCMQVVREK 61 Query: 62 GDFENG 67 +G Sbjct: 62 CCLTSG 67 >UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H9Y8_ANADF Length = 431 Score = 54.6 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 78/236 (33%), Gaps = 26/236 (11%) Query: 10 ISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP 69 + +PD R L++IL + +++GA + E+ + ++ +P Sbjct: 22 LEAVPDVRAREG-RWSLAEILTGVLLGIVAGARSLAEAEELTDGMSPAARRLASVPRRLP 80 Query: 70 VHDTIARVVSCICPAKFHESFINWMLD-------YHSSDDKDVIAIDGK-----IHRHSY 117 T + + + + V+A+DGK H Sbjct: 81 -DTTARDALCAVPLDGLRAALHRLVRAAWRRKALTPVDLPVGVVALDGKVTALPTLNHPL 139 Query: 118 DKSRRK------GAIHVIS--AFSTMHSLVIGQIKTDKKSNEITAIPELL-NMLDIKGK- 167 +++ G ++ S I + ++NE +L +++ G Sbjct: 140 IQNQHPDVGLPYGLARTVTCALVSAPGRPCIDAVPIPAETNEAGHFQHVLAGLVETYGAL 199 Query: 168 --IIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHD 221 ++ DA + + G DY+FA+K + + K E E+ + D Sbjct: 200 FQVVTYDAGALSEANGAAVVAAGKDYVFALKNDHFTMVKLATELLDPHEIAARRED 255 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 54.6 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 23/74 (31%), Positives = 29/74 (39%), Gaps = 10/74 (13%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEK--------FPLKELNNPKHDSYAI 225 M Q D+ +Q++GGDY+ K NQG L E FP D+ Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 SE--KSHGREETRL 237 E K HG E R Sbjct: 61 CEVSKGHGWVERRT 74 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 52.7 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 7/78 (8%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREET 235 K + I + G DY+ AVKGNQ RL++ + L +E+ R T Sbjct: 1 MPKKTVQLIIEGGNDYVIAVKGNQKRLHEQIK----LTTEQRLPVSLDITTERRSDRITT 56 Query: 236 RLHIVCDVPDELIDFTFE 253 R V D+L +++ Sbjct: 57 RSVSVF---DDLSGISYD 71 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 52.7 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 15/59 (25%), Positives = 21/59 (35%), Gaps = 5/59 (8%) Query: 195 VKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELIDFTFE 253 +K NQ + P + + S HGR E+R C + DEL F Sbjct: 2 IKRNQPTTYRQL-AALPWPDSAV----QHTASSAGHGRRESRSIKTCGIADELGGIAFP 55 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 52.7 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 20/55 (36%), Gaps = 3/55 (5%) Query: 193 FAVKGNQGRLNKAFEEKFPLKE---LNNPKHDSYAISEKSHGREETRLHIVCDVP 244 AVK NQ L E L + +K HGR ETR + D P Sbjct: 2 LAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREVDKGHGRIETRRCLALDFP 56 >UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ13_9PLAN Length = 75 Score = 52.3 bits (124), Expect = 1e-05, Method: Composition-based stats. Identities = 18/56 (32%), Positives = 31/56 (55%), Gaps = 1/56 (1%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDF 57 EL++L ++ + D R HKL +++L+ +CAVI+GA+G IE + Sbjct: 19 ELRELAKYFQSLDDSRSHVNQRHKLVNVVLIAMCAVIAGADGPTAIE-WLAGRLQL 73 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 16/67 (23%), Positives = 28/67 (41%), Gaps = 1/67 (1%) Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPK-HDSYAISEKSHGREETRLHIV 240 EKI ++ GDY+ +K N + E F + P+ +++ R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 CDVPDEL 247 V D L Sbjct: 61 LKVSDWL 67 >UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z7_THET2 Length = 112 Score = 51.9 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 36/108 (33%), Gaps = 7/108 (6%) Query: 41 AEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS 100 + +E F +P L G ++ + P K E+ + + Sbjct: 1 MDSLRGVERFARANPHLLPHLGLRNPPGHTLL--PLLLHRLDPKKLQEALHQ---VFPEA 55 Query: 101 DDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKK 148 D V+ +DGK R S + + ++ + + Q + + K Sbjct: 56 DLGGVLVVDGKHLRGS--GKGKSPQVRLVEVLALHLKTTLAQARVEGK 101 >UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E7_9ACTO Length = 72 Score = 50.7 bits (120), Expect = 4e-05, Method: Composition-based stats. Identities = 12/45 (26%), Positives = 24/45 (53%) Query: 134 TMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 T + + Q++ + +NEIT LL+ D++ + DA+ Q+ Sbjct: 2 TGTGMTVTQLRVPENTNEITCFAALLDPYDLREVTVTGDALHTQR 46 >UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HUT6_9ACTO Length = 130 Score = 50.7 bits (120), Expect = 5e-05, Method: Composition-based stats. Identities = 19/84 (22%), Positives = 32/84 (38%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L +S IPD R+ + L +L L + AV+ GA I F L++ Sbjct: 45 SLAGTLSRIPDPRRVRGRRYHLGSLLALCLVAVLGGARSLATIARFAADTNSSLREQLGL 104 Query: 65 ENGIPVHDTIARVVSCICPAKFHE 88 + P T+ + + + E Sbjct: 105 ASSTPNASTLGGLRANLKDEWVRE 128 >UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L6S8_RUMHA Length = 107 Score = 50.3 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 16/61 (26%), Positives = 27/61 (44%) Query: 47 IEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVI 106 + F + + ++ D + G P DT+ RV + I P KF E F +W+L + Sbjct: 1 MAFFMKLQEPYFEKILDLKYGTPSADTLLRVFAIIEPEKFMEMFYHWILFLMQKRKYKIS 60 Query: 107 A 107 Sbjct: 61 Q 61 >UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C378D Length = 453 Score = 50.3 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 48/253 (18%), Positives = 77/253 (30%), Gaps = 47/253 (18%) Query: 8 EHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG 67 E IPD R L D+L+ + A + T L+ G Sbjct: 27 ERFETIPDAR--RGPTFSLPDVLMAGLALFALKAPSLLAFQR--RTLDHNLRHVFGL-TG 81 Query: 68 IPVHDTIARVVSCICPAKFHESFIN--------WMLDYHSSDDKDVIAIDG-------KI 112 P + V+ + P F + +LD + D V+A+DG K+ Sbjct: 82 RPSDSQMRAVLDDVDPDHLRPVFRDVFARLQAAHVLDEYRVDGCYVVALDGVEYFCSQKV 141 Query: 113 HR-HSYDKSRRKGAI-----HVISAFST-MHSLVIG------QIKTDKKSN--EITAIPE 157 H H + GA+ + +A S V+ Q N E A Sbjct: 142 HCPHCMTRRHANGAVSYYHQMLGAAVVHPDFSAVLALAPEPIQRADGGTKNDCERNAARR 201 Query: 158 LLNML-----DIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVK-GNQGRLNKAF----- 206 L D+ ++ DA +QK +L VK + L Sbjct: 202 WLGRFREEHPDLA-VLVVEDARSSNAPHVRDLQKARCHFLLGVKAADHAHLFAHVCARQD 260 Query: 207 EEKFPLKELNNPK 219 + F + E +P+ Sbjct: 261 QHAFEVVEDADPR 273 >UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BWL6_9GAMM Length = 94 Score = 49.6 bits (117), Expect = 9e-05, Method: Composition-based stats. Identities = 20/55 (36%), Positives = 33/55 (60%), Gaps = 1/55 (1%) Query: 8 EHISIIPDYRQAW-KVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 EH +PD R+ + HK DIL++ ICA+I GA+ W + +FG+ D+ + + Sbjct: 40 EHFKSLPDPRRRTMNLRHKFIDILIIAICAIICGADSWVAVAEFGKAKEDWFRVF 94 >UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TCH7_HELMI Length = 453 Score = 49.2 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 70/241 (29%), Gaps = 36/241 (14%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + + D R+ +++ I + E E ++ + +Q Sbjct: 38 VYGFSQMVRQAKDGRKQPRIK--APAIFTVAFFGAFFCMESMEQMDRW--QKTGVFRQLV 93 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDK--------DVIAIDGKIHR 114 +P HDT+ + + + E + Y V AIDG Sbjct: 94 PKNIRLPSHDTVRQALMKWDLKEQREQHNCVIQRYKEQRGPQKESINGWRVTAIDGVELF 153 Query: 115 HSYDKSRRKGAI--HVISAFSTMHSLVIGQI------------------KTDKKSNEITA 154 H+ + H H++V+ Q DK E T Sbjct: 154 HTKAYRCPECLTREHRDKTTDYYHAVVVAQQVGGNANLIYDWEMRKPQDGVDKDEGETTV 213 Query: 155 IPELLNML-DIKGK---IIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 L+ + + GK + DA+ + + G + +K + R+ K F Sbjct: 214 AQRLIRRMAETYGKITDVYTLDALFAKAPVIHAALDAGAHVVVRMKEERRRIMKEANACF 273 Query: 211 P 211 Sbjct: 274 A 274 >UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LD44_FRASN Length = 261 Score = 49.2 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 11/47 (23%), Positives = 26/47 (55%), Gaps = 1/47 (2%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVI-SGAEGWEDI 47 + L+E ++ +PD R+ V H + +L + +CA++ +G+ + Sbjct: 57 DQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLTAGSRQTRAL 103 >UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7B Length = 481 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 66/172 (38%), Gaps = 28/172 (16%) Query: 47 IEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVI 106 IE G L+++ ++G H+ + I P E+F+ + D ++ +V+ Sbjct: 81 IEHQGSGRQAHLRRHRQPDDG--CHEAFYGKLRRI-PRGLSEAFLRDVTDRFTALFPEVV 137 Query: 107 A--------------IDGKIHRHSYDK----SRRKGAIH---VISAFSTMHSLVIG-QIK 144 A +DGK + + G + ++ A+ LV+ Sbjct: 138 AHRLPTSFDRLEVLILDGKSLKKVAKRLVDTRGTPGKLLGGKLLVAYRPRDGLVLDMAAD 197 Query: 145 TDKKSNEITAIPELLNMLDIKG---KIIKTDAMGCQKDIAEKIQKQGGDYLF 193 D ++NE IP+L+ + +G K++ D + C + K G ++ Sbjct: 198 LDGETNEAKLIPDLMPRVHARGGPAKLVVGDRLFCASKHFAEFTKDNGHFVV 249 >UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AI67_NITEC Length = 94 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 11/60 (18%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 +L + I D RQ K H L +L++TI +I + D+L+QY Sbjct: 34 RLADVFVSITDPRQ-RKSRHDLVKVLVITI----------NEILAWANEKLDWLRQYLKL 82 >UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus pneumoniae RepID=B2ISL1_STRPS Length = 535 Score = 47.7 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 67/202 (33%), Gaps = 31/202 (15%) Query: 18 QAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG--IPVHDTIA 75 Q + SDIL+ + +++G D+ + G + T++ Sbjct: 142 QRRYCRYSDSDILVQFLFQLLTGYGT-----DYACKELSADAYFPKLLEGGQLASQPTLS 196 Query: 76 RVVSCICPA----------KFHESFINW--MLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 R +S + E F+ + + D GK +Y+ R Sbjct: 197 RFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDSTHFTTYGKQEGVAYNAHYRA 256 Query: 124 GAIHVISAFSTMHSLVI-GQIKTDKK--SNE----ITAIPELLNMLDIKGKIIKTDAMGC 176 H + AF Q++ + S E IT + E N L + + D+ Sbjct: 257 HGYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFITPVLERFNQL-----LFRMDSGFA 311 Query: 177 QKDIAEKIQKQGGDYLFAVKGN 198 + + I+K G YL +K N Sbjct: 312 TPKLYDLIEKTGQYYLIKLKKN 333 >UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKL6_9PROT Length = 506 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 43/121 (35%), Gaps = 14/121 (11%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHK----LSDILLLTICAVISGAEGWEDIEDFGETHPDF 57 EL L+ + IPD R K HK L LL+ + S E ++ Sbjct: 75 ELPALLGQLEQIPDPRDPRKRRHKLTVLLLYGLLMFVFQFASRRETNREMTR--PQFLAN 132 Query: 58 LKQYGDFENGIPVHDTIARVVSCICPAKFHES--------FINWMLDYHSSDDKDVIAID 109 L++ +P DT+ R++ I A ++ + + IAID Sbjct: 133 LQRLFPEIEALPHADTLYRLLRDIDLAHLEQAHVDLVRRLIRGKSFRRYLINHCHPIAID 192 Query: 110 G 110 G Sbjct: 193 G 193 >UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4X2F9_SALTO Length = 143 Score = 46.9 bits (110), Expect = 6e-04, Method: Composition-based stats. Identities = 15/64 (23%), Positives = 24/64 (37%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 L + +PD V H+L+ +L+ ICAV + I ++ P G Sbjct: 13 AGLPAALLDLPDPLCRLGVLHRLTVVLIAAICAVAVSNRSYTAIAEWFPDVPAATGARGG 72 Query: 64 FENG 67 G Sbjct: 73 HRPG 76 >UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX Length = 98 Score = 46.9 bits (110), Expect = 6e-04, Method: Composition-based stats. Identities = 32/48 (66%), Positives = 35/48 (72%) Query: 78 VSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 +SCI KFHE FIN M + HSSDD DVIAIDGK HS DKSRR+ A Sbjct: 1 MSCIRSVKFHECFINRMRECHSSDDIDVIAIDGKALPHSCDKSRRRRA 48 >UniRef50_A6FBF2 Putative uncharacterized protein n=1 Tax=Moritella sp. PE36 RepID=A6FBF2_9GAMM Length = 65 Score = 46.9 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 17/43 (39%), Positives = 23/43 (53%) Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSY 223 + I QGGDYL AVK NQG+L K E+ F + + + Sbjct: 6 CQSIVNQGGDYLLAVKNNQGKLRKTVEKSFSHQRTTTAQGIEF 48 >UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 Length = 41 Score = 46.5 bits (109), Expect = 8e-04, Method: Composition-based stats. Identities = 16/30 (53%), Positives = 24/30 (80%) Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 +++A +T + + IGQ+K D KSNEITAIP+ Sbjct: 1 MVNALATANGMSIGQLKVDSKSNEITAIPK 30 >UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L7Y6_FRASN Length = 209 Score = 46.5 bits (109), Expect = 8e-04, Method: Composition-based stats. Identities = 12/68 (17%), Positives = 25/68 (36%), Gaps = 1/68 (1%) Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 + P T+ + I +F W+ + + +AIDGK+ R ++ Sbjct: 28 HFRRNTRAPSKKTLRAPLKKIDVDALDATFGAWLCAQI-ARGRVALAIDGKVLRGAWSGD 86 Query: 121 RRKGAIHV 128 A ++ Sbjct: 87 ESVTAAYL 94 >UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCH0_9FIRM Length = 435 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 25/164 (15%), Positives = 55/164 (33%), Gaps = 17/164 (10%) Query: 51 GETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS----DDKDVI 106 G T L + DF+ P + + I P F F + + + + ++ Sbjct: 54 GCTLNKELLDFFDFDVNAPTVSAYTQQRAKILPEAFEYLFHAFTEENAQTKNLYEGYQLL 113 Query: 107 AIDG------------KIHRHSYDKSRRKGAIHVISAFSTMHSLVI-GQIKTDKKSNEIT 153 A DG + S +H+ + + ++ I ++T E Sbjct: 114 ACDGSNLTIAPNLNDPETLWKSNQLGATGNHLHLNALYDVLNRTYIDALVQTASTYQEHR 173 Query: 154 AIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKG 197 A +++ + + I+ D +I ++G +L +K Sbjct: 174 ACIQMIERVTLDKVILIADRGYENYNIMSHAIEKGWKFLIRIKD 217 >UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TLQ7_SHEHH Length = 74 Score = 45.3 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 19/44 (43%), Positives = 25/44 (56%) Query: 7 MEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDF 50 EH+SII R + EH DI+ L A+ S EGW DI++F Sbjct: 4 FEHLSIIKAPRSSINHEHDPVDIMFLVNSAIASDCEGWLDIDEF 47 >UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW00_RHOSR Length = 98 Score = 45.3 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 15/52 (28%), Positives = 24/52 (46%), Gaps = 5/52 (9%) Query: 110 GKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNM 161 GK R + D S H+++A +V+ Q+ + NEI P LL+ Sbjct: 18 GKTWRGAKDGSG--HLTHLLAAVDHDAGVVLRQVAVGARINEI---PLLLDP 64 >UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromonas gingivalis ATCC 33277 RepID=B2RI66_PORG3 Length = 87 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 13/47 (27%), Positives = 23/47 (48%) Query: 17 RQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 R K + L + L+ + +SG W +IED+ E + + LK + Sbjct: 23 RIESKEVYPLDFLFLIVFLSTLSGDTSWYEIEDYAEEYEEVLKSRYE 69 >UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JGX0_FRASC Length = 222 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 19/120 (15%), Positives = 41/120 (34%), Gaps = 8/120 (6%) Query: 66 NGIPVHDTIARVVSCICPAKFHESFIN-WMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 G P + + + P + + + V+ +DG R Sbjct: 31 PGTPAPGGVGKSCRSLDPGSLAALDAAPHRPTWRAGRVRRVLTVDGTTMR----PQHGSR 86 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIIKTDAMGCQKDIAEK 183 +H+ + +++ Q+ D+K+NE + L + D+ G +I A A+ Sbjct: 87 HVHLPEGLAHACGVLLTQVDVDEKTNENPFVLRGLGQIPDLTGVLIT--AFPAPPSHAQA 144 >UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms RepID=Q11MU1_MESSB Length = 447 Score = 44.6 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 74/218 (33%), Gaps = 24/218 (11%) Query: 5 KLMEHISI-IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 KL E ++ I D R +V H L+DIL I A+ G E D++ P F G Sbjct: 45 KLAEKLAAAIRDPRDPARVRHSLTDILRARIFAIACGYEDANDLDRL-RNDPAFKLACGR 103 Query: 64 FENG---IPVHDTIARVVSCIC---PAKFHESFIN-WMLDYHSSDDKDVIAID------- 109 + + T +R+ + + ++ W+ Y + + ID Sbjct: 104 LPDSGQDLCSQPTCSRLENLPDLRTVIRLGRVLVDLWLSSYPAPPKSVTLDIDDTLDVVH 163 Query: 110 GKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKK---SNEITAIPELLNML---- 162 G ++ + I + + I K EI L Sbjct: 164 GHQQLSLFNGHHDERCFLPIHIYDAATGRPVAMILRPGKTPSGKEIRGHLRRLARCIRAR 223 Query: 163 -DIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 +++ D+ + ++ ++ DY+F + GN+ Sbjct: 224 WPDTRILVRGDSHYGRVEVMAWCEENAIDYVFGLAGNK 261 >UniRef50_Q2RR82 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RR82_RHORT Length = 84 Score = 44.2 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 1/46 (2%) Query: 154 AIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 A E++ LD+ G++ DA+ CQK E ++ G L K NQ Sbjct: 36 ATQEMIAPLDLTGRLFTLDALHCQK-TFEIARQAGNHLLVQAKINQ 80 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 43.0 bits (100), Expect = 0.008, Method: Composition-based stats. Identities = 13/53 (24%), Positives = 17/53 (32%), Gaps = 4/53 (7%) Query: 199 QGR-LNKAFEEKFPLKELNNPKHDS---YAISEKSHGREETRLHIVCDVPDEL 247 Q L A + F + + +K HGR ETR D L Sbjct: 105 QPTHLAHALRDFFGTLDAPGYPVRQTCVHETLDKGHGRIETRRCTAAGDLDWL 157 >UniRef50_A4WVT3 Transposase, IS4 family n=63 Tax=Bacteria RepID=A4WVT3_RHOS5 Length = 322 Score = 42.3 bits (98), Expect = 0.017, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 50/145 (34%), Gaps = 7/145 (4%) Query: 58 LKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGK--IHRH 115 L + + +P + T+ R + + + S + DGK +H Sbjct: 92 LLRLAGLDWPVPDYSTLCRRQKTLKVQIPYRRADGPLNLLVDSTGIKFLG-DGKWQARKH 150 Query: 116 SYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKI--IKTDA 173 R+ +H+ A T S V T + + +P+LL+ + + I + D Sbjct: 151 GVQGRRQWRKVHL--AMDTATSDVRAVEFTPSREGDSPVLPDLLDQIRVDEAIGTVTADG 208 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGN 198 I +GG + ++ N Sbjct: 209 AYDTPRCHSAIIARGGTAIIPIRKN 233 >UniRef50_D1RJD3 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RJD3_LEGLO Length = 61 Score = 42.3 bits (98), Expect = 0.018, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 1/55 (1%) Query: 22 VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIAR 76 + L I+ L + + I DFG ++LKQ+ ++NG+PV DT+ R Sbjct: 2 KRYLLIKIMFLLLVLQFMDVKAGT-IRDFGLLKIEWLKQFLTYKNGMPVDDTMTR 55 >UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID=A1RCW9_ARTAT Length = 436 Score = 41.5 bits (96), Expect = 0.026, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 67/205 (32%), Gaps = 19/205 (9%) Query: 11 SIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 I+PD R +V+H L +L I A+ +G E D D G H L+ + + Sbjct: 49 KIVPDRRDPGRVQHGLQTLLAQRIYALAAGYEDLND-HD-GLRHDYALQTAVNRLQPLAG 106 Query: 71 HDTIARVVSCICPA---KFHESFINWMLDYHSSDDKDVI----AID----GKIHRHSYDK 119 T+ R+ + H + H +++ A D G + Sbjct: 107 KSTLGRLEQQADRETVVQAHRLLWEHFIAQHDQAPAEIVLDFDATDVPVHGDQEGRFFHG 166 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEIT-AIPELLNML-----DIKGKIIKTDA 173 + F H LV ++ + AI LL + + D Sbjct: 167 YYDHYCFLPLYVFCGRHLLVSYLRPSNIDGARHSWAILALLVKFIRRFWPETRIVFRGDG 226 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGN 198 C+ + + ++ DY+ + N Sbjct: 227 GFCRHRMLDWCDRKQVDYVVGLARN 251 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 41.5 bits (96), Expect = 0.029, Method: Composition-based stats. Identities = 14/69 (20%), Positives = 22/69 (31%), Gaps = 5/69 (7%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKS-HGREETR 236 K E + G D L +KGN +L A + + + R E R Sbjct: 6 KKTVETVLATGNDLLVQLKGNHPKLLAAVRTL----CQSRAHAEQSYTVDLGRRNRIEQR 61 Query: 237 LHIVCDVPD 245 + +P Sbjct: 62 TVRLWPLPP 70 >UniRef50_A6FLE0 Transposase, IS4 n=2 Tax=Roseobacter sp. AzwK-3b RepID=A6FLE0_9RHOB Length = 136 Score = 40.7 bits (94), Expect = 0.046, Method: Composition-based stats. Identities = 21/72 (29%), Positives = 30/72 (41%), Gaps = 2/72 (2%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP--V 70 +PD R+ KV H L DI+ I + +G E D D + L D E G Sbjct: 41 LPDPREPGKVRHSLEDIIRFRIMMIAAGYEDGNDAGDLRDDPAFKLALERDPETGAALCS 100 Query: 71 HDTIARVVSCIC 82 TI+R+ + Sbjct: 101 QPTISRMENMAD 112 >UniRef50_A5FU21 Transposase, IS4 family protein n=11 Tax=Alphaproteobacteria RepID=A5FU21_ACICJ Length = 448 Score = 40.3 bits (93), Expect = 0.059, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 73/208 (35%), Gaps = 31/208 (14%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHP--DFLKQYGDFENGIPV 70 I D R +V+H L +I+ + + +G E D D P + + Sbjct: 57 IDDPRTPERVQHGLDEIIRFRMLMIAAGYEDGND-ADRLRNDPMFKLAMERLPEAGDLCS 115 Query: 71 HDTIARVVSCICPAKFHESFIN----WMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAI 126 TI+R + P + + + + ++ V+ ID ++D + + Sbjct: 116 QATISRTENLPGPRALLRMGLAMVEHYCASFRTIPNRVVLDID-----DTFDAAHGAQQL 170 Query: 127 HVISAFSTMHSL-----------VIGQI---KTDKKSNEITA-IPELLNML----DIKGK 167 + +A + ++ + K ++I + L++ + Sbjct: 171 CLFNAHHDEYGFQPIVVFDGDGRMLAAVLRPACRPKGSQIVKWLRRLIDAIRSHWPRTAI 230 Query: 168 IIKTDAMGCQKDIAEKIQKQGGDYLFAV 195 +++ D+ C ++ + + DY+F V Sbjct: 231 MLRGDSHYCTPEVLRFCRARRLDYIFGV 258 >UniRef50_C7S7P7 Transposase n=4 Tax=root RepID=C7S7P7_METEA Length = 404 Score = 39.9 bits (92), Expect = 0.080, Method: Composition-based stats. Identities = 14/37 (37%), Positives = 22/37 (59%) Query: 11 SIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDI 47 ++IPD R ++ H L +ILL I A+ G E +D+ Sbjct: 9 AVIPDRRDPSRIVHPLPEILLARILAIACGYEDADDL 45 >UniRef50_A0P2Q4 Putative uncharacterized protein n=2 Tax=Labrenzia aggregata IAM 12614 RepID=A0P2Q4_9RHOB Length = 39 Score = 39.9 bits (92), Expect = 0.081, Method: Composition-based stats. Identities = 11/39 (28%), Positives = 21/39 (53%) Query: 7 MEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWE 45 M ++ PD R+A + ++L +I A++SGA + Sbjct: 1 MSCLAAFPDRRRAEGKMYDQVGVILFSIIAILSGARSYR 39 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 303 5e-81 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 278 1e-73 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 273 4e-72 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 273 6e-72 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 270 4e-71 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 261 1e-68 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 261 1e-68 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 260 4e-68 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 254 3e-66 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 253 3e-66 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 253 4e-66 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 250 4e-65 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 243 4e-63 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 242 7e-63 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 241 2e-62 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 240 3e-62 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 239 5e-62 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 238 2e-61 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 236 6e-61 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 235 1e-60 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 234 3e-60 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 230 4e-59 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 229 6e-59 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 229 6e-59 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 228 1e-58 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 228 1e-58 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 227 3e-58 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 226 8e-58 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 224 2e-57 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 221 2e-56 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 221 2e-56 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 220 3e-56 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 220 3e-56 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 220 3e-56 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 220 4e-56 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 220 4e-56 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 218 1e-55 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 216 5e-55 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 215 9e-55 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 215 1e-54 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 214 3e-54 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 209 9e-53 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 209 9e-53 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 208 2e-52 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 206 8e-52 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 206 9e-52 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 203 4e-51 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 202 9e-51 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 201 1e-50 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 200 3e-50 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 197 2e-49 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 196 6e-49 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 193 5e-48 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 191 2e-47 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 188 1e-46 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 187 3e-46 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 186 5e-46 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 184 2e-45 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 183 4e-45 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 183 6e-45 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 181 2e-44 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 180 3e-44 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 180 4e-44 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 176 6e-43 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 176 7e-43 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 176 8e-43 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 174 3e-42 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 172 6e-42 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 172 7e-42 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 172 8e-42 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 168 2e-40 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 168 2e-40 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 166 5e-40 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 166 7e-40 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 160 5e-38 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 159 8e-38 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 159 8e-38 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 159 9e-38 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 154 3e-36 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 153 4e-36 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 152 1e-35 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 151 2e-35 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 150 3e-35 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 150 3e-35 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 149 8e-35 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 144 3e-33 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 144 4e-33 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 142 2e-32 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 134 3e-30 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 134 4e-30 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 132 1e-29 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 129 6e-29 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 128 2e-28 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 124 3e-27 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 122 8e-27 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 121 2e-26 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 120 6e-26 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 118 2e-25 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 118 2e-25 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 117 4e-25 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 114 4e-24 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 111 2e-23 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 107 3e-22 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 104 2e-21 UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis... 101 2e-20 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 101 2e-20 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 101 2e-20 UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 101 3e-20 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 100 6e-20 UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfo... 100 8e-20 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 98 3e-19 UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoa... 97 4e-19 UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis... 97 5e-19 UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliob... 95 2e-18 UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteoba... 92 2e-17 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 91 4e-17 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 91 4e-17 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 91 5e-17 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 89 1e-16 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 87 6e-16 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 86 1e-15 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 86 1e-15 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 84 3e-15 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 84 4e-15 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 84 5e-15 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 83 8e-15 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 83 1e-14 UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaerom... 82 1e-14 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 82 2e-14 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 82 2e-14 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 81 3e-14 UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_... 80 4e-14 UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 79 2e-13 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 78 3e-13 UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coproco... 78 3e-13 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 78 4e-13 UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus p... 77 4e-13 UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=... 77 5e-13 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 77 6e-13 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 76 1e-12 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 76 1e-12 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 74 4e-12 UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinis... 74 5e-12 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 73 1e-11 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 72 2e-11 UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodoco... 72 2e-11 UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus... 69 2e-10 UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium m... 66 1e-09 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 64 4e-09 UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synecho... 64 4e-09 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 63 9e-09 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 62 1e-08 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 59 2e-07 UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobi... 59 2e-07 UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polarom... 57 6e-07 UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candida... 57 7e-07 UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia... 56 1e-06 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 56 1e-06 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 56 1e-06 UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylo... 54 3e-06 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 54 5e-06 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 54 7e-06 UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Strepto... 53 8e-06 UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia... 53 9e-06 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 51 4e-05 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 51 4e-05 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 50 7e-05 UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinis... 49 2e-04 UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastop... 49 2e-04 UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia... 48 4e-04 UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 46 0.001 UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 R... 46 0.001 UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitroso... 45 0.002 Sequences not found previously or not previously below threshold: UniRef50_A8MIZ4 Putative uncharacterized protein n=1 Tax=Alkalip... 56 1e-06 UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostri... 55 2e-06 UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostri... 54 7e-06 UniRef50_B2IT45 Putative uncharacterized protein n=5 Tax=Cyanoba... 50 6e-05 UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteoba... 49 1e-04 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 49 2e-04 UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID... 49 2e-04 UniRef50_A7BZU6 Transposase, IS4 n=2 Tax=Beggiatoa sp. PS RepID=... 47 4e-04 UniRef50_C7G6U9 Putative uncharacterized protein (Fragment) n=7 ... 47 6e-04 UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipe... 47 7e-04 UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms Rep... 45 0.002 UniRef50_A4BVT6 Putative uncharacterized protein n=1 Tax=Nitroco... 44 0.004 UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=... 44 0.005 UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiat... 43 0.011 UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus... 43 0.012 UniRef50_A7BZC9 Putative uncharacterized protein n=1 Tax=Beggiat... 43 0.012 UniRef50_B8FDX7 Transposase IS4 family protein n=2 Tax=Desulfati... 42 0.013 UniRef50_A6FBF2 Putative uncharacterized protein n=1 Tax=Moritel... 42 0.016 UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX 42 0.025 UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewane... 41 0.036 UniRef50_A7C035 Transposase n=5 Tax=Bacteria RepID=A7C035_9GAMM 41 0.037 UniRef50_Q2RR82 Putative uncharacterized protein n=1 Tax=Rhodosp... 41 0.045 UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 41 0.046 UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodoco... 41 0.049 UniRef50_Q11ZV5 Transposase, IS4 family n=1 Tax=Polaromonas sp. ... 40 0.063 UniRef50_A6FLE0 Transposase, IS4 n=2 Tax=Roseobacter sp. AzwK-3b... 40 0.079 UniRef50_UPI00016AFD66 hypothetical protein Bpse38_17802 n=1 Tax... 40 0.081 UniRef50_Q877V8 ISPpu8, transposase n=3 Tax=Proteobacteria RepID... 40 0.089 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 303 bits (775), Expect = 5e-81, Method: Composition-based stats. Identities = 239/253 (94%), Positives = 242/253 (95%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 MELKKLMEHISIIPDYRQ WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETH DFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 YGDFENGIPVHDTIARVVSCI PAKFHE FINWM D HSSDDKDVIAIDGK RHSYDKS Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 RR+GAIHVISAFSTMHSLVIGQIKTD+KSNEITAIPELLNMLDIKGKII TDAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 AEKIQKQGGDYLFAVKG QGRLNKAFEEKFPLKELNNP+HDSYAISEKSHGREE RLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 CDVPDELIDFTFE 253 CDVPDELIDFTFE Sbjct: 241 CDVPDELIDFTFE 253 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 278 bits (711), Expect = 1e-73, Method: Composition-based stats. Identities = 122/253 (48%), Positives = 165/253 (65%), Gaps = 1/253 (0%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L+ +SII D RQ KV H L D+L L I AVISG EGWE+I+DFG D+L++ Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 Y F GIP DTI+R+ I P +F + F WM DVIAIDGK R S++K Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKK 120 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 + IH++SAF+ +S+V+GQ+KT+ KSNEITAIP+LL++LD++G ++ DAMGCQ I Sbjct: 121 DKSDTIHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 A+KI +GGDYL VKGNQ RL A + F ++ L P+ ++Y EK HGRE++R+ +V Sbjct: 181 AKKIVDKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMV 240 Query: 241 CDVPDELIDFTFE 253 D E+ D FE Sbjct: 241 ADAN-EIGDLVFE 252 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 273 bits (698), Expect = 4e-72, Method: Composition-based stats. Identities = 112/251 (44%), Positives = 152/251 (60%), Gaps = 1/251 (0%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M ++ +H S I D+RQ+ KV + L D+L ++CAVI+ + GW +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 F +GIP DTIAR+VS I P F+ F+ WM H + +VIAIDGK R SY++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 R IH+ISA+++ + LV+GQ+K ++KSNEITAIP LL MLD++G ++ DAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 A I +GGDYL AVK NQG L KA + F D + EKSHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFSPHRSAGLS-DDHVNIEKSHGRIENRTCYV 239 Query: 241 CDVPDELIDFT 251 DFT Sbjct: 240 LSSAALDGDFT 250 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 273 bits (697), Expect = 6e-72, Method: Composition-based stats. Identities = 96/251 (38%), Positives = 148/251 (58%), Gaps = 3/251 (1%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++EH S + D R A ++E+ L DI+++T+CAV+ GA+ W ++ ++G + +LKQ+ Sbjct: 5 PFASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQW 64 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSR 121 NG+P HDT V + + P + + F+NW + ++IAIDGK R + Sbjct: 65 IALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGE 124 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIA 181 + IH++SA+++ + LV+GQ D+KSNEITAIPELL +L+++G ++ DAMGCQ IA Sbjct: 125 QCSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIA 184 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPKHDSYAISEKSHGREETRLH 238 E I + GDY+ A+KGNQG L + F + +HDSY EK HGR E R + Sbjct: 185 ETIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTY 244 Query: 239 IVCDVPDELID 249 D L+ Sbjct: 245 WTMGQTDYLLG 255 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 270 bits (690), Expect = 4e-71, Method: Composition-based stats. Identities = 118/250 (47%), Positives = 161/250 (64%), Gaps = 3/250 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+E SII D RQ K++H+L DIL L + AVI GAEGW+DIE+ G ++L++ G F Sbjct: 6 SLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFF 65 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 + GIPV DTIAR++S + P + FI WM + D +IA+DGK RHSYDK +RK Sbjct: 66 KKGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKS 125 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 AIH++SA++ + +V+GQ KTD KSNEI AIP LL++LDIKG I+ DAMGCQ+ IAEKI Sbjct: 126 AIHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKI 185 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKE---LNNPKHDSYAISEKSHGREETRLHIVC 241 + GDY+ AVK NQ +L++ + F +HD + S K HGR E R + + Sbjct: 186 VTKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRRYWIS 245 Query: 242 DVPDELIDFT 251 D+ L + Sbjct: 246 DMLSTLGNPE 255 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 261 bits (668), Expect = 1e-68, Method: Composition-based stats. Identities = 107/252 (42%), Positives = 146/252 (57%), Gaps = 5/252 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 + H S I D RQ KV + L +ILLLT+CAV+SGA W I +G FLK++ F Sbjct: 25 FLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPFA 84 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 +G P HD + + + + F FI+W+ + + V+AIDGK R S DK+ K A Sbjct: 85 DGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTVTG-VVAIDGKTSRRSLDKAGGKAA 143 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 IH+ISA+S+ +L + Q + D KSNEITAIPELL +L +KG I+ DAMGCQ++IA KI Sbjct: 144 IHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKII 203 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPKHDSYAISEKSHGREETRLHIVCD 242 + DY+ A+KGNQG L K E + + ++ + EKSHGR ETR VC Sbjct: 204 SKEADYILALKGNQGSLRKDTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVTVCT 263 Query: 243 VPDEL-IDFTFE 253 D L D + Sbjct: 264 DIDWLKADHNWP 275 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 261 bits (667), Expect = 1e-68, Method: Composition-based stats. Identities = 100/251 (39%), Positives = 148/251 (58%), Gaps = 4/251 (1%) Query: 2 ELKKLMEHISIIPDYRQAW-KVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 +K E+ + D R+ H DIL++ +CA+ISGA + +IE FG + ++ + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 + NGIP HDT V++ + P +F F+ W + + IAID K R S DK Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKK 124 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 K +H++SA++T +LVIGQIKT++ SNEITAIPELLN LD+KG ++ DAMGCQ +I Sbjct: 125 NGKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEI 184 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK---ELNNPKHDSYAISEKSHGREETRL 237 AEKI ++ DY+ A+KGNQ +L+++ E F L E + D E S+GREE R Sbjct: 185 AEKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRC 244 Query: 238 HIVCDVPDELI 248 + +++I Sbjct: 245 AYATNEIEKII 255 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 260 bits (664), Expect = 4e-68, Method: Composition-based stats. Identities = 88/244 (36%), Positives = 134/244 (54%), Gaps = 3/244 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ ++ I D R +H L D+L + I AVI+G++GWED+E++G ++L ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 +GIP DT RV I P + W+ +S ++I IDGK R SYD++ + Sbjct: 90 PHGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQC 149 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 A++ ++A+++ SLV+GQ+K + SNEITAIP LL +LDI G II DAMG Q I ++I Sbjct: 150 ALYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQI 209 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP---KHDSYAISEKSHGREETRLHIVC 241 +Q DY+ +K N L ++ F + N +HD Y K H R E R Sbjct: 210 CRQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRYVWAI 269 Query: 242 DVPD 245 V Sbjct: 270 PVAA 273 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 254 bits (648), Expect = 3e-66, Method: Composition-based stats. Identities = 89/247 (36%), Positives = 132/247 (53%), Gaps = 2/247 (0%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +L + E +PD R + H LS++L + +CAV+ GA + D+ +G+++ +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD-VIAIDGKIHRHSYDKS 120 + G+P HDT RV++ I PA F +F+ W+ + D V+AIDGK R S K Sbjct: 65 LKLKAGVPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGKD 124 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 +H++SAF+ LV+GQ TD+KSNEITAIPELL ML ++G I+ DAMG Q I Sbjct: 125 TSG-PLHMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAI 183 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 A I+ +G DY+ VK N L + + K HGR E R Sbjct: 184 ARTIRSRGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWA 243 Query: 241 CDVPDEL 247 D +L Sbjct: 244 YDAVSQL 250 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 253 bits (647), Expect = 3e-66, Method: Composition-based stats. Identities = 89/241 (36%), Positives = 131/241 (54%), Gaps = 3/241 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L ++ I D R H+L DI+ + + AV++GA+ W IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP HDT ARV + + P F +W+ S+ VIAIDGK + SYD+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 + ++SA+++ H LV+GQ D KSNEITAIP LL L + G I+ DAMG + IA +I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKEL---NNPKHDSYAISEKSHGREETRLHIVCD 242 KQ DY+ A+KGNQ L K ++ F + ++ + E +H R E+R Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRRVFQVP 253 Query: 243 V 243 V Sbjct: 254 V 254 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 253 bits (647), Expect = 4e-66, Method: Composition-based stats. Identities = 101/264 (38%), Positives = 151/264 (57%), Gaps = 18/264 (6%) Query: 8 EHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG 67 ++ + D R +HKL DI+ +TICAVI GA+ W DIE FG+ +LK++ + NG Sbjct: 11 DYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNG 70 Query: 68 IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIH 127 IP HDT RV S + P + F+ W+ S +++AIDGK RHSYD+S+ K A+ Sbjct: 71 IPSHDTFGRVFSLLNPEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQ 130 Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAI---------------PELLNMLDIKGKIIKTD 172 +ISA++T + LV+GQ D+KSNEITAI P LL +L + G I+ D Sbjct: 131 MISAWATTNGLVLGQSIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLD 190 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPK---HDSYAISEKS 229 A+GCQK+I ++I +Q DY+ +K NQG L + E F ++N + Y + ++ Sbjct: 191 AIGCQKEIVKQITEQDADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEG 250 Query: 230 HGREETRLHIVCDVPDELIDFTFE 253 HGR+E R + + E ID ++ Sbjct: 251 HGRQEVRYYQMLSNVAEEIDPDWQ 274 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 250 bits (638), Expect = 4e-65, Method: Composition-based stats. Identities = 91/248 (36%), Positives = 134/248 (54%), Gaps = 6/248 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M K L++++ IPD R K H LS+++ + ICA++ G + W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD-KDVIAIDGKIHRHSYDK 119 + GIP HDT R+ + + PA F W+ D D +A+DGK R + K Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMGDDKLVGQLAVDGKALR-ATAK 119 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 R A+H+++ +ST + +GQ K KSNEITAIPELL +L++KG ++ DAMG Q Sbjct: 120 GRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQVK 179 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF----PLKELNNPKHDSYAISEKSHGREET 235 IA+ I K+ GDYL AVK NQ LN +E+F E + H + HGR+E Sbjct: 180 IADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKEH 239 Query: 236 RLHIVCDV 243 R V V Sbjct: 240 RRCWVLMV 247 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 243 bits (620), Expect = 4e-63, Method: Composition-based stats. Identities = 97/248 (39%), Positives = 137/248 (55%), Gaps = 5/248 (2%) Query: 9 HISIIPDYRQA-WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG 67 +IPD R+A H LSDIL + +CAV+SG + WE + +FG T +L+Q+ NG Sbjct: 17 FFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANG 76 Query: 68 IPVHDTIARVVSCICPAKFHESFINWML-DYHSSDDKDVIAIDGKIHRHSYDKSRRKGAI 126 IP HDT RV S I P F +F +W D D +A+DGK R S+ + A+ Sbjct: 77 IPSHDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSH-RGSAGRAL 135 Query: 127 HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQK 186 H++ A+S L++ Q + D KSNEITAIP++L++ D++G I DA+GCQK +A +I + Sbjct: 136 HLLHAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITE 195 Query: 187 QGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDE 246 GGDY+ A+KGNQ L+ F + + EK HGR ETR V D D Sbjct: 196 AGGDYVLALKGNQSALHDDVR-LFMETQADRHPQGQAEAVEKDHGRIETRRIWVNDEIDW 254 Query: 247 LID-FTFE 253 L + Sbjct: 255 LTQKPDWP 262 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 242 bits (618), Expect = 7e-63, Method: Composition-based stats. Identities = 86/246 (34%), Positives = 134/246 (54%), Gaps = 2/246 (0%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 L ++E + D+R A + H+LS++L + +CAV+SGA+ +E+I +G +L+ + Sbjct: 6 LADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPWLRGFL 65 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD-VIAIDGKIHRHSYDKSR 121 + G+ DT RV + + P +F ++F W+ + KD VIAIDGK R + K+ Sbjct: 66 RLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRTTSKAA 125 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIA 181 +H++SAF+ +V+GQ T +KSNEITAIPELL +LDI+G I+ DAMG Q IA Sbjct: 126 AA-PLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGTQTKIA 184 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVC 241 I+++G Y+ VK N +L + ++ + HGR E R Sbjct: 185 RAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLTPSSTHETTSTGHGRIEVRRCTAF 244 Query: 242 DVPDEL 247 D D L Sbjct: 245 DATDRL 250 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 241 bits (615), Expect = 2e-62, Method: Composition-based stats. Identities = 86/256 (33%), Positives = 142/256 (55%), Gaps = 12/256 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L+EH I D R + +H+L D+L++ +C ++ G E + D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 +GIP HDT RV + + P F + F+ W ++ +++A+DGK R + ++ + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQGQSP 126 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 I +SA++ +SLV+GQI+ K+NEITA+P+LL +L++ G I+ DAMGCQK+IA + Sbjct: 127 RVI--VSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLK----------ELNNPKHDSYAISEKSHGRE 233 I + +Y+ A+KGNQG+ ++ + E N +EK HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 ETRLHIVCDVPDELID 249 ETR + L D Sbjct: 245 ETRRYWQSGDVSWLAD 260 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 240 bits (613), Expect = 3e-62, Method: Composition-based stats. Identities = 90/256 (35%), Positives = 130/256 (50%), Gaps = 12/256 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 L E I D+R H L+DIL++ CA++ G + +E FG +L+ + Sbjct: 14 SNLREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLA 73 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD--------VIAIDGKIHRH 115 NGIP HDT +V S + P +F E+F W + VIAIDGK R Sbjct: 74 LPNGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRG 133 Query: 116 SYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMG 175 + DK + I + A+++ SL +GQ+K KSNEI A+PELL ML +KG I+ DAMG Sbjct: 134 AVDKGQAPAVI--VGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMG 191 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE-LNNPKHDSYAISEKSHGREE 234 CQ+++A KI +Q GDY+ A+K NQ L++ E L + + + HGR E Sbjct: 192 CQREVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHE 251 Query: 235 TRLHIVCDVPD-ELID 249 R V + + L Sbjct: 252 VRRCWVSEEVECWLQG 267 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 239 bits (611), Expect = 5e-62, Method: Composition-based stats. Identities = 83/247 (33%), Positives = 125/247 (50%), Gaps = 10/247 (4%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 +++ I D R K HK+ I+ ++I AVI GA+ W +IE+FG F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPD 63 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS------YD 118 IP HDT R S I P F F NW+ V+AIDGK+ R + Sbjct: 64 LEFIPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 + + ++SA+S ++ + +GQ+K D KSNEITAIP L+N L++ G I+ DAMGCQK Sbjct: 123 TGKEGFKLWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPKHDSYAISEKSHGREET 235 DI + I ++ +Y+ A+K N+ + L K + + ++ + + HGR E Sbjct: 183 DITQTIIERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEK 242 Query: 236 RLHIVCD 242 R V Sbjct: 243 RTCTVVS 249 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 87/237 (36%), Positives = 137/237 (57%), Gaps = 3/237 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L++H + D R +HKL DI+++ +CA+I GA+ + +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP HDT ARV + I P +F + F +W+ DV+ IDGK +HS +K K A Sbjct: 69 NGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKKA 128 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 IH+++A+++ LV+ Q K +++ EITAIP L+ +L++ G ++ DAMG Q DIAE + Sbjct: 129 IHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELLH 188 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELN---NPKHDSYAISEKSHGREETRLHI 239 +G DY A+KGNQ L + +E F + +H + EK R E Sbjct: 189 SKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEVSSAY 245 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 236 bits (602), Expect = 6e-61, Method: Composition-based stats. Identities = 86/207 (41%), Positives = 130/207 (62%), Gaps = 1/207 (0%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 +KL IPD+R++ K + L ILL+ I +VI GA+ W ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 NGIP HDT RV S I +F + FI W+ +++IAIDGK R + +K Sbjct: 65 LPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGAKA-GGKK 123 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 +H++SA++ ++LV+GQ+K KSNEITAIP+LL +L I+ I+ DAMGCQ IA+ Sbjct: 124 SPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIAKA 183 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKF 210 I K+ DY+ AVK NQ +L + E++F Sbjct: 184 IVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 235 bits (600), Expect = 1e-60, Method: Composition-based stats. Identities = 88/252 (34%), Positives = 138/252 (54%), Gaps = 8/252 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++L+E + I D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD---KDVIAIDGKIHRHSYDK 119 + +NG P HDT+ RV+ + P + + W + ++ K +I IDGK R +K Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRS--NK 118 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + H++SA+S +GQ +KSNEITAIPELL + +KG+I+ DAMG Q Sbjct: 119 RNGEKPGHIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN---NPKHDSYAISEKSHGREETR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ ETR Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVCDVPDELI 248 + + L Sbjct: 239 EYYQTEKIKWLS 250 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 234 bits (596), Expect = 3e-60, Method: Composition-based stats. Identities = 82/243 (33%), Positives = 125/243 (51%), Gaps = 3/243 (1%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L I D R + + L +ILL+T+CA+I G + W+ I DFG+ +L Q+ + Sbjct: 14 QYLFHCFLSIKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVN 73 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 G+P T ARV S I P +F WM + D+I +DGK S + + + Sbjct: 74 MRCGVPSTLTFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQ 133 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 A H+++A+ + +G+++ KSNEI AIP LLN L+++G II DAMG QK IA Sbjct: 134 KATHIVNAYLPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANL 193 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEK---SHGREETRLHIV 240 I+ + DY+ A+K N R + E F + + + Y E HGR E R + V Sbjct: 194 IRLKQADYVLALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSYCV 253 Query: 241 CDV 243 + Sbjct: 254 LPM 256 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 230 bits (586), Expect = 4e-59, Method: Composition-based stats. Identities = 87/241 (36%), Positives = 135/241 (56%), Gaps = 8/241 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++L+E + I D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD---KDVIAIDGKIHRHSYDK 119 + +NG P HDT+ RV+ + P + + W + ++ K +I IDGK R +K Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRS--NK 118 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + H++SA+S +GQ +KSNEITAIPELL + +KG+I+ DAMG Q Sbjct: 119 RNGEKPGHIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN---NPKHDSYAISEKSHGREETR 236 IAEKI+ + DY+ ++K NQG L + E F E + EK+HG+ ETR Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 L 237 Sbjct: 239 E 239 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 229 bits (584), Expect = 6e-59, Method: Composition-based stats. Identities = 99/254 (38%), Positives = 145/254 (57%), Gaps = 14/254 (5%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + ++ + + D R+ WK++H LSDI+LL A +SGAE W++IE FG+ + LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSD---------DKDVIAIDGKIH 113 ENGIP HDT+ RV + + P E W SD K ++AIDGK Sbjct: 66 QLENGIPSHDTLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTI 125 Query: 114 RHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDA 173 R + S ++ A+H+++A++T + GQ+ T++KSNEITAIPELL+M+ +KG ++ DA Sbjct: 126 RG--NGSAKQKALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDA 183 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGRE 233 MG QK IA+KI K+ DY AVK NQ L + F + + + D Y EK+HG+ Sbjct: 184 MGTQKAIADKIIKKKADYCLAVKENQKTLLEDIVPFFEMSQEAD---DHYHTVEKAHGQI 240 Query: 234 ETRLHIVCDVPDEL 247 ETR + V L Sbjct: 241 ETRAYEVIHDVSWL 254 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 229 bits (584), Expect = 6e-59, Method: Composition-based stats. Identities = 82/253 (32%), Positives = 141/253 (55%), Gaps = 8/253 (3%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ ++E+ + + D R+ +H L D+L++ + AVI+GA+G I + E H ++LK Sbjct: 9 DVVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSR 68 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS-----DDKDVIAIDGKIHRHS 116 + +G+P HDTI R+++ + P F + F W+ + D +++IAIDGK R S Sbjct: 69 LELPSGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRS 128 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 +D+ + G + + SA++ + +GQ+ KSNEI PEL+ +D++ I+ DA GC Sbjct: 129 HDRGKGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGC 188 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPKHDSYAISEKSHGRE 233 Q+D+AEKI GDY+ A+K NQ RL++ + + K + + K HGR Sbjct: 189 QRDVAEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRL 248 Query: 234 ETRLHIVCDVPDE 246 + R + +PDE Sbjct: 249 DKRFYYQVKLPDE 261 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 228 bits (581), Expect = 1e-58, Method: Composition-based stats. Identities = 77/246 (31%), Positives = 127/246 (51%), Gaps = 7/246 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFL--KQYGD 63 L+E S +PD R+ + L+ IL++ +CA++ GA+ W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 E G P HDT + + F F +W+ + D V+AIDGK R S K + Sbjct: 63 LEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVIDG-VVAIDGKTLRGSGKKGSNE 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 +H+++A++ L + Q T K +E+ + LL++L +KG I+ DA+GCQ ++AEK Sbjct: 122 -LLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAEK 180 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP---KHDSYAISEKSHGREETRLHIV 240 I +GGDY+ VK NQ L +A E F + + +EK HGR ETR + Sbjct: 181 IVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYTW 240 Query: 241 CDVPDE 246 + Sbjct: 241 INDVTW 246 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 228 bits (581), Expect = 1e-58, Method: Composition-based stats. Identities = 91/247 (36%), Positives = 144/247 (58%), Gaps = 4/247 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 + H + D R +H L D++ LT+ A++SGAEGW+DI+ FG++ D+L+++ F Sbjct: 2 SFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAF 61 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 + G+PV DTIAR++S + P SFI+W+ + + VIA DGK RHS+D RK Sbjct: 62 KEGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFD-GDRKT 120 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 A+H +SA++ LV+ Q K+ K NE++ + EL+ +L++KG I+ DAM C K +A+ I Sbjct: 121 ALHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAI 180 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPK---HDSYAISEKSHGREETRLHIVC 241 +GGDY+ VK NQG+L F + P+ +S ++ HGR E R ++ Sbjct: 181 NAKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYVQL 240 Query: 242 DVPDELI 248 + L Sbjct: 241 PITPWLT 247 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 227 bits (578), Expect = 3e-58, Method: Composition-based stats. Identities = 101/263 (38%), Positives = 137/263 (52%), Gaps = 19/263 (7%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 L + + + +I D R +H+ S I+L+ I AVI GA+ W IEDFG++ F Sbjct: 14 LHEFADSLILI-DNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKL 72 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSY----- 117 NGIP HDT R S + P KF ES+ W+ IAIDGK R +Y Sbjct: 73 SNFNGIPSHDTFNRFFSALDPLKFEESYRQWVQSILKCYSG-HIAIDGKTIRGAYESEQD 131 Query: 118 ----------DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGK 167 D + K +HVISAF+T + +GQ+ T +K NEI IPELL+ML IK Sbjct: 132 KRHRKQGVLPDSNTGKYKLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDC 191 Query: 168 IIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP--LKELNNPKHDSYAI 225 II DA+GCQ+ IAEK+ K GDY+F VK NQ +L + + + + D Y Sbjct: 192 IITIDALGCQRTIAEKVIKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKGTTVRFDKYET 251 Query: 226 SEKSHGREETRLHIVCDVPDELI 248 E+ HGR E+R+ C+ P L Sbjct: 252 HEEGHGRNESRICYCCNDPGFLG 274 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 226 bits (575), Expect = 8e-58, Method: Composition-based stats. Identities = 86/251 (34%), Positives = 129/251 (51%), Gaps = 9/251 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 L+ L+EH S I D R ++ H L +ILLL +C ++ + +E+I +G H FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRR 122 + +G+P + +++ I PA F +F W+ D +AIDGK R S+D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFPGRA-DFVAIDGKTSRRSHDRRAG 130 Query: 123 KGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML----DIKGKIIKTDAMGCQK 178 IH++SAF+T LV+ Q K+NE+ AIP LL+ L + G ++ DA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLH 238 IA I+ QG DYL AVK NQ L E F + + + HD +K HGR E R Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGADHHHD----LDKGHGRVEERHV 246 Query: 239 IVCDVPDELID 249 V D L Sbjct: 247 SVIREVDWLSG 257 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 224 bits (571), Expect = 2e-57, Method: Composition-based stats. Identities = 90/252 (35%), Positives = 132/252 (52%), Gaps = 7/252 (2%) Query: 4 KKLMEHISIIPDYRQA-WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 LM + D R+ H ++L++ I AV+S + EDI +G D+L+Q+ Sbjct: 7 ASLMCCFREVDDPRKRSNGTLHDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFL 66 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRR 122 NG+ +T R+ + P +F +F W+ + + +DGK R S S Sbjct: 67 VLLNGVASEETFLRIFRALDPKQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGS--GSGG 123 Query: 123 KGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE 182 + AIH++SAF+T +V+GQ K KSNEITAIPELL L I G ++ DAMGCQK+IA Sbjct: 124 ESAIHMVSAFATELGVVLGQEKVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIAR 183 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCD 242 +I QGGDYL AVKGNQ L A E +F + + D + SHGR ++ V Sbjct: 184 QITDQGGDYLLAVKGNQPTLLDAIETEFID-QYQSDDVDRHRQVHPSHGRIVAQIASVLP 242 Query: 243 VPDELIDF-TFE 253 + ++D + Sbjct: 243 A-EGIVDLADWP 253 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 221 bits (563), Expect = 2e-56, Method: Composition-based stats. Identities = 75/238 (31%), Positives = 117/238 (49%), Gaps = 3/238 (1%) Query: 10 ISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP 69 +PD R H L D+L + + A I GAE D F +++ + G+P Sbjct: 9 FQGLPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLP 68 Query: 70 VHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVI 129 HDT +RV + P F F ++ D+ D V+AIDGK R S+D++ + A+HV+ Sbjct: 69 SHDTFSRVFRLLDPVAFSRCFQQFL-DHLGEDGAGVLAIDGKTLRRSFDRAAGRSALHVV 127 Query: 130 SAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGG 189 SAF++ +++GQ NEI A LL + D+KG ++ DA+ Q+ A+ I ++GG Sbjct: 128 SAFASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGG 187 Query: 190 DYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDEL 247 D+LF +K N+ L E F + + ++ HGR E R H V L Sbjct: 188 DWLFPLKDNRPALRAEVERYF--ADPATVLAVPHVTTDADHGRIEVRRHWVSHDVAWL 243 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 221 bits (562), Expect = 2e-56, Method: Composition-based stats. Identities = 83/228 (36%), Positives = 123/228 (53%), Gaps = 2/228 (0%) Query: 22 VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCI 81 V + L+++LL T+ +I A +++IE G D+L+Q+ FE+G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 CPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIG 141 P +F W+ V AIDGK R S + GA+H++SA++ LVIG Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGR 201 Q + KSNEITAIPELL+ L + G I+ DAMG QK IA K+ +G DY+ A+KGNQG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELID 249 L+ + F +L + + + HGR E R V D L + Sbjct: 181 LHDDVRDFFADPDL-LRECARHDDTCIGHGRIEERTCQVADASAWLTE 227 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 220 bits (561), Expect = 3e-56, Method: Composition-based stats. Identities = 93/253 (36%), Positives = 132/253 (52%), Gaps = 9/253 (3%) Query: 7 MEHISIIPDYRQA-WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 M + I D R+ H +IL++ I AV+S + EDI + T +L+++ + Sbjct: 1 MSCLDEIDDPRKPSNGTLHDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLK 60 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDV-----IAIDGKIHRHSYDKS 120 NGIP +T R++ + P +F F W+ + D IAIDGK R S S Sbjct: 61 NGIPSEETFLRILRALDPKQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGS--GS 118 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 + AIH++SAF+T LV+GQ K KSNEITAIPELL L IKG ++ DAMGCQK I Sbjct: 119 GGESAIHMVSAFATELGLVLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSI 178 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIV 240 A++I + GDYL VKGNQ +L +A E F + D + E+ HGR ++ V Sbjct: 179 AKQIVAKKGDYLLMVKGNQPKLLEAIETAFID-QHGVESVDRSSRVERGHGRTVGQIASV 237 Query: 241 CDVPDELIDFTFE 253 + + Sbjct: 238 LSAKGIVDPADWP 250 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 220 bits (561), Expect = 3e-56, Method: Composition-based stats. Identities = 91/261 (34%), Positives = 137/261 (52%), Gaps = 14/261 (5%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + +E ++ I D+R + ++L DILL++ AVI + + ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDV------IAIDGKIHRHSY 117 F +G P HDT +V+S + P E F WM + + K V +AIDGK S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS- 121 Query: 118 DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 S + A HV++AF++ LV+GQIKTD+KSNEITAIPELL + +K ++ DAMG Q Sbjct: 122 -GSAEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHD------SYAISEKSHG 231 K+IA KI ++GGDY+ AVKGNQ +L + + + EK HG Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTRELKAKGQYAVTLEKDHG 240 Query: 232 REETRLHIVCDVPDELIDFTF 252 R E R + + Sbjct: 241 RIEKRECYLSNDLSWFEGLED 261 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 220 bits (561), Expect = 3e-56, Method: Composition-based stats. Identities = 90/249 (36%), Positives = 139/249 (55%), Gaps = 6/249 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + + E++S D R A+ +H DI+ L + AVISGA W +I+ FGE H D+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 Y FE GIPV DTIARV+ I P F+E F+N++ + + ++VIAIDGK RHS++ Sbjct: 57 YRPFECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFNP- 115 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 + A+H ++ +S L++ Q K+ K NE A+ E+++ +K +I DAM QK I Sbjct: 116 ETQSALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKI 175 Query: 181 AEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPK-HDSYAISEKSHGREETRLHI 239 AEKI ++ GDY+ +K N + E F + P+ ++Y R + R + Sbjct: 176 AEKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYR 235 Query: 240 VCDVPDELI 248 V D L Sbjct: 236 KLKVSDWLS 244 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 220 bits (560), Expect = 4e-56, Method: Composition-based stats. Identities = 83/251 (33%), Positives = 123/251 (49%), Gaps = 14/251 (5%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M ++ +IIPD R ++ ++I+ + + AVI GA+ W +IE FG+TH + K Sbjct: 1 MTIQAFS---AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKA 57 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHR-----H 115 IP HDT++R S + F E F W+ D V+AIDGK Sbjct: 58 RLPGLVSIPSHDTLSRFFSILDIDWFEECFRLWVDDICRRIPG-VVAIDGKAICDNPDKS 116 Query: 116 SYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMG 175 S K+ + ++++SA+S + + +GQ K ++KSNE AIPEL+ LD++ II DA+G Sbjct: 117 SNSKNGVRSKLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIG 176 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP---KHDSYAISEKSHGR 232 CQK I + I + DY+ K N L E F L E + Y K HGR Sbjct: 177 CQKSITKLIIENKADYILCAKDNHEALRNIIE--FNLSEESRYYLCHAKRYFEENKGHGR 234 Query: 233 EETRLHIVCDV 243 E R + Sbjct: 235 SEYRECVCISA 245 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 220 bits (560), Expect = 4e-56, Method: Composition-based stats. Identities = 77/250 (30%), Positives = 126/250 (50%), Gaps = 8/250 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L+E + + D R K+EH+L DIL++ +CAV++ AE +EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD------KDVIAIDGKIHRHS 116 D GIP HDT RV I P F F+NW + + IA+DGK+ RHS Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 +D+ + +H++SA++T LV+ Q D K E A+P +L L + G ++ DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPK--HDSYAISEKSHGREE 234 ++++A+ I +G YL +K NQ +++ F + + +HGR Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAFDDTHGRLV 241 Query: 235 TRLHIVCDVP 244 R C Sbjct: 242 RRRVFACPDA 251 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 218 bits (556), Expect = 1e-55, Method: Composition-based stats. Identities = 69/255 (27%), Positives = 115/255 (45%), Gaps = 8/255 (3%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 + + E +PD R A H L++IL + + A + GA D+ F + Sbjct: 4 PMDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDV 62 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS----DDKDVIAIDGKIHRHSY 117 +NG+P HDT +RV + P F ++F +M + K VIA+DGK R Y Sbjct: 63 LVLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGY 122 Query: 118 DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 + R +++A++ + + ++ NE +L+ +L +KG ++ DA+ C Sbjct: 123 ESGRSHMPPVMVTAWAAQTRMALANVQAPNN-NEAAGALQLIELLQLKGCVVTADALHCH 181 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRL 237 + +AE I+ +GGDY+ AVK NQ L + + + HGR+E R Sbjct: 182 RGMAEAIKARGGDYVLAVKDNQPALMRDAKAAIRAATRQGKPSTI--TVDAGHGRKEKRR 239 Query: 238 HIVCDVPDELIDFTF 252 +V VP D F Sbjct: 240 AVVAAVPQMAQDHDF 254 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 216 bits (551), Expect = 5e-55, Method: Composition-based stats. Identities = 84/252 (33%), Positives = 126/252 (50%), Gaps = 11/252 (4%) Query: 6 LMEHISIIPDYRQA-WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L + +PD R H L+DIL + CAVI+GAEGWEDI ++G + F +++ + Sbjct: 5 LTSVFADLPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLEL 64 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS--------DDKDVIAIDGKIHRHS 116 +NG+P HDT RV + + P F + F W + + D +A+DGK R S Sbjct: 65 KNGVPSHDTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGKSARRS 124 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 K G +H++ + +L++GQ + +EIT ++L LD+ G ++ DA GC Sbjct: 125 -AKPTFSGCLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTLDAAGC 183 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK-ELNNPKHDSYAISEKSHGREET 235 Q + E I+ +GG+Y+ VKGNQ L A F E D + +HGR E Sbjct: 184 QTETLEVIRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAHGRHEE 243 Query: 236 RLHIVCDVPDEL 247 R V PD L Sbjct: 244 RNVTVVHDPDGL 255 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 215 bits (548), Expect = 9e-55, Method: Composition-based stats. Identities = 80/244 (32%), Positives = 126/244 (51%), Gaps = 4/244 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L +H+S++ D R H L D+L L + AV SG +GW +I+ FGE ++L+++ F Sbjct: 2 SLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPF 61 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 NGIP TIAR++ + P +W+ D ++ K +IAIDGK R + Sbjct: 62 ANGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLGC--N 119 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 +H + AF + L + Q K EI + L+ ML+I +I DA+ Q+ E I Sbjct: 120 TLHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAI 179 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVP 244 + GDY+ VK NQ L +A + ++ + ++ + +A SEK HGR E R I +P Sbjct: 180 VARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQR--ITFQIP 237 Query: 245 DELI 248 +L Sbjct: 238 SKLS 241 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 215 bits (548), Expect = 1e-54, Method: Composition-based stats. Identities = 81/246 (32%), Positives = 139/246 (56%), Gaps = 3/246 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L+EH++++ + R +H L D++ L I A++SGAEGW DIE +G++ D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP T+AR++ CI E+ + W+ + + K +IA DGK+ R S+ + K A Sbjct: 69 NGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKDA 127 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 + +++A+ T + LV+ Q T K EI + ++L++L++KG ++ DA+ CQ++ EKI Sbjct: 128 LQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKIS 187 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDV-- 243 ++ + VK NQ +L +A + +F + E HGR+E R Sbjct: 188 EKKAHVVVQVKNNQPKLYQAVQSQFQSLFDAQKEKIVVEHKESGHGRQEERYVFQLKAKL 247 Query: 244 PDELID 249 P EL + Sbjct: 248 PPELTE 253 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 214 bits (544), Expect = 3e-54, Method: Composition-based stats. Identities = 77/247 (31%), Positives = 122/247 (49%), Gaps = 7/247 (2%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L + I D RQA KV H++ ++L++ C+ + E + D+ DF ++ +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRR 122 ++G P HD V+ I P E W + IAIDGK R +++ Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGW----CGDLEGRHIAIDGKALRGTHNAETG 116 Query: 123 KGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE 182 + +H++ A+ + L GQI +KSNEI AIP LL L +KG + DAMG Q IAE Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYA---ISEKSHGREETRLHI 239 +I G DY+ A+K N R ++ + F E + + E SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VCDVPDE 246 + + D Sbjct: 237 ITEELDW 243 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 209 bits (531), Expect = 9e-53, Method: Composition-based stats. Identities = 83/255 (32%), Positives = 125/255 (49%), Gaps = 12/255 (4%) Query: 3 LKKLMEHISIIPD------YRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPD 56 + +++ I I D RQ+WK+ + LS IL L ++G E +++EDF E + Sbjct: 1 MTTMIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEP 60 Query: 57 FLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD-KDVIAIDGKIHRH 115 Y D G P HDT+ RV+S + + E + + S D +I++DGK R Sbjct: 61 LFATYVDLSEGCPSHDTLERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRG 120 Query: 116 SYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMG 175 ++ + + +H+++A+ H L +GQ+ ++KSNEI AIP+LL +DI+ I+ DAMG Sbjct: 121 --NRGKNQKPVHIVTAYDGGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMG 178 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL---KELNNPKHDSYAISEKSHGR 232 Q I + I K DY AVKGNQ L F E Y EKS G+ Sbjct: 179 TQTAIVDTIIKGKADYCLAVKGNQETLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQ 238 Query: 233 EETRLHIVCDVPDEL 247 E R + V L Sbjct: 239 IEVREYWVSSDIKWL 253 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 209 bits (531), Expect = 9e-53, Method: Composition-based stats. Identities = 77/239 (32%), Positives = 127/239 (53%), Gaps = 1/239 (0%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 +++H+ I D R EH + DI L + AVISGA+ W +FG ++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 NGIP +I R+ + ++ ++W+ +Y + + IAIDGK+ + S A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLKG-AKASASSAA 119 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 +H+++A+ T LV K +E+ + ELL LD+K +++ DA+ CQ + I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVP 244 K+GGD + VKGNQ +L +A + +F NNP + + + K HGR E R+ C + Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLN 238 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 208 bits (529), Expect = 2e-52, Method: Composition-based stats. Identities = 67/254 (26%), Positives = 113/254 (44%), Gaps = 6/254 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 + + + +PD R A V H L ++L++ +V+ G+ ++ FG F + Sbjct: 10 IAMHIFLSAFDEVPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSD-DKDVIAIDGKIHRHSYDK 119 + ++ IP HDT + V I P +F + D D D+IAIDGK R + D Sbjct: 69 FLKLKHAIPSHDTFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDP 128 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 ++SA+++ L + + D + E++A E L ++D++GK++ DA+ C + Sbjct: 129 GESARTRMMVSAYASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRR 187 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHI 239 I GGD+ A+KGNQ L F ++P HGR+ETR + Sbjct: 188 TVAAINAGGGDWCLALKGNQESLLSDARGCFSKGHKSDPTA---VTENTGHGRKETRKAV 244 Query: 240 VCDVPDELIDFTFE 253 V F Sbjct: 245 VVSAKALAEYHEFP 258 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 206 bits (523), Expect = 8e-52, Method: Composition-based stats. Identities = 81/249 (32%), Positives = 123/249 (49%), Gaps = 9/249 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L E + ++P R K + L +LL+ + +SG W +IED+ E + + LK + Sbjct: 4 SLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEM 63 Query: 65 ENG------IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYD 118 G +P HDT+ R +S + F ++ W+ + S+ I IDGK R Sbjct: 64 LTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRG-VK 122 Query: 119 KSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 K HV+SAFS + Q+ D+K+NEI AI +LL++LD+ G ++ DA+G Q Sbjct: 123 KLSFDTQSHVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQT 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLH 238 I E+I +GGDY+ VK NQ + E F + D +E SHGR ETR + Sbjct: 183 AIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLD--EQTELSHGRIETRRY 240 Query: 239 IVCDVPDEL 247 P E+ Sbjct: 241 ESILNPLEI 249 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 206 bits (523), Expect = 9e-52, Method: Composition-based stats. Identities = 75/231 (32%), Positives = 116/231 (50%), Gaps = 10/231 (4%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 +++ I D R K HK+ I+ ++I AVI GA+ W +IE+FG F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPS 63 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS------YD 118 IP HDT R S I P F F NW+ V+AIDGK+ R + Sbjct: 64 LEFIPSHDTFNRFFSMIKPDYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 + + + ++SA+S + + +GQ+K D KS+EITAIP L+N L++ G I+ DAMGCQK Sbjct: 123 RGKEGFKLWMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLN---KAFEEKFPLKELNNPKHDSYAIS 226 DI + I +Y+ A+K N+ + K + + ++ + + Sbjct: 183 DITQTIIGHDANYIIAIKENKKKKYQPAKQIIDDYQDRDEIINRVIRHVSE 233 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 203 bits (517), Expect = 4e-51, Method: Composition-based stats. Identities = 84/254 (33%), Positives = 128/254 (50%), Gaps = 12/254 (4%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 L+KL E S IPD+R+A K + HKLSDI++L I +S +I +FG+ + ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDK-----DVIAIDGKIHRH 115 NGIP T+ R+ I + +H ++I IDGK R Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMG 175 + K+ R I +SA S + + ++KSNEI A+P L++ +DI GKI+ DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREET 235 QKDI +KI+++ GD++ +K NQ L E+K +P + E HGR ET Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDKIKEL---SPVYSYCGEPELGHGRIET 269 Query: 236 RLHIVCDVPDELID 249 R + V D D + + Sbjct: 270 RSYRVFDGTDLIAN 283 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 202 bits (514), Expect = 9e-51, Method: Composition-based stats. Identities = 78/284 (27%), Positives = 119/284 (41%), Gaps = 39/284 (13%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + E I I D R+ KV + I+L+T+ V + W DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDK------------------- 103 P HDT+ R I + + W + Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 104 -DVIAIDGKIHRHSYDKSR--------------RKGAIHVISAFSTMHSLVIGQIKTDKK 148 IAIDGK + + + +H++SAF + SL +GQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNMLDIK-GKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFE 207 NEI AIP+LL+ +DI+ G ++ DA+G QK I EKI ++ DYL VK N +L + E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPLKELNNPKHDS---YAISEKSHGREETRLHIVCDVPDELI 248 ++ ++D + + HG TR I C P L Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRLG 300 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 201 bits (512), Expect = 1e-50, Method: Composition-based stats. Identities = 67/255 (26%), Positives = 116/255 (45%), Gaps = 16/255 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 E+ L+E ++ +PD R V H L+ +L LT CAV++GA + ++ P+ L + Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 62 GDFE-------NGIPVHDTIARVVSCICPAKFHESFINWM-LDYHSSDDKDVIAIDGKIH 113 P TI RV++ I + W+ + +A+DGK Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWLACRQQDAGGLRALAVDGKSL 157 Query: 114 RHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKIIKTD 172 R + R+ +H+++A + LV+ Q+ +K+NEIT LL+ L D+ G ++ +D Sbjct: 158 RGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTSD 215 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGR 232 A+ Q D A ++ + Y+ VK N +L+ + P +++ HGR Sbjct: 216 ALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLKS-LPWQQIPLQD----RTRTTGHGR 270 Query: 233 EETRLHIVCDVPDEL 247 E R VC V + L Sbjct: 271 CEIRRLKVCTVNNLL 285 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 200 bits (509), Expect = 3e-50, Method: Composition-based stats. Identities = 65/241 (26%), Positives = 111/241 (46%), Gaps = 7/241 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++ +PD R +H L +IL + + AV+ GA ++E F + D L+Q+ E Sbjct: 3 FLDVFGEVPDPRD-LTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS----DDKDVIAIDGKIHRHSYDKSR 121 G P HDT +RV++ + P +E+F+ +M + K +A+DGK R +Y K R Sbjct: 62 RGAPSHDTFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGR 121 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIA 181 V++ F + + Q ++ E+ A L +L +KG + DA+ C + + Sbjct: 122 SHMPPLVVTVFGCDTFMSLAQT-VAQEGGEVQAAIAALELLSLKGLTVTADALHCHRRMT 180 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVC 241 + ++ GG Y+ A+KGNQ +L + E +HGR E R V Sbjct: 181 KTVRDGGGHYVIAIKGNQSKLAAEANTALDKA-AAGKATKFHQTEEDAHGRHEVRRAFVI 239 Query: 242 D 242 Sbjct: 240 P 240 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 197 bits (502), Expect = 2e-49, Method: Composition-based stats. Identities = 64/249 (25%), Positives = 105/249 (42%), Gaps = 6/249 (2%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 ++ +PD R A H L ++L++ +V+ GA ++ FG + + + Sbjct: 37 ILSAFEDVPDPR-AENTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLK 95 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDY-HSSDDKDVIAIDGKIHRHSYDKSRRKG 124 + +P HDT + V I P +F + D D DVIA+DGK R + D Sbjct: 96 HAVPSHDTFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGR 155 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 ++SA++ L + + D + E+ A E L ++ +KGK++ DA+ C + I Sbjct: 156 TRMMVSAYAARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAI 214 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVP 244 GGD+ A+K NQ L F + +P + HGR ETR V Sbjct: 215 NAGGGDWCLALKANQDSLLSDARASFGAEPDAHPSA---LSEDIGHGRTETRKATVVSSK 271 Query: 245 DELIDFTFE 253 F Sbjct: 272 ALAEHHEFP 280 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 196 bits (498), Expect = 6e-49, Method: Composition-based stats. Identities = 77/252 (30%), Positives = 124/252 (49%), Gaps = 10/252 (3%) Query: 3 LKKLMEHISIIPDYRQ--AWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 +K L E + +PDYR+ ++KL DILLL I + DI FG+ + + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSD---DKDVIAIDGKIHRHSY 117 G +G+P T+ R+ I E + +H D++ IDGK R + Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAGDILCIDGKAMRGTV 137 Query: 118 DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 ++ R I +SA+S + + ++KSNEIT++P+LL+ +D+ G I+ DAM Q Sbjct: 138 LENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADAMSFQ 195 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRL 237 K I +KI+++GGD+L +K NQ L E+ L E + + + HGR ETR+ Sbjct: 196 KAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAEPVDVYSEGPFLE---HGRIETRV 252 Query: 238 HIVCDVPDELID 249 + D + D Sbjct: 253 CRIFRGNDLITD 264 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 193 bits (490), Expect = 5e-48, Method: Composition-based stats. Identities = 70/255 (27%), Positives = 124/255 (48%), Gaps = 10/255 (3%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 ++ I+++ D R ++++ L ILL+++ A ISG + WE IED+ H + L+ Sbjct: 3 AEIWNAIAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYT 62 Query: 64 FENG-------IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS 116 +G +P HDT V I P +F E + +++ + + IAIDGK R Sbjct: 63 KLSGKELKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPRG- 121 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 ++ +++SA+ T H VI I ++ K +E+++I +L+ +L ++ + DA G Sbjct: 122 IKQTANSHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGT 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETR 236 ++ E I +GG+++ VKGNQ +L + E++F N D + HGR E R Sbjct: 182 YVEVIEMILSKGGNFVLPVKGNQKKLLEFIEKEFREYRGNTVSAD--TQEDIGHGRVEKR 239 Query: 237 LHIVCDVPDELIDFT 251 D Sbjct: 240 TVYCITEIKTDDDID 254 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 191 bits (484), Expect = 2e-47, Method: Composition-based stats. Identities = 100/197 (50%), Positives = 130/197 (65%), Gaps = 13/197 (6%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L L +H + + D RQA KV +KL D+L L + AVISGAEGWE+IEDFG +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKS 120 YGDF +GIPVHDTIAR+V I P FH+ FI WM DK V+A+DGK Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 H+ISAF+T + +V+GQ +TD+KSNEITA+PELL +L+++G ++ DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 AEKIQKQGGDYLFAVKG 197 + I K+ DY AVK Sbjct: 168 VKTIVKKKADYCIAVKK 184 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 188 bits (478), Expect = 1e-46, Method: Composition-based stats. Identities = 71/246 (28%), Positives = 126/246 (51%), Gaps = 4/246 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L++H+ II D R ++H L D++ LT+ A++SGA GW+ IE FG D+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 +GIP IA ++ + ++ W+ D K +IA+DGK R ++ + A Sbjct: 63 HGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ-A 121 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 +H++SAF + + + +KK +E ++++ L + ++ DA+ CQK +KI Sbjct: 122 LHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKII 181 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVC--DV 243 + D++ +KGNQ A + ++P + HGR+E R + ++ Sbjct: 182 SKKSDFVIQIKGNQPA-LLAAVKAAFAACYDSPALAISEQTNTGHGRKECRRVMQIEGNL 240 Query: 244 PDELID 249 P EL + Sbjct: 241 PPELSE 246 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 187 bits (475), Expect = 3e-46, Method: Composition-based stats. Identities = 81/286 (28%), Positives = 120/286 (41%), Gaps = 39/286 (13%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 L+ + + I D RQ KV H+ I++ + V + + W ++ DF DF++++ Sbjct: 18 LESMYNALGEI-DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFF 76 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINW--------------------MLDYHSSDD 102 P HDT+ R +CP + W + + Sbjct: 77 PDIQKAPSHDTLRRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKP 136 Query: 103 KDVIAIDGKIHRHSYDKSRRK--------------GAIHVISAFSTMHSLVIGQIKTDKK 148 IAIDGK + + ++ RR+ +H++SAFS L +GQ + DKK Sbjct: 137 FRQIAIDGKTIKKAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKK 196 Query: 149 SNEITAIPELLNMLDI-KGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAF- 206 NEI AIP LL+ LDI +G ++ DAMG QKDI +I K+ YL VK NQ L + Sbjct: 197 ENEIVAIPRLLDDLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIA 256 Query: 207 --EEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELIDF 250 F L N + + E HG R VC L Sbjct: 257 GNMRDFERIPLPNEVYKVHKEGENGHGFVFLRECRVCSSLHSLGKI 302 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 186 bits (473), Expect = 5e-46, Method: Composition-based stats. Identities = 65/184 (35%), Positives = 94/184 (51%), Gaps = 10/184 (5%) Query: 68 IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS------YDKSR 121 IP HDT R S I P F F NW+ V+AIDGK+ R + + Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHTTGK 62 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIA 181 + ++SA+S + + +GQ+K D KSNEITAIP L+N L++ G I+ DAMGCQKDI Sbjct: 63 EGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDIT 122 Query: 182 EKIQKQGGDYLFAVKGNQGR---LNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLH 238 + I + +Y+ A+K N+ + L K + + K+ + + HGR ETR Sbjct: 123 QTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRTC 182 Query: 239 IVCD 242 V Sbjct: 183 TVVS 186 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 184 bits (467), Expect = 2e-45, Method: Composition-based stats. Identities = 65/248 (26%), Positives = 116/248 (46%), Gaps = 8/248 (3%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L+ + +PD R ++L ++ + +CAV +GA + I D+ P + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD-VIAIDGKIHRHSYDKS 120 +P TI +V + + +D +A+DGK R + + Sbjct: 102 CGIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RI 159 Query: 121 RRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 ++ A H+++A + ++V+GQ +T KSNEI + LL +DI G ++ DAM QK Sbjct: 160 GKQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKAT 219 Query: 181 AEKIQKQ-GGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHI 239 A +++Q +Y+ VK NQ L ++ P +++ D E+ HGREE R + Sbjct: 220 ARCLREQCRAEYVMIVKANQPGLLARVRDQ-PWEQVPVVWSDP---VERGHGREEHRSYK 275 Query: 240 VCDVPDEL 247 + V L Sbjct: 276 ILTVARGL 283 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 183 bits (465), Expect = 4e-45, Method: Composition-based stats. Identities = 73/215 (33%), Positives = 103/215 (47%), Gaps = 1/215 (0%) Query: 38 ISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDY 97 ++ AE WEDIE +G + +L+ + NGIP HDT RV + F F + Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 ++V+A+DGK R S G +H++S +++ L +GQ D KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 LL L + G I+ DAMGCQ IAE+I+ +G D L +K N G +A F L + Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 218 PKHDSYAISE-KSHGREETRLHIVCDVPDELIDFT 251 + HGR R V L + Sbjct: 184 GAAGRPVFDAFEGHGRLVRRRVFVDAAATALAPLS 218 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 183 bits (464), Expect = 6e-45, Method: Composition-based stats. Identities = 70/257 (27%), Positives = 107/257 (41%), Gaps = 38/257 (14%) Query: 30 LLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHES 89 +L+T+ V + W DI DF DFL+++ P HDT+ R I + Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FINWMLDYHSSDDK--------------------DVIAIDGKIHRHSYDKSR-------- 121 + W + IAIDGK + + + Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 122 ------RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIK-GKIIKTDAM 174 +H++SAF + SL +GQ + K NEI AIP+LL+ +DI+ G ++ DA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDS---YAISEKSHG 231 G QK I EKI ++ DYL VK N +L + E ++ ++D + + HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 REETRLHIVCDVPDELI 248 TR I C P L Sbjct: 241 FMVTRTCISCSEPSRLG 257 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 181 bits (460), Expect = 2e-44, Method: Composition-based stats. Identities = 58/246 (23%), Positives = 113/246 (45%), Gaps = 9/246 (3%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFL-KQY 61 + L+E + + D+R+ H L +L++ I + G G+ ++ +F + + L +++ Sbjct: 1 MLSLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEF 60 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINW-MLDYHSSDDKDVIAIDGKIHRHSYDK- 119 +P + TI RV+ + + F W + +Y DD + + +DGK +++ Sbjct: 61 NIIPERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNP 120 Query: 120 -SRRKGAIHVISAFSTMHSLVIGQIKTDKK-SNEITAIPELLNMLDIKGKIIKTDAMGCQ 177 + ++ I +S FS LV+ + + K +EI ++ ++ K+ DA+ CQ Sbjct: 121 NNEQQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRL 237 K I K DY+ VKGNQ L K ++ ++ + + SHGR+ +R Sbjct: 181 KKTISLIAKTKNDYVITVKGNQKNLYKRIQDL----SNSSKPESCFLEQDNSHGRKISRK 236 Query: 238 HIVCDV 243 V V Sbjct: 237 IEVFKV 242 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 180 bits (457), Expect = 3e-44, Method: Composition-based stats. Identities = 66/252 (26%), Positives = 109/252 (43%), Gaps = 8/252 (3%) Query: 7 MEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFEN 66 + + I D R H L+++L L + A + GA+ +I +F E LK+ + Sbjct: 5 LSILREIHDPRD-INARHDLAELLFLALAATLCGAKNCVEIAEFVEGREAELKEIVTLRH 63 Query: 67 GIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD-----KDVIAIDGKIHRHSYDKSR 121 G P HDT +R+ I P + + ++ + V+A+DGK R Y+K R Sbjct: 64 GCPSHDTFSRIFRLIDPDELARALGAFLAALRQGLGLGPRPRGVVAVDGKALRRGYEKGR 123 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIA 181 ++S + L + + + S+E+ A LL +D+KG I+ DA+ C+ D A Sbjct: 124 AFMPPVMVSVWDAETRLSVATKRAEG-SDEVAATLALLKSIDLKGCIVTADALHCRPDTA 182 Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVC 241 + + + Y A+K N+GRL E F + + E HGR ETR V Sbjct: 183 KALIGRKAHYALALKANRGRLFACAEAGFVAADAAG-DLAFHETRETGHGRLETRRASVL 241 Query: 242 DVPDELIDFTFE 253 + F Sbjct: 242 PLKAFKQAPAFP 253 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 180 bits (457), Expect = 4e-44, Method: Composition-based stats. Identities = 61/237 (25%), Positives = 105/237 (44%), Gaps = 18/237 (7%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHE 88 +L+ + G + +TH + L+++ + GI TI R++ I Sbjct: 1 MLVCVTLGFLCGRTTIRRSLKWCKTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALY 60 Query: 89 SFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKK 148 +F+ W+ + S + +A+DGK + +K++ + +++ T+ L++ Q+ D K Sbjct: 61 AFMEWVGEIVDSRNT-HLAVDGKALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSK 119 Query: 149 SNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEE 208 +NEIT IPELL +LDI G I+ DA+G Q I E+I +QGG + VK NQ + Sbjct: 120 TNEITVIPELLKLLDISGSIVTIDAVGTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHT 179 Query: 209 KFPLKELNNPKH--------------DSYAI---SEKSHGREETRLHIVCDVPDELI 248 E + + + Y EK+ R E R +C L Sbjct: 180 FMDKLEAADVQRKKGEVLDSGMREYLEKYEEIIRIEKNRDRNEYRTCQICKDASNLT 236 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 176 bits (446), Expect = 6e-43, Method: Composition-based stats. Identities = 61/261 (23%), Positives = 113/261 (43%), Gaps = 22/261 (8%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 + L+E ++ +PD R+ V ++ + +L + +CA++SGA + I ++ P + Sbjct: 47 DQTALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAG 106 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD-------------KDVIAI 108 +P TI RV+ + A + W+ + D + V+A+ Sbjct: 107 LGLTGRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAV 166 Query: 109 DGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGK 167 DGK R + +H++ +V+ Q+ D+K+NEI +L+ + D+ Sbjct: 167 DGKAMRAT---RHGTHPVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDV 223 Query: 168 IIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISE 227 +I DAM Q A+ + +G L VK NQ ++ + P K++ + + Sbjct: 224 LITVDAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRLKT-LPWKDVPV----GHTTTG 278 Query: 228 KSHGREETRLHIVCDVPDELI 248 + HGR ETR VP L Sbjct: 279 RGHGRIETRTLKAVTVPAGLG 299 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 176 bits (446), Expect = 7e-43, Method: Composition-based stats. Identities = 61/229 (26%), Positives = 101/229 (44%), Gaps = 5/229 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+E ++ +PD R ++ L +L L + AV+ G E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 NG +P +TIA ++ + P + W+ D H D + +A+DGK S + + Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGKRLCGS--RDGQV 120 Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD-IKGKIIKTDAMGCQKDIAE 182 H+++A++ S V+ Q+ + +NE A LL +L + G ++ DA+ Q D+ Sbjct: 121 PGTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHG 231 +Q +GGD + K NQG L E F + G Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAFATAAGGDFSPRVTGRVGSGRG 229 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 176 bits (445), Expect = 8e-43, Method: Composition-based stats. Identities = 59/264 (22%), Positives = 95/264 (35%), Gaps = 27/264 (10%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ L+E + +PD R+ V L +L L + AV GA G+ +I + L Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYH--------------SSDDKDVIA 107 P T RV+ P E+ W VI+ Sbjct: 91 FGLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVIS 150 Query: 108 IDGKIHRHSYDKSRRKGAI--HVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML--- 162 DGK R + ++ V+ V+ + +EI A+ ++ L Sbjct: 151 ADGKTMRGARRRTGDGKIAQDQVVEILDHASGAVVACEPVND-GDEIGAVRTVMGRLADR 209 Query: 163 --DIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKH 220 + G ++ TDA Q + E++ GG +L VK NQ R+ P ++ Sbjct: 210 WGSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRILAKVRA-LPWAQVRAQD- 267 Query: 221 DSYAISEKSHGREETRLHIVCDVP 244 K+HGR ETR V P Sbjct: 268 ---TCRGKAHGRAETRTVRVVQAP 288 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 174 bits (440), Expect = 3e-42, Method: Composition-based stats. Identities = 56/228 (24%), Positives = 101/228 (44%), Gaps = 14/228 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ LM+ +S D R+ + H ++ +CA++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDFENGI-------PVHDTIARVVSCICPAKFHESFINWMLDYHSSDD----KDVIAIDG 110 F P T+ R + I + W V++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIK 170 K R + K++ IH ++AF +V+ Q D+K+NEI + LL ++I+G+I+ Sbjct: 341 KAVRGA-SKAKGGQKIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 TDAMGCQKDIAEKIQK-QGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 DA+ Q + A I + + DY+F VK NQ + + E P + Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIES-LPWEAFPP 446 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 172 bits (437), Expect = 6e-42, Method: Composition-based stats. Identities = 54/256 (21%), Positives = 104/256 (40%), Gaps = 13/256 (5%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L+ + +PD+R V ++L+ +L L + I+G + + ++ P + Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS----DDKDVIAIDGKIHRHSY 117 F +P TI R+V P + ++ W +A DGK+ + + Sbjct: 84 LGFPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGAR 143 Query: 118 DKSRRKGAIH--VISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMG 175 + + V+ A +G + +EI ++ L+N + ++ TD + Sbjct: 144 SRPPQGSVRQEAVVEAVRHDTGTALGHQRVVA-GDEIASVRRLVNRVCDHNTLVTTDCLH 202 Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREET 235 + +A I+ +GG +LF++KGNQ + P E N + EK+HGR E Sbjct: 203 AHEPLARAIRAKGGHWLFSIKGNQPTVRAKL-AGLPWDEFGN----QHVTREKAHGRIEE 257 Query: 236 RLHIVC-DVPDELIDF 250 R L+ F Sbjct: 258 RALKALTPSAPSLVGF 273 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 172 bits (437), Expect = 7e-42, Method: Composition-based stats. Identities = 64/253 (25%), Positives = 110/253 (43%), Gaps = 16/253 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDI----EDFGETHPDF 57 ++ L+ + I D R+A + LS +L + A ++GA G +I DFG+ Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKQYGDFENG---IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD--VIAIDGKI 112 L D G P I + + A +F W+ + + + V+A+D K+ Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 HRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGKII-K 170 R ++ + ++ + +SA LV GQ++ +NEIT + LL L DI G ++ Sbjct: 141 LRGAWSEGNKRVTL--LSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 171 TDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSH 230 DA+ Q + A + + G DY VKGNQ L + + F K + + E+ H Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLYR---KTFEQTLPLLQKPPQHEVEERGH 255 Query: 231 GREETRLHIVCDV 243 GR + + Sbjct: 256 GRIKKWQAWTTEA 268 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 172 bits (437), Expect = 8e-42, Method: Composition-based stats. Identities = 72/179 (40%), Positives = 104/179 (58%), Gaps = 3/179 (1%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ +PD R+ + H+L ++LL IC VISGAE W + + + D+L+ Y + Sbjct: 7 SLLTAFDDLPDPRR-RECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 +GI HDT RV S + ++F F+ W+ S + +AIDGK R S+D + Sbjct: 66 AHGIASHDTFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHD--GARS 123 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 IH++SA+S+ +L +GQ++T KSNEITAIPELL LDI+G I DAMGC A Sbjct: 124 PIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCHGMPARH 182 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 168 bits (425), Expect = 2e-40, Method: Composition-based stats. Identities = 72/158 (45%), Positives = 98/158 (62%), Gaps = 1/158 (0%) Query: 94 MLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEIT 153 M H +V+AIDGK R SYD+ R+ IH++SA+++ + LV+GQ+KT+ KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIP L+ MLD++G I+ DAM CQ IA+ I ++GGDYL AVKGNQG+L A + F Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFTPH 120 Query: 214 ELNNPKHDSYAISEKSHGREETRLHIVCDVPDELIDFT 251 D+ EK GR E R + V D + DF+ Sbjct: 121 RRAPIDRDTCQ-IEKQKGRVEARTYHVLSASDLIRDFS 157 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 168 bits (425), Expect = 2e-40, Method: Composition-based stats. Identities = 61/218 (27%), Positives = 98/218 (44%), Gaps = 3/218 (1%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + L E +S IPD R + H L +L L A++ G + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDK 119 F G P T++R + P + + W+ + IA+DGK R S + Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGS--R 118 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + H+++A++ V+ Q++ D K+NE A LL +L + G ++ DAM CQ+D Sbjct: 119 DGQVPGQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNN 217 +A + G DY+ K NQ L + E ++ Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGLGFEDAAR 216 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 166 bits (421), Expect = 5e-40, Method: Composition-based stats. Identities = 57/260 (21%), Positives = 104/260 (40%), Gaps = 19/260 (7%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDF------ 57 L+ ++ +PD R V H L +L + AV++GA + ++ P Sbjct: 27 SSLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELG 86 Query: 58 -LKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS--DDKDVIAIDGKIHR 114 + + P T R+++ + ++ W+L + + V ++DGK R Sbjct: 87 VFRDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLR 146 Query: 115 HSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAM 174 S +H+++ V+GQ+ D K+NE+T LL LD+ ++ DA+ Sbjct: 147 GS---GPAGEQVHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADAL 203 Query: 175 GCQKDIAE-KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGRE 233 Q++ A + + Y+F VK NQ RL + + P ++ S + HGR Sbjct: 204 HTQREHARWLVDTKKAAYVFTVKKNQPRLYRQLKT-LPWTKIPIQD----ETSTRGHGRY 258 Query: 234 ETRLHIVCDVPDELIDFTFE 253 + R L F Sbjct: 259 DIRRLQAVTCTGPLA-LDFP 277 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 166 bits (420), Expect = 7e-40, Method: Composition-based stats. Identities = 65/245 (26%), Positives = 105/245 (42%), Gaps = 10/245 (4%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + L+ + + D R H L ++L L + A + GA+ ++ +F E + L++ Sbjct: 1 MSVLISILREVRDPRDV-NARHDLGELLFLALLATLRGAKTCVEMAEFSEARQEELREIV 59 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD----KDVIAIDGKIHRHSYD 118 +G P HDT +RV + P + +F +M + K V+AIDGK R YD Sbjct: 60 ALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRGALGLPAPKGVVAIDGKSLRRGYD 119 Query: 119 KSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 K R ++S + I ++ +EI A +L L +KG + DA+ C Sbjct: 120 KGRAFMPPLMVSVWDVETRPSIAAMRAPG-GDEIKATLSVLKALTLKGCTVTADALHCHP 178 Query: 179 DIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLH 238 +A+ + Y +K N G L +A E F + E+ HGREE R Sbjct: 179 AMAQALLAAKAQYALGLKANHGPLFRAAEAGFA----AVTDLAVFETRERGHGREEQRRA 234 Query: 239 IVCDV 243 V V Sbjct: 235 SVLPV 239 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 160 bits (404), Expect = 5e-38, Method: Composition-based stats. Identities = 52/227 (22%), Positives = 103/227 (45%), Gaps = 15/227 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFG----ETHPDF 57 +++ L + +PD R +H L IL + + AV++ A+ + + ++ + Sbjct: 219 QMEGLRQIFFRLPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKR 278 Query: 58 LKQYGDFEN---GIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHR 114 ++ + P T+ RV+ + W+L + +A+DGK+ + Sbjct: 279 IRARFNPRTQRYVAPSEPTLRRVLQGANVTALDAAIGAWLLGIA---GFEAVAVDGKVLK 335 Query: 115 HSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAM 174 + + + +H++SAF I Q + +K+NEI + LL +DI+ K++ DA+ Sbjct: 336 GAVREDGSQ--VHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADAL 393 Query: 175 GCQKDIAEKIQK-QGGDYLF-AVKGNQGRLNKAFEEKFPLKELNNPK 219 Q+ A + + + DYLF AVKGNQ +L + P + + Sbjct: 394 HTQRKTARFLVEDKKADYLFTAVKGNQRKLRNSLI-CLPWGDFPPQR 439 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 159 bits (402), Expect = 8e-38, Method: Composition-based stats. Identities = 57/261 (21%), Positives = 109/261 (41%), Gaps = 21/261 (8%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 E++ L + ++ +PD R + H+L IL L+ AV +G + E+I + P + Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 62 GDFENGI-------PVHDTIARVVSCICPAKFHES---FINWMLDYHSSDDKDVIAIDGK 111 P DT+ RV+S + + + F + V+A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 IHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLD----IKGK 167 R + R A H+++ +V+ + + K+NE+TA LL L + G Sbjct: 158 TLRGAAGPEGR--APHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 IIKTDAMGCQKDIAEKIQ-KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAIS 226 ++ DA+ + A+ I + G ++F VK N L+ + ++ ++ Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPI----GHSAE 271 Query: 227 EKSHGREETRLHIVCDVPDEL 247 ++HGR E R + + + Sbjct: 272 GRAHGRFERRTIQLAQASEAI 292 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 159 bits (402), Expect = 8e-38, Method: Composition-based stats. Identities = 56/223 (25%), Positives = 97/223 (43%), Gaps = 19/223 (8%) Query: 11 SIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPD-FLKQYGDFENGI- 68 + + D R+A + H +LL+ + V++G +E I + + L++ G + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 69 -----PVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRK 123 P TI R++S P + +++ + IAIDGK R S Sbjct: 289 ERFLPPSEPTIRRILSKADPVELDRILSQYIVAH---SSGRAIAIDGKTIRSS------- 338 Query: 124 GAIHVISAFSTMHSLVIGQIKTDK-KSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAE 182 ++ +++A V+ Q D K +EI A LL LD+ GK++ DA+ Q +A Sbjct: 339 -SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALAS 397 Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAI 225 +I+++GGDY+F VK N+ L +P D Sbjct: 398 RIREKGGDYVFTVKDNRKTLKDEISGLDDEAFSPSPYDDLLRT 440 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 159 bits (402), Expect = 9e-38, Method: Composition-based stats. Identities = 60/275 (21%), Positives = 106/275 (38%), Gaps = 34/275 (12%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISG-----AEGWEDIEDFGETHPDFL 58 + E ++ IPD+R A + + L + + +CAV + A E + T L Sbjct: 22 AGIWERLAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERVRL 81 Query: 59 KQYGDFENG--IPVHDTIARVVSCICPAKFHESFINWMLDYHSSD--------------- 101 + + +G +P TI R ++ + + ++ L +D Sbjct: 82 RLPWNPWDGHLLPDEATIRRFLNTVDDQALATALLDPPLADTPADLTDAVPSAPVRPPAG 141 Query: 102 ----DKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 A+DGK R + + +H++ + ++GQ + D KSNE T Sbjct: 142 DQAVPVRAYAVDGKTSRGAKRADGSQ--VHLLGVAAHGAGALLGQREIDAKSNETTEFRA 199 Query: 158 LLNMLDIKGKIIKTDAMGC-QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 LL L++ G + DA+ + ++ + ++ YL K NQ +L AF P E+ Sbjct: 200 LLAPLELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKLR-AFLAALPWTEIP 258 Query: 217 NPKHDSYAISEKSHGREETRLHIVCDVPDELIDFT 251 ++ HGREETR V V Sbjct: 259 TADL----TRDRGHGREETRTLKVATVTHLDFPHA 289 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 154 bits (389), Expect = 3e-36, Method: Composition-based stats. Identities = 58/293 (19%), Positives = 107/293 (36%), Gaps = 50/293 (17%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVIS-GAEGWEDIEDFGETHPDF------ 57 L++ ++I D R H L+ IL + CA ++ G + IE + + P Sbjct: 29 DLIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLH 88 Query: 58 -LKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD------------ 104 + + P TI RV++ + + ++ + Sbjct: 89 IWRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRR 148 Query: 105 ---------------------VIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQI 143 A+DGK + + + +H+IS + + + V Q Sbjct: 149 TEREARRAAHRSPTPAPGLLPAYAVDGKRLKGARHPDGGR--VHLISLAAHLDATVHAQR 206 Query: 144 KTDKKSNEITAIPELLNM---LDIKGKIIKTDAMGCQKDIAE-KIQKQGGDYLFAVKGNQ 199 + KS+EI A+ LL D+ G +I DA+ Q+ A I++ Y+ VK NQ Sbjct: 207 QIPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQ 266 Query: 200 GRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELIDFTF 252 L+ + + ++ + + HGR E R+ P + IDF + Sbjct: 267 PTLHATAITALTGTD-TDFAAVTHRETHRGHGRTEYRILR--TAPADGIDFPY 316 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 153 bits (387), Expect = 4e-36, Method: Composition-based stats. Identities = 53/164 (32%), Positives = 82/164 (50%), Gaps = 3/164 (1%) Query: 47 IEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVI 106 + + L+ + NG P DT RV+ I P + + + S + I Sbjct: 1 MHELCLERGASLRPPVELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHI 60 Query: 107 AIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKG 166 AIDGK + S K+ H++SA+ L + Q +K NE+ AIPE+L+ LD+ G Sbjct: 61 AIDGKRLKGSKKKTGS---THILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSG 117 Query: 167 KIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 +I DAMG Q +IAE+I + DY+ ++KGNQ L + + F Sbjct: 118 AVISIDAMGTQTNIAEQIIQSEADYILSLKGNQKHLYEDVRDCF 161 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 152 bits (384), Expect = 1e-35, Method: Composition-based stats. Identities = 54/142 (38%), Positives = 76/142 (53%), Gaps = 3/142 (2%) Query: 101 DDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLN 160 VIAI+GK R + + A+H +SA++ + L +GQ+ +KSNEITAI ELL Sbjct: 1 MGGLVIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLP 60 Query: 161 MLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKH 220 L ++G ++ DA+GCQ +AE+I GGDY+ AVK NQ L A + F Sbjct: 61 TLALEGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPV 120 Query: 221 DS---YAISEKSHGREETRLHI 239 + +K HGR ETR Sbjct: 121 RQTCVHETLDKGHGRIETRRCT 142 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 151 bits (382), Expect = 2e-35, Method: Composition-based stats. Identities = 63/142 (44%), Positives = 90/142 (63%), Gaps = 4/142 (2%) Query: 106 IAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIK 165 +AIDGK R S+D + IH++SA+S+ +L +GQ++T KSNEITAIPELL LDI+ Sbjct: 1 MAIDGKCLRGSHD--GARSPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 GKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDS--Y 223 G I DAMGCQ DIAE+I ++G DY+ VKGNQ L +A + F + + + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 AISEKSHGREETRLHIVCDVPD 245 + ++K+HGR ETR + + Sbjct: 119 SQTDKNHGRIETRRCVATNDVA 140 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 150 bits (380), Expect = 3e-35, Method: Composition-based stats. Identities = 57/226 (25%), Positives = 92/226 (40%), Gaps = 12/226 (5%) Query: 28 DILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFH 87 +L + + A +G G+ + T D + P T V+S + PA + Sbjct: 2 ALLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 ESFINWMLDYHSSDDKD---VIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIK 144 ++ + +S D IA+DGK+ R + + A H++S F+ LV+GQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGAL--RAKATATHLVSVFAHRARLVLGQLA 119 Query: 145 TDKKSNEITAIPELLNMLDIK-GKIIKTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRL 202 +KSNEI + LL +L ++ DAM Q A+ I YL VK NQ ++ Sbjct: 120 VAEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 NKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELI 248 P E+ D + HGR ETR + + Sbjct: 180 LARI-TALPWAEVPAAATD----DSRGHGRVETRTLQIITAARGIG 220 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 150 bits (380), Expect = 3e-35, Method: Composition-based stats. Identities = 60/293 (20%), Positives = 105/293 (35%), Gaps = 50/293 (17%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVI-SGAEGWEDIEDFGETHPDFLKQ 60 +++ L+ + D R A V +++S +L L +CA+ +G + ++ Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 61 YGDFEN-------GIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD----------- 102 IP T+ V+ + P + + + + S+ Sbjct: 90 AFGLPYHPLRGRYRIPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPLMPDGG 149 Query: 103 ---------------------KDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIG 141 + IA+DGK R + + + V+SA + + Sbjct: 150 IEREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR--VFVLSAVRHGDGITLA 207 Query: 142 QIKTDKKSNEITAIPELLNMLDI---KGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 + K+NEI LL+ LD KG ++ DA+ Q+D A + ++G YL +K N Sbjct: 208 SREIGAKTNEIPEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERGAHYLLTIKNN 267 Query: 199 QGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELIDFT 251 Q + P KE+ D + HGR E RL V V L Sbjct: 268 QRGQARQLHA-LPWKEIPVIHRDDA----RGHGRHEQRLVQVVTVNGLLFPHA 315 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 149 bits (376), Expect = 8e-35, Method: Composition-based stats. Identities = 49/187 (26%), Positives = 89/187 (47%), Gaps = 4/187 (2%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 + L+ + +PD R+A + L +L+ T+ A++SGA + I F E + L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 61 YGDFE-NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD---KDVIAIDGKIHRHS 116 + + PV +T+ V+ + ++F + K V+A+DGK R S Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGS 130 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 +D + A ++AF + ++V+ + D KSNEI A +++ L + G + DAM C Sbjct: 131 FDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHC 190 Query: 177 QKDIAEK 183 QK + + Sbjct: 191 QKKHSRR 197 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 144 bits (363), Expect = 3e-33, Method: Composition-based stats. Identities = 55/226 (24%), Positives = 91/226 (40%), Gaps = 12/226 (5%) Query: 28 DILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFH 87 +L + + A + G+ + T D + P T V+S + PA + Sbjct: 2 ALLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 ESFINWMLDYHSSDDKD---VIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIK 144 ++ + +S D IA+DGK+ R + + A H++S F+ LV+GQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGAL--RAKATATHLVSVFAHRARLVLGQLA 119 Query: 145 TDKKSNEITAIPELLNMLDIK-GKIIKTDAMGCQKDIAEKIQKQ-GGDYLFAVKGNQGRL 202 +KSNEI + LL +L ++ DAM Q A+ I YL VK NQ ++ Sbjct: 120 VAEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 NKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELI 248 P E+ D + HGR +TR + + Sbjct: 180 LARI-TALPWAEVPAAATD----DSRGHGRVKTRTLQIITAARGIG 220 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 144 bits (362), Expect = 4e-33, Method: Composition-based stats. Identities = 52/129 (40%), Positives = 78/129 (60%), Gaps = 1/129 (0%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 ++H+++A+ + +L++GQ+K D KSNEITAIP+LL ML ++G I+ DAMGCQK IA++ Sbjct: 1 NSLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQ 60 Query: 184 IQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNP-KHDSYAISEKSHGREETRLHIVCD 242 I + DY+ AVK NQ L + + F ++N H + + HGR ETR + Sbjct: 61 IGSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREYSTIV 120 Query: 243 VPDELIDFT 251 D L T Sbjct: 121 GDDLLAGIT 129 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 142 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 58/154 (37%), Positives = 87/154 (56%), Gaps = 3/154 (1%) Query: 102 DKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNM 161 D+IA+DGK R SYD++ K AIH++SA+ST + LV+GQ+KT++KSNE TAIP+L + Sbjct: 6 PGDIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIPKLFTL 65 Query: 162 LDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHD 221 L ++ + DA+G Q+DIA++I + DYL VK NQ L++ + + E D Sbjct: 66 LALEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAKGFTED 125 Query: 222 SYAI-SEKS--HGREETRLHIVCDVPDELIDFTF 252 +E+ HGR + V L Sbjct: 126 FTDSVTEEGDKHGRIDKLHCRVTHRFSGLGALAD 159 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 134 bits (337), Expect = 3e-30, Method: Composition-based stats. Identities = 49/180 (27%), Positives = 82/180 (45%), Gaps = 4/180 (2%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+E ++ +PD+R A + L +LLL I +S G+ +EDF H + L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 -ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDK--SR 121 P T RV+ I F NW+ ++D + +DGK + + Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 122 RKGAIHVISAFSTMHSLVIG-QIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDI 180 + I+V+S FS + I Q +K+ +EI + LL LD++G + D++ CQK + Sbjct: 124 YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKKL 183 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 134 bits (336), Expect = 4e-30, Method: Composition-based stats. Identities = 47/183 (25%), Positives = 82/183 (44%), Gaps = 4/183 (2%) Query: 20 WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGI-PVHDTIARVV 78 H L +L L AV+ G + I FG + L F G P T+++ + Sbjct: 2 QGRIHPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTL 61 Query: 79 SCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSL 138 I P + + W+ + D + +A+DGK R S + H ++A++ + Sbjct: 62 RRIDPQQLEAALGRWIAGRLTPDARAHVALDGKCLRGS--RDGDVPGPHRVAAYAPHAAA 119 Query: 139 VIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGN 198 V+GQI+ D ++NE A LL ++ + G ++ A C +D+A + GG Y+ +G Sbjct: 120 VLGQIRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYVSHGQG- 178 Query: 199 QGR 201 Q Sbjct: 179 QPT 181 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 132 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 53/180 (29%), Positives = 83/180 (46%), Gaps = 5/180 (2%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFL-KQY 61 + L + + IPD+R+A L +LL +I A++SGA + I F TH L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSR 121 G P + +I + + F VIA+DGK R S D+ Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHFRAHAARLAE--GAAVIALDGKTLRGSLDRFE 118 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTD--KKSNEITAIPELLNMLDIKGKIIKTDAMGCQKD 179 + A V+SAF+T +V+GQI + K +EI A L+ L + G++ DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 129 bits (325), Expect = 6e-29, Method: Composition-based stats. Identities = 43/190 (22%), Positives = 72/190 (37%), Gaps = 6/190 (3%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L+ H+ IPD R V +LL+ + ++S E D+E F H L + Sbjct: 12 DLISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGI 71 Query: 65 E-NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDK--DVIAIDGKIHRHSYDK-- 119 E P + A + +W L D + DGK R S + Sbjct: 72 ELKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTS 131 Query: 120 SRRKGAIHVISAFSTMHSLVIGQ-IKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 I ++ +S + I Q + +E + +LL LD++G +I+ DA+ Q+ Sbjct: 132 GGGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQ 191 Query: 179 DIAEKIQKQG 188 Q +G Sbjct: 192 AFFGSSQSRG 201 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 128 bits (321), Expect = 2e-28, Method: Composition-based stats. Identities = 50/117 (42%), Positives = 66/117 (56%), Gaps = 3/117 (2%) Query: 133 STMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYL 192 S +LV+GQ K + KSNEITAIP L+ ML+I+ II DAMGCQK+I I+K+ GDY+ Sbjct: 28 SLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESSIITIDAMGCQKEITSLIRKKKGDYI 87 Query: 193 FAVKGNQGRLNKAFEEKF---PLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDE 246 +K NQ L + +E F +E + +H Y E H R E R I V Sbjct: 88 ITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQEIETGHHRIEKREVIAVSVSSL 144 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 124 bits (312), Expect = 3e-27, Method: Composition-based stats. Identities = 42/185 (22%), Positives = 69/185 (37%), Gaps = 8/185 (4%) Query: 70 VHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVI 129 + ++ + F S +K + DGK R S + +++G V+ Sbjct: 21 SRSHLPVLLQKVDVEVFDYLLFTHYGFRLDSQEKQWFSGDGKELRGSIESGKKRGQA-VV 79 Query: 130 SAFSTMHSLVIGQIKTDK-KSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQG 188 I Q D K +EI + LL+ D+ + I DA+ E I K G Sbjct: 80 QIVHHHSGEAIAQNYYDGQKESEIPTLRALLSKDDLASQKITLDALHLCPSTTEMITKAG 139 Query: 189 GDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELI 248 G +L +K NQ L + P D + +HGR E R + + DV + Sbjct: 140 GVFLIGLKENQPTLLAHMTDC------ALPPIDQKTTFDFNHGRVEQRKYWLYDVSKQGF 193 Query: 249 DFTFE 253 D ++ Sbjct: 194 DPRWD 198 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 122 bits (307), Expect = 8e-27, Method: Composition-based stats. Identities = 48/126 (38%), Positives = 69/126 (54%), Gaps = 3/126 (2%) Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 H++SA++T H + +G + T++KSNEITAI LL L K ++ DAMGCQKDIA I Sbjct: 3 PRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARNI 62 Query: 185 QKQGGDYLFAVKGNQGRLNKAFEE---KFPLKELNNPKHDSYAISEKSHGREETRLHIVC 241 GGD++ AV+ NQ +L A K E +H ++ HGR + R + Sbjct: 63 VAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWGA 122 Query: 242 DVPDEL 247 VP + Sbjct: 123 QVPPDF 128 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 48/167 (28%), Positives = 81/167 (48%), Gaps = 9/167 (5%) Query: 3 LKKLMEHISIIPDYRQAWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 L+KL E S IPD+R+A K + HKL D+++L I +S +I +FG+ + ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSD-----DKDVIAIDGKIHRH 115 NGIP T+ R+ I + +H ++++ IDGK R Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCAQEIVCIDGKAERG 154 Query: 116 SYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML 162 + K+ R I +SA S + + ++KSNEI A+P L++ + Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLIDKI 199 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 120 bits (300), Expect = 6e-26, Method: Composition-based stats. Identities = 43/187 (22%), Positives = 83/187 (44%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L + +S +PD R A + L +L L + A +S + +E F +P L G Sbjct: 3 LRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 P H I ++ + P K + + +D +V+ +DGK R S + Sbjct: 60 RKAPGHTAITLLLHRLDPEKLQAALG---QVFPEADLGEVLVVDGKHLRGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML---DIKGKIIKTDAMGCQKDIAE 182 + ++ + + Q + + + E A ELL+ L +++GK++ DA ++A Sbjct: 115 VKLVEVLALHLHTTLAQARAEGR--EEKAFLELLDRLEARELEGKVVVGDAGYLYPEVAA 172 Query: 183 KIQKQGG 189 +++K+GG Sbjct: 173 RVRKKGG 179 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 48/204 (23%), Positives = 74/204 (36%), Gaps = 21/204 (10%) Query: 50 FGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAID 109 FG + +LK GI H T + V C+ F + + Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAAL------------PKPLQRA 90 Query: 110 GKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKII 169 K S + + ++ ++ LVIGQ + NE+ + L +L ++G I+ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQQTAPGR-NEVQGALDALALLSLEGAIV 149 Query: 170 KTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKS 229 DA+ C+ D A I GGDY A+K NQ L + E + +E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLGVQ----TAAEND 205 Query: 230 HGREETRLHIVCDVPDELIDFTFE 253 H R E R + V D F Sbjct: 206 HDRCERRRACIVAVN----DIDFP 225 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 43/112 (38%), Positives = 67/112 (59%), Gaps = 3/112 (2%) Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQ 187 ++S ++T + LV+GQ+K + SNEITAIPELL +L++ G I++ A+ C KDI + I ++ Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 GGDYLFAVKGNQGRLNKAFEEKFP---LKELNNPKHDSYAISEKSHGREETR 236 DY+ +K NQG L ++ E+ F +H +Y E HG E R Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQELQHSTYKPEETGHGLHEIR 112 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 117 bits (293), Expect = 4e-25, Method: Composition-based stats. Identities = 37/157 (23%), Positives = 73/157 (46%), Gaps = 8/157 (5%) Query: 98 HSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 ++ + IA+DGK + S + + H++SA + + + +++ K+NE T Sbjct: 126 ATAGPRRAIAVDGKALKASARLTSPRR--HLLSAVTHGRVVTLARVEVGAKTNETTHFKP 183 Query: 158 LLNMLDIKGKIIKTDAMG-CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELN 216 LL LD+ ++ DA+ + +I+ ++ + Y+ +K NQ + P +++ Sbjct: 184 LLAPLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQL-ATLPWRDIP 242 Query: 217 NPKHDSYAISEKSHGREETRLHIVCDVPDELIDFTFE 253 +A SE HGR E+ C +PDEL + Sbjct: 243 V----QHAASEVGHGRRESSSIKTCAIPDELGGIAYP 275 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 114 bits (284), Expect = 4e-24, Method: Composition-based stats. Identities = 45/187 (24%), Positives = 85/187 (45%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFE 65 L E +S IPD R A ++ L +L L + A +S + +E F +P L G Sbjct: 3 LREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 P H + ++ + P K E+ + + +D +V+ +DGK + S + Sbjct: 60 RKPPGHTILTLLLHRLDPEKLQEAL---LQVFPGADLGEVLVVDGKHLKGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML---DIKGKIIKTDAMGCQKDIAE 182 + ++ + + Q K + + + A+ ELL+ L +KGK++ DA ++A Sbjct: 115 VRLVEVLALHLLTTLAQAKAEGRED--QALLELLDRLGAEGLKGKVVVGDAGYLYPELAG 172 Query: 183 KIQKQGG 189 K+ ++GG Sbjct: 173 KVVQKGG 179 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 49/108 (45%), Positives = 60/108 (55%), Gaps = 3/108 (2%) Query: 143 IKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRL 202 + T+ KSNEITAIP LL L+ K ++ DAMGCQKDIA I GGD++ AVK NQ +L Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 NKAFE---EKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDEL 247 A EK EL +H +Y HGR + R H V VP Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVPPGF 108 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 107 bits (268), Expect = 3e-22, Method: Composition-based stats. Identities = 44/211 (20%), Positives = 90/211 (42%), Gaps = 14/211 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLK-- 59 + + +++ IPD R+ K +H+ D+LL+ + AV SG + + + FL Sbjct: 5 SIPSPLPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDE 64 Query: 60 ---QYGDFENGIPVHDTIARVVSCI--CPAKFHESFINWMLDYHSSDDKD-----VIAID 109 + E +P T+ R+ + ++ ++W + + K+ +A+D Sbjct: 65 VHIRTRRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVD 124 Query: 110 GKIHRHSYDKSRRKGAIHVISAFSTMHSLVIG-QIKTDKKSNEITAIPELLNMLDIKGKI 168 GK R + R + A+ +SA L +G Q D ++ + + L + + Sbjct: 125 GKHLRGTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGV-DWV 183 Query: 169 IKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 + DA C +++A + +Q G A KG + Sbjct: 184 LTGDAALCTQELAAVVVEQKGGICSASKGTR 214 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 42/113 (37%), Positives = 65/113 (57%), Gaps = 4/113 (3%) Query: 3 LKKLM-EHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 ++ L E S IPD R +H +I+ L + +V++GA+ + +IEDF E H D+LK Y Sbjct: 1 MEGLFVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTY 60 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS---DDKDVIAIDGK 111 + NGIP HDT +RV S I PA F +SF+ W+ + + + I ++ K Sbjct: 61 FNLPNGIPSHDTFSRVFSAINPASFQDSFLIWLKAINDAFMYASQRPICLNFK 113 >UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BR51_9GAMM Length = 153 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 60/150 (40%), Gaps = 9/150 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ L ++ + +PD +A H+L +L L A + G +G++ + ++ + ++ Sbjct: 7 QMRSLPQYFTDLPDPHRAEGRRHRLPVVLTLAAGASLCGMQGYKSMAEWASSLAQAARRR 66 Query: 62 GDFENG-----IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS 116 +P I + + P W + ++ +A+DGKI + Sbjct: 67 FGCRRVNGHYLVPSLYVIRDCLVRLGPEALDRRLQAWQAA--QLNSEEALAMDGKIMKGG 124 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTD 146 D + + H++S + Q K+ Sbjct: 125 VDHTGAQT--HIVSLIGHESKHCVAQKKSA 152 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 36/96 (37%), Positives = 49/96 (51%), Gaps = 2/96 (2%) Query: 154 AIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLK 213 AIPELL LD++G + DA+G Q IA I + G DY+ AVK NQ RL + + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 ELNNPKHDS--YAISEKSHGREETRLHIVCDVPDEL 247 + + +K HGR ETR+ V + L Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETRVCRVSEDVAWL 96 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 28/129 (21%), Positives = 53/129 (41%), Gaps = 6/129 (4%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 + L ++ +PD R + L IL + +CAV++GA + I D+ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FENGIPVHDTIARVVSCICPAKFHESFINWMLDY------HSSDDKDVIAIDGKIHRHSY 117 F + +P T+ R++ I + W+ VIA+DGK+ R + Sbjct: 89 FTDRVPAATTVWRLLIRIDAEVLPQVLARWLRARTAPVVVTGRRLCLVIAVDGKVVRGAR 148 Query: 118 DKSRRKGAI 126 ++ A+ Sbjct: 149 LRAAGPSAL 157 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 31/150 (20%), Positives = 56/150 (37%), Gaps = 5/150 (3%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAK 85 L+ +L L V++G + + + ++ P L GIP T R+V P Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFGLTRGIPSERTTRRLVEGCDPVA 107 Query: 86 FHESFINWM--LDYHSSDDKDVIAIDGKIHRH--SYDKSRRKGAIHVISAFSTMHSLVIG 141 E+ W+ +A DGK + S+ ++ V+ A + G Sbjct: 108 LDEALSGWIARAATVGDPGPRGLAFDGKTLKGTRSFTEAGAMSQEAVLEAVWHDTGITAG 167 Query: 142 QIKTDKKSNEITAIPELLNMLDIKGKIIKT 171 + +EI A+ L LD+ ++ T Sbjct: 168 HQRVVG-GDEIAALEALAGRLDLTDVLVTT 196 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 100 bits (248), Expect = 6e-20, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 73/222 (32%), Gaps = 37/222 (16%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS 116 + G P ++T+ +++C+ WM I DGK+ S Sbjct: 13 RWRPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGG-IRADGKVLGGS 71 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGC 176 K A+H + + + + Q + + A+ LL + G+++ DA Sbjct: 72 --KRAGAPALHGVELVTHTTGMALAQREAVG-GDAAAALLALLTEAPLDGRMVSMDAGFL 128 Query: 177 QKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFP------------------------- 211 + + I ++ G+YL VKG+Q ++ P Sbjct: 129 NAAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALDQIAPPR 188 Query: 212 --------LKELNNPKHDSYAISEKSHGREETRLHIVCDVPD 245 +EL + E+S GR E R V D D Sbjct: 189 RKRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGD 230 >UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV84_9DELT Length = 357 Score = 99.8 bits (247), Expect = 8e-20, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 57/148 (38%), Gaps = 9/148 (6%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ L ++ + D R+ H++S +L + A + G +G++ I + +Q Sbjct: 214 QMESLPDYFKTVTDPRRTHGRRHRISTVLSIAAAATLCGMKGYKAIYGWANKLGQKARQR 273 Query: 62 GDFE-----NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS 116 IP I V+ P + + + D D +A DGK +++ Sbjct: 274 FRCRKENGKYVIPSQFVIRDVLVRADPVELDLAVQRFNEDQGLEDTC--LAFDGKTMKNA 331 Query: 117 YDKSRRKGAIHVISAFSTMHSLVIGQIK 144 D++ R+ H+ S Q K Sbjct: 332 IDENARQT--HIASVVGHESKTTHTQKK 357 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 97.8 bits (242), Expect = 3e-19, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 68/212 (32%), Gaps = 34/212 (16%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICA-VISGAEGWEDIEDFGETHPDFLKQYG 62 + + E + + D R + + + +C+ +G + + + Sbjct: 22 EGIWERLDRVTDPRSTRGRVYSWLCLAAVWLCSLTAAGHHRVSAVRAWLARTSGAERARL 81 Query: 63 DFEN------GIPVHDTIARVVSCICPAKFHESFINWML--------------------- 95 +P TI + + + + ++ L Sbjct: 82 RLPWDPFAGWRLPSTATIHCFLQAVDDGELAVALLDPPLDPDPPAEQGDDTDQRTEPSAA 141 Query: 96 ----DYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNE 151 + + +A+DGK RH+ K +H++ S ++ Q++ + K+NE Sbjct: 142 PVDPGHGCQPVESAVALDGKTSRHA--KRADGSKVHLVGVASHGDGRLLAQVEVEAKTNE 199 Query: 152 ITAIPELLNMLDIKGKIIKTDAMGCQKDIAEK 183 LL LD+ ++ DA+ + + Sbjct: 200 TAVFRRLLRPLDLTNVLVTADALHTVRANLDT 231 >UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PA49_CLOTS Length = 245 Score = 97.4 bits (241), Expect = 4e-19, Method: Composition-based stats. Identities = 48/230 (20%), Positives = 81/230 (35%), Gaps = 37/230 (16%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + L E I+ + D R V+ +S I + + + + +E + K+ Sbjct: 16 VYHLGEKINTLKDKRVKSSVK--ISTITFVVLFGFMLQIRSFNRLEHW--LKKGKFKKAL 71 Query: 63 DFENGIPVHDTIARVVSCICPAKFHES--------FINWMLDYHSSDDKDVIAIDGKIHR 114 + +P DTI RV+S +E N + + D V+AIDG Sbjct: 72 PKKTKMPRIDTIRRVLSNFDLDGLNELNNSIIKTSIKNKVFRRGTIDGLKVVAIDGVELF 131 Query: 115 HSYDKSRRKG--------------AIHVISAFSTMHSLVIGQIKTDKKSN-------EIT 153 S K V S + L++GQ + K + EIT Sbjct: 132 ESTKKCCGNCLTRVQKDGITHYFHRTVVCSTIGSDSHLILGQEILEPKKDGSDKDEGEIT 191 Query: 154 AIPELLNMLDIK----GKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 A L+ L + II DA+ C+ +++ G D + VK + Sbjct: 192 AGKRLIRKLHREFHHFADIIVADALYCKSTWVKEVLSIGMDAVVRVKDER 241 >UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BLK3_9GAMM Length = 190 Score = 97.1 bits (240), Expect = 5e-19, Method: Composition-based stats. Identities = 25/129 (19%), Positives = 54/129 (41%), Gaps = 6/129 (4%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 +++ L ++ + +PD R+A H+L + LT A + G +G++ + ++ + +Q Sbjct: 59 QMRALPQYFTDLPDPRRAQGRRHRLPVVPALTAGASLCGMQGYKAMAEWASSLGQAARQR 118 Query: 62 GDFENG-----IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHS 116 +P I + + P W +S D + +A+DGKI + Sbjct: 119 FGCRRVNGHYLVPSLYVIRDCLVRLGPKALDRRLQAWQAAQLNSSD-EALAMDGKIMKGG 177 Query: 117 YDKSRRKGA 125 D + + Sbjct: 178 VDHTGAQTQ 186 >UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TCH7_HELMI Length = 453 Score = 94.7 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 70/241 (29%), Gaps = 36/241 (14%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + + D R+ +++ I + E E ++ + +Q Sbjct: 38 VYGFSQMVRQAKDGRKQPRIK--APAIFTVAFFGAFFCMESMEQMDRW--QKTGVFRQLV 93 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDK--------DVIAIDGKIHR 114 +P HDT+ + + + E + Y V AIDG Sbjct: 94 PKNIRLPSHDTVRQALMKWDLKEQREQHNCVIQRYKEQRGPQKESINGWRVTAIDGVELF 153 Query: 115 HSYDKSRRKGAI--HVISAFSTMHSLVIGQIK------------------TDKKSNEITA 154 H+ + H H++V+ Q DK E T Sbjct: 154 HTKAYRCPECLTREHRDKTTDYYHAVVVAQQVGGNANLIYDWEMRKPQDGVDKDEGETTV 213 Query: 155 IPELLNML-DIKGKI---IKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF 210 L+ + + GKI DA+ + + G + +K + R+ K F Sbjct: 214 AQRLIRRMAETYGKITDVYTLDALFAKAPVIHAALDAGAHVVVRMKEERRRIMKEANACF 273 Query: 211 P 211 Sbjct: 274 A 274 >UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3K1_9PROT Length = 134 Score = 92.1 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 29/117 (24%), Positives = 48/117 (41%), Gaps = 6/117 (5%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 L+ S I D R+ + L+ +LL T+ A+++GA + ++ F TH D L D Sbjct: 3 STLLSLFSQISDPRRDQGKIYPLAPVLLFTVLAMLAGARSYRQVQAFIRTHLDRLNDGFD 62 Query: 64 FE-NGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDD-----KDVIAIDGKIHR 114 P + T+ ++ I + +F + L IAIDGK Sbjct: 63 LSLRRAPAYSTVRFILRGIDAEEMERAFRDHALGLADGPAEGAAIPGAIAIDGKTWC 119 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 90.9 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 33/90 (36%), Positives = 50/90 (55%), Gaps = 1/90 (1%) Query: 119 KSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 S A+H++SAF + +V+ Q+ +KSNEI A ELL LDI G + DAM Q+ Sbjct: 2 ASETVKAVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQR 61 Query: 179 DIAE-KIQKQGGDYLFAVKGNQGRLNKAFE 207 + A ++ + D++ VK NQ L +A Sbjct: 62 EHARFAVEDKRADFVMTVKDNQPELREALA 91 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 90.5 bits (223), Expect = 4e-17, Method: Composition-based stats. Identities = 33/131 (25%), Positives = 54/131 (41%), Gaps = 2/131 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF- 64 L ++ IPD+R+A + L+ +LL +I AV+SGA + I+ F + H + L Sbjct: 3 LKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQLH 62 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDV-IAIDGKIHRHSYDKSRRK 123 PVH +I + + +F + IA+DGK R + + R Sbjct: 63 WKRAPVHTSIRYALQGLDAKAGELAFHRHASGLDGEGAQHASIAMDGKTLRAAVSITSRT 122 Query: 124 GAIHVISAFST 134 SA Sbjct: 123 ARPLRYSAHWP 133 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 90.5 bits (223), Expect = 5e-17, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 48/118 (40%), Gaps = 9/118 (7%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L EH++ +PD R + H L IL + + A+ SGAE + + ++ T L Q Sbjct: 15 GLWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGC 74 Query: 65 ENG-------IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRH 115 + P T+ RV+ I + NW+L +A+DGK Sbjct: 75 QESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSLG--LSPAALAVDGKTLAG 130 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 89.4 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 32/79 (40%), Positives = 41/79 (51%), Gaps = 3/79 (3%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKH---DSYAISEKSH 230 MGCQK+IAE I +Q DY+ AVK NQ L++A ++ F N + D KSH Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYFEEANEANFESYNIDFAETYNKSH 60 Query: 231 GREETRLHIVCDVPDELID 249 GR E+R V L D Sbjct: 61 GRIESRRCWVGYDALPLTD 79 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 87.0 bits (214), Expect = 6e-16, Method: Composition-based stats. Identities = 28/97 (28%), Positives = 42/97 (43%), Gaps = 4/97 (4%) Query: 155 IPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 +L +++ GK I DA+ QK +AE I + YLF VK NQ L + F ++ Sbjct: 2 FIPILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYFEHRK 61 Query: 215 LNNPKHDSYAISEKSHGREETRLHIVCDVPDELIDFT 251 D HGR +TR +E ++F Sbjct: 62 EP----DYCLQDPPGHGRIDTRSIWTTTELNEYLEFP 94 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 86.3 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 62/163 (38%), Gaps = 16/163 (9%) Query: 51 GETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS---DDKDVIA 107 G+ D L +Y +F+N P + + + + I P F F + + + + +IA Sbjct: 53 GKALRDELLEYFEFDNTTPSNSSFNQRRAQILPEAFEFLFQEFTKSFTDNVTYNGLRLIA 112 Query: 108 IDGKIHRHSYDK------------SRRKGAIHVISAFS-TMHSLVIGQIKTDKKSNEITA 154 DG +++ + +H+ + + I+ + +NE A Sbjct: 113 CDGSDLCIAHNPQDETTYFQTLPDRKGYNLLHLNAFYDLCSRQYTDAIIQPSRLANERRA 172 Query: 155 IPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKG 197 + E+++ + I D +I ++ +G YL VK Sbjct: 173 MCEMIDRYNDTSAIFIADRGYENYNIFAHVEHKGMYYLIRVKD 215 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 85.9 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 29/87 (33%), Positives = 39/87 (44%), Gaps = 4/87 (4%) Query: 170 KTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPL---KELNNPKHDSYAIS 226 + D +GCQK IA+ I +Q DYL AVK NQ L++A F D Sbjct: 6 RCDGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYFEEANKARFAGYNIDYDEKI 65 Query: 227 EKSHGREETRLHIV-CDVPDELIDFTF 252 K GR E R V ++PD + + Sbjct: 66 NKGPGRLEQRRCWVGYEIPDTINSQNW 92 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 84.3 bits (207), Expect = 3e-15, Method: Composition-based stats. Identities = 40/122 (32%), Positives = 60/122 (49%), Gaps = 4/122 (3%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 ++H I D R +H L +I+LL I AV+SG+EGWE IE+FG D+L Q+ Sbjct: 5 ATFLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRP 64 Query: 64 FENGIPVHDTIARVVSCI--CPAKFHESFINWMLDYHSSDDKDVIAIDG--KIHRHSYDK 119 F+ GIP HDTIARV+ + + + + DY + + G + H + Sbjct: 65 FKAGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQR 124 Query: 120 SR 121 Sbjct: 125 EG 126 Score = 68.6 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 22/74 (29%), Positives = 31/74 (41%), Gaps = 4/74 (5%) Query: 178 KDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPKHDSYAISEKSHGREE 234 K+IA+ I KQ DY+ A+KG+ L E + + D + + HGR E Sbjct: 87 KEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGHGRIE 146 Query: 235 TRLHIVCDVP-DEL 247 TR V L Sbjct: 147 TRRCQQVLVNKSWL 160 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 84.3 bits (207), Expect = 4e-15, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 41/86 (47%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 + ++ + + + D R +H+ DI+++ +C V+ G +G I + ++L+ Sbjct: 6 VAVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQG 65 Query: 61 YGDFENGIPVHDTIARVVSCICPAKF 86 + + NG+P D I + + P F Sbjct: 66 FLELPNGLPSRDCIRNWLMALQPDAF 91 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 83.6 bits (205), Expect = 5e-15, Method: Composition-based stats. Identities = 33/128 (25%), Positives = 50/128 (39%), Gaps = 13/128 (10%) Query: 66 NGIPVHDTIARVVSCICPAKFHESFINWMLD----YHSSDDKDVIAIDGKIHRHSYDKSR 121 PV+ ++ ++ I P +F + IAIDGK R S+D Sbjct: 9 RRAPVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAFS 68 Query: 122 RKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELL---------NMLDIKGKIIKTD 172 A +V+SAF+ H +++ D+KSNEI A L+ I + D Sbjct: 69 DTKAAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALIVATALWKSREETSIYASSVMLD 128 Query: 173 AMGCQKDI 180 AM I Sbjct: 129 AMTFAPAI 136 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 83.2 bits (204), Expect = 8e-15, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 43/85 (50%) Query: 7 MEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFEN 66 ++H + D R L I+ + I AV++GA+G+ IE +G+ +L+ + D Sbjct: 28 LKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLPK 87 Query: 67 GIPVHDTIARVVSCICPAKFHESFI 91 GIP HDT RV+ + P + F Sbjct: 88 GIPSHDTFGRVLRILEPKQLQSGFR 112 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 82.8 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 35/75 (46%), Positives = 53/75 (70%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 +++++E + + D R A + +H L DIL+L +CAV+SGA+GW+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARV 77 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRV 81 >UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H9Y8_ANADF Length = 431 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 75/236 (31%), Gaps = 26/236 (11%) Query: 10 ISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP 69 + +PD R L++IL + +++GA + E+ + ++ +P Sbjct: 22 LEAVPDVRAREG-RWSLAEILTGVLLGIVAGARSLAEAEELTDGMSPAARRLASVPRRLP 80 Query: 70 VHDTIARVVSCICPAKFHESFINWMLD-------YHSSDDKDVIAIDGK-----IHRHSY 117 T + + + + V+A+DGK H Sbjct: 81 -DTTARDALCAVPLDGLRAALHRLVRAAWRRKALTPVDLPVGVVALDGKVTALPTLNHPL 139 Query: 118 DKSRRKG--------AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNML-DIKGK- 167 +++ + S I + ++NE +L L + G Sbjct: 140 IQNQHPDVGLPYGLARTVTCALVSAPGRPCIDAVPIPAETNEAGHFQHVLAGLVETYGAL 199 Query: 168 --IIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHD 221 ++ DA + + G DY+FA+K + + K E E+ + D Sbjct: 200 FQVVTYDAGALSEANGAAVVAAGKDYVFALKNDHFTMVKLATELLDPHEIAARRED 255 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 82.0 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 41/110 (37%), Positives = 60/110 (54%), Gaps = 2/110 (1%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 ++H + D R +H L DI+LL I AV+SG+EGWEDIE+FG D+L+QY Sbjct: 5 ATFLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRP 64 Query: 64 FENGIPVHDTIARVVSCI--CPAKFHESFINWMLDYHSSDDKDVIAIDGK 111 F+ GIP HDTIARV+ + + + + DY + + G+ Sbjct: 65 FKAGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQGE 114 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 81.7 bits (200), Expect = 2e-14, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 67/196 (34%), Gaps = 29/196 (14%) Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYH-------SSDDKDVIAIDGKIHR 114 G P T+ R+++ PA E+ + D V++ DGK Sbjct: 93 LGLGRGKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTW 152 Query: 115 HSYDKSRRKGAIH----------------VISAFSTMHSLVIGQIKTDKKSNEITAIPEL 158 D + KGA + S+ +GQ K E TA L Sbjct: 153 SRTDGEKVKGAQQSAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFRRL 212 Query: 159 L----NMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 L L + +I+ DA C ++ AE + G Y+F +K NQ L+ + + Sbjct: 213 LPAISEQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLH-DIARDYGQYD 271 Query: 215 LNNPKHDSYAISEKSH 230 L P A + H Sbjct: 272 LGTP-LARTAERYRGH 286 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 81.3 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 27/74 (36%), Positives = 49/74 (66%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + +++H S + D RQ+W+V + L +I LL +CA +SG E + +I +G+ +FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTI 74 + +E G+P HDT+ Sbjct: 77 FLPYERGLPAHDTL 90 >UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_9ACTO Length = 410 Score = 80.5 bits (197), Expect = 4e-14, Method: Composition-based stats. Identities = 29/138 (21%), Positives = 48/138 (34%), Gaps = 11/138 (7%) Query: 42 EGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSD 101 + + P G + P I R++ I P + W+ + Sbjct: 221 RATSALIAWVLARPTVAVLLGIDADRRPSEAMIRRLLQAIDPDLLTTAIGIWLAARIPAP 280 Query: 102 DK---DVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE- 157 IA+DGK R S + A HV++A +V+ D K+NEIT Sbjct: 281 APGSRRAIAVDGKTLRGSRTRDSA--ARHVLAAADQHTGIVLASTDVDTKTNEITRFTAS 338 Query: 158 -----LLNMLDIKGKIIK 170 LL+ I+ ++ Sbjct: 339 GSHADLLSSRCIRSGVVS 356 >UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E6_9ACTO Length = 128 Score = 78.6 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 31/133 (23%), Positives = 47/133 (35%), Gaps = 13/133 (9%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + L E ++ + D R+ H +LL+ AV++GA + I ++ P + Sbjct: 1 MGPLAERLATLADPRRRRGKRHPFVAVLLIACSAVVTGARSFVAIHEWATDSPQDVLARL 60 Query: 63 DFENG-------IPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRH 115 P TI RV+ CP + H D +AIDGK R Sbjct: 61 GARTATALAVRIPPSGVTIRRVIKDTCPGGLADLLG------HDPAGTDTLAIDGKSARG 114 Query: 116 SYDKSRRKGAIHV 128 S S R Sbjct: 115 SRLGSTRPPIYWP 127 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 77.8 bits (190), Expect = 3e-13, Method: Composition-based stats. Identities = 28/112 (25%), Positives = 47/112 (41%), Gaps = 6/112 (5%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF- 64 L ++S IPD+R+A + L+ +LL +I A++SGA + I+ F +TH + L Sbjct: 3 LKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQLH 62 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD-----VIAIDGK 111 P H +I + + +F D VI + K Sbjct: 63 RKRAPAHTSIRYALQGLDAKAVELAFPRHASGLDGEDHNRFFPSTVIDAEWK 114 >UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCH0_9FIRM Length = 435 Score = 77.8 bits (190), Expect = 3e-13, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 55/166 (33%), Gaps = 17/166 (10%) Query: 51 GETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS----DDKDVI 106 G T L + DF+ P + + I P F F + + + + ++ Sbjct: 54 GCTLNKELLDFFDFDVNAPTVSAYTQQRAKILPEAFEYLFHAFTEENAQTKNLYEGYQLL 113 Query: 107 AIDG------------KIHRHSYDKSRRKGAIHVISAFSTMHSLVI-GQIKTDKKSNEIT 153 A DG + S +H+ + + ++ I ++T E Sbjct: 114 ACDGSNLTIAPNLNDPETLWKSNQLGATGNHLHLNALYDVLNRTYIDALVQTASTYQEHR 173 Query: 154 AIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 A +++ + + I+ D +I ++G +L +K Sbjct: 174 ACIQMIERVTLDKVILIADRGYENYNIMSHAIEKGWKFLIRIKDVH 219 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 77.8 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 49/108 (45%), Gaps = 4/108 (3%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG-IPVHDTIARVVSCICPAKFH 87 +L L + AV++G E I FG P L F+NG +P +TIA ++ + Sbjct: 3 LLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLDADHLD 62 Query: 88 ESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTM 135 W+ D H D D IA+DGK S + H+++A++ Sbjct: 63 RIIGAWLGDRHP-DGWDHIALDGKRLCGS--RDGAVPGTHLLAAYAPQ 107 >UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus pneumoniae RepID=B2ISL1_STRPS Length = 535 Score = 77.4 bits (189), Expect = 4e-13, Method: Composition-based stats. Identities = 42/231 (18%), Positives = 79/231 (34%), Gaps = 31/231 (13%) Query: 18 QAWKVEHKLSDILLLTICAVISGAEGWEDIEDF-GETHPDFLKQYGDFENGIPVHDTIAR 76 Q + SDIL+ + +++G ++ + + L + G T++R Sbjct: 142 QRRYCRYSDSDILVQFLFQLLTGYGTDYACKELSADAYFPKLLEGGQL----ASQPTLSR 197 Query: 77 VVSCICPA----------KFHESFINW--MLDYHSSDDKDVIAIDGKIHRHSYDKSRRKG 124 +S + E F+ + + D GK +Y+ R Sbjct: 198 FLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDSTHFTTYGKQEGVAYNAHYRAH 257 Query: 125 AIHVISAFSTMHSLVI-GQIKTDKK--SNE----ITAIPELLNMLDIKGKIIKTDAMGCQ 177 H + AF Q++ + S E IT + E N L + + D+ Sbjct: 258 GYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFITPVLERFNQL-----LFRMDSGFAT 312 Query: 178 KDIAEKIQKQGGDYLFAVKGNQ--GRLNKAFEEKFPLKELNNPKHDSYAIS 226 + + I+K G YL +K N RL ++L H +Y+ + Sbjct: 313 PKLYDLIEKTGQYYLIKLKKNTVLSRLGDLSLPCPQDEDLTILPHSAYSET 363 >UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C378D Length = 453 Score = 77.0 bits (188), Expect = 5e-13, Method: Composition-based stats. Identities = 43/252 (17%), Positives = 69/252 (27%), Gaps = 45/252 (17%) Query: 8 EHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG 67 E IPD R L D+L+ + A + T L+ G Sbjct: 27 ERFETIPDAR--RGPTFSLPDVLMAGLALFALKAPSLLAFQR--RTLDHNLRHVFGL-TG 81 Query: 68 IPVHDTIARVVSCICPAKFHESFIN--------WMLDYHSSDDKDVIAIDGK-------- 111 P + V+ + P F + +LD + D V+A+DG Sbjct: 82 RPSDSQMRAVLDDVDPDHLRPVFRDVFARLQAAHVLDEYRVDGCYVVALDGVEYFCSQKV 141 Query: 112 ------IHRHSYDKSRRKGAIHVISAFSTMHSLVIG------QIKTDKKSN--EITAIPE 157 RH+ + + S V+ Q N E A Sbjct: 142 HCPHCMTRRHANGAVSYYHQMLGAAVVHPDFSAVLALAPEPIQRADGGTKNDCERNAARR 201 Query: 158 LLNML----DIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVK-GNQGRLNKAF-----E 207 L ++ DA +QK +L VK + L + Sbjct: 202 WLGRFREEHPDLAVLVVEDARSSNAPHVRDLQKARCHFLLGVKAADHAHLFAHVCARQDQ 261 Query: 208 EKFPLKELNNPK 219 F + E +P+ Sbjct: 262 HAFEVVEDADPR 273 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 77.0 bits (188), Expect = 6e-13, Method: Composition-based stats. Identities = 45/165 (27%), Positives = 69/165 (41%), Gaps = 26/165 (15%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFG--ETHPDFLK 59 +LKKL+E S IPD R+A V+H+L+ +LL + + + + L+ Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSRPAFLQALQ 138 Query: 60 QYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLD--------YHSSDDKDVIAIDG- 110 +P DT+ARV+ I P K ESFI + H + IAIDG Sbjct: 139 GLFPELETLPHGDTLARVLERIEPQKLEESFIRLLRRYIRHKKFKRHLINKCYPIAIDGT 198 Query: 111 -KIHR-------------HSYDKSRRKGAIHVISA-FSTMHSLVI 140 K+ R + D + + I+V+ A F + L I Sbjct: 199 QKLVRDGELGEEWLERHIKTKDGEKVQQYIYVLEANFVFKNGLTI 243 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 75.9 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 26/78 (33%), Positives = 35/78 (44%), Gaps = 4/78 (5%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKF---PLKELNNPKHDSYAISEKSH 230 MGCQK+IA+ I KQ DY+ A+KG+ L E + + D + + H Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGH 60 Query: 231 GREETRLHIVCDVP-DEL 247 GR ETR V L Sbjct: 61 GRIETRRCQQVLVNKSWL 78 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 75.9 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 2/88 (2%) Query: 161 MLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKH 220 M +KG ++ DAMGCQ+ IA+++++ G D + ++KGNQG+ A F ++ + Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 221 --DSYAISEKSHGREETRLHIVCDVPDE 246 + E SHGR R V + E Sbjct: 61 LKPDHDEFEDSHGRTVRRRGWVLPLTPE 88 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 73.9 bits (180), Expect = 4e-12, Method: Composition-based stats. Identities = 30/79 (37%), Positives = 42/79 (53%) Query: 175 GCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREE 234 + ++ +KI ++ DYL AVKGNQG L AF++ F LNN + Y E+S GR E Sbjct: 11 SVRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHE 70 Query: 235 TRLHIVCDVPDELIDFTFE 253 +R V L D + E Sbjct: 71 SRAAFVSHDLSVLGDISDE 89 >UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XCB4_SALTO Length = 117 Score = 73.6 bits (179), Expect = 5e-12, Method: Composition-based stats. Identities = 23/106 (21%), Positives = 45/106 (42%), Gaps = 3/106 (2%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAK 85 ++ +L +CAV++GA + D+ E F + +PV T+ R++ + Sbjct: 1 MASVLADAVCAVMAGASTFAAFGDWVEDLDAPAWSRLGFTDRVPVLTTLWRLLVRVDAET 60 Query: 86 FHESFINWMLDYHSSDDK---DVIAIDGKIHRHSYDKSRRKGAIHV 128 + +W+ VIA+DGK+ R + R A+ + Sbjct: 61 LTAVWADWLCSRLPVAPPPVRRVIAVDGKVVRGAVLTEGRVPALWM 106 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 72.8 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 38/81 (46%), Positives = 52/81 (64%), Gaps = 4/81 (4%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M + + E++S D R A+ +H DI+ L + AVISGA W +I+ FGE H D+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCI 81 Y FE GIPV DTIARV+ I Sbjct: 57 YRPFECGIPVDDTIARVIKRI 77 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 72.4 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 28/69 (40%), Positives = 42/69 (60%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQ 60 M L+ H + I D RQ+ KV + L D+L +T+C VI+GAEGW +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIP 69 G G+P Sbjct: 72 KGILTEGVP 80 >UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW01_RHOSR Length = 130 Score = 71.6 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 28/81 (34%) Query: 11 SIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 +PD R V H+ S IL + A +GA + I ++ P +P Sbjct: 49 DEVPDPRHRRGVRHRFSVILAPALSATCAGARSFIAIAEWAHDAPAEALGGLGVSAVVPS 108 Query: 71 HDTIARVVSCICPAKFHESFI 91 T R ++ + + Sbjct: 109 ESTSRRFLAGVDATALDQVLG 129 >UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TKB9_RICCO Length = 107 Score = 68.6 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 16/100 (16%), Positives = 34/100 (34%), Gaps = 1/100 (1%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFL-KQYGDF 64 + S + D R+A + L +L + +++SG+ ++ F E L + +G Sbjct: 8 FGDVFSELRDVRRAQGKRYALEPLLCAIVMSILSGSASLRKMQVFIEEQLPNLNRLFGTS 67 Query: 65 ENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKD 104 P I + + + +F S Sbjct: 68 WRKAPCWVAIREFLLGLDEQELERAFREHANRQVSPPPGR 107 >UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium meliloti RepID=Q1WLF6_RHIME Length = 105 Score = 65.9 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 19/73 (26%), Positives = 28/73 (38%), Gaps = 1/73 (1%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + +PD R H L+ IL + I A++ GAE D+ DFG +LK Sbjct: 1 MSRFAACFEDLPDPR-GRNARHPLTSILFIAIAAIVCGAESCTDMADFGVAKKKWLKTIV 59 Query: 63 DFENGIPVHDTIA 75 I Sbjct: 60 PLPYASRCWRDIR 72 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 64.3 bits (155), Expect = 4e-09, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 5/77 (6%) Query: 155 IPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 + +L IK I DA+ CQK E I ++ Y+ VK NQ L +A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIEDT----- 56 Query: 215 LNNPKHDSYAISEKSHG 231 N ++++ ++K HG Sbjct: 57 AKNSPLNAWSWTQKGHG 73 >UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3Z283_9SYNE Length = 156 Score = 63.9 bits (154), Expect = 4e-09, Method: Composition-based stats. Identities = 22/96 (22%), Positives = 40/96 (41%), Gaps = 4/96 (4%) Query: 84 AKFHESFINWMLDY-HSSDDKDVIAIDGKIHRHSYD--KSRRKGAIHVISAFSTMHSLVI 140 F + WM +D D + DGK R S D I +S +S + I Sbjct: 2 EAFEALLLQWMSQQPALADGVDTLVCDGKTLRGSIDQKPGAAASFIAQVSLYSQPLGVAI 61 Query: 141 GQ-IKTDKKSNEITAIPELLNMLDIKGKIIKTDAMG 175 Q +S+E ++ LL+ +++ +++ D +G Sbjct: 62 AQTTYATDESSETASLLWLLSGIELTDMLVQADEVG 97 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 63.2 bits (152), Expect = 9e-09, Method: Composition-based stats. Identities = 27/57 (47%), Positives = 41/57 (71%) Query: 97 YHSSDDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEIT 153 Y S + ++ DGK R S+D+S K AIH++SA+++ +SLV+GQ+KTD+KSNE Sbjct: 17 YQKSLKEKSLSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNEHK 73 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 62.4 bits (150), Expect = 1e-08, Method: Composition-based stats. Identities = 14/64 (21%), Positives = 27/64 (42%), Gaps = 2/64 (3%) Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPD 245 ++ DY+ A+KGN + + ++ F + +K HGR E R++ + Sbjct: 7 EKDNDYILALKGNHPLMEQEVKDFF--LSPVTSTRSVHTTFDKGHGRIERRIYTLDTNIG 64 Query: 246 ELID 249 D Sbjct: 65 WFED 68 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 58.5 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 16/68 (23%), Positives = 28/68 (41%), Gaps = 1/68 (1%) Query: 182 EKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPK-HDSYAISEKSHGREETRLHIV 240 EKI ++ GDY+ +K N + E F + P+ +++ R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 CDVPDELI 248 V D L Sbjct: 61 LKVSDWLS 68 >UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B1F Length = 84 Score = 58.5 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 30/56 (53%) Query: 159 LNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKE 214 L+M D+ + DA+G Q IAE+I + G DY+ A+K NQ +A F E Sbjct: 17 LDMEDLAQSQLVIDAVGTQGPIAEQIIEAGADYVLALKANQPSALQAVSAHFKEAE 72 >UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VX04_POLNA Length = 100 Score = 57.0 bits (136), Expect = 6e-07, Method: Composition-based stats. Identities = 16/66 (24%), Positives = 37/66 (56%), Gaps = 4/66 (6%) Query: 6 LMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDI----EDFGETHPDFLKQY 61 L++ SI+PD R + L +++++T+ AV+ GA+ W D+ + +G++ +++ Sbjct: 2 LIQAFSILPDPRTGPAQRYDLREMIVMTLSAVLCGADNWVDVPVGSKKYGDSCMQVVREK 61 Query: 62 GDFENG 67 +G Sbjct: 62 CCLTSG 67 >UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKL6_9PROT Length = 506 Score = 57.0 bits (136), Expect = 7e-07, Method: Composition-based stats. Identities = 27/123 (21%), Positives = 45/123 (36%), Gaps = 14/123 (11%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHK----LSDILLLTICAVISGAEGWEDIEDFGETHPDF 57 EL L+ + IPD R K HK L LL+ + S E ++ Sbjct: 75 ELPALLGQLEQIPDPRDPRKRRHKLTVLLLYGLLMFVFQFASRRETNREMTR--PQFLAN 132 Query: 58 LKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS--------DDKDVIAID 109 L++ +P DT+ R++ I A ++ ++ + + IAID Sbjct: 133 LQRLFPEIEALPHADTLYRLLRDIDLAHLEQAHVDLVRRLIRGKSFRRYLINHCHPIAID 192 Query: 110 GKI 112 G Sbjct: 193 GSQ 195 >UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J572_FRASC Length = 148 Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 15/63 (23%), Positives = 20/63 (31%), Gaps = 1/63 (1%) Query: 8 EHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENG 67 E IPD R V H+L +L L AV+ G G + + Sbjct: 70 ECFEQIPDPRDPRGVRHRLPVVLSLC-AAVLCGESGLAGVAAWVAAEGPGDPTGEGCRWP 128 Query: 68 IPV 70 P Sbjct: 129 RPG 131 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 55.8 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 21/71 (29%), Positives = 32/71 (45%), Gaps = 1/71 (1%) Query: 183 KIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCD 242 I + GDYL VKGNQ +L +A E F + + D A+ E+ HGR ++ V Sbjct: 1 MIIAKKGDYLLMVKGNQPKLLEAIEIAFID-QHDVKSVDRSALVERGHGRTVGQIASVLS 59 Query: 243 VPDELIDFTFE 253 + + Sbjct: 60 AKGIINPGDWP 70 >UniRef50_A8MIZ4 Putative uncharacterized protein n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MIZ4_ALKOO Length = 218 Score = 55.8 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 60/189 (31%), Gaps = 33/189 (17%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 + + E I+ + D R V+ +S I + + + + + + E K+ Sbjct: 16 VYDIGEKINTLKDKRVKSPVK--VSTISFVVLFGFMLQIRSFNRLNHWIE--KGKFKKVV 71 Query: 63 DFENGIPVHDTIARVVSCICPAKFHESFI--------NWMLDYHSSDDKDVIAIDGKIHR 114 + +P D++ R ++ N + + D V AIDG Sbjct: 72 PKKTKMPCIDSVRRFLADFDLHGLKNMHSHIVKTSIKNKVFRSGTVDGLKVAAIDGVELF 131 Query: 115 HSYDKSRRKG--AIH------------VISAFSTMHSLVIGQIKTDKK-------SNEIT 153 S K +H + S + L++GQ + K E+T Sbjct: 132 ESTKKCCNNCLTRVHKDEITHYFHRSVICSTVGSDPHLILGQEMLEPKRDGSNKDEGEVT 191 Query: 154 AIPELLNML 162 L+ L Sbjct: 192 GGKRLIKKL 200 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 55.8 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 7/78 (8%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREET 235 K + I + G DY+ AVKGNQ RL++ + L +E+ R T Sbjct: 1 MPKKTVQLIIEGGNDYVIAVKGNQKRLHEQIK----LTTEQRLPVSLDITTERRSDRITT 56 Query: 236 RLHIVCDVPDELIDFTFE 253 R V D+L +++ Sbjct: 57 RSVSVF---DDLSGISYD 71 >UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostridium spiroforme DSM 1552 RepID=B1C560_9FIRM Length = 399 Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 37/182 (20%), Positives = 61/182 (33%), Gaps = 22/182 (12%) Query: 55 PDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSD---DKDVIAIDGK 111 D L ++ DF P + S I P F F + ++AIDG Sbjct: 20 KDELLKFNDFSITTPSASAFVQARSKIKPEAFRTLFDGFNKKTFKKKLYHGYRLLAIDGS 79 Query: 112 I--------------HRHSYDKSRRKGAIHVISAFSTMHSLVIG-QIKTDKKSNEITAIP 156 RH A H+ +++ M I+ + K +E A Sbjct: 80 ELPIDNTIFDDETTVLRHGTLAKTFS-AYHLNASYDLMERTYDDIIIQGEAKRDEHGAFC 138 Query: 157 ELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKG--NQGRLNKAFEEKFPLKE 214 +L++ D + I D + E + G YL V+ +Q + K+ FP E Sbjct: 139 QLVDRYDGQKAIFIADRGYESYNGFEHVVHSGHKYLIRVRDIESQSSITKSL-GPFPDGE 197 Query: 215 LN 216 + Sbjct: 198 FD 199 >UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ITI9_METNO Length = 74 Score = 54.3 bits (129), Expect = 3e-06, Method: Composition-based stats. Identities = 17/64 (26%), Positives = 33/64 (51%), Gaps = 1/64 (1%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYG 62 ++ L + D R+ +H+L IL++ +CAVI+ AE +DI +G + +L+ + Sbjct: 2 IEGLAVCFPGLEDLRETSGCDHRLIAILVIAVCAVIACAES-KDIGLYGRSKQAWLQTFL 60 Query: 63 DFEN 66 Sbjct: 61 PLPC 64 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 53.9 bits (128), Expect = 5e-06, Method: Composition-based stats. Identities = 28/62 (45%), Positives = 30/62 (48%) Query: 55 PDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHR 114 LKQYG FE GI HDTI +VSCI F + FI WM A DGK R Sbjct: 9 RGLLKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVR 68 Query: 115 HS 116 S Sbjct: 69 RS 70 >UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=A7B831_RUMGN Length = 366 Score = 53.5 bits (127), Expect = 7e-06, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 60/186 (32%), Gaps = 20/186 (10%) Query: 53 THPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINW---MLDYHSSDDKDVIAID 109 + L Y F P + + + F F + D ++A D Sbjct: 64 SLKKELLDYFQFSVDTPSASAFCQQRNKLLLEAFQFLFYEFNSCFSFEKKYKDYQLLACD 123 Query: 110 GKIHRHSYDK------------SRRKGAIHVISAFS-TMHSLVIGQIKTDKKSNEITAIP 156 G + + R IH+ + F + I+ + NE A+ Sbjct: 124 GSDLNIARNPNDAGTYFQSQPTDRGFNQIHLNALFDLCEKRYIDLVIQPARLENESLAMT 183 Query: 157 ELLNMLDIKGK-IIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKEL 215 ++++ + K I D +I +Q++G YL VK G + F L + Sbjct: 184 QMIDRYKGEKKTIFIADRGYETYNIFAHVQEKGMYYLIRVKDGGGG---SMTGSFDLPDE 240 Query: 216 NNPKHD 221 N HD Sbjct: 241 NEFDHD 246 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 53.5 bits (127), Expect = 7e-06, Method: Composition-based stats. Identities = 17/64 (26%), Positives = 21/64 (32%), Gaps = 3/64 (4%) Query: 193 FAVKGNQGRLNKAFEEKFPLKE---LNNPKHDSYAISEKSHGREETRLHIVCDVPDELID 249 AVK NQ L E L + +K HGR ETR + D P Sbjct: 2 LAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREVDKGHGRIETRRCLALDFPGPFEP 61 Query: 250 FTFE 253 + Sbjct: 62 DLWP 65 >UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HUT6_9ACTO Length = 130 Score = 53.1 bits (126), Expect = 8e-06, Method: Composition-based stats. Identities = 19/84 (22%), Positives = 32/84 (38%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 L +S IPD R+ + L +L L + AV+ GA I F L++ Sbjct: 45 SLAGTLSRIPDPRRVRGRRYHLGSLLALCLVAVLGGARSLATIARFAADTNSSLREQLGL 104 Query: 65 ENGIPVHDTIARVVSCICPAKFHE 88 + P T+ + + + E Sbjct: 105 ASSTPNASTLGGLRANLKDEWVRE 128 >UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L7Y6_FRASN Length = 209 Score = 53.1 bits (126), Expect = 9e-06, Method: Composition-based stats. Identities = 12/79 (15%), Positives = 27/79 (34%), Gaps = 1/79 (1%) Query: 60 QYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDK 119 + P T+ + I +F W+ + + +AIDGK+ R ++ Sbjct: 27 DHFRRNTRAPSKKTLRAPLKKIDVDALDATFGAWLCAQI-ARGRVALAIDGKVLRGAWSG 85 Query: 120 SRRKGAIHVISAFSTMHSL 138 A ++ + + Sbjct: 86 DESVTAAYLHTHVRGNWGI 104 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 50.8 bits (120), Expect = 4e-05, Method: Composition-based stats. Identities = 23/74 (31%), Positives = 29/74 (39%), Gaps = 10/74 (13%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEK--------FPLKELNNPKHDSYAI 225 M Q D+ +Q++GGDY+ K NQG L E FP D+ Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 SE--KSHGREETRL 237 E K HG E R Sbjct: 61 CEVSKGHGWVERRT 74 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 50.8 bits (120), Expect = 4e-05, Method: Composition-based stats. Identities = 15/59 (25%), Positives = 21/59 (35%), Gaps = 5/59 (8%) Query: 195 VKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELIDFTFE 253 +K NQ + P + + S HGR E+R C + DEL F Sbjct: 2 IKRNQPTTYRQL-AALPWPDSAV----QHTASSAGHGRRESRSIKTCGIADELGGIAFP 55 >UniRef50_B2IT45 Putative uncharacterized protein n=5 Tax=Cyanobacteria RepID=B2IT45_NOSP7 Length = 435 Score = 50.4 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 76/242 (31%), Gaps = 37/242 (15%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIED-FGETHPDFLKQY 61 ++ + +PD R +++SD L + + + + + Q Sbjct: 11 VQYFQSILKDLPDKRTGKNKRYQMSDAALSAFSIFFTQSPSFLAHQRSMAHSKGHNNAQS 70 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESF---------INWMLDYHSSDDKDVIAIDGKI 112 + IP + I ++ I P F + + S + +IA+DG Sbjct: 71 LFGVHQIPSDNHIRDLLDEIEPTVVFPVFTKIFKALENGKHLSKFRSFKNNLLIALDGTE 130 Query: 113 HRHSYD-----------KSRRKGAIHVISA---FSTMHS--------LVIGQIKTDKKSN 150 + S + K+ H + +S V+ Q K+ Sbjct: 131 YFCSNEIHCEHCSSRTFKNGTTQYFHTVVTPVIVCPSNSQVIPLIPEFVVPQDGYQKQDC 190 Query: 151 EITAIPELLNMLDIK----GKIIKTDAMGCQKDIAEKIQKQGGDYLFAVK-GNQGRLNKA 205 E A + + G I D + C + + E + ++ +++ + + L + Sbjct: 191 ENAAAKRWIQKYAKQYASLGITILGDDLYCHQPLCELLLQEKLNFILVCRSKSHKTLYEW 250 Query: 206 FE 207 E Sbjct: 251 LE 252 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 50.1 bits (118), Expect = 7e-05, Method: Composition-based stats. Identities = 31/54 (57%), Positives = 36/54 (66%), Gaps = 6/54 (11%) Query: 1 MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTI------CAVISGAEGWEDIE 48 MELKKLMEHISIIPDYRQAWKVEHKL DIL + C ++ G+ + Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRGFGETH 54 >UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteobacteria RepID=A6X872_OCHA4 Length = 330 Score = 49.3 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 73/226 (32%), Gaps = 22/226 (9%) Query: 17 RQAWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTI 74 R+ + I IC + + E L + + E +P H T Sbjct: 53 RKTRGGQCRYSDLAIETTLICG-----KVFNQPLRQTEGLMASLLRLLNVELPVPDHTTF 107 Query: 75 ARVVSCICPAKFHESFIN-----WMLDYHSSDDKDVI-AIDGKIHRHSYDKSRRKGAIHV 128 +R + + + + S + A +H +R+ +H+ Sbjct: 108 SRRCANLVVSSLTRCTRRDGTDEPLHVIVDSTGMKIYEAGQWLEEKHGAKSARKWLKLHL 167 Query: 129 ISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQG 188 A + VI + TD+ +++++ +P+LL+M+D D + ++ Sbjct: 168 --AIDADSNQVIAETLTDQNTSDLSQVPDLLDMIDRPIACFMADGAYDSDQTYQALRSHS 225 Query: 189 GDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSHGREE 234 + R+ E + + D ++ + GR E Sbjct: 226 PGVSIII---PPRIRDLQEASY----GPPDQRDWHSRTNAQRGRME 264 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 48.5 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 15/66 (22%), Positives = 21/66 (31%), Gaps = 4/66 (6%) Query: 186 KQGGDYLFAVKGNQGR-LNKAFEEKFPLKELNNPKHDS---YAISEKSHGREETRLHIVC 241 + L A + Q L A + F + + +K HGR ETR Sbjct: 92 RGRWWRLRACRQGQPTHLAHALRDFFGTLDAPGYPVRQTCVHETLDKGHGRIETRRCTAA 151 Query: 242 DVPDEL 247 D L Sbjct: 152 GDLDWL 157 >UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID=A1RCW9_ARTAT Length = 436 Score = 48.5 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 69/207 (33%), Gaps = 19/207 (9%) Query: 11 SIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPV 70 I+PD R +V+H L +L I A+ +G E D G H L+ + + Sbjct: 49 KIVPDRRDPGRVQHGLQTLLAQRIYALAAGYEDLND--HDGLRHDYALQTAVNRLQPLAG 106 Query: 71 HDTIARVVSCICPAKFHESFI----NWMLDYHSSDDKDVIAID-------GKIHRHSYDK 119 T+ R+ ++ +++ + + + V+ D G + Sbjct: 107 KSTLGRLEQQADRETVVQAHRLLWEHFIAQHDQAPAEIVLDFDATDVPVHGDQEGRFFHG 166 Query: 120 SRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEIT-AIPELLNMLDIKGK-----IIKTDA 173 + F H LV ++ + AI LL + + + D Sbjct: 167 YYDHYCFLPLYVFCGRHLLVSYLRPSNIDGARHSWAILALLVKFIRRFWPETRIVFRGDG 226 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGNQG 200 C+ + + ++ DY+ + N Sbjct: 227 GFCRHRMLDWCDRKQVDYVVGLARNTR 253 >UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4X2F9_SALTO Length = 143 Score = 48.5 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 15/65 (23%), Positives = 24/65 (36%) Query: 4 KKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 L + +PD V H+L+ +L+ ICAV + I ++ P G Sbjct: 13 AGLPAALLDLPDPLCRLGVLHRLTVVLIAAICAVAVSNRSYTAIAEWFPDVPAATGARGG 72 Query: 64 FENGI 68 G Sbjct: 73 HRPGP 77 >UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ13_9PLAN Length = 75 Score = 48.5 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 18/50 (36%), Positives = 30/50 (60%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFG 51 EL++L ++ + D R HKL +++L+ +CAVI+GA+G IE Sbjct: 19 ELRELAKYFQSLDDSRSHVNQRHKLVNVVLIAMCAVIAGADGPTAIEWLA 68 >UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LD44_FRASN Length = 261 Score = 47.8 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 11/50 (22%), Positives = 27/50 (54%), Gaps = 1/50 (2%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVI-SGAEGWEDIEDF 50 + L+E ++ +PD R+ V H + +L + +CA++ +G+ ++ Sbjct: 57 DQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLTAGSRQTRALKAV 106 >UniRef50_A7BZU6 Transposase, IS4 n=2 Tax=Beggiatoa sp. PS RepID=A7BZU6_9GAMM Length = 270 Score = 47.4 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 58/194 (29%), Gaps = 24/194 (12%) Query: 18 QAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARV 77 + D +L + ++G + + F +T L + T Sbjct: 34 KHHNQIFNYYDFFILLMYYFVAGKQS---VGLFVKTELKLLPITLGLRQV--AYSTFNDA 88 Query: 78 VSCICPAKFHESF------INWMLDYHSSDDKDVIAIDG-------KIHRHSYDKSRRKG 124 P F E F I + S + IDG + Y Sbjct: 89 FERFSPNLFQEVFKYILSTIPFKQISELSTLGVLYCIDGSLFPVINSMLWAEYTSKHCAL 148 Query: 125 AIHVISAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKI 184 +H+ F +V+ + T +E A+ E+L G D ++ + Sbjct: 149 KLHLC--FELNRMIVVEFLVTAANGSERKALQEMLK----AGVTYIGDRGYMSFELCHLM 202 Query: 185 QKQGGDYLFAVKGN 198 ++ ++F +K N Sbjct: 203 MQKEAYFVFRLKRN 216 >UniRef50_C7G6U9 Putative uncharacterized protein (Fragment) n=7 Tax=Clostridiales RepID=C7G6U9_9FIRM Length = 212 Score = 47.0 bits (110), Expect = 6e-04, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 66/214 (30%), Gaps = 36/214 (16%) Query: 2 ELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 + K+ + I + D R+ + L +I++ + ++ E + I E+ LK Sbjct: 3 SVYKIPQKIKCLTDERKRKSI--PLFNIVMPVLLFLMLQYESFHTIFSAPESMSKRLKN- 59 Query: 62 GDFENGIPVHDTIARVVSCICPAKFHESF--------INWMLDYHSSDDKDVIAIDGKIH 113 IP D + ++S I P + N + + V +DG Sbjct: 60 -CISGRIPKVDAVRDLLSRINPDEIRSIHEEMIDIIKRNRVFREGTIGGYVVAGLDGVEL 118 Query: 114 RHSYDKSRRKG---AIHVISAFSTMHSLV-----------IGQIK------TDKKSNEIT 153 S KS H S+V +GQ + K E+T Sbjct: 119 FSSTKKSCPNCLSRKKHTGETEYFYRSVVCMIIGKSPHVILGQEMLKPRDGSGKDEGELT 178 Query: 154 AIPELLNMLDIK----GKIIKTDAMGCQKDIAEK 183 L+ L + +I DA+ Sbjct: 179 GGKRLIERLKKRHGHFADVIVADALYLNAPFINT 212 >UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipelotrichaceae RepID=B7C7E2_9FIRM Length = 446 Score = 46.6 bits (109), Expect = 7e-04, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 62/203 (30%), Gaps = 38/203 (18%) Query: 20 WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVS 79 K +H L+ + G + D L + + P + + Sbjct: 49 RKRKHLFGSTLMNVLL-------------LEGGSLKDELYKLFGYNLDTPTVSSFIQARD 95 Query: 80 CICPAKFHESFINWMLDYHSS---DDKDVIAIDGKIH-------------RHSYDKSRRK 123 I P FH F + + ++A+DG + + + + Sbjct: 96 KIKPDTFHILFNLFNGRTRKPKLYNGYRLLAVDGSTLPITSEIKDKKTTIQKANNSDKPF 155 Query: 124 GAIHVISAFS----TMHSLVI-GQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 A H+ +++ T +++ GQ +E A+ +++ I D Sbjct: 156 SAFHLNTSYDILEYTYDDVILQGQAV----QDERDALNKMVERYKGDKAIFIADRGYESI 211 Query: 179 DIAEKIQKQGGDYLFAVKGNQGR 201 + EKI G YL VK Sbjct: 212 NSFEKIHLSGNKYLVRVKDIHST 234 >UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E7_9ACTO Length = 72 Score = 46.2 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 12/45 (26%), Positives = 24/45 (53%) Query: 134 TMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQK 178 T + + Q++ + +NEIT LL+ D++ + DA+ Q+ Sbjct: 2 TGTGMTVTQLRVPENTNEITCFAALLDPYDLREVTVTGDALHTQR 46 >UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L6S8_RUMHA Length = 107 Score = 45.8 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 16/61 (26%), Positives = 27/61 (44%) Query: 47 IEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVI 106 + F + + ++ D + G P DT+ RV + I P KF E F +W+L + Sbjct: 1 MAFFMKLQEPYFEKILDLKYGTPSADTLLRVFAIIEPEKFMEMFYHWILFLMQKRKYKIS 60 Query: 107 A 107 Sbjct: 61 Q 61 >UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AI67_NITEC Length = 94 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 11/61 (18%) Query: 5 KLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDF 64 +L + I D RQ K H L +L++TI +I + D+L+QY Sbjct: 34 RLADVFVSITDPRQ-RKSRHDLVKVLVITI----------NEILAWANEKLDWLRQYLKL 82 Query: 65 E 65 Sbjct: 83 T 83 >UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms RepID=Q11MU1_MESSB Length = 447 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 80/256 (31%), Gaps = 29/256 (11%) Query: 5 KLMEHISI-IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGD 63 KL E ++ I D R +V H L+DIL I A+ G E D++ P F G Sbjct: 45 KLAEKLAAAIRDPRDPARVRHSLTDILRARIFAIACGYEDANDLDRL-RNDPAFKLACGR 103 Query: 64 FENGIP---VHDTIARVVSCICPA---KFHESFIN-WMLDYHSSDDKDVIAID------- 109 + T +R+ + + ++ W+ Y + + ID Sbjct: 104 LPDSGQDLCSQPTCSRLENLPDLRTVIRLGRVLVDLWLSSYPAPPKSVTLDIDDTLDVVH 163 Query: 110 GKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKK---SNEITAIPELLNML---- 162 G ++ + I + + I K EI L Sbjct: 164 GHQQLSLFNGHHDERCFLPIHIYDAATGRPVAMILRPGKTPSGKEIRGHLRRLARCIRAR 223 Query: 163 -DIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHD 221 +++ D+ + ++ ++ DY+F + GN K + + Sbjct: 224 WPDTRILVRGDSHYGRVEVMAWCEENAIDYVFGLAGN-----KVLKRLVDASADDIRTRR 278 Query: 222 SYAISEKSHGREETRL 237 + G ETR Sbjct: 279 ALEQKPVLRGYVETRY 294 >UniRef50_A4BVT6 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVT6_9GAMM Length = 120 Score = 44.3 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 11/94 (11%), Positives = 28/94 (29%), Gaps = 4/94 (4%) Query: 3 LKKLMEHISIIPDYRQAWK-VEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQY 61 L+ + + D R + L+D L + + +D + + Sbjct: 14 LRTVRACFEALDDPRSRPNSTRYTLADALSSALAMFLLKYPSLLQFDDSARAADEVTRHN 73 Query: 62 GDFENG---IPVHDTIARVVSCICPAKFHESFIN 92 G +P + ++ + P+ +F Sbjct: 74 LGTLYGVEQVPCDTQMRAILDPLKPSTLRGAFRA 107 >UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=Streptococcus RepID=A4W4J4_STRS2 Length = 440 Score = 43.9 bits (102), Expect = 0.005, Method: Composition-based stats. Identities = 17/167 (10%), Positives = 49/167 (29%), Gaps = 19/167 (11%) Query: 70 VHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGK------------IHRHSY 117 + + F F N + D ++A+DG + Sbjct: 77 SQSAFVQRRYQLKHQAFKALFANITSKIPTFKDLPILAVDGSDVVLPRNRSDKTTTFQTG 136 Query: 118 DKSRRKGAIHVISAFSTMHSLVIGQIKTDKKS-NEITAIPELLNMLDIKGKIIKTDAMGC 176 IH+ + ++ + + + +E A +++ + ++ D Sbjct: 137 PHHTPYTLIHINALYNLEQEIYHDLRIQNNREVDERAAFIDMMESCPFEQALVIMDRGYE 196 Query: 177 QKDIAEKIQKQGGDYLFAVKG-NQGRLNKAFEEKFPLKELNNPKHDS 222 ++ Q++ Y+ ++ N + + F L + + Sbjct: 197 SYNVMAHCQERNWSYIIRIRDGNH-----SMKSGFNLPDTPCFDEEF 238 >UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BWL6_9GAMM Length = 94 Score = 42.7 bits (99), Expect = 0.011, Method: Composition-based stats. Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 1/53 (1%) Query: 8 EHISIIPDYRQAW-KVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLK 59 EH +PD R+ + HK DIL++ ICA+I GA+ W + +FG+ D+ + Sbjct: 40 EHFKSLPDPRRRTMNLRHKFIDILIIAICAIICGADSWVAVAEFGKAKEDWFR 92 >UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z7_THET2 Length = 112 Score = 42.7 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 36/108 (33%), Gaps = 7/108 (6%) Query: 41 AEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS 100 + +E F +P L G ++ + P K E+ + + Sbjct: 1 MDSLRGVERFARANPHLLPHLGLRNPPGHTLL--PLLLHRLDPKKLQEALH---QVFPEA 55 Query: 101 DDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKK 148 D V+ +DGK R S + + ++ + + Q + + K Sbjct: 56 DLGGVLVVDGKHLRGS--GKGKSPQVRLVEVLALHLKTTLAQARVEGK 101 >UniRef50_A7BZC9 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC9_9GAMM Length = 61 Score = 42.7 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 13/47 (27%), Positives = 16/47 (34%), Gaps = 3/47 (6%) Query: 193 FAVKGNQGRLNKAFEEKFPLKELNNPKHD---SYAISEKSHGREETR 236 F K N A + F N + D +K HGR E R Sbjct: 10 FESKDNHPYRYHAIQNYFVEAFDANFEGDEIDFAETFDKGHGRLELR 56 >UniRef50_B8FDX7 Transposase IS4 family protein n=2 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FDX7_DESAA Length = 395 Score = 42.4 bits (98), Expect = 0.013, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 62/189 (32%), Gaps = 24/189 (12%) Query: 27 SDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKF 86 D + + I+ A+ +I L+ G G P T++ F Sbjct: 39 WDHFVAMLFCQIAQAKSLREICSGMACCLGKLRHLG--VKGAPKRSTLSYANQKRTWKLF 96 Query: 87 HESFIN--WMLDYHSSDDK-------DVIAIDGKIH--------RHSYDKSRRKGAIHVI 129 + F + + S K ++++D Y +++ +H++ Sbjct: 97 QDVFYDTLHLCRQAPSPGKTKFRFRNKLMSLDSSTISLCLSLFPWAEYRQTKGAVKLHLL 156 Query: 130 SAFSTMHSLVIGQIKTDKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGG 189 L + TD K++++T +L KG I+ D + + + Sbjct: 157 --LDHDGYLPVFACITDGKTHDVTMARQLALS---KGSIVVMDRGYNDYKLYAEWVEDEV 211 Query: 190 DYLFAVKGN 198 ++ +K N Sbjct: 212 YFVTRLKDN 220 >UniRef50_A6FBF2 Putative uncharacterized protein n=1 Tax=Moritella sp. PE36 RepID=A6FBF2_9GAMM Length = 65 Score = 42.4 bits (98), Expect = 0.016, Method: Composition-based stats. Identities = 17/44 (38%), Positives = 22/44 (50%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPK 219 + I QGGDYL AVK NQG+L K E+ F + + Sbjct: 1 MPDQNCQSIVNQGGDYLLAVKNNQGKLRKTVEKSFSHQRTTTAQ 44 >UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX Length = 98 Score = 41.6 bits (96), Expect = 0.025, Method: Composition-based stats. Identities = 32/48 (66%), Positives = 35/48 (72%) Query: 78 VSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIHRHSYDKSRRKGA 125 +SCI KFHE FIN M + HSSDD DVIAIDGK HS DKSRR+ A Sbjct: 1 MSCIRSVKFHECFINRMRECHSSDDIDVIAIDGKALPHSCDKSRRRRA 48 >UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TLQ7_SHEHH Length = 74 Score = 41.2 bits (95), Expect = 0.036, Method: Composition-based stats. Identities = 19/64 (29%), Positives = 27/64 (42%) Query: 7 MEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFEN 66 EH+SII R + EH DI+ L A+ S EGW DI++F + Sbjct: 4 FEHLSIIKAPRSSINHEHDPVDIMFLVNSAIASDCEGWLDIDEFDRIDDRKNAERMALIR 63 Query: 67 GIPV 70 + Sbjct: 64 RMLS 67 >UniRef50_A7C035 Transposase n=5 Tax=Bacteria RepID=A7C035_9GAMM Length = 437 Score = 41.2 bits (95), Expect = 0.037, Method: Composition-based stats. Identities = 35/257 (13%), Positives = 69/257 (26%), Gaps = 39/257 (15%) Query: 3 LKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDI--EDFGETHPDFLKQ 60 L ++ + IP K LSD L+ + + LK Sbjct: 15 LSEIKNYFEKIPSPVVKQKDSISLSDCLMSGLAIFSLKYPSLLQFDNDKRTPVVEHNLKS 74 Query: 61 YGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS---------DDKDVIAIDGK 111 IP + + + ++ ++ + +D ++++DG Sbjct: 75 LYKIGI-IPSDTYMRERLDELPTSELRGAYTTLIRQAQRGKVLEKFTYYNDYYLVSMDGT 133 Query: 112 IHRHSYDKSRRKGA--------------IHVISAFSTMHSLVI--------GQIKTDKKS 149 + S+D + + I+ H V+ Q +K Sbjct: 134 GYFSSHDIHCDQCCEKHHRNGKITYHHQMLGIALVHPNHHHVLPLAPEPIIKQDGVEKND 193 Query: 150 NEITAIPELLNMLDIK----GKIIKTDAMGCQKDIAEKIQKQGGDYLFAVK-GNQGRLNK 204 E A LL L + II D + + ++ Y+ K + L Sbjct: 194 CERNAGKRLLTQLRKEYPKMKMIITEDGLASNGPHIKLLKSLNMSYILGAKPKDHTYLFD 253 Query: 205 AFEEKFPLKELNNPKHD 221 + K D Sbjct: 254 RIKNSSQTKFYQTQDDD 270 >UniRef50_Q2RR82 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RR82_RHORT Length = 84 Score = 40.8 bits (94), Expect = 0.045, Method: Composition-based stats. Identities = 15/46 (32%), Positives = 23/46 (50%), Gaps = 1/46 (2%) Query: 154 AIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQ 199 A E++ LD+ G++ DA+ CQ E ++ G L K NQ Sbjct: 36 ATQEMIAPLDLTGRLFTLDALHCQ-KTFEIARQAGNHLLVQAKINQ 80 >UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 Length = 41 Score = 40.8 bits (94), Expect = 0.046, Method: Composition-based stats. Identities = 16/29 (55%), Positives = 23/29 (79%) Query: 128 VISAFSTMHSLVIGQIKTDKKSNEITAIP 156 +++A +T + + IGQ+K D KSNEITAIP Sbjct: 1 MVNALATANGMSIGQLKVDSKSNEITAIP 29 >UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW00_RHOSR Length = 98 Score = 40.8 bits (94), Expect = 0.049, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 22/48 (45%), Gaps = 2/48 (4%) Query: 110 GKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 157 GK R + D S H+++A +V+ Q+ + NEI + + Sbjct: 18 GKTWRGAKDGSG--HLTHLLAAVDHDAGVVLRQVAVGARINEIPLLLD 63 >UniRef50_Q11ZV5 Transposase, IS4 family n=1 Tax=Polaromonas sp. JS666 RepID=Q11ZV5_POLSJ Length = 441 Score = 40.4 bits (93), Expect = 0.063, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 67/242 (27%), Gaps = 20/242 (8%) Query: 5 KLMEHISI-IPDYRQAWKVEHKLSDILLLTICAVISGAEGWED----------IEDFGET 53 LME + I D R ++H ++D+L + + G E D G Sbjct: 49 GLMEAAARCIADPRSPLLIKHGVADMLRQRVYGLALGWEDLNDHGALRDDVAMQTAVGVD 108 Query: 54 HPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSSDDKDVIAIDGKIH 113 T R + + +F SF + D + G+ Sbjct: 109 REVASAPTLCRLEKWADRATAWR-LHQVLVEQFIASFKTAPEELVLDFDATDNPLYGQQE 167 Query: 114 RHSYDKSRRKGAIHVISAFSTMHSLVIGQI--KTDKKSNEITAIPELLNMLDIK----GK 167 + + F L + D + L+ L + Sbjct: 168 GRFFHGYYDCYCYLPLYVFCGQQLLCAYLRPSRIDGAKHAGAIFKLLVTRLRQQWPQVRI 227 Query: 168 IIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISE 227 + + D+ C++ I ++ G Y+ + N + E L ++ + E Sbjct: 228 VFRGDSGFCRQRIINYCERAGVHYIVGLARNAR--LEQITEFLELSMKDDYQSSGVKQRE 285 Query: 228 KS 229 Sbjct: 286 VG 287 >UniRef50_A6FLE0 Transposase, IS4 n=2 Tax=Roseobacter sp. AzwK-3b RepID=A6FLE0_9RHOB Length = 136 Score = 40.0 bits (92), Expect = 0.079, Method: Composition-based stats. Identities = 21/72 (29%), Positives = 30/72 (41%), Gaps = 2/72 (2%) Query: 13 IPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIP--V 70 +PD R+ KV H L DI+ I + +G E D D + L D E G Sbjct: 41 LPDPREPGKVRHSLEDIIRFRIMMIAAGYEDGNDAGDLRDDPAFKLALERDPETGAALCS 100 Query: 71 HDTIARVVSCIC 82 TI+R+ + Sbjct: 101 QPTISRMENMAD 112 >UniRef50_UPI00016AFD66 hypothetical protein Bpse38_17802 n=1 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016AFD66 Length = 58 Score = 40.0 bits (92), Expect = 0.081, Method: Composition-based stats. Identities = 12/44 (27%), Positives = 19/44 (43%), Gaps = 2/44 (4%) Query: 206 FEEKFPLKELNNPKHDS--YAISEKSHGREETRLHIVCDVPDEL 247 F + +H + +K+HGR ETR+ V + D L Sbjct: 1 MRRWFAEARQDQLEHSYWEHVEHDKAHGRLETRICRVGEDVDWL 44 >UniRef50_Q877V8 ISPpu8, transposase n=3 Tax=Proteobacteria RepID=Q877V8_PSEPK Length = 433 Score = 39.7 bits (91), Expect = 0.089, Method: Composition-based stats. Identities = 29/225 (12%), Positives = 60/225 (26%), Gaps = 11/225 (4%) Query: 14 PDYRQAWKVEHKL-SDILLLTICAVISGAEGWEDIEDFGETHPDFLKQYGDFENGIPVHD 72 ++RQ L S I+ L + + P L D + Sbjct: 20 EEHRQRQYSRELLFSTIIKLMSLVSLGLKPSLHAAARQLDDLPVSLAALYDKISR--TEP 77 Query: 73 TIARVVSCICPAKFHESFINWMLDYHSSD------DKDVIAIDGKIHRHSYDKSRRKGAI 126 + R + C + + D D +A K + Sbjct: 78 ALLRALVTGCAQRLAPTIHELGCSAMLPDWQVRVVDGSHLASTEKRLGALRQERGAARPG 137 Query: 127 HVISAFSTMHSLVIGQIKT-DKKSNEITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQ 185 + + VI D ++E + LL ++ D + C + E + Sbjct: 138 FSVVVYDPDLDQVIDLQPCEDAYASERVCVLPLLAEAK-TNQVWIADRLYCTLPVMEACE 196 Query: 186 KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPKHDSYAISEKSH 230 + ++ + RL + E + P+ + + H Sbjct: 197 QVKTSFVIRQQAKHPRLIQEGEWQAPMPVATGTVREQSIEVKGGH 241 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.314 0.134 0.355 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,396,748,805 Number of Sequences: 3077464 Number of extensions: 50104978 Number of successful extensions: 167504 Number of sequences better than 1.0e-01: 210 Number of HSP's better than 0.1 without gapping: 449 Number of HSP's successfully gapped in prelim test: 86 Number of HSP's that attempted gapping in prelim test: 166225 Number of HSP's gapped (non-prelim): 570 length of query: 253 length of database: 1,040,396,356 effective HSP length: 126 effective length of query: 127 effective length of database: 652,635,892 effective search space: 82884758284 effective search space used: 82884758284 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 91 (39.7 bits)