BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (370 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P11901 Transposase for insertion sequence element IS421... 748 0.0 UniRef50_C6AUF2 Transposase IS4 family protein n=7 Tax=Rhizobium... 259 1e-67 UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001... 210 6e-53 UniRef50_B8F976 Transposase IS4 family protein n=2 Tax=Desulfati... 178 2e-43 UniRef50_C0AF19 InsL n=9 Tax=Opitutaceae bacterium TAV2 RepID=C0... 152 1e-35 UniRef50_Q1Q5J6 Putative uncharacterized protein n=6 Tax=Candida... 134 6e-30 UniRef50_A6DTQ2 Putative transposase insL for insertion sequence... 131 3e-29 UniRef50_UPI00017465B5 InsL n=2 Tax=Verrucomicrobium spinosum DS... 129 2e-28 UniRef50_C4ZTT5 Predicted divalent heavy-metal cations transport... 106 1e-21 UniRef50_B7GET6 Transposase n=2 Tax=Bacillaceae RepID=B7GET6_ANOFW 78 4e-13 UniRef50_A9AZS8 Transposase IS4 family protein n=3 Tax=Herpetosi... 71 6e-11 UniRef50_A6TN04 Transposase, IS4 family protein n=1 Tax=Alkaliph... 69 3e-10 UniRef50_P12249 Transposase for insertion sequence element IS231... 64 9e-09 UniRef50_Q73IB8 Transposase, IS4 family n=9 Tax=Wolbachia RepID=... 60 1e-07 UniRef50_A6M1E5 Transposase, IS4 family protein n=1 Tax=Clostrid... 58 5e-07 UniRef50_Q1PXV1 Putative uncharacterized protein n=3 Tax=Candida... 55 5e-06 UniRef50_B0R9A9 Transposase (ISH8) n=22 Tax=Halobacteriaceae Rep... 54 1e-05 UniRef50_C0BDH6 Putative uncharacterized protein n=2 Tax=Coproco... 53 2e-05 UniRef50_Q648P8 Transposase n=2 Tax=environmental samples RepID=... 53 2e-05 UniRef50_Q5L3A2 Transposase of IS231E-like element n=1 Tax=Geoba... 52 2e-05 UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepI... 52 3e-05 UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostri... 52 3e-05 UniRef50_Q7MLW1 Transposase and inactivated derivative n=29 Tax=... 52 3e-05 UniRef50_A6UXI0 Protein containing transposase DDE domain n=4 Ta... 52 4e-05 UniRef50_B0TD95 Transposase, is4 family n=3 Tax=Heliobacterium m... 50 9e-05 UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipe... 50 1e-04 UniRef50_A9DPK2 Transposase n=8 Tax=Shewanella benthica KT99 Rep... 50 1e-04 UniRef50_A9DNS7 Transposase n=1 Tax=Shewanella benthica KT99 Rep... 50 1e-04 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 49 2e-04 UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3... 49 2e-04 UniRef50_Q64B41 Transposase n=11 Tax=environmental samples RepID... 49 3e-04 UniRef50_A3ZZQ0 Putative uncharacterized protein n=3 Tax=Blastop... 48 5e-04 UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium... 48 6e-04 UniRef50_Q46GC6 Transposase n=7 Tax=Methanosarcina RepID=Q46GC6_... 47 0.001 UniRef50_C3BTW8 Transposase for insertion sequence element IS231... 47 0.001 UniRef50_C8W6S4 Transposase IS4 family protein n=1 Tax=Desulfoto... 47 0.001 UniRef50_B6FTH4 Putative uncharacterized protein n=3 Tax=Clostri... 46 0.003 UniRef50_Q9X6I5 Putative uncharacterized protein n=2 Tax=Bacillu... 46 0.003 UniRef50_C5EN31 Putative uncharacterized protein n=1 Tax=Clostri... 45 0.004 UniRef50_Q3M8C5 Transposase, IS4 n=15 Tax=Cyanobacteria RepID=Q3... 45 0.005 UniRef50_C3FBK7 Transposase for insertion sequence element IS231... 45 0.006 UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostrid... 45 0.006 UniRef50_C5T3Q2 Transposase IS4 family protein n=4 Tax=Proteobac... 45 0.006 UniRef50_B7CEB8 Putative uncharacterized protein n=2 Tax=Erysipe... 44 0.007 UniRef50_A3IS08 Putative uncharacterized protein n=1 Tax=Cyanoth... 44 0.008 UniRef50_A3ZNH0 Probable transposase n=1 Tax=Blastopirellula mar... 44 0.008 UniRef50_Q1PWW4 Putative uncharacterized protein n=2 Tax=Candida... 44 0.009 UniRef50_Q8VV93 Transposase n=1 Tax=marine psychrotrophic bacter... 44 0.011 UniRef50_A6DSH7 Probable transposase n=3 Tax=Lentisphaera araneo... 44 0.012 UniRef50_C9R546 Transposase (IS4 family) protein n=1 Tax=Aggrega... 44 0.013 UniRef50_Q05309 Transposase for insertion sequence element IS115... 43 0.015 UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coproco... 43 0.015 UniRef50_Q3A1U3 Transposase n=1 Tax=Pelobacter carbinolicus DSM ... 43 0.016 UniRef50_UPI00016C3BAC transposase n=1 Tax=Gemmata obscuriglobus... 43 0.021 UniRef50_Q7ULM3 Probable transposase n=5 Tax=Planctomycetaceae R... 43 0.021 UniRef50_B2IXJ5 Putative uncharacterized protein n=1 Tax=Nostoc ... 43 0.021 UniRef50_A6CHG0 Transposase of IS5377-like element n=2 Tax=Bacil... 43 0.024 UniRef50_Q55566 Putative transposase for insertion sequence elem... 42 0.025 UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea f... 42 0.027 UniRef50_A4BL98 Putative uncharacterized protein n=5 Tax=Nitroco... 42 0.036 UniRef50_D1XZ52 Transposase, IS4 family n=1 Tax=Prevotella bivia... 42 0.047 UniRef50_Q4V248 Transposase, n=5 Tax=Bacillus cereus group RepID... 42 0.049 UniRef50_B7AA71 Transposase IS4 family protein n=2 Tax=Thermus a... 41 0.054 UniRef50_Q093Y3 Isrso13-transposase protein n=7 Tax=Stigmatella ... 41 0.058 UniRef50_A4SUB1 IS element transposase n=8 Tax=Bacteria RepID=A4... 41 0.079 UniRef50_B0CC46 Transposase, IS4 family, putative n=9 Tax=Cyanob... 41 0.083 UniRef50_Q2FU81 Transposase, IS4 n=4 Tax=Methanospirillum hungat... 41 0.083 >UniRef50_P11901 Transposase for insertion sequence element IS421 n=41 Tax=cellular organisms RepID=T421_ECOLX Length = 371 Score = 748 bits (1931), Expect = 0.0, Method: Compositional matrix adjust. Identities = 370/371 (99%), Positives = 370/371 (99%), Gaps = 1/371 (0%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS-LRE 59 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS LRE Sbjct: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSSLRE 60 Query: 60 VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI 119 VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI Sbjct: 61 VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI 120 Query: 120 SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC 179 SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC Sbjct: 121 SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC 180 Query: 180 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG 239 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG Sbjct: 181 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG 240 Query: 240 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 299 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE Sbjct: 241 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 300 Query: 300 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF Sbjct: 301 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 360 Query: 360 PPRSAGSEKKN 370 PPRSAGSEKKN Sbjct: 361 PPRSAGSEKKN 371 >UniRef50_C6AUF2 Transposase IS4 family protein n=7 Tax=Rhizobium RepID=C6AUF2_RHILS Length = 372 Score = 259 bits (662), Expect = 1e-67, Method: Compositional matrix adjust. Identities = 156/364 (42%), Positives = 208/364 (57%), Gaps = 6/364 (1%) Query: 6 DNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ 65 D+W + + +L+ +AR GA TR REI++A TLLRL LAYG GMSLRE AWA+ Sbjct: 9 DHWPEVRERLPAGFDLEATARLRGAFTRVREIKNAETLLRLALAYGGLGMSLRETCAWAE 68 Query: 66 LHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAV-TGCTSGKRLRLVDGTAISAPGG 124 +A LSD +LL+RL AA W G + A +A +A V TG +G RLR++DGT+I PG Sbjct: 69 AGGIARLSDPSLLERLCKAAPWLGDIVAALIAEQAKVPTGRFAGYRLRVLDGTSICHPGA 128 Query: 125 GSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLA 184 WRLH+GYD T Q ELTD AE L R +I +ADR + +RP +R + Sbjct: 129 DRTTWRLHVGYDLATAQVDQLELTDIHGAENLQRLTYAPGDIVLADRYY-ARPRDLRPVI 187 Query: 185 FGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMI--GNSGNKKAGAPF 242 AD+IVR W LR L G FD+ L + GE V + G +G Sbjct: 188 DAGADFIVRTGWNSLRLLQTNGEPFDLFAAL-AAQQEQEGEVQVRVHEGMTGTPPPPP-L 245 Query: 243 PARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 RLI P++A + RLL + R++G+ +LEAA ++LLLTSLP + + Sbjct: 246 VLRLIVRRKDPQQAQAEQERLLKDARKRGKKPDPRSLEAAKYILLLTSLPTATFPPADIL 305 Query: 303 DCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPR 362 YR RWQIELAFKR KSL LD+L AK+PELA+AW++A L+ A + + I D PP Sbjct: 306 TLYRFRWQIELAFKRFKSLAGLDSLPAKKPELARAWLYARLIVAIIAEQIAGQVPDSPPS 365 Query: 363 SAGS 366 G+ Sbjct: 366 GCGN 369 >UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001905F7C Length = 367 Score = 210 bits (535), Expect = 6e-53, Method: Compositional matrix adjust. Identities = 143/356 (40%), Positives = 205/356 (57%), Gaps = 12/356 (3%) Query: 7 NWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL 66 +W ++ +G EEL+ SAR AGAL R+R++ AA LLRL AY GG SLR + AWA Sbjct: 9 DWGELVERLGSAEELEASAREAGALLRKRQVGGAADLLRLCFAYVLGGFSLRTLAAWADQ 68 Query: 67 HDVATLSDVALLKRLRNAADWFGILAAQTLAVRA--AVTGCTSGKRLRLVDGTAISAPGG 124 +A++SDVA+LKRL+ +ADW G L ++ LA R A G S RL VD T ++ PG Sbjct: 69 RGLASMSDVAMLKRLKASADWVGYLVSELLAERCPEAFAGVHSDLRLMAVDATVVAPPGP 128 Query: 125 GSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLA 184 W +H +D + + E+TD R+AERL R + A E+RIADR + + Sbjct: 129 KRDYWMVHTVFDLSRLKLSSVEVTDRREAERLSRGVK-AGELRIADRAHAKATDLAAVVK 187 Query: 185 FGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG--ETTVMIGNSGNKKAGAPF 242 G AD++VR R L +G + + R + G G + +V I + +K A Sbjct: 188 AG-ADFLVRAPSNYPRLLDGDGQLLERLALCR--EAGDKGVLDRSVRIQDGKSKVEVA-- 242 Query: 243 PARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 AR++ + LPPE A ++ + +E AG+++LLTSL D++ E++A Sbjct: 243 -ARVVILPLPPEAAAKARRAARRLAAKARYKPSEAGIEMAGYLVLLTSLNADDWPPERLA 301 Query: 303 DCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 YRLRWQIELAFKR+KSL+ L+ LRAK+ +LA+ WI LLAA L +D + P+LD Sbjct: 302 STYRLRWQIELAFKRMKSLIGLEGLRAKDADLARLWINIALLAALLAEDDL-PALD 356 >UniRef50_B8F976 Transposase IS4 family protein n=2 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F976_DESAA Length = 371 Score = 178 bits (452), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 121/360 (33%), Positives = 184/360 (51%), Gaps = 15/360 (4%) Query: 6 DNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ 65 D+W AIL + P + A+ GAL+RRR LLR+ L + G SLR +A ++ Sbjct: 11 DDWQAILTFL--PHGWEEKAKELGALSRRRNFDGPEALLRVLLIHLVQGCSLRVTSALSK 68 Query: 66 LHDVATLSDVALLKRLRNAADWFGILAAQTLAV---RAAVTGCTSGKRLRLVDGTAISAP 122 +A+ SDVALLKRL+ + +W +A + + + G+ +R+VDG+ +S P Sbjct: 69 AGGLASASDVALLKRLKASGEWMRWMAVELMKQWFGKQPEKILGMGRTVRVVDGSTVSEP 128 Query: 123 GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS 182 G W++H + Q + +TD + E L F ++ +ADRG+ R + Sbjct: 129 GSTGTTWKIHYSIQLPSLQCDEVYVTDPKTGEDLKNFNVHPGDVFLADRGYYHRTGMLHV 188 Query: 183 LAFGEADYIVR-VHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAP 241 + G D IVR +H L G F ++ LR L + G+ I + +G Sbjct: 189 VK-GGGDLIVRMIHQYKL--YDINGQEFGLIKNLRSLTVNQIGDWDAFIHHKKEVISG-- 243 Query: 242 FPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQV 301 R+ A+ E A +K +L EN +KG + ETL AA +V + T+L E+ A QV Sbjct: 244 ---RVCAIKKSKEAAEKAKRAILRENSKKGHKTKPETLVAAEYVFVFTTLSR-EWKASQV 299 Query: 302 ADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPP 361 + YR RWQ+ELAFKRLKSL+ L L+ + E AKAW+ + AAFL++ +I F P Sbjct: 300 LEAYRGRWQVELAFKRLKSLIGLGHLKKTDFEGAKAWLHGKIFAAFLVEAMIAACDSFSP 359 >UniRef50_C0AF19 InsL n=9 Tax=Opitutaceae bacterium TAV2 RepID=C0AF19_9BACT Length = 362 Score = 152 bits (385), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 110/347 (31%), Positives = 166/347 (47%), Gaps = 12/347 (3%) Query: 18 PEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVAL 77 PE + +AR GA + + IR A LLRL L + G+SLR A + +SDVAL Sbjct: 18 PEGWEVAAREQGAFKQAKGIRTAEELLRLILMHAGSGLSLRHAVARGAAAGLPEVSDVAL 77 Query: 78 LKRLRNAADWFGILAAQTLAVRAAV---TGCTSGKRLRLVDGTAISAPGGGSAEWRLHMG 134 LKRLRNA W ++ + L +A + G VD T I G +WRLH Sbjct: 78 LKRLRNAEGWLRWMSVRLLEQQAGQPRWSRLPEGWTAVAVDSTTIEESGASGTDWRLHYA 137 Query: 135 YDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRV 194 + ELTD++ E L R+ ++ + DR F P+ IR + + ++R Sbjct: 138 IGLPSLFCEQAELTDNKGGESLCRYKVRKGDLFLGDRNFCRAPQ-IRHVMDHQGAVLLRW 196 Query: 195 HWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPE 254 H L +G D+ +L L + E V + K G RL A+ + P+ Sbjct: 197 HSTSLPLFDQQGHALDVPAWLAQLRSRQCSELPVFL------KDGTAL--RLCALRVSPQ 248 Query: 255 KALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELA 314 A + ++ ++ GR + L A +++++TSLP + + YRLRWQIELA Sbjct: 249 AAQRERAKIRLSAKKNGRKPSCQCLCMADYIVVVTSLPSSCLDSRGILQLYRLRWQIELA 308 Query: 315 FKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPP 361 FKRLKSLL+ + K+P +++W+ A LL LI+ + S F P Sbjct: 309 FKRLKSLLNTGHVPKKDPLSSRSWLQAKLLTCLLIEKSLLQSEVFSP 355 >UniRef50_Q1Q5J6 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q5J6_9BACT Length = 367 Score = 134 bits (337), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 101/348 (29%), Positives = 161/348 (46%), Gaps = 8/348 (2%) Query: 18 PEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVAL 77 P + + A + AL R+ + LLR L + G SLRE A+ ++A LSDVAL Sbjct: 14 PNDWKSLAVDTNALKGLRKDKSEEKLLRTLLIHLGCGYSLRETVVRAKRANLADLSDVAL 73 Query: 78 LKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDP 137 LKRL+ + +W L R + LRL D T + PG + WR+H + Sbjct: 74 LKRLKKSKEWLYKLCLSLFRERGLQINKRNNFHLRLFDATTVKEPGKTGSLWRIHYSIEV 133 Query: 138 HTCQFTDFELTDSRDAERLDRFAQ---TADEIRIADRGFGSRPECIRSLAFGEADYIVRV 194 + F+LT + + F Q D+ IADRG+ + + I A VRV Sbjct: 134 PSLSCDFFKLTGTEGEGTGESFRQFPMKKDDYIIADRGYCT-GQGIHHATRKGAYLSVRV 192 Query: 195 HWRGLRWLTAEGMRFDMMGFLRGLDCGKNGET-TVMIGNSGNKKAGAPFPARLIAVSLPP 253 + + LR E F ++ ++ L ++ V I N N + L + Sbjct: 193 NSQSLRIFGEEKKPFPLLKEIQYLKRPLAIKSWNVFIPNVDNTEY---VKGSLCIIRKTE 249 Query: 254 EKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIEL 313 E I+ +L +KG ++ ETL A +V++ T+ PE++++A + + YR+RWQIEL Sbjct: 250 EAIKIAHKKLKRHASKKGIELKPETLIYAKYVIVFTTFPENQFTAFDILEWYRVRWQIEL 309 Query: 314 AFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPP 361 FKR K + L + + +KAW++ L A L + +I + F P Sbjct: 310 VFKRFKQIAQFGHLPKYDDDSSKAWLYGKLFVALLTEKLIDFATSFSP 357 >UniRef50_A6DTQ2 Putative transposase insL for insertion sequence IS186 n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTQ2_9BACT Length = 375 Score = 131 bits (330), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 109/329 (33%), Positives = 161/329 (48%), Gaps = 17/329 (5%) Query: 41 ATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRA 100 + LLR L + G +SLR A A+ ++ +SDVALLKRL+ +++WF Q L Sbjct: 45 SKLLRTLLIHLGGNLSLRSTCALAKEGNIIDVSDVALLKRLQKSSEWFNWCTTQLLDKMK 104 Query: 101 AVT--GCTSGK--RLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERL 156 G + R VDG+ + PG + W LH + T + +TD + E L Sbjct: 105 PKNPQGLPEQEEYNFRYVDGSIVREPGATGSTWMLHYSMNAKTLAPDEITITDQKKGESL 164 Query: 157 DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE-GMRFDMMGFL 215 ++ +++ I DR + R I + G YI+ L L + G F ++ L Sbjct: 165 KNYSVKPNDVFIGDRVYPRRNGIIHVHSNG--GYILCRFPPSLTPLHNDNGTPFKLLSKL 222 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKT---RLLSENRRKGR 272 R L G GE V+I K AR+ A+ E L ++ R S+N RKG Sbjct: 223 RKLKLGDIGEYNVVI-----KHNEGQINARVCAMKKDHESTLKAQKAIHRKASKNSRKGS 277 Query: 273 VVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEP 332 + ETLE AG++L+LT+L E S E++ + YR RWQIEL FKRLKS++ L K Sbjct: 278 T-RPETLEYAGYILILTTLAE-SVSPEKILNIYRSRWQIELLFKRLKSIIGAAPLYKKND 335 Query: 333 ELAKAWIFANLLAAFLIDDIIQPSLDFPP 361 ++W+ +L A LI+ II+ DF P Sbjct: 336 IGMRSWLAGKILVATLIEYIIRCGEDFFP 364 >UniRef50_UPI00017465B5 InsL n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017465B5 Length = 382 Score = 129 bits (323), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 108/352 (30%), Positives = 168/352 (47%), Gaps = 8/352 (2%) Query: 6 DNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ 65 +NW +L+ + PE ++ A+ GA+ R R ++LLR+ L + G SLR + + Sbjct: 7 ENWDYLLSLL--PENWESLAKTTGAVQRLRGAESLSSLLRVLLLHAGHGCSLRTASVVGK 64 Query: 66 LHDVATLSDVALLKRLRNAADWFGILAAQTLA-VRAAVTGCTSGKRLRLVDGTAISAPGG 124 ++SDVAL KR W L A A R + G +LRLVDGT I PG Sbjct: 65 AAGWISMSDVALHKRFALCEGWLQQLCAGLFAQSRLQLPAAYRGLKLRLVDGTTIKEPGA 124 Query: 125 GSAEWRLHMGY---DPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIR 181 ++WR+H D H F + S + E L F + +ADRGF S I Sbjct: 125 TGSQWRIHYSLRVPDWHCDFFRLNPVRGSGNGESLKHFEVAPGDCFLADRGF-SHLLGIE 183 Query: 182 SLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL-DCGKNGETTVMIGNSGNKKAGA 240 + G A I+R++ + +G ++ +LR L G + + Sbjct: 184 HVYRGGAHVIMRLNEQNTPLEDEQGRPVVLLPWLRKLKQPGAAAGLDLWVRPRKEDSLEK 243 Query: 241 PFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ 300 P RL AV E A +++ ++ ++ ++A TLE +++LT++P D S + Sbjct: 244 RVPVRLCAVRKSVEAAALAQRKVQRRAQQDQTKLRAATLEHTAWIVVLTTVPRDTLSDVE 303 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 V YR+RWQIELAFKRLKSL + L + ++AW++A LL A L + + Sbjct: 304 VLQWYRVRWQIELAFKRLKSLGDVGHLPKSDERSSRAWVYAKLLIALLSEKM 355 >UniRef50_C4ZTT5 Predicted divalent heavy-metal cations transporter n=1 Tax=Escherichia coli BW2952 RepID=C4ZTT5_ECOBW Length = 60 Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 52/54 (96%), Positives = 52/54 (96%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGG 54 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAY P G Sbjct: 3 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYRPRG 56 >UniRef50_B7GET6 Transposase n=2 Tax=Bacillaceae RepID=B7GET6_ANOFW Length = 417 Score = 78.2 bits (191), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 93/384 (24%), Positives = 160/384 (41%), Gaps = 46/384 (11%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGM---SL 57 M W + + PE L A+ G + R+R + A L L A+G G + SL Sbjct: 1 MENQMQAWMKTIRQLFSPETLTHLAQETGFIQRKRAL-TAEAFLTL-CAWGDGSLAQQSL 58 Query: 58 REVTAWAQLHDVATLSDVALLKRLRNAAD------WFGILAAQTLAVRAAV-TGCTSGKR 110 + + L +LS L +R A +F +L Q + + + T T R Sbjct: 59 QRLCTSLTLRHDCSLSSEGLNQRFTERAVAFLREVFFLLLQRQPPLLWSTIQTYRTCFTR 118 Query: 111 LRLVDGTAISAPGGGSAEWR--------LHMGYDPHTCQFTDFELTDSRDAERLDRFAQT 162 LR++D T+ P ++R + YD + + D++ RFA Sbjct: 119 LRILDSTSFLVPADYGEDYRGSVSSGAKIQFEYDLLSGACLQLCAQSANDSD--ARFAYH 176 Query: 163 ADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL---- 218 A + + CIR L F + + RG ++T +R DM +++ Sbjct: 177 AQHTILPN------DLCIRDLGFFSVAALTEIDARGAYYITR--LRSDMKVYIKENSQWK 228 Query: 219 --------DCGKNGETTVMIG-NSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRR 269 + K GE+ M G+++ P RLI L E+ + +R Sbjct: 229 EWDWESLGNQLKEGESVEMEHVYIGHERLYIP---RLIFRRLTEEEWQKRMAYVRKREKR 285 Query: 270 KGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRA 329 KG+ + +TLE + +LLT+LP++ + +QV + Y LRWQIEL FK KS+ L+ ++ Sbjct: 286 KGKALTRQTLEQKKYHILLTNLPQESFDGQQVYELYSLRWQIELLFKAWKSVFDLEKVKK 345 Query: 330 KEPELAKAWIFANLLAAFLIDDII 353 + E + ++ L+A + + Sbjct: 346 MKKERFECHVYGTLIAILVTQTFL 369 >UniRef50_A9AZS8 Transposase IS4 family protein n=3 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AZS8_HERA2 Length = 442 Score = 71.2 bits (173), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 63/228 (27%), Positives = 103/228 (45%), Gaps = 11/228 (4%) Query: 120 SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERL---DRFAQTADEIRIADRGFGSR 176 +A G+A + + D T +LTD R ++++ R A +R+AD GF + Sbjct: 140 NASARGTAGLKCGVQLDLLTGTLCGIDLTDGRASDQVLSVQRAPLPAGSLRLADLGFYN- 198 Query: 177 PECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNK 236 R LA E ++ RV L + + ++ + GL + E TV++G+ Sbjct: 199 IRIFRELAAAEVYWLSRVQSHSRIRLPGQKEQ-SILEVVTGLGDADHWEGTVLVGSKER- 256 Query: 237 KAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEY 296 ARL+ +P A + R+ E K R V ++ A +++T+ PED+ Sbjct: 257 -----LAARLLVQRVPDAVAAQRRQRVQDEAHDKCRPVSNAAMDLAAWTVVITNAPEDKL 311 Query: 297 SAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 + ++RWQIEL FK KS H+D R K+P I+A LL Sbjct: 312 GLTEAMVLLKMRWQIELLFKLWKSHGHVDEWRTKKPARILCEIYAKLL 359 >UniRef50_A6TN04 Transposase, IS4 family protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TN04_ALKMQ Length = 454 Score = 68.9 bits (167), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 64/257 (24%), Positives = 119/257 (46%), Gaps = 29/257 (11%) Query: 109 KRLRLVDGTAISAP----------GGGSAEWRLHMG--YDPHTCQFTDFELTDS--RDAE 154 K +++ D T I+ P GG +A+ L + Y +F+ E+T + D Sbjct: 117 KDVKICDSTKITLPDKLVALYPGLGGRNAKSSLKVQGIYSLIPARFSSLEITKAPGADTT 176 Query: 155 RLDRFAQTAD--EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGM--RFD 210 D+ + E+ I D G+ S+ L+ + Y+ R+ + ++ G + D Sbjct: 177 YNDKLLAMVNPGELLITDLGYFSKA-FFEKLSTKGSYYLTRIKKNSIVYVEKSGQLTKVD 235 Query: 211 MMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRR- 269 + L+G +T V +G + K+ R +A+ LP EK + + R ++ + Sbjct: 236 LTDLLKGTVV----DTEVFLGIAHKKQ----LKCRFVAIRLP-EKVVNQRRRKANQQAKA 286 Query: 270 KGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRA 329 +G+ + A+ E +++T++ +D+ S E D YR RWQIEL FK LKS L++D + + Sbjct: 287 QGKQLSAKETELLAWNIIVTNVTKDKLSPEAACDLYRARWQIELVFKSLKSYLNIDKIGS 346 Query: 330 KEPELAKAWIFANLLAA 346 + I+ L+A Sbjct: 347 CGKYQLECLIYGRLIAV 363 >UniRef50_P12249 Transposase for insertion sequence element IS231A n=411 Tax=Bacillus RepID=T231A_BACTB Length = 478 Score = 63.9 bits (154), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 82/379 (21%), Positives = 153/379 (40%), Gaps = 63/379 (16%) Query: 18 PEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVA--TLSDV 75 P L+ A+ G + R+R+ + L + + S V +QLH +S Sbjct: 22 PSFLEELAKKLGFVKRKRKF-SGSELATICIWISQRTASDSLVRLCSQLHAATGTLMSPE 80 Query: 76 ALLKRL-RNAADW----FGILAAQTLAVRAAV--TGCTSGKRLRLVDGTAISAP------ 122 L KR + A ++ F IL L +A+ T T +R+R++D T P Sbjct: 81 GLNKRFDKKAVEFLKYIFSILWKGKLCKTSAISSTALTHFQRIRILDATIFQIPKHLASI 140 Query: 123 ---GGGSAE---WRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSR 176 GG A+ ++ + YD H+ QF +F++ ++ ++ + D +R D Sbjct: 141 YPGSGGCAQTAGIKIQLEYDLHSGQFLNFQVGPGKNNDKTFG-TECLDTLRPGDL----- 194 Query: 177 PECIRSLAFGEADYIVRVHWRGLRWLT--------------------------AEGMRFD 210 CIR L + + + ++ RG +++ ++ ++ D Sbjct: 195 --CIRDLGYFSLEDLDQMDQRGAYYISRLKLNHTVYIKNPSPEYFRNGTVKKQSQYIQVD 252 Query: 211 MMGFLRGLDCGKNGETT-VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRR 269 + + L G+ E IG N+K R+I L ++ + + + Sbjct: 253 LEHIMNHLKPGQTYEIKEAYIGK--NQK----LFTRVIIYRLTEKQIQERRKKQAYTESK 306 Query: 270 KGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRA 329 KG ++ G + +++ PE EQ+ D Y LRWQIE+ FK KSL + + Sbjct: 307 KGITFSEKSKRLTGINIYVSNTPEGIVPMEQIHDFYSLRWQIEIIFKTWKSLFQIHHWQN 366 Query: 330 KEPELAKAWIFANLLAAFL 348 + E + ++ L+A F+ Sbjct: 367 IKQERLECHVYGRLIAIFI 385 >UniRef50_Q73IB8 Transposase, IS4 family n=9 Tax=Wolbachia RepID=Q73IB8_WOLPM Length = 442 Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 69/303 (22%), Positives = 123/303 (40%), Gaps = 43/303 (14%) Query: 75 VALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWR---- 130 V +KR+ N + +L L V + ++L+D + I+ P ++ Sbjct: 84 VEFMKRMYNES---VLLFKNILQVDCKILQ--QFNSVKLLDSSYITLPNSMEEMYKGYGT 138 Query: 131 LHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRI-----ADRGFGSRPECIRS--L 183 + GY+ +T +L D Q D++ + +D+G+ I S L Sbjct: 139 SYSGYESNTKSGIKLQLV-------FDYMNQIIDQLNLTEGVRSDQGYRKHLSNILSNDL 191 Query: 184 AFGEADYIVRVHWRGLRWLTAEGMR--------FDM-----MGFLRGLDCGKNGETTVMI 230 + Y V ++ + + A + +D+ M L L+ E V++ Sbjct: 192 LISDLGYFVPSSFKQINEIGAYFISRYKSDTNIYDVETNQKMELLECLEDKLFLENEVLL 251 Query: 231 GNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTS 290 G A R+I L E+++ + + R +G + + +T+ Sbjct: 252 GKE------AKIRVRIICQKLTEEQSMARRRKANRLARSQGYTSSKRNQKLLNWSIFITN 305 Query: 291 LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 +PE++ SAEQV YR+RWQIEL FK KS + LD L+ K P ++A L A + Sbjct: 306 VPENKISAEQVLTIYRVRWQIELLFKLYKSHIRLDKLKGK-PCRVLCELYAKLCAILIFH 364 Query: 351 DII 353 I+ Sbjct: 365 GIV 367 >UniRef50_A6M1E5 Transposase, IS4 family protein n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M1E5_CLOB8 Length = 460 Score = 58.2 bits (139), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 50/240 (20%), Positives = 104/240 (43%), Gaps = 15/240 (6%) Query: 121 APGGGSAEWRLHMGYDPHTCQFTDFELTD--SRDAERLDRFAQ--TADEIRIADRGFGSR 176 + ++E ++ Y + Q FE D + D + A +EI + D G+ + Sbjct: 147 SEDKSASEMKIQTVYSFKSKQIETFEFEDGTTNDNSYMKTLADKINTNEILLVDLGYFDK 206 Query: 177 PECIRSLAFGEADYIVRVHWRGL----RWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGN 232 +C + L A ++ ++ + + + +M+ FL+ +T + +G Sbjct: 207 -KCFKMLEKKSAFFLSKIKYNTALYKENYKKGNFEKVEMIDFLKK--SSGVIDTYLYVGM 263 Query: 233 SGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLP 292 N + R+I LP E + R + + +GR + E V+++T++ Sbjct: 264 KQNNRE----EFRVIGKRLPEEIVNLRIRRAREKAKAQGRAPKKIDKELMSWVIMITNIE 319 Query: 293 EDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 +++ + + D YRLRWQIEL FK KS +D +++ + ++ L+ LI+ + Sbjct: 320 KEQADVDMLLDIYRLRWQIELLFKCWKSYGKIDHVKSAGIDYLNCLLYGRLIITLLINTV 379 >UniRef50_Q1PXV1 Putative uncharacterized protein n=3 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PXV1_9BACT Length = 449 Score = 54.7 bits (130), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 34/117 (29%), Positives = 56/117 (47%), Gaps = 16/117 (13%) Query: 244 ARLIAVSLPPEKALISKTRLLSENRRK--------GRVVQAETLEAAGHVLLLTSLPEDE 295 +RLIA P +++E RRK G+ + E LE + +T++ + Sbjct: 260 SRLIAYRAPGH--------VINERRRKAKRAVQKSGKTLSREYLEWLDYSFYITNVGAEI 311 Query: 296 YSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 +S E V YR+RWQIEL FK+ K L +D +R E + ++ L+ ++ I Sbjct: 312 WSPEVVGTIYRIRWQIELVFKQWKQLFRMDVMRGTREERIRCLLYGRLIMICIVTRI 368 >UniRef50_B0R9A9 Transposase (ISH8) n=22 Tax=Halobacteriaceae RepID=B0R9A9_HALS3 Length = 424 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 25/72 (34%), Positives = 41/72 (56%) Query: 265 SENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHL 324 S + ++ RVV +A + L +T+LP DE+ E +A YR RW++E F+ LK+ L Sbjct: 260 SLDTKRFRVVGVRDSDADDYHLYITNLPRDEFFPEDLATLYRCRWEVETLFRELKTQYEL 319 Query: 325 DALRAKEPELAK 336 D +P++ K Sbjct: 320 DEFNTSDPDVVK 331 >UniRef50_C0BDH6 Putative uncharacterized protein n=2 Tax=Coprococcus comes ATCC 27758 RepID=C0BDH6_9FIRM Length = 204 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 24/68 (35%), Positives = 40/68 (58%) Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAW 338 + + + ++T+LP DE+ E++ Y RW IE +F++LK + L A +PE K Sbjct: 136 ISTSTYECIVTNLPRDEFPVERIKTLYNARWSIESSFRKLKYTIGLSNFHAYKPEYVKQE 195 Query: 339 IFANLLAA 346 I+A LLA+ Sbjct: 196 IWARLLAS 203 >UniRef50_Q648P8 Transposase n=2 Tax=environmental samples RepID=Q648P8_9ARCH Length = 464 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 27/82 (32%), Positives = 48/82 (58%), Gaps = 3/82 (3%) Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 V+ T L E++ E++ + Y RW IE+ F+ +K++L +D LR K P++ I+ +LL Sbjct: 302 VITTTLLDPKEFTREEIDELYAKRWLIEVDFRFIKTVLQMDILRCKTPDMVCKEIWVHLL 361 Query: 345 AAFLIDDIIQPS---LDFPPRS 363 A LI ++ + + PPR+ Sbjct: 362 AYNLIRTVMAQAAHRYNLPPRT 383 >UniRef50_Q5L3A2 Transposase of IS231E-like element n=1 Tax=Geobacillus kaustophilus RepID=Q5L3A2_GEOKA Length = 453 Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 88/367 (23%), Positives = 149/367 (40%), Gaps = 51/367 (13%) Query: 19 EELDTSARNAGALTRRREIR--DAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVA 76 EEL+ AR+ + R+ ++R D L L G G SL ++ + L +LS Sbjct: 25 EELEHMARDHQFIQRKGKLRAHDFVALCTF-LQEGGGQKSLVQLCSALALKQNTSLSAEG 83 Query: 77 LLKRLRNAADWF------GILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP-------- 122 L +R A F +L QT R + R+R++D T+ P Sbjct: 84 LNQRFHEKAVSFLKAVFEKLLIHQTQEARRLCPRHSLFLRIRILDSTSFQLPPEIQGIYE 143 Query: 123 GGGSAEWRLHMGYDPHTCQFTDFELTDSR--DAERLDRFAQTADE--IRIADRGFGSRPE 178 G ++ + Y+ + ++ D+R DA T E + + D G+ S E Sbjct: 144 GCTGPGVKIQLEYEWLEGKVLHVDVEDARHHDAAYGASLLSTIQEGDLCLKDLGYFSL-E 202 Query: 179 CIRSLAFGEADYIVRV-HWRGLRWLTAEGMRF---DMMGFLRGLDCGKNGETTVMIGNSG 234 ++++ A YI R+ H G+ EG RF + FL L G+ E + Sbjct: 203 GLQAIHDAGAFYISRLKHNVGI--YQKEGDRFRKWEPEDFLAVLQPGETME--LEHAYVS 258 Query: 235 NKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV--------L 286 KK P RLI L E+ E +++G+ Q + A +V + Sbjct: 259 GKKVHQP---RLIVYRLTEEQ----------ERQKEGQWKQKAKQKGAAYVTRRPHPIYV 305 Query: 287 LLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 +T++P S ++ Y LRWQIE+ FK KSL H+ + + + ++ L+A Sbjct: 306 YITNIPAIYTSLHEIHTLYSLRWQIEVVFKTWKSLFHIHRFKPMKGARFQCHLYGTLIAL 365 Query: 347 FLIDDII 353 + ++ Sbjct: 366 LISSTVM 372 >UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepID=B0R8M6_HALS3 Length = 449 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 28/71 (39%), Positives = 39/71 (54%) Query: 269 RKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALR 328 R R+V E + L LT+L D+YSA +A YR RW++EL FK LKS LD + Sbjct: 276 RTFRLVGLRNEETEEYHLYLTNLGNDDYSAPDIAQLYRARWEVELLFKELKSRFGLDEIN 335 Query: 329 AKEPELAKAWI 339 + + +A I Sbjct: 336 TTDAYIIEALI 346 >UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostridium spiroforme DSM 1552 RepID=B1C560_9FIRM Length = 399 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 28/83 (33%), Positives = 48/83 (57%), Gaps = 1/83 (1%) Query: 272 RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKE 331 RVV+ + E + ++T+L +E+S E + + Y +RW E +F+ LK + L+AL AK+ Sbjct: 245 RVVRLKITENT-YETVITNLSRNEFSMEDICEIYNMRWGEETSFRELKYAIGLNALHAKK 303 Query: 332 PELAKAWIFANLLAAFLIDDIIQ 354 EL + I+A +L I+Q Sbjct: 304 RELIQQEIYARMLMYNFCQRIVQ 326 >UniRef50_Q7MLW1 Transposase and inactivated derivative n=29 Tax=Gammaproteobacteria RepID=Q7MLW1_VIBVY Length = 445 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 31/79 (39%), Positives = 47/79 (59%), Gaps = 3/79 (3%) Query: 274 VQAETLEAAG-HVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAK 330 V+A T + G H + TSLP +EY AE VA+ Y RW+I+L ++ +KS + +A LR+K Sbjct: 279 VRAVTYQVQGKHKTVFTSLPREEYDAESVAELYHERWEIKLGYRDIKSSMQHNALVLRSK 338 Query: 331 EPELAKAWIFANLLAAFLI 349 EL ++ LL L+ Sbjct: 339 TVELVYQELWGLLLGYNLV 357 >UniRef50_A6UXI0 Protein containing transposase DDE domain n=4 Tax=Gammaproteobacteria RepID=A6UXI0_PSEA7 Length = 423 Score = 52.0 bits (123), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 65/272 (23%), Positives = 106/272 (38%), Gaps = 55/272 (20%) Query: 108 GKRLRLVDGTAISAP--GGGSAEWRLHMGYDPHTCQFTDFELTD--------------SR 151 G R+ VDG+ + P + + H G+ P T +E+ D R Sbjct: 110 GLRVLAVDGSTVHLPLESTMATFFGSHSGF-PMARLSTLYEVADGQTLHSLIVPLTVGER 168 Query: 152 DAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDM 211 D L AD + + DRG+ HW L L A+ R Sbjct: 169 DCAHLHLEHLPADSLTLFDRGYPG-------------------HW--LFALFAQQQR--- 204 Query: 212 MGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKG 271 FL L CG N + + +G +L + P + S+ + ++ + Sbjct: 205 -HFLMRLPCGYNAQVKAFL------HSGQVEDTQLFVANHPEARLFCSEAGVDPASQIEL 257 Query: 272 RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD-----A 326 R+++ E VLL + L + + AE A+ Y RW IE F+RLK L LD + Sbjct: 258 RLIRVELANGESEVLLTSLLDREAFPAEVFAELYHRRWGIETDFRRLKQTLTLDNFSGRS 317 Query: 327 LRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 + A + + A + NL A L+ ++QP ++ Sbjct: 318 VTAVKQDFHAAQLLKNL--ALLMQHLLQPVIE 347 >UniRef50_B0TD95 Transposase, is4 family n=3 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TD95_HELMI Length = 441 Score = 50.4 bits (119), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 24/56 (42%), Positives = 33/56 (58%) Query: 272 RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDAL 327 RVV E + +T+LP + + AE +A YR RW IEL FK LKS HL+++ Sbjct: 282 RVVGILNEETKDYHFYITNLPAERFPAEDIATLYRARWTIELLFKELKSYYHLESI 337 >UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipelotrichaceae RepID=B7C7E2_9FIRM Length = 446 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 77/337 (22%), Positives = 132/337 (39%), Gaps = 41/337 (12%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGI 90 TR+R+ +TL+ + L G G + + D T+S + R + D F I Sbjct: 47 FTRKRKHLFGSTLMNVLLLEG-GSLKDELYKLFGYNLDTPTVSSF-IQARDKIKPDTFHI 104 Query: 91 LAAQTLAVRAAVTGCTSGKRLRLVDG-------------TAISAPGGGS---AEWRLHMG 134 L R +G RL VDG T I + + L+ Sbjct: 105 LF-NLFNGRTRKPKLYNGYRLLAVDGSTLPITSEIKDKKTTIQKANNSDKPFSAFHLNTS 163 Query: 135 YDPHTCQFTDFELT-----DSRDA--ERLDRFAQTADEIRIADRGFGSRPECIRSLAFGE 187 YD + D L D RDA + ++R+ + I IADRG+ E I S Sbjct: 164 YDILEYTYDDVILQGQAVQDERDALNKMVERY-KGDKAIFIADRGY----ESINSFE--- 215 Query: 188 ADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLI 247 ++H G ++L G LR + E +++ + K A Sbjct: 216 -----KIHLSGNKYL-VRVKDIHSTGMLRSFGPFLDDEFDLIVKRTLTTKQTNEIKAHPE 269 Query: 248 AVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRL 307 P+ + RVV+ + E + ++T+L ++E+S + + + Y L Sbjct: 270 IYKFVPQNQRFDYFEDAPFYDFECRVVRFKITEDT-YECIVTNLDKNEFSMQDIKELYHL 328 Query: 308 RWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 RW+IE +++ LK L L+ L +K+ L + I+A ++ Sbjct: 329 RWEIETSYRELKYDLDLNTLHSKKRNLIEQEIYAKMI 365 >UniRef50_A9DPK2 Transposase n=8 Tax=Shewanella benthica KT99 RepID=A9DPK2_9GAMM Length = 269 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 24/62 (38%), Positives = 40/62 (64%) Query: 287 LLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 L+T+L ++SA++V+ Y LRWQIEL FK LKS L ++ +A++ ++A++L Sbjct: 128 LITNLKRAQFSADKVSKLYGLRWQIELFFKELKSYSGLKTFNTRDKSIAESLVWASMLTL 187 Query: 347 FL 348 L Sbjct: 188 LL 189 >UniRef50_A9DNS7 Transposase n=1 Tax=Shewanella benthica KT99 RepID=A9DNS7_9GAMM Length = 190 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 24/62 (38%), Positives = 41/62 (66%) Query: 287 LLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 L+T+L ++SA++V++ Y LRWQIEL FK LKS L ++ +A++ ++A++L Sbjct: 49 LITNLKRAQFSADKVSELYGLRWQIELFFKELKSYSGLKTFNTRDKSIAESLVWASMLTL 108 Query: 347 FL 348 L Sbjct: 109 LL 110 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 57/239 (23%), Positives = 99/239 (41%), Gaps = 55/239 (23%) Query: 131 LHMG--YDPHTCQFTDFELTDSRDA-------ERLDRFAQTADEIRIADRGFGSRPECIR 181 LH+ YD + Q+TD + SR A E +DR+ T+ I IADRG+ + Sbjct: 143 LHLNAFYDLCSRQYTDAIIQPSRLANERRAMCEMIDRYNDTS-AIFIADRGYEN------ 195 Query: 182 SLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT--VMIGNSG----- 234 + V +G+ +L +R D NG T+ M+ SG Sbjct: 196 ------YNIFAHVEHKGMYYL------------IRVKDITSNGITSKLTMLPESGEFDEW 237 Query: 235 ----------NKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGH 284 N+ P R+I P + + K RV++ + + Sbjct: 238 VNVTLTKKQTNEVKANPKKYRVIDKKTPFDYLDLHFNNFYE---MKMRVIRF-PIPQGSY 293 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANL 343 ++T+LP+D+++++++ Y RW IE +F+ LK L L +K+PE I++ + Sbjct: 294 ECIITNLPQDKFNSDEIKRLYAKRWGIETSFRELKYALGLTRFHSKKPEYIMQEIWSRM 352 >UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3VMZ1_KLEPN Length = 421 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 28/64 (43%), Positives = 37/64 (57%) Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 V L T+L + A V + YRLRWQIEL FK KSL L+ + +A+ I+ +LL Sbjct: 276 VCLCTNLDRHTFPAATVGEWYRLRWQIELLFKEWKSLNSLNKFNTEYSTIAETLIWGSLL 335 Query: 345 AAFL 348 AA L Sbjct: 336 AATL 339 >UniRef50_Q64B41 Transposase n=11 Tax=environmental samples RepID=Q64B41_9ARCH Length = 439 Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 28/81 (34%), Positives = 45/81 (55%) Query: 265 SENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHL 324 S +R+ R+V A ++ + LT++ D SAE++A Y RW+IEL FK LKS + Sbjct: 280 STVKRRFRMVCAFNSDSGKYHSYLTNIRVDILSAEEIALLYGARWEIELIFKELKSHYRM 339 Query: 325 DALRAKEPELAKAWIFANLLA 345 D + + P + K I+ +L Sbjct: 340 DQIPSANPNIVKCLIWIAILT 360 >UniRef50_A3ZZQ0 Putative uncharacterized protein n=3 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZQ0_9PLAN Length = 457 Score = 48.1 bits (113), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 24/65 (36%), Positives = 35/65 (53%) Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 L T L D Y AE +AD YR RWQ EL + LK + +D LR K P + + + +++ Sbjct: 303 TLATTLLQGDVYRAEDLADLYRRRWQAELHIRSLKIQMQMDHLRCKSPAMVRKELHCHMI 362 Query: 345 AAFLI 349 L+ Sbjct: 363 GYNLV 367 >UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XCY0_9BACT Length = 481 Score = 47.8 bits (112), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 23/69 (33%), Positives = 43/69 (62%) Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 +L+ T L Y E++A+ Y RW+IEL+F+ LK+ L L+ LR + P + + ++ +L+ Sbjct: 309 MLVTTLLDPVRYPVEELAELYLRRWEIELSFRDLKTTLGLEVLRCQSPAMVEKEVWMHLI 368 Query: 345 AAFLIDDII 353 A L+ ++ Sbjct: 369 AFNLLRRVM 377 >UniRef50_Q46GC6 Transposase n=7 Tax=Methanosarcina RepID=Q46GC6_METBF Length = 435 Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust. Identities = 29/96 (30%), Positives = 50/96 (52%), Gaps = 3/96 (3%) Query: 272 RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKE 331 R+V E + + +T++ +D +A+ +A+ Y RW IEL FK LKS LD L K Sbjct: 281 RLVAVYNDEDEKYHIYITNIQKDILNAKDIANLYGARWDIELLFKELKSKYSLDVLETKN 340 Query: 332 PELAKAWIFANLLAAFL---IDDIIQPSLDFPPRSA 364 ++ +A I+ +L + I +++ S P + A Sbjct: 341 VQVIEALIWTAILTLIVSRRIYSLVRKSTTHPEKMA 376 >UniRef50_C3BTW8 Transposase for insertion sequence element IS231B n=13 Tax=Bacillus RepID=C3BTW8_9BACI Length = 387 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 31/109 (28%), Positives = 52/109 (47%), Gaps = 6/109 (5%) Query: 243 PARLIAVSLPPEKALISKTRLLSEN---RRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 299 P R+I L E+ + RL + ++KG A + +G + +T+ P D Sbjct: 191 PTRVIVHRLTKEQ---QQKRLQDQTVREKKKGMKYSARSKRLSGINVYMTNTPTDIVPMG 247 Query: 300 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFL 348 Q+ D Y LRWQIE+ FK KS ++ + + E + ++ L+A L Sbjct: 248 QLHDWYSLRWQIEILFKTWKSFFYIHHCKKIKRERLECHLYGQLIAILL 296 >UniRef50_C8W6S4 Transposase IS4 family protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W6S4_DESAS Length = 465 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 27/67 (40%), Positives = 39/67 (58%) Query: 283 GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFAN 342 G LLT+ D SA ++ YR R QIE+ FK LK LL L+ + + PE +A++F Sbjct: 303 GIFALLTNYDADRVSANKLIKKYRERNQIEVNFKDLKGLLDLERIFLQLPERIEAYVFPK 362 Query: 343 LLAAFLI 349 LA F++ Sbjct: 363 TLAYFVL 369 >UniRef50_B6FTH4 Putative uncharacterized protein n=3 Tax=Clostridium nexile DSM 1787 RepID=B6FTH4_9CLOT Length = 224 Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust. Identities = 28/104 (26%), Positives = 51/104 (49%), Gaps = 4/104 (3%) Query: 241 PFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ 300 P + IA S P + + + + N R R +E + +LT+LP++++ E+ Sbjct: 47 PQKFKFIAKSSPFDYLDLYDKKFYTLNFRVVRFAISED----SYESILTNLPKEDFPVEE 102 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 + Y +RW IE +F+ LK + L +K+ E I+A L+ Sbjct: 103 IKKVYAMRWGIETSFRELKYAIGLCCFHSKKVEYIMQEIYARLI 146 >UniRef50_Q9X6I5 Putative uncharacterized protein n=2 Tax=Bacillus thuringiensis RepID=Q9X6I5_BACTU Length = 118 Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 21/50 (42%), Positives = 30/50 (60%) Query: 299 EQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFL 348 +QV + Y LRWQIE+ FK KSL +D R + E + ++ L+A FL Sbjct: 2 KQVHELYSLRWQIEIVFKTWKSLFDIDHCRTVKQERIECHLYGKLIAIFL 51 >UniRef50_C5EN31 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EN31_9FIRM Length = 148 Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust. Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 5/99 (5%) Query: 245 RLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADC 304 R I ++P + LI+ +R E R + RVV+ + E G+ ++T+LP DE+S EQ+ Sbjct: 36 RYICKAVPFD--LITDSR--PEYRMQLRVVRFQIAEG-GYENIITNLPADEFSLEQIKHI 90 Query: 305 YRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANL 343 Y L W E +F+ LK + + + P+ + I + Sbjct: 91 YHLLWGQETSFRDLKHTIGTENFHSGSPKYIEFEILCRM 129 >UniRef50_Q3M8C5 Transposase, IS4 n=15 Tax=Cyanobacteria RepID=Q3M8C5_ANAVT Length = 340 Score = 44.7 bits (104), Expect = 0.005, Method: Compositional matrix adjust. Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 3/94 (3%) Query: 265 SENRRKGRVVQAETLEAAGHVLLLTSLPED---EYSAEQVADCYRLRWQIELAFKRLKSL 321 S++ + RV+ LE L+T+LP D S + + D Y LRW +EL +K LK Sbjct: 213 SDDAQAYRVINFCDLETKTEFRLVTNLPADGEATVSDDDIRDIYLLRWGVELLWKFLKMH 272 Query: 322 LHLDALRAKEPELAKAWIFANLLAAFLIDDIIQP 355 L LD L K I+ +L+A ++ + P Sbjct: 273 LKLDKLITKNVNGITIQIYVSLIAYLILQLVSIP 306 >UniRef50_C3FBK7 Transposase for insertion sequence element IS231B n=3 Tax=Bacillus thuringiensis RepID=C3FBK7_BACTU Length = 180 Score = 44.7 bits (104), Expect = 0.006, Method: Compositional matrix adjust. Identities = 24/80 (30%), Positives = 40/80 (50%) Query: 269 RKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALR 328 +KG ++ G + +T+ P + EQ+ D Y LRWQIE+ FK KSL + + Sbjct: 13 KKGITFSEKSKRLTGINIYVTNAPWEVVPMEQIHDFYSLRWQIEIIFKTWKSLFQMHHWQ 72 Query: 329 AKEPELAKAWIFANLLAAFL 348 + E + ++ L+A L Sbjct: 73 TIKQERLECHVYEKLIAILL 92 >UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostridiales RepID=C7GFW6_9FIRM Length = 436 Score = 44.7 bits (104), Expect = 0.006, Method: Compositional matrix adjust. Identities = 47/190 (24%), Positives = 85/190 (44%), Gaps = 12/190 (6%) Query: 156 LDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFL 215 +DR+A A I IADRGF S ++ D+++R ++ G D + Sbjct: 178 MDRYAYGASPIFIADRGFSSYNVFAHAIE-NNVDFLIRAKDLNVQRFLGGGTLPDKLD-- 234 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVS-LPPEKALISKTRLLSENRRKGRVV 274 ++ + K++ + + IA L P A IS LL K R+V Sbjct: 235 TTIELILTRTQSKKKHKHPEKESQYRYIGKNIAFDYLNP--ADISDEYLL-----KLRIV 287 Query: 275 QAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPEL 334 + E + ++ T+L E++++ + + CY LRW IE +F+ LK + L +K+ E Sbjct: 288 RVEVSDGVFENII-TTLSEEDFTPDDIKYCYNLRWGIETSFRDLKHTIGATNLHSKKTEY 346 Query: 335 AKAWIFANLL 344 +++ L+ Sbjct: 347 VAFELWSKLI 356 >UniRef50_C5T3Q2 Transposase IS4 family protein n=4 Tax=Proteobacteria RepID=C5T3Q2_ACIDE Length = 436 Score = 44.7 bits (104), Expect = 0.006, Method: Compositional matrix adjust. Identities = 29/81 (35%), Positives = 46/81 (56%), Gaps = 6/81 (7%) Query: 275 QAETLE--AAGHV--LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD--ALR 328 QA +E A G V + + L ++++A +A YR RW+IEL F+ +K L LR Sbjct: 269 QARLIEVRAGGKVRRFITSMLDPEQFAAAPLAQLYRQRWEIELGFREIKQSLQQGQAVLR 328 Query: 329 AKEPELAKAWIFANLLAAFLI 349 +K+PEL K ++ L+A L+ Sbjct: 329 SKQPELVKQEVWGVLIAYTLL 349 >UniRef50_B7CEB8 Putative uncharacterized protein n=2 Tax=Erysipelotrichaceae RepID=B7CEB8_9FIRM Length = 431 Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust. Identities = 22/68 (32%), Positives = 38/68 (55%) Query: 287 LLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 L+T+L DE+ ++ Y +RW IE AFK LK ++ + A +K+ + I+A +L Sbjct: 292 LVTNLTRDEFDLNELKKMYHMRWDIETAFKVLKYIIGMMAFHSKKRNFIQQEIYAAILLH 351 Query: 347 FLIDDIIQ 354 L + I + Sbjct: 352 CLTNIITE 359 >UniRef50_A3IS08 Putative uncharacterized protein n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IS08_9CHRO Length = 472 Score = 44.3 bits (103), Expect = 0.008, Method: Compositional matrix adjust. Identities = 21/70 (30%), Positives = 40/70 (57%), Gaps = 1/70 (1%) Query: 285 VLLLTSLPED-EYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANL 343 ++++T+L + EY + + D Y RWQ E+ + +K+ L +D L + PE+ + I+ L Sbjct: 313 IIVVTTLIDAIEYPSSDILDLYDQRWQAEVNLRNIKTTLGMDILTCQTPEMVRKEIYVYL 372 Query: 344 LAAFLIDDII 353 LA + I+ Sbjct: 373 LAYNFLRSIM 382 >UniRef50_A3ZNH0 Probable transposase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZNH0_9PLAN Length = 451 Score = 43.9 bits (102), Expect = 0.008, Method: Compositional matrix adjust. Identities = 33/125 (26%), Positives = 60/125 (48%), Gaps = 13/125 (10%) Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGH 284 E M GN G K G+ P R I + P ++ + R+GRV +T Sbjct: 302 ELITMGGNCGASKIGSDHPMRRIKLIPPADR---------PSSARQGRVRTDQT--GRDE 350 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 ++L T+L + +AE++ Y RW++EL F+ LK +L L + + + ++ ++ Sbjct: 351 LVLATTLMD--LTAEEIVRLYEHRWEVELFFRFLKQVLGCKKLLSAKTAGVQIQLYCAII 408 Query: 345 AAFLI 349 A+ L+ Sbjct: 409 ASLLL 413 >UniRef50_Q1PWW4 Putative uncharacterized protein n=2 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PWW4_9BACT Length = 166 Score = 43.9 bits (102), Expect = 0.009, Method: Compositional matrix adjust. Identities = 25/63 (39%), Positives = 36/63 (57%), Gaps = 1/63 (1%) Query: 286 LLLTSLPEDEY-SAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 +LLTSLP D + A V +CY RWQIE+ FK LKS ++ + + E K I ++ Sbjct: 1 MLLTSLPADTFRQACLVVECYLCRWQIEIYFKVLKSGCKIEERQLETAERIKPCIALYMI 60 Query: 345 AAF 347 A+ Sbjct: 61 VAW 63 >UniRef50_Q8VV93 Transposase n=1 Tax=marine psychrotrophic bacterium Mst37 RepID=Q8VV93_9GAMM Length = 423 Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust. Identities = 19/52 (36%), Positives = 32/52 (61%) Query: 289 TSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIF 340 T+L +++SAE+V Y+LRWQIEL FK KS +L ++ + + ++ Sbjct: 282 TNLDREQFSAEKVMKLYQLRWQIELLFKEWKSYCNLQKFNTRKATMMEGLVW 333 >UniRef50_A6DSH7 Probable transposase n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH7_9BACT Length = 382 Score = 43.5 bits (101), Expect = 0.012, Method: Compositional matrix adjust. Identities = 26/75 (34%), Positives = 45/75 (60%), Gaps = 2/75 (2%) Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 +LL+TS ++ +A ++ YR RWQIE+ FK LKS+L L A+ +++ L+ Sbjct: 279 ILLVTSEAPEKLNAAIISTIYRQRWQIEVFFKWLKSILGCRKLLAESSNGVAIQMYSALI 338 Query: 345 AAFLIDDII--QPSL 357 AA ++ D+ +P+L Sbjct: 339 AAIMLFDLFGKKPTL 353 >UniRef50_C9R546 Transposase (IS4 family) protein n=1 Tax=Aggregatibacter actinomycetemcomitans D11S-1 RepID=C9R546_AGGAD Length = 382 Score = 43.5 bits (101), Expect = 0.013, Method: Compositional matrix adjust. Identities = 25/84 (29%), Positives = 43/84 (51%), Gaps = 2/84 (2%) Query: 266 ENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD 325 E R R++ A E + LT+ +YSA ++A+ Y+ RW+IE+ FK +K L Sbjct: 252 ETRHIFRLIIARLKEKDEEIYFLTN--HADYSATEIAELYKRRWEIEVFFKFIKQHLDFS 309 Query: 326 ALRAKEPELAKAWIFANLLAAFLI 349 L ++ K ++ L+ A L+ Sbjct: 310 HLLSRNENGMKVEMYMTLITAILL 333 >UniRef50_Q05309 Transposase for insertion sequence element IS1151 n=16 Tax=Clostridium perfringens RepID=T1151_CLOPE Length = 473 Score = 43.1 bits (100), Expect = 0.015, Method: Compositional matrix adjust. Identities = 40/164 (24%), Positives = 69/164 (42%), Gaps = 25/164 (15%) Query: 203 TAEGMRFDMMGFLRGLDCGKNGETT-VMIGNSGNKKAGAPFPARLIAVSLPPEKAL---- 257 ++E ++ D++ L G+ E T + IG + +RLI L E Sbjct: 241 SSEYIKIDIIKLAEPLAAGETIELTDIYIG------SKKELKSRLIITKLTEENKSKRIF 294 Query: 258 -----ISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIE 312 I K RL RR L+ +T++ + + QV + Y LRWQIE Sbjct: 295 NHIEGIKKKRLTLNQRR---------LDFNSINAYITNVSSNIITMNQVHELYSLRWQIE 345 Query: 313 LAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 + FK KS+ ++ ++ + E +++ L+A L I+ S Sbjct: 346 IIFKVWKSIFKINQVKKVKLERFMCFLYGRLIALLLSSTIVFTS 389 >UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCH0_9FIRM Length = 435 Score = 43.1 bits (100), Expect = 0.015, Method: Compositional matrix adjust. Identities = 22/57 (38%), Positives = 35/57 (61%) Query: 287 LLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANL 343 ++T+L +SAE++ + Y LRW IE +F+ LK + L + AK+ + K IFA L Sbjct: 295 VITNLDRFCFSAEKLKELYHLRWGIETSFRELKYAIGLTSFHAKKVDYIKQEIFARL 351 >UniRef50_Q3A1U3 Transposase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A1U3_PELCD Length = 489 Score = 43.1 bits (100), Expect = 0.016, Method: Compositional matrix adjust. Identities = 20/71 (28%), Positives = 38/71 (53%) Query: 283 GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFAN 342 G ++ T + Y AE +A+ Y RW +EL F+ +K+ + +D LR P++ + I + Sbjct: 320 GFYIVTTLIDAARYPAEDLAELYFKRWDVELFFRDIKTTMGMDVLRCLTPDMIRKEILMH 379 Query: 343 LLAAFLIDDII 353 +A + +I Sbjct: 380 FIAYNCVRRLI 390 >UniRef50_UPI00016C3BAC transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3BAC Length = 218 Score = 42.7 bits (99), Expect = 0.021, Method: Compositional matrix adjust. Identities = 20/58 (34%), Positives = 34/58 (58%) Query: 296 YSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDII 353 Y+ E +A Y RW++EL + +K L +D L K PE+ + I+ +LLA ++ +I Sbjct: 44 YTREDLAQLYHHRWRVELWIRDIKQTLAMDVLGGKTPEMLRREIWCHLLAYNVVRHVI 101 >UniRef50_Q7ULM3 Probable transposase n=5 Tax=Planctomycetaceae RepID=Q7ULM3_RHOBA Length = 458 Score = 42.7 bits (99), Expect = 0.021, Method: Compositional matrix adjust. Identities = 73/340 (21%), Positives = 127/340 (37%), Gaps = 47/340 (13%) Query: 39 DAATLLRLGLAYGPGGMSLREVTAWAQLHDVATL--SDVALLKRLRNAADWFGI--LAAQ 94 D +L L P SLR ++ ++L V ++ A L L A F L Sbjct: 86 DQYCMLVLLYVLNPTVSSLRAISQASELTKVRNKLSNEKASLGSLSEAGGLFSADHLKPV 145 Query: 95 TLAVRAAVTGCTSGKRLR-------LVDGTAISA-------------PGGGSAEWRLHMG 134 A+ A V RL VDG+ ++A G WRLH Sbjct: 146 IEALSAEVNDAAPDPRLSSIQQTITAVDGSLVNALPSLIAASILKQTTGSALVRWRLHTH 205 Query: 135 YDPHTCQFTDFELTD----SRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADY 190 ++ + ++T D + + D + + DRG+ ++ S+ + Y Sbjct: 206 FEVNNLLPARVDVTPDGGGQHDERAVLKRVLEEDRLYVMDRGY-AKFSLFNSIVASSSSY 264 Query: 191 IVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVS 250 + R+ + T E ++ R G +T V +G S + P RLI + Sbjct: 265 VCRLRDNTVYETTQE---LELTEGDRA--AGVLSDTIVKLGGSSSSSNSPDHPIRLIQIR 319 Query: 251 LPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQ 310 P +NR G+ ++ + G + + T+L AE +A Y RW Sbjct: 320 CTPH-----------QNRTGGKARGSKAPNSDGILRIATNLLN--VPAEIIALIYAYRWT 366 Query: 311 IELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 IE+ F+ K L+ D L + + ++ +++A LI+ Sbjct: 367 IEIFFRFYKQLMGGDHLISHNANGIQIQVYCSVIACLLIN 406 >UniRef50_B2IXJ5 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2IXJ5_NOSP7 Length = 238 Score = 42.7 bits (99), Expect = 0.021, Method: Compositional matrix adjust. Identities = 22/64 (34%), Positives = 33/64 (51%) Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLA 345 L+ T L YS + Y RW +E+ + LK+ L +D LR K P + + I+ LLA Sbjct: 125 LITTLLDITTYSTLDIVGLYGKRWDVEIDLRHLKTTLGMDVLRCKTPSMVRKEIYVYLLA 184 Query: 346 AFLI 349 L+ Sbjct: 185 YNLL 188 >UniRef50_A6CHG0 Transposase of IS5377-like element n=2 Tax=Bacillus sp. SG-1 RepID=A6CHG0_9BACI Length = 381 Score = 42.7 bits (99), Expect = 0.024, Method: Compositional matrix adjust. Identities = 32/108 (29%), Positives = 53/108 (49%), Gaps = 15/108 (13%) Query: 256 ALISKTRLLSENRRKGRVV-----QAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQ 310 A I K ++NR RVV + ++ A ++++++ PED +A Y+LRWQ Sbjct: 237 AFIGKNSRKTKNR--FRVVTFTDNEGNRIKVATNLMMMS--PED------IAYIYKLRWQ 286 Query: 311 IELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 IEL F+ +K L L L P ++ L++ FL+ I + D Sbjct: 287 IELFFRWVKGNLDLSNLFGNSPNSVYIQVYGTLISYFLLRWIYNETKD 334 >UniRef50_Q55566 Putative transposase for insertion sequence element IS4SA n=10 Tax=Synechocystis sp. PCC 6803 RepID=T4SA_SYNY3 Length = 338 Score = 42.4 bits (98), Expect = 0.025, Method: Compositional matrix adjust. Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 5/74 (6%) Query: 287 LLTSLPEDE-----YSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFA 341 L+T+LP + S E++A+ Y+ RWQIEL +K LK L L+ L AK I+ Sbjct: 231 LVTNLPIESKEIEGVSDEKIAEIYKKRWQIELLWKFLKMHLKLNRLIAKNENAIGIQIYT 290 Query: 342 NLLAAFLIDDIIQP 355 ++A ++ ++ P Sbjct: 291 CIIAYLILKLLVIP 304 >UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G346_9FIRM Length = 443 Score = 42.4 bits (98), Expect = 0.027, Method: Compositional matrix adjust. Identities = 26/88 (29%), Positives = 45/88 (51%), Gaps = 1/88 (1%) Query: 266 ENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD 325 E R R+V+ + E VL T+L ++++SA+ + Y +RW IE AF +LK L Sbjct: 283 EYRLNLRIVKIKLSETTTEVLF-TNLSKEKFSADDLKRLYHMRWGIETAFDQLKYALGAA 341 Query: 326 ALRAKEPELAKAWIFANLLAAFLIDDII 353 ++ +K EL ++ L+ I+ Sbjct: 342 SVHSKNSELIIQELYGKLIMFNFCKTIV 369 >UniRef50_A4BL98 Putative uncharacterized protein n=5 Tax=Nitrococcus mobilis Nb-231 RepID=A4BL98_9GAMM Length = 426 Score = 42.0 bits (97), Expect = 0.036, Method: Compositional matrix adjust. Identities = 25/71 (35%), Positives = 37/71 (52%) Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAW 338 L G V + T Y +A+ YR RW IEL + +K+ + ++ LR K PE + Sbjct: 257 LAVNGIVYVTTLSNPKRYPRRALAEHYRSRWTIELDLRSIKTDMAMERLRCKSPERVRKE 316 Query: 339 IFANLLAAFLI 349 I A+LLA L+ Sbjct: 317 IAAHLLAYNLV 327 >UniRef50_D1XZ52 Transposase, IS4 family n=1 Tax=Prevotella bivia JCVIHMP010 RepID=D1XZ52_9BACT Length = 241 Score = 41.6 bits (96), Expect = 0.047, Method: Compositional matrix adjust. Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 1/64 (1%) Query: 295 EYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQ 354 E +AEQVA Y+ RW++EL FK LK L + K I+A ++A L+ I+Q Sbjct: 131 EVTAEQVALLYKYRWRVELFFKWLKQHLRIKEFYGTSENAVKIQIYAAIIAYCLV-VIVQ 189 Query: 355 PSLD 358 +D Sbjct: 190 ECMD 193 >UniRef50_Q4V248 Transposase, n=5 Tax=Bacillus cereus group RepID=Q4V248_BACCZ Length = 140 Score = 41.6 bits (96), Expect = 0.049, Method: Compositional matrix adjust. Identities = 20/56 (35%), Positives = 30/56 (53%) Query: 269 RKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHL 324 +KG ++ G + +T+ P + EQ+ D Y LRWQIE+ FK KSL + Sbjct: 78 KKGITYSEKSKRLTGINIYVTNTPWEIVPMEQIHDFYSLRWQIEITFKTWKSLFQI 133 >UniRef50_B7AA71 Transposase IS4 family protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7AA71_THEAQ Length = 393 Score = 41.2 bits (95), Expect = 0.054, Method: Compositional matrix adjust. Identities = 51/168 (30%), Positives = 74/168 (44%), Gaps = 27/168 (16%) Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHW-RGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 + +ADRGF R + LA GE +++VRV+ R L EG + L CG+ Sbjct: 169 VYVADRGFDDRKVFGQVLALGE-EFVVRVYRDRKL----GEGGSLAKVASSLALPCGE-- 221 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVV-QAETLEAAG 283 E + +G ++ F R + V E RR VV + L G Sbjct: 222 EVELRVGGR-YQRVRLHFGWREVEV----------------EGRRLHLVVCRVPALGRRG 264 Query: 284 HVLLLTSLP-EDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAK 330 LLTSLP A QV + YR RW++E F+ LK+ L L+ + + Sbjct: 265 EWWLLTSLPVRGREEAAQVVEAYRRRWEVERFFRLLKTGLGLETFQVR 312 >UniRef50_Q093Y3 Isrso13-transposase protein n=7 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q093Y3_STIAU Length = 457 Score = 41.2 bits (95), Expect = 0.058, Method: Compositional matrix adjust. Identities = 19/39 (48%), Positives = 27/39 (69%) Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLH 323 + LLT++P + A +VA+ YR RW IE F RL+S+LH Sbjct: 285 IRLLTNVPAERMGALEVAELYRRRWSIEGMFGRLESVLH 323 >UniRef50_A4SUB1 IS element transposase n=8 Tax=Bacteria RepID=A4SUB1_AERS4 Length = 420 Score = 40.8 bits (94), Expect = 0.079, Method: Compositional matrix adjust. Identities = 29/114 (25%), Positives = 54/114 (47%), Gaps = 4/114 (3%) Query: 245 RLIAVSLPPEKALISKTRLLSENRRKG----RVVQAETLEAAGHVLLLTSLPEDEYSAEQ 300 +L +SL E + +L + + G R+++ E + +T+L + + AE+ Sbjct: 236 KLTGMSLKEEGRRHCRAEVLDMDVKSGKYEYRLIRRWFAEETRFCVWMTNLARETWPAER 295 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQ 354 V YR RWQ+EL FK KS +L + + + ++ +LL+ L + Q Sbjct: 296 VMRLYRCRWQVELLFKERKSYNNLKGFVTGQKAITEGLVWDSLLSLVLKRRVAQ 349 >UniRef50_B0CC46 Transposase, IS4 family, putative n=9 Tax=Cyanobacteria RepID=B0CC46_ACAM1 Length = 482 Score = 40.8 bits (94), Expect = 0.083, Method: Compositional matrix adjust. Identities = 22/73 (30%), Positives = 45/73 (61%), Gaps = 2/73 (2%) Query: 284 HVLLLTSLPEDE-YSAEQVADCYRLRWQI-ELAFKRLKSLLHLDALRAKEPELAKAWIFA 341 H++++T+L + + YSA Q+ Y RW + E+ + LK+ L ++ L AK P++ + I+ Sbjct: 312 HIIVVTTLLDAQRYSAGQLTRLYGWRWPVAEVNLRHLKTTLKMEMLSAKTPDMVRKDIWV 371 Query: 342 NLLAAFLIDDIIQ 354 +LL L+ +++ Sbjct: 372 HLLGYNLLRSLME 384 >UniRef50_Q2FU81 Transposase, IS4 n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FU81_METHJ Length = 452 Score = 40.8 bits (94), Expect = 0.083, Method: Compositional matrix adjust. Identities = 22/68 (32%), Positives = 36/68 (52%) Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLA 345 L +T+L ++ +SA+ + + YR RW IEL FK LK L + +A I++ LL Sbjct: 308 LYITNLGKEVFSADDIYELYRFRWVIELIFKELKGDYDLGKMLLNNEPMAFIHIYSMLLR 367 Query: 346 AFLIDDII 353 + D+ Sbjct: 368 FIISRDLF 375 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P11901 Transposase for insertion sequence element IS421... 457 e-127 UniRef50_C6AUF2 Transposase IS4 family protein n=7 Tax=Rhizobium... 363 6e-99 UniRef50_B8F976 Transposase IS4 family protein n=2 Tax=Desulfati... 343 7e-93 UniRef50_C0AF19 InsL n=9 Tax=Opitutaceae bacterium TAV2 RepID=C0... 326 6e-88 UniRef50_UPI00017465B5 InsL n=2 Tax=Verrucomicrobium spinosum DS... 319 9e-86 UniRef50_Q1Q5J6 Putative uncharacterized protein n=6 Tax=Candida... 312 2e-83 UniRef50_A6DTQ2 Putative transposase insL for insertion sequence... 298 2e-79 UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001... 290 6e-77 UniRef50_B7GET6 Transposase n=2 Tax=Bacillaceae RepID=B7GET6_ANOFW 261 3e-68 UniRef50_P12249 Transposase for insertion sequence element IS231... 258 2e-67 UniRef50_Q5L3A2 Transposase of IS231E-like element n=1 Tax=Geoba... 242 1e-62 UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipe... 229 2e-58 UniRef50_A9AZS8 Transposase IS4 family protein n=3 Tax=Herpetosi... 205 2e-51 UniRef50_A6M1E5 Transposase, IS4 family protein n=1 Tax=Clostrid... 200 7e-50 UniRef50_A6TN04 Transposase, IS4 family protein n=1 Tax=Alkaliph... 195 2e-48 UniRef50_Q73IB8 Transposase, IS4 family n=9 Tax=Wolbachia RepID=... 190 6e-47 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 171 4e-41 UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostri... 166 8e-40 UniRef50_Q1PXV1 Putative uncharacterized protein n=3 Tax=Candida... 155 2e-36 UniRef50_A6UXI0 Protein containing transposase DDE domain n=4 Ta... 144 7e-33 UniRef50_Q64B41 Transposase n=11 Tax=environmental samples RepID... 143 9e-33 UniRef50_Q46GC6 Transposase n=7 Tax=Methanosarcina RepID=Q46GC6_... 135 2e-30 UniRef50_C3BTW8 Transposase for insertion sequence element IS231... 128 4e-28 UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepI... 127 6e-28 UniRef50_B0R9A9 Transposase (ISH8) n=22 Tax=Halobacteriaceae Rep... 122 1e-26 UniRef50_B0TD95 Transposase, is4 family n=3 Tax=Heliobacterium m... 122 2e-26 UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3... 114 5e-24 UniRef50_C0BDH6 Putative uncharacterized protein n=2 Tax=Coproco... 111 6e-23 UniRef50_Q648P8 Transposase n=2 Tax=environmental samples RepID=... 110 1e-22 UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium... 103 1e-20 UniRef50_A3ZZQ0 Putative uncharacterized protein n=3 Tax=Blastop... 102 2e-20 UniRef50_Q7MLW1 Transposase and inactivated derivative n=29 Tax=... 94 6e-18 UniRef50_A9DPK2 Transposase n=8 Tax=Shewanella benthica KT99 Rep... 86 2e-15 UniRef50_A9DNS7 Transposase n=1 Tax=Shewanella benthica KT99 Rep... 81 9e-14 UniRef50_C8W6S4 Transposase IS4 family protein n=1 Tax=Desulfoto... 74 7e-12 UniRef50_C4ZTT5 Predicted divalent heavy-metal cations transport... 74 8e-12 Sequences not found previously or not previously below threshold: UniRef50_Q74P20 IS231-related transposase n=15 Tax=Bacillus RepI... 163 1e-38 UniRef50_Q05309 Transposase for insertion sequence element IS115... 151 4e-35 UniRef50_A7C1C1 IS231-related transposase n=6 Tax=Beggiatoa sp. ... 130 9e-29 UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coproco... 128 3e-28 UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicute... 126 1e-27 UniRef50_A2RJ55 Putative transposase n=7 Tax=Lactobacillales Rep... 124 4e-27 UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea f... 121 4e-26 UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostrid... 120 1e-25 UniRef50_B0NXD2 Putative uncharacterized protein n=5 Tax=Clostri... 116 2e-24 UniRef50_C3EBZ9 IS231-related transposase n=1 Tax=Bacillus thuri... 111 3e-23 UniRef50_Q2FU81 Transposase, IS4 n=4 Tax=Methanospirillum hungat... 111 5e-23 UniRef50_B7CEB8 Putative uncharacterized protein n=2 Tax=Erysipe... 110 1e-22 UniRef50_C3FBK7 Transposase for insertion sequence element IS231... 107 6e-22 UniRef50_Q4V248 Transposase, n=5 Tax=Bacillus cereus group RepID... 104 4e-21 UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostri... 104 4e-21 UniRef50_Q3A1U3 Transposase n=1 Tax=Pelobacter carbinolicus DSM ... 104 4e-21 UniRef50_B6FTH4 Putative uncharacterized protein n=3 Tax=Clostri... 100 1e-19 UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=... 95 4e-18 UniRef50_A4SUB1 IS element transposase n=8 Tax=Bacteria RepID=A4... 93 1e-17 UniRef50_A3IS08 Putative uncharacterized protein n=1 Tax=Cyanoth... 93 2e-17 UniRef50_A8RFU1 Putative uncharacterized protein n=1 Tax=Eubacte... 91 8e-17 UniRef50_A6CHG0 Transposase of IS5377-like element n=2 Tax=Bacil... 90 9e-17 UniRef50_Q8VV93 Transposase n=1 Tax=marine psychrotrophic bacter... 90 1e-16 UniRef50_A5II18 Transposase, IS4 n=1 Tax=Legionella pneumophila ... 88 4e-16 UniRef50_C6JHT2 Transposase ISLbp1 n=1 Tax=Ruminococcus sp. 5_1_... 88 7e-16 UniRef50_Q7ULM3 Probable transposase n=5 Tax=Planctomycetaceae R... 87 9e-16 UniRef50_B6FLV1 Putative uncharacterized protein (Fragment) n=1 ... 87 1e-15 UniRef50_C5T3Q2 Transposase IS4 family protein n=4 Tax=Proteobac... 87 1e-15 UniRef50_C1ZMB0 Transposase family protein n=1 Tax=Planctomyces ... 87 1e-15 UniRef50_D0SHM1 Transposase n=3 Tax=Acinetobacter RepID=D0SHM1_A... 86 1e-15 UniRef50_UPI0001BC4BB6 transposase n=2 Tax=Neisseria mucosa ATCC... 86 3e-15 UniRef50_Q648P7 Transposase n=2 Tax=environmental samples RepID=... 85 4e-15 UniRef50_A4BL98 Putative uncharacterized protein n=5 Tax=Nitroco... 84 6e-15 UniRef50_A1HQH6 Transposase, IS4 family protein n=2 Tax=Thermosi... 84 7e-15 UniRef50_P55729 Putative transposase y4zB n=4 Tax=Rhizobiaceae R... 84 7e-15 UniRef50_B5ZZ25 Transposase IS4 family protein n=11 Tax=Rhizobiu... 84 8e-15 UniRef50_A4J2U7 Transposase, IS4 family protein n=3 Tax=Desulfot... 84 9e-15 UniRef50_Q9X6I5 Putative uncharacterized protein n=2 Tax=Bacillu... 83 2e-14 UniRef50_C5EN31 Putative uncharacterized protein n=1 Tax=Clostri... 83 2e-14 UniRef50_B0NZ84 Putative uncharacterized protein n=1 Tax=Clostri... 80 1e-13 UniRef50_B0CC46 Transposase, IS4 family, putative n=9 Tax=Cyanob... 80 1e-13 UniRef50_D0LI35 Transposase IS4 family protein n=1 Tax=Haliangiu... 80 1e-13 UniRef50_B9BXQ1 Transposase, IS4 family n=8 Tax=Proteobacteria R... 80 1e-13 UniRef50_Q7MGY3 Transposase and inactivated derivative n=4 Tax=V... 80 1e-13 UniRef50_D1K7L7 Transposase n=3 Tax=Bacteroidales RepID=D1K7L7_9... 79 2e-13 UniRef50_D1N0Z4 Transposase IS4 family protein n=3 Tax=Bacteria ... 79 2e-13 UniRef50_Q877R2 Transposase n=51 Tax=Bacteroidales RepID=Q877R2_... 79 2e-13 UniRef50_A8M893 Transposase IS4 family protein n=3 Tax=Actinomyc... 79 3e-13 UniRef50_C9KS84 Transposase domain protein n=5 Tax=Bacteroidales... 79 3e-13 UniRef50_Q12AI7 Transposase, IS4 family n=3 Tax=Proteobacteria R... 79 3e-13 UniRef50_C3R0J9 Transposase n=4 Tax=Bacteroidales RepID=C3R0J9_9... 79 3e-13 UniRef50_Q4V0X3 Possible transposase n=1 Tax=Bacillus cereus E33... 78 4e-13 UniRef50_C0ING1 Putative uncharacterized protein n=1 Tax=uncultu... 78 4e-13 UniRef50_UPI0001C4271A transposase, IS4 family protein n=1 Tax=B... 78 5e-13 UniRef50_B6FVR6 Putative uncharacterized protein (Fragment) n=2 ... 78 5e-13 UniRef50_C5V7Z6 Transposase IS4 family protein n=3 Tax=root RepI... 78 7e-13 UniRef50_B3JNI1 Putative uncharacterized protein n=3 Tax=Bactero... 78 8e-13 UniRef50_Q3M8C5 Transposase, IS4 n=15 Tax=Cyanobacteria RepID=Q3... 77 1e-12 UniRef50_C4Z764 Putative uncharacterized protein n=4 Tax=Clostri... 77 1e-12 UniRef50_UPI00016C3BAC transposase n=1 Tax=Gemmata obscuriglobus... 77 1e-12 UniRef50_UPI000196B70E hypothetical protein CATMIT_00144 n=1 Tax... 76 2e-12 UniRef50_Q1VPP4 ISPg4, transposase n=7 Tax=Bacteria RepID=Q1VPP4... 76 2e-12 UniRef50_A3ZNH0 Probable transposase n=1 Tax=Blastopirellula mar... 76 2e-12 UniRef50_B3E6V4 Transposase IS4 family protein n=8 Tax=Proteobac... 76 2e-12 UniRef50_D1T817 Transposase IS4 family protein n=1 Tax=Burkholde... 75 3e-12 UniRef50_C9C7H0 Transposase n=5 Tax=Enterococcus faecium RepID=C... 75 3e-12 UniRef50_C6J7R2 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 75 3e-12 UniRef50_C6JEA3 Putative uncharacterized protein n=1 Tax=Ruminoc... 75 4e-12 UniRef50_Q45620 Probable transposase for insertion sequence elem... 75 4e-12 UniRef50_D1XZ52 Transposase, IS4 family n=1 Tax=Prevotella bivia... 74 8e-12 UniRef50_Q04V25 Transposase, ISLbp1 n=29 Tax=Leptospira RepID=Q0... 74 9e-12 UniRef50_C6N0W0 Putative uncharacterized protein n=1 Tax=Legione... 74 1e-11 UniRef50_A4T2G5 Transposase, IS4 family protein n=10 Tax=Coryneb... 74 1e-11 UniRef50_Q0F098 ISGsu1, transposase n=6 Tax=Mariprofundus ferroo... 74 1e-11 UniRef50_D1Q0M9 ISGsu1 transpoase n=7 Tax=Bacteroidales RepID=D1... 73 1e-11 UniRef50_B8FDX7 Transposase IS4 family protein n=2 Tax=Desulfati... 73 1e-11 UniRef50_A6WTA0 Transposase IS4 family protein n=14 Tax=Shewanel... 73 1e-11 UniRef50_A6L0R8 Transposase n=13 Tax=Bacteroidales RepID=A6L0R8_... 73 1e-11 UniRef50_P03835 Transposase insG for insertion sequence element ... 73 2e-11 UniRef50_C3KKH4 Putative transposase Y4ZB n=2 Tax=Rhizobium sp. ... 72 3e-11 UniRef50_A5KKC4 Putative uncharacterized protein n=1 Tax=Ruminoc... 72 3e-11 UniRef50_Q11ZL6 Transposase, IS4 family n=22 Tax=Bacteria RepID=... 71 5e-11 UniRef50_B2IXJ5 Putative uncharacterized protein n=1 Tax=Nostoc ... 71 6e-11 UniRef50_B6EGT0 Transposase n=20 Tax=Vibrionaceae RepID=B6EGT0_A... 71 6e-11 UniRef50_A6DKD2 ISPg4, transposase n=7 Tax=Chlamydiae/Verrucomic... 71 7e-11 UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=St... 71 7e-11 UniRef50_A4JGL4 Transposase, IS4 family protein n=3 Tax=Burkhold... 71 8e-11 UniRef50_B3PC11 ISCja2, transposase n=5 Tax=Proteobacteria RepID... 71 8e-11 UniRef50_B8FEP3 Transposase IS4 family protein n=1 Tax=Desulfati... 70 1e-10 UniRef50_B2JV26 Transposase IS4 family protein n=9 Tax=Burkholde... 70 1e-10 UniRef50_A8YU85 Transposase n=21 Tax=Lactobacillus RepID=A8YU85_... 70 1e-10 UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia R... 70 1e-10 UniRef50_A1ZPG0 Transposase of, putative n=3 Tax=Microscilla mar... 70 2e-10 UniRef50_C6CF98 Transposase IS4 family protein n=20 Tax=Gammapro... 69 2e-10 UniRef50_B2PVI2 Putative uncharacterized protein n=1 Tax=Provide... 68 4e-10 UniRef50_C9R546 Transposase (IS4 family) protein n=1 Tax=Aggrega... 68 5e-10 UniRef50_A6CCZ3 Transposase, IS4 (Fragment) n=7 Tax=Planctomyces... 68 5e-10 UniRef50_A4BSI0 Putative uncharacterized protein n=1 Tax=Nitroco... 68 5e-10 UniRef50_UPI0000F70487 putative IS4 transposase n=1 Tax=Aeromona... 68 6e-10 UniRef50_D2TH14 ISCro6 transposase n=8 Tax=Gammaproteobacteria R... 68 7e-10 UniRef50_A6DSH7 Probable transposase n=3 Tax=Lentisphaera araneo... 68 8e-10 UniRef50_B8FI31 Transposase IS4 family protein n=1 Tax=Desulfati... 67 9e-10 UniRef50_B8FXQ3 Transposase IS4 family protein n=8 Tax=Desulfito... 67 9e-10 UniRef50_C4XGQ6 Putative transposase for insertion sequence elem... 66 2e-09 UniRef50_C8W0R5 Transposase-like protein n=12 Tax=Desulfotomacul... 66 2e-09 UniRef50_Q8ABH9 Putative transposase n=1 Tax=Bacteroides thetaio... 66 2e-09 UniRef50_A7C4E9 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 66 2e-09 UniRef50_A3ZQJ1 Probable transposase n=4 Tax=Blastopirellula mar... 66 2e-09 UniRef50_Q55566 Putative transposase for insertion sequence elem... 66 2e-09 UniRef50_A3EIG1 FOG: Transposase and inactivated derivatives n=3... 66 3e-09 UniRef50_C9LFX6 Transposase domain protein n=14 Tax=Bacteroidale... 65 4e-09 UniRef50_C6J0N9 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 65 5e-09 UniRef50_C7PAE4 Transposase IS4 family protein n=4 Tax=Chitinoph... 65 5e-09 UniRef50_Q18EK5 Probable transposase (ISH8/ISH26) n=5 Tax=Haloqu... 64 6e-09 UniRef50_A8KXP7 Transposase IS4 family protein n=2 Tax=Actinomyc... 64 7e-09 UniRef50_Q7UY96 Similar to transposase n=1 Tax=Rhodopirellula ba... 64 7e-09 UniRef50_Q46310 Transposase n=1 Tax=Carnobacterium maltaromaticu... 64 1e-08 UniRef50_C0VKK7 ISCja2 transposase n=8 Tax=Acinetobacter RepID=C... 64 1e-08 UniRef50_Q7UPU9 Probable transposase n=2 Tax=Rhodopirellula balt... 63 2e-08 UniRef50_B2LS82 Putative uncharacterized protein n=3 Tax=Vibrio ... 62 3e-08 UniRef50_A1APW2 Transposase, IS4 family n=6 Tax=Deltaproteobacte... 62 4e-08 UniRef50_Q82R31 Putative IS4 family ISFsp6-like transposase n=2 ... 61 5e-08 UniRef50_B5EK95 Transposase IS4 family protein n=2 Tax=Acidithio... 60 2e-07 UniRef50_Q093Y3 Isrso13-transposase protein n=7 Tax=Stigmatella ... 60 2e-07 UniRef50_C3AUM2 Transposase for insertion sequence element IS231... 59 2e-07 UniRef50_C1DIQ1 Transposase, IS4 n=2 Tax=Azotobacter vinelandii ... 59 3e-07 UniRef50_Q8QNB6 EsV-1-170 n=2 Tax=Ectocarpus siliculosus virus 1... 59 3e-07 UniRef50_B5WFI6 Putative uncharacterized protein n=1 Tax=Burkhol... 59 3e-07 UniRef50_UPI0001AF03EF IS4 family transposase n=1 Tax=Streptomyc... 58 4e-07 UniRef50_B2J1G3 Transposase, IS4 family protein n=6 Tax=Nostocac... 58 4e-07 UniRef50_A1BCF6 Transposase, IS4 family protein n=1 Tax=Chlorobi... 58 6e-07 UniRef50_A7C2A8 Transposase of IS641 n=1 Tax=Beggiatoa sp. PS Re... 58 8e-07 UniRef50_A4A0C3 Probable transposase n=1 Tax=Blastopirellula mar... 57 9e-07 UniRef50_UPI00016C37A0 transposase, IS4 n=2 Tax=Gemmata obscurig... 57 1e-06 UniRef50_Q3M9Z5 Transposase, IS4 n=10 Tax=Cyanobacteria RepID=Q3... 57 1e-06 UniRef50_D0DW10 Transposase IS4 family protein n=5 Tax=Lactobaci... 57 1e-06 UniRef50_C3FCZ5 Transposase for insertion sequence element IS231... 57 1e-06 UniRef50_Q6LJK0 Hypothetical transposase n=2 Tax=Vibrionaceae Re... 56 2e-06 UniRef50_A7GMF1 Transposase IS4 family protein n=15 Tax=Bacillus... 56 2e-06 UniRef50_A3ZMM8 Transposase insG for insertion sequence element-... 56 2e-06 UniRef50_Q64E61 Transposase n=1 Tax=uncultured archaeon GZfos14B... 56 3e-06 UniRef50_C6DY52 Transposase IS4 family protein n=1 Tax=Geobacter... 56 3e-06 UniRef50_Q67PW6 Transposase-like protein n=14 Tax=Symbiobacteriu... 55 4e-06 UniRef50_A5D1X0 Transposase n=1 Tax=Pelotomaculum thermopropioni... 55 4e-06 UniRef50_UPI0000164DB3 hypothetical protein TVN0693 n=1 Tax=Ther... 55 5e-06 UniRef50_Q737L2 IS231-related transposase n=3 Tax=Bacillus cereu... 54 8e-06 UniRef50_C5VJA1 Transposase domain protein n=15 Tax=Prevotella R... 54 9e-06 UniRef50_A6DM44 Putative uncharacterized protein n=2 Tax=Lentisp... 54 1e-05 UniRef50_B0R9V4 Transposase (TCE33) n=2 Tax=Halobacterium salina... 54 1e-05 UniRef50_Q8DM76 Tlr0247 protein n=2 Tax=Thermosynechococcus elon... 54 1e-05 UniRef50_C6JAL6 Transposase (Fragment) n=1 Tax=Ruminococcus sp. ... 53 2e-05 UniRef50_A5WBL3 Transposase, IS4 family n=2 Tax=Bacteria RepID=A... 52 3e-05 UniRef50_A6DG92 ISPg4, transposase n=1 Tax=Lentisphaera araneosa... 52 3e-05 UniRef50_C0GNX3 Transposase IS4 family protein n=3 Tax=Desulfona... 52 3e-05 UniRef50_C8VXW5 Transposase IS4 family protein n=3 Tax=Desulfoto... 52 4e-05 UniRef50_C8XGG2 Transposase IS4 family protein n=2 Tax=Nakamurel... 52 4e-05 UniRef50_Q1PWW4 Putative uncharacterized protein n=2 Tax=Candida... 52 5e-05 UniRef50_C3RGR4 Putative uncharacterized protein n=2 Tax=Bactero... 52 5e-05 UniRef50_C2V5D0 Transposase for insertion sequence element IS231... 51 5e-05 UniRef50_Q1VPU4 Putative uncharacterized protein n=7 Tax=Psychro... 51 5e-05 UniRef50_Q2NZH2 ISXoo8 transposase n=73 Tax=Xanthomonas RepID=Q2... 51 6e-05 UniRef50_Q978C6 TVG1544340 protein n=2 Tax=Thermoplasma volcaniu... 51 7e-05 UniRef50_Q82R33 Putative IS4 family ISFsp6-like transposase n=1 ... 51 7e-05 UniRef50_C3BDU8 Transposase for insertion sequence element IS231... 51 8e-05 UniRef50_C0WV66 Transposase IS4 family protein n=5 Tax=Lactobaci... 51 9e-05 UniRef50_Q5GUK2 ISxac1 transposase n=1 Tax=Xanthomonas oryzae pv... 51 1e-04 UniRef50_Q9R3J0 Transposase, putative n=10 Tax=Deinococcus radio... 50 1e-04 UniRef50_A0LAZ1 Transposase, IS4 family protein n=4 Tax=Magnetoc... 50 1e-04 UniRef50_A8L1S1 Transposase IS4 family protein n=2 Tax=Frankia s... 50 1e-04 UniRef50_UPI0001C16028 hypothetical protein CRD_01775 n=2 Tax=Ra... 50 1e-04 UniRef50_Q1Q2K2 Putative uncharacterized protein n=5 Tax=Candida... 50 1e-04 UniRef50_B9EHT2 Olfr780 protein n=158 Tax=root RepID=B9EHT2_MOUSE 50 2e-04 UniRef50_Q647P2 Transposase n=1 Tax=uncultured archaeon GZfos9E5... 49 2e-04 UniRef50_Q73GX5 Conserved domain protein n=5 Tax=Wolbachia RepID... 49 2e-04 UniRef50_D1VZM3 Transposase, IS4 family n=3 Tax=Prevotella RepID... 49 2e-04 UniRef50_D0I6N0 Transposase IS4 n=1 Tax=Grimontia hollisae CIP 1... 49 3e-04 UniRef50_C6I0E1 Transposase, IS4 family protein n=4 Tax=Leptospi... 49 3e-04 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 49 3e-04 UniRef50_B1IL33 Putative uncharacterized protein n=1 Tax=Clostri... 49 3e-04 UniRef50_A3H586 Putative uncharacterized protein (Fragment) n=2 ... 49 4e-04 UniRef50_D1JFQ9 Putative uncharacterized protein n=1 Tax=uncultu... 49 4e-04 UniRef50_A3H523 Transposase (IS4 family) protein (Fragment) n=1 ... 48 5e-04 UniRef50_Q2JAY9 Transposase, IS4 n=2 Tax=Frankia RepID=Q2JAY9_FRASC 48 7e-04 UniRef50_Q6LRT4 Similar to transposase n=38 Tax=Photobacterium p... 47 7e-04 UniRef50_B4B8T5 Transposase IS4 family protein n=1 Tax=Cyanothec... 47 7e-04 UniRef50_Q6MS13 Transposase IS1634BQ n=39 Tax=Mycoplasma RepID=Q... 47 7e-04 UniRef50_B2AJ60 Transposase, IS4 family n=4 Tax=Proteobacteria R... 47 8e-04 UniRef50_Q1VRR5 Putative uncharacterized protein n=8 Tax=Bactero... 47 0.001 UniRef50_Q7BLZ8 Putative uncharacterized protein (Fragment) n=1 ... 47 0.001 UniRef50_B7AA71 Transposase IS4 family protein n=2 Tax=Thermus a... 47 0.001 UniRef50_A4A0C6 Probable transposase n=2 Tax=Blastopirellula mar... 47 0.001 UniRef50_Q15UH5 Transposase, IS4 family n=36 Tax=Gammaproteobact... 47 0.001 UniRef50_B7JAV1 Transposase, putative n=3 Tax=Acidithiobacillus ... 47 0.001 UniRef50_C7TBQ5 Transposase n=4 Tax=Lactobacillus rhamnosus RepI... 47 0.001 UniRef50_Q7NBK2 Predicted transposase n=10 Tax=Mycoplasma RepID=... 47 0.001 UniRef50_A9F243 Transposase, IS4 family n=4 Tax=Sorangium cellul... 47 0.001 UniRef50_Q4A8Q4 ISMHp1 transposase n=19 Tax=Mycoplasma RepID=Q4A... 47 0.001 UniRef50_B5CN98 Putative uncharacterized protein n=5 Tax=Clostri... 46 0.002 UniRef50_C9LZT7 Putative uncharacterized protein n=1 Tax=Lactoba... 46 0.002 UniRef50_A3D336 Transposase, IS4 family n=6 Tax=Shewanella RepID... 46 0.002 UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=... 46 0.002 UniRef50_C3IAJ8 Putative uncharacterized protein n=1 Tax=Bacillu... 46 0.002 UniRef50_Q04QP0 Transposase, ISLbp11 n=2 Tax=Leptospira borgpete... 46 0.002 UniRef50_B2AKB8 Transposase, IS4 family n=40 Tax=cellular organi... 46 0.002 UniRef50_C6PFH6 Transposase IS4 family protein n=2 Tax=Thermoana... 46 0.002 UniRef50_Q1PW38 Putative uncharacterized protein n=4 Tax=Candida... 46 0.002 UniRef50_Q64EL6 Putative uncharacterized protein n=1 Tax=uncultu... 46 0.002 UniRef50_C9KIM9 Transposase, IS4 family protein n=1 Tax=Mitsuoke... 46 0.002 UniRef50_A5CYT3 FOG: transposase and inactivated derivatives n=1... 46 0.002 UniRef50_A1WHR7 Transposase, IS4 family n=11 Tax=Proteobacteria ... 46 0.003 UniRef50_Q8VVL2 TRANSPOSASE ISMmy1G n=9 Tax=Mycoplasma RepID=Q8V... 46 0.003 UniRef50_C8VXQ2 Transposase (IS4 family protein) n=3 Tax=Desulfo... 46 0.003 UniRef50_A4C5E2 Hypothetical transposase n=2 Tax=Pseudoalteromon... 45 0.004 UniRef50_B9P933 Predicted protein n=16 Tax=cellular organisms Re... 45 0.004 >UniRef50_P11901 Transposase for insertion sequence element IS421 n=41 Tax=cellular organisms RepID=T421_ECOLX Length = 371 Score = 457 bits (1176), Expect = e-127, Method: Composition-based stats. Identities = 370/371 (99%), Positives = 370/371 (99%), Gaps = 1/371 (0%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS-LRE 59 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS LRE Sbjct: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSSLRE 60 Query: 60 VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI 119 VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI Sbjct: 61 VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI 120 Query: 120 SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC 179 SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC Sbjct: 121 SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC 180 Query: 180 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG 239 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG Sbjct: 181 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG 240 Query: 240 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 299 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE Sbjct: 241 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 300 Query: 300 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF Sbjct: 301 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 360 Query: 360 PPRSAGSEKKN 370 PPRSAGSEKKN Sbjct: 361 PPRSAGSEKKN 371 >UniRef50_C6AUF2 Transposase IS4 family protein n=7 Tax=Rhizobium RepID=C6AUF2_RHILS Length = 372 Score = 363 bits (931), Expect = 6e-99, Method: Composition-based stats. Identities = 154/364 (42%), Positives = 205/364 (56%), Gaps = 4/364 (1%) Query: 5 HDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWA 64 D+W + + +L+ +AR GA TR REI++A TLLRL LAYG GMSLRE AWA Sbjct: 8 LDHWPEVRERLPAGFDLEATARLRGAFTRVREIKNAETLLRLALAYGGLGMSLRETCAWA 67 Query: 65 QLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVT-GCTSGKRLRLVDGTAISAPG 123 + +A LSD +LL+RL AA W G + A +A +A V G +G RLR++DGT+I PG Sbjct: 68 EAGGIARLSDPSLLERLCKAAPWLGDIVAALIAEQAKVPTGRFAGYRLRVLDGTSICHPG 127 Query: 124 GGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSL 183 WRLH+GYD T Q ELTD AE L R +I +ADR + +RP +R + Sbjct: 128 ADRTTWRLHVGYDLATAQVDQLELTDIHGAENLQRLTYAPGDIVLADR-YYARPRDLRPV 186 Query: 184 AFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGN-KKAGAPF 242 AD+IVR W LR L G FD+ L + GE V + P Sbjct: 187 IDAGADFIVRTGWNSLRLLQTNGEPFDLFAALAA-QQEQEGEVQVRVHEGMTGTPPPPPL 245 Query: 243 PARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 RLI P++A + RLL + R++G+ +LEAA ++LLLTSLP + + Sbjct: 246 VLRLIVRRKDPQQAQAEQERLLKDARKRGKKPDPRSLEAAKYILLLTSLPTATFPPADIL 305 Query: 303 DCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPR 362 YR RWQIELAFKR KSL LD+L AK+PELA+AW++A L+ A + + I D PP Sbjct: 306 TLYRFRWQIELAFKRFKSLAGLDSLPAKKPELARAWLYARLIVAIIAEQIAGQVPDSPPS 365 Query: 363 SAGS 366 G+ Sbjct: 366 GCGN 369 >UniRef50_B8F976 Transposase IS4 family protein n=2 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F976_DESAA Length = 371 Score = 343 bits (879), Expect = 7e-93, Method: Composition-based stats. Identities = 120/369 (32%), Positives = 184/369 (49%), Gaps = 18/369 (4%) Query: 1 MNYSH-----DNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGM 55 M D+W AIL + P + A+ GAL+RRR LLR+ L + G Sbjct: 1 MENQILLSEGDDWQAILTFL--PHGWEEKAKELGALSRRRNFDGPEALLRVLLIHLVQGC 58 Query: 56 SLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVR---AAVTGCTSGKRLR 112 SLR +A ++ +A+ SDVALLKRL+ + +W +A + + G+ +R Sbjct: 59 SLRVTSALSKAGGLASASDVALLKRLKASGEWMRWMAVELMKQWFGKQPEKILGMGRTVR 118 Query: 113 LVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRG 172 +VDG+ +S PG W++H + Q + +TD + E L F ++ +ADRG Sbjct: 119 VVDGSTVSEPGSTGTTWKIHYSIQLPSLQCDEVYVTDPKTGEDLKNFNVHPGDVFLADRG 178 Query: 173 FGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGN 232 + R + + G D IVR+ + + G F ++ LR L + G+ I + Sbjct: 179 YYHRTGMLHVVK-GGGDLIVRMIHQY-KLYDINGQEFGLIKNLRSLTVNQIGDWDAFIHH 236 Query: 233 SGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLP 292 +G R+ A+ E A +K +L EN +KG + ETL AA +V + T+L Sbjct: 237 KKEVISG-----RVCAIKKSKEAAEKAKRAILRENSKKGHKTKPETLVAAEYVFVFTTLS 291 Query: 293 EDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 E+ A QV + YR RWQ+ELAFKRLKSL+ L L+ + E AKAW+ + AAFL++ + Sbjct: 292 R-EWKASQVLEAYRGRWQVELAFKRLKSLIGLGHLKKTDFEGAKAWLHGKIFAAFLVEAM 350 Query: 353 IQPSLDFPP 361 I F P Sbjct: 351 IAACDSFSP 359 >UniRef50_C0AF19 InsL n=9 Tax=Opitutaceae bacterium TAV2 RepID=C0AF19_9BACT Length = 362 Score = 326 bits (836), Expect = 6e-88, Method: Composition-based stats. Identities = 109/359 (30%), Positives = 168/359 (46%), Gaps = 14/359 (3%) Query: 6 DNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ 65 + W + + PE + +AR GA + + IR A LLRL L + G+SLR A Sbjct: 8 EEWGLVKGLL--PEGWEVAAREQGAFKQAKGIRTAEELLRLILMHAGSGLSLRHAVARGA 65 Query: 66 LHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTG---CTSGKRLRLVDGTAISAP 122 + +SDVALLKRLRNA W ++ + L +A G VD T I Sbjct: 66 AAGLPEVSDVALLKRLRNAEGWLRWMSVRLLEQQAGQPRWSRLPEGWTAVAVDSTTIEES 125 Query: 123 GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS 182 G +WRLH + ELTD++ E L R+ ++ + DR F P+ IR Sbjct: 126 GASGTDWRLHYAIGLPSLFCEQAELTDNKGGESLCRYKVRKGDLFLGDRNFCRAPQ-IRH 184 Query: 183 LAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPF 242 + + ++R H L +G D+ +L L + E V + + Sbjct: 185 VMDHQGAVLLRWHSTSLPLFDQQGHALDVPAWLAQLRSRQCSELPVFLKDGT-------- 236 Query: 243 PARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 RL A+ + P+ A + ++ ++ GR + L A +++++TSLP + + Sbjct: 237 ALRLCALRVSPQAAQRERAKIRLSAKKNGRKPSCQCLCMADYIVVVTSLPSSCLDSRGIL 296 Query: 303 DCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPP 361 YRLRWQIELAFKRLKSLL+ + K+P +++W+ A LL LI+ + S F P Sbjct: 297 QLYRLRWQIELAFKRLKSLLNTGHVPKKDPLSSRSWLQAKLLTCLLIEKSLLQSEVFSP 355 >UniRef50_UPI00017465B5 InsL n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017465B5 Length = 382 Score = 319 bits (817), Expect = 9e-86, Method: Composition-based stats. Identities = 109/361 (30%), Positives = 171/361 (47%), Gaps = 8/361 (2%) Query: 6 DNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ 65 +NW +L+ + PE ++ A+ GA+ R R ++LLR+ L + G SLR + + Sbjct: 7 ENWDYLLSLL--PENWESLAKTTGAVQRLRGAESLSSLLRVLLLHAGHGCSLRTASVVGK 64 Query: 66 LHDVATLSDVALLKRLRNAADWFGILAAQTLAV-RAAVTGCTSGKRLRLVDGTAISAPGG 124 ++SDVAL KR W L A A R + G +LRLVDGT I PG Sbjct: 65 AAGWISMSDVALHKRFALCEGWLQQLCAGLFAQSRLQLPAAYRGLKLRLVDGTTIKEPGA 124 Query: 125 GSAEWRLHMGY---DPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIR 181 ++WR+H D H F + S + E L F + +ADRGF S I Sbjct: 125 TGSQWRIHYSLRVPDWHCDFFRLNPVRGSGNGESLKHFEVAPGDCFLADRGF-SHLLGIE 183 Query: 182 SLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL-DCGKNGETTVMIGNSGNKKAGA 240 + G A I+R++ + +G ++ +LR L G + + Sbjct: 184 HVYRGGAHVIMRLNEQNTPLEDEQGRPVVLLPWLRKLKQPGAAAGLDLWVRPRKEDSLEK 243 Query: 241 PFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ 300 P RL AV E A +++ ++ ++ ++A TLE +++LT++P D S + Sbjct: 244 RVPVRLCAVRKSVEAAALAQRKVQRRAQQDQTKLRAATLEHTAWIVVLTTVPRDTLSDVE 303 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFP 360 V YR+RWQIELAFKRLKSL + L + ++AW++A LL A L + + + + Sbjct: 304 VLQWYRVRWQIELAFKRLKSLGDVGHLPKSDERSSRAWVYAKLLIALLSEKMQRHAAALS 363 Query: 361 P 361 P Sbjct: 364 P 364 >UniRef50_Q1Q5J6 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q5J6_9BACT Length = 367 Score = 312 bits (798), Expect = 2e-83, Method: Composition-based stats. Identities = 101/360 (28%), Positives = 162/360 (45%), Gaps = 10/360 (2%) Query: 6 DNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ 65 ++W + P + + A + AL R+ + LLR L + G SLRE A+ Sbjct: 4 EDWDLLRTFF--PNDWKSLAVDTNALKGLRKDKSEEKLLRTLLIHLGCGYSLRETVVRAK 61 Query: 66 LHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGG 125 ++A LSDVALLKRL+ + +W L R + LRL D T + PG Sbjct: 62 RANLADLSDVALLKRLKKSKEWLYKLCLSLFRERGLQINKRNNFHLRLFDATTVKEPGKT 121 Query: 126 SAEWRLHMGYDPHTCQFTDFEL---TDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS 182 + WR+H + + F+L E +F D+ IADRG+ + + I Sbjct: 122 GSLWRIHYSIEVPSLSCDFFKLTGTEGEGTGESFRQFPMKKDDYIIADRGYCT-GQGIHH 180 Query: 183 LAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGE-TTVMIGNSGNKKAGAP 241 A VRV+ + LR E F ++ ++ L + V I N N + Sbjct: 181 ATRKGAYLSVRVNSQSLRIFGEEKKPFPLLKEIQYLKRPLAIKSWNVFIPNVDNTEY--- 237 Query: 242 FPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQV 301 L + E I+ +L +KG ++ ETL A +V++ T+ PE++++A + Sbjct: 238 VKGSLCIIRKTEEAIKIAHKKLKRHASKKGIELKPETLIYAKYVIVFTTFPENQFTAFDI 297 Query: 302 ADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPP 361 + YR+RWQIEL FKR K + L + + +KAW++ L A L + +I + F P Sbjct: 298 LEWYRVRWQIELVFKRFKQIAQFGHLPKYDDDSSKAWLYGKLFVALLTEKLIDFATSFSP 357 >UniRef50_A6DTQ2 Putative transposase insL for insertion sequence IS186 n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTQ2_9BACT Length = 375 Score = 298 bits (763), Expect = 2e-79, Method: Composition-based stats. Identities = 109/365 (29%), Positives = 166/365 (45%), Gaps = 20/365 (5%) Query: 7 NWSAILAHIGKPEELDTSARNAGALTRRREI---RDAATLLRLGLAYGPGGMSLREVTAW 63 +W + P+ D G L R+ + LLR L + G +SLR A Sbjct: 10 DWDYFKTFL--PDGWDGMMAETGMLKFGRKFSGEDGPSKLLRTLLIHLGGNLSLRSTCAL 67 Query: 64 AQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVT--GCTSG--KRLRLVDGTAI 119 A+ ++ +SDVALLKRL+ +++WF Q L G R VDG+ + Sbjct: 68 AKEGNIIDVSDVALLKRLQKSSEWFNWCTTQLLDKMKPKNPQGLPEQEEYNFRYVDGSIV 127 Query: 120 SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC 179 PG + W LH + T + +TD + E L ++ +++ I DR + R Sbjct: 128 REPGATGSTWMLHYSMNAKTLAPDEITITDQKKGESLKNYSVKPNDVFIGDRVYPRRNGI 187 Query: 180 IRSLAFGEADYIVRVHWRGLRWLTA-EGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKA 238 I + G YI+ L L G F ++ LR L G GE V+I K Sbjct: 188 IHVHSNGG--YILCRFPPSLTPLHNDNGTPFKLLSKLRKLKLGDIGEYNVVI-----KHN 240 Query: 239 GAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV--VQAETLEAAGHVLLLTSLPEDEY 296 AR+ A+ E L ++ + + + R + ETLE AG++L+LT+L + Sbjct: 241 EGQINARVCAMKKDHESTLKAQKAIHRKASKNSRKGSTRPETLEYAGYILILTTL-AESV 299 Query: 297 SAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 S E++ + YR RWQIEL FKRLKS++ L K ++W+ +L A LI+ II+ Sbjct: 300 SPEKILNIYRSRWQIELLFKRLKSIIGAAPLYKKNDIGMRSWLAGKILVATLIEYIIRCG 359 Query: 357 LDFPP 361 DF P Sbjct: 360 EDFFP 364 >UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001905F7C Length = 367 Score = 290 bits (742), Expect = 6e-77, Method: Composition-based stats. Identities = 129/335 (38%), Positives = 188/335 (56%), Gaps = 7/335 (2%) Query: 7 NWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL 66 +W ++ +G EEL+ SAR AGAL R+R++ AA LLRL AY GG SLR + AWA Sbjct: 9 DWGELVERLGSAEELEASAREAGALLRKRQVGGAADLLRLCFAYVLGGFSLRTLAAWADQ 68 Query: 67 HDVATLSDVALLKRLRNAADWFGILAAQTLAVRAA--VTGCTSGKRLRLVDGTAISAPGG 124 +A++SDVA+LKRL+ +ADW G L ++ LA R G S RL VD T ++ PG Sbjct: 69 RGLASMSDVAMLKRLKASADWVGYLVSELLAERCPEAFAGVHSDLRLMAVDATVVAPPGP 128 Query: 125 GSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLA 184 W +H +D + + E+TD R+AERL R A E+RIADR + + ++ Sbjct: 129 KRDYWMVHTVFDLSRLKLSSVEVTDRREAERLSRG-VKAGELRIADRAHAKATD-LAAVV 186 Query: 185 FGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPA 244 AD++VR R L +G + + R + +V I + +K A Sbjct: 187 KAGADFLVRAPSNYPRLLDGDGQLLERLALCREAGDKGVLDRSVRIQDGKSKVE---VAA 243 Query: 245 RLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADC 304 R++ + LPPE A ++ + +E AG+++LLTSL D++ E++A Sbjct: 244 RVVILPLPPEAAAKARRAARRLAAKARYKPSEAGIEMAGYLVLLTSLNADDWPPERLAST 303 Query: 305 YRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 YRLRWQIELAFKR+KSL+ L+ LRAK+ +LA+ WI Sbjct: 304 YRLRWQIELAFKRMKSLIGLEGLRAKDADLARLWI 338 >UniRef50_B7GET6 Transposase n=2 Tax=Bacillaceae RepID=B7GET6_ANOFW Length = 417 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 91/395 (23%), Positives = 162/395 (41%), Gaps = 46/395 (11%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGM---SL 57 M W + + PE L A+ G + R+R + A L L A+G G + SL Sbjct: 1 MENQMQAWMKTIRQLFSPETLTHLAQETGFIQRKRAL-TAEAFLTLC-AWGDGSLAQQSL 58 Query: 58 REVTAWAQLHDVATLSDVALLKRLRNAAD------WFGILAAQTLAVRAAV-TGCTSGKR 110 + + L +LS L +R A +F +L Q + + + T T R Sbjct: 59 QRLCTSLTLRHDCSLSSEGLNQRFTERAVAFLREVFFLLLQRQPPLLWSTIQTYRTCFTR 118 Query: 111 LRLVDGTAISAPGGGSAEWR--------LHMGYDPHTCQFTDFELTDSRDAERLDRFAQT 162 LR++D T+ P ++R + YD + + D++ RFA Sbjct: 119 LRILDSTSFLVPADYGEDYRGSVSSGAKIQFEYDLLSGACLQLCAQSANDSD--ARFAYH 176 Query: 163 ADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGK 222 A + + CIR L F + + RG ++T +R DM +++ K Sbjct: 177 AQHTILPN------DLCIRDLGFFSVAALTEIDARGAYYITR--LRSDMKVYIKENSQWK 228 Query: 223 NGETTVM-------------IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRR 269 + + G+++ P RLI L E+ + +R Sbjct: 229 EWDWESLGNQLKEGESVEMEHVYIGHERLYIP---RLIFRRLTEEEWQKRMAYVRKREKR 285 Query: 270 KGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRA 329 KG+ + +TLE + +LLT+LP++ + +QV + Y LRWQIEL FK KS+ L+ ++ Sbjct: 286 KGKALTRQTLEQKKYHILLTNLPQESFDGQQVYELYSLRWQIELLFKAWKSVFDLEKVKK 345 Query: 330 KEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSA 364 + E + ++ L+A + + + + ++A Sbjct: 346 MKKERFECHVYGTLIAILVTQTFLFQARTYWQQTA 380 >UniRef50_P12249 Transposase for insertion sequence element IS231A n=411 Tax=Bacillus RepID=T231A_BACTB Length = 478 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 82/394 (20%), Positives = 149/394 (37%), Gaps = 49/394 (12%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 + +S L P L+ A+ G + R+R+ + L + + S V Sbjct: 5 IQDELQLFSEELCRHLTPSFLEELAKKLGFVKRKRKF-SGSELATICIWISQRTASDSLV 63 Query: 61 TAWAQLHDVA--TLSDVALLKRLRNAAD-----WFGILAAQTLAVRAAV--TGCTSGKRL 111 +QLH +S L KR A F IL L +A+ T T +R+ Sbjct: 64 RLCSQLHAATGTLMSPEGLNKRFDKKAVEFLKYIFSILWKGKLCKTSAISSTALTHFQRI 123 Query: 112 RLVDGTAISAP--------GGGS----AEWRLHMGYDPHTCQFTDFELTDSRDAERL--- 156 R++D T P G G A ++ + YD H+ QF +F++ ++ ++ Sbjct: 124 RILDATIFQIPKHLASIYPGSGGCAQTAGIKIQLEYDLHSGQFLNFQVGPGKNNDKTFGT 183 Query: 157 -DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWL------------- 202 ++ I D G+ S E + + A YI R+ ++ Sbjct: 184 ECLDTLRPGDLCIRDLGYFS-LEDLDQMDQRGAYYISRLKLNHTVYIKNPSPEYFRNGTV 242 Query: 203 --TAEGMRFDMMGFLRGLDCGKNGET-TVMIGNSGNKKAGAPFPARLIAVSLPPEKALIS 259 ++ ++ D+ + L G+ E IG R+I L ++ Sbjct: 243 KKQSQYIQVDLEHIMNHLKPGQTYEIKEAYIGK------NQKLFTRVIIYRLTEKQIQER 296 Query: 260 KTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLK 319 + + +KG ++ G + +++ PE EQ+ D Y LRWQIE+ FK K Sbjct: 297 RKKQAYTESKKGITFSEKSKRLTGINIYVSNTPEGIVPMEQIHDFYSLRWQIEIIFKTWK 356 Query: 320 SLLHLDALRAKEPELAKAWIFANLLAAFLIDDII 353 SL + + + E + ++ L+A F+ + Sbjct: 357 SLFQIHHWQNIKQERLECHVYGRLIAIFICSSTM 390 >UniRef50_Q5L3A2 Transposase of IS231E-like element n=1 Tax=Geobacillus kaustophilus RepID=Q5L3A2_GEOKA Length = 453 Score = 242 bits (618), Expect = 1e-62, Method: Composition-based stats. Identities = 83/372 (22%), Positives = 148/372 (39%), Gaps = 47/372 (12%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIR--DAATLLRLGLAYGPGGMSLREVTAWAQLHDV 69 L + EEL+ AR+ + R+ ++R D L L G G SL ++ + L Sbjct: 18 LRSVLSCEELEHMARDHQFIQRKGKLRAHDFVALCTF-LQEGGGQKSLVQLCSALALKQN 76 Query: 70 ATLSDVALLKRLRNAADWF------GILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP- 122 +LS L +R A F +L QT R + R+R++D T+ P Sbjct: 77 TSLSAEGLNQRFHEKAVSFLKAVFEKLLIHQTQEARRLCPRHSLFLRIRILDSTSFQLPP 136 Query: 123 -------GGGSAEWRLHMGYDPHTCQFTDFELTDSR--DAERLDRFAQTA--DEIRIADR 171 G ++ + Y+ + ++ D+R DA T ++ + D Sbjct: 137 EIQGIYEGCTGPGVKIQLEYEWLEGKVLHVDVEDARHHDAAYGASLLSTIQEGDLCLKDL 196 Query: 172 GFGSRPECIRSLAFGEADYIVRVHWR-GLRWLTAE-GMRFDMMGFLRGLDCGKNGETTVM 229 G+ S E ++++ A YI R+ G+ + +++ FL L G+ E Sbjct: 197 GYFS-LEGLQAIHDAGAFYISRLKHNVGIYQKEGDRFRKWEPEDFLAVLQPGETMELE-- 253 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV---- 285 KK P RLI L E+ E +++G+ Q + A +V Sbjct: 254 HAYVSGKKVHQP---RLIVYRLTEEQ----------ERQKEGQWKQKAKQKGAAYVTRRP 300 Query: 286 ----LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFA 341 + +T++P S ++ Y LRWQIE+ FK KSL H+ + + + ++ Sbjct: 301 HPIYVYITNIPAIYTSLHEIHTLYSLRWQIEVVFKTWKSLFHIHRFKPMKGARFQCHLYG 360 Query: 342 NLLAAFLIDDII 353 L+A + ++ Sbjct: 361 TLIALLISSTVM 372 >UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipelotrichaceae RepID=B7C7E2_9FIRM Length = 446 Score = 229 bits (583), Expect = 2e-58, Method: Composition-based stats. Identities = 76/345 (22%), Positives = 130/345 (37%), Gaps = 41/345 (11%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGI 90 TR+R+ +TL+ + L G G + + D T+S + R + D F I Sbjct: 47 FTRKRKHLFGSTLMNVLLLEG-GSLKDELYKLFGYNLDTPTVSSF-IQARDKIKPDTFHI 104 Query: 91 LAAQTLAVRAAVTGCTSGKRLRLVDG-------------TAISAPGGGS---AEWRLHMG 134 L R +G RL VDG T I + + L+ Sbjct: 105 LF-NLFNGRTRKPKLYNGYRLLAVDGSTLPITSEIKDKKTTIQKANNSDKPFSAFHLNTS 163 Query: 135 YDPHTCQFTDFE-----LTDSRDA--ERLDRFAQTADEIRIADRGFGSRPECIRSLAFGE 187 YD + D + D RDA + ++R+ + I IADRG+ S + Sbjct: 164 YDILEYTYDDVILQGQAVQDERDALNKMVERY-KGDKAIFIADRGYES-INSFEKIHLSG 221 Query: 188 ADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLI 247 Y+VRV G LR + E +++ + K A Sbjct: 222 NKYLVRV------------KDIHSTGMLRSFGPFLDDEFDLIVKRTLTTKQTNEIKAHPE 269 Query: 248 AVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRL 307 P+ + RVV+ + E + ++T+L ++E+S + + + Y L Sbjct: 270 IYKFVPQNQRFDYFEDAPFYDFECRVVRFKITEDT-YECIVTNLDKNEFSMQDIKELYHL 328 Query: 308 RWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 RW+IE +++ LK L L+ L +K+ L + I+A ++ I Sbjct: 329 RWEIETSYRELKYDLDLNTLHSKKRNLIEQEIYAKMILYNFCSRI 373 >UniRef50_A9AZS8 Transposase IS4 family protein n=3 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AZS8_HERA2 Length = 442 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 87/356 (24%), Positives = 140/356 (39%), Gaps = 39/356 (10%) Query: 29 GALTRR-REIRDAATLLRLGL--AYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAA 85 G + R R +TL++ + SL ++ A AT+S A+ +R A Sbjct: 27 GYVKRADRATFTPSTLVQTLVYGWLANPTASLGQLAQMAARVG-ATVSPQAIDRRFTLA- 84 Query: 86 DWFGILAAQTLAVRA--------AVTGCTSGKRLRLVDGTAISAPGGGSAEWR------- 130 +L LA AV+ +R+ D T I P + +R Sbjct: 85 -TVDLLHHVLLASMEYAISADPVAVSILQRFTSVRIHDSTTIGLPDALATTYRGCGNASA 143 Query: 131 -------LHMGYDPHTCQFTDFELTDSRDAE---RLDRFAQTADEIRIADRGFGSRPECI 180 + D T +LTD R ++ + R A +R+AD GF + Sbjct: 144 RGTAGLKCGVQLDLLTGTLCGIDLTDGRASDQVLSVQRAPLPAGSLRLADLGFYN-IRIF 202 Query: 181 RSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGA 240 R LA E ++ RV L + + ++ + GL + E TV++G+ Sbjct: 203 RELAAAEVYWLSRVQSHSRIRLPGQKEQ-SILEVVTGLGDADHWEGTVLVGSKER----- 256 Query: 241 PFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ 300 ARL+ +P A + R+ E K R V ++ A +++T+ PED+ + Sbjct: 257 -LAARLLVQRVPDAVAAQRRQRVQDEAHDKCRPVSNAAMDLAAWTVVITNAPEDKLGLTE 315 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 ++RWQIEL FK KS H+D R K+P I+A LL I+ S Sbjct: 316 AMVLLKMRWQIELLFKLWKSHGHVDEWRTKKPARILCEIYAKLLGLVFQQWILVAS 371 >UniRef50_A6M1E5 Transposase, IS4 family protein n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M1E5_CLOB8 Length = 460 Score = 200 bits (508), Expect = 7e-50, Method: Composition-based stats. Identities = 65/378 (17%), Positives = 143/378 (37%), Gaps = 37/378 (9%) Query: 4 SHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGL-AYGPGGMSLREVTA 62 S D I+ + + +A G R ++ Y SLR + Sbjct: 10 SMDKIKKIIN-LFSKRLITKTAVTTGFTQRNSKLDGFTFFKAFTFGVYSLENPSLRNIAN 68 Query: 63 WAQLHD-VATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTS-------GKRLRLV 114 + + + +S A+ +L+ +++ + + + + + +++ Sbjct: 69 FCEDINPNLKVSRQAIENKLKAGSNFLKTILTNIIEDKIIKSIKHNHIEIFKAFNDIKIC 128 Query: 115 DGTAISA------------PGGGSAEWRLHMGYDPHTCQFTDFELTDS--RDAERLDRFA 160 D + I ++E ++ Y + Q FE D D + A Sbjct: 129 DSSLIKLNDSLRDSYKGFSEDKSASEMKIQTVYSFKSKQIETFEFEDGTTNDNSYMKTLA 188 Query: 161 QTAD--EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGL----RWLTAEGMRFDMMGF 214 + EI + D G+ + +C + L A ++ ++ + + + +M+ F Sbjct: 189 DKINTNEILLVDLGYFDK-KCFKMLEKKSAFFLSKIKYNTALYKENYKKGNFEKVEMIDF 247 Query: 215 LRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVV 274 L+ +T + +G N + R+I LP E + R + + +GR Sbjct: 248 LK--KSSGVIDTYLYVGMKQNNREE----FRVIGKRLPEEIVNLRIRRAREKAKAQGRAP 301 Query: 275 QAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPEL 334 + E V+++T++ +++ + + D YRLRWQIEL FK KS +D +++ + Sbjct: 302 KKIDKELMSWVIMITNIEKEQADVDMLLDIYRLRWQIELLFKCWKSYGKIDHVKSAGIDY 361 Query: 335 AKAWIFANLLAAFLIDDI 352 ++ L+ LI+ + Sbjct: 362 LNCLLYGRLIITLLINTV 379 >UniRef50_A6TN04 Transposase, IS4 family protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TN04_ALKMQ Length = 454 Score = 195 bits (496), Expect = 2e-48, Method: Composition-based stats. Identities = 69/360 (19%), Positives = 138/360 (38%), Gaps = 36/360 (10%) Query: 14 HIGKPEELDTSARNAGALTRRREI--RDAATLLRLGLAYGPGGMSLREVTAWAQLHDVAT 71 + ++ A G L R++ + + GL + + T Sbjct: 13 RLFDNNKIMEIAIGTGLLKRQKGMLPDTILKVFTFGLLNIANPSLNQIASKCQAFQPGLT 72 Query: 72 LSDVALLKRLRNAADWFGILAAQTLAVR-------AAVTGCTSGKRLRLVDGTAISAPG- 123 +S A+ KRL+ ++ + + K +++ D T I+ P Sbjct: 73 ISKEAVYKRLKKSSLFLQETFKHMMQKSMNSVIPVKTAAILEQFKDVKICDSTKITLPDK 132 Query: 124 -----------GGSAEWRLHMGYDPHTCQFTDFELT--DSRDAERLDRFA--QTADEIRI 168 + ++ Y +F+ E+T D D+ E+ I Sbjct: 133 LVALYPGLGGRNAKSSLKVQGIYSLIPARFSSLEITKAPGADTTYNDKLLAMVNPGELLI 192 Query: 169 ADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGM--RFDMMGFLRGLDCGKNGET 226 D G+ S+ L+ + Y+ R+ + ++ G + D+ L+ G +T Sbjct: 193 TDLGYFSKA-FFEKLSTKGSYYLTRIKKNSIVYVEKSGQLTKVDLTDLLK----GTVVDT 247 Query: 227 TVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVL 286 V +G + K+ R +A+ LP + + + + + +G+ + A+ E + Sbjct: 248 EVFLGIAHKKQ----LKCRFVAIRLPEKVVNQRRRKANQQAKAQGKQLSAKETELLAWNI 303 Query: 287 LLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 ++T++ +D+ S E D YR RWQIEL FK LKS L++D + + + I+ L+A Sbjct: 304 IVTNVTKDKLSPEAACDLYRARWQIELVFKSLKSYLNIDKIGSCGKYQLECLIYGRLIAV 363 >UniRef50_Q73IB8 Transposase, IS4 family n=9 Tax=Wolbachia RepID=Q73IB8_WOLPM Length = 442 Score = 190 bits (483), Expect = 6e-47, Method: Composition-based stats. Identities = 73/369 (19%), Positives = 140/369 (37%), Gaps = 44/369 (11%) Query: 19 EELDTSARNAGALTRRREIRDAATLLRLGLA-YGPGGMSLREVTAWAQLHDVATLSDVAL 77 E+ D + + R+R+++ ++ + + L G S+ + D ++ L Sbjct: 17 EKADKISITTRFIKRKRKLKGSSFVKAMVLGNIGVDNCSVETMCQLL-NEDSIDITKQGL 75 Query: 78 LKRLRNAADWF--------GILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEW 129 R A F +L L V + ++L+D + I+ P + Sbjct: 76 DFRFTEEAVEFMKRMYNESVLLFKNILQVDCKI--LQQFNSVKLLDSSYITLPNSMEEMY 133 Query: 130 R------------------LHMGYDPHTCQFTDFELTDSRDAERLDRFAQT---ADEIRI 168 + L + +D LT+ +++ R + ++++ I Sbjct: 134 KGYGTSYSGYESNTKSGIKLQLVFDYMNQIIDQLNLTEGVRSDQGYRKHLSNILSNDLLI 193 Query: 169 ADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTV 228 +D G+ P + + A +I R + + M L L+ E V Sbjct: 194 SDLGYF-VPSSFKQINEIGAYFISRYKSDTNIY---DVETNQKMELLECLEDKLFLENEV 249 Query: 229 MIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLL 288 ++G A R+I L E+++ + + R +G + + + Sbjct: 250 LLGK------EAKIRVRIICQKLTEEQSMARRRKANRLARSQGYTSSKRNQKLLNWSIFI 303 Query: 289 TSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFL 348 T++PE++ SAEQV YR+RWQIEL FK KS + LD L+ K P ++A L A + Sbjct: 304 TNVPENKISAEQVLTIYRVRWQIELLFKLYKSHIRLDKLKGK-PCRVLCELYAKLCAILI 362 Query: 349 IDDIIQPSL 357 I+ + Sbjct: 363 FHGIVGCTE 371 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 171 bits (432), Expect = 4e-41, Method: Composition-based stats. Identities = 71/388 (18%), Positives = 141/388 (36%), Gaps = 39/388 (10%) Query: 1 MNYSHDNWSAILAHIGKPEE--LDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLR 58 MNYS + +L+ I K + + +R+++ +++ L + Sbjct: 1 MNYSTEVKQKLLSIITKMDSYYWLFTKHPKTDFSRKKKW-SFEEVMKFMLTMEGKALRDE 59 Query: 59 EVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTA 118 + + + + S +R + + F L + +G RL DG+ Sbjct: 60 LLEYFEFDNTTPSNSSFN-QRRAQILPEAFEFLFQE-FTKSFTDNVTYNGLRLIACDGSD 117 Query: 119 IS--------------APGGGS-AEWRLHMGYDPHTCQFTDFELTDSRDA-------ERL 156 + P L+ YD + Q+TD + SR A E + Sbjct: 118 LCIAHNPQDETTYFQTLPDRKGYNLLHLNAFYDLCSRQYTDAIIQPSRLANERRAMCEMI 177 Query: 157 DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLR 216 DR+ T+ I IADRG+ + + Y++RV +T+ G+ + L Sbjct: 178 DRYNDTS-AIFIADRGYENYN-IFAHVEHKGMYYLIRVKD-----ITSNGITSKLT-MLP 229 Query: 217 GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQA 276 + N+ P R+I P + + K RV++ Sbjct: 230 ESGEFDEWVNVTLTKKQTNEVKANPKKYRVIDKKTPFDYLDLHFNNFYE---MKMRVIRF 286 Query: 277 ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAK 336 + + ++T+LP+D+++++++ Y RW IE +F+ LK L L +K+PE Sbjct: 287 -PIPQGSYECIITNLPQDKFNSDEIKRLYAKRWGIETSFRELKYALGLTRFHSKKPEYIM 345 Query: 337 AWIFANLLAAFLIDDIIQPSLDFPPRSA 364 I++ + + I + + Sbjct: 346 QEIWSRMTLYNFCEIIATNVVINEKKGC 373 >UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostridium spiroforme DSM 1552 RepID=B1C560_9FIRM Length = 399 Score = 166 bits (421), Expect = 8e-40, Method: Composition-based stats. Identities = 67/340 (19%), Positives = 119/340 (35%), Gaps = 40/340 (11%) Query: 39 DAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAV 98 D T +++ L G + + + S + R + + F L Sbjct: 3 DFETTMKIVLCTGASPIKDELLKFNDFSITTPSASAF-VQARSKIKPEAFRTLFDG-FNK 60 Query: 99 RAAVTGCTSGKRLRLVDG-------------TAISAPG---GGSAEWRLHMGYDPHTCQF 142 + G RL +DG T + G + + L+ YD + Sbjct: 61 KTFKKKLYHGYRLLAIDGSELPIDNTIFDDETTVLRHGTLAKTFSAYHLNASYDLMERTY 120 Query: 143 TDFELTDSRD-------AERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVH 195 D + + +DR+ I IADRG+ S + Y++RV Sbjct: 121 DDIIIQGEAKRDEHGAFCQLVDRY-DGQKAIFIADRGYESYN-GFEHVVHSGHKYLIRV- 177 Query: 196 WRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEK 255 R + ++ + L +GE V + K A P+ Sbjct: 178 ----RDIESQ------SSITKSLGPFPDGEFDVDVSRMLTLKQTKMIKACPDVYKFVPKN 227 Query: 256 ALISK-TRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELA 314 + RVV+ + E + ++T+L +E+S E + + Y +RW E + Sbjct: 228 MRFDFMNKQNPWYEFNCRVVRLKITENT-YETVITNLSRNEFSMEDICEIYNMRWGEETS 286 Query: 315 FKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQ 354 F+ LK + L+AL AK+ EL + I+A +L I+Q Sbjct: 287 FRELKYAIGLNALHAKKRELIQQEIYARMLMYNFCQRIVQ 326 >UniRef50_Q74P20 IS231-related transposase n=15 Tax=Bacillus RepID=Q74P20_BACC1 Length = 460 Score = 163 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 72/362 (19%), Positives = 135/362 (37%), Gaps = 40/362 (11%) Query: 16 GKPEELDTSARNAGALTRRREIRDAATLLRLG--LAYGPGGMSLREVTAWAQLHDVATLS 73 L A G + R+R+ R A L+ L L+ G SL + A LS Sbjct: 27 FSIHHLQLLAVKTGMIRRKRKCR-AQDLVSLCVFLSQAIGTESLVSLCAKLTRATGIQLS 85 Query: 74 DVALLKRLRNAAD-WFGILAAQTLAVR-AAVTGCTS-GKRLRLVDGTAISAPGGGSAEWR 130 L +R + L Q + + +T ++ R+R++D TA P ++ ++ Sbjct: 86 SQGLNERFNAQTVQFLKELFLQVFRKKFSPMTPLSNRFTRIRILDSTAFQLPAQYASSYK 145 Query: 131 ------------LHMGYDPHTCQFTDFELTD--SRDAERLDRFAQT--ADEIRIADRGFG 174 + + Y+ + +F + + D S D QT E+ + D G+ Sbjct: 146 GVGGGGSEAGVKIQLEYELISGEFLETAVRDGTSSDCRYGQERTQTLEPGELSLRDLGYF 205 Query: 175 SRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMR---FDMMGFLRGLDCGKNGET-TVMI 230 S + +A +A Y+ R+ W + +G + D+ + L G+ E + I Sbjct: 206 S-IYDLEKIADRKAFYVSRIRWNTQVYQKEKGGKWTLLDLEKLTKDLSEGQILELPEIYI 264 Query: 231 GNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTS 290 G RL+ L + ++ + LL+T+ Sbjct: 265 G------LHQKHKTRLVIYRLTQTEWTKRLEHHKKAKKKMPKYASR-------INLLITN 311 Query: 291 LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 + +V + Y LRWQIE+ FK KS+ + ++ + E + ++ L+ L+ Sbjct: 312 VSSKHLPHNEVYELYSLRWQIEIIFKTWKSIFKIHEVKPVKLERFQCHLYGQLIGLCLVA 371 Query: 351 DI 352 I Sbjct: 372 SI 373 >UniRef50_Q1PXV1 Putative uncharacterized protein n=3 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PXV1_9BACT Length = 449 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 63/366 (17%), Positives = 129/366 (35%), Gaps = 41/366 (11%) Query: 19 EELDTSARNAGALTRRREIRDAATLLRLGLA--YGPGGMSLREVTAWAQLHD-VATLSDV 75 + LD AR + R + L A + G +SL ++ + + ++ Sbjct: 12 DNLDRIARETCFVQRSTNKVSGRDFVELLSAGHFDSGIISLEGLSDVLREKSPESDITPQ 71 Query: 76 ALLKRLR--NAADWFGILAAQTLAVRA-------AVTGCTSGKRLRLVDGTAISA----- 121 AL K++ A + + + L D T I+ Sbjct: 72 ALSKKINSDKAVSFLERTFEAIYKEQVCPKLEKIPFVALEQFSNVYLQDSTQIALNEHLA 131 Query: 122 ---PGGGSAEWRLHMGYDPH---------TCQFTDFELTDSRDAERLDRFAQTADEIRIA 169 G G + + + D T D ++ ++ + ++ + Sbjct: 132 EEFKGTGGSASKSSVKIDLLYEAVHHILKEVSITKGTYPDQKNGAKVLK-HIGERDLLLR 190 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE--GMRFDMMGFLRG-LDCGKNGET 226 D G+ + + A Y+ R +L+A+ D++ +++ + + Sbjct: 191 DLGYFD-LSVLGDIEGKGAYYLSRFFKSTKVYLSADPGAEAIDLVSYVKKHIGNKGLADM 249 Query: 227 TVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVL 286 V +G +RLIA P + + ++ G+ + E LE + Sbjct: 250 EVYLGEE-------RICSRLIAYRAPGHVINERRRKAKRAVQKSGKTLSREYLEWLDYSF 302 Query: 287 LLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 +T++ + +S E V YR+RWQIEL FK+ K L +D +R E + ++ L+ Sbjct: 303 YITNVGAEIWSPEVVGTIYRIRWQIELVFKQWKQLFRMDVMRGTREERIRCLLYGRLIMI 362 Query: 347 FLIDDI 352 ++ I Sbjct: 363 CIVTRI 368 >UniRef50_Q05309 Transposase for insertion sequence element IS1151 n=16 Tax=Clostridium perfringens RepID=T1151_CLOPE Length = 473 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 73/394 (18%), Positives = 147/394 (37%), Gaps = 49/394 (12%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGP--GGMSLR 58 MN S + E++ A+++ + R+ I A L + YG L Sbjct: 1 MNKKLKYLSKSIKESFDINEINKIAKDSKFIQRKGSI-TAKDFLMFNVFYGSDICTAPLS 59 Query: 59 EVTAWAQLHDVATLSDVALLKRLRN-AADWFGILAAQTLAVR------AAVTGCTSGKRL 111 ++ A + L AL KR + ++ + + L + T T R+ Sbjct: 60 QLAAKYDMIFSKQLPKQALDKRFNKYSVEFMKEIFIKFLYSQNNTLTNLERTLRTYFDRV 119 Query: 112 RLVDGTAISAP--------GGGS----AEWRLHMGYDPHTCQFTDFELTDS--RDAERLD 157 + D + + P G G + ++ + Y+ T F + ++ D E L Sbjct: 120 IINDSISFTLPKEFKKKFPGSGGVASPSSIKVQLQYELLTGSFMNIDIFSGIKNDVEYLK 179 Query: 158 RFAQTADEIRI--ADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWL------------- 202 + D + AD G+ + + ++ L +I +V ++ Sbjct: 180 TMKKYKDYKDLKLADLGYF-KIDYLKRLDKSGTAFISKVKSNTSLYIKNPSPEKYKVGTI 238 Query: 203 --TAEGMRFDMMGFLRGLDCGKNGE-TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALIS 259 ++E ++ D++ L G+ E T + IG+ +RLI L E Sbjct: 239 KKSSEYIKIDIIKLAEPLAAGETIELTDIYIGSKKE------LKSRLIITKLTEENKSKR 292 Query: 260 KTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLK 319 + ++K + L+ +T++ + + QV + Y LRWQIE+ FK K Sbjct: 293 IFNHIEGIKKKRLTLNQRRLDFNSINAYITNVSSNIITMNQVHELYSLRWQIEIIFKVWK 352 Query: 320 SLLHLDALRAKEPELAKAWIFANLLAAFLIDDII 353 S+ ++ ++ + E +++ L+A L I+ Sbjct: 353 SIFKINQVKKVKLERFMCFLYGRLIALLLSSTIV 386 >UniRef50_A6UXI0 Protein containing transposase DDE domain n=4 Tax=Gammaproteobacteria RepID=A6UXI0_PSEA7 Length = 423 Score = 144 bits (362), Expect = 7e-33, Method: Composition-based stats. Identities = 82/378 (21%), Positives = 137/378 (36%), Gaps = 63/378 (16%) Query: 9 SAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHD 68 I + + P + +N TRRR++ L L L P E+ + ++ + Sbjct: 5 QKITSPLDCPAFIAAHRQNPQDFTRRRQLTFKN--LVLFLLNQPRTALQTELDQFYRVLN 62 Query: 69 VAT-----LSDVALLK-RLRNAADWFGILAAQTLAVRAAVTGCTS---GKRLRLVDGTAI 119 A+ ++ A K R + + F L + L + G G R+ VDG+ + Sbjct: 63 QASTETQMVTAQAFCKARKKLNPEVFESL-NRLLQQQIDCFGLRQKWRGLRVLAVDGSTV 121 Query: 120 SAP--GGGSAEWRLHMGYDPHTCQFTDFELTD--------------SRDAERLDRFAQTA 163 P + + H G+ P T +E+ D RD L A Sbjct: 122 HLPLESTMATFFGSHSGF-PMARLSTLYEVADGQTLHSLIVPLTVGERDCAHLHLEHLPA 180 Query: 164 DEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKN 223 D + + DRG+ L L A+ R FL L CG N Sbjct: 181 DSLTLFDRGYPGHW---------------------LFALFAQQQR----HFLMRLPCGYN 215 Query: 224 GETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAG 283 + + +G +L + P + S+ + ++ + R+++ E Sbjct: 216 AQVKAFL------HSGQVEDTQLFVANHPEARLFCSEAGVDPASQIELRLIRVELANGES 269 Query: 284 HVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANL 343 VLL + L + + AE A+ Y RW IE F+RLK L LD + K A Sbjct: 270 EVLLTSLLDREAFPAEVFAELYHRRWGIETDFRRLKQTLTLDNFSGRSVTAVKQDFHAAQ 329 Query: 344 L---AAFLIDDIIQPSLD 358 L A L+ ++QP ++ Sbjct: 330 LLKNLALLMQHLLQPVIE 347 >UniRef50_Q64B41 Transposase n=11 Tax=environmental samples RepID=Q64B41_9ARCH Length = 439 Score = 143 bits (361), Expect = 9e-33, Method: Composition-based stats. Identities = 72/385 (18%), Positives = 130/385 (33%), Gaps = 37/385 (9%) Query: 2 NYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS-LREV 60 S D + + + L +AR G + R R+I L +G +S +R + Sbjct: 10 ENSVDPTVQTIVEMFPEDFLRNTARETGVVKRERKIDVVILFWVTTLGFGVRFLSTIRGL 69 Query: 61 TAWAQLHDVATLSDVALLKRLR-NAADWFGILAAQTLAVRAAVTG------CTSGKRLRL 113 + TLS + R A++ +A +A TG K L + Sbjct: 70 KRKYEEKAKTTLSISSFHDRFTPEMAEFLRKCVLHAIAFQAQQTGRVLDDKLKRFKDLVI 129 Query: 114 VDGTAISA--------PGGG----SAEWRLHMGYDPHTCQFTDFELTDSRDAER--LDRF 159 D T I P +A ++ + R +E L Sbjct: 130 QDSTIIRLHESLAKIWPAARTKKIAAGVKVSCIVSAVADSPKSVRIYPERTSEAKILRLG 189 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 D I + D G+ + ++ R+ + ++ Sbjct: 190 PWLRDRILLIDLGYFKYL-FFDRIDGYGGYFVSRLKGNANP-------------LIVRVN 235 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 G + ++G ++ V + E S +R+ R+V A Sbjct: 236 RKCRGNSVDVVGKKLRDVL-PRLKREILDVEVEVEFKRRKYKGKQSTVKRRFRMVCAFNS 294 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 ++ + LT++ D SAE++A Y RW+IEL FK LKS +D + + P + K I Sbjct: 295 DSGKYHSYLTNIRVDILSAEEIALLYGARWEIELIFKELKSHYRMDQIPSANPNIVKCLI 354 Query: 340 FANLLAAFLIDDIIQPSLDFPPRSA 364 + +L I++ + P +A Sbjct: 355 WIAILTLMCSRRILRLIRNANPENA 379 >UniRef50_Q46GC6 Transposase n=7 Tax=Methanosarcina RepID=Q46GC6_METBF Length = 435 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 65/381 (17%), Positives = 122/381 (32%), Gaps = 45/381 (11%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVA- 70 L + E L +A+ G + R R+I L L++G L+ A + Sbjct: 13 LREMFPEEWLRQTAKETGLIVRERKIDPVIIFWVLTLSFGVR---LQRTLASLKREYETE 69 Query: 71 ---TLSDVALLKRLR-NAADWFGILAAQTLAVRAAVTG------CTSGKRLRLVDGTAIS 120 T+SD + R ++ + A G + + + + D T + Sbjct: 70 SQKTISDSSWYYRFTPELVEFLHQCVIHGMEELAKEPGRKLSKKLETFQDVVIQDSTIVR 129 Query: 121 A--------PGGG----SAEWRLHMGYDPHTCQFTDFELTDSRDAE--RLDRFAQTADEI 166 P +A ++ + L + AE L D I Sbjct: 130 LHSSLADRFPAARSRTVAAGVKVGVMVSAIANGPRTIALYSEKTAEIKTLKIGPWIKDHI 189 Query: 167 RIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGET 226 + D GF + + + ++ R+ L + L + Sbjct: 190 LLVDLGFY-KTQMFARVEENGGYFVSRIRKNMDPIL------VSIEEELSKTKSKEFAGK 242 Query: 227 TVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVL 286 V ++ ++ R+V E + + Sbjct: 243 PVSECIKQLSGKDIDAVVKIEFKR-------REYKGKQKQDEMIVRLVAVYNDEDEKYHI 295 Query: 287 LLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 +T++ +D +A+ +A+ Y RW IEL FK LKS LD L K ++ +A I+ +L Sbjct: 296 YITNIQKDILNAKDIANLYGARWDIELLFKELKSKYSLDVLETKNVQVIEALIWTAILTL 355 Query: 347 FLIDDI---IQPSLDFPPRSA 364 + I ++ S P + A Sbjct: 356 IVSRRIYSLVRKSTTHPEKMA 376 >UniRef50_A7C1C1 IS231-related transposase n=6 Tax=Beggiatoa sp. PS RepID=A7C1C1_9GAMM Length = 445 Score = 130 bits (326), Expect = 9e-29, Method: Composition-based stats. Identities = 68/360 (18%), Positives = 118/360 (32%), Gaps = 36/360 (10%) Query: 22 DTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRL 81 D G + R+R+ + + L + + E A + + +S L KR Sbjct: 20 DALGETTGFIKRKRKFTGSTFIKTLVFGWMQTPQATLEELVQAGVLNDIEISAQGLDKRF 79 Query: 82 R-NAADWFGILAAQTLAVRAAVTG------CTSGKRLRLVDGTAISAPG----------- 123 +AD + Q +A + L D T ++ P Sbjct: 80 TPKSADLARAVLEQAVAEAVRAPNAVPIELLNRFSSVTLFDTTILNLPDELYQVWAGTGG 139 Query: 124 ---GGSAEWRLHMGYDPHTCQFTDFELTDSR---DAERLDRFAQTADEIRIADRGFGSRP 177 + + +GYD T Q L + +A +L + ++IAD G+ S Sbjct: 140 NGPTSRSALKGEIGYDLKTGQLIGPLLLPGKTHDNAGKLPQMELEECSLQIADLGYFSIA 199 Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMM--GFLRGLDCGKNGETTVMIGNSGN 235 + + + R+ + FD+ + E V++ Sbjct: 200 KMAENF-DANVFCLSRLRHD-AVLFDEQEEEFDLSLYTLFMKKNNRLRAELNVLL----- 252 Query: 236 KKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAA--GHVLLLTSLPE 293 P RL +P + + + +K + A + LL+T+ P Sbjct: 253 -VRYEKLPVRLFIERVPEMISSKRRRQANKGASKKKKGKTASKKSLSLCDFTLLVTTAPS 311 Query: 294 DEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDII 353 + S ++ Y RWQIEL FK KS LD P +I+ LLA + II Sbjct: 312 VQLSFDEALVLYGARWQIELLFKLWKSHAKLDTSIRPNPWRICRYIYIKLLACLVQHWII 371 >UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCH0_9FIRM Length = 435 Score = 128 bits (321), Expect = 3e-28, Method: Composition-based stats. Identities = 62/365 (16%), Positives = 124/365 (33%), Gaps = 45/365 (12%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGI 90 +R R+I + T + + + G ++ + + + T+S +R + + F Sbjct: 34 FSRNRKI-NFKTCVGITMNSGGCTLNKELLDFFDFDVNAPTVSAYT-QQRAKILPEAFEY 91 Query: 91 LAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP---------------GGGSAEWRLHMGY 135 L A G +L DG+ ++ G L+ Y Sbjct: 92 LFHAFTEENAQTKNLYEGYQLLACDGSNLTIAPNLNDPETLWKSNQLGATGNHLHLNALY 151 Query: 136 DPHTCQFTDFELTDS------RDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEAD 189 D + D + + R ++ I IADRG+ + ++ G Sbjct: 152 DVLNRTYIDALVQTASTYQEHRACIQMIERVTLDKVILIADRGYENYNIMSHAIEKGWKF 211 Query: 190 YI--VRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLI 247 I VH G+ A G+ + + +++ + K Sbjct: 212 LIRIKDVHSNGI----ASGLELPQTAVF-------DMDINLILTRNQTKSKKQA------ 254 Query: 248 AVSLPPEKALISKTRLLSENR--RKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCY 305 P + S+ R+ + + + + ++T+L +SAE++ + Y Sbjct: 255 GYKFMPTVQTFDYLPIGSKEDYPISFRIARFKIADD-SYETVITNLDRFCFSAEKLKELY 313 Query: 306 RLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAG 365 LRW IE +F+ LK + L + AK+ + K IFA L + I ++ + Sbjct: 314 HLRWGIETSFRELKYAIGLTSFHAKKVDYIKQEIFARLALYNYCELITTYVVEHTENISK 373 Query: 366 SEKKN 370 + N Sbjct: 374 KNQVN 378 >UniRef50_C3BTW8 Transposase for insertion sequence element IS231B n=13 Tax=Bacillus RepID=C3BTW8_9BACI Length = 387 Score = 128 bits (320), Expect = 4e-28, Method: Composition-based stats. Identities = 29/114 (25%), Positives = 49/114 (42%) Query: 242 FPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQV 301 P R+I L E+ ++KG A + +G + +T+ P D Q+ Sbjct: 190 VPTRVIVHRLTKEQQQKRLQDQTVREKKKGMKYSARSKRLSGINVYMTNTPTDIVPMGQL 249 Query: 302 ADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQP 355 D Y LRWQIE+ FK KS ++ + + E + ++ L+A L + Sbjct: 250 HDWYSLRWQIEILFKTWKSFFYIHHCKKIKRERLECHLYGQLIAILLCSSTMFQ 303 Score = 78.3 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 52/342 (15%), Positives = 117/342 (34%), Gaps = 43/342 (12%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGP--GGMSLR 58 ++ + L P L AR+ G + R + + A L+ L + SL Sbjct: 9 VSDELQLFGQELQSFLSPHILRDLARDVGFVQRTSKYQ-AKDLVALCVWMSQNVATTSLT 67 Query: 59 EVTAWAQLHDVATLSDVALLKRLRNAAD-WFGILAAQTLAVRAAV------TGCTSGKRL 111 ++++ + +S L +R +A + + A+ L + + KR+ Sbjct: 68 QLSSCLEASTEVLISPEGLNQRFNKSAVQFLQHILAELLNQKLTSSMPISSPYTSVFKRI 127 Query: 112 RLVDGTAISA--------PGGGS----AEWRLHMGYDPHTCQFTDFELTDSRDAER---- 155 R++D TA PG G A ++ + YD + QF + +R Sbjct: 128 RILDSTAFQLPDPFSFVYPGAGGCSHTAGVKIQLEYDLLSGQFLHIHTGPGKQHDRTYGS 187 Query: 156 --------LDRFAQTADEIRIADRGFGSRPECI----RSLAFGEADYIVRVHWRGLRWLT 203 + R + + R+ D+ + + + RS + + + + Sbjct: 188 LCVPTRVIVHRLTKEQQQKRLQDQTVREKKKGMKYSARSKRLSGINVYMTNTPTDIVPMG 247 Query: 204 AEGMRFDMMGFLRGLDCGKNGETTVMIGN---SGNKKAGAPFPARLIAVSLPPEKALISK 260 + + + L K ++ I + ++ +LIA+ L + Sbjct: 248 QLHDWYSLRWQIEIL--FKTWKSFFYIHHCKKIKRERLECHLYGQLIAILLCSSTMFQMR 305 Query: 261 TRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 LL + +R+ +A + +LL ++ +D ++ Sbjct: 306 QLLLMKKKRELSEYKAIYMIKDYFLLLFQAIQKDTQELSKIL 347 >UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepID=B0R8M6_HALS3 Length = 449 Score = 127 bits (319), Expect = 6e-28, Method: Composition-based stats. Identities = 81/375 (21%), Positives = 127/375 (33%), Gaps = 45/375 (12%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL----H 67 + +EL AR + R R+ A L + G S R + A+ + Sbjct: 16 IQRAFPSDELRERARATNLVERERKFDIVALFYTLSFGFAAG--SDRSLQAFLERYVEMA 73 Query: 68 DVATLSDVALLKRLRNAADWFGIL-------AAQTLAVRAAVTG-CTSGKRLRLVDGTAI 119 D LS A + +L RA ++G + + + D T + Sbjct: 74 DCDDLSYAAFHDWF--EPGFVALLREILDDAIENLDTGRADLSGRLERFRDVLIADATIV 131 Query: 120 SA----------PGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAER--LDRFAQTADEIR 167 S G AE +LH+ T T F TD ER L AD + Sbjct: 132 SLYQDAADVYAATGEDQAELKLHLIESLSTGLPTRFRTTDGTTHERSQLPTGEWVADALI 191 Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT 227 + D GF R + ++ RV A + + RG GE+ Sbjct: 192 LLDLGFYDFWLFDR-IDQNGGWFVSRVKDN------ANFEIVEELRTWRGNSIPLEGES- 243 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLL 287 + + R+ + K + R R+V E + L Sbjct: 244 --LQAVLDDLQRQEIDVRITL-------SFERKRGSGASATRTFRLVGLRNEETEEYHLY 294 Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 LT+L D+YSA +A YR RW++EL FK LKS LD + + + +A I ++ Sbjct: 295 LTNLGNDDYSAPDIAQLYRARWEVELLFKELKSRFGLDEINTTDAYIIEALIIMAAISLM 354 Query: 348 LIDDIIQPSLDFPPR 362 + I+ R Sbjct: 355 MSRVIVDELRSLEAR 369 >UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicutes RepID=B8FXU5_DESHD Length = 381 Score = 126 bits (317), Expect = 1e-27, Method: Composition-based stats. Identities = 57/327 (17%), Positives = 101/327 (30%), Gaps = 38/327 (11%) Query: 51 GPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKR 110 G +S + AT S + +R + + +L + + + R Sbjct: 2 GGNSLSKELYDWLGYSSETATASAF-VQQRDKIRPEALKLLFHEFTRLTVSENSL-QDYR 59 Query: 111 LRLVDGTAISAPGGGSAEW---------------RLHMGYDPHTCQFTDFELTDSRDAER 155 L VDG+ + P + L YD + D + + Sbjct: 60 LLAVDGSDLRLPSNSKDGFSSIRNSEDSKNYNLVHLDAMYDLMGKVYVDASVQSKKGMNE 119 Query: 156 LDRFAQTADE-------IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMR 208 D+ I I DRG+ S I YI+R R Sbjct: 120 HKALVSMVDQSEINGNVIAIMDRGYESFN-NIAHFQEKSWYYIIRAKESYGII-----SR 173 Query: 209 FDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVS--LPPEKALISKTRLLSE 266 + + + E + + K+ A P K + Sbjct: 174 LSLPDY-----PEYDEEIMLTLTRRQTKETLPLLKAYPHRYRWIQPHTTFDFIKPKDSKF 228 Query: 267 NRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA 326 R V+ + + + T+L +++ E++ Y LRW IE +FK LK + L + Sbjct: 229 YDLHFRAVRFAIADGV-YETVYTNLNAEDFPPEKLKQLYNLRWGIETSFKELKYAVGLAS 287 Query: 327 LRAKEPELAKAWIFANLLAAFLIDDII 353 L +K+ + IFA L+ I+ Sbjct: 288 LHSKKKDFILQEIFARLILYNYSSIIM 314 >UniRef50_A2RJ55 Putative transposase n=7 Tax=Lactobacillales RepID=A2RJ55_LACLM Length = 439 Score = 124 bits (312), Expect = 4e-27, Method: Composition-based stats. Identities = 66/357 (18%), Positives = 121/357 (33%), Gaps = 46/357 (12%) Query: 22 DTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRL 81 + A++ +R R++ T +++ L++G +S ++ + T S + + R Sbjct: 33 EIYAQSPFDFSRNRKL-SFETTIKIILSFGGQSLSSELLSHFNFTLKTPTASAL-VQARS 90 Query: 82 RNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP---------------GGGS 126 + F L +T+ G R+ DG+ ++ P G Sbjct: 91 KIKLKAFEQLFYRTIPSAQP-NKLYKGYRIFAHDGSDLNIPYNEKESDTHYRVGKFGKHV 149 Query: 127 AEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADE-------IRIADRGFGSRPEC 179 L+ YDP + + + Q D+ I IADRG+ S Sbjct: 150 GSLHLNALYDPLNKHYVAVDFQKIKQLNERKSLCQIVDDFDFTSPTIIIADRGYESFN-V 208 Query: 180 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG 239 + +++R G L GLD +G I ++ Sbjct: 209 YEHIKKSGQKFLIRAKDTKSN------------GLLNGLDLPSDGTFDKKITLQLTRRQT 256 Query: 240 APFP----ARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDE 295 + + I + RVV+ + E + L+T+L Sbjct: 257 NKVKKDKHYHFLHKRANFDYLPIRSKETYPIS---LRVVRIKLNEDT-YESLVTNLDPFL 312 Query: 296 YSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 +++E + Y LRW IE +F+ LK L L +K+ + IFA L+ I Sbjct: 313 FTSEDLKVLYHLRWGIETSFRELKYALGLSHFHSKKLDFIIQEIFARLIMYNFSMTI 369 >UniRef50_B0R9A9 Transposase (ISH8) n=22 Tax=Halobacteriaceae RepID=B0R9A9_HALS3 Length = 424 Score = 122 bits (307), Expect = 1e-26, Method: Composition-based stats. Identities = 61/343 (17%), Positives = 118/343 (34%), Gaps = 33/343 (9%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPG-GMSLREVTAWAQLHDVA 70 L + + L+ A G + R +++ + L + G +L Sbjct: 4 LTTLFPSKFLEEHAEELGVVEREGKLQIPVLVWALVFGFAAGESRTLAGFRRCYNSTADE 63 Query: 71 TLSDVALLKRLRNA-ADWFGILAAQTLAV----RAAVTGCTSGKRLRLVDGTA------- 118 T+S RL A++ L L + + + DGT Sbjct: 64 TISPGGFYHRLTPTLAEYLRDLVEHGLDEVAVPDTVDADIDRFRDVMIADGTVLRLHEFL 123 Query: 119 ---ISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSR--DAERLDRFAQTADEIRIADRGF 173 A A +LH+ ++ ++TD + D+ + + + + DR + Sbjct: 124 SDEFQARHEEQAGAKLHLLHNATDETIERIDVTDEKTHDSTLFKTGSWLQERLVLFDRAY 183 Query: 174 GSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNS 233 + + ++ R+ +T E + RG G+ I + Sbjct: 184 FKYRR-FALIDENDGYFVSRLKENANPLITEELREW------RGRAIPLEGKQ---IHDV 233 Query: 234 GNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPE 293 + + + A E S + ++ RVV +A + L +T+LP Sbjct: 234 VDDISRKYIDVEVEA-----EFKRGQYEGTRSLDTKRFRVVGVRDSDADDYHLYITNLPR 288 Query: 294 DEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAK 336 DE+ E +A YR RW++E F+ LK+ LD +P++ K Sbjct: 289 DEFFPEDLATLYRCRWEVETLFRELKTQYELDEFNTSDPDVVK 331 >UniRef50_B0TD95 Transposase, is4 family n=3 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TD95_HELMI Length = 441 Score = 122 bits (306), Expect = 2e-26, Method: Composition-based stats. Identities = 63/379 (16%), Positives = 132/379 (34%), Gaps = 41/379 (10%) Query: 10 AILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYG---PGGMSLREVTAWAQL 66 L+ + E + A G + R R+I L L L +G + + Q Sbjct: 13 EALSKLFPKEWVSEVAAETGFVKRERKISPVVFLWALVLGFGVGVQRTLGDLRRSYMEQA 72 Query: 67 HDVATLSDVALLKRLR-NAADWFGILAAQTLAVRAAVTGCTSGKRLR------LVDGTAI 119 ++ A R ++ + + G +RL+ ++D + + Sbjct: 73 GH--SVVPSAFYDRFTPELVEFLKRCVEKAIGHLVVEPGQVMSERLKDILDIAVIDSSLV 130 Query: 120 SAPGGGSAEW------------RLHMGYDPHTCQFTDFELTDSRDAER--LDRFAQTADE 165 + +W +++M + ++ + E L + D Sbjct: 131 RLHDQLAKKWPGPRTNHSPAAAKVNMLVSVFGATRSQVQIVEGTRGESKLLSIGSWVKDR 190 Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGE 225 I + D G+ S + + + ++ R+ + + ++ E Sbjct: 191 ILLFDLGYFS-FKHFGKIMNEKGYFVSRLKSNSNPLI--------LRSLIQHRGRTIAVE 241 Query: 226 TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV 285 ++ G+ + +I + + S L+ + RVV E + Sbjct: 242 GKRLLDIKGSLRRE------IIDFEVLVSNSQSSNMDLVKRTALQLRVVGILNEETKDYH 295 Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLA 345 +T+LP + + AE +A YR RW IEL FK LKS HL+++ + + + +A ++ LL Sbjct: 296 FYITNLPAERFPAEDIATLYRARWTIELLFKELKSYYHLESISSGKDCIVEALLYTALLT 355 Query: 346 AFLIDDIIQPSLDFPPRSA 364 + I+ + P A Sbjct: 356 LIVSRRILGLLREQFPEHA 374 >UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G346_9FIRM Length = 443 Score = 121 bits (303), Expect = 4e-26, Method: Composition-based stats. Identities = 64/355 (18%), Positives = 110/355 (30%), Gaps = 49/355 (13%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL--------HDVATLSDVALLKRLR 82 +R R++ ++R L M +++ + S + R + Sbjct: 34 FSRNRKLP-FEEVIRFLLPLQGQCMDQELFRHFSKKPLFFSTDYSGIPH-SSAMIQARQK 91 Query: 83 NAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP---------------GGGSA 127 + L G +L +DG+ S P G Sbjct: 92 LSDSAMPALFHS-FTETCKKGALFQGYQLLAIDGSQFSVPENLKEPLCWRKIPNISKGRN 150 Query: 128 EWRLHMGYDPHTCQFTDFELTD-------SRDAERLDRFAQTADEIRIADRGFGSRPECI 180 L+ Y + F D A+ +DR + I +ADRG+ S Sbjct: 151 VIHLNAMYHLQSGIFEDVVFQPICECNEHKALAQMVDRRSSAFPAIFMADRGYESYNT-F 209 Query: 181 RSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGA 240 + Y+VR G G GL+ E + KK Sbjct: 210 AHIEQKGDKYVVRGRESG-------------TGICSGLNLPDTEEYDIEKELYICKKHSK 256 Query: 241 PFPARLIAVSLPPEKALISKTRLL-SENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 299 A E R R+V+ + E +L T+L ++++SA+ Sbjct: 257 KVKTNPRKYKRIRSDATFDFFTDDCEEYRLNLRIVKIKLSETTT-EVLFTNLSKEKFSAD 315 Query: 300 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQ 354 + Y +RW IE AF +LK L ++ +K EL ++ L+ I+ Sbjct: 316 DLKRLYHMRWGIETAFDQLKYALGAASVHSKNSELIIQELYGKLIMFNFCKTIVG 370 >UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostridiales RepID=C7GFW6_9FIRM Length = 436 Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 67/350 (19%), Positives = 130/350 (37%), Gaps = 36/350 (10%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGI 90 R+R++ D ++ L ++ G ++ + + V T S +R + + F Sbjct: 33 FIRKRKL-DFKKMMHLIISMESGSLNHELLKFFEYDSSVPTGSAF-YQQRSKLSVSAFRH 90 Query: 91 LAAQTLAVRAAVTGCTSGKRLRLVDGT------------AISAP-GGGSAEW-RLHMG-- 134 L + ++ + L DG+ P G + + +H Sbjct: 91 LLKE-FNLKFPLEKFRGKYYLIACDGSEFNIARNLKDADTFHEPNGKSVSGFNMVHTISL 149 Query: 135 YDPHTCQFTDFELTDSRD-------AERLDRFAQTADEIRIADRGFGSRPECIRSLAFGE 187 Y+ + ++ D E+ R +DR+A A I IADRGF S ++ Sbjct: 150 YEVCSKRYLDLEVQPGRLKNEFQAICNLMDRYAYGASPIFIADRGFSSYNVFAHAIEN-N 208 Query: 188 ADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLI 247 D+++R ++ G D LD T +K R I Sbjct: 209 VDFLIRAKDLNVQRFLGGGTLPD------KLDTTIELILTRTQSKKKHKHPEKESQYRYI 262 Query: 248 AVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRL 307 ++ + + + E K R+V+ E + ++T+L E++++ + + CY L Sbjct: 263 GKNIAFDYLNPA--DISDEYLLKLRIVRVEVSDGV-FENIITTLSEEDFTPDDIKYCYNL 319 Query: 308 RWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 RW IE +F+ LK + L +K+ E +++ L+ II Sbjct: 320 RWGIETSFRDLKHTIGATNLHSKKTEYVAFELWSKLILYNFCSIIILHVP 369 >UniRef50_B0NXD2 Putative uncharacterized protein n=5 Tax=Clostridium sp. SS2/1 RepID=B0NXD2_9CLOT Length = 439 Score = 116 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 56/351 (15%), Positives = 113/351 (32%), Gaps = 44/351 (12%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGI 90 TR+R++ + + Y G + ++ T S L ++ + F Sbjct: 35 FTRKRKL-SFQDTINTIVTYDAGSIGRCIKRYIPKVEKTPTTSAF-LQQQKKLKLSAFQT 92 Query: 91 LAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP-----------------GGGSAEWRLHM 133 L + + VDGT ++ P ++ H+ Sbjct: 93 LFYR-FNDPFPDKTLYH-LHILSVDGTGVTVPMDRINENKEYARVRTNKDCTRPAYQFHV 150 Query: 134 G--YDPHTCQFTDFELTDSRD-------AERLDRFAQTADEIRIADRGFGSRPECIRSLA 184 YD ++ D + R + L+R + IADRG+ S Sbjct: 151 SCIYDLINERYCDAYIEPFRTHSETHVFSVMLERKNFPQKALFIADRGYESYL------- 203 Query: 185 FGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPA 244 + ++ G +L F ++G ++G + K A Sbjct: 204 -----LMAQIQHDGNYFLIRAREDFGQGSMIKGYPFPRDGTFDKTVTYIYTKTQNKRTKA 258 Query: 245 RLIAV-SLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVAD 303 + + + + R V L+T+LP +++ +E + Sbjct: 259 NPELYKRVATRNSPYFINKEHPYVKMTLRFVMIVLPNGQK-ECLITNLPANKFPSETLKK 317 Query: 304 CYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQ 354 Y +RW+IE +F+ +K +L +K+ E + I+A ++ I Q Sbjct: 318 LYCIRWKIETSFRLIKYSANLLEFHSKKIEFLQQEIWAKMIFYNFTTTITQ 368 >UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3VMZ1_KLEPN Length = 421 Score = 114 bits (285), Expect = 5e-24, Method: Composition-based stats. Identities = 80/380 (21%), Positives = 125/380 (32%), Gaps = 66/380 (17%) Query: 2 NYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVT 61 +S I + ++L++ + R R I A L A G G Sbjct: 3 QNQITAFSTIFEALFSEQQLNSLGVQTHMIERFRLITPAKLCLAFVCALGSGNAR-TIAD 61 Query: 62 AWAQLHDVATLSDVALLKRLRNA------ADWFGILAAQTLAVRAAV------TGCTSGK 109 + + ++S LK N ++ + Q LA+ K Sbjct: 62 IHRYFNHLHSMS--VRLKPFHNQLVKLGTPEFMRQVFEQALALHLPAMHTFSDAYRGHFK 119 Query: 110 RLRLVDGTAI--------SAPGG----GSAEWRLHMGYDPHTCQFTDFELTDSRDAE--R 155 ++ L DGT+ PG A LH+ YD Q L++ +E Sbjct: 120 QVLLQDGTSFAVHDGLSLHFPGRFSTHSPAAVELHVTYDLEKAQPVRVSLSEDTASERDY 179 Query: 156 LDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRF--DMMG 213 L + +AD G+ S+ I SL A +++R+ T + Sbjct: 180 LPVAQSLRGCLLMADAGYFSKA-YIESLQNEAASFVLRMPASVNPMATCNQTGLCQPLRS 238 Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV 273 +L L K+GE + + A P Sbjct: 239 WLAVLP--KHGELDLDVQWPDGPVYRCVLFASTDHKDKP--------------------- 275 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPE 333 V L T+L + A V + YRLRWQIEL FK KSL L+ + Sbjct: 276 -----------VCLCTNLDRHTFPAATVGEWYRLRWQIELLFKEWKSLNSLNKFNTEYST 324 Query: 334 LAKAWIFANLLAAFLIDDII 353 +A+ I+ +LLAA L +I Sbjct: 325 IAETLIWGSLLAATLKRWLI 344 >UniRef50_C3EBZ9 IS231-related transposase n=1 Tax=Bacillus thuringiensis serovar pakistani str. T13001 RepID=C3EBZ9_BACTU Length = 221 Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats. Identities = 50/228 (21%), Positives = 85/228 (37%), Gaps = 26/228 (11%) Query: 133 MGYDPHTCQFTDFELTD--SRDAERLDRF--AQTADEIRIADRGFGSRPECIRSLAFGEA 188 M YD + F ++T+ S DA+ ++ I D G+ P+ + A Sbjct: 1 MEYDVISGDFLQLDITNGISHDAKYGQELIHTVEKRDLCIRDLGYFYLPD-FHEINQKGA 59 Query: 189 DYIVRVHWRGLRWLTAE--GMRFDMMGFLRGLDCGKNGET-TVMIGNSGNKKAGAPFPAR 245 Y+ R+ + R + F++ + GK E V I P R Sbjct: 60 YYLSRLPINTQVYRKKGILYERLYLEDFIKKVSEGKTIEWFDVYIRKQH------KVPTR 113 Query: 246 LIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCY 305 LI L A +S + R V +L+T++P D E++ Y Sbjct: 114 LIIYKLTG--AGYDGKNNVSTATKYKRQVS----------ILMTNIPSDILQKEEIYPLY 161 Query: 306 RLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDII 353 +R QIE+ FK KSL + + + E + ++ L A L ++ Sbjct: 162 TVRGQIEILFKTWKSLCGIHLCKHVKLERFQCHLYGQLTAILLHSMLM 209 >UniRef50_Q2FU81 Transposase, IS4 n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FU81_METHJ Length = 452 Score = 111 bits (277), Expect = 5e-23, Method: Composition-based stats. Identities = 63/364 (17%), Positives = 124/364 (34%), Gaps = 42/364 (11%) Query: 19 EELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGM---SLREV-TAWAQLHDVATLSD 74 + ++ AR G + R+R++ LL L +G +L E+ + L D + Sbjct: 23 DFIEKKARETGFMQRKRKLD--PVLLIFSLIFGVSSHLKPTLEEIHRHYVDLDDNPKIET 80 Query: 75 VALLKRLRNA-----ADWFGILAAQTLAVRAAVTGCT------SGKRLRLVDGTAIS--- 120 L + R D+ L + + K + + D + I Sbjct: 81 SILNQSFRKRFNYKLVDFLKSLMDHYIDQIVHQSPAHLKGIVEDFKDILVQDSSIIRISK 140 Query: 121 -----APGGGS----AEWRLHMGYDPHTCQFTDFELTDSR--DAERLDRFAQTADEIRIA 169 P S A ++H Y + +T R D + L + + I Sbjct: 141 KLYDLHPAARSRDDSAGLKIHAVYSVVYHSVKNAIITTERVHDYKMLKIGPDVENILLIN 200 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLR-WLTAEGMRFDMMGFLRGLDCGKNGETTV 228 D G+ S + + + RV + ++ ++ + +C K+ Sbjct: 201 DLGYYS-LKTFSKIQEYGGFFASRVKSNAVFKVVSINSGPPEITSIVDH-NCFKSINGDD 258 Query: 229 MIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLL 288 + + LI ++ + ++ RV+ + L + Sbjct: 259 FL-----DRMPKKGVYDLIC---SFHIGDKHINKIKTPIFQEFRVICSWNPLTEKWHLYI 310 Query: 289 TSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFL 348 T+L ++ +SA+ + + YR RW IEL FK LK L + +A I++ LL + Sbjct: 311 TNLGKEVFSADDIYELYRFRWVIELIFKELKGDYDLGKMLLNNEPMAFIHIYSMLLRFII 370 Query: 349 IDDI 352 D+ Sbjct: 371 SRDL 374 >UniRef50_C0BDH6 Putative uncharacterized protein n=2 Tax=Coprococcus comes ATCC 27758 RepID=C0BDH6_9FIRM Length = 204 Score = 111 bits (276), Expect = 6e-23, Method: Composition-based stats. Identities = 41/189 (21%), Positives = 73/189 (38%), Gaps = 13/189 (6%) Query: 161 QTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDC 220 + I DRG+ S + G+ ++ R + L D F Sbjct: 27 DGIKSVYIGDRGYCSYNNMAHVVEQGQ-YFLFRTKDIHSKGLVGNFNFPDAESF------ 79 Query: 221 GKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISK--TRLLSENRRKGRVVQAET 278 + +V++ S +KK A + A R+++ Sbjct: 80 --DINVSVILVRSHSKKILADIHTEGYI-RFVDQSAAFDYIEYGSYDTYELSFRILRF-P 135 Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAW 338 + + + ++T+LP DE+ E++ Y RW IE +F++LK + L A +PE K Sbjct: 136 ISTSTYECIVTNLPRDEFPVERIKTLYNARWSIESSFRKLKYTIGLSNFHAYKPEYVKQE 195 Query: 339 IFANLLAAF 347 I+A LLA+ Sbjct: 196 IWARLLASL 204 >UniRef50_B7CEB8 Putative uncharacterized protein n=2 Tax=Erysipelotrichaceae RepID=B7CEB8_9FIRM Length = 431 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 53/343 (15%), Positives = 113/343 (32%), Gaps = 38/343 (11%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGI 90 TR+R++ TL+ + ++ + + + T S + +R + F Sbjct: 32 FTRKRKLP-VETLIHFIIQMQSKSLNSELCEYFNDIDFLPTASALC-QQRDKLDISAFQR 89 Query: 91 LAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP--------------GGGSAEWRLHMGYD 136 + G + DG+ ++ +++ ++ YD Sbjct: 90 IMH-LFVNAFDDYKTWKGYHVLACDGSDVNIAYDEKDEDTKRQNGNNKPFSQFHINGLYD 148 Query: 137 PHTCQFTDFELTDSRDA-------ERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEAD 189 F D + + E + + + I ADRG+ + + Sbjct: 149 CINHVFWDTSIDTANKTRECAALMEMIMKHDYPENSIITADRGYEKYNLIACCIENNQKF 208 Query: 190 YIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAV 249 D+ G + + E + + +K A Sbjct: 209 VF-------------RIKDIDVFGSILSNLNLPDEEFDLDVTKILTRKQTNETKANKHKY 255 Query: 250 SLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRW 309 + K+ + + RVV+ + + + L+T+L DE+ ++ Y +RW Sbjct: 256 TFISNKSEFNYFGTKEFYKMNLRVVRFKITDDT-YECLVTNLTRDEFDLNELKKMYHMRW 314 Query: 310 QIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 IE AFK LK ++ + A +K+ + I+A +L L + I Sbjct: 315 DIETAFKVLKYIIGMMAFHSKKRNFIQQEIYAAILLHCLTNII 357 >UniRef50_Q648P8 Transposase n=2 Tax=environmental samples RepID=Q648P8_9ARCH Length = 464 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 60/315 (19%), Positives = 108/315 (34%), Gaps = 66/315 (20%) Query: 79 KRLRNAADWFGILAAQ---TLAVRAAVTGCTSGKRLRLVDGTAISAP------------- 122 RLR + L + L +++ G+ ++LVDGT +S P Sbjct: 105 ARLRLPINLVRRLVRETGKLLHLKSEEAWKWKGRSVKLVDGTTVSMPDTPENQKMYPQPE 164 Query: 123 ----GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQT-------ADEIRIADR 171 G G RL D + + E + +I + DR Sbjct: 165 GQKEGVGFPIARLVAIISLSCGAVLDIAIGPYKGKETGEHALLRQILGSISTGDILLGDR 224 Query: 172 GFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIG 231 + S I L AD + R+H + F L D Sbjct: 225 YYCSYFL-IVMLQQLGADSVFRIH-------GSRKKDFRRGKHLGKKD------------ 264 Query: 232 NSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSL 291 ++ P ++ + + + ++ V+ T L Sbjct: 265 -------------HIVIWKKPKQRPNWMTESMYLQMPD---TLTIREIKINRKVITTTLL 308 Query: 292 PEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDD 351 E++ E++ + Y RW IE+ F+ +K++L +D LR K P++ I+ +LLA LI Sbjct: 309 DPKEFTREEIDELYAKRWLIEVDFRFIKTVLQMDILRCKTPDMVCKEIWVHLLAYNLIRT 368 Query: 352 IIQPS---LDFPPRS 363 ++ + + PPR+ Sbjct: 369 VMAQAAHRYNLPPRT 383 >UniRef50_C3FBK7 Transposase for insertion sequence element IS231B n=3 Tax=Bacillus thuringiensis RepID=C3FBK7_BACTU Length = 180 Score = 107 bits (267), Expect = 6e-22, Method: Composition-based stats. Identities = 24/99 (24%), Positives = 44/99 (44%) Query: 257 LISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFK 316 + + + +KG ++ G + +T+ P + EQ+ D Y LRWQIE+ FK Sbjct: 1 MERRKKQSYTESKKGITFSEKSKRLTGINIYVTNAPWEVVPMEQIHDFYSLRWQIEIIFK 60 Query: 317 RLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQP 355 KSL + + + E + ++ L+A L + Sbjct: 61 TWKSLFQMHHWQTIKQERLECHVYEKLIAILLCFSTMFQ 99 >UniRef50_Q4V248 Transposase, n=5 Tax=Bacillus cereus group RepID=Q4V248_BACCZ Length = 140 Score = 104 bits (260), Expect = 4e-21, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 39/92 (42%) Query: 240 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 299 R+I L ++ + + +KG ++ G + +T+ P + E Sbjct: 49 QKLFTRVIIYRLTEKQIQERRKKQNYTESKKGITYSEKSKRLTGINIYVTNTPWEIVPME 108 Query: 300 QVADCYRLRWQIELAFKRLKSLLHLDALRAKE 331 Q+ D Y LRWQIE+ FK KSL + + Sbjct: 109 QIHDFYSLRWQIEITFKTWKSLFQIHHWHNIK 140 >UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=A7B831_RUMGN Length = 366 Score = 104 bits (260), Expect = 4e-21, Method: Composition-based stats. Identities = 53/315 (16%), Positives = 116/315 (36%), Gaps = 45/315 (14%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGI 90 +R R++ +T+ + L+ G + + + D + S +R + + F Sbjct: 42 FSRNRKLDFVSTI-QFLLSMESGSLKKELLDYFQFSVDTPSASAFC-QQRNKLLLEAFQF 99 Query: 91 LAAQTLAVRAAVTGCTSGKRLRLVDGTAISA---PGGGSAEWR----------LHMG--Y 135 L + + +L DG+ ++ P ++ +H+ + Sbjct: 100 LFYE-FNSCFSFEKKYKDYQLLACDGSDLNIARNPNDAGTYFQSQPTDRGFNQIHLNALF 158 Query: 136 DPHTCQFTDFELTDSRD-------AERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEA 188 D ++ D + +R + +DR+ I IADRG+ + + Sbjct: 159 DLCEKRYIDLVIQPARLENESLAMTQMIDRYKGEKKTIFIADRGYETYN-IFAHVQEKGM 217 Query: 189 DYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNK-KAGAPFPARLI 247 Y++RV G +T D F + + +++ K P + I Sbjct: 218 YYLIRVKDGGGGSMTGSFDLPDENEF--------DHDMQLILTRKQTKDVKAKPKKFKFI 269 Query: 248 AVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRL 307 A S P + + + + N RVV+ E + ++T+LP++++ E++ Y + Sbjct: 270 AKSSPFDYLDLYDKKFYTLN---FRVVRFAISED-SYESIITNLPKEDFPVEEIKKVYAM 325 Query: 308 RW------QIELAFK 316 RW IE+ ++ Sbjct: 326 RWHRNIVQGIEICYR 340 >UniRef50_Q3A1U3 Transposase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A1U3_PELCD Length = 489 Score = 104 bits (260), Expect = 4e-21, Method: Composition-based stats. Identities = 64/361 (17%), Positives = 126/361 (34%), Gaps = 55/361 (15%) Query: 28 AGALTRRREIRDAATLLRLG--LAYGPGGMS--LREVTAWAQLHDV--ATLSDVAL-LKR 80 +GA++RRR T + GG +R++ ++A + + + S + R Sbjct: 60 SGAMSRRRLFSKENTFWAFFSQVLDADGGCKEVIRKLQSYASIKGIKVPSSSTASYCTAR 119 Query: 81 LRNAADWFGILAAQTLA--VRAAVTGCTSGKRLRLVDGTAIS-----------------A 121 + A + A T + TG + +R+ + DGT +S Sbjct: 120 KKLAEPMLADILAHTAEQLEKMPATGMLNNRRVIVADGTGVSMPDTPENQAAWPQSSALK 179 Query: 122 PGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAE----RLDRFAQTADEIRIADRGFGSRP 177 PG G R+ + + + + + ++ E R +I + D+GF S Sbjct: 180 PGCGFPSARICACFSLDSGALLSYAIGNKKNNELPLFRQQWETFNPGDIFLGDKGFCSYF 239 Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKK 237 + I L D +V + R + + L + K + ++ Sbjct: 240 D-IAKLQDRGVDSVVTLAKRAPVRAASSLKKLGPDDLLITWERPKYAQILSYSKDAWAN- 297 Query: 238 AGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYS 297 R I V +P G ++ T + Y Sbjct: 298 LPKKLTLRQIKVKVPHPGFRTR-----------------------GFYIVTTLIDAARYP 334 Query: 298 AEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 AE +A+ Y RW +EL F+ +K+ + +D LR P++ + I + +A + +I + Sbjct: 335 AEDLAELYFKRWDVELFFRDIKTTMGMDVLRCLTPDMIRKEILMHFIAYNCVRRLIYEAA 394 Query: 358 D 358 + Sbjct: 395 E 395 >UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XCY0_9BACT Length = 481 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 75/372 (20%), Positives = 124/372 (33%), Gaps = 49/372 (13%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAW------AQ 65 L + P + AGA +R R T G + REV AQ Sbjct: 31 LEALFAPFIPEQLLSRAGANSRERFYTLRQTFWAFLWQALHPGTACREVVRQLLSDWQAQ 90 Query: 66 LHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGG 125 + A R R L A A G ++LVDGT S P Sbjct: 91 AGRTRAQAGTAAYCRARQRLP-LERLQAILQATLGPEPPRWRGHAVKLVDGTTFSLPDTA 149 Query: 126 SAEWRL-HMGYDPHTCQFTDFELTD------------SRDAERLDRFAQ--------TAD 164 + + + G C F ++ +R + R+ Sbjct: 150 ANQKKFPQSGAQKPGCGFPTLKVVALFSLASGLALNWARGSLRVHEIPLFRKLWSGLRRR 209 Query: 165 EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 ++ I DRGF S L D + R+H + +R L+ Sbjct: 210 DLIIGDRGFSSYTNLALLLGR-GVDCLFRLHQ-------GKKVRHPRRSRLQRKQKLGPR 261 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGH 284 + V K P R + P + + + +V + Sbjct: 262 QWLV----QWKKPYQKPEYMRPKEWAAVPSEMQVRVFEV---------IVCTRGMRTRKL 308 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 +L+ T L Y E++A+ Y RW+IEL+F+ LK+ L L+ LR + P + + ++ +L+ Sbjct: 309 MLVTTLLDPVRYPVEELAELYLRRWEIELSFRDLKTTLGLEVLRCQSPAMVEKEVWMHLI 368 Query: 345 AAFLIDDIIQPS 356 A L+ ++ S Sbjct: 369 AFNLLRRVMLQS 380 >UniRef50_A3ZZQ0 Putative uncharacterized protein n=3 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZQ0_9PLAN Length = 457 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 67/330 (20%), Positives = 105/330 (31%), Gaps = 63/330 (19%) Query: 48 LAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTS 107 LA G S + + +AL +RL A+ Sbjct: 81 LAQGLSRCSGDTTSYCQARRRLP----IALFQRL------LAW-TARKCDEAGLGDWRYQ 129 Query: 108 GKRLRLVDGTAI-----------------SAPGGGSAEWRLHMGYDPHTCQFTDFELTDS 150 G+ + +VDGT + PG G R+ + T T F + Sbjct: 130 GREVIIVDGTTVTMADTRANQTAFPQIENQKPGCGFPLARIVQVFSLATGAATMFAMGRY 189 Query: 151 RDAERLDRFAQ-------TADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLT 203 E + + EI +ADR + S S D + R H R Sbjct: 190 AGKETGETSLLRTLLSQFHSGEIVLADRYYASFWLLALSDLR-GIDIVARAHHR------ 242 Query: 204 AEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRL 263 + F RGL G + ++G + ++ E + L Sbjct: 243 ------RKIDFRRGLRQGDCDQ---IVGYAKPQRPTWM---------TTDEYDQYPSSIL 284 Query: 264 LSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLH 323 + R V L T L D Y AE +AD YR RWQ EL + LK + Sbjct: 285 VRHLR---YEVTQRGFRTRRITLATTLLQGDVYRAEDLADLYRRRWQAELHIRSLKIQMQ 341 Query: 324 LDALRAKEPELAKAWIFANLLAAFLIDDII 353 +D LR K P + + + +++ L+ + Sbjct: 342 MDHLRCKSPAMVRKELHCHMIGYNLVRAAM 371 >UniRef50_B6FTH4 Putative uncharacterized protein n=3 Tax=Clostridium nexile DSM 1787 RepID=B6FTH4_9CLOT Length = 224 Score = 99.9 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 38/165 (23%), Positives = 70/165 (42%), Gaps = 13/165 (7%) Query: 189 DYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNK-KAGAPFPARLI 247 Y++RV G +T D F + + +++ K P + I Sbjct: 2 YYLIRVKDGGGGSMTGSFDLPDDNEF--------DHDMQLILTRKQTKDVKANPQKFKFI 53 Query: 248 AVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRL 307 A S P + + + + N RVV+ E + +LT+LP++++ E++ Y + Sbjct: 54 AKSSPFDYLDLYDKKFYTLN---FRVVRFAISED-SYESILTNLPKEDFPVEEIKKVYAM 109 Query: 308 RWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 RW IE +F+ LK + L +K+ E I+A L+ + I Sbjct: 110 RWGIETSFRELKYAIGLCCFHSKKVEYIMQEIYARLILYNYCELI 154 >UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=Streptococcus RepID=A4W4J4_STRS2 Length = 440 Score = 95.3 bits (235), Expect = 4e-18, Method: Composition-based stats. Identities = 55/360 (15%), Positives = 108/360 (30%), Gaps = 52/360 (14%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGI 90 +R+ ++ T+++ L G ++ + D+ + +R + F Sbjct: 42 FSRKSQL-TMETMIQAILTMGGNTLAKELLDL-----DLPVSQSAFVQRRYQLKHQAFKA 95 Query: 91 LAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGS---------------AEWRLHMGY 135 L A + + VDG+ + P S ++ Y Sbjct: 96 LFANITSKIPTFKDLP----ILAVDGSDVVLPRNRSDKTTTFQTGPHHTPYTLIHINALY 151 Query: 136 DPHTCQFTDFELTDSRD----AERLDRFAQTA--DEIRIADRGFGSRPECIRSLAFGEAD 189 + + D + ++R+ A +D + I DRG+ S Sbjct: 152 NLEQEIYHDLRIQNNREVDERAAFIDMMESCPFEQALVIMDRGYESYNVMAHCQER---- 207 Query: 190 YIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGA-----PFPA 244 +W + + L C E + I P Sbjct: 208 -----NWSYIIRIRDGNHSMKSGFNLPDTPCFDE-EFDLNICRKQTNVMKELYRDFPNQY 261 Query: 245 RLIAVSLPPEKALISKTRLLS--ENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 + + + S + R+V+ E L+T+ YS E++ Sbjct: 262 HFLPHNASFDLLPNSSRKSDPISFYDLHFRMVRLEIKPGF-FETLVTNTD---YSPEKLK 317 Query: 303 DCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPR 362 D Y RW IE +F+ LK + L AK+ E I+A+ + + + P + Sbjct: 318 DLYAYRWGIETSFRDLKYSIGLTHFHAKKKEGILQEIYAHFINFNVCKWLTSHVAIKPSK 377 >UniRef50_Q7MLW1 Transposase and inactivated derivative n=29 Tax=Gammaproteobacteria RepID=Q7MLW1_VIBVY Length = 445 Score = 94.5 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 71/363 (19%), Positives = 123/363 (33%), Gaps = 35/363 (9%) Query: 18 PEELDTSARNAG--ALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVAT--LS 73 P+E A A RRR + + L L G + A+ +V L+ Sbjct: 29 PDEWVAKAATLSDKATIRRRRLPSD---MVLWLIVGMAFFRNESIAEVARRMNVCAEGLA 85 Query: 74 DVALL---------KRL-RNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPG 123 D LL +RL + A +W + T + G ++ +DG Sbjct: 86 DEELLAKSALTQARQRLGKAAPEWLFRQCSHTWGLERYPEDTWQGLQVFAIDGALFRT-- 143 Query: 124 GGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSL 183 ++E R H G + + T + + I A R E ++ Sbjct: 144 ADTSELREHFG----SGNTSSERQTPHPVLRVVTMMNVRSHVIVDAAISPYRRGEIPLAM 199 Query: 184 AFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFP 243 +I + + L D++ L+ ++ G Sbjct: 200 P-----FIDSLPDNSVTLLDKGFYGADLLLSLQNSGSNRHWLLPAKKGVKFRLLDDEESD 254 Query: 244 ARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVAD 303 L+ + + P+ + K +V H + TSLP +EY AE VA+ Sbjct: 255 DMLVEMKVSPQA-----RKKNPNLPEKWQVRAVTYQVQGKHKTVFTSLPREEYDAESVAE 309 Query: 304 CYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPP 361 Y RW+I+L ++ +KS + +A LR+K EL ++ LL L+ + Sbjct: 310 LYHERWEIKLGYRDIKSSMQHNALVLRSKTVELVYQELWGLLLGYNLVRREASQAAVAHG 369 Query: 362 RSA 364 R A Sbjct: 370 RMA 372 >UniRef50_A4SUB1 IS element transposase n=8 Tax=Bacteria RepID=A4SUB1_AERS4 Length = 420 Score = 92.9 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 74/372 (19%), Positives = 126/372 (33%), Gaps = 61/372 (16%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAW-----AQL 66 + + P E + AR R R I L+ L GG QL Sbjct: 10 INQLLTPAETECIARLCKFCLRLRAI-TPWMLVTSLLRAFGGGKVGAIACLHQHFNGLQL 68 Query: 67 HDVATLSDVALLKRLRNAA--DWFGILAAQTLAVRAAV----TGCTSGKRLRLVDGTAIS 120 +S +LR A + L + +A+R + K++ L DGT+ + Sbjct: 69 AHTHQVSYKPFHNQLRKPAFAQFMKALVERAIALRIGQQVTDVAQGAFKQVLLQDGTSFA 128 Query: 121 A--------PGG----GSAEWRLHMGYDPHTCQFTDFELTDSRDAER--LDRFAQTADEI 166 PG A HM + +L+ +ER L + + Sbjct: 129 VHKRLATVFPGRFKTISPAAIECHMTMSLLEQKPLCMQLSADTASERQFLPDAKKLTGSL 188 Query: 167 RIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGET 226 +AD G+ R Y V+ G +L + D G+ E Sbjct: 189 LLADAGYIDR------------AYFAEVNKAGCFYLVRGRKGLNPKILRAWRDDGRAVE- 235 Query: 227 TVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKG----RVVQAETLEAA 282 +L +SL E + +L + + G R+++ E Sbjct: 236 ------------------KLTGMSLKEEGRRHCRAEVLDMDVKSGKYEYRLIRRWFAEET 277 Query: 283 GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFAN 342 + +T+L + + AE+V YR RWQ+EL FK KS +L + + + ++ + Sbjct: 278 RFCVWMTNLARETWPAERVMRLYRCRWQVELLFKERKSYNNLKGFVTGQKAITEGLVWDS 337 Query: 343 LLAAFLIDDIIQ 354 LL+ L + Q Sbjct: 338 LLSLVLKRRVAQ 349 >UniRef50_A3IS08 Putative uncharacterized protein n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IS08_9CHRO Length = 472 Score = 92.9 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 53/286 (18%), Positives = 102/286 (35%), Gaps = 36/286 (12%) Query: 97 AVRAAVTGCTSGKRLRLVDGTAISAP-----------------GGGSAEWRLHMGYDPHT 139 + G+ ++ +DG+ +S P G G ++ + + T Sbjct: 118 EEKVDKKHLWHGRCVKSIDGSTVSMPDSLKNQEAYPQHGSQKKGCGFPLAKIGVLFSYAT 177 Query: 140 CQF-----TDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRV 194 F+ D + A +L + A +I + DR F S + I S D ++R+ Sbjct: 178 GSVVGIVIDIFKTHDIKLARKLTDYLD-AGDILLGDRAFCSYID-IYSWKKKGIDSVMRL 235 Query: 195 HWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPE 254 H L+ F + K +++ +K + Sbjct: 236 HQGRLQKGKKRPKYTVSPPFKKKKKTRKCPHDRLILWEKPKRKPKDISK---------ED 286 Query: 255 KALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELA 314 + K +L E + T E +++ T + EY + + D Y RWQ E+ Sbjct: 287 FYSLPKDLVLREVHCYICIPGFRTKE---IIVVTTLIDAIEYPSSDILDLYDQRWQAEVN 343 Query: 315 FKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFP 360 + +K+ L +D L + PE+ + I+ LLA + I+ + D Sbjct: 344 LRNIKTTLGMDILTCQTPEMVRKEIYVYLLAYNFLRSIMYDAGDIF 389 >UniRef50_A8RFU1 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RFU1_9FIRM Length = 443 Score = 90.6 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 56/337 (16%), Positives = 110/337 (32%), Gaps = 35/337 (10%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAW-AQLHDVATLSDVALLKRLRNAADWFG 89 TR R I TL++ L +S + + D+ ++S V+ +R + F Sbjct: 42 FTRSR-ILTPKTLIKFILGLQAHSLSGEVSDYFTSSNIDIPSISAVS-QRRDLLYPEIFK 99 Query: 90 ILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP------------GGGSAEWRLHMGYDP 137 + + L+ ++ +G + DG+ I+ P ++ L+ YD Sbjct: 100 SINRRFLSSIDNLSTL-NGYYILAQDGSDINLPFWHDDTQISYGQDSIVCQYHLNALYDC 158 Query: 138 HTCQFTDFEL-------TDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADY 190 F + + S + ++ + I ADRG+ S + + Sbjct: 159 INHVFWESRIDLPTKKSEKSALIDFINHRNYPENSIITADRGYESYNLIAHCIENNQKFV 218 Query: 191 IVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVS 250 R + M + T + N+ + Sbjct: 219 F--------RVKDIDTRSGIMTSISLPDETFDITVTRTLTNLQTNEVKKNENNQFVFV-- 268 Query: 251 LPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQ 310 P R+V+ + + + L+T+L E+E+ D Y LRW Sbjct: 269 -PSTSVFDYLDACNRFYNLSFRIVRFKIADD-KYETLVTNLDENEFGLSDFKDLYHLRWN 326 Query: 311 IELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 E AF LK + + K+ + + I+A++L Sbjct: 327 EETAFYYLKHAVGMLYFHCKKRQHIQQEIYASILFYN 363 >UniRef50_A6CHG0 Transposase of IS5377-like element n=2 Tax=Bacillus sp. SG-1 RepID=A6CHG0_9BACI Length = 381 Score = 90.3 bits (222), Expect = 9e-17, Method: Composition-based stats. Identities = 75/368 (20%), Positives = 121/368 (32%), Gaps = 59/368 (16%) Query: 9 SAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGG-MSLREVTAWAQLH 67 S +L EE++ G R+ A L+R + S R+ Sbjct: 8 SEVLQTFITDEEVENLCEKWGYRDTARKF-SAKDLVRFFVISSAKDWKSFRDAETKIPQE 66 Query: 68 D-VATLSDVALLKRLRNAA-DWFGILAAQTLA--VRAAVTGCTSGKRLRLVDGTAI--SA 121 D + ++ L K+ +N L ++ + R +L VD T I Sbjct: 67 DSLPSVDHSTLAKKAQNVPYQILQELFSRLVNRLGRGMRRALFKPYKLFAVDSTTITFQH 126 Query: 122 PGGGSAEW-------RLHMGYDPHTCQFTDFELTDSRDAERL---DRFAQT-ADEIRIAD 170 P A + RLH +D Q T T R + + + T I AD Sbjct: 127 PDMSWAGYTRTRHAIRLHTKFDVEEGQPTQVIPTTGRHHDVMVAPKLYEDTEPLSIITAD 186 Query: 171 RGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMI 230 RG+ +R L +++R+ + E +V + Sbjct: 187 RGY-ARTRDFEDLQEDNQFFVIRIAS--------------------SFSLSEEMEHSVPL 225 Query: 231 GNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTS 290 GN K E + + + + RVV E + T+ Sbjct: 226 DEDGNVK----------------EDLTAFIGKNSRKTKNRFRVVTFTDNEGNRIKVA-TN 268 Query: 291 LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 L S E +A Y+LRWQIEL F+ +K L L L P ++ L++ FL+ Sbjct: 269 L--MMMSPEDIAYIYKLRWQIELFFRWVKGNLDLSNLFGNSPNSVYIQVYGTLISYFLLR 326 Query: 351 DIIQPSLD 358 I + D Sbjct: 327 WIYNETKD 334 >UniRef50_Q8VV93 Transposase n=1 Tax=marine psychrotrophic bacterium Mst37 RepID=Q8VV93_9GAMM Length = 423 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 60/347 (17%), Positives = 109/347 (31%), Gaps = 64/347 (18%) Query: 23 TSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLR 82 + R+R I+ L + L G S+ VT + T +DV K Sbjct: 22 EIGKKVNFCNRKRIIKPFE--LVMSLITALGDKSVDTVTDLHRYFVKLTETDVQY-KPFH 78 Query: 83 N--AADWFGILAAQTL---------AVRAAVTGCTSGKRLRLVDGTAISAPGG------- 124 N + F L + + V ++ K + L DG++ + Sbjct: 79 NQLSKPEFVGLIKELIGVAVNDWQQQVLGTEVELSAFKGIVLQDGSSFAVHDSLKDIFTG 138 Query: 125 -----GSAEWRLHMGYDPHTCQFTDFELTDSRDAER--LDRFAQTADEIRIADRGFGSRP 177 A +H+ +D ++ AE L + +ADRG+ + Sbjct: 139 RFTKISPAAIEVHVSWDVLKGYPEQVSISPDSQAEYDFLPDADALEGRLLLADRGYF-KL 197 Query: 178 ECIRSLAFGEADYIVRVHWRGLRW----LTAEGMRFDMMGFLRGLDCGKNGETTVMIGNS 233 + + Y+VR G ++ K+ + ++ Sbjct: 198 SYLDEIDQAGGAYVVRAKTTVNPMVVAGFNKAGKPLKRFQKIKQKAVKKHIRRSGIVDMD 257 Query: 234 GNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPE 293 RLIA E + + T+L Sbjct: 258 ----VEGKTNYRLIA-------------------------SWPEGKDEPTY--WATNLDR 286 Query: 294 DEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIF 340 +++SAE+V Y+LRWQIEL FK KS +L ++ + + ++ Sbjct: 287 EQFSAEKVMKLYQLRWQIELLFKEWKSYCNLQKFNTRKATMMEGLVW 333 >UniRef50_A5II18 Transposase, IS4 n=1 Tax=Legionella pneumophila str. Corby RepID=A5II18_LEGPC Length = 379 Score = 88.3 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 62/336 (18%), Positives = 112/336 (33%), Gaps = 63/336 (18%) Query: 36 EIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDV--------ATLSDVALLKRLRNAADW 87 + + L + + SLR + D+ +TLSD +R AD Sbjct: 33 KFKTYEHLQSMLYVHLNQISSLRTLETAINSQDLGLSAKICRSTLSDA--NRR--RKADC 88 Query: 88 FGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPG-----------GGSAEWRLHMGYD 136 F + Q L + K +R++D + I G +LH+ YD Sbjct: 89 FLWILEQLLEMLPKKQKKEFSKIVRVLDSSPIQLKGYGYEWAKHNATRRCEGLKLHVEYD 148 Query: 137 PHTCQFTDFELT--DSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRV 194 T L+ + D+ ++ D I + D+G+ S+ +A ++ R+ Sbjct: 149 LGLESPTRVALSFPNFNDSSMGKQWPIETDIIYVFDKGYCDYDWWW-SIHQKKAFFVSRL 207 Query: 195 HWRGLRWLTAEGMRFDMMGFLR-GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPP 253 + + + L GL N + G K R+ Sbjct: 208 KVNAAISIEQKFETNENSPILEDGLFRFSNPK-----PRGGKKNLYTSLARRISVQR--- 259 Query: 254 EKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIEL 313 E ++L+T+L ++ AE +A Y+ RW+IEL Sbjct: 260 --------------------------EDKDPLILVTNLLDE--PAEMIAQLYKSRWEIEL 291 Query: 314 AFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 FK +K L + + K K + ++A L+ Sbjct: 292 FFKWIKQRLKIKKILGKSENAVKIQLITAIIAYLLV 327 >UniRef50_C6JHT2 Transposase ISLbp1 n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JHT2_9FIRM Length = 424 Score = 87.6 bits (215), Expect = 7e-16, Method: Composition-based stats. Identities = 68/360 (18%), Positives = 127/360 (35%), Gaps = 62/360 (17%) Query: 27 NAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLK-RLRNAA 85 N TR R++ LL + ++L H ++S LK R++ Sbjct: 14 NKNHFTRIRKMP-LQDLLFTMINRKGLTLALELRNYMKLAHPGVSISKPGYLKQRMKLNP 72 Query: 86 DWFGILAA---QTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSA--------------- 127 D F L + + + + + DG+ I+ P Sbjct: 73 DAFLELYKYHNRNFYADSTFSTYKNHL-ILAADGSDINIPTTTETLKLYGSASRKNTKPQ 131 Query: 128 -EWRLHMGYDPHTCQ-----FTDFELTDSRDAE-RLDRFAQTADEI---RIADRGFGSRP 177 + L YD + + R AE +++R +T I I DRG+ S P Sbjct: 132 AQIGLGCIYDVMNRMILESDCNKVKFDEMRLAEKQMERIPETIGNIPYIIIMDRGYPSTP 191 Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKK 237 I + + +IVR+ + + D + ++ Sbjct: 192 AFIHMM-DKDLKFIVRLKSSDYKKEQSSLTENDQLVKIK--------------------- 229 Query: 238 AGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYS 297 +R+ P+ R+ R+V+ LE +L T+L + E+ Sbjct: 230 ---LDKSRIRHYEGTPDG-----ERMKELGEISLRMVKI-LLENGNLEVLATNLSQTEFH 280 Query: 298 AEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 E++ + Y +RW IE A++ LK+ L L+ +P L I++ + + L++DII + Sbjct: 281 TEEIKELYHMRWGIETAYETLKNRLQLENFTGTKPILLLQDIYSTIYLSNLVEDIILDAE 340 >UniRef50_Q7ULM3 Probable transposase n=5 Tax=Planctomycetaceae RepID=Q7ULM3_RHOBA Length = 458 Score = 87.2 bits (214), Expect = 9e-16, Method: Composition-based stats. Identities = 53/269 (19%), Positives = 103/269 (38%), Gaps = 37/269 (13%) Query: 100 AAVTGCTS-GKRLRLVDGTAIS-APG---------GGSAE---WRLHMGYDPHTCQFTDF 145 A +S + + VDG+ ++ P + WRLH ++ + Sbjct: 157 APDPRLSSIQQTITAVDGSLVNALPSLIAASILKQTTGSALVRWRLHTHFEVNNLLPARV 216 Query: 146 ELTDSRDAERLDRFAQT----ADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRW 201 ++T + +R D + + DRG+ ++ S+ + Y+ R+ + Sbjct: 217 DVTPDGGGQHDERAVLKRVLEEDRLYVMDRGY-AKFSLFNSIVASSSSYVCRLRDNTVYE 275 Query: 202 LTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKT 261 T ++ R G +T V +G S + P RLI + P + Sbjct: 276 TT---QELELTEGDRA--AGVLSDTIVKLGGSSSSSNSPDHPIRLIQIRCTPHQ------ 324 Query: 262 RLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL 321 NR G+ ++ + G + + T+L AE +A Y RW IE+ F+ K L Sbjct: 325 -----NRTGGKARGSKAPNSDGILRIATNLLN--VPAEIIALIYAYRWTIEIFFRFYKQL 377 Query: 322 LHLDALRAKEPELAKAWIFANLLAAFLID 350 + D L + + ++ +++A LI+ Sbjct: 378 MGGDHLISHNANGIQIQVYCSVIACLLIN 406 >UniRef50_B6FLV1 Putative uncharacterized protein (Fragment) n=1 Tax=Clostridium nexile DSM 1787 RepID=B6FLV1_9CLOT Length = 135 Score = 86.8 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 24/92 (26%), Positives = 42/92 (45%), Gaps = 1/92 (1%) Query: 264 LSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLH 323 RV++ E+ ++T+L E+++ +++ Y RW IE +F+ LK + Sbjct: 8 NPFYTLHFRVLRFPITEST-MECIITNLEEEDFPMKEIKKLYEWRWGIERSFRELKYTIG 66 Query: 324 LDALRAKEPELAKAWIFANLLAAFLIDDIIQP 355 L AK+ E IFA L+ + II Sbjct: 67 LTNFHAKKVEYILQEIFARLIIYNFCERIITK 98 >UniRef50_C5T3Q2 Transposase IS4 family protein n=4 Tax=Proteobacteria RepID=C5T3Q2_ACIDE Length = 436 Score = 86.8 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 70/364 (19%), Positives = 124/364 (34%), Gaps = 43/364 (11%) Query: 12 LAHIGKPEELDTSARNAG-ALTRRREIRDAATLLRLGLAYGPGGMSL----REVTAWAQL 66 L+ + P + + + G A RRR++ + + M L +E+ Sbjct: 23 LSALLDPAWIAQALQATGKASMRRRKLPAEHAVWLVIGLALFRHMPLWQVVQEMALTLDG 82 Query: 67 HDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSG-KRLRLVDGTAISAPGGG 125 ++ S V++ R R A+ + +G R+ VDG A SAP Sbjct: 83 QELPAPS-VSVQVRQRLGAEPMEHMFGLLANAWGRAHAVHAGALRVLAVDGVAWSAPDSK 141 Query: 126 SAEWRL---HMGYDPHT----CQFTDFELTDSRDAERLDRFAQTADEIRIA-DRGFGSRP 177 L Y P + TDS + E+ +A D Sbjct: 142 DNRQELGSGQTQYGPQPWPMVRAVCLLD-TDSHELLDAQLGDYGCGELTLAADLHGLDHS 200 Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKK 237 + A+ A +++ G + +L E + Sbjct: 201 ITLFDRAYFSAAFLLAWSQAGQQR-----------HWLMRAKDNLRYEV---VQTLDEGD 246 Query: 238 AGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYS 297 P A L P+ + RL+ R G V+ + + L ++++ Sbjct: 247 WLIRMPVSPRARKLHPQLPSHWQARLIEV--RAGGKVRR---------FITSMLDPEQFA 295 Query: 298 AEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDIIQP 355 A +A YR RW+IEL F+ +K L LR+K+PEL K ++ L+A L+ ++ Sbjct: 296 AAPLAQLYRQRWEIELGFREIKQSLQQGQAVLRSKQPELVKQEVWGVLIAYTLLRRWMRL 355 Query: 356 SLDF 359 + Sbjct: 356 MAEH 359 >UniRef50_C1ZMB0 Transposase family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZMB0_PLALI Length = 497 Score = 86.8 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 53/344 (15%), Positives = 99/344 (28%), Gaps = 68/344 (19%) Query: 41 ATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRA 100 A + R+ + Y G+ + A A + +++RL A Sbjct: 92 AAVQRVAVYYALSGIRISSTNTGAYCRARAKI-PEGVVQRLAVGVG--QRCEAAVPDKW- 147 Query: 101 AVTGCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDF--------------- 145 G R ++DGT S P Y + Q Sbjct: 148 ----RWHGFRTLVIDGTTCSMPDTQEN----QAEYPQPSSQGKGLGFPILRAVALTSLAT 199 Query: 146 ----------ELTDSRDAERLDRFA---QTADEIRIADRGFGSRPECIRSLAFGEADYIV 192 + L R A ++ ++DR + + L +++ Sbjct: 200 GMILALVTGPCAGKATGETALFRTLFDQLKAGDLVLSDR-YYGGWFMLALLQELGVEFVT 258 Query: 193 RVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLP 252 R+H + + + + + + R I V +P Sbjct: 259 RLHQFRIADFHQGKRLGQRDHVVAWAKP----QKPAWLDQATYDRLPDQLEVREIEVQVP 314 Query: 253 PEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIE 312 V T A V++ + + E++A YR RW +E Sbjct: 315 --------------------VPGFRT---ASLVVVTSLRDHRRFPREELALLYRRRWTVE 351 Query: 313 LAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 L + +K+ + L LR +P + ++ LLA LI + S Sbjct: 352 LELRDIKATMDLAVLRCTKPAWVRQELWTGLLAYNLIRQSMLQS 395 >UniRef50_D0SHM1 Transposase n=3 Tax=Acinetobacter RepID=D0SHM1_ACIJO Length = 443 Score = 86.4 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 62/365 (16%), Positives = 111/365 (30%), Gaps = 73/365 (20%) Query: 22 DTSARNAGALTRRREIRDAATLLRLG---------LAYGPGGMSLREVTAWAQLHDVATL 72 D R A R+R++ + + + Y + L + A Sbjct: 42 DCLKRTGKASVRKRKLPAEHAVWLVIGLALFRDQPIWYVVQQLQL----VFGTAESCAPS 97 Query: 73 SDVALLKRLRNAA--DWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWR 130 + V +RL F L+ G + VDG S P Sbjct: 98 ASVQARQRLGLEPLNVLFNTLSQTWFEDSQPQYSAFHGLSICAVDGAVWSMPHTDENFRH 157 Query: 131 LHMG----YDPHTCQFTDFELTDSRDAERLD---------------RFAQTADEIRIADR 171 Q L ++ E +D + A+ + + DR Sbjct: 158 FGSSKGKTIAAPWPQARAVCLINTNTHEVIDAGIGSMDQGELTLAKKLKVPANSLTLFDR 217 Query: 172 GFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIG 231 + S AD++ R + +L E +I Sbjct: 218 AYFS------------ADFLSGWQSR------------ENCHWLMRAKDNLRYE---IIR 250 Query: 232 NSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSL 291 + P A L P+ + RL+ + GRV + + + L Sbjct: 251 KNSAHDFQIRMPVSPRAKKLNPDLGDYWEARLIETEQS-GRVRR----------YVTSLL 299 Query: 292 PEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD-ALRAKEPELAKAWIFANLLAAFLID 350 Y E+V+ Y RW+IE+ ++ +KS L LR+K+PEL ++ L+A ++ Sbjct: 300 DSKAYPLEEVSTLYAQRWEIEMCYREIKSDLQDGMHLRSKQPELVYQELWGVLIAYNILR 359 Query: 351 DIIQP 355 ++ Sbjct: 360 RQMKF 364 >UniRef50_A9DPK2 Transposase n=8 Tax=Shewanella benthica KT99 RepID=A9DPK2_9GAMM Length = 269 Score = 86.0 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 83/224 (37%), Gaps = 29/224 (12%) Query: 135 YDPHTCQFTDFELTDSRDAERLDR-FAQTADE-IRIADRGFGSRPECIRSLAFGEADYIV 192 D + + ++ ++ER FA + + + D G+ + C ++ I+ Sbjct: 1 MDLMSGHYNYLGISPDSESERHYNPFAYEIQDTLLLMDAGYFNIDYCYQA-DKHGGHVIM 59 Query: 193 RVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLP 252 R + + + A G E +IG + Sbjct: 60 RTNGKINPDIKAAFDS-----------QGLAIEG--LIGKKLKQLKWHR----------- 95 Query: 253 PEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIE 312 + + + A G+ L+T+L ++SA++V+ Y LRWQIE Sbjct: 96 EQIIDLDVQWKSKPGTHRLIAFWDRNKSAIGY--LITNLKRAQFSADKVSKLYGLRWQIE 153 Query: 313 LAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 L FK LKS L ++ +A++ ++A++L L I + S Sbjct: 154 LFFKELKSYSGLKTFNTRDKSIAESLVWASMLTLLLKRFIARAS 197 >UniRef50_UPI0001BC4BB6 transposase n=2 Tax=Neisseria mucosa ATCC 25996 RepID=UPI0001BC4BB6 Length = 403 Score = 85.6 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 68/389 (17%), Positives = 119/389 (30%), Gaps = 63/389 (16%) Query: 3 YSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTA 62 YS + I+ I + A + + L+ + A+ SLR + Sbjct: 2 YSISRFQQIIKPIMHGRFQKHV-QQHQADKYSKGFNCHSLLISMVYAHLTHCNSLRTLEQ 60 Query: 63 WAQLH-------DVA-----TLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKR 110 H ++ + AL KR F + + +A + + Sbjct: 61 SFNAHSHHHYHLNLCRRIRHSTLSEALAKRDTRP---FTDMLRELMATCSRTLRKHTQDT 117 Query: 111 ---LRLVDGTAISAPGGGSAEW----------RLHMGYDPHTCQFTDFELTDSRDAERLD 157 L L+D T I G G +W ++H+ + T +T++ + Sbjct: 118 ADLLYLLDSTPIILKGRGFNQWVSSNGRISGLKVHVLMNHANGCPTVQSITEASVNDIDQ 177 Query: 158 RFAQTA--DEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFL 215 R + D+G+ L A ++ R+ + + Sbjct: 178 RHIVQPEKGATYVFDKGYCDYNWWAE-LDRAGAYFVTRLKANAAVEVIEQFSP------- 229 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKT----RLLSENRRKG 271 ET NS N P L E R + + Sbjct: 230 --------SETQNAHENSRNDNKNTPI--------LTDEYIRFKHKSNSTRPNHYHNKTL 273 Query: 272 RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKE 331 R + E VL+ +L SA+++A+ Y+ RWQIEL FK LK L L + Sbjct: 274 RRITVEREGTEALVLVSNNLTA---SAQEIAENYKRRWQIELLFKWLKQHLKLKRFLGRS 330 Query: 332 PELAKAWIFANLLAAFLIDDIIQPSLDFP 360 K + ++A L+ + Q Sbjct: 331 ANAVKLQLLCAMMAYLLL-KLYQQCTTHS 358 >UniRef50_Q648P7 Transposase n=2 Tax=environmental samples RepID=Q648P7_9ARCH Length = 281 Score = 84.9 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 36/266 (13%), Positives = 90/266 (33%), Gaps = 46/266 (17%) Query: 105 CTSGKRLRLVDGTAISAPG----------GGSAEWRLHMGYDPHTCQFTDFELTDSRDAE 154 + K VDG+ I A +L++ + T ++T+ + + Sbjct: 41 LSRFKDCFAVDGSIIRLNKTLEKIFKSTCKSQAALKLNVKFSIVNLAVTKLQVTEGKRHD 100 Query: 155 RLDRF-AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 RF + + + + D G+ S + + + ++ ++ R+ + Sbjct: 101 NRFRFITKDPNILYLFDLGYWS-FKNFKKIVDAKSFFVSRLKKSCDPLIVTVSDPKWSHL 159 Query: 214 FLRGLDC-----GKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENR 268 + L E + + S ++K+ RL+ + Sbjct: 160 AGKRLSQINGALKGMVELDMKVQLSKSEKSPLKDDLRLVGI------------------- 200 Query: 269 RKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALR 328 L +T++ + ++ + + + Y RW IE+ F +K +L L+ + Sbjct: 201 ----------LYEGKWRFYVTNIFDTLFTPQVIYELYSERWTIEIFFNDIKHVLKLEHIF 250 Query: 329 AKEPELAKAWIFANLLAAFLIDDIIQ 354 ++ I++ L+ L+ +I Sbjct: 251 SQNKNGIMVEIYSALIFYLLVRIMIA 276 >UniRef50_A4BL98 Putative uncharacterized protein n=5 Tax=Nitrococcus mobilis Nb-231 RepID=A4BL98_9GAMM Length = 426 Score = 84.5 bits (207), Expect = 6e-15, Method: Composition-based stats. Identities = 55/293 (18%), Positives = 88/293 (30%), Gaps = 60/293 (20%) Query: 94 QTLAVRAAVTGCTSGKRLRLVDGTAI-----------------SAPGGGSAEWRLHMGYD 136 QTL RA G R+ L DGT PG G R+ Sbjct: 84 QTLHQRAPSAWGWRGHRVVLADGTTALMPDTLDNQREFPQQGNQQPGLGFPIVRIVALIS 143 Query: 137 PHTCQFTDFELTDSRDAERLDR-------FAQTADEIRIADRGFGSRPECIRSLAFGEAD 189 D+ L + + ++ +ADR + + + Sbjct: 144 LGAGAVLDYALGPYQGKGSGESSLFSTLLHTLQPGDLLLADRYYCT---------YAIMA 194 Query: 190 YIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAV 249 +V +GL A+ + RG G + + Sbjct: 195 LLVHHGVQGLFQKHAQRKP----HWHRGERLGAKDHLIKWAKPPRKPVWMSAQDY----L 246 Query: 250 SLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRW 309 LPP + L G V + T Y +A+ YR RW Sbjct: 247 KLPP-------------------TLTIRELAVNGIVYVTTLSNPKRYPRRALAEHYRSRW 287 Query: 310 QIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPR 362 IEL + +K+ + ++ LR K PE + I A+LLA L+ + + + Sbjct: 288 TIELDLRSIKTDMAMERLRCKSPERVRKEIAAHLLAYNLVRANLNRAAQCFEK 340 >UniRef50_A1HQH6 Transposase, IS4 family protein n=2 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HQH6_9FIRM Length = 400 Score = 84.1 bits (206), Expect = 7e-15, Method: Composition-based stats. Identities = 57/332 (17%), Positives = 110/332 (33%), Gaps = 62/332 (18%) Query: 43 LLRLGLAYGPGGMSLREVT--AWAQLHDVATLSDVALLKRLRN--AADWFGIL--AAQTL 96 + + ++ R +L + ++S L +RLRN W + + + Sbjct: 39 VAQFLRLDSLRDIANRLTCDKQLQKLLHLTSISASTLSRRLRNIDHRVWEQVFAEVKRQI 98 Query: 97 AVRAAVTGCTSGKRLRLVDGTAISAP---------GGGSAEWRLHMGYDPHTCQ--FTDF 145 +A TG +L ++D + I+ + +LH H Sbjct: 99 WQQANKTGAVRQYQLNVIDSSTITLCLRKYLWADYRKTKSGIKLHQRITIHDGNSYPDSA 158 Query: 146 ELTDSRDAERL---DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWL 202 LT +R A++ + + D + + DRG+ + ++ R+ + Sbjct: 159 VLTSARKADKTVMDELVVTSPDALNVFDRGYVDYAK-WDDYCRKGIRFVSRLKSNAV--- 214 Query: 203 TAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTR 262 D++ E V +GN+ + P RLI Sbjct: 215 ------IDVLEEKSVETNQVLAEKIVRLGNAYTTQMTHPV--RLIETR------------ 254 Query: 263 LLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLL 322 V+++T+ E A +++D YR+RWQIEL FK +K L Sbjct: 255 ----------------DNQGNAVIIVTN--ELTLPAAEISDIYRMRWQIELFFKWIKQHL 296 Query: 323 HLDALRAKEPELAKAWIFANLLAAFLIDDIIQ 354 + I+ L+ L+ ++ Q Sbjct: 297 VVKEFFGTSQNAVYGQIWLALIGYCLLQNLQQ 328 >UniRef50_P55729 Putative transposase y4zB n=4 Tax=Rhizobiaceae RepID=Y4ZB_RHISN Length = 356 Score = 84.1 bits (206), Expect = 7e-15, Method: Composition-based stats. Identities = 56/276 (20%), Positives = 98/276 (35%), Gaps = 51/276 (18%) Query: 85 ADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI--------SAPGGGSAEWRLHMGYD 136 A+ FG+LA Q LRL+D T I + G ++H+ YD Sbjct: 65 AETFGLLAGQLDRQTRREGRAM----LRLIDSTPIPLGKLCGWAKSNGRIRGMKMHVVYD 120 Query: 137 PHTCQFTDFELTDSR--DAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRV 194 P + ++TD+ DA+ A + I D+G+ ++A +A ++ R Sbjct: 121 PDSDCPRLLDITDANVNDAQIGRTIAIESGATYIFDKGYC-HYGWWTAIAEAKAFFVTRP 179 Query: 195 HWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPE 254 + + +R + G + TV + + G+ K P RL + Sbjct: 180 KSN----MGLKVVRQRRIKVAEGDGFTVIDDATVRLASKGDSKLPIPLR-RLTVKRADGD 234 Query: 255 KALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELA 314 + LLT+ A +A Y+ RWQIEL Sbjct: 235 T-----------------------------ITLLTN-DRKR-PAVAIAALYKGRWQIELL 263 Query: 315 FKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 F+ +K L + + + +FA ++A L+ Sbjct: 264 FRWIKQHLKIRSFLGNNDNAVRLQLFAAMIAYALLR 299 >UniRef50_B5ZZ25 Transposase IS4 family protein n=11 Tax=Rhizobium RepID=B5ZZ25_RHILW Length = 381 Score = 84.1 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 67/390 (17%), Positives = 126/390 (32%), Gaps = 76/390 (19%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 M + + + +L I + A R + + L+ L G +SLRE+ Sbjct: 1 MRHDNSVFHDVLKRIPWAVF-ERLVDEHQADKHVRRLSTKSQLIALLYGQLAGAVSLREI 59 Query: 61 TAWAQLHDV---------ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRL 111 + H + S A LR + F L AQ +A G+ + Sbjct: 60 VGSLESHSARLYHLGARPVSRSTFADANGLRPSTV-FAELFAQMVARAGRGLKRAIGEAV 118 Query: 112 RLVDGTAISAPGGGSAEW----------RLHMGYDPHTCQFTDFELTDSRDAERL--DRF 159 L+DG+++S G ++W ++H+ YD + + +T + + Sbjct: 119 YLIDGSSLSLAGA-GSQWARFSDQACGAKMHVVYDANAERPIYAAVTPANVNDITAAKEM 177 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE-------GMRFDMM 212 A + D G+ L + R+ ++AE G+ FD + Sbjct: 178 PIEAGATYVFDLGYYD-FGWWAKLNAAGCRIVSRLKSHTKLTVSAEQAANADAGILFDRI 236 Query: 213 GFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGR 272 G L + +++ P R I V + G+ Sbjct: 237 GLLPQRQ-------------AKSRRNPMNRPVREIGVRI-----------------ETGK 266 Query: 273 VVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEP 332 V++ + + AE++A Y+ RW IEL F+ +K L + Sbjct: 267 VLRIFSNDLTA-------------PAEEIAALYKRRWAIELFFRWVKQTLKIRHFLGNSE 313 Query: 333 ELAKAWIFANLLAAFLID-DIIQPSLDFPP 361 + + L+A L+ + P Sbjct: 314 NAVRIQVAVALIAYLLLQMAKADQATVTSP 343 >UniRef50_A4J2U7 Transposase, IS4 family protein n=3 Tax=Desulfotomaculum reducens MI-1 RepID=A4J2U7_DESRM Length = 413 Score = 83.7 bits (205), Expect = 9e-15, Method: Composition-based stats. Identities = 50/305 (16%), Positives = 104/305 (34%), Gaps = 59/305 (19%) Query: 63 WAQLHDVATLSDVALLKRLRNAAD-----WFGILAAQTLAVRAAVTGCTSGKRLRLVDGT 117 ++ + ++S L ++LR+ + F + Q + R+ L+D + Sbjct: 70 FSSAVGLDSISASQLSRKLRDLSPELTQSLFSDIVHQFGTEIGFKSIRQELGRIYLIDSS 129 Query: 118 AISAP---------GGGSAEWRLHMGYDPHTC--QFTDFELTDSRDAE--RLDRFAQTAD 164 IS + +LH+ + ++ A+ ++D D Sbjct: 130 TISLCLSRYRWAEFRKTKSGVKLHLRIQLLEQGVLPDKAIIKPAKSADKTQMDALVVEKD 189 Query: 165 EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 + + DRG+ + + ++ R+ + + F D Sbjct: 190 ALNVFDRGYLDYKR-FDNYSNNGTRFVSRLKSNAIVE--------TLEEFPTNQDSLIKK 240 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGH 284 + V++G G K P RL+ +G+ V Sbjct: 241 DHKVILGKDGTTKMQNPL-------------------RLIETEDTEGKPV---------- 271 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 +++T+ + E SAE+++D YR RWQIEL FK +K + + + + L+ Sbjct: 272 -IIITN--DFELSAEEISDIYRYRWQIELFFKWIKQHFCVKHFYGLSQQAVENQLMIALI 328 Query: 345 AAFLI 349 L+ Sbjct: 329 TYCLM 333 >UniRef50_Q9X6I5 Putative uncharacterized protein n=2 Tax=Bacillus thuringiensis RepID=Q9X6I5_BACTU Length = 118 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 31/56 (55%) Query: 298 AEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDII 353 +QV + Y LRWQIE+ FK KSL +D R + E + ++ L+A FL + Sbjct: 1 MKQVHELYSLRWQIEIVFKTWKSLFDIDHCRTVKQERIECHLYGKLIAIFLCSSTM 56 >UniRef50_C5EN31 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EN31_9FIRM Length = 148 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 33/146 (22%), Positives = 57/146 (39%), Gaps = 8/146 (5%) Query: 215 LRGLDCGKNGETTVMIG---NSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKG 271 L G G T+ + + R I ++P + S+ E R + Sbjct: 3 LTACQIGWTGGRTLSLPGQMPGKRLHPESEPLYRYICKAVPFDLITDSR----PEYRMQL 58 Query: 272 RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKE 331 RVV+ + E G+ ++T+LP DE+S EQ+ Y L W E +F+ LK + + + Sbjct: 59 RVVRFQIAE-GGYENIITNLPADEFSLEQIKHIYHLLWGQETSFRDLKHTIGTENFHSGS 117 Query: 332 PELAKAWIFANLLAAFLIDDIIQPSL 357 P+ + I + I Sbjct: 118 PKYIEFEILCRMTLYNFCTIITMEVP 143 >UniRef50_A9DNS7 Transposase n=1 Tax=Shewanella benthica KT99 RepID=A9DNS7_9GAMM Length = 190 Score = 80.6 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 27/85 (31%), Positives = 48/85 (56%) Query: 272 RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKE 331 R++ + L+T+L ++SA++V++ Y LRWQIEL FK LKS L ++ Sbjct: 34 RLIAFWDRNKSAIGYLITNLKRAQFSADKVSELYGLRWQIELFFKELKSYSGLKTFNTRD 93 Query: 332 PELAKAWIFANLLAAFLIDDIIQPS 356 +A++ ++A++L L I + S Sbjct: 94 KSIAESLVWASMLTLLLKRFIARAS 118 >UniRef50_B0NZ84 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0NZ84_9CLOT Length = 244 Score = 80.2 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 29/147 (19%), Positives = 55/147 (37%), Gaps = 11/147 (7%) Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFP--------ARLIAVSLPPEKALISKTR--LLS 265 GL+ + E + + +K R IA S + + Sbjct: 26 MGLELPRRNEFDLDVSLKLTRKQTNDVKKLLKDKNHYRYIASSATFDFLPSHSRKSEQTR 85 Query: 266 ENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD 325 R+V+ E + +LTSL ++Y +++ Y LRW E +F+ LK + + Sbjct: 86 FYEINFRIVRFEITP-GNYETVLTSLDVNKYPPKELKRLYALRWGTETSFRDLKYTVGML 144 Query: 326 ALRAKEPELAKAWIFANLLAAFLIDDI 352 +K+ I+A+L+ + I Sbjct: 145 NFHSKKVMCIHQEIYAHLIIYNFSEMI 171 >UniRef50_B0CC46 Transposase, IS4 family, putative n=9 Tax=Cyanobacteria RepID=B0CC46_ACAM1 Length = 482 Score = 80.2 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 67/388 (17%), Positives = 125/388 (32%), Gaps = 77/388 (19%) Query: 8 WSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVT----AW 63 W+ IL L+ + R R TL + SLR W Sbjct: 37 WTDIL----PASRLEELLKEEAFSYRNRIYSPIVTLWAMLYQVLSADKSLRNTVKCITTW 92 Query: 64 AQLHDVATLS-------------DVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKR 110 + S +LL+RL + A+ LA + G+ Sbjct: 93 LTAAGIQPPSSDTGAYSKARSRFPESLLQRLIPES-------AECLAQPLSPEHLWCGRP 145 Query: 111 LRLVDGTAI-----------------SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDA 153 +++ DGT + G G RL + + T + + Sbjct: 146 VKVYDGTTVLMADSAANQASYPQHGNQTAGCGFPIARLVVFFCLVTGAVASACIASWDTS 205 Query: 154 E----RLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRF 209 E RL ++ +AD+ +GS + + + AD ++R H Sbjct: 206 EIVMSRLLYQDLEVGDVVMADQAYGSYVD-LAIIQQHRADGVLRKHH------------A 252 Query: 210 DMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRR 269 F +G G + + + LI +L + + Sbjct: 253 RKTDFRKGNKHGIGDHQVTWHKPAQRPEHMSEQDFALIPQTLVVREVCLR---------- 302 Query: 270 KGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQI-ELAFKRLKSLLHLDALR 328 + + +++ T L YSA Q+ Y RW + E+ + LK+ L ++ L Sbjct: 303 ----LSLKGFRDQHIIVVTTLLDAQRYSAGQLTRLYGWRWPVAEVNLRHLKTTLKMEMLS 358 Query: 329 AKEPELAKAWIFANLLAAFLIDDIIQPS 356 AK P++ + I+ +LL L+ +++ + Sbjct: 359 AKTPDMVRKDIWVHLLGYNLLRSLMELA 386 >UniRef50_D0LI35 Transposase IS4 family protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LI35_HALO1 Length = 449 Score = 80.2 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 73/360 (20%), Positives = 113/360 (31%), Gaps = 33/360 (9%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVAT 71 LA PE ++ + G T RR L + L G R +T D+A Sbjct: 22 LARDVAPEWIEQALEATGTATLRRRRLPMEQL--VWLVIGMALFRDRPITEVVTSLDLAL 79 Query: 72 LSD--------VALLKRLRNAADWFGILAAQTLAVRAAVTG---CTSGKRLRLVDGTAIS 120 S R R L A + A + G L VDGT + Sbjct: 80 PSPGHPEVAPSAVAQARDRLGESPMAWLFAHSADRWAHQSAADDRWRGLALYGVDGTTLR 139 Query: 121 APGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRG---FGSRP 177 P E R H G + + A R A +G + Sbjct: 140 VPDS--EENRDHFGLANGGARGSSGYPVVRLAALMALRSHLLAAVSFGPYQGHGEYWYAA 197 Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKK 237 + L + R +W + + + +L G N +G S Sbjct: 198 DLWPCLPDNSLVIVDRHYWAANVLIPLQQDGLNR-HWLIRGRKGLNYRVVEQLGPSD--- 253 Query: 238 AGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYS 297 A A S PE R++ R+ R + L + L Y Sbjct: 254 ELAEVKVSPQARSKNPELPRTWTVRIIHYQRKGFRPQR----------LFTSLLDPVAYP 303 Query: 298 AEQVADCYRLRWQIELAFKRLKSLLHLDA-LRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 A+++ Y RW+IEL + +KS + + LR+K + A+ I+ L+A LI + Sbjct: 304 ADELVALYHERWEIELGYDEVKSKMLANVPLRSKSVDRARQEIWGLLIAYNLIRLEMARV 363 >UniRef50_B9BXQ1 Transposase, IS4 family n=8 Tax=Proteobacteria RepID=B9BXQ1_9BURK Length = 446 Score = 80.2 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 62/370 (16%), Positives = 115/370 (31%), Gaps = 30/370 (8%) Query: 1 MNYSHDNWSAILAHIGK--PEELDTSARN---AGALTRRREIRDAATLLRLGLA----YG 51 + + D L+ + + P A ++ RRR + L + LA + Sbjct: 12 LTFMLDAEPTDLSRLAEHLPHAWIEQAIEATGTASIRRRRLPAEQVVWLVIALAIYRHWS 71 Query: 52 PGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTS---G 108 + + S V R R L QT G G Sbjct: 72 VSEVVDSLELVLPNETTFVSKSAVT-QARQRLGHAPIAWLFEQTAQAWCKQDGARHAFKG 130 Query: 109 KRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRI 168 L +DGT + P + R H G + A L + Sbjct: 131 LSLWAMDGTTLRTPDSAAN--REHFG--SQSYASGKVASYPQMRAVTLTSIPTH----LV 182 Query: 169 ADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTV 228 A+ FG + + ++ L + +++ L + ++ Sbjct: 183 ANIAFGRYDTN---EMIYAKNLLAQIPDHSLTLFDKGFLAAEILCGLNSGERNRHFLIPA 239 Query: 229 MIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLL 288 +G P A L+ + + P+ + R V+ + + VLL Sbjct: 240 KSNTRWEVLSGKPDDA-LVRMRVSPQA---RQKCPDLPEWWTARAVRIQDAQGRERVLLT 295 Query: 289 TSLPEDEYSAEQVADCYRLRWQIELAFKRLK-SLLHLD-ALRAKEPELAKAWIFANLLAA 346 + + + CY RWQIE +++ LK S+L + LR++ + I+ L+A Sbjct: 296 SLTDRRRFKLADLVACYERRWQIEASYRELKQSMLGSELTLRSRTVDGIYQEIWGALIAY 355 Query: 347 FLIDDIIQPS 356 LI + + Sbjct: 356 NLIRREMACA 365 >UniRef50_Q7MGY3 Transposase and inactivated derivative n=4 Tax=Vibrio vulnificus RepID=Q7MGY3_VIBVY Length = 441 Score = 80.2 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 45/288 (15%), Positives = 94/288 (32%), Gaps = 34/288 (11%) Query: 77 LLKRLRNAADWFGILAAQTLAVRAAVTGCT--SGKRLRLVDGTAISAPGGGSAEWRLHMG 134 + R R AD + Q+ ++ G +L VDG P Sbjct: 93 VQARQRLGADAMKEVFHQSQSLWNETADHPTWCGLKLLAVDGVVWRTPDTKENRDAFQSA 152 Query: 135 YD-------PHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGE 187 + P ELT + +E+ +A++ Sbjct: 153 SNQNGEGSFPQVRMVCQMELTSHMLVASAF-ASYKTNEMILAEQ---------------- 195 Query: 188 ADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLI 247 I L ++ ++ + + R++ Sbjct: 196 --LIETTPDYSLTMFDRGFYSLSLLHRWANTGNERHWLMPMRKNTQFTEVRKLGRNDRIV 253 Query: 248 AVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRL 307 + P+ K L + R+++ +T++ +L + Y ++A+ Y Sbjct: 254 ELKTTPQA---RKKSLSLPETIEVRLIK-KTIKGKEVSILTSMTDHRRYPPAEIAELYSH 309 Query: 308 RWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDII 353 RW+IE+ ++ +KS L + LR+K+PE+ K ++ LL+ +I + Sbjct: 310 RWEIEVGYREMKSSLLNNEFTLRSKKPEMVKQELWGLLLSYNIIRYQM 357 >UniRef50_D1K7L7 Transposase n=3 Tax=Bacteroidales RepID=D1K7L7_9BACE Length = 389 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 60/333 (18%), Positives = 107/333 (32%), Gaps = 70/333 (21%) Query: 46 LGLAYGPGGM--SLREVTAWAQLHDV--------ATLSDVALLKRLRNAADWFGILAAQT 95 L + +G S+R++ + H AT+S L K RN A T Sbjct: 43 LCMIFGQLTARDSMRDLMLSLEAHKNKYYHLGFGATVSRTNLGKANRNRDYRIYEEFAYT 102 Query: 96 LAVRAAVTGCTSGKRLR------LVDGTAISAPGGGSAEW-----------RLHMGYDPH 138 L A + ++ D + I + W +LH YD Sbjct: 103 LIAEARNNYNKNDFEVKVDSNVYAFDSSTIDL--CLNVFWWAEFRKHKGGIKLHTLYDVK 160 Query: 139 TCQFTDFELTDSR--DAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHW 196 T T +T+++ D LD + I D+G+ + L A ++ R Sbjct: 161 TSIPTIVLVTNAKVHDVNMLDELSYEKGSFYIMDKGYVDFTR-LHKLHTCGAYFVTRAK- 218 Query: 197 RGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKA 256 +R+ D ++ G Sbjct: 219 NNMRFRRMYSCEVDKTTGIKCDQIG----------------------------------- 243 Query: 257 LISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFK 316 ++ + L K R ++ E + +T+ E SAE++A Y+ RWQ+EL FK Sbjct: 244 MLETYKSLKAYPNKLRRLKYYDEELDREFVFITN--NMELSAEEIALLYKNRWQVELFFK 301 Query: 317 RLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 +K L + + K ++ ++ L+ Sbjct: 302 WIKQHLKVKSFWGTTMNAVKTQVYCAIITYCLV 334 >UniRef50_D1N0Z4 Transposase IS4 family protein n=3 Tax=Bacteria RepID=D1N0Z4_9BACT Length = 384 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 63/375 (16%), Positives = 111/375 (29%), Gaps = 67/375 (17%) Query: 2 NYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVT 61 N S + +L I K E + R++ A + + SLRE+ Sbjct: 3 NTSVSLFRQVLDLIPKREF-EEIVMKHNGDKRKQSFDSWAHFVSMIFCQLAQANSLREIC 61 Query: 62 AWAQLHD----------VATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSG--- 108 + T S+++ + FG + L A+ Sbjct: 62 GGLKTCGGKLNHLGVESAPTKSNLSY-ANAHRSPKMFGDIFHMLLGHCHAIAPRHEFSFP 120 Query: 109 KRLRLVDGTAISA-----------PGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLD 157 K+L +D T I G+ + + + +D H F DF D + Sbjct: 121 KKLYSLDATLIELCVKVFPWATYRQTKGAIKLNMLLDHDGHLPVFVDFTNGDVHEVNSAR 180 Query: 158 RFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRG 217 R D + + DRG+ + D++ R+ + Sbjct: 181 RMELPRDSMVVCDRGYVD-FSMLYKWNLSGVDFVTRLKTNATYDIP-------------- 225 Query: 218 LDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE 277 E V + G +I + + + R V Sbjct: 226 -------EYDV------KQYPGTVLSDEVIFLR-----------GSQDKYPERLRKVVVC 261 Query: 278 TLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 +E + LLT+ E A+ + D Y+ RWQIE FK LK + + Sbjct: 262 DVENHRTLTLLTN--NFELDAQTIGDIYKARWQIESFFKMLKQNFKIKTFIGTSENAVRI 319 Query: 338 WIFANLLAAFLIDDI 352 ++ L+A L + Sbjct: 320 QVWTALIAILLTKYL 334 >UniRef50_Q877R2 Transposase n=51 Tax=Bacteroidales RepID=Q877R2_BACTN Length = 387 Score = 79.1 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 60/333 (18%), Positives = 108/333 (32%), Gaps = 67/333 (20%) Query: 42 TLLRLGLAYGPGGMSLREVTAWAQLHD----------VATLSDVAL--LKRLRNAADWFG 89 LL L SLR++ + H + S +A R + + + Sbjct: 41 QLLALMFGQLSNRESLRDLIVALEAHHSKCYHLGMGKNVSKSSLARANQDRDYHIFEEYA 100 Query: 90 ILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEW-----------RLHMGYDPH 138 + A G + D T I S W ++H YD Sbjct: 101 YYLVSEARQKCANHIFKLGGNVYAFDSTTIDL--CLSVFWWAKFRKKKGGIKVHTLYDVE 158 Query: 139 TCQFTDFELTDS--RDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHW 196 T F +T++ D++ + I DRG+ + + + + EA ++VR Sbjct: 159 TQIPAFFHITEASVHDSKVMIEIPYEPSSYYIFDRGY-NNFKMLYKIHQIEAYFVVRAKK 217 Query: 197 RGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKA 256 +++ + + R L + +V++ K+ Sbjct: 218 N---------LQYKSIQWKRRLPKNVLSDASVLLTGFYPKQYYP---------------- 252 Query: 257 LISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFK 316 + R+V+ E +T+ SA QVA+ Y+ RWQ+EL FK Sbjct: 253 ------------KPLRLVKYWDEEQEREFTFITN--AMHISALQVAELYKNRWQVELFFK 298 Query: 317 RLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 LK L + + I+A + A L+ Sbjct: 299 WLKQHLKIKRFWGTTENAVRIQIYAAICAYCLV 331 >UniRef50_A8M893 Transposase IS4 family protein n=3 Tax=Actinomycetales RepID=A8M893_SALAI Length = 451 Score = 79.1 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 73/367 (19%), Positives = 105/367 (28%), Gaps = 73/367 (19%) Query: 23 TSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVA--TLSDVAL-LK 79 + R R R LL G + G A L + SD AL Sbjct: 39 AATRRTQRRVRLLPARVVVYLLLAGCLFADCGYRQVWAKLVAGLRGLPVADPSDSALRQA 98 Query: 80 RLRNAADWFGILA---AQTLAVRAAVTGCTSGKRLRLVDGTAISAP-------------- 122 R R L A A G +VDGT I+ Sbjct: 99 RQRLGPAPLRALFDLLRGPAATSAVAAVRWRGLLPVVVDGTMIAVADSPANLGRYGKHRC 158 Query: 123 ---GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQT----ADEIRIADRGFGS 175 G G RL T D S E T A + +ADR + + Sbjct: 159 NNGGSGYPTLRLSALLTCGTRSVIDAVFDPSTTGEITQAHRLTRSLRAGMLLLADRNY-A 217 Query: 176 RPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGN 235 + I + AD ++R G + M R G+ + Sbjct: 218 AADLIGAFTATGADLLIRCKS---------GRKLPMTRRCRD-------------GSWLS 255 Query: 236 KKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDE 295 G P R+I + + + L+ T L Sbjct: 256 VIDGQPV--RIIEARIS--------------------ITTTAGSHTGDYRLITTLLDPRR 293 Query: 296 YSAEQVADCYRLRWQIELAFKRLKS-LLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQ 354 Y A + Y RW+IE A+ LKS +L LRA+ P+ I A L+ ++ + Sbjct: 294 YPAADLVRLYHQRWEIETAYLELKSTILGGRVLRARTPDGVDQEIHALLIVYQVLRTAMV 353 Query: 355 PSLDFPP 361 + D P Sbjct: 354 DATDSRP 360 >UniRef50_C9KS84 Transposase domain protein n=5 Tax=Bacteroidales RepID=C9KS84_9BACE Length = 407 Score = 78.7 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 66/363 (18%), Positives = 113/363 (31%), Gaps = 75/363 (20%) Query: 23 TSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLRE--VTAWAQLHDVATLSDVALLKR 80 +R G + LL + A SLRE T ++ + + + KR Sbjct: 27 EISRKHGGERYVKSFDGYTHLLTMLYAVIMRFDSLREIETTMITEVRKLHHVGIERIPKR 86 Query: 81 -------LRNAADWFGILAAQTLAVRAAVT---GCTSG-----KRLRLVDGTAISA---- 121 R + +F + +G KRLR++D T IS Sbjct: 87 STLSDANARRSEKFFEEVYHNLYEANKEKLTSDSRRNGTEEWIKRLRIIDSTTISLFSNA 146 Query: 122 ----------PGGGSAEWRLHMGYDPHTCQFTDFELTDS--RDAERLDRFAQTADEIRIA 169 G ++H + D + T + D+ L DEI Sbjct: 147 IFKGVGRHPKTGRKKGGIKVHSVIHANEGVHCDVKFTSAATNDSFMLAPNHFRHDEIVAL 206 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 DR + + + +L Y+ ++ + + + NGE Sbjct: 207 DRAYINYAK-FEALTERNVVYVTKMKKNLVY-----------DTLVDCMYQNNNGEMEYR 254 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 +K G AR+I + +KG+ + + LLT Sbjct: 255 EQVVVFRKDGINHIARIITY----------------VDVKKGKQPKLIS--------LLT 290 Query: 290 SLPEDEYSAE--QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 + ++ E + YR RWQIE FK++K L + K I+ L+A Sbjct: 291 N----DFDMELETIVAIYRRRWQIESLFKQIKQNFPLRYFYGESANAIKIQIWVTLIANL 346 Query: 348 LID 350 L+ Sbjct: 347 LLS 349 >UniRef50_Q12AI7 Transposase, IS4 family n=3 Tax=Proteobacteria RepID=Q12AI7_POLSJ Length = 458 Score = 78.7 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 53/287 (18%), Positives = 90/287 (31%), Gaps = 61/287 (21%) Query: 94 QTLAVRAAVTGCTSGKRLRLVDGTAI-----------------SAPGGGSAEWRLHMGYD 136 + L +A G+ ++LVDGT I APG G RL M Sbjct: 120 RLLHEKALAQWLWRGRAVKLVDGTGISMPDTPENQERYPQPSTQAPGVGFPLARLVMVIC 179 Query: 137 PHTCQFTDFELTDSRDAERLDRFAQ-------TADEIRIADRGFGSRPECIRSLAFGEAD 189 T D + + ++ +AD + + I SL D Sbjct: 180 LATGAALDMAVGPHSGKGSGELGLVRRLLAGFCPGDVMLADALYCNYFL-IASLMAAGVD 238 Query: 190 YIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAV 249 + G R + L + P P R Sbjct: 239 VL----------FEQNGSRITDFRRGQSLGPRDHI-------------VRWPKPPR---- 271 Query: 250 SLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRW 309 P T E + ++ A VL+ T L + S ++ Y RW Sbjct: 272 --PEWMTPEQYTGFPDE-------LTVREVKVAHQVLVTTLLDYRKVSKNDLSALYARRW 322 Query: 310 QIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 +EL + LK+ +D L + P++ + ++ +LLA +I ++ + Sbjct: 323 NVELDLRNLKTTTGMDVLSCQTPQMNEKQLWVHLLAYNVIRLLMAQA 369 >UniRef50_C3R0J9 Transposase n=4 Tax=Bacteroidales RepID=C3R0J9_9BACE Length = 424 Score = 78.7 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 61/339 (17%), Positives = 109/339 (32%), Gaps = 66/339 (19%) Query: 41 ATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLK----------------RLRNA 84 + LL L G +S+R++ + H +++ + + K R+ Sbjct: 40 SQLLHLLFGQITGCVSIRDICLCLEAHG-SSIYHLGIRKSVNQSNLCRANEKRDYRIYEG 98 Query: 85 ADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP--------GGGSAE-WRLHMGY 135 + I + + VT T L +D T IS G S ++H Sbjct: 99 LGMYLISIVRPMYSNTKVTEITIDNVLYALDSTTISTSIVLAAWALGKYSKGAVKMHTLL 158 Query: 136 DPHTCQFTDFELTDSR--DAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVR 193 D + +TD + D+ LD A + D+ + R A +I R Sbjct: 159 DLRGSIPANIHITDGKWHDSNELDEIVPEAFAFYMMDKAYVDFIALFR-FHKAGAYWISR 217 Query: 194 VHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPP 253 + + FD G G+ + + +KK P P R++ Sbjct: 218 PKDNMRYEVVNHRLDFDP-------STGICGDFIIKLTTHKSKKLY-PEPIRMVTY---- 265 Query: 254 EKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIEL 313 V +T+ E SA +V + YR RW IE+ Sbjct: 266 -----------------------HDSVTGNDVEFITN--NFEISAIEVTNLYRHRWDIEV 300 Query: 314 AFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 FK +K + + L + ++ ++A +I I Sbjct: 301 FFKWIKQNIVVKNLWGYSENAVRTHLWVAIIAYLIIAKI 339 >UniRef50_Q4V0X3 Possible transposase n=1 Tax=Bacillus cereus E33L RepID=Q4V0X3_BACCZ Length = 89 Score = 78.3 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 1/77 (1%) Query: 273 VVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEP 332 +V + +G + + + D QV D Y LRWQI++ FK K H+ R + Sbjct: 10 MVTPHSKRLSGINVYMINTSTDIVPMGQVHDWYSLRWQIKILFKTWKLFFHIHHCRKIKQ 69 Query: 333 ELAKAWIFA-NLLAAFL 348 E + +++ +L+A +L Sbjct: 70 ERLEYYLYGLSLIAIWL 86 >UniRef50_C0ING1 Putative uncharacterized protein n=1 Tax=uncultured bacterium BLR12 RepID=C0ING1_9BACT Length = 337 Score = 78.3 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 54/286 (18%), Positives = 89/286 (31%), Gaps = 56/286 (19%) Query: 107 SGKRLRLVDGTAISAPG------------------GGSAEWRLHMGYDPHTCQFTDFELT 148 +G RL +DG+ PG + R + YD D ++ Sbjct: 16 NGLRLLAIDGSTAVLPGHKSITEEFGITNFGPYANSPRSVARTSVLYDVLNLTVLDGQID 75 Query: 149 DSRDAERL---DRFAQ--TADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLT 203 ER FAQ A ++ + DRG+ S Y++R+ Sbjct: 76 RYDSCERNLARQHFAQVKPATDLLLFDRGYPSLGLMFEM-QAQGIHYLIRMREDW----- 129 Query: 204 AEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRL 263 +L NGET + LP + + Sbjct: 130 ----------WLDVRKMLANGETDKEV-----------------TFKLPATERDLLNKYA 162 Query: 264 LSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLH 323 ++ K R+V + E VL + + ++ E AD Y RW IE A+K K + Sbjct: 163 TKNDKFKCRLVAVQLPEGGTEVLCTSIINKEILPYECFADLYHCRWNIEEAYKLFKCRVQ 222 Query: 324 LDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEKK 369 L+A K K FA + + P + + + + Sbjct: 223 LEAFSGKTAIAVKQDFFAKIFMMTTTAVLAFPVEEQIKQECQNSTR 268 >UniRef50_UPI0001C4271A transposase, IS4 family protein n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C4271A Length = 399 Score = 78.3 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 52/274 (18%), Positives = 83/274 (30%), Gaps = 54/274 (19%) Query: 88 FGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP---------GGGSAEWRLHMGYDP- 137 F L A RL ++D T +S A RLH+ Sbjct: 84 FHYLVLNIQAKMKQSPIIREIGRLHVIDSTTMSMSVSQYPWATFRKTKAGIRLHLRVVVT 143 Query: 138 ----HTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVR 193 + + + D +D I + DRG+ + L + +I R Sbjct: 144 KELTLPDKGILLPAKHADRTQMGDLIEMDSDAIHLFDRGYIDYKQ-FDHLCLHDVRFITR 202 Query: 194 VHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPP 253 + + +E + + V +GNS N P RLI Sbjct: 203 LKKNAQVEVLSEQIP--------QAGSPIVKDQEVFLGNSQNGTKMTH-PLRLI------ 247 Query: 254 EKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIEL 313 +G VV +++T+ + SAE++ D YR RW+IE Sbjct: 248 -----------ETQDSQGNVV-----------MIVTNC--FDLSAEEIGDLYRYRWKIET 283 Query: 314 AFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 FK +K L K I+ L+ Sbjct: 284 FFKWMKQHLTFKTFYGKSENAVCNQIWVALITYC 317 >UniRef50_B6FVR6 Putative uncharacterized protein (Fragment) n=2 Tax=Clostridium nexile DSM 1787 RepID=B6FVR6_9CLOT Length = 286 Score = 77.9 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 47/279 (16%), Positives = 88/279 (31%), Gaps = 34/279 (12%) Query: 48 LAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTS 107 L++G + + T+S + +R + + L + Sbjct: 1 LSFGSNSLGHEIGEFFEYRKGFPTVSAF-VQQRKKLSYTALEHLFYRFNECTFKKPVLYK 59 Query: 108 GKRLRLVDGTAISAPGGGS----------AEWRLHMGYDPHTCQFTDFELTDS------- 150 RL +DG+ S P + L+ +D + F D + Sbjct: 60 NYRLLAIDGSDFSLPYNSQEDNVMGDNHFSTLHLNALFDVCSKSFLDVIVQKGLHENETG 119 Query: 151 RDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFD 210 E +DR ++ I +ADRG+ + + DY+VRV D Sbjct: 120 AACELVDRISEKHPVIIMADRGYENYNL-FAHIEERLFDYVVRV------------RDSD 166 Query: 211 MMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISK--TRLLSENR 268 + GL+ K E + + P +K+ + Sbjct: 167 NSCMVSGLNLPKTVEYDITKRVVLTRHFSGPAAINTEKYKYLSKKSRFDYIENSKSPDYE 226 Query: 269 RKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRL 307 R V+ L+ + ++ TSLPE+ +S E + + Y Sbjct: 227 ITIRFVRF-LLDDNTYEVIATSLPEEIFSMEDLKEIYHR 264 >UniRef50_C5V7Z6 Transposase IS4 family protein n=3 Tax=root RepID=C5V7Z6_9PROT Length = 389 Score = 77.5 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 66/378 (17%), Positives = 116/378 (30%), Gaps = 70/378 (18%) Query: 6 DNWSAILAHIGKPEELDTSARNAGA-------LTRRREIRDAATLLRLG---LAYGPGGM 55 +++ + + + + LTRR +RD L L + Sbjct: 18 NHFEYLTERFAANHGIKHFSAWSQFICMAYAQLTRRDGLRDLVACLNSQKSKLYHIG--- 74 Query: 56 SLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVD 115 +R + + L D D L + L + +++ R G + L +D Sbjct: 75 -IRSKVSRSTLADANERRDWRLFEALGH-----RLISIALELYRDEDIGLGLKEPLYAMD 128 Query: 116 GTAI---------SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSR--DAERLDRFAQTAD 164 T I + A + H D +T + D LD A Sbjct: 129 STTIDLCLTLFPWAEFRSTKAAVKAHTIIDLRGSIPVFLSITTGKVHDVNLLDVIPFPAG 188 Query: 165 EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 I + DRG+ + +L + +++R Sbjct: 189 TIVVIDRGYLHFAR-LYALHQRQVTFVIRAKNNLRF------------------------ 223 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGH 284 I + KA + I ++ P K + R V E H Sbjct: 224 ---TWIASREVDKATGLRCDQTILLATPKSKTAYPER---------LRRVSFRDPETGKH 271 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 ++ LT+ + A +A+ Y+ RWQIEL FK LK L + K+ I+ + Sbjct: 272 LVFLTN--RFDLPALTIANIYKNRWQIELFFKWLKQNLAIKHFYGNSLNAVKSQIWIAIC 329 Query: 345 AAFLIDDIIQPSLDFPPR 362 L+ I + L+ P Sbjct: 330 VYLLV-SIAKKQLNLPAS 346 >UniRef50_B3JNI1 Putative uncharacterized protein n=3 Tax=Bacteroides coprocola DSM 17136 RepID=B3JNI1_9BACE Length = 389 Score = 77.5 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 67/374 (17%), Positives = 107/374 (28%), Gaps = 65/374 (17%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 MN +S + + + K + + T+ I LL L G SLRE+ Sbjct: 1 MNIKKYVFSQMTSFLPK-RYFERLVEKSNDRTKSWSISFWNQLLVLIFGQLDGCNSLREL 59 Query: 61 TAWAQLHDV--------------ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCT 106 T H +TLS +L R + F + Sbjct: 60 TDITIAHSSKSYHLGFGKTPITRSTLSKANML-RNYRVFESFAYHMVNLAQQKRIDKEFD 118 Query: 107 SGKRLRLVDGTAISAP---------GGGSAEWRLHMGYDPHTCQFTDFELTDS--RDAER 155 D T I + ++H D T T F +TD+ D Sbjct: 119 LNGTFYAFDSTTIDLCLSLYDWARFRSTKSGIKVHTQLDIRTEISTSFTITDAVVHDVNA 178 Query: 156 LDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFL 215 +D A I DRG+ + + + +++R R +TA L Sbjct: 179 MDSIAYEPFACYIFDRGYFDLRR-LYHINEVSSFFVIREKRRPKYEITAG------EDVL 231 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQ 275 G D +T G + R I P Sbjct: 232 EGTDNVLQDQTIRFTGERNCTNYPSEI--RRIVYYSP----------------------- 266 Query: 276 AETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELA 335 E T+ A +A Y+ RW++EL FK LK L + + Sbjct: 267 ----EMNRTFTYYTN--NFYLKASDIALLYKNRWKVELFFKFLKQHLRVKSFWGNSENAV 320 Query: 336 KAWIFANLLAAFLI 349 + I+ ++ L+ Sbjct: 321 RIQIYVAIITYCLV 334 >UniRef50_Q3M8C5 Transposase, IS4 n=15 Tax=Cyanobacteria RepID=Q3M8C5_ANAVT Length = 340 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 54/304 (17%), Positives = 96/304 (31%), Gaps = 66/304 (21%) Query: 64 AQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPG 123 D++T S L + + + + L L A + + +D T I+ Sbjct: 57 GFELDISTFSKANLHRSQKPFQEIYQKL--NKLVQNKAENKLHNKYAICPIDSTVITLTS 114 Query: 124 G-----GSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDR----FAQTADEIRIADRGFG 174 G + +L + T D + D + + + + DRGF Sbjct: 115 KLLWVLGHHQVKLFSSLNLATGSPEDNLINFGHDHDYKFGSKMIANLPTNAVGVMDRGFA 174 Query: 175 SRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSG 234 + I+ L +++R+ Sbjct: 175 G-LKFIQELVQENKYFVLRIKNN------------------------------------- 196 Query: 235 NKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPED 294 L E++ S++ + RV+ LE L+T+LP D Sbjct: 197 --------------WKLEFEESSGLIKVGASDDAQAYRVINFCDLETKTEFRLVTNLPAD 242 Query: 295 ---EYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDD 351 S + + D Y LRW +EL +K LK L LD L K I+ +L+A ++ Sbjct: 243 GEATVSDDDIRDIYLLRWGVELLWKFLKMHLKLDKLITKNVNGITIQIYVSLIAYLILQL 302 Query: 352 IIQP 355 + P Sbjct: 303 VSIP 306 >UniRef50_C4Z764 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=C4Z764_EUBE2 Length = 236 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 67/185 (36%), Gaps = 15/185 (8%) Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT 227 +AD G+ S L + +++R+ + + D T Sbjct: 1 MADSGYESFNT-FAHLIWKGMYFVIRMKDINSNGILSSYDLPDSE--------FDTHIRT 51 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLL 287 + + G P ++ S + + + R+V+ L+ ++ + Sbjct: 52 TLTRRHTKETLGNPNTYTILQPSTDFDFLDENCMY----YDIEFRIVRVH-LDNGTYICI 106 Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 T+L +E+ E++ Y +RW E +F+ LK + L + + + I A+++ Sbjct: 107 ATNLS-EEFPLEEINKLYLMRWSEETSFRELKYTIGLINWHSSKYDGILQEINAHMILYN 165 Query: 348 LIDDI 352 + + Sbjct: 166 FCELV 170 >UniRef50_UPI00016C3BAC transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3BAC Length = 218 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 24/87 (27%), Positives = 42/87 (48%), Gaps = 1/87 (1%) Query: 270 KGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRA 329 + RV + V+ T Y+ E +A Y RW++EL + +K L +D L Sbjct: 19 RFRVKRP-GYRTREIVVATTLTDATAYTREDLAQLYHHRWRVELWIRDIKQTLAMDVLGG 77 Query: 330 KEPELAKAWIFANLLAAFLIDDIIQPS 356 K PE+ + I+ +LLA ++ +I + Sbjct: 78 KTPEMLRREIWCHLLAYNVVRHVIAQA 104 >UniRef50_UPI000196B70E hypothetical protein CATMIT_00144 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196B70E Length = 479 Score = 76.4 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 55/312 (17%), Positives = 106/312 (33%), Gaps = 55/312 (17%) Query: 71 TLSDVALLKRLRN-AADWFGILAAQTLAV--RAAVTGCTSGKRLRLVDGTAISAPGGGSA 127 +S AL K +R + F L Q ++ ++ L DGT + P Sbjct: 79 RISKQALNKAIRKLNPNVFTYLINQFASIYYSTSLPKKYRDHLLIAEDGTYMEIPYNMLN 138 Query: 128 ------EWRLHMG---------------YDPHTCQFTDFELTDSRDAE---------RLD 157 H+ YD F DF L + +E R Sbjct: 139 INEFQFALGCHVRNMFDVKKVQSKAGGLYDVTNGLFIDFSLRQAPYSETPLAFAHLYRTR 198 Query: 158 RFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRG 217 + I +ADR + E I L Y++R F + Sbjct: 199 EMLENQKVIYLADR-YYGSAEIISHLEDLRYSYVIRGKSN----------------FYKK 241 Query: 218 LDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE 277 G + I ++K F A L E + + +R+ R + Sbjct: 242 QVAGMESD-DEWIEVEVDEKWLKRFRFSPEAKKLRKENPTLKIRVI----KREYRYTDNK 296 Query: 278 TLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 E +++ T+L + ++ +++ + Y RW IE+++K +K+ ++ + + ++A+ Sbjct: 297 NKEHCENLIYFTNLSSESFTTDEIMEIYSRRWDIEVSYKTMKTTQEVERHISSDGDVARN 356 Query: 338 WIFANLLAAFLI 349 I+A +L + Sbjct: 357 DIYAKVLFHNIA 368 >UniRef50_Q1VPP4 ISPg4, transposase n=7 Tax=Bacteria RepID=Q1VPP4_9FLAO Length = 411 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 41/256 (16%), Positives = 88/256 (34%), Gaps = 46/256 (17%) Query: 108 GKRLRLVDGTAISAPGGG---------SAEWRLHMGYDPHTCQFTDFELTDSRDAER--L 156 GK ++L+D + IS +LH +D + +T+++ +R L Sbjct: 132 GKVVKLIDSSTISLCLAMFDWAEFRTAKGGIKLHTSWDYNLMIPDVVNITEAKVHDRYGL 191 Query: 157 DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLR 216 + D I + DR + + + ++ R+ L + Sbjct: 192 KQLIFPKDTIIVEDRAYFDFELMLNRIKAENV-FVTRIKSNTLY------------ETIE 238 Query: 217 GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQA 276 L+ + + ++ +G + ++ K R+V Sbjct: 239 ELELADDVDQHILKDEIIQLTSGRAIETGI--------------------SKHKLRLVHV 278 Query: 277 ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAK 336 + + ++T+ + ++ +A Y+ RW IEL FK LK L + K Sbjct: 279 YKEDENKVIAIITN--QLDWEYNTIAALYKKRWDIELFFKALKQNLQVKTFWGTSENAVK 336 Query: 337 AWIFANLLAAFLIDDI 352 + I+ L+ L++ I Sbjct: 337 SQIYVALINYLLLELI 352 >UniRef50_A3ZNH0 Probable transposase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZNH0_9PLAN Length = 451 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 54/264 (20%), Positives = 94/264 (35%), Gaps = 30/264 (11%) Query: 109 KRLRLVDGTAISA--------PGGGSAEWRLHMGYDPHTCQFTDFELTDS--------RD 152 +RL VDG+ ++A +WR H Q +LT+ RD Sbjct: 157 ERLIAVDGSVLTALPQIVGRIAAKEKGQWRFHALVHVLDGQPVASKLTEEPSAKGRAERD 216 Query: 153 -------AERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE 205 A+++D + + DRG+ S E + DYI R++ + L Sbjct: 217 VLAEMIAADQIDIPQSDEGHLFLMDRGYRSA-ELFNKIHTAGHDYICRLNRTDGKLLKPP 275 Query: 206 GMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLS 265 L + I G A I P + + Sbjct: 276 KKGEVREPI--QLPPLSAEAIAMGIVADELITMGGNCGASKIGSDHPMRRIKLIPPADRP 333 Query: 266 ENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD 325 + R+GRV +T VL T + +AE++ Y RW++EL F+ LK +L Sbjct: 334 SSARQGRVRTDQTGRDE-LVLATTLMD---LTAEEIVRLYEHRWEVELFFRFLKQVLGCK 389 Query: 326 ALRAKEPELAKAWIFANLLAAFLI 349 L + + + ++ ++A+ L+ Sbjct: 390 KLLSAKTAGVQIQLYCAIIASLLL 413 >UniRef50_B3E6V4 Transposase IS4 family protein n=8 Tax=Proteobacteria RepID=B3E6V4_GEOLS Length = 372 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 56/362 (15%), Positives = 102/362 (28%), Gaps = 72/362 (19%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDV-- 69 + K E +T AR + R + + + G SLR++ ++ Sbjct: 11 VVRFFKRHEFETLARKHHVGQQFRSFSRWSQFTAMLVGQLTGRKSLRDLVDNLKVQGHKL 70 Query: 70 ----ATLSDVALLKRLRNAADWFGILAAQTL-------AVRAAVTGCTSGKRLRLVDGTA 118 + L R+ L + A +L L+D T Sbjct: 71 YHLGTRDVPRSTLARVNEEQP--HQLYKELFHKLLGRCQAIAPKNRFKLDAKLYLLDATV 128 Query: 119 ISA-----PGGG----SAEWRLHMGYDPHTCQFTDFELTDSRDAERL--DRFAQTADEIR 167 I+ P +LH+G F++T ++ E Sbjct: 129 INLCLKVFPWASYQKAKGAIKLHVGLSADGYLPEFFDVTTGKEHEINWARLLKLPTGSFV 188 Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT 227 + DRG+ + ++L ++ R+ L + G L + NG Sbjct: 189 VFDRGYTDY-DWYQALMDSSIFFVARLKDNALVEYFKKRPGRRSQGVLTDQEISLNGI-- 245 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLL 287 + R+V + + + Sbjct: 246 ----------------------------------------KGSLRLVHFVAEDGNEYRFV 265 Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 T+ + A VA+ Y+ RWQIEL FK +K L + A I+ L Sbjct: 266 -TN--ANHIPAALVAELYKERWQIELFFKWIKQNLKIKAFYGTSENAVLTQIWIALCVYL 322 Query: 348 LI 349 ++ Sbjct: 323 VL 324 >UniRef50_D1T817 Transposase IS4 family protein n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1T817_9BURK Length = 448 Score = 75.2 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 64/363 (17%), Positives = 114/363 (31%), Gaps = 31/363 (8%) Query: 7 NWSAILAHIGKPEELDTSARNAG--ALTRRREIRDAATLLRLGLAYGPGGMSLREVT--- 61 W + H+ P E A A A RRR + + + S+ EV Sbjct: 23 EWGRLGQHL--PYEWIEYAVQASGSASVRRRRLPAQQVVWLVIALALYRHQSISEVVDEL 80 Query: 62 -AWAQLHDVATLSDVAL-LKRLRNAADWFGILAAQTLAVRAAVTG---CTSGKRLRLVDG 116 D + +S A+ R R A L ++ A A G L +DG Sbjct: 81 DLALPAADASFVSKSAIAQARQRIGAAPLAWLFHESAANWVAQDQAKHLFKGFSLFAMDG 140 Query: 117 TAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSR 176 T + + H + +L + D + G Sbjct: 141 TTLRTADSAANRRHFGASAAAHGRIGSYPQLRAVTLTALATHLVR--DAVF----GPYDI 194 Query: 177 PECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNK 236 E I + I RV + + ++ L ++ Sbjct: 195 NEMI-----WARELIARVPANSITVFDKGFLSAQLLCNLVSGGENRHFIIPAKANTCWEV 249 Query: 237 KAGAPFPARLIAVSLPPEKALISKTRLLSENRR-KGRVVQAETLEAAGHVLLLTSLPEDE 295 +G P + + + + P+ ++ + + R V A +LL + Sbjct: 250 VSGGPGD-QTVRMRVSPQ----ARAKCPDLPEFWQARAVLALDARGRQRILLTSLTDRRR 304 Query: 296 YSAEQVADCYRLRWQIELAFKRLK-SLLHLD-ALRAKEPELAKAWIFANLLAAFLIDDII 353 + A + CY RWQIE ++ LK S+L ++ LR++ E + L+A LI + Sbjct: 305 FKAVDIVSCYERRWQIETSYHELKQSMLGMELTLRSQTVEGVYQEFWGALIAYNLIRLEM 364 Query: 354 QPS 356 + Sbjct: 365 AKA 367 >UniRef50_C9C7H0 Transposase n=5 Tax=Enterococcus faecium RepID=C9C7H0_ENTFC Length = 373 Score = 75.2 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 59/374 (15%), Positives = 118/374 (31%), Gaps = 63/374 (16%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAY-------GPG 53 + S W + + G E + + +++ + + +L + Sbjct: 6 VKTSFQKWFSAINFSGLSENSQSLISDFDFYSKKLDFQTTLKVLLHAVYEELPSYREIDR 65 Query: 54 GMSLREVTAWAQLHDVATLSDVALLKRL-RNAADWFGILAAQTLAVRAAVTGCTSGKRLR 112 + + + + +L +L +R + + Q +A +A + L+ Sbjct: 66 AFLDQRLC---KELGIDSLCYSSLSRRAPEIKQEVLMEIFTQLVARISAQQPSSKTTSLQ 122 Query: 113 LVDGTAISAPGG---------GSAEWRLHMGYDPHTCQ---FTDFELT--DSRDAERLDR 158 L+D T I + +LH+ F +T D L+ Sbjct: 123 LIDSTTIPLNKAWFPWAKFRKTKSGIKLHLNLCYLDKTNQYPESFTMTNASEHDRNHLEV 182 Query: 159 FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL 218 + DRG+ + + L ++ R + L Sbjct: 183 LVDKTQATYVVDRGYFDY-KLLDKLNRDGYFFVTRTKSNT---------------KITIL 226 Query: 219 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAET 278 D + +TT G + + ++ + R+V T Sbjct: 227 DQIEVADTTTRDGTIISDQQVILVGG-------------------VNHVTERFRLVTVLT 267 Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAW 338 + + ++T+L + S +VAD Y+ RWQIEL FK LK L + L + + A Sbjct: 268 -KGQKILRMVTNL--FDVSPNEVADMYQARWQIELLFKHLKQNLTIKRLYSHSEQGAINQ 324 Query: 339 IFANLLAAFLIDDI 352 + L+A L I Sbjct: 325 VILTLIATLLTYVI 338 >UniRef50_C6J7R2 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J7R2_9BACL Length = 399 Score = 75.2 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 61/342 (17%), Positives = 110/342 (32%), Gaps = 74/342 (21%) Query: 33 RRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ--------LHDVATLSDVALLKRLRNA 84 R R++ ++LL A S E++ + L + ++S L ++++ Sbjct: 30 RARKLFVGSSLLLFIEAQLQQRESYAEMSEHLEANEDFQAILGGLESISPSQLSRKMKKL 89 Query: 85 A-DWFGILAAQTLAVRAAVTGCTSGK-----RLRLVDGTAISAP---------GGGSAEW 129 + +L Q +T G +L ++D T I+ P + Sbjct: 90 PLENLHLLFMQVTRQIQQLTENKPGITTKIGKLAIMDSTQITLPAILSKWAYCSASNHGV 149 Query: 130 RLHMGY---DPHTCQFTDFELT--DSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLA 184 ++H D T + D D E F + + DRG+ + ++ Sbjct: 150 KMHTSLLVVDAKTMVPDKIIASTKDVADHEVAPNFTVDKEVTYVMDRGYQVH-KHFQAWV 208 Query: 185 FGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPA 244 ++ RV + L+ K G+ I ++ G Sbjct: 209 DQGMKFVARVKDNT------------RLTILKERALPKRGD---FIRDADVTLPGQQMKL 253 Query: 245 RLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADC 304 RLI ++GR+ + T + S Q+AD Sbjct: 254 RLI-----------------EFQDQQGRLYRLVTSRM-------------DLSVHQIADV 283 Query: 305 YRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 YR RWQIEL FK +K L L E ++ L++ Sbjct: 284 YRHRWQIELFFKWIKQHLRLVKPHGYTAEAIWNQMYIALISY 325 >UniRef50_C6JEA3 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JEA3_9FIRM Length = 329 Score = 75.2 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 51/312 (16%), Positives = 96/312 (30%), Gaps = 51/312 (16%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFG- 89 R R++ +L + + + D + + +R + D F Sbjct: 33 FIRNRKLGFKDYMLMFLTMEADC-IREELYRFFGRTIDAPSKAAF-YRQRKKIREDAFRN 90 Query: 90 -ILAAQTLAVRAAVTGCTSGKRLRLVDGT------------AISAPGGGSA----EWRLH 132 +LA + G DG+ P G S + ++ Sbjct: 91 LLLAFNRKLPKKLYNGKYEFW---ACDGSSCDIFLNPEDKDTYFEPNGKSTRGFNQIHIN 147 Query: 133 MGYDPHTCQFTDFELTDSRDAERLDRFAQTADE---------IRIADRGFGSRPECIRSL 183 + +FTD + +R F D I DRG+ S + Sbjct: 148 AMFSLFDKRFTDILVQPARKRNEYSAFCSMVDSADIPEHYKVIFFGDRGYTSYNNFAHVI 207 Query: 184 AFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFP 243 G+ I + MMG+ + + ++++ S + Sbjct: 208 EKGQYFLIRC----------NDKRASGMMGYPVDTLPAFDEDISLILTRSKAVSKYSRPE 257 Query: 244 ----ARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 299 R I + P + +T E R+++ + L+ + + T+LPE+E+ AE Sbjct: 258 LFSSYRYIYQNAPMDYLNDQRT----EYDLALRLLRIQ-LDDGSYENIATNLPEEEFKAE 312 Query: 300 QVADCYRLRWQI 311 Y LRW I Sbjct: 313 DFKALYHLRWGI 324 >UniRef50_Q45620 Probable transposase for insertion sequence element IS5377 n=12 Tax=Bacillaceae RepID=T5377_BACST Length = 377 Score = 75.2 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 61/367 (16%), Positives = 112/367 (30%), Gaps = 53/367 (14%) Query: 3 YSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTA 62 H ++ + EE+ A G R + LA S R Sbjct: 2 NKHTTLPNLMQKLVSDEEIQLIAEAVGYRDSSRTFTLRELIHFFLLAAMHQWKSFRHGAD 61 Query: 63 WAQLHDVATLSDVALLKRLRNAA-DWFGILAAQTLAVRAAVTGCTSGKR--LRLVDGTAI 119 L+ + + K+ + D L A ++ T + LR+VD T + Sbjct: 62 VGPLYGLPRFHYSTVSKKAKEVPYDIMKRLLALIISKCNRQTRRSLRFPKPLRVVDSTTV 121 Query: 120 SAPGGG---------SAEWRLHMGYDPH-TCQFTDFELTDSRDAERLDRFAQTADEIRIA 169 + A +LH+ Y P + E T R + A ++ + Sbjct: 122 TVGKNRLPWAPYHGERAGVKLHVAYSPEFSLPADVVETTGLRHDGPVGEQLTNAQQVLVE 181 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 DR + R + + I + + L L + T Sbjct: 182 DRAYFKIERLDRFVEQHQLFVIR----------MKDNIELHQKKSLNRLSSTSSSVQTDF 231 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 G K+ + R++ GR ++ ++T Sbjct: 232 TCQLGTKQCRSTKRHRVVIFR-----------------DANGRDIR-----------VVT 263 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 +L SAE +AD Y+ RW +E+ F+ +K L++ L +FA +A L+ Sbjct: 264 NL--FHASAETIADMYQQRWAVEVFFRWVKQYLNVPTLFGTTENAVYNQLFAAFIAYVLL 321 Query: 350 DDIIQPS 356 + + Sbjct: 322 RWLYDQT 328 >UniRef50_C8W6S4 Transposase IS4 family protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W6S4_DESAS Length = 465 Score = 74.1 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 27/70 (38%), Positives = 40/70 (57%) Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 + G LLT+ D SA ++ YR R QIE+ FK LK LL L+ + + PE +A++ Sbjct: 300 KLDGIFALLTNYDADRVSANKLIKKYRERNQIEVNFKDLKGLLDLERIFLQLPERIEAYV 359 Query: 340 FANLLAAFLI 349 F LA F++ Sbjct: 360 FPKTLAYFVL 369 >UniRef50_C4ZTT5 Predicted divalent heavy-metal cations transporter n=1 Tax=Escherichia coli BW2952 RepID=C4ZTT5_ECOBW Length = 60 Score = 74.1 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 52/54 (96%), Positives = 52/54 (96%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGG 54 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAY P G Sbjct: 3 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYRPRG 56 >UniRef50_D1XZ52 Transposase, IS4 family n=1 Tax=Prevotella bivia JCVIHMP010 RepID=D1XZ52_9BACT Length = 241 Score = 74.1 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 48/233 (20%), Positives = 84/233 (36%), Gaps = 41/233 (17%) Query: 129 WRLHMGYDPHTCQFTDFELTDS--RDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFG 186 +LH YD T T +TD+ D++ ++ + I DR + + + + + Sbjct: 1 MKLHELYDVKTDIPTFSVITDASVHDSQVMELIPYEKESFYIFDRAYMATNK-LYIIEEA 59 Query: 187 EADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARL 246 EA ++VR M F+++ +I G+ Sbjct: 60 EAYFVVR---------EKHKMSFEVIEDKEYNTPSSGIMADQIIRFKGH----------- 99 Query: 247 IAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYR 306 + + K R V + + T+ E +AEQVA Y+ Sbjct: 100 ---------------KTKKQYPNKLRRVVFYDYDGNRTFVFYTN--NFEVTAEQVALLYK 142 Query: 307 LRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 RW++EL FK LK L + K I+A ++A L+ I+Q +D Sbjct: 143 YRWRVELFFKWLKQHLRIKEFYGTSENAVKIQIYAAIIAYCLV-VIVQECMDL 194 >UniRef50_Q04V25 Transposase, ISLbp1 n=29 Tax=Leptospira RepID=Q04V25_LEPBJ Length = 423 Score = 74.1 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 71/205 (34%), Gaps = 34/205 (16%) Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRW--LTAEGMRFDMMGFLRGLDCGKN 223 I + D+G+ S E I L +I+R + R L+ + E +D Sbjct: 176 ILLFDKGYPS-MELIGKLMANGIHFIIRSNTRWLKEAKIAGEYKEYD--------KVKNI 226 Query: 224 GETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAG 283 T M+ K A L ++ + + Sbjct: 227 LITNNMLKKKEWLKEYANTKGNLFSLRFVGSRYKDGQVG--------------------- 265 Query: 284 HVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANL 343 + +T LP+ E+S E + Y RW IE F+ K L L+ + K +A + Sbjct: 266 --IFVTDLPDSEFSREDIVFLYGKRWNIETHFRFEKYSLELENVAPKTSIRFLQEYYAKI 323 Query: 344 LAAFLIDDIIQPSLDFPPRSAGSEK 368 L L +IQ + + +S ++K Sbjct: 324 LTFNLASLLIQEAQEEYDQSIQNKK 348 >UniRef50_C6N0W0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N0W0_9GAMM Length = 453 Score = 73.7 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 60/366 (16%), Positives = 109/366 (29%), Gaps = 59/366 (16%) Query: 32 TRRREIRD---AATLLRLGLAYGPGGMS-----LREVTAWAQLHDVATLSDVALLKRLRN 83 TR+R I +L+L L+ G L E + ++ L + + R + Sbjct: 29 TRKRVINTQFLVTFILKLVLSKNSQGYKILLNELWETSEFSALQEQPVSASSICEARQKM 88 Query: 84 AADWFGILAAQTLAVRAAVTGCT--SGKRLRLVDGTAISAP-----GGGSAEWRLHMG-- 134 F ++ + LA+R R+ VDG+ I+ P G A + Sbjct: 89 PETIFTLINQKVLAMREESDTLPLWRNHRVFGVDGSRINVPHELLEAGYKAPIKQQYYPQ 148 Query: 135 ------YDPHTCQFTDFELTD---SRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAF 185 Y + D L R T ++ + DRG+ S ++++ Sbjct: 149 GLMSTLYHLGSGLIYDGILEPVKGERICLLSHMEKLTLGDVLVLDRGYFSYLILVKAIER 208 Query: 186 GEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPAR 245 I R+ + D + + G P R Sbjct: 209 -GIHLICRMQSGPVNKAVQAFWDSDKEDEVISYIPSSPVKYES--KKQGYDIELNPIELR 265 Query: 246 LIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCY 305 LI T++ +V T L ++Y + Y Sbjct: 266 LIKY----------------------------TIDNETYVCCTTLL-GEQYPLNEFPAVY 296 Query: 306 RLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSL-DFPPRSA 364 RW IE +K K + ++ ++ + +A++L L + PP S Sbjct: 297 HGRWGIEELYKISKEFVDVEDFHSRSERGVRQECYAHMLLINLARIFEAEADKQLPPPSE 356 Query: 365 GSEKKN 370 + N Sbjct: 357 PDNRDN 362 >UniRef50_A4T2G5 Transposase, IS4 family protein n=10 Tax=Corynebacterineae RepID=A4T2G5_MYCGI Length = 401 Score = 73.7 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 74/389 (19%), Positives = 117/389 (30%), Gaps = 77/389 (19%) Query: 11 ILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGL---AYGPGGMSLRE-------- 59 +L + P +D G R A + + Y G Sbjct: 23 VLTRVFPPAMVDEVIEATGRTQVRHRALPARVMAYFAIGMGLYSDGSYEDVLSQLTDGLA 82 Query: 60 -VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQT--LAVRAAVTGCT-SGKRLRLVD 115 + W + + + S + R R + L A+ A G +G+R+ +D Sbjct: 83 WASGWREQYQLPGKSAI-FQARERLGSQPLAALFARVARPLGAADTPGTWVAGRRVVAID 141 Query: 116 GT------------AISAPG------GGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLD 157 GT PG + RL + T + RDAE Sbjct: 142 GTCLDVADNPVNEEFFGRPGVNKGEKSAFPQARLLAVAECGTHAIFAATIGAYRDAESTM 201 Query: 158 RFAQ----TADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 T + + +ADRGF S R+ + AD + RV Sbjct: 202 VEHVLDALTPEMLVLADRGFFSYAL-WRNASDTGADLLWRVSTGRNGPTPTHVEDLADGS 260 Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV 273 +L L K+ + G P AR+I ++ GR Sbjct: 261 WLAHLRAAKD-------------RHGEPMLARVIDYTVDD-----------------GR- 289 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAKE 331 + LL T D A ++A Y RW+IE F LK+ LR+K Sbjct: 290 -----DNPVAYRLLTTLTDPDTAPAVELAAAYAQRWEIESVFDELKTHQRGSKVVLRSKS 344 Query: 332 PELAKAWIFANLLAAFLIDDIIQPSLDFP 360 P+L I+ L + I ++ + Sbjct: 345 PDLVLQEIWGYLCCHYAIRSLMSQAAHHS 373 >UniRef50_Q0F098 ISGsu1, transposase n=6 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F098_9PROT Length = 383 Score = 73.7 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 63/374 (16%), Positives = 120/374 (32%), Gaps = 71/374 (18%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 M +++ + +L I + + R R R + + A SLR+V Sbjct: 1 MKHANTVFHQLLRVIPR-HRFEEVVRRYDGDRRIRSLSCWTQFCVMLYAQLCSRQSLRDV 59 Query: 61 TAWAQLHDV------------ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSG 108 + + H +TL+D + + + F L Q G Sbjct: 60 VSAWESHASRHYHLGAGSVRRSTLADANVKRSAGMYLELFYWLLHQF-----RGKGIHRK 114 Query: 109 KRLRLVDGTAISAPG---------GGSAEWRLHMGYDPHTCQFTDFELTDSRDAERL--D 157 +RL+D T I G + ++H YDP T F +T ++ ++ + Sbjct: 115 DAVRLIDSTTIDLCKHQFEWASFRTGKSGVKVHTVYDPDAQVPTFFSITAAKKHDKKAAE 174 Query: 158 RFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRG 217 + DR + + L + ++ R+ F+++ L Sbjct: 175 HMPLLPGATYVFDRAY-NDYAWFHDLTQRDIRFVSRMKRNA---------EFEVVATLPV 224 Query: 218 LDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE 277 D G + + + ++ +K R+ V Sbjct: 225 SDDGVLEDQHIRLSSAKGRKECPTILRRICFVH--------------------------- 257 Query: 278 TLEAAGHVLLLTS-LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAK 336 E ++ +T+ L SA +A Y+ RWQIEL F+ +K L + K Sbjct: 258 -EEDGKKLVFITNDLKR---SAGAIAALYKQRWQIELFFRWIKQNLKIKRFIGTSENAVK 313 Query: 337 AWIFANLLAAFLID 350 I ++A L+ Sbjct: 314 IQIIIAMIAYLLLH 327 >UniRef50_D1Q0M9 ISGsu1 transpoase n=7 Tax=Bacteroidales RepID=D1Q0M9_9BACT Length = 412 Score = 73.3 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 60/374 (16%), Positives = 103/374 (27%), Gaps = 65/374 (17%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 MN + S +++ I E + G RR L + A SLR + Sbjct: 27 MNVGNTVLSQLMSLIPDYELRKCVDKYRGDFHARR-FTCRDQFLVMSYAQFTSSASLRSI 85 Query: 61 TAWAQL------HDVATLSDVALLKRLRNAADW-FGILAAQTLAVRAAVTGCTSGKRLRL 113 A H + + L + +W A RA + RL + Sbjct: 86 EAQLTAFNSKLYHAGLKIMPKSTLADMNEKKNWRIYQDYAMIFVDRAKALYKDNYYRLNI 145 Query: 114 ------VDGTAISAPGGGSA---------EWRLHMGYDPHTCQFTDFELTDSR--DAERL 156 D + I+ +++H D LT D++ + Sbjct: 146 DNMVYAFDSSTINLCLQLCPWAKFLHDKGAFKMHTLVDVKNSIPNFVLLTPGNVHDSQAM 205 Query: 157 DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLR 216 D + D+G+ R L A ++ R Sbjct: 206 DMLPIETGAYYLMDKGYVDFDRLFRILQQQHAYFVTRAK--------------------- 244 Query: 217 GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQA 276 N + V R++ L ++ V++ Sbjct: 245 -----DNMKYNVF-------------ETRVVDRQTGVISDETISLSGLLTAKKHPDVLRL 286 Query: 277 ETLEAAGHVLLLTSLPED-EYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELA 335 T E ++ L D A +A+ YR RW IE FK +K LH+ + Sbjct: 287 VTYEDYAQNVVYRFLTNDFILPAITIAELYRERWTIETFFKWIKQHLHIKSFYGTTQNAV 346 Query: 336 KAWIFANLLAAFLI 349 I+ + L+ Sbjct: 347 FTQIWIAICDYLLL 360 >UniRef50_B8FDX7 Transposase IS4 family protein n=2 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FDX7_DESAA Length = 395 Score = 73.3 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 61/366 (16%), Positives = 105/366 (28%), Gaps = 68/366 (18%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTA-----WAQL 66 L + + G+ + + + SLRE+ + +L Sbjct: 11 LTGLFNRNQFYALVLRHGSEKHAKGFSSWDHFVAMLFCQIAQAKSLREICSGMACCLGKL 70 Query: 67 HDVA-------TLSDVALLKR-LRNAADWFGILAAQTLAVRAAVTGCTSG---KRLRLVD 115 + + A KR + D F L +A G T +L +D Sbjct: 71 RHLGVKGAPKRSTLSYANQKRTWKLFQDVFYDTLH--LCRQAPSPGKTKFRFRNKLMSLD 128 Query: 116 GTAISA-----PGG----GSAEWRLHMGYDPHTCQFTDFELTDSR--DAERLDRFAQTAD 164 + IS P +LH+ D +TD + D + A + Sbjct: 129 SSTISLCLSLFPWAEYRQTKGAVKLHLLLDHDGYLPVFACITDGKTHDVTMARQLALSKG 188 Query: 165 EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 I + DRG+ + E ++ R+ + A+ + Sbjct: 189 SIVVMDRGYNDYKLYAEWVED-EVYFVTRLKDNAAFMVLADFP-------VPKNRNILVD 240 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGH 284 +T + G K R++ E+ Sbjct: 241 QTILFTGAVAAKNCPYALR-RVVVWDKEQER---------------------------KI 272 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 LL L + A +A Y+ RW+IEL FK LK L + + I+ L+ Sbjct: 273 ELLTNHLD---FGATTIAAIYKDRWEIELFFKALKQNLKVKTFVGTSENALQIQIWTALI 329 Query: 345 AAFLID 350 A LI Sbjct: 330 AMLLIK 335 >UniRef50_A6WTA0 Transposase IS4 family protein n=14 Tax=Shewanella RepID=A6WTA0_SHEB8 Length = 446 Score = 73.3 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 63/180 (35%), Gaps = 6/180 (3%) Query: 190 YIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAV 249 I + L ++ + + + G + + + Sbjct: 200 LIPSIPNHSLTLFDRGFYSLGLLHAWQQAQPDSHWLLPLKKGTQYEVVRTLGKHDQWVKL 259 Query: 250 SLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRW 309 + P+ K + + R++ +T++ +L + Y +E + D Y RW Sbjct: 260 TTTPQA---RKKWPQLPDTLEARLLT-KTVKGKSVAILTSLTDPMRYPSEDIVDLYAHRW 315 Query: 310 QIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSE 367 +IEL ++ +K L LR++ PEL ++ LLA LI + P ++ Sbjct: 316 EIELGYREMKQHLLESRFTLRSQLPELVTQELWGVLLAYNLIRYKMLLMAKSLPSVHPNQ 375 >UniRef50_A6L0R8 Transposase n=13 Tax=Bacteroidales RepID=A6L0R8_BACV8 Length = 411 Score = 73.3 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 48/257 (18%), Positives = 87/257 (33%), Gaps = 50/257 (19%) Query: 109 KRLRLVDGTAISAPGG--------------GSAEWRLHMGYDPHTCQFTDFELTDS--RD 152 RL+++D T IS ++H + +D T + D Sbjct: 130 NRLQIIDSTTISLFSNLIFTGVGRHPKTGKKKGGIKVHTNIHANEGVSSDIRFTSAATND 189 Query: 153 AERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMM 212 + L T+ +I DR + + L+ Y+ ++ + ++A Sbjct: 190 SFMLKPSNYTSGDIVALDRAYIDYAK-FEELSRAGVIYVTKMKKNLVYEVSA-------- 240 Query: 213 GFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGR 272 + + G + + K G R+++ +K R Sbjct: 241 DTIYMTESGLMALRERHVTFTKKVKDGDDIK---------------HHARIVTYVDQKKR 285 Query: 273 VVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEP 332 A + LLT+ + E SAE + YR RW+IEL FK++K L + Sbjct: 286 --------GAKLISLLTN--DMEMSAEDIVAIYRKRWEIELLFKQIKQNFPLRYFYGESA 335 Query: 333 ELAKAWIFANLLAAFLI 349 K I+ L+A L+ Sbjct: 336 NAIKIQIWITLIANLLL 352 >UniRef50_P03835 Transposase insG for insertion sequence element IS4 n=377 Tax=root RepID=INSG_ECOLI Length = 442 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 60/172 (34%), Gaps = 6/172 (3%) Query: 190 YIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAV 249 I + L + ++ ++ + G + L+ + Sbjct: 196 LIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEEIRKLGKGDHLVKL 255 Query: 250 SLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRW 309 P+ A L +E + V + LL + + ++ D Y RW Sbjct: 256 KTSPQ-ARKKWPGLGNEVTARLLTVTRKGKVC---HLLTSMTDAMRFPGGEMGDLYSHRW 311 Query: 310 QIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 +IEL ++ +K + LR+K+PEL + ++ LLA L+ + + Sbjct: 312 EIELGYREIKQTMQRSRLTLRSKKPELVEQELWGVLLAYNLVRYQMIKMAEH 363 >UniRef50_C3KKH4 Putative transposase Y4ZB n=2 Tax=Rhizobium sp. NGR234 RepID=C3KKH4_RHISN Length = 493 Score = 72.1 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 53/291 (18%), Positives = 90/291 (30%), Gaps = 53/291 (18%) Query: 71 TLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI--------SAP 122 TLSD + + A+ F ++A Q T K LRL+D T I + Sbjct: 188 TLSDANARRPVAVFAETFALVAGQL----DRQTRRDGSKMLRLIDSTPIPLGKLCDWAKS 243 Query: 123 GGGSAEWRLHMGYDPHTCQFTDFELTDSR--DAERLDRFAQTADEIRIADRGFGSRPECI 180 G +LH+ YDP ++TD+ DA+ + D+G+ Sbjct: 244 NGRIRGMKLHVVYDPKADCPRLLDITDANVNDAQIGRTVTIEKGATYVFDKGYCHYGWWT 303 Query: 181 RSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGA 240 A L+ + + L+ + V + + G+ K Sbjct: 304 AIAAAKAVFVTRPKVNMALKVVRKRRITAAEGDGFTVLE-----DARVRLASKGDSKLPI 358 Query: 241 PFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPED-EYSAE 299 R+ + +T L D + A Sbjct: 359 GLR-RITVKRADGD--------------------------------TITLLTNDLKRPAV 385 Query: 300 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 + Y+ RWQIEL F+ +K L + + I A ++A L+ Sbjct: 386 AIGQLYKGRWQIELLFRWIKQHLKIRKFLGNNDNAIRLQILAAMVAYALLR 436 >UniRef50_A5KKC4 Putative uncharacterized protein n=1 Tax=Ruminococcus torques ATCC 27756 RepID=A5KKC4_9FIRM Length = 422 Score = 72.1 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 53/309 (17%), Positives = 98/309 (31%), Gaps = 80/309 (25%) Query: 72 LSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP--------- 122 +S A L+ R + F +G + VDG+ I P Sbjct: 91 ISHKAFLELFRLSVKQFYFQPVNL--------RTWNGFHIYAVDGSTIQIPESKENYEVF 142 Query: 123 GGGSAEWRL-------HMGYDPHTCQFTDFELTDSRDAE------RLDRFAQTADEIRIA 169 GG + ++ + YD D L R E +D + + I + Sbjct: 143 GGNPNKTKIISPLASASVLYDVINDILIDVSLHPYRYNERESAKAHVDFLPRFPNSIILF 202 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 DRG+ S + L +++RV + ++ + Sbjct: 203 DRGYPSE-DMFHYLNSKGILFLMRVPKTFKKAISEQ------------------------ 237 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 E AL + ++ R + LE L+T Sbjct: 238 ------------------------EDALFTYPASCNKESLTLRSIHF-LLEDGSTEYLVT 272 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 +L ++ + E D Y+ RW +E ++ LK+ L ++A + +P + FA + + L+ Sbjct: 273 NLMPEQIAKENFPDLYQFRWGVESKYRELKNRLEIEAFNSIKPASIQQEFFAAMYLSNLV 332 Query: 350 DDIIQPSLD 358 I S Sbjct: 333 AVIKSESDS 341 >UniRef50_Q11ZL6 Transposase, IS4 family n=22 Tax=Bacteria RepID=Q11ZL6_POLSJ Length = 389 Score = 71.4 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 47/284 (16%), Positives = 85/284 (29%), Gaps = 57/284 (20%) Query: 84 AADWFGIL-AAQTLAVRAAVTGCTSGK------RLRLVDGTAISA--------P-GGGSA 127 + DW A L RA + +D T I P A Sbjct: 90 SRDWRIWSDLAALLIRRARKLYREEDLGLDLTNTVYALDATTIDLCLSLFDWAPFRSTKA 149 Query: 128 EWRLHMGYDPHTCQFTDFELTDSRDAER--LDRFAQTADEIRIADRGFGSRPECIRSLAF 185 ++H D ++D + + LD A + DRG+ + + Sbjct: 150 AVKMHTLLDLRGSIPAFIHISDGKMGDVNVLDFLPVEAGAFYVMDRGYLDFAR-LYKMHQ 208 Query: 186 GEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPAR 245 A ++ R K G + ++ +A + Sbjct: 209 AGAFFVTR---------------------------AKRGMNARRVYSAQTDRATGVICDQ 241 Query: 246 LIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCY 305 IA++ + + + R ++ + E ++ LT+ A +A Y Sbjct: 242 SIAMN---------GFYVCKDYPEQLRRIRFKDPETGKTLVFLTN--NTTLPALTIAALY 290 Query: 306 RLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 + RWQ+EL FK +K L + K I+ + LI Sbjct: 291 KSRWQVELFFKWIKQHLRIKKFLGTSENAVKTQIWCAVCTYVLI 334 >UniRef50_B2IXJ5 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2IXJ5_NOSP7 Length = 238 Score = 71.4 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 22/68 (32%), Positives = 35/68 (51%) Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLA 345 L+ T L YS + Y RW +E+ + LK+ L +D LR K P + + I+ LLA Sbjct: 125 LITTLLDITTYSTLDIVGLYGKRWDVEIDLRHLKTTLGMDVLRCKTPSMVRKEIYVYLLA 184 Query: 346 AFLIDDII 353 L+ ++ Sbjct: 185 YNLLRGLM 192 >UniRef50_B6EGT0 Transposase n=20 Tax=Vibrionaceae RepID=B6EGT0_ALISL Length = 441 Score = 71.0 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 57/346 (16%), Positives = 107/346 (30%), Gaps = 62/346 (17%) Query: 32 TRRREIRDAATLLRLGLAYGPGGMSLREVTA-WAQLHDVATLSDVALLKRLRNAADW-FG 89 R+ + LL Y M A ++ AL +R +N + Sbjct: 50 KRKLTLESMVWLLVGMAIYNNKSMKDLVNQLDIVDRTGKAFVAPSALTQRRKNLGEAAMK 109 Query: 90 ILAAQTLAVRAAVTGCT--SGKRLRLVDGTAISAP-------------GGGSAEWRLHMG 134 + + + +G L VDG AP G + R+ Sbjct: 110 AVFERMTSSWLKSANLPKWNGLTLLGVDGVVWRAPDNQKNEEAFSRQKGTQYPQVRMVCQ 169 Query: 135 YDPHTCQFTDFELTDSRDAERL---DRFAQTADE-IRIADRGFGSRPECIR-SLAFGEAD 189 + + T + E + T D + + D+GF S + + E Sbjct: 170 MELSSHLITASAFDNYNTNEMILAEKLIDSTPDHSVTMFDKGFYSLGLLHKWQMTGSERH 229 Query: 190 YIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAV 249 +++ + + R D + LR AR + Sbjct: 230 WLIPLKKNTQYEIIRSLGRNDKLVILRSNP-----------------------RARKLFS 266 Query: 250 SLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRW 309 +LP R+V + ++ + +L + + Y + + Y RW Sbjct: 267 NLPE--------------TMTARLVTRK-IKGKDYQVLTSMIDPLRYPLKDIVGLYEHRW 311 Query: 310 QIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDII 353 +IEL ++ K + + LR++ PEL K ++ LL LI + Sbjct: 312 EIELGYREQKQYMLGNRLTLRSRLPELVKQELWGILLTYNLIRYQM 357 >UniRef50_A6DKD2 ISPg4, transposase n=7 Tax=Chlamydiae/Verrucomicrobia group RepID=A6DKD2_9BACT Length = 412 Score = 71.0 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 53/373 (14%), Positives = 111/373 (29%), Gaps = 75/373 (20%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVT--------AW 63 + + P ++ A+ G + R++ + ++ L +SL +V + Sbjct: 14 ICQLIPPHIVNKLAKKHGI--KTRKLSSWSHVVSLLYTQLSHALSLNDVCDGLHYHSSSL 71 Query: 64 AQLHDV--ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKR----------- 110 Q+ + + R RNAA + ++++ + K+ Sbjct: 72 FQIRGATAPKRNTFSNANRTRNAAMAEDLFWEVLKSLQSQLPSFGLDKQNSNFPQRFKRA 131 Query: 111 LRLVDGTAISAPG---------GGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDR--- 158 + VD T I A + HM + T + + ++ + + Sbjct: 132 VYAVDSTTIQLVAHCLDWAKHRRRKAAAKCHMQLNLQTFLPSYAIVKEANTHDSTEAKEM 191 Query: 159 -FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRG 217 EI + D+ + V +R L L + G + ++ Sbjct: 192 CANIKDGEIVVFDKAY--------------------VDFRHLYHLDSRG-----VNWVTR 226 Query: 218 LDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE 277 + GN + I + R+V A Sbjct: 227 SKDNMVYDIIEERPTKGNIISDQIIKLNGI--------------NTEKHYSQNLRLVTAN 272 Query: 278 TLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 +L+ +++ +A Y+ RW IE+ FK+LK L L + Sbjct: 273 IEVDGKMKVLMFLTNNLQWAPSSIASIYQSRWGIEVFFKQLKQNLKLADFLGHNKNAIQW 332 Query: 338 WIFANLLAAFLID 350 ++ LL L+ Sbjct: 333 QVWTALLTYVLLR 345 >UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4D726 Length = 464 Score = 71.0 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 53/273 (19%), Positives = 82/273 (30%), Gaps = 48/273 (17%) Query: 107 SGKRLRLVDGTAISAPGGGS------------------AEWRLHMGYDPHTCQFTDFELT 148 G R+ DGT++ + + RL + T Sbjct: 108 HGLRVVAWDGTSVEVADSAANVAHYGRHGKATSRPAGYPQVRLTALVECGTRALMGAVFG 167 Query: 149 DSRDAE--RLDRFA--QTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTA 204 D E + R + +ADRG+ E IR A AD + RV + R L Sbjct: 168 PMHDKELPQARRLLPVLRPGILLLADRGYDGY-EAIRDAASTGADLLWRV--QSGRLLPV 224 Query: 205 EGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGA--PFPARLIAVSLPPEKALISKTR 262 D + LD A R+I + A + Sbjct: 225 IQPLPDGSHLSQILDRRSGDRLAAWQRRKRPTPPPALTAMAVRVIRYQVTVTTADGRQH- 283 Query: 263 LLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLL 322 ++ L+ T L + A ++A+ Y RW+IE A+ LK L Sbjct: 284 ------------------SSTVRLITTLLDPARHPAAELAELYHQRWEIETAYYGLKVTL 325 Query: 323 HLD--ALRAKEPELAKAWIFANLLAAFLIDDII 353 LR+ + + I+A L L I Sbjct: 326 RGSDRVLRSHTVQGVEQEIYALLTVFQLTRTAI 358 >UniRef50_A4JGL4 Transposase, IS4 family protein n=3 Tax=Burkholderiaceae RepID=A4JGL4_BURVG Length = 402 Score = 70.6 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 46/254 (18%), Positives = 82/254 (32%), Gaps = 23/254 (9%) Query: 119 ISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTA--DEIRIADRGFGSR 176 ++APG A +R G + ++ D + + + R G Sbjct: 124 LAAPGAPGAWYR---GLRVMALDGSCMDVADEAANAKFFGYPGASRGQSAFPQARVLGLV 180 Query: 177 PECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM-IGNSGN 235 ++ + A+ + + + L + + Sbjct: 181 ECGTHAVVAAGIAPYGHSEQ----VMAAQLLPAKLTPEMLVLADRNFYGFKLWQTACATG 236 Query: 236 KKAGAPFPARL---IAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAG--------H 284 K + L + LP L RR G+ V+ G + Sbjct: 237 AKLAWRVKSNLKLPVEQMLPDGSYLSRVFDSDDRARRAGQTVRVIDYALEGSATPAQGSY 296 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFAN 342 LL L D A ++A Y RW+IE F K+ L ++ LR+K PEL + ++ Sbjct: 297 RLLTNLLDPDAAPALELAALYHERWEIEGVFDEFKTHLRANSTVLRSKTPELVQQELWGL 356 Query: 343 LLAAFLIDDIIQPS 356 LLA F I ++ + Sbjct: 357 LLAHFAIRQLMAQA 370 >UniRef50_B3PC11 ISCja2, transposase n=5 Tax=Proteobacteria RepID=B3PC11_CELJU Length = 383 Score = 70.6 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 57/371 (15%), Positives = 120/371 (32%), Gaps = 62/371 (16%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLR-- 58 M++S+ + +L + E + A+ + R + + ++ G SLR Sbjct: 1 MSHSNTAFHQLLKPL-SRHEFEAEAKKHHVGQKLRSATRWDQFVGMAMSQLSGRQSLRDI 59 Query: 59 EVTAWAQLHDVATLSDVAL----LKRLRN--AADWFGILAAQTLAVRAAVTGCTSGK--- 109 + AQ H + L + L R+ A+ + + A+ L ++ G + Sbjct: 60 QSNLEAQQHKLYHLGAKPIARSTLARINEVQPAELYKHVFARLLHRCKSMQGKHKFQFKN 119 Query: 110 RLRLVDGTAISAP---------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLD--R 158 L +D +AI +A +L +G + T L+D ++ + ++ + Sbjct: 120 PLYSLDASAIDLSLSVFPWAAHRDDTANVKLSVGLNHGTQVPEFVALSDGQENDMIEGRK 179 Query: 159 FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL 218 F I D+G+ + L ++ R+ + + + G + Sbjct: 180 FDFPKGSIVAFDKGYVDY-RWFKLLTDKGVFFVTRLRAKAVYRVEERRYADSSKGII--- 235 Query: 219 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAET 278 + +S + K R + Sbjct: 236 ---------------------------------SDQVIQLSSAHAIKRGAPKLRRIGYRD 262 Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAW 338 LT+ + +A +A Y+ RWQ+EL FK +K L + A Sbjct: 263 ATTGKFYEFLTN--NFQLAAATIAAIYKDRWQVELFFKAIKQNLKIKAFVGTSRNAVLTQ 320 Query: 339 IFANLLAAFLI 349 I+ ++ L+ Sbjct: 321 IWIAMITYLLL 331 >UniRef50_B8FEP3 Transposase IS4 family protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FEP3_DESAA Length = 422 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 60/269 (22%), Positives = 83/269 (30%), Gaps = 52/269 (19%) Query: 107 SGKRLRLVDGTAISAP---------GGGSAEWRLHMG----YDPHTCQFTDFELTDSRDA 153 G+R+ +DGT I P G S W YD D + Sbjct: 111 RGRRVLAIDGTKIMLPRTKELLDAFGKCSHGWFPQTHACVLYDVLAGLPLDVAWGHYKSG 170 Query: 154 E------RLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGM 207 E D F +I + DRGF L D+IVR L +G Sbjct: 171 ERGLARDMFDGFL--PGDILVLDRGFPGFA-FFLDLMEQGIDFIVR--------LRGDGQ 219 Query: 208 RFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSEN 267 + FL+ + E P R+ E A K Sbjct: 220 FAALRPFLQENRRDQIIEIP---------------PTRVAI----EEYARQGKPAPGPVT 260 Query: 268 RRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDAL 327 R V+ + + T + Y +++ Y LRWQ E FK +K LL + + Sbjct: 261 ---LRFVKVSLGKGKSALYATTLVDRKRYKFKELKHLYHLRWQEEEFFKHMKDLLEAENI 317 Query: 328 RAKEPELAKAWIFANLLAAFLIDDIIQPS 356 R K L I A L L +I S Sbjct: 318 RGKSEALVDQEIVAVHLYHLLARILIMES 346 >UniRef50_B2JV26 Transposase IS4 family protein n=9 Tax=Burkholderia RepID=B2JV26_BURP8 Length = 442 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 69/374 (18%), Positives = 119/374 (31%), Gaps = 53/374 (14%) Query: 7 NWSAILAHIGKPEEL-DTSARNAGALTRRREIRDAATLLRLGLA-----YGPGGMSLREV 60 + S + H+ P E + + + GA + RR A ++ L +A + L + Sbjct: 17 DLSRLAEHL--PYEWIERAVQATGAASIRRRRLPAEQVVWLVIALAMYRHWSISEVLDSL 74 Query: 61 TAWAQLHDVATLSDVALLK-RLRNAADWFGILAAQTLAVRAAVTGCTS---GKRLRLVDG 116 +S A+++ R R L QT G L +DG Sbjct: 75 DLALPNEAAPFVSKSAVVQARQRIGEAPMAWLFEQTARAWTTQDAAHHAFKGLSLWAMDG 134 Query: 117 TAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEI---RIADRGF 173 T + P + R H G + A A T I +AD F Sbjct: 135 TTLRTPDSAAN--REHFGAQ---------GYASGKVASYPQVRAVTLTAIPTHLVADINF 183 Query: 174 GSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNS 233 G A + ++ L + +++ L ++ Sbjct: 184 GCYDTNEMVYAKS---LLPQIPDDSLTVFDKGFLAAEILCGLTMNGRNRHFLIPAKSNTC 240 Query: 234 GNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKG---------RVVQAETLEAAGH 284 AG A + + R+ + R+K R ++A Sbjct: 241 WEVIAGTADDAMV-------------RMRVSQQARKKCPALPEFWNARAIRAIDARGRER 287 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLK-SLLHLD-ALRAKEPELAKAWIFAN 342 VLL + + + CY RW+IE ++ LK S+L + LR++ E I+ Sbjct: 288 VLLTSLGDRRRFKPADIVACYERRWRIETSYGELKQSMLGSELTLRSRTVEGVYQEIWGA 347 Query: 343 LLAAFLIDDIIQPS 356 L+A LI I + Sbjct: 348 LIAYNLIRREIASA 361 >UniRef50_A8YU85 Transposase n=21 Tax=Lactobacillus RepID=A8YU85_LACH4 Length = 194 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 38/90 (42%), Gaps = 3/90 (3%) Query: 270 KGRVVQAETLE---AAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA 326 K RV + +L+T+L +E+ ++ + Y L W IE +F+ LK L Sbjct: 31 KFRVCKFRINPPGSDDEWEVLITNLDRNEFPLARMKEIYHLSWGIETSFRELKYDLSGIQ 90 Query: 327 LRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 +K+ + I+A+ + + S Sbjct: 91 FHSKKDQFVYMEIYAHFAMYNAVSLSVATS 120 >UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LF21_FRASN Length = 420 Score = 69.8 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 66/365 (18%), Positives = 114/365 (31%), Gaps = 57/365 (15%) Query: 6 DNWSAILAHIGKPEEL------DTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLRE 59 D++ ++ + D + GA+++ R L L L Sbjct: 71 DDYDEVMRRLVGTLRWLGSWKGDWKVPSTGAISQARTRLGPEPLKLLFERVAVPVAGLGT 130 Query: 60 VTAWAQLHDVATLSDVALLKRLR-NAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTA 118 AW + + V L AD FG + T + + Sbjct: 131 KGAWLGSRRLVAVDGVHLDTADTPENADAFGRFSHG------PKTAAFPQVHVVAL---- 180 Query: 119 ISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPE 178 AE H + +T D R A + ADR F Sbjct: 181 --------AECGTHAVFAAAIGAYTS----DERSLAATLFDACEPGMLLTADRNFYGYGL 228 Query: 179 CIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFL--RGLDCGKNGETTVMIGNSGNK 236 ++LA AD + RV+ + + L + + G+ Sbjct: 229 WQQALAT-GADLLWRVNANLTLPVIRALPDGSYLSLLIDPKIPVARRGQLIADARAGHAP 287 Query: 237 KAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTS-LPEDE 295 + P R+I S+P + + + L+T+ L + Sbjct: 288 PTESALPVRVIEYSVPDHEENG----------------------TSELICLITNILDPTD 325 Query: 296 YSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDII 353 +A ++A Y RW+IE F +K+ + LR+K PEL K I+A LL + I ++ Sbjct: 326 VAAIELATAYHERWEIESTFDEIKTHQRGEKRVLRSKNPELVKQEIWALLLTHYAIRSLM 385 Query: 354 QPSLD 358 + D Sbjct: 386 IEAAD 390 >UniRef50_A1ZPG0 Transposase of, putative n=3 Tax=Microscilla marina ATCC 23134 RepID=A1ZPG0_9SPHI Length = 395 Score = 69.8 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 30/141 (21%), Positives = 54/141 (38%), Gaps = 9/141 (6%) Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPP-----EKALISKTRLLSENRRK 270 R + + P +L+ P + + R N + Sbjct: 205 RKFKDFDEASIQFVTNIGKKPRYQVNRPHQLLDRHHPDLDFIQDSVVQLFERGQPTNSME 264 Query: 271 --GRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALR 328 R+++ E H+ +L++L + AE VA Y +RW IE+ F+ LK ++L Sbjct: 265 HEFRLIEFRVKETGKHLFILSNL--WDLPAEVVAQVYLMRWDIEVIFRFLKQEMNLTHFV 322 Query: 329 AKEPELAKAWIFANLLAAFLI 349 + K I+ L+AA +I Sbjct: 323 CNDLNAIKVMIYVKLIAAMMI 343 >UniRef50_C6CF98 Transposase IS4 family protein n=20 Tax=Gammaproteobacteria RepID=C6CF98_DICZE Length = 441 Score = 69.5 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 57/349 (16%), Positives = 91/349 (26%), Gaps = 32/349 (9%) Query: 21 LDTSARN--AGALTRRREIRDAATLLRLGLAYGPGGMSLREVT------AWAQLHDVATL 72 A N A RRR + + + + +S+++V Sbjct: 28 WIHQALNACHKASIRRRRLPAEQAVWLVLMMGLLRDLSIKDVCHHLDIVLQPDEGYQPLA 87 Query: 73 SDVALLKRLRNAADWFGILAAQTLAVRAAV---TGCTSGKRLRLVDGTAISAPGGGSAEW 129 V R R L + G + VDGT P Sbjct: 88 PSVLTAARQRLGEAPLRYLFHACNEGWLPTVLGSDTFHGLHVLSVDGTLFRTPDSPDNAA 147 Query: 130 RLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEAD 189 DP F + + D FG E +LA Sbjct: 148 AFGF-IDPVHGTFPQVRMVG----------LMATHSHMLLDAAFGGVAEGELTLAHR--- 193 Query: 190 YIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAV 249 + L + R + T V LI + Sbjct: 194 LVSSAPDHSLTLFDRCYFSASFLLEWRQAGVETHWLTPVKRKLRYRVIERYSDYDMLIEM 253 Query: 250 SLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV--LLLTSLPEDEYSAEQVADCYRL 307 + P+ K + R+V G + L + Y E + Y Sbjct: 254 PVSPQA---RKAAPHLPAVWQARMVSYINGSGKGKITGFLTSMTDPVAYPLEDLLRIYWT 310 Query: 308 RWQIELAFKRLK--SLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQ 354 RW+IEL + LK L LR++ PE K ++ L++ L+ + Sbjct: 311 RWEIELGYGELKQRQLKGEVTLRSRFPEGVKQELWGILVSYNLLRKEMA 359 >UniRef50_B2PVI2 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PVI2_PROST Length = 144 Score = 68.3 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 39/81 (48%), Gaps = 2/81 (2%) Query: 277 ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAKEPEL 334 + + G +L + Y +AD YR RW+IE F+ +K + + LR+K+P L Sbjct: 10 KNINGKGVQILTSMSEPLRYPKADIADLYRHRWEIEHGFREMKQHMLNNELTLRSKKPAL 69 Query: 335 AKAWIFANLLAAFLIDDIIQP 355 ++ +LA L+ ++ Sbjct: 70 VNQELWGIVLAYNLLRFMMAQ 90 >UniRef50_C9R546 Transposase (IS4 family) protein n=1 Tax=Aggregatibacter actinomycetemcomitans D11S-1 RepID=C9R546_AGGAD Length = 382 Score = 68.3 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 53/292 (18%), Positives = 98/292 (33%), Gaps = 55/292 (18%) Query: 76 ALLKRLR-NAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGG---GSAEWRL 131 ++ RLR AD+F + L R + + + D T ++ + Sbjct: 79 SISDRLRTINADYFKAIFESVL-SRYSDSYLKPRDNIIAFDSTIVTLSSKLLKTGMKVGS 137 Query: 132 HMGYD------------PHTCQFTD--FELTDSRDAERLDRFAQTADEIRIADRGFGSRP 177 + G + + FT + D E + + D I + D G SR Sbjct: 138 YQGVNGIKFSVAFSSVPVKSKLFTQRVYSSEDVALKELIVEHPLSRDNILLFDMGIQSRN 197 Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKK 237 + ++ RV + + E V I + + Sbjct: 198 T-FDEFSDKHFTFVTRVREIARYRVMS--------------------ENPVEIRETASMV 236 Query: 238 AGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYS 297 + + RL + E R R++ A E + LT+ +YS Sbjct: 237 IQSDYNVRLF-------------NKENKETRHIFRLIIARLKEKDEEIYFLTN--HADYS 281 Query: 298 AEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 A ++A+ Y+ RW+IE+ FK +K L L ++ K ++ L+ A L+ Sbjct: 282 ATEIAELYKRRWEIEVFFKFIKQHLDFSHLLSRNENGMKVEMYMTLITAILL 333 >UniRef50_A6CCZ3 Transposase, IS4 (Fragment) n=7 Tax=Planctomyces maris DSM 8797 RepID=A6CCZ3_9PLAN Length = 531 Score = 67.9 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 45/279 (16%), Positives = 74/279 (26%), Gaps = 51/279 (18%) Query: 102 VTGCTSGKRLRLVDGTAI-----------------SAPGGGSAEWRLHMGYDPHTCQFTD 144 V ++G R+ LVDG I PG G R T D Sbjct: 194 VKSRSTGGRILLVDGFTITAADTPENQRAYPQNPAQKPGLGFPVLRCVSLISMTTGLLVD 253 Query: 145 FELTDSRDAERLDRFAQ-------TADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWR 197 + + +AD + + + + +++ H Sbjct: 254 LVSGPYSGKGSGETALLWQMLDVLRPGDTLVADSYYCTYWL-VSACHARGVQILMKNHHL 312 Query: 198 GLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKAL 257 + L + ++ RL+ V Sbjct: 313 RDDHPQTARRLNKRERLVTWLRPPV---RPAWMARQEYRRQPLTLTLRLVDV-------- 361 Query: 258 ISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKR 317 V + T A +A Y+ RW IEL + Sbjct: 362 ---------------QVSQPGCRTKTFTIATTITDRKACPARWIAAVYQSRWLIELDIRS 406 Query: 318 LKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 +K L +D LRAK P + +++ LLA LI + S Sbjct: 407 IKCSLGMDILRAKSPAMVLTELWSCLLAYNLIRLKMLQS 445 >UniRef50_A4BSI0 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BSI0_9GAMM Length = 406 Score = 67.9 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 28/122 (22%), Positives = 51/122 (41%), Gaps = 6/122 (4%) Query: 245 RLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADC 304 +I + P ++ + + L+A G +L+ T + + Sbjct: 208 HVIVIHKPKKRPQWMSETEYAAAPA---TLTLRELKAGGKLLVTTLRCPNTAPKGALKAL 264 Query: 305 YRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS---LDFPP 361 Y+ RW +EL + +K L +D L K P++ + I+ LLA LI ++ S D P Sbjct: 265 YQSRWHVELDIRHIKETLGMDVLSCKTPDMTRKEIWVYLLAYNLIRLMMVQSARLADIAP 324 Query: 362 RS 363 R+ Sbjct: 325 RT 326 >UniRef50_UPI0000F70487 putative IS4 transposase n=1 Tax=Aeromonas salmonicida subsp. salmonicida RepID=UPI0000F70487 Length = 168 Score = 67.9 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 18/98 (18%), Positives = 40/98 (40%), Gaps = 11/98 (11%) Query: 267 NRRKGRVVQA---------ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKR 317 +R+ ++ T+ +L + + + + + Y RW+IEL ++ Sbjct: 4 AQRRCSTIRISYLEARLLTRTINGKERQVLTSMVDPMRFPGADIVELYGHRWEIELGYRE 63 Query: 318 LKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDII 353 +K L LR+K+ + ++ LLA L+ + Sbjct: 64 MKHCLQQHRLTLRSKKAAGIRQELWGVLLAYNLLRSQM 101 >UniRef50_D2TH14 ISCro6 transposase n=8 Tax=Gammaproteobacteria RepID=D2TH14_CITRO Length = 438 Score = 67.5 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 64/350 (18%), Positives = 102/350 (29%), Gaps = 76/350 (21%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGI 90 L R R I D T L L L+ G + A + LSD L + Sbjct: 61 LYRDRSITDVVTKLDLVLSSQEG----ETLAASSVARARQRLSDEPLRELFT-------- 108 Query: 91 LAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGS------------------AEWRLH 132 L A + G RL VDGT P RL Sbjct: 109 LTASHWTQQEDKDDLWYGLRLFAVDGTLFRTPDTPELAEHFEYIKHRPDRHTEYPMVRLC 168 Query: 133 MGYDPHTCQFTDFELTDSRDAE--RLDRFAQTADEIRIADRGFGSRPECIR-SLAFGEAD 189 + + + E + + A + + DR + S I EA Sbjct: 169 AMMSLRSRLIHGVKFGPANTGEVSYAKQLSPQAKSLTLFDRCYLSAELLINWQRRQQEAH 228 Query: 190 YIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAV 249 ++V + + D + ++ + ++++ + ARLI Sbjct: 229 WLVPLKGNTKYRIVETFAGGDHLVEMQVSPQARKQDSSL----------PENWQARLIEY 278 Query: 250 SLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPE-DEYSAEQVADCYRLR 308 E+ + +TSL E +Y AE + Y+ R Sbjct: 279 ----------------------------EDESGDYKGFITSLTEPGQYPAEALRYVYQER 310 Query: 309 WQIELAFKRLKSLLHLDA---LRAKEPELAKAWIFANLLAAFLIDDIIQP 355 W IE + LK L LR+++ I+ L A LI + Sbjct: 311 WSIENGYGELKQ-FQLSTATLLRSQKVSGIYQEIWGLLTAYNLIRMEMSQ 359 >UniRef50_A6DSH7 Probable transposase n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH7_9BACT Length = 382 Score = 67.5 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 64/339 (18%), Positives = 117/339 (34%), Gaps = 58/339 (17%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGI 90 L R + A+ L ++ G +SL + + D +L L+++L + Sbjct: 63 LKSMRGLCAASELKKVQEHVTLGKISLGSFSEAQHVFDATSLQ--HLVQKLSSKIP---- 116 Query: 91 LAAQTLAVRAAVTGCTSGKRLRLVDGTAIS-APGGGSAEW--------RLHMGYDPHTCQ 141 + R+ + K L VDG+ AEW +LH+G+ Sbjct: 117 --INKIQDRSLLAAV---KDLVAVDGSLFQTLTRVLWAEWLDENHKAAKLHLGFSLLKQS 171 Query: 142 FTDFELTDSRDAERLDRFA-QTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLR 200 D +T ER + + DR +G L A + +R+ + Sbjct: 172 AVDAVITAGNSCERKALLKMVQPGVMYVCDRYYGLDYSYFEELQQRGALFTIRIRNKPKL 231 Query: 201 WLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISK 260 + E + + G + V +G++ + + L A K Sbjct: 232 TVIKEYEITE-----KDRKEGVISDQLVYLGDTDRELKP---------IRLVRTGAFNDK 277 Query: 261 TRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKS 320 +LL+TS ++ +A ++ YR RWQIE+ FK LKS Sbjct: 278 E-----------------------ILLVTSEAPEKLNAAIISTIYRQRWQIEVFFKWLKS 314 Query: 321 LLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 +L L A+ +++ L+AA ++ D+ Sbjct: 315 ILGCRKLLAESSNGVAIQMYSALIAAIMLFDLFGKKPTL 353 >UniRef50_B8FI31 Transposase IS4 family protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FI31_DESAA Length = 386 Score = 67.1 bits (162), Expect = 9e-10, Method: Composition-based stats. Identities = 40/228 (17%), Positives = 67/228 (29%), Gaps = 43/228 (18%) Query: 124 GGSAEWRLHMGYDPHTCQFTDFELTDSR--DAERLDRFAQTADEIRIADRGFGSRPECIR 181 A ++H D +TD++ D E + I + DR + Sbjct: 144 STKAGIKIHTVLDHSGYIPAFVRITDAKTSDIEIARTLSLPKGSILVEDRAYVDFT---- 199 Query: 182 SLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAP 241 + H ++T R G T+ I KKA Sbjct: 200 --------WFKNWHENKQFFVTRLKKNIKYKVLERRDVPQNKGVTSDQIIKLTGKKAADC 251 Query: 242 FPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQV 301 R + H + LT+L + SA + Sbjct: 252 PNLRRVGY---------------------------WDKTTKKHYVYLTNLTK--LSARTI 282 Query: 302 ADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 AD Y+ RWQIEL FK +K L + + I+ +++ ++ Sbjct: 283 ADIYKDRWQIELFFKWIKQNLRIKSFLGNSRNAVLTQIWTAMISMLIL 330 >UniRef50_B8FXQ3 Transposase IS4 family protein n=8 Tax=Desulfitobacterium hafniense RepID=B8FXQ3_DESHD Length = 414 Score = 67.1 bits (162), Expect = 9e-10, Method: Composition-based stats. Identities = 56/373 (15%), Positives = 112/373 (30%), Gaps = 74/373 (19%) Query: 6 DNWSAILAHIGKPEE----LDTSARNAGALTRRREIRDAATLLRLGLAYGPGG--MSLRE 59 D + + +P + L + R + L L +++ +LR+ Sbjct: 5 DTTQSTFTQVFQPFFSKDLWKKIDQEVPNLDQ-RNYKLKTNQLTLLISHAQLQEYKALRK 63 Query: 60 VTA------WAQLHDVATLSDVALLKRLRNAADWFG-ILAAQTLAVRAAVTGCTSGK--- 109 +++ +++ + ++S + +RLR +L L A G + Sbjct: 64 ISSNVQSNDFSEAIGLESISHSQISRRLRTLPIKVSEMLFKGVLNKVAQKKGDGKIQQRL 123 Query: 110 -RLRLVDGTAISA-----------PGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLD 157 +L ++D + IS + L + +D + +T ++ A+R Sbjct: 124 GKLYMIDASVISLCLSRFPWAVFRKIKAGVKMHLRLSFDEMAI-PDEVIITPAKTADRKK 182 Query: 158 R---FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGF 214 D + I DRG+ E ++ R+ + T + G Sbjct: 183 LDELIVVDKDALTIFDRGYIDYLL-FDEYCEKEIRFVTRLKNNAVIEFTGVERPVEEEGS 241 Query: 215 LRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVV 274 + + +++G K R V Sbjct: 242 IEE-------DVDIILGTGTRKMKHT------------------------------LREV 264 Query: 275 QAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPEL 334 + +L SAE++ + YR RWQIEL FK LK + Sbjct: 265 TIDDNVNEPFTILTNDFD---LSAEELGEVYRYRWQIELFFKWLKQHAQIKHFYGTSEAA 321 Query: 335 AKAWIFANLLAAF 347 I +L+ Sbjct: 322 VINQIRLDLMTYC 334 >UniRef50_C4XGQ6 Putative transposase for insertion sequence element n=2 Tax=Desulfovibrio magneticus RS-1 RepID=C4XGQ6_DESMR Length = 376 Score = 66.4 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 66/224 (29%), Gaps = 45/224 (20%) Query: 128 EWRLHMGYDPHTCQFTDFELTDSRDAERL--DRFAQTADEIRIADRGFGSRPECIRSLAF 185 ++H D +T+++ E I + DRG+ R L Sbjct: 147 GIKMHTVMDHDGYLPAVVTVTEAKCHEVNIAKLLKLPKGSIVVFDRGYNDYTW-FRHLCK 205 Query: 186 GEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPAR 245 + R+ + I +A Sbjct: 206 SGVFLVTRLKSNARFRV---------------------------IERHRTDQATGVTSDH 238 Query: 246 LIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCY 305 +I V++ + + + V E + LT+ A +AD Y Sbjct: 239 IIQVAVGEKTMTLRR-------------VGYRDQETGNRLDFLTN--HMTLPARTIADIY 283 Query: 306 RLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 + RWQ+E+ F+ +K L + + + ++ L+A L+ Sbjct: 284 KERWQVEIFFRFIKQNLKIKSFLGNSKNAVLSQVYVALIAYLLL 327 >UniRef50_C8W0R5 Transposase-like protein n=12 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W0R5_DESAS Length = 604 Score = 66.4 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 28/130 (21%), Positives = 57/130 (43%) Query: 221 GKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLE 280 GK + ++ + K F + L +++ + + K + E + Sbjct: 387 GKLNKRNLITKEACEKVVDNIFKGQPDMRRLFNVTIKLNQHNAIVMSWSKDEAIIPELEK 446 Query: 281 AAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIF 340 G +LLT+ +++ A ++ YR R IE++F+ LK L L + + PE A+ F Sbjct: 447 TDGIFVLLTNHDKEKVDANELLTRYRGRNDIEISFRFLKGSLDLQQIFLRNPERVDAYCF 506 Query: 341 ANLLAAFLID 350 +LA +++ Sbjct: 507 LKVLAMLVLN 516 >UniRef50_Q8ABH9 Putative transposase n=1 Tax=Bacteroides thetaiotaomicron RepID=Q8ABH9_BACTN Length = 310 Score = 66.0 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 66/190 (34%), Gaps = 40/190 (21%) Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 I DRG+ + + + + EA ++VR +++ + + R L Sbjct: 105 PYEPSSYYIFDRGY-NNFKMLYKIHQIEAYFVVRAKKN---------LQYKSIQWKRRLP 154 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 + +V++ K+ + R+V+ Sbjct: 155 KNVLSDASVLLTGFYPKQYYP----------------------------KPLRLVKYWDE 186 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 E +T+ SA QVA+ Y+ RWQ+EL FK LK L + + I Sbjct: 187 EQEREFTFITN--AMHISALQVAELYKNRWQVELFFKWLKQHLKIKRFWGTTENAVRIQI 244 Query: 340 FANLLAAFLI 349 +A + A L+ Sbjct: 245 YAAICAYCLV 254 >UniRef50_A7C4E9 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C4E9_9GAMM Length = 216 Score = 66.0 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 52/141 (36%), Gaps = 3/141 (2%) Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV 273 FL + T ++ ++K + + + + +L + R Sbjct: 12 FLSRIKSKTVIYITEIVQGKISQKYIGTKLLSVPIKNKRSDILEVIVEKLCDKGTLCCRA 71 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPE 333 + + +T+L +A + YRLRWQIEL FK K L+ + L + Sbjct: 72 IGFWNPVDKCYHWYITNL---SVAAHLIYPLYRLRWQIELIFKACKQSLNANRLTSNNKH 128 Query: 334 LAKAWIFANLLAAFLIDDIIQ 354 + + + A++ A ++ Sbjct: 129 IIENLLLASIAAQLASHTVLD 149 >UniRef50_A3ZQJ1 Probable transposase n=4 Tax=Blastopirellula marina DSM 3645 RepID=A3ZQJ1_9PLAN Length = 432 Score = 66.0 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 55/335 (16%), Positives = 107/335 (31%), Gaps = 49/335 (14%) Query: 35 REIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQ 94 R +R + +S++++ TLSD L + L Q Sbjct: 71 RTLRTIEDFSQT--QQVQRHLSIQKICR-------TTLSDFHRLVDPQRLEPILQALREQ 121 Query: 95 TLAVRAAVTGCTSGK-----RLRLVDGTAISAP-------------GGGSAEWRLHMGYD 136 A + + R VDGT + A G ++ RL Sbjct: 122 LSRKEAGLGRAANDLSELLKRTVAVDGTFLEAAAEVAWAVRGSNQHGRENSYIRLDFQVG 181 Query: 137 PHTCQFTDFELTDSRDAERL-DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVH 195 + + + +E + + DRGF + R Sbjct: 182 VTSWAPEMIVVAEPGHSESASAAANVQDGRLYLYDRGFSGFDVINAHYHLQNESWTPRAQ 241 Query: 196 WRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEK 255 + +R+ A G + + + + + + + AR I + +P Sbjct: 242 FV-IRYKPAGGNAPHLAD--ADENPLSEKDLAAGVVSDRRGRFRSSKAARHIVLDVP--- 295 Query: 256 ALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAF 315 + + + + + + + V+ L+T+L + SAE +A Y+ RWQIEL F Sbjct: 296 --LREVIIEYQEQDETKTVR-----------LITNL--LDVSAEVIAQLYQQRWQIELFF 340 Query: 316 KRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 + LK + + L + + ++AA L Sbjct: 341 RWLKCFANFNHLISHHRSGVLLSFYVAVIAALLTY 375 >UniRef50_Q55566 Putative transposase for insertion sequence element IS4SA n=10 Tax=Synechocystis sp. PCC 6803 RepID=T4SA_SYNY3 Length = 338 Score = 66.0 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 52/322 (16%), Positives = 106/322 (32%), Gaps = 59/322 (18%) Query: 46 LGLAYGPGGMSLREVTAWAQLHD-VATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTG 104 LGL S+R + L +S + + R+ + I+ + + Sbjct: 34 LGLVLDQSQTSMRSMFKRLNLRGETVDISTFSKASKKRDVGVFREIIFSLKKELSKRKEI 93 Query: 105 CTSGKRLRLVDGTAISAPGG-----GSAEWRLHMGYDPHTCQFTDFEL--TDSRDAERLD 157 + +D T +S G + ++ G + T + D + + Sbjct: 94 KQGELEIFPLDSTIVSITSKLMWNLGFHQVKVFSGINLSTGIPGGIVIHFGQGHDNKYGN 153 Query: 158 R--FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFL 215 + + + DRGF + ++ L E ++ ++ Sbjct: 154 ETIEETPENGVAVMDRGFC--------------------DLQRIKRLQKENNKYHVLRIK 193 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQ 275 + K M+G NK +R++ + + R+V Sbjct: 194 NNIKLEKLANDNYMVGTGKNK-----IESRVVIFTHD---------------NSEFRLVT 233 Query: 276 AETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELA 335 +E+ + S E++A+ Y+ RWQIEL +K LK L L+ L AK Sbjct: 234 NLPIESKEI---------EGVSDEKIAEIYKKRWQIELLWKFLKMHLKLNRLIAKNENAI 284 Query: 336 KAWIFANLLAAFLIDDIIQPSL 357 I+ ++A ++ ++ P Sbjct: 285 GIQIYTCIIAYLILKLLVIPKE 306 >UniRef50_A3EIG1 FOG: Transposase and inactivated derivatives n=3 Tax=Vibrio cholerae V51 RepID=A3EIG1_VIBCH Length = 264 Score = 65.6 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 56/159 (35%), Gaps = 6/159 (3%) Query: 199 LRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALI 258 L ++ + ++ + G LI +SL +A Sbjct: 27 LTLFDKGFYALGLLHRWQSQGKERHWLIPLRKGAQYKTLRKLGRGDGLIELSLT-AQAKK 85 Query: 259 SKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRL 318 + + + E LL + Y +A+ Y RW+IEL ++ + Sbjct: 86 KWADAPDTLEARLITTKVKGKEVQ---LLTSMTDPKRYIGADIAELYSHRWEIELGYREM 142 Query: 319 KSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDIIQP 355 K + ++ LR+K P L K ++ LLA L+ ++ Sbjct: 143 KQYMLQNSLTLRSKTPALVKQELWGMLLAYNLLRFMMCQ 181 >UniRef50_C9LFX6 Transposase domain protein n=14 Tax=Bacteroidales RepID=C9LFX6_9BACT Length = 424 Score = 65.2 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 64/378 (16%), Positives = 124/378 (32%), Gaps = 73/378 (19%) Query: 8 WSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLH 67 +S I+ + K E L S+ GA + LL + A SLRE+TA Q Sbjct: 25 YSQIIKLLNKSEILRISS-EQGAERYVKSFDAWTHLLVMLYAVIMRFDSLREITASLQAE 83 Query: 68 D----------VATLSDVALLKRLRNAADW---FGILAAQ---TLAVRAAVTGCTSG--- 108 + + S +A + R+ A + + L A+ L+ + + + Sbjct: 84 ACKLRHLGIFMMTSRSTLADGNKRRSEAVFEAVYRDLYAKHRHLLSSDSRLCTRKNEPKW 143 Query: 109 -KRLRLVDGTAISAPGG--------------GSAEWRLHMGYDPHTCQFTDFELTDSRDA 153 KRL+++D T I+ ++H + +D T + Sbjct: 144 MKRLKIIDSTTITLFSNLLFKGVGRHPKTGKKKGGIKVHSIIQANEGVPSDIRFTSAATN 203 Query: 154 ERLDRFAQTA--DEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDM 211 + T +I DR + + + + Y+ ++ + + M Sbjct: 204 DSFMLLPATLNRGDIIAMDRAYIDYAK-FQQMTERGVVYVTKMKKNLQYTIEEDVMCQTP 262 Query: 212 MGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKG 271 G + ++ + K L + ++ Sbjct: 263 EGVM-----------------------------QVRVQRVTFRKKLKGGSSIVHHA---- 289 Query: 272 RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKE 331 R+V ++ + LLT+ + ++ D Y RW IEL FK++K L + Sbjct: 290 RIVTYVDVQKRKLISLLTN--DMTSDPLEIMDIYHKRWAIELLFKQIKQNFPLKYFYGES 347 Query: 332 PELAKAWIFANLLAAFLI 349 K I+ L+A L+ Sbjct: 348 ANAIKIQIWVTLIANLLL 365 >UniRef50_C6J0N9 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J0N9_9BACL Length = 402 Score = 64.8 bits (156), Expect = 5e-09, Method: Composition-based stats. Identities = 55/272 (20%), Positives = 95/272 (34%), Gaps = 53/272 (19%) Query: 92 AAQTLAVRAAVTGCTSGKRLRLVDGTAISAP--GGGSAEW-------RLHMGY---DPHT 139 A+ + G + +LR++D T ++ P G A W ++H D T Sbjct: 100 IARIQEITKQKQGIPNIGKLRILDSTVLTLPTLAGRWAYWSKEQNAVKIHTQLVVADRET 159 Query: 140 CQFTDFELTDSR--DAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWR 197 + + D E D I + DRG+ E SL + ++ R+ + Sbjct: 160 VFPGKIINSTAAVSDQEVALDLVVADDAIHVMDRGYIQY-ELYESLIHQQMRFVARLQTK 218 Query: 198 GLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKAL 257 + + + + V I + +K ARLI Sbjct: 219 NKVTILHQRAVPEGFPI--------TIDADVEIQWNDKQKQTHYLQARLI---------- 260 Query: 258 ISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKR 317 E +R R LLT++ + SA+++++ YR RW IEL FK Sbjct: 261 ----EFTDEQKRTYR--------------LLTNVQDR--SAQEISEIYRYRWLIELFFKW 300 Query: 318 LKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 +K L L + + I+ L+A L+ Sbjct: 301 IKQHLRLVKIYSANQTAIWNQIYLALIAYSLV 332 >UniRef50_C7PAE4 Transposase IS4 family protein n=4 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PAE4_CHIPD Length = 412 Score = 64.8 bits (156), Expect = 5e-09, Method: Composition-based stats. Identities = 57/370 (15%), Positives = 112/370 (30%), Gaps = 69/370 (18%) Query: 21 LDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL----------HDVA 70 +D R A + + L+ + + SLRE+ Sbjct: 25 IDKVCRETNADYYYKHFKAFDHLVTMLFSSFHQCTSLRELHTGLLANQHRLHHLGIKHTP 84 Query: 71 TLSDVALLKRLRNAADWFGILAAQTLAVRAA------VTGCTSGKRLRLVDGTAISA--- 121 S ++ R R A +F L + + RL +VD T +S Sbjct: 85 RRSTISDANRTRPVA-FFEKLYHRLYNHHYQAFSPDSRKRKSLVDRLFIVDSTTVSLFSN 143 Query: 122 ----------PGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAER--LDRFAQTADEIRIA 169 G + H+ T + LT++ +R + + I Sbjct: 144 VMKGAGVIRMDGRKKGGIKAHVLMTAKTELPSFTILTEAAKNDRIIMPQLELLPGSIIAM 203 Query: 170 DRGFGSRPECIRSLAFGEADYIVRV-HWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTV 228 DR + + + ++ E ++ RV ++ LT ++ + G + + Sbjct: 204 DRAYVNY-KLMKEWTEKEITWVTRVTKSMKIKLLTRNRLK------ILHKRKGILKDWVI 256 Query: 229 MIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLL 288 +GN ++ AR+I + N +K LL Sbjct: 257 QLGNPLTEEKSPVQTARVI--------------SIYDRNTKK------------KIHLLT 290 Query: 289 TSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFL 348 + Y+ + Y+ RW IE+ FKR+K L+ + ++ L+ L Sbjct: 291 NNF---TYTPTTIRKLYQKRWAIEMLFKRIKQNSQLNNFLGENKNAISIQLWCTLIKDLL 347 Query: 349 IDDIIQPSLD 358 + + Sbjct: 348 TKIVKDKLTE 357 >UniRef50_Q18EK5 Probable transposase (ISH8/ISH26) n=5 Tax=Haloquadratum walsbyi DSM 16790 RepID=Q18EK5_HALWD Length = 417 Score = 64.4 bits (155), Expect = 6e-09, Method: Composition-based stats. Identities = 23/101 (22%), Positives = 38/101 (37%), Gaps = 2/101 (1%) Query: 251 LPPEKALISKTRLLSENRRKGRVVQAETLEA--AGHVLLLTSLPEDEYSAEQVADCYRLR 308 E++ E G + LE + LT+L EY V + Y LR Sbjct: 285 HSDEESTRRVRDERIELAETGEEFRRIVLETPDGEEIEYLTTLASSEYDPIDVINIYTLR 344 Query: 309 WQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 IE+ F+ K L+++ +K +F L+ L+ Sbjct: 345 TVIEILFREWKQYLNIENFHSKSLNGVLFELFCALIGYMLV 385 >UniRef50_A8KXP7 Transposase IS4 family protein n=2 Tax=Actinomycetales RepID=A8KXP7_FRASN Length = 421 Score = 64.1 bits (154), Expect = 7e-09, Method: Composition-based stats. Identities = 51/306 (16%), Positives = 82/306 (26%), Gaps = 64/306 (20%) Query: 79 KRLRNAADWFGILAAQTL----AVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRL-HM 133 R R + +L +T R V G R+ VDGT + P Sbjct: 102 ARDRLGVEPVKLLFERTAVPMALPRRTVGAFYRGWRVCTVDGTTLLVPDTDENAAAFGKP 161 Query: 134 GYDPHTCQFTDFELTD-------------------SRDAERLDRFAQ-----TADEIRIA 169 G D + S+ A F + +A Sbjct: 162 GNDQGEGALPQVRVLGLVECGTRALLGAGFGGTGGSKAASEQALFPDLLGALRPGMLVLA 221 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 DR F E A AD + R + E + L Sbjct: 222 DRNFLG-FELFAKAAATGADLLWRAKSDRRLPIDTELADGSYLSHL-------------- 266 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 + G R I V + L +++ + L+ T Sbjct: 267 ------VEPGTRDKGRKITVRVVEYTLDRDPDSPLPAGKKE------------TYRLVTT 308 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAF 347 L D A +A Y RW++E +K LR++ P+ + ++ LL Sbjct: 309 ILDPDAAPATDLAALYSDRWEVETLLDEIKVHQQDGRLVLRSRAPDRVEQEVWGVLLLHR 368 Query: 348 LIDDII 353 + +I Sbjct: 369 ALRKLI 374 >UniRef50_Q7UY96 Similar to transposase n=1 Tax=Rhodopirellula baltica RepID=Q7UY96_RHOBA Length = 403 Score = 64.1 bits (154), Expect = 7e-09, Method: Composition-based stats. Identities = 37/178 (20%), Positives = 69/178 (38%), Gaps = 9/178 (5%) Query: 180 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG 239 +RS ++VR R +R + ++ L C + ++ Sbjct: 158 LRSWHAKGHLFLVRCDDRRVRCEGRSVLLSELNDELDS-QCEYADAGKALYHGKKVQRQV 216 Query: 240 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVL----LLTSLPEDE 295 A L P + + + + ++ + R V ++A G +L LLT++P D+ Sbjct: 217 AEKTVTL--YR-PHSEVIDGEKKAVTGEPIEVRTVFVRLVDADGWILAEWTLLTNVPADQ 273 Query: 296 YSAEQVADCYRLRWQIELAFKRLKSLLH-LDALRAKEPELAKAWIFANLLAAFLIDDI 352 +A V Y RW+IE FK LKS L+ + + E + +A L+ + Sbjct: 274 ANASDVGRWYYFRWRIESFFKLLKSHGQELEYWQQESGEAITKRLLMASMACVLVKQL 331 >UniRef50_Q46310 Transposase n=1 Tax=Carnobacterium maltaromaticum RepID=Q46310_CARML Length = 152 Score = 63.7 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 47/135 (34%), Gaps = 10/135 (7%) Query: 4 SHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAY--GPGGMSLREVT 61 ++ ++ P + +R + + R+R LL L + G SL ++ Sbjct: 3 QFKKFAEHISSCFSPSAIQEFSRKSKIVKRKRMF-TIDHLLWLCVWQEKNMGDSSLIDMC 61 Query: 62 AWAQLHDVATLSDVALLKRLRNAA-DWFGILAAQTLAVRAAVTGC------TSGKRLRLV 114 A +S L +R + + +L L + T R+R++ Sbjct: 62 ASLWQQFGIKISPEGLNQRFNEKSTAFLKLLFHSILEKQTPDLAAIQHAYSTHFNRIRIL 121 Query: 115 DGTAISAPGGGSAEW 129 D T+ P S ++ Sbjct: 122 DSTSFQLPNTFSDKY 136 >UniRef50_C0VKK7 ISCja2 transposase n=8 Tax=Acinetobacter RepID=C0VKK7_9GAMM Length = 385 Score = 63.7 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 54/371 (14%), Positives = 115/371 (30%), Gaps = 63/371 (16%) Query: 5 HDNWSAILAHIGKPE---ELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVT 61 + + + + KP + + A+ + R + + ++ SLR++ Sbjct: 1 MSHQNTVFHQLLKPISRCDFERLAKQHHCGQKLRSATRWDQFIAILMSQLSCRQSLRDIQ 60 Query: 62 AWAQLHDV------ATLSDVALLKRLRNA--ADWFGILAAQTL--AVRAAVTGCTSGKR- 110 + + A + L R+ A + L Q L + K Sbjct: 61 SNLESQQEKLYHLGAKTIARSTLARINQEQPASLYQQLFTQLLRHCENTKIAHKFRFKNP 120 Query: 111 LRLVDGTAISAPGGGSAEWRLH---------MGYDPHTCQFTDFELTDSRDAERLDR--F 159 L +D + I ++H +G + L D + + + Sbjct: 121 LYSLDASHIDLSLSLCEWAKVHESKASIKLTVGLNHSNTIPEFVALGDGIENDMVQGRLL 180 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 I + D+G+ + + + ++ R+ + + + ++ + G L Sbjct: 181 KFPPGSIVVFDKGYVDY-QWFAEMTDRKVSFVTRLRPKTVYEVKSKREVYACKGIL---- 235 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 + + + + KK GAP R I E + K R Sbjct: 236 ----ADEYIELSSDYAKKRGAPKRLRRI------EFYDVEKKRTFEFLSNNFH------- 278 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 +A +A Y+ RW++EL FK +K L L + + + I Sbjct: 279 ----------------LAASTIAAIYKDRWKVELFFKAIKQNLKLKSFLGRSRNAIQTQI 322 Query: 340 FANLLAAFLID 350 + L+A L+ Sbjct: 323 WIALIAYLLVS 333 >UniRef50_Q7UPU9 Probable transposase n=2 Tax=Rhodopirellula baltica RepID=Q7UPU9_RHOBA Length = 656 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 67/222 (30%), Gaps = 40/222 (18%) Query: 129 WRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEA 188 W + + E L+ + D GF E +S+ G Sbjct: 268 WHVATQLTWCWKLGPSNASERAHVQEMLENGEFPEKTLFTGDAGFVGY-EFWKSIIDGGH 326 Query: 189 DYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIA 248 ++VRV +++ L ++ P R+I Sbjct: 327 HFLVRVGAN-----------VNLLHSLGYDVEPDEDNLVYCWPKDKRREGMRPLKLRMI- 374 Query: 249 VSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLR 308 + + + + VLL + L E + + +Q Y+ R Sbjct: 375 ------QIQLGRKKA---------------------VLLTSVLDEKKLTDKQALVIYKSR 407 Query: 309 WQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 W IEL F+ LK LR ++ A + ++L+ ++ Sbjct: 408 WGIELEFRNLKQTYGRRQLRCRQSVRALVELHWSILSILIVK 449 >UniRef50_B2LS82 Putative uncharacterized protein n=3 Tax=Vibrio RepID=B2LS82_9VIBR Length = 440 Score = 62.1 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 49/352 (13%), Positives = 110/352 (31%), Gaps = 44/352 (12%) Query: 18 PEEL-DTSARNAGALT-RRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL------HDV 69 P E + + + G ++ R+R + + + S+++V +L ++ Sbjct: 28 PWEWVEEAVQQTGRVSLRKRRLPAEQAVWLVLGIGLQRNRSIQDVCDKLELAFPDVDGEL 87 Query: 70 ATLSDVALLK---RLRNAADWFGILAAQTLAVRAAVTGC--TSGKRLRLVDGTAISAPGG 124 ++ +++K RL + L T + G +L VDGT Sbjct: 88 TPMATSSIIKGKERLGDKP--MRYLFKTTAQQWEQQSDFDEVCGLKLLSVDGTYFKTHNT 145 Query: 125 GSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLA 184 + H G+ A++ F ++ R + Sbjct: 146 EENQ---HFGF-----------------AQKGASFPSVLAVTLMSTRSHLVSDAAFGPVT 185 Query: 185 FGEADYIVRV----HWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGA 240 E Y ++ L ++ +G + T + + Sbjct: 186 NSEISYAQQLVGSAPDDSLTLFDRGFTSAELFTSWQGASSNSHWLTPIKTKMRYDIIESY 245 Query: 241 PFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ 300 LI + + P+ A L + + ++ E G + + L + Y + Sbjct: 246 TDYDHLIEMPVSPQ-AQKQTPYLGKRWQARLILIPTPKGEIKG--FITSCLCPERYLFDD 302 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLID 350 + Y RW+IE ++ LK + LR+K+ ++ L + ++ Sbjct: 303 LVKVYWERWEIERSYGELKQYQLQNKPTLRSKKKVGIYQELWGILTSYNIVR 354 >UniRef50_A1APW2 Transposase, IS4 family n=6 Tax=Deltaproteobacteria RepID=A1APW2_PELPD Length = 391 Score = 61.7 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 56/286 (19%), Positives = 100/286 (34%), Gaps = 56/286 (19%) Query: 75 VALLKR-LRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGS-AEWRL- 131 A+ R L ++ FG L Q A + L +DG+ I A A++R Sbjct: 93 EAINNRGLEQLSEVFGHLVKQ--AGKVLPAEYAHLGNLVSIDGSLIDAVLSMEWADYRSG 150 Query: 132 ------HMGYDPHTCQFTDFELTDSRDAER--LDRFAQTADEIRIADRGFGSRPECIRSL 183 H+G+D + L+D ++ ER +D+ E + DRG+ S Sbjct: 151 SKKAKAHVGFDINRGIPRKIYLSDGKEGERPFVDKIIDK-GETGVMDRGYQSHDH-FDKW 208 Query: 184 AFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFP 243 E ++ R+ ++ + E + + + VM+G G + Sbjct: 209 QAAEKFFVCRIRENTIKIVIRE-NAVNPDSII-------FYDRIVMLGTKGVNQTEKEL- 259 Query: 244 ARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVAD 303 RL+ + + + + T+ + +AEQVA+ Sbjct: 260 -RLVGYRV----------------------------DGKDYWIA-TN--RYDLTAEQVAE 287 Query: 304 CYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 Y+LRW IE F K L + L A+ + L+ L+ Sbjct: 288 VYKLRWNIETFFGWWKRHLKVYHLIARSKYGLMVQLLGGLITYLLL 333 >UniRef50_Q82R31 Putative IS4 family ISFsp6-like transposase n=2 Tax=Streptomyces avermitilis RepID=Q82R31_STRAW Length = 542 Score = 61.4 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 2/93 (2%) Query: 273 VVQAETLEAAG-HVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD-ALRAK 330 V A+ L G + L T Y A ++ + Y RW+IE AF L+ L LR++ Sbjct: 276 VTTAKGLRLEGHYRLATTLTDHRRYPAVELVELYHERWEIESAFYSLRHTLQCGLVLRSQ 335 Query: 331 EPELAKAWIFANLLAAFLIDDIIQPSLDFPPRS 363 + + ++A+L + + +++ P + Sbjct: 336 DVAGIQQELWAHLTVYQALRRAMVEAVETLPGT 368 >UniRef50_B5EK95 Transposase IS4 family protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EK95_ACIF5 Length = 369 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 38/138 (27%), Positives = 61/138 (44%), Gaps = 18/138 (13%) Query: 239 GAPFPARLI-AVSLPPEKALISKTRLLS-----ENRRKGRV-VQAETLEAA--------- 282 GA RL + LP EK L + L + ++R+KGR ++ +E Sbjct: 200 GARLLFRLSSVLKLPREKILADGSYLSTIYSSTQDRKKGRGGIRVRIIEYTLDGIPDAEP 259 Query: 283 GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLL--HLDALRAKEPELAKAWIF 340 + L+ + E A ++A Y RW IE + LK+ L LR+K PEL + + Sbjct: 260 SYRLITNWMDPTEAPALELAALYHRRWTIESSLDELKTHLADRQVVLRSKRPELVEQEFY 319 Query: 341 ANLLAAFLIDDIIQPSLD 358 A LLA + ++ + D Sbjct: 320 ALLLAHAAVRHLMTEAAD 337 >UniRef50_Q093Y3 Isrso13-transposase protein n=7 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q093Y3_STIAU Length = 457 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 64/373 (17%), Positives = 122/373 (32%), Gaps = 62/373 (16%) Query: 11 ILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVA 70 ++ E ++ + +E+ +A + + L SL A AQ + Sbjct: 23 LMQRALSAEWMEGLFQEHRQRQYTKELLFSAEVGLMELVALGLRPSLH---AAAQDSEEL 79 Query: 71 TLSDVALLKRL-RNAADWFGILAAQTLAVRAAVTGC--------TSGKRLRLVDGTAISA 121 +S AL +++ + L + + +G R+R++DG ++A Sbjct: 80 KVSQQALYEKVNHTEPELVRALVQGSGERLTPIVKQLKLQQEPWAAGYRVRVLDGNKLAA 139 Query: 122 P--------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDR-------FAQTADEI 166 G A ++ + DA +R E+ Sbjct: 140 SEKRLKPLRGFRGAAMPGQSLVVYAPEWDLVVDILPAEDAHAQERALMGPILERVQPGEL 199 Query: 167 RIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGET 226 +ADR F ++ + A ++VR H + + + G E Sbjct: 200 WLADRNFSTKNILF-GIEETGAAFLVREHAQTPHP-----KEVGTLKEVGRSKTGVVFEQ 253 Query: 227 TVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVL 286 V I G K+ R+ + T + Sbjct: 254 AVEIEAEGGKRLALR---RVEVH------------------------LDEPTENGDTCIR 286 Query: 287 LLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFA-NLLA 345 LLT++P + A +VA+ YR RW IE F RL+S+LH + A A F +++A Sbjct: 287 LLTNVPAERMGALEVAELYRRRWSIEGMFGRLESVLH-SEVHALGHPRAALLAFGVSVMA 345 Query: 346 AFLIDDIIQPSLD 358 ++ ++ + Sbjct: 346 YNVLAVLLAAVEE 358 >UniRef50_C3AUM2 Transposase for insertion sequence element IS231B n=3 Tax=Bacillus RepID=C3AUM2_BACMY Length = 192 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 54/166 (32%), Gaps = 26/166 (15%) Query: 92 AAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAE---WRLHMGYDPHTCQFTDFELT 148 Q LA S R D I GG A+ ++ + YD H+ +F +F++ Sbjct: 24 ILQQLAQETGFVKRKSKYGAR--DLAPIYPSSGGCAQTAGIKIQLEYDLHSGKFLNFQME 81 Query: 149 DSRDAERL----DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRW--- 201 + ++ ++ I D G + ++ + A YI + + Sbjct: 82 PGENNDKTFGTDCLDTLCPGDLCIRDLG-CFHLKDLQHIQDKMAYYISGIKSNTRIYQKN 140 Query: 202 ------------LTAEGMRFDMMGFLRGLDCGKNGE-TTVMIGNSG 234 E ++ DM + L G+ E + +G Sbjct: 141 PNPDYFQDGRIKKGTEYIQIDMEVLMNSLQPGQTCEISNAFVGMVD 186 >UniRef50_C1DIQ1 Transposase, IS4 n=2 Tax=Azotobacter vinelandii DJ RepID=C1DIQ1_AZOVD Length = 400 Score = 59.1 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 44/184 (23%), Positives = 71/184 (38%), Gaps = 17/184 (9%) Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE-GMRFDMMGFLRGLDCGKNG 224 + + D GF +++ Y+ RV R L L + + L + G Sbjct: 155 VLVTDAGFQRPW--FQAVEIRGWHYVGRVRNRDLCRLGEQPWGPVKSLYALASASPKRLG 212 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGH 284 + AP+ +L V P R+ R R Q+ E+ Sbjct: 213 CVEM--------TRSAPWSTQLCVVKHAPR--GRQHRRITGTLARDKRSRQSAQRESEPW 262 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL---LHLDALRAKEPELAKAWIFA 341 LL ++LPE +++A QV YR R QIE F+ LKS + L R++ P + + Sbjct: 263 -LLASNLPEAQWNAAQVVAIYRRRTQIEEGFRDLKSHRLGIGLGLHRSRCPRRIEILLLI 321 Query: 342 NLLA 345 +LA Sbjct: 322 AVLA 325 >UniRef50_Q8QNB6 EsV-1-170 n=2 Tax=Ectocarpus siliculosus virus 1 RepID=Q8QNB6_ESV1 Length = 383 Score = 58.7 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 63/362 (17%), Positives = 110/362 (30%), Gaps = 94/362 (25%) Query: 17 KPEELDTSAR-NAGALTRRREIRDAATLLRL--GLAYGPGGMSLREVTAWAQLHDVATLS 73 PE D + + + RRR++ ++ L G G ++ V + Sbjct: 6 SPEIYDAFKKCDQQWMQRRRKMDTSSLFYTLTRCCVQGRG------------VNHVLKME 53 Query: 74 DVALLKRLRNAADWFGILAAQTLAVRAAVTGC-TSGKRLRLVDGTAIS-APGGGSAEWRL 131 D A + ++A L R+ VDG+ + P +A ++ Sbjct: 54 DEAYSSQAVHSAR--KKLPMGAFKEVNRFLHRGPHEPRVFAVDGSKVHVHPSFINAGYKT 111 Query: 132 HMG------------------YDPHTCQFTDFELTD---SRDAERLDRFAQTADEIRIAD 170 D T DFELT R A + + + D Sbjct: 112 RTNDQPVSRPAKRPLVMLSSMVDVKTKACIDFELTKHFNERRAATSMLRSVQKGDTLLFD 171 Query: 171 RGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMI 230 RG+ S+ + + S+ A + R+ + RG N Sbjct: 172 RGYYSK-DLLHSVHGSHAFGVWRLK----------------IDAFRGTRSFFNS------ 208 Query: 231 GNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTS 290 + ARL+ ++ +V L T Sbjct: 209 CRTEATCLILGVKARLLKY----------------------------FIDGKTYVCLTTD 240 Query: 291 LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 S ++ Y RW++E +FKRLKS L L+ A+ P+L + A +L + Sbjct: 241 ---PSLSRLKIKTMYASRWRVEESFKRLKSNLRLEKAHARTPDLYIQEVEARVLLDTITL 297 Query: 351 DI 352 + Sbjct: 298 RM 299 >UniRef50_B5WFI6 Putative uncharacterized protein n=1 Tax=Burkholderia sp. H160 RepID=B5WFI6_9BURK Length = 256 Score = 58.7 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 26/109 (23%), Positives = 46/109 (42%), Gaps = 11/109 (10%) Query: 260 KTRLLSENRRKG---------RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQ 310 + R+ + R+K R ++ VLL + + + CY RWQ Sbjct: 31 RMRVSPQARKKCPALSEFWNARAIRTIDARGRERVLLTSLEDRRRFKPADIVACYERRWQ 90 Query: 311 IELAFKRLK-SLLHLD-ALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 +E +++ LK S+L + LR++ E I L+A+ LI I + Sbjct: 91 LETSYRGLKQSMLGSELTLRSRTVEGVYQEISGALIASNLIRREIANAT 139 >UniRef50_UPI0001AF03EF IS4 family transposase n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF03EF Length = 374 Score = 58.3 bits (139), Expect = 4e-07, Method: Composition-based stats. Identities = 42/203 (20%), Positives = 70/203 (34%), Gaps = 45/203 (22%) Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 T D + +ADR F E + ++A A ++VR + L L Sbjct: 35 HLTPDMLLLADRAF-DGNELLAAIARQGAQFLVRCTSTRRPPV------------LALLP 81 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 G + GN R+I E + + R V T Sbjct: 82 DGS------YLTRIGN------LSLRVI------------------EAKVEARTVDGSTF 111 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLK-SLLHLDALRAKEPELAKAW 338 + LL T A ++ Y RW+IE A+ L+ +LL LR+K+P Sbjct: 112 -GDAYRLLTTLTDHRTDPAARLMRLYHERWEIETAYLALRHTLLQGRVLRSKDPVGLCQE 170 Query: 339 IFANLLAAFLIDDIIQPSLDFPP 361 ++ L + I+ +++ P Sbjct: 171 VWGLLTLYQALRSIMVTAVETEP 193 >UniRef50_B2J1G3 Transposase, IS4 family protein n=6 Tax=Nostocaceae RepID=B2J1G3_NOSP7 Length = 381 Score = 58.3 bits (139), Expect = 4e-07, Method: Composition-based stats. Identities = 49/308 (15%), Positives = 96/308 (31%), Gaps = 62/308 (20%) Query: 78 LKRLRNAADWFGILAAQTLAVRAAVTGCTSGK---RLRLVDGTAI--------------- 119 R R A L Q + A + R+ ++DG+ Sbjct: 70 QARQRLGARVMCKLFHQLVKPMATQETLGAFLQELRIVVIDGSCFDVPDSDENARVFGRP 129 Query: 120 -SAPGGGSAEWRLHMGYDPHTCQFTDFE--LTDSRDAERLDRF----AQTADEIRIADRG 172 S PG +A ++ + F+ + R ER+ + T + + DRG Sbjct: 130 GSRPGTKAAFPKVRLVILVEAGTHIIFDALMWPYRIGERVRALRLLRSVTPGMLLMWDRG 189 Query: 173 FGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGN 232 S +++ DY+ R+ + ++ Sbjct: 190 LHSYA-MVQATVTKGCDYLGRIPANIKFIAEKPLEDGSYLSWI-------------YPSG 235 Query: 233 SGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLP 292 KKA P R+I ++ E + L+ + L Sbjct: 236 KLRKKASQPILVRIIEYTIEHPD---------------------NPTEQLTYRLITSLLN 274 Query: 293 EDEYSAEQVADCYRLRWQIELAFKRLKSLL--HLDALRAKEPELAKAWIFANLLAAFLID 350 +++ AE +A Y RW++E LK L +R+++P ++A LL + + Sbjct: 275 IEKFPAELLAREYHQRWEVENTIDELKIHLLGRKTHVRSQKPREVVQEVYAWLLGHWTVR 334 Query: 351 DIIQPSLD 358 ++ + Sbjct: 335 LLMFQAAT 342 >UniRef50_A1BCF6 Transposase, IS4 family protein n=1 Tax=Chlorobium phaeobacteroides DSM 266 RepID=A1BCF6_CHLPD Length = 252 Score = 57.9 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 71/232 (30%), Gaps = 46/232 (19%) Query: 128 EWRLHMGYDPHTCQFTDFELTDSRDAE-RLDR------FAQTADEIRIADRGFGSRPECI 180 +LH YD + +TD + ++ R+ R F D I DR + E + Sbjct: 24 AIKLHYLYDHRSSLPAFMVMTDGKKSDIRVARSQEKLDFHLLPDSIVSFDRAYID-FEWL 82 Query: 181 RSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGA 240 +L + ++ R + + R Sbjct: 83 YTLDQRKVWFVTRSKANIQYRIIGQHQPIKNKQVTR------------------------ 118 Query: 241 PFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ 300 E+ + + ++ + R+V E +T+ + +A Sbjct: 119 ------------DERIELIIEKSRAKYLKPLRLVCYTDQETGKAYEFITN--NIKLAAST 164 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 +A Y+ RWQIE F+ +K L + + + + + + + I Sbjct: 165 IAAIYKSRWQIETFFRWIKQNLKIKSFQGTSQNAVLSQTWIAMCYYLRLSYI 216 >UniRef50_A7C2A8 Transposase of IS641 n=1 Tax=Beggiatoa sp. PS RepID=A7C2A8_9GAMM Length = 304 Score = 57.5 bits (137), Expect = 8e-07, Method: Composition-based stats. Identities = 47/230 (20%), Positives = 84/230 (36%), Gaps = 40/230 (17%) Query: 130 RLHMGYDPHTCQFTDFELTDSRDAERLDRFA-QTADEIRIADRGFGSRPECIRSLAFGEA 188 +LH+ ++ + +F +T + +ER A IADRG+ S L A Sbjct: 14 KLHLCFELNRMLAVEFLVTAANFSERAALIKMLKAGVTYIADRGYMSFKVGDEVLKAK-A 72 Query: 189 DYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIA 248 ++ RV GLR + + + + + N T +I + ++ RL+ Sbjct: 73 HFVFRVK-TGLRLTVTKTLLVQLPKTVAAI---FNNVTDELIRYTNDEFKHI---YRLVC 125 Query: 249 VSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLR 308 S+ ++ +L + + QV Y LR Sbjct: 126 FSIGFDQ----------------------------FHILT---DRHDLTTFQVIMLYALR 154 Query: 309 WQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 WQIEL F+ LK ++ L ++ + L+AA L + Q + D Sbjct: 155 WQIELLFRFLKRTINGIHLIKQDERGVTIQFYTMLIAALLELRLKQMTAD 204 >UniRef50_A4A0C3 Probable transposase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A0C3_9PLAN Length = 445 Score = 57.1 bits (136), Expect = 9e-07, Method: Composition-based stats. Identities = 56/254 (22%), Positives = 81/254 (31%), Gaps = 71/254 (27%) Query: 108 GKRLRLVDGTAISAPGGGSAEWRL------------------HMGYDPHTCQFTDFELTD 149 G + VDG+ AP + E L H+G DF + Sbjct: 131 GWVVMAVDGSRFEAPRTRANEAGLGCAGREKTTPQIYQTTLQHVGTSLP----WDFRIGP 186 Query: 150 SRDAERLDRFAQTAD----EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE 205 +ER D + IAD GF S C R L G D+++RV Sbjct: 187 GTASERRQLDEMLPDLPGKSLLIADAGFISYDLC-RVLLMGRHDFLLRVGGN-------- 237 Query: 206 GMRFDMMGFLRGLDCG-KNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLL 264 L L + E TV + A P RLI + ++ Sbjct: 238 ------THLLEKLGFACETRERTVYLWP-LRFHAIPPVVLRLIVLRDANKEP-------- 282 Query: 265 SENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHL 324 V L+T++ + E + YRLRW +E F+ LK L Sbjct: 283 --------------------VYLVTNIDSESLPEEIASQIYRLRWGLETHFRGLKQTLGR 322 Query: 325 DALRAKEPELAKAW 338 D + ++ P A A Sbjct: 323 DRVLSRTPATALAE 336 >UniRef50_UPI00016C37A0 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C37A0 Length = 334 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 74/247 (29%), Gaps = 57/247 (23%) Query: 96 LAVRAAVTGCTSGKRLRLVDGTAISAP-----------------GGGSAEWRLHMGYDPH 138 L RA +G+R+ + DGT ++ P G G + RL + Sbjct: 109 LHDRAPGNWRWNGRRVLIADGTTVTMPDTPKNQNEYPHPGSQADGIGFPQIRLVALFCLA 168 Query: 139 TCQFTDFELTDSRDAERLDRF-------AQTADEIRIADRGFGSRPECIRSLAFGEADYI 191 D L SR + + + + + +ADR + + D + Sbjct: 169 CGAVLDAALGPSRGKQSGETALRRQIAGSVGSGTVLLADR-YFGGWFDLVLWRERGIDVV 227 Query: 192 VRVHWRGLRWLTAEGMRFDMMGFLR--GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAV 249 R+H + +R + + + Sbjct: 228 TRIHQKRATDFRRGRRLGRDDHVVRWPKGQRPEWMDRDTYV------------------- 268 Query: 250 SLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRW 309 LP E + + ++G + +++ T+L + A + + YR RW Sbjct: 269 RLPDE---LDIREVRVRVAQRGFRTRV--------LVVATTLTDPSIRATDLGERYRQRW 317 Query: 310 QIELAFK 316 IE+ + Sbjct: 318 SIEVDLR 324 >UniRef50_Q3M9Z5 Transposase, IS4 n=10 Tax=Cyanobacteria RepID=Q3M9Z5_ANAVT Length = 439 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 50/321 (15%), Positives = 84/321 (26%), Gaps = 78/321 (24%) Query: 71 TLSDVALLKRL-----RNAADWFGILAAQTLAVRAAVTG----------CTSGKRLRLVD 115 +S A+ +R + F L A T +++ +VD Sbjct: 75 EVSQQAISQRFLTFPAQLFEKVFKDLLPHLQASWQRRNQRKIPPSVQFTLTKFEKIWIVD 134 Query: 116 GTAI----------------SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRF 159 + + G L + D++ + Sbjct: 135 CSILEALFQKLDSLKDAPQGQLAGKIGTVINLVNLLPVEIWFCENPRTADTKFEADILNL 194 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEG-MRFDMMGFLRGL 218 T + + DRGF ++ L ++I R+ + F + L L Sbjct: 195 -VTPHTLLLLDRGFYHFNFWLQ-LIAQNVNFITRLKKGAAIHVQQVFTDSFALRDRLVRL 252 Query: 219 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAET 278 G K RL+ + Sbjct: 253 GSG--------------TKKTTFITLRLVEIRSDK------------------------- 273 Query: 279 LEAAGHVLLLTS-LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 LTS L + VAD YR RW+IE AF +K LL L L + Sbjct: 274 ----TWHSYLTSVLDPEVLPPYVVADLYRRRWRIEDAFNTVKRLLGLSYLWTGSVNGVQL 329 Query: 338 WIFANLLAAFLIDDIIQPSLD 358 ++ L ++ D+ D Sbjct: 330 QVWGTWLFYAVLVDLGDAVAD 350 >UniRef50_D0DW10 Transposase IS4 family protein n=5 Tax=Lactobacillus RepID=D0DW10_LACFE Length = 452 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 58/166 (34%), Gaps = 10/166 (6%) Query: 201 WLTAEGMRFDMMGFLRGLDCGKNG----ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKA 256 + M L L G + V G + + RL A P++A Sbjct: 221 LFDSWYSSPKMFYELTKLGLNGVGMLKRSSKVYYQYRGRQYSVKALYKRLQASKYQPKQA 280 Query: 257 LISKTRLLSE---NRRKGRVVQAET-LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIE 312 + + + K R+V +++L T+ + +++ Y RWQIE Sbjct: 281 YQYSCFVEAHVGNQKFKLRLVFVANRARQDDYLVLATT--QLSLQPQEIIQLYARRWQIE 338 Query: 313 LAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 FK K L LD + + + + ++A L+ + + D Sbjct: 339 NYFKVAKQYLRLDKSQVQSYDGLCGHLAIVMIAYNLLAWQERQNED 384 >UniRef50_C3FCZ5 Transposase for insertion sequence element IS231B n=2 Tax=Bacillus thuringiensis RepID=C3FCZ5_BACTU Length = 136 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 23/117 (19%), Positives = 41/117 (35%), Gaps = 10/117 (8%) Query: 16 GKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGM--SLREVTAWAQLHDVATLS 73 P+ L A+ + R + R A L L + G SL + + +S Sbjct: 21 FSPQALTELAKQTQFVQRTSKFR-AQDLSSLCIGMGQDTASHSLARLCGILESETGVLIS 79 Query: 74 DVALLKRL-RNAADWFGILAAQTLAVRA----AVTGCTS--GKRLRLVDGTAISAPG 123 L RL A ++ L ++ L + + S +R+R++D T Sbjct: 80 PEGLNLRLNTKAVEFLRSLFSRLLQKQLLSTMPLPSSFSAYFRRIRILDATTFQVSD 136 >UniRef50_Q6LJK0 Hypothetical transposase n=2 Tax=Vibrionaceae RepID=Q6LJK0_PHOPR Length = 394 Score = 56.4 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 57/186 (30%), Gaps = 17/186 (9%) Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGE 225 I + D GF R R + + Y+ RV + + + G Sbjct: 156 IIVTDAGF--RNTWFRQVDDMDWCYLGRVRGDVNVLIKNQWQHIKQLFIKANSKPKYVGF 213 Query: 226 TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV 285 T + P L K + + R A Sbjct: 214 TQL--------AKRKPLQCHLHLYKK----QTPKKRKDRPKGREHFSAQAVHKKSALEPW 261 Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL---LHLDALRAKEPELAKAWIFAN 342 +L T+LP D +S+ + Y R QIE F+ LKS L R +P+ + Sbjct: 262 VLATNLPTDIFSSRCIVRLYTKRMQIEETFRDLKSPQYGFGLRQSRTHDPKRFDILLLIG 321 Query: 343 LLAAFL 348 LLA + Sbjct: 322 LLAFMV 327 >UniRef50_A7GMF1 Transposase IS4 family protein n=15 Tax=Bacillus RepID=A7GMF1_BACCN Length = 294 Score = 56.0 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 45/112 (40%), Gaps = 5/112 (4%) Query: 252 PPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQI 311 E +I + + R ++ + L LT+ + A+++A+ ++ RW I Sbjct: 159 SDEMIVIGIGTTQNRSENAFRFIKVLDSKGNELHL-LTN--RFDLGADEIAELHKSRWAI 215 Query: 312 ELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRS 363 EL FK +K L++ + + I+ ++ L ++ R+ Sbjct: 216 ELFFKWMKQHLNIKKFYGQNEQAVHNQIYIAMIVYCL--HVLAQLSSQSKRT 265 >UniRef50_A3ZMM8 Transposase insG for insertion sequence element-like protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZMM8_9PLAN Length = 464 Score = 56.0 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 21/74 (28%), Positives = 38/74 (51%), Gaps = 3/74 (4%) Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAW 338 LE + L+TS+ + A +A+ YR R+ IE + LK + + LRAK E+ Sbjct: 299 LEDGATLALVTSM---SFDALCLAELYRRRYDIEFDIRDLKVTMDTENLRAKSVEMVMKE 355 Query: 339 IFANLLAAFLIDDI 352 + +++A L+ + Sbjct: 356 LMGSVIAYNLVSQL 369 >UniRef50_Q64E61 Transposase n=1 Tax=uncultured archaeon GZfos14B8 RepID=Q64E61_9ARCH Length = 622 Score = 55.6 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 38/254 (14%), Positives = 77/254 (30%), Gaps = 45/254 (17%) Query: 129 WRLHMGYDPHTCQFTDFELTDSRDA------ERLDRFAQTAD----EIRIADRGFGSRPE 178 ++L+ ++ + F++ + E +R + EI + D+GF + Sbjct: 335 YKLYAAFELKSNYPVCFKIEPGNTSDSTMLVEMCERAKKVVGKENIEIVMFDKGFYNAKS 394 Query: 179 CIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGET--TVMIGNSGNK 236 + G+ + +M + G++ K +T I + Sbjct: 395 FNK--IKGDLTFNTPAKKYKT-----------IMDAIAGIEPEKFKQTGYNRWISETRVA 441 Query: 237 KAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEY 296 G RLI V +A K + + LT+ Sbjct: 442 LEGYDGKLRLIVVKKVEPRAKKDKETGEKSWTME-----------DVYYSYLTN--NKTL 488 Query: 297 SAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 Y RW+IE FK L++ ++ + + ++ I A I ++ Sbjct: 489 GTIDAPKLYSKRWRIENFFKELRNHWNIRNFPSTSLDAVRSHI-----ALLFIQFMVLSL 543 Query: 357 LDFPPRSAGSEKKN 370 G E +N Sbjct: 544 FKHY--VLGGEYRN 555 >UniRef50_C6DY52 Transposase IS4 family protein n=1 Tax=Geobacter sp. M21 RepID=C6DY52_GEOSM Length = 394 Score = 55.6 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 54/287 (18%), Positives = 96/287 (33%), Gaps = 59/287 (20%) Query: 75 VALLKR-LRNAADWFGILAAQTLAVRAAVTGCTSGK-RLRLVDGTAISAP--------GG 124 A+ R L A+ F +L + + + L +DG+ I A Sbjct: 97 EAVNNRGLEQLAELFKLL---LKDAKNVIPAEFADIGNLVAIDGSYIDAVMSMDWADYSS 153 Query: 125 GSAEWRLHMGYDPHTCQFTDFELTDSRDAER--LDRFAQTADEIRIADRGFGSRPECIRS 182 + + H+ +D + D LTD ER ++R DE + DRG+ Sbjct: 154 THNKAKAHVAFDINRGIPKDLILTDGNQTERQFVERM-IGPDETAVLDRGY-QCNANFDQ 211 Query: 183 LAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPF 242 E +I R+ R + + E + + V++G + A Sbjct: 212 WQENEKKFICRIQARSNKKVIRE-NPIARGSII-------FYDAVVLLGAPSTR---AKK 260 Query: 243 PARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 R++A + + I+ T+ + +A Q+A Sbjct: 261 EVRVVAYRVEGKDFWIA-----------------------------TN--RHDLTALQIA 289 Query: 303 DCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 + Y+LRW IE F K L + L A+ I + L+ L+ Sbjct: 290 EAYKLRWHIESFFAWWKRHLSVYHLIARSQYGLTVQILSGLITYLLL 336 >UniRef50_Q67PW6 Transposase-like protein n=14 Tax=Symbiobacterium thermophilum RepID=Q67PW6_SYMTH Length = 552 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 44/275 (16%), Positives = 82/275 (29%), Gaps = 36/275 (13%) Query: 90 ILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTD 149 + A +A V + +S G + +L + + Sbjct: 250 WVEACLAQQKARVRQLYQQLQ-------TVSGKGSARRKQKLQREFQEEVQHLREVN-QR 301 Query: 150 SRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRF 209 R + +R I + PE I+ L ++ ++ + Sbjct: 302 LRQYRQENRTNLAPLRILLRADSAFGTPEVIQRLLELGYEFTIKSYSGSNVAYKHLFDAV 361 Query: 210 DMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRR 269 ++ E + G + AP+P RL+A+ Sbjct: 362 PAENWVEVEKNRFASEAVTVPGPT----LLAPYPVRLVAMR---------------RWDA 402 Query: 270 KGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRA 329 GR V+ ++LT+L +E + +V Y R IE F+ K H R Sbjct: 403 DGREVR---------SVILTTLQPEELTTTEVVKLYHGRQTIEAGFQEWKGTFHFGTPRL 453 Query: 330 KEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSA 364 ++ E A+ L A L+ + P+ A Sbjct: 454 RKYEANAAFTQLVLFAFNLVRWAWRFLSTNSPKLA 488 >UniRef50_A5D1X0 Transposase n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D1X0_PELTS Length = 547 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 67/412 (16%), Positives = 125/412 (30%), Gaps = 69/412 (16%) Query: 2 NYSHDNWSAIL-AHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 NW ++ I +L R A R+ + + + + S ++ Sbjct: 70 EAQIKNWGYVVYQKIWNQFDLPNLLRKISA-QRKVQFDLNNAAFLMAVQHLLEPRS--KL 126 Query: 61 TAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGK---RLRLVDGT 117 + H A+L +V+L L + D + L V + D T Sbjct: 127 GTYTHQHRYASLPNVSLNH-LYRSLD-LLWEHKELLEVEIFKKNHHLFNMQVDVVFYDVT 184 Query: 118 AISAPGGGSAEWR------------LHMGYDP---HTCQFTDFELTDSR--DAERLDRFA 160 S + R + + + +EL D + L++ Sbjct: 185 TFSFASVEADSLRNFGFSKDGKFNEVQVVLGLLIDCEGRPIGYELFPGNTFDGKTLEKAL 244 Query: 161 QTADE-------IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWL----------- 202 +E I +ADRG S+ R + G + + + Sbjct: 245 VKLEERFGLRRVIIVADRGINSKLNLKRIVDRGYSYIFAARIKNMKKEITDEILSENGYQ 304 Query: 203 ----TAEGMRFDMMGFLRGLDC-GKNGETTVMIGNSGNKKAGAPFPA---RLIAVSLPPE 254 E +R+ ++ +L G+ + + + + + A RLIA + Sbjct: 305 EINDGEEVIRYKVIEYLNEFTAEGQKYQLPEKLIVTYSSRRAEKDRADRERLIAK---AQ 361 Query: 255 KALISKTRLLSENRRKGRVVQAETLEAAGHVL--------------LLTSLPEDEYSAEQ 300 L SK ++ + N+R G+ E +L E E SA Sbjct: 362 NLLESKAKIQASNKRGGKKYLKEIDCTGTWILDEEAIAREEQFDGYYGIQTSEKEMSARD 421 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 + Y W+IE +F+ +KS L + + K LA L + Sbjct: 422 ILAAYHNLWRIEESFRVMKSTLEVRPVFHWTERRIKGHFVICFLAFLLERTL 473 >UniRef50_UPI0000164DB3 hypothetical protein TVN0693 n=1 Tax=Thermoplasma volcanium GSS1 RepID=UPI0000164DB3 Length = 83 Score = 54.8 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 17/62 (27%), Positives = 32/62 (51%), Gaps = 2/62 (3%) Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA--FLIDDIIQPSLD 358 + Y RW I++ F+ +K+ L +D L +++ IF ++A +I D+I S+ Sbjct: 3 IHWIYSQRWNIDIFFRTMKTYLKIDHLISRKINSIMVQIFTAMIAYIVLMIQDMISCSMS 62 Query: 359 FP 360 P Sbjct: 63 IP 64 >UniRef50_Q737L2 IS231-related transposase n=3 Tax=Bacillus cereus group RepID=Q737L2_BACC1 Length = 167 Score = 54.0 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 53/149 (35%), Gaps = 24/149 (16%) Query: 122 PGGGS----AEWRLHMGYDPHTCQFTDFELTDSRDAERL----DRFAQTADEIRIADRGF 173 PG G A ++ + YD H+ +F +F++ ++ ++ +++ I D G+ Sbjct: 10 PGSGGCAQTAGIKIQLEYDLHSGEFLNFQVGPGKNNDKTFGTECLDTLRPEDLCIRDLGY 69 Query: 174 GSRPECIRSLAFGEADYIVRVHWRGLRWL---------------TAEGMRFDMMGFLRGL 218 S E + + YI R+ ++ +E + DM L+ L Sbjct: 70 FS-LEDLDQMDQRGTYYISRLKLNTNVYMKNSNPEYFKNSAIKKQSEYIHIDMKQILKQL 128 Query: 219 DCGKNGETTVMIGNSGNKKAGAPFPARLI 247 T M + N P LI Sbjct: 129 QLYLEVCTPEMNASFINIVYFHPLFFVLI 157 >UniRef50_C5VJA1 Transposase domain protein n=15 Tax=Prevotella RepID=C5VJA1_9BACT Length = 405 Score = 54.0 bits (128), Expect = 9e-06, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 67/235 (28%), Gaps = 46/235 (19%) Query: 131 LHMGYDPHTCQFTDFELTDSRDAER--LDRFAQTADEIRIADRGFGSRPECIRSLAFGEA 188 +H H +LT + + L D DR + + + L Sbjct: 166 VHTVMKYHVGVPMVVQLTSAATHDHYLLKEVHLPKDATLTMDRAYVDYAQ-FQRLTEEGV 224 Query: 189 DYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIA 248 Y+ ++ + G + D Sbjct: 225 CYVTKMKKNLTYTELSSVTYVSPDGLVTHTD----------------------------- 255 Query: 249 VSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLR 308 + EK I R V+ + + V+LLT+ E + + + Y+ R Sbjct: 256 KKIVFEKGEIRHQA---------RRVELWSDNSHKSVVLLTN--NLELDVKDLEEIYKRR 304 Query: 309 WQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFL---IDDIIQPSLDFP 360 W IE +K+LK L + + L+A L I +I+ + F Sbjct: 305 WAIESLYKQLKQNFPLHFFYGDSVNAIQIQTWVVLIANLLCTVISRMIKRHVSFS 359 >UniRef50_A6DM44 Putative uncharacterized protein n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DM44_9BACT Length = 272 Score = 53.7 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 48/84 (57%), Gaps = 1/84 (1%) Query: 278 TLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 T + VL+ T L + ++S ++++ YR R+ +E+A++ LK L+L+++R ++ K Sbjct: 88 TSKLKSQVLITTLLDDAKFSWKELSGLYRQRYLVEVAYRHLKVNLNLESIRKRKFSRIKK 147 Query: 338 WIFANLLAAFLIDDIIQPSLDFPP 361 +++A + L +++ + P Sbjct: 148 FMYAAIALYNLA-AVLRNRIKLPE 170 >UniRef50_B0R9V4 Transposase (TCE33) n=2 Tax=Halobacterium salinarum RepID=B0R9V4_HALS3 Length = 475 Score = 53.7 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 1/60 (1%) Query: 275 QAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPEL 334 + T+L +E E++ YR RW IE F+++K L +K P L Sbjct: 338 SGANEPMDDYTYFYTNLHPEEVPPEELGKAYRRRWGIETDFRKIKRDF-LAKSGSKNPAL 396 >UniRef50_Q8DM76 Tlr0247 protein n=2 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DM76_THEEB Length = 166 Score = 53.7 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 3/88 (3%) Query: 265 SENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHL 324 N + RVV+ E L T+L S E+V+ YR RW IE +K LK L L Sbjct: 41 KFNDHRYRVVEFYD-ENQREFCLATNL--KHLSDEEVSQLYRHRWAIENLWKFLKMHLSL 97 Query: 325 DALRAKEPELAKAWIFANLLAAFLIDDI 352 D L AK + I+ L+ +++ + Sbjct: 98 DRLIAKSLKGMVNQIYMFLIVYLILELV 125 >UniRef50_C6JAL6 Transposase (Fragment) n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JAL6_9FIRM Length = 237 Score = 52.9 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 60/177 (33%), Gaps = 14/177 (7%) Query: 207 MRFDMMGFLRGLDCGKNGETTVMIGNSGNKKA----GAPFPARLIAVSLPP-----EKAL 257 R + L+ L+ +++I + G L + L +K Sbjct: 39 ERAAALEHLKVLEDMGLYNNSIIIFDRGYYSEDMFRYCVEHGHLCVMRLKEGINLSKKCN 98 Query: 258 ISKTRLLSENRRKG-----RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIE 312 +L ++G V L+ L T+L + + + + Y RW +E Sbjct: 99 GDMISILQGTSKEGTSDVPIRVLEIPLDDGTKEYLATNLFDPAVTKDMFRELYFYRWPVE 158 Query: 313 LAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEKK 369 L +K LKS ++ + + N+L + L I + + SA S K Sbjct: 159 LKYKELKSRFAMEEFSGATAVSIQQEFYINMLLSNLASLIKNEADEEIQISAKSTNK 215 >UniRef50_A5WBL3 Transposase, IS4 family n=2 Tax=Bacteria RepID=A5WBL3_PSYWF Length = 427 Score = 52.1 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 49/247 (19%), Positives = 78/247 (31%), Gaps = 23/247 (9%) Query: 130 RLHMGY-DPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEA 188 +H Y D Q ELT E L++ I I DR S + R + Sbjct: 137 GVHSSYQDTPAKQTHLNELTTR--IEYLEQQGFDKPLIHIIDREADSAYQM-RQWDEHDY 193 Query: 189 DYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIA 248 +I RV + R + N + G A+++ Sbjct: 194 KFITRVKAGSYLSYEGKSQR------CSQIAGQLNFSYQRQVNYKGKAAKQYIATAKVVL 247 Query: 249 VSLPPEKALISKTRLLSENRRKGRVVQA----------ETLEAAGHVLLLTSLPEDEYSA 298 +A I KG+ + + A LL++L E + Sbjct: 248 TRSAKPQA-IDPATGKRIAPIKGKPLSLLLTVSRIYDDQDKRLATWY-LLSNLQEPSVNG 305 Query: 299 EQVADCYRLRWQIELAFKRLKSL-LHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 ++ Y RWQIE FK LKS L L++ + + + A L+ I+Q + Sbjct: 306 ADISQWYYWRWQIESYFKLLKSAGLQLESWLQQSGDAYFKRLLIASQACTLVWRIMQKTD 365 Query: 358 DFPPRSA 364 A Sbjct: 366 KQSKEFA 372 >UniRef50_A6DG92 ISPg4, transposase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG92_9BACT Length = 189 Score = 52.1 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 15/56 (26%), Positives = 26/56 (46%) Query: 295 EYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 +++ +A Y+ RW IE+ FK+LK L L + ++ LL L+ Sbjct: 67 QWAPSSIASIYQSRWGIEVFFKQLKQNLKLADFLGHNKNAIQWQVWTALLTYVLLR 122 >UniRef50_C0GNX3 Transposase IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GNX3_9DELT Length = 851 Score = 52.1 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 23/103 (22%), Positives = 42/103 (40%), Gaps = 2/103 (1%) Query: 253 PEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIE 312 + + SK +L E ++ R V + E + ++ +YS ++ D Y +RW +E Sbjct: 663 EQYRIASKDFILPETKKPFRFVVKQNKETSEIRCFGST--HTDYSPTKILDAYHIRWPVE 720 Query: 313 LAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQP 355 K L L+ PE +A + +LA +D Sbjct: 721 TGIKDLIENYFLNKPTGTSPEKVEAHYYCIMLARLAVDYFRSQ 763 >UniRef50_C8VXW5 Transposase IS4 family protein n=3 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VXW5_DESAS Length = 587 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 18/70 (25%), Positives = 33/70 (47%) Query: 283 GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFAN 342 G +T+L ++A + YR + ++E AF +K L L + + A + Sbjct: 442 GITCFITNLDVTSHTAIDIIQWYRRKNKVEEAFHEIKDHLDLRPIYLTREQRVMAHVIIC 501 Query: 343 LLAAFLIDDI 352 +LA F+ +DI Sbjct: 502 VLAYFIFNDI 511 >UniRef50_C8XGG2 Transposase IS4 family protein n=2 Tax=Nakamurella multipartita DSM 44233 RepID=C8XGG2_NAKMY Length = 457 Score = 51.7 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 43/241 (17%), Positives = 73/241 (30%), Gaps = 22/241 (9%) Query: 137 PHTCQFTDFELTDSRDAERLDRFAQT--ADEIRI-ADRGFGSRPECIRSLAFGEADYIVR 193 + D E + R Q +I + AD GF A AD Sbjct: 186 LRRGRSNDATAAPLFINETISRLRQAGATGQIVLRADSGFYLHDV---VAACRAADVRFS 242 Query: 194 VHWRGLRWLTAEGMRFDMMGF--LRGLDCGK---NGETTVMIGNSGNKKAGAPFPARLIA 248 + R + L + + + G T ++ + P RL+ Sbjct: 243 IGARMIGHLRGQIEAIPDEQWQPIEYFLPGAGVAEIPYTPFAQDTHGRDRTDTVPLRLMV 302 Query: 249 VSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLR 308 PP +A + ++ QA + ++T D E R Sbjct: 303 RRTPPTQAQVRNRGQDTD--------QAALFPVYDYHPIITDRDGDLRDLEADH---RRH 351 Query: 309 WQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEK 368 ++EL + LK + L L K AW+ N +A L I + D R + + Sbjct: 352 AEVELTIRDLKHGMGLAHLPTKSFGGNAAWLILNTIAHNLTRWITRLGFDQGHRMTKNIR 411 Query: 369 K 369 + Sbjct: 412 R 412 >UniRef50_Q1PWW4 Putative uncharacterized protein n=2 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PWW4_9BACT Length = 166 Score = 51.7 bits (122), Expect = 5e-05, Method: Composition-based stats. Identities = 27/79 (34%), Positives = 39/79 (49%), Gaps = 4/79 (5%) Query: 286 LLLTSLPEDEYS-AEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 +LLTSLP D + A V +CY RWQIE+ FK LKS ++ + + E K I ++ Sbjct: 1 MLLTSLPADTFRQACLVVECYLCRWQIEIYFKVLKSGCKIEERQLETAERIKPCIALYMI 60 Query: 345 AA---FLIDDIIQPSLDFP 360 A + + D P Sbjct: 61 VAWRVLFVTMFGRECPDLP 79 >UniRef50_C3RGR4 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C3RGR4_9BACE Length = 413 Score = 51.7 bits (122), Expect = 5e-05, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 57/163 (34%), Gaps = 10/163 (6%) Query: 201 WLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISK 260 + + D++ F+ + + +G + + A +I L EK++ Sbjct: 181 LVDSWFTCADLIRFITSRHLECHLIGMLKMGKTRYRTEAGNLNAPVIIDRLKKEKSVRYS 240 Query: 261 TRL--------LSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIE 312 +L RK R+ + L++ ++ + Y +RW IE Sbjct: 241 RKLNCYYAHMDAEYANRKIRIFFCKRGRKGAWNAFLSTDTRLDF--FEAYRIYSMRWAIE 298 Query: 313 LAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQP 355 + F +K LL L + + A I L+ ++ I + Sbjct: 299 VCFSEMKGLLRLGKCQCRNFSSQIASISLTLMQYNILSHIKRF 341 >UniRef50_C2V5D0 Transposase for insertion sequence element IS231B n=3 Tax=Bacillus cereus group RepID=C2V5D0_BACCE Length = 146 Score = 51.3 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 19/123 (15%), Positives = 39/123 (31%), Gaps = 24/123 (19%) Query: 181 RSLAFGEADYIVRVHWRGLRW---------------LTAEGMRFDMMGFLRGLDCGKNGE 225 + + +A YI R+ + A+ + DM + L G+ E Sbjct: 5 QHIQDKKAYYISRIKSNTRIYQKNPNPDYFQDGRIKKGAKYIHIDMEVLMNSLQPGQTYE 64 Query: 226 -TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGH 284 + + PAR+I L ++ + + ++KG A +G Sbjct: 65 ISDAYVRMID------KVPARVIVHRLTKQQQRLHDQTVRE--KKKGMKYSARNKRLSGI 116 Query: 285 VLL 287 + Sbjct: 117 NIY 119 >UniRef50_Q1VPU4 Putative uncharacterized protein n=7 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VPU4_9FLAO Length = 477 Score = 51.3 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 40/116 (34%), Gaps = 10/116 (8%) Query: 235 NKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPED 294 K+ P R A I ++ ++G+ L+T+ Sbjct: 292 KKQKAKPKRCRKFNYYYHHYIAEIDGLKVALFISKRGK--------NGKWHTLITTDTSL 343 Query: 295 EYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 ++ + + Y +RW IE+ FK K L L ++ ++ A I + L Sbjct: 344 KF--VKAIEVYSIRWSIEVFFKEAKQLFGLGKCQSTNFDVQIAQITIAMTQYLLTS 397 >UniRef50_Q2NZH2 ISXoo8 transposase n=73 Tax=Xanthomonas RepID=Q2NZH2_XANOM Length = 407 Score = 51.3 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 34/156 (21%), Positives = 60/156 (38%), Gaps = 5/156 (3%) Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGE 225 I + D GF R R+++ D++ R+ RG + + + D + ++ Sbjct: 159 ILVTDAGF--RTPWFRAVSAMGWDWVGRL--RGRTQVKPQDVPDDAVQWIDSRRLHALAS 214 Query: 226 TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV 285 P RL+ + + R ++ R ++A E Sbjct: 215 NRARALPPMQANRSDPLDCRLVLYAKTRQGRQQRNRRSSAKVSRASSSLKAAAREREPW- 273 Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL 321 L++ S SA+Q+ + Y R QIELAF+ LKS Sbjct: 274 LIVASPQLHAPSAKQLVNLYARRMQIELAFRDLKSH 309 >UniRef50_Q978C6 TVG1544340 protein n=2 Tax=Thermoplasma volcanium RepID=Q978C6_THEVO Length = 107 Score = 51.0 bits (120), Expect = 7e-05, Method: Composition-based stats. Identities = 14/57 (24%), Positives = 29/57 (50%), Gaps = 2/57 (3%) Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA--FLIDDIIQP 355 + Y RW I++ F+ +K+ L +D L +++ IF ++A +I D++ Sbjct: 20 IFTIYSQRWNIDIFFRTMKTYLKIDHLISRKINSIMVQIFTAMIAYIVLMIQDMLSC 76 >UniRef50_Q82R33 Putative IS4 family ISFsp6-like transposase n=1 Tax=Streptomyces avermitilis RepID=Q82R33_STRAW Length = 333 Score = 51.0 bits (120), Expect = 7e-05, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 34/82 (41%), Gaps = 1/82 (1%) Query: 283 GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA-LRAKEPELAKAWIFA 341 + L+ T Y A + Y RW+ E A+ L+ + LR+ +P + ++A Sbjct: 100 SYRLVTTLTDARRYPAPALVALYHQRWEHESAYFALRHTITDGRVLRSGDPVGVEQEMWA 159 Query: 342 NLLAAFLIDDIIQPSLDFPPRS 363 L + ++ + + P + Sbjct: 160 LLALYQALRTVMVEAAESRPGT 181 >UniRef50_C3BDU8 Transposase for insertion sequence element IS231B n=1 Tax=Bacillus mycoides Rock3-17 RepID=C3BDU8_BACMY Length = 113 Score = 50.6 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 16/85 (18%), Positives = 33/85 (38%), Gaps = 7/85 (8%) Query: 209 FDMMGFLRGLDCGKNGET-TVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSEN 267 FD+ + L G+ E IG P R++ L E+ + Sbjct: 14 FDLEDLMLQLQLGQTHEIHKAYIG------LYQKLPTRVVLHRLTEEQTRKRWENQALKE 67 Query: 268 RRKGRVVQAETLEAAGHVLLLTSLP 292 ++KG V++ + + + +++LP Sbjct: 68 KKKGIVMKERSKRLSAMNVYISNLP 92 >UniRef50_C0WV66 Transposase IS4 family protein n=5 Tax=Lactobacillus RepID=C0WV66_LACFE Length = 450 Score = 50.6 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 2/67 (2%) Query: 283 GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFAN 342 +++L T+ + ++ Y RWQIE FK K L D + ++ + + Sbjct: 309 NYLVLATT--KTSLRPNEIIQLYGRRWQIETYFKAAKQYLRFDQTQVQKYDGLCGHLAMV 366 Query: 343 LLAAFLI 349 ++ L+ Sbjct: 367 MMTYDLL 373 >UniRef50_Q5GUK2 ISxac1 transposase n=1 Tax=Xanthomonas oryzae pv. oryzae RepID=Q5GUK2_XANOR Length = 361 Score = 50.6 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 24/93 (25%), Positives = 40/93 (43%), Gaps = 1/93 (1%) Query: 229 MIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLL 288 ++ + P RL+ + P+ R ++ R ++A E L++ Sbjct: 171 LVARTMQANRSDPRDCRLVLYAKTPQGRQQRNRRSPAKVSRASSSLKAAAREREPW-LIV 229 Query: 289 TSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL 321 S SA+Q+ + Y R QIELAF+ LKS Sbjct: 230 ASPQLHAPSAKQLVNLYARRMQIELAFRNLKSH 262 >UniRef50_Q9R3J0 Transposase, putative n=10 Tax=Deinococcus radiodurans RepID=Q9R3J0_DEIRA Length = 416 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 60/182 (32%), Gaps = 20/182 (10%) Query: 172 GFGSRPECIRSLAFGEADYIVRVHWRGLR---WLTAEGMRFDMMGFLRGLDCGKNGETTV 228 G ++ + ++ +I R + R G + + Sbjct: 191 GNYAKESMVETVTGHGLPFISRFPRNANLKYLYTGEHPRRRGRPKKFDGKVDFSDLQRFD 250 Query: 229 MIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLL 288 ++ + ++ + + A + ++ + +KG+V G+ +L Sbjct: 251 LVSETSTERVWTQVVWSV-------QWAREVRAVVIQQVGKKGQV--------TGYAVLF 295 Query: 289 TSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFL 348 ++ A +V YR R++IEL F+ K L ++ + +A LL L Sbjct: 296 ST--AVTMPAHEVIALYRSRFEIELIFRDAKQFLGGQDVQLRSQPGIEAHWNVVLLTLNL 353 Query: 349 ID 350 Sbjct: 354 CR 355 >UniRef50_A0LAZ1 Transposase, IS4 family protein n=4 Tax=Magnetococcus sp. MC-1 RepID=A0LAZ1_MAGSM Length = 563 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 42/85 (49%), Gaps = 1/85 (1%) Query: 268 RRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDAL 327 RK V+ ET G ++ T+LPE E S+ Y+ Q+E AF+ LKS L + + Sbjct: 384 SRKQAVIDQETA-LDGIYVVRTNLPEKEISSADTIRQYKSLAQVESAFRDLKSSLDIRPI 442 Query: 328 RAKEPELAKAWIFANLLAAFLIDDI 352 + KA +F +LA + ++ Sbjct: 443 FHFRADRIKAHVFLCMLAYMVEREM 467 >UniRef50_A8L1S1 Transposase IS4 family protein n=2 Tax=Frankia sp. EAN1pec RepID=A8L1S1_FRASN Length = 425 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 49/200 (24%), Positives = 76/200 (38%), Gaps = 21/200 (10%) Query: 148 TDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGM 207 R R DE+ ADRGF S +LA G ++ GL Sbjct: 202 AGERTLARGLLMRLNRDEVLTADRGFYSFDNW--ALAAGTGADLIWRAPTGLN------- 252 Query: 208 RFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSEN 267 + +R L G TV+I + G RL+A + ++ + L Sbjct: 253 ----LPVVRVLSDGTF--LTVLINP---EITGGRRRERLLAAAKAGDELDPDEAHLARVV 303 Query: 268 RRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLH--LD 325 +A V+L T L + A++VA Y RW+ E A +LK+ L Sbjct: 304 EYD-IPDRAGNGTGELVVVLTTILDPRQARADEVAAGYNERWEEETANDQLKTHLRGPGR 362 Query: 326 ALRAKEPELAKAWIFANLLA 345 LR++ P+LA ++A L+ Sbjct: 363 VLRSRLPDLAVQEMWAWLIV 382 >UniRef50_UPI0001C16028 hypothetical protein CRD_01775 n=2 Tax=Raphidiopsis brookii D9 RepID=UPI0001C16028 Length = 465 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 49/301 (16%), Positives = 82/301 (27%), Gaps = 22/301 (7%) Query: 64 AQLHDVATLSDVALLKRLRNAADWFGILA--AQTLAVRAAVTGCTSG-KRLRLVDGTAIS 120 + S + R R A L G G R+ VDGT Sbjct: 87 GLRLQTPSASSIT-EARQRTGAAVMRRLFELVAKPLATILTPGAFLGELRIMAVDGTVFD 145 Query: 121 APGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECI 180 P + + F RL + + I + R Sbjct: 146 VPDTSTNARVFGYP-GSPKGTYPGFPKV------RLVFLVEAGTHLIIDAFCYPYRMGER 198 Query: 181 RSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGA 240 R G + ++ L F M+ + G GN + Sbjct: 199 R----GALKLLRSINSSMLLMWDRGLHSFKMVHTVIKQQGNFLGRVP---GNVKFQVVKT 251 Query: 241 PFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ 300 ++ P ++ K R V E + L+ + D++ A Sbjct: 252 LADGSYLSWIAPDGQS--RKKGAKRMEVRIIEYVIEEDGTLKTYRLITNLMDVDKFPALL 309 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 +A Y RW+ E LK L +R+K P ++ LLA + + ++ S Sbjct: 310 LAQEYHKRWEAENTLDELKVHLLARKIPIRSKNPREVVQELYGWLLAHYCLRCLMFQSAT 369 Query: 359 F 359 Sbjct: 370 L 370 >UniRef50_Q1Q2K2 Putative uncharacterized protein n=5 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q2K2_9BACT Length = 457 Score = 49.8 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 57/188 (30%), Gaps = 14/188 (7%) Query: 173 FGSRPECIRSLAFGEADYIVRVHWR---GLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 + + + + + + + + + L L Sbjct: 226 WYASQRFLEHIHAKKKHFFSEIKSNRNISMYHPEKQKYCIIKPDELVTLIKKHYAGKIKY 285 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 + + + L + +L + ++ + + +L+T Sbjct: 286 VTLKSADGSEVSYKTYTFDAKLNGCNVPLKFVVILGKWNKE---------DDKKYHVLIT 336 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 + + + S + V Y LRW IE FK LK + D + + + + + L++ + Sbjct: 337 N--QLDASVKTVITNYLLRWGIEHCFKELKDTFYFDHYQVRHIDKIERYWNICLISWTFV 394 Query: 350 DDIIQPSL 357 I Q + Sbjct: 395 YWIKQNAY 402 >UniRef50_B9EHT2 Olfr780 protein n=158 Tax=root RepID=B9EHT2_MOUSE Length = 402 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 78/247 (31%), Gaps = 27/247 (10%) Query: 105 CTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTAD 164 KRL ++ + ++ G + P + Q + + D A+ Sbjct: 101 IREQKRLMVLRAS-VALHGRSVTLYEKAF---PLSEQCSK-KAHDQFLADLASILPSNTT 155 Query: 165 EIRIADRGFGSRPECIRSLAFGEADYIVRV--HWRGLRWLTAEGMRFDMMGFLRGLDCGK 222 + ++D GF + +S+ ++ RV + + + Sbjct: 156 PLIVSDAGF--KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKT 213 Query: 223 NGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAA 282 +G K+ P +++ + + R + A+ Sbjct: 214 -------LGYKRLTKS-NPISCQILLYK-----SRSKGRKNQRSTRTHCHHPSPKIYSAS 260 Query: 283 GHV--LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL---LHLDALRAKEPELAKA 337 +L T+LP + + +Q+ + Y R QIE F+ LKS L L R E Sbjct: 261 AKEPWVLATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDI 320 Query: 338 WIFANLL 344 + L+ Sbjct: 321 MLLIALM 327 >UniRef50_Q647P2 Transposase n=1 Tax=uncultured archaeon GZfos9E5 RepID=Q647P2_9ARCH Length = 398 Score = 49.4 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 21/124 (16%), Positives = 43/124 (34%), Gaps = 8/124 (6%) Query: 201 WLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISK 260 +L ++ L+ L + IG + ++ + + Sbjct: 219 YLDRGFYAVPIVRMLKRLSVHFIIQAQKSIGIKKVIEENKDKEVIVVDYKM-------KR 271 Query: 261 TRLLSENRRKGRV-VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLK 319 R + R+ + L+ V +T+L +E +A+ A YR RW IE +++ K Sbjct: 272 KRKAPSGKEDVRLFIVPHRLKKDKRVCFVTNLDVNEENAKDYAGNYRKRWGIETSYRVKK 331 Query: 320 SLLH 323 Sbjct: 332 DAFR 335 >UniRef50_Q73GX5 Conserved domain protein n=5 Tax=Wolbachia RepID=Q73GX5_WOLPM Length = 158 Score = 49.4 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 16/64 (25%), Positives = 29/64 (45%) Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 +L + + E ++ + Y LRW +E F +LK L L+ K E K ++ + Sbjct: 1 MLATSLMDEQKFQTGGFKELYFLRWGVETFFAKLKGRLSLENFTGKSVESVKQDFWSAIF 60 Query: 345 AAFL 348 + L Sbjct: 61 ISNL 64 >UniRef50_D1VZM3 Transposase, IS4 family n=3 Tax=Prevotella RepID=D1VZM3_9BACT Length = 511 Score = 49.0 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 71/228 (31%), Gaps = 18/228 (7%) Query: 133 MGYDPHTC-QFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYI 191 + Y Q+ F + D + RF D + +AD G ++ G + Sbjct: 223 LSYSLFNGSQYEGFTMIPMID-DFKQRFTLGDDFVIVADSGLMNKNNVALLQNAGYKYIL 281 Query: 192 VRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAV-- 249 + + D +NGE ++ + K A R IA Sbjct: 282 GARIRNERNNIRQWILSLDKKDNASYEMYRQNGERLIVCYSERRAKKDAYNRTRGIARLR 341 Query: 250 ------SLPPEKALIS-KTRLLSENRRKGRVVQAETLE----AAGHVLLLTSLPEDEYSA 298 + ++ + L ++ + E +E G +T+ E A Sbjct: 342 KAYKSGRITKQQVNKRGYNKFLEISKDIEVTISQEKIEEDCKWDGWKGYITNT---ELDA 398 Query: 299 EQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 E+V Y W +E +F+ K L + + +A + +A Sbjct: 399 ERVIAQYHGLWVVERSFRISKGTLEMRPMFHFTERRIEAHVCICFIAY 446 >UniRef50_D0I6N0 Transposase IS4 n=1 Tax=Grimontia hollisae CIP 101886 RepID=D0I6N0_VIBHO Length = 345 Score = 49.0 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 60/344 (17%), Positives = 104/344 (30%), Gaps = 44/344 (12%) Query: 36 EIRDAATLLRLGLAYGPGGMSL----REVTAWAQLHDVATLSDVAL-LKRLRNAADWFGI 90 + R + LL A G ++L R + + D L RL + Sbjct: 20 KKRLQSLLLATESALGGADLTLTKLGRSLNTFTAAKHAIKRVDRLLGNTRLHREKEDIYK 79 Query: 91 LAAQTLAVRAAVT-------GCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFT 143 A+ +A R + + I+ G + Y Q+ Sbjct: 80 WNARLIAGANPCPVILLDWSDVREQLRFMTLRAS-IALDGRAVTLYEQAFEY----AQYN 134 Query: 144 DFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLT 203 + + + +A I I+D GF R R + ++ RV Sbjct: 135 SPKTHQYFLGKLQEILPPSATPIIISDAGF--RNTWFRQVQSKGWFWLGRV--------- 183 Query: 204 AEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLP----PEKALIS 259 R D+ + + ++ + K + +L A P Sbjct: 184 ----RGDVSIKMTQ----SDWQSNKTLYPDATSKPHSLGQCQL-ARRSPLTCNGYVVKQQ 234 Query: 260 KTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLK 319 K + S +K + A LL+T++P + +A Q+ Y R QIE AF+ LK Sbjct: 235 KAQRHSRTGQKHTASRLFAKNANEPWLLVTNIPTETLNAVQICRLYAKRMQIEEAFRDLK 294 Query: 320 SL---LHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFP 360 S L L R + N + L + +I Sbjct: 295 STAYGLALRHNRTHHNRRLLSESANNFCLSGLSEILITQLSSHS 338 >UniRef50_C6I0E1 Transposase, IS4 family protein n=4 Tax=Leptospirillum ferrodiazotrophum RepID=C6I0E1_9BACT Length = 650 Score = 49.0 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 26/122 (21%), Positives = 47/122 (38%), Gaps = 3/122 (2%) Query: 231 GNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTS 290 + P R+ ++ R++ K + Q +V+ T Sbjct: 454 HYTVAPVYETTAPPRISRGKKKKASPSLASPRIVDLAWEKSPLRQVRKTLTGAYVIETTH 513 Query: 291 LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 E SA + Y Q+E AF+ LKS L + + + + +A +F ++LA FL+ Sbjct: 514 T---ELSASGIWSLYTTLTQVEGAFRALKSDLGVRPVFHQTADRTRAHLFVSVLAYFLLS 570 Query: 351 DI 352 I Sbjct: 571 HI 572 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 49.0 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 49/258 (18%), Positives = 75/258 (29%), Gaps = 27/258 (10%) Query: 102 VTGCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQ 161 T+G+R+ VDG + G + L D HT D + E L RF Sbjct: 128 QPAATTGRRVYSVDGKTLRGSGPAGEQVHLLAVLDQHTGTVLGQVDVDGKTNE-LTRFQP 186 Query: 162 TADEIRI------ADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFL 215 + + AD R + +A Y+ V R + L Sbjct: 187 LLGPLDLTAVVVTADALHTQREHARWLVDTKKAAYVFTVKKNQPRLYRQ-------LKTL 239 Query: 216 RGLDCGKNGETTVM-IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVV 274 ET+ G ++ A +A+ P + R N GR Sbjct: 240 PWTKIPIQDETSTRGHGRYDIRRLQAVTCTGPLALDFP-HAVQALRIRRRRLNLATGRWS 298 Query: 275 QAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELA--FKRLKSLLHLDALRAKEP 332 V +T+L + ++AD R W IE + LR Sbjct: 299 TVT-------VYAITNLSAAQAGPAELADWLRGHWAIETLHHIRDTTYAEDASRLRTGNA 351 Query: 333 ELAKAWIFANLLAAFLID 350 A A + A L+ Sbjct: 352 PRAMATL--RNTAINLLR 367 >UniRef50_B1IL33 Putative uncharacterized protein n=1 Tax=Clostridium botulinum B1 str. Okra RepID=B1IL33_CLOBK Length = 108 Score = 48.7 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 17/68 (25%), Positives = 33/68 (48%) Query: 283 GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFAN 342 + LL+T++ +E SA ++ Y R IE FK K++ + LR ++ +++ Sbjct: 22 KYSLLITNINLNEMSAVELFHFYNERQTIEAFFKMAKNIYQIKNLRTRKFLGIYGFLWLV 81 Query: 343 LLAAFLID 350 + LI Sbjct: 82 FITHNLIS 89 >UniRef50_A3H586 Putative uncharacterized protein (Fragment) n=2 Tax=Proteobacteria RepID=A3H586_VIBCH Length = 245 Score = 48.7 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 65/174 (37%), Gaps = 30/174 (17%) Query: 175 SRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSG 234 P ++ + YI+ + R D ++ L ++GN G Sbjct: 31 REPLKVKQVQIDGCRYIICQNPR-----QQRKDAADREAIVKALTEKLKKGPKSLVGNKG 85 Query: 235 NKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPED 294 +K + + + A I + R+ E R G+ V L T+ Sbjct: 86 FRKY----------LKVEKDSARIDEKRVTYEARFDGKWV------------LQTNTD-- 121 Query: 295 EYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFL 348 S E+VA Y+ W++E F+ +KSLL + ++ + + +F + LA L Sbjct: 122 -LSPEKVALKYKELWRVERVFRDVKSLLDTRPIFHQKDQTIRGHVFCSFLALVL 174 >UniRef50_D1JFQ9 Putative uncharacterized protein n=1 Tax=uncultured archaeon RepID=D1JFQ9_9ARCH Length = 483 Score = 48.7 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 70/197 (35%), Gaps = 17/197 (8%) Query: 163 ADEIRIADRGFGSRPECIRSLAFG--EADYIVRVHWRG-----LRWLTAEGMRFDMMGFL 215 + + I D+GF S+ + + E YI+ + + FD FL Sbjct: 242 KNAVLITDKGFYSKTNILALVKEKKDELHYIIPLKRDSSLIDYTKIRQGNRKSFDGY-FL 300 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQ 275 + + G+ KK RL EK +S+ Sbjct: 301 FEKRAIWYYKYELEDGDLKGKKVIVFLDERL---RAEEEKDYLSRLEKNDTATLDNF--- 354 Query: 276 AETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELA 335 + G + ++T L + S E++ + + R +IE+ F K++L+ D ++ Sbjct: 355 FKIQHRMGTIAVITDLDK---SGERIYNLLKSRVEIEIMFDAFKNVLNADRTYMRDDYQM 411 Query: 336 KAWIFANLLAAFLIDDI 352 + W+F N +A + Sbjct: 412 EGWMFINFIALVFYYRL 428 >UniRef50_A3H523 Transposase (IS4 family) protein (Fragment) n=1 Tax=Vibrio cholerae B33 RepID=A3H523_VIBCH Length = 371 Score = 47.9 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 17/68 (25%), Positives = 31/68 (45%) Query: 295 EYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQ 354 + S EQ+ + Y RW+IE FK +K + + + + I +++AA +I Sbjct: 252 KLSVEQIIEYYGARWKIESGFKEIKQDIGSSKSQTRNAQAVINHINFSIMAATIIWIYGS 311 Query: 355 PSLDFPPR 362 + P R Sbjct: 312 RLENIPER 319 >UniRef50_Q2JAY9 Transposase, IS4 n=2 Tax=Frankia RepID=Q2JAY9_FRASC Length = 412 Score = 47.9 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 52/259 (20%), Positives = 77/259 (29%), Gaps = 60/259 (23%) Query: 107 SGKRLRLVDGTAISAP---------------GGGSAEWRLHMGY--DPHTCQFTDFELTD 149 G RL +DG+ P GG + ++ + T Sbjct: 144 HGLRLVQIDGSTCDLPDTQANRAFFPGPSNAGGPAPFPKVRWVIAAEAATGALLGASFGP 203 Query: 150 SRDAE-RLDR---FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE 205 E L R + +ADR F S LA A + R + A Sbjct: 204 WSTGEPALARDLLGQLGPGMLTLADRNFLSHRLAGEVLAT-GAHLLWRAK---ATFTLAP 259 Query: 206 GMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLS 265 D +L L + E G P R+I E + S T Sbjct: 260 VHVLDDGSYLAELTPPRGSE-------------GPPLTMRVI------EYTVHSTTAGGD 300 Query: 266 ENRRKGRVVQAETLEAAGHVLLLT-SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHL 324 E+ + L+T L +E+S +A Y RW E K+ L Sbjct: 301 ESSSEL-------------FCLVTDLLDPEEWSMLDLARAYPTRWGCETVIGHHKTDLGE 347 Query: 325 DA--LRAKEPELAKAWIFA 341 LR+K+PE ++A Sbjct: 348 GRPVLRSKDPEGVAQEMWA 366 >UniRef50_Q6LRT4 Similar to transposase n=38 Tax=Photobacterium profundum RepID=Q6LRT4_PHOPR Length = 426 Score = 47.5 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 52/175 (29%), Gaps = 20/175 (11%) Query: 186 GEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPAR 245 G ++ R + +++ R + G +I G + Sbjct: 201 GSVHWLTRTKKGSTFRHEDQFKTAEIIS--RTISPDLKG----VISLRGKEGYLFVGETT 254 Query: 246 LIAVSLPPEKALISKTRLLSENRRKGRVVQA-ETLEAAGHVLLLTSLPEDEYSAEQVADC 304 + + A R +V E E A LL L A ++A Sbjct: 255 VELHRKSEKLAS-----AAPTCRFVMSLVTDDEGKELARWYLLSNVLD---VDATEIATW 306 Query: 305 YRLRWQIELAFKRLKSLLH-LDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 Y RW IE FK LKS H L+ + E L+ A + +I Sbjct: 307 YCHRWNIESWFKLLKSDGHQLEKWQQTTAESILK----RLITASVATTLIFKLYS 357 >UniRef50_B4B8T5 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4B8T5_9CHRO Length = 294 Score = 47.5 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 55/233 (23%), Positives = 93/233 (39%), Gaps = 42/233 (18%) Query: 140 CQFTDFELTDSRDAER--LDRFAQTA--DEIRIADRGFGSRPECIRSLAFGEADYIVRVH 195 F + D ER L +T D++ IADR F + + +A + +++R H Sbjct: 2 LAIDVFPIEDGHAQERSLLKEVLKTVEEDDVWIADRNFCT-LSFLSGIAAQKGFFLIRQH 60 Query: 196 WRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEK 255 + L W G +F +G + G GK E T+ + + Sbjct: 61 -QCLPW--HNGEKFHEVGLIEG---GKVFEQTITVSDDD--------------------- 93 Query: 256 ALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAF 315 LS+ R+ V++ T + + ++T+LP E SA VA YR RW +E F Sbjct: 94 ------GYLSKIRQVKIVLEQATRDGDKEIFIVTNLPVTEASAIVVAQLYRKRWTLETLF 147 Query: 316 KRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEK 368 + L + + + K KA +FA A + +I+ L G++K Sbjct: 148 QILTVIFNCEI---KTLGYPKAALFA-FCVALVSYNILAVVLAALKSVHGTQK 196 >UniRef50_Q6MS13 Transposase IS1634BQ n=39 Tax=Mycoplasma RepID=Q6MS13_MYCMS Length = 557 Score = 47.5 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 24/96 (25%), Positives = 39/96 (40%), Gaps = 5/96 (5%) Query: 277 ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAK 336 E + G+ + T+ + S ++V + Y +WQIE FK LK L L + Sbjct: 414 EDQKYDGYYVYETN--RTDLSVKEVINLYSKQWQIESNFKTLKGKLSLRPMYLSTWNHIV 471 Query: 337 ---AWIFANLLAAFLIDDIIQPSLDFPPRSAGSEKK 369 F +L+ I I+ L +S +E K Sbjct: 472 GYICLCFISLVFLNYIIYILNSKLGLTGKSKITEHK 507 >UniRef50_B2AJ60 Transposase, IS4 family n=4 Tax=Proteobacteria RepID=B2AJ60_CUPTR Length = 412 Score = 47.5 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 23/74 (31%), Positives = 33/74 (44%), Gaps = 1/74 (1%) Query: 276 AETLEAAGHVLLLTSL-PEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPEL 334 + A +L+T+L Y A D Y RW++E AFKRLK + L+ L Sbjct: 256 RQISPAGKVRVLITNLLDMHHYPAATFRDLYHQRWRLEEAFKRLKHRMALEHLSGLSQLA 315 Query: 335 AKAWIFANLLAAFL 348 A+ A +L L Sbjct: 316 ARQDFGAKILCDNL 329 >UniRef50_Q1VRR5 Putative uncharacterized protein n=8 Tax=Bacteroidetes RepID=Q1VRR5_9FLAO Length = 372 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 25/157 (15%), Positives = 51/157 (32%), Gaps = 17/157 (10%) Query: 203 TAEGMRFDMMGFLRGLDCGKNG----ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALI 258 E + D + FL + V + + + + R E Sbjct: 174 DREFVGKDWLAFLNRNEIRYYIRIRNNFKVFLPHKNKEIKASHLFNRF----KTNEFVYY 229 Query: 259 SK--TRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFK 316 K G + + L+ +++ + PE+ Y+ RWQIE+ FK Sbjct: 230 HKIVRVNGELCYLSGCKLNPKNLKQEFLIIVSFNKPEN------AQQDYQKRWQIEMCFK 283 Query: 317 RLKSL-LHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 +KS ++ ++ + + I ++A I Sbjct: 284 AMKSSGFDIEKTHLQDIQRIEKLILLVMIAFVWCYKI 320 >UniRef50_Q7BLZ8 Putative uncharacterized protein (Fragment) n=1 Tax=Streptomyces rishiriensis RepID=Q7BLZ8_9ACTO Length = 341 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 53/282 (18%), Positives = 86/282 (30%), Gaps = 64/282 (22%) Query: 106 TSGKRLRLVDGTAISAPGGGS-------------------AEWRLHMGYDPHTCQFTDFE 146 G RL VDGT P + + RL + T E Sbjct: 1 YRGWRLVAVDGTTFDVPDTEANAAFFGRPGVSRGQEKSAYPQVRLAALAECGTHAVFAAE 60 Query: 147 LTD--SRDAERLDRF--AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWL 202 + E R + T + +ADRGF + R+ A AD + RV Sbjct: 61 AGPLAVHETELAQRLFGSLTPGMLLLADRGFRG-FDLWRAAAATGADLLWRVKNDA---- 115 Query: 203 TAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTR 262 ++ L+ G + ++ + P R+I +L Sbjct: 116 --------VLPVRTLLEDGSY--LSEIVAARDKNRRADPARVRVIEYTL----------- 154 Query: 263 LLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLL 322 + L+ T L A +A RW+IE +K+ L Sbjct: 155 -------------GRDGSDTVYRLITTILDPKAAPAASLAALAAQRWEIESTLDEIKTHL 201 Query: 323 HLDA--LRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPR 362 LR++ P A+ IFA LL + D++ + + Sbjct: 202 GGPRLVLRSQHPRGAEQEIFAFLLVHHALRDLMHQAAHQSEQ 243 >UniRef50_B7AA71 Transposase IS4 family protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7AA71_THEAQ Length = 393 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 53/214 (24%), Positives = 81/214 (37%), Gaps = 38/214 (17%) Query: 146 ELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE 205 E+ + +A R + +ADRGF R + LA GE +++VRV+ R L Sbjct: 149 EVEGAIEAARERLGGVGRRLVYVADRGFDDRKVFGQVLALGE-EFVVRVYRD--RKLGEG 205 Query: 206 GMRFDMMGFLRGLDCGKNGETTV-------MIGNSGNKKAGAPFPARLIAVSLPPEKALI 258 G + L L CG+ E V + + L+ +P Sbjct: 206 GSLAKVASSLA-LPCGEEVELRVGGRYQRVRLHFGWREVEVEGRRLHLVVCRVP------ 258 Query: 259 SKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLP-EDEYSAEQVADCYRLRWQIELAFKR 317 R+G LLTSLP A QV + YR RW++E F+ Sbjct: 259 -------ALGRRGEWW------------LLTSLPVRGREEAAQVVEAYRRRWEVERFFRL 299 Query: 318 LKSLLHLDALRAKEPELAKAWIFANL-LAAFLID 350 LK+ L L+ + + + + L LA FL + Sbjct: 300 LKTGLGLETFQVRGLARIRKVVAVLLGLAVFLWE 333 >UniRef50_A4A0C6 Probable transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A4A0C6_9PLAN Length = 442 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 16/74 (21%), Positives = 28/74 (37%), Gaps = 1/74 (1%) Query: 273 VVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEP 332 ++ ++ + L S D S + + YR RW +E+ F+ +K LR P Sbjct: 268 KLRLIKVDTGKETIYLVSSELD-MSDQAACELYRQRWGVEVFFRTVKQSCQRSKLRCCTP 326 Query: 333 ELAKAWIFANLLAA 346 I L+ Sbjct: 327 RNLLTEIHWTLIGV 340 >UniRef50_Q15UH5 Transposase, IS4 family n=36 Tax=Gammaproteobacteria RepID=Q15UH5_PSEA6 Length = 420 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 68/192 (35%), Gaps = 21/192 (10%) Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGE 225 + + D G+ + R + ++ RV + F + Sbjct: 175 LIVTDAGYRNPW--FREVEKHGWFWLGRVRGDVGFKRDGQASWQSNKSFYPSANSRAKYL 232 Query: 226 TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV 285 +G P A L +K R + + + GR A+ AG Sbjct: 233 GCGQLGRKS------PLHAHLHLYK------AKAKHRKDNRSSKAGRNHTAQQSYRAGSK 280 Query: 286 ---LLLTSLPE-DEYSAEQVADCYRLRWQIELAFKRLKSL---LHLDALRAKEPELAKAW 338 LL T+LPE D+ +++Q+ Y R QIE F+ +KS + L ++ + Sbjct: 281 EPWLLATNLPENDKLNSKQLVSLYARRMQIEETFRDIKSPQYGMGLRHSNSRCTKRFDIL 340 Query: 339 IFANLLAAFLID 350 + +LA +L+ Sbjct: 341 LLIAMLAEWLLR 352 >UniRef50_B7JAV1 Transposase, putative n=3 Tax=Acidithiobacillus ferrooxidans RepID=B7JAV1_ACIF2 Length = 336 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 31/71 (43%) Query: 293 EDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 E + +A ++ Y RWQIE F LK + L + + + W+ +A ++ + Sbjct: 243 ETDLAASEIVALYSNRWQIEPLFHNLKRWWGIHNLWQQRRAVLERWVQIRCIAWSMVQIL 302 Query: 353 IQPSLDFPPRS 363 + + P + Sbjct: 303 AETVAEDFPMT 313 >UniRef50_C7TBQ5 Transposase n=4 Tax=Lactobacillus rhamnosus RepID=C7TBQ5_LACRG Length = 374 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 53/179 (29%), Gaps = 30/179 (16%) Query: 173 FGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG--ETTVMI 230 + + P+ L I + + G D+ L K + + Sbjct: 147 WFAYPKMFHELLKRGITGIGMIKQTEKVYFRYRGREMDVKRLYATLKQSKRLIHQHYLYS 206 Query: 231 GNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTS 290 G +L+ V +KG + L T+ Sbjct: 207 PIVQYDMDGTKMAMKLVFV------------------TKKGAKGRFLVLATTK-----TN 243 Query: 291 LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 L E++ Y RWQIE FK K L DA + + + A + +++ L+ Sbjct: 244 L-----RPERIIQMYGRRWQIEGYFKVAKQYLRFDATQVRGYDGLCAHMAMVMMSYDLL 297 >UniRef50_Q7NBK2 Predicted transposase n=10 Tax=Mycoplasma RepID=Q7NBK2_MYCGA Length = 348 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 90/239 (37%), Gaps = 41/239 (17%) Query: 156 LDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDM--MG 213 + + + + I +AD+G S+ +R L YIV + + L E F + G Sbjct: 52 MQKIYKIKNTIIVADKG-ISQNANLRYLEQKGYKYIV---QKRIDILGKEDKSFIVNEQG 107 Query: 214 FLRGLDCGKNGET--TVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKT---------- 261 F++ + +V N K+ F + + S + K Sbjct: 108 FVQENEYFTKSRFVQSVWAKNKNKKRYSNTFRKQFVYFSPSKQTLDKIKRQNLINKLEKK 167 Query: 262 ---------RLLSENRRK-----GRVVQAETLE-------AAGHVLLLTSLPEDEYSAEQ 300 L+ E ++K G+ V +E G ++ T++ ++++ Sbjct: 168 SINGELPLSALVPEYKKKYMDVDGKTVGRLNIEKIKKVANEDGFYMIETNITN--INSKE 225 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 + Y+ +W++E +F+ LKS + + + + E ++ +F L+ ++ I F Sbjct: 226 ANEIYKRQWKVEESFRTLKSAIEVRPMYVYKDEHIQSHVFLCFLSLIVLKYCIYKLKKF 284 >UniRef50_A9F243 Transposase, IS4 family n=4 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F243_SORC5 Length = 461 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 36/157 (22%), Positives = 57/157 (36%), Gaps = 14/157 (8%) Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGN-- 235 L +++R+ L A G + L + E + +GN Sbjct: 187 GLFAQLLSAGHRFVIRLAHNRLVEADALGAEAKLEQALAHVQAVAVREVELSPRPAGNRS 246 Query: 236 ---KKAGAPFPARLIAVSLPPEKALISKTRLLSE---NRRKGRVVQAETLE-----AAGH 284 K+ P RL ++L + + + R RVV+ +E A Sbjct: 247 PQQKRLHPPRAGRLAKLALGSTRVTLRRPRSQPRELPATLSLRVVRVWEIEPPPGEAPVE 306 Query: 285 VLLLTSLPEDEYS-AEQVADCYRLRWQIELAFKRLKS 320 +LLTS P + Q+ D YR RW +E FK LK+ Sbjct: 307 WVLLTSEPVESVEQLTQLVDWYRARWMVEELFKALKT 343 >UniRef50_Q4A8Q4 ISMHp1 transposase n=19 Tax=Mycoplasma RepID=Q4A8Q4_MYCH7 Length = 552 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 73/222 (32%), Gaps = 36/222 (16%) Query: 163 ADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGK 222 + IADRG S IR L E ++I+ + + D ++ L+ Sbjct: 265 KNMTIIADRG-MSTAANIRFLESKEYNFIISYRAKIGSQKFKNYL-LDPSDYV-DLNTDF 321 Query: 223 NGETTVMIGNSGNKKAGAPFPARLIAV---------SLPPEKALISKTRLLS-------- 265 + + NK+ R+I E+ + Sbjct: 322 KYKKEEFYSSYKNKRYTENIRRRIITYSKKRAIKDSKAREEQIQSFIKKQNKDGFIEVNK 381 Query: 266 ---ENRRKGRVVQA-----------ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQI 311 + + R + + + G+ + T++ + + + Y+ +W I Sbjct: 382 LFGKKPKYFREISNMKFELDQSKIDKDKQFDGYYVYETNILN--LNVLDIVEKYQKQWNI 439 Query: 312 ELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDII 353 E F+ LK LL++ + + E A F ++ ++ II Sbjct: 440 EANFRSLKGLLNIRPVFLRIDEHILAHTFLCFISLVILKTII 481 >UniRef50_B5CN98 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=B5CN98_9FIRM Length = 582 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 34/83 (40%), Gaps = 6/83 (7%) Query: 281 AAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI- 339 G ++T+L D ++ + RW+IE F+ +K+ + + E KA Sbjct: 437 YDGFYAVITNLEGD---VSEIIRINKQRWEIEENFRIMKTEFEARPVYVRREERIKAHFM 493 Query: 340 --FANLLAAFLIDDIIQPSLDFP 360 + +LL L++ + + Sbjct: 494 TCYISLLLYRLLEKKLGDAYTVS 516 >UniRef50_C9LZT7 Putative uncharacterized protein n=1 Tax=Lactobacillus helveticus DSM 20075 RepID=C9LZT7_LACHE Length = 318 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 23/122 (18%), Positives = 49/122 (40%), Gaps = 18/122 (14%) Query: 253 PEKALISKTRLLSENRRKGRVVQAETLEAA----------------GHVLLLTSLPEDEY 296 ++++ + R ++ RK +V A + +++L T+ + + Sbjct: 134 DQRSIAGRRRTQAQRPRKFIIVTAAAFMMSEACMNVWLPPKRGSKGKYLVLATT--QYKL 191 Query: 297 SAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 +++ Y RWQIE FK K L L+ + + + +I L L+ + S Sbjct: 192 HPQEIIQLYGRRWQIETYFKAAKQYLALNKSQIRSYDGQCGYIAVTALTYDLLAWQERQS 251 Query: 357 LD 358 +D Sbjct: 252 ID 253 >UniRef50_A3D336 Transposase, IS4 family n=6 Tax=Shewanella RepID=A3D336_SHEB5 Length = 460 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 52/188 (27%), Gaps = 18/188 (9%) Query: 153 AERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMM 212 E + + D GF + R + ++ RV + G +F + Sbjct: 145 NELRKVLPDNITPLIVTDAGFRNPW--FRKVEQLGWYWLGRVRGLSVYRPHPFGRQFSLK 202 Query: 213 GFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGR 272 V + P ++ P + + Sbjct: 203 ALYPQARRRAKHVGRVALSVKK------PLLCEMVLFRAPSKG-----RKGQRSTTTDCH 251 Query: 273 VVQAETLEAAGHV--LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL---LHLDAL 327 T E L+T+L S +++ + Y+ R Q+E F+ LKS L Sbjct: 252 HTAQWTYELTAKEPWALVTNLTMKAMSPQKLVNIYQKRMQMEETFRDLKSPAYGFGLRHS 311 Query: 328 RAKEPELA 335 R + Sbjct: 312 RTRYAARM 319 >UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7B Length = 481 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 31/152 (20%), Positives = 50/152 (32%), Gaps = 12/152 (7%) Query: 189 DYIVRVHWRGLRWLTAEGMRF--DMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARL 246 D + RVH RG G R F + + + P P R Sbjct: 210 DLMPRVHARGGPAKLVVGDRLFCASKHFAEFTKDNGH-----FVVRYARTLSFEPDPKRP 264 Query: 247 IAVSLPPEKALISKT-----RLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQV 301 + P + + + + + RR R + +L L Y A + Sbjct: 265 AVTTADPSQRAVVEEWGWAGKPKDKLRRYVRRITVARPVGEAITILTDLLDSAPYPATDL 324 Query: 302 ADCYRLRWQIELAFKRLKSLLHLDALRAKEPE 333 D YR+RW IE F+++ ++ L PE Sbjct: 325 LDLYRIRWTIEGTFQKVTAIFALGRFIGSTPE 356 >UniRef50_C3IAJ8 Putative uncharacterized protein n=1 Tax=Bacillus thuringiensis IBL 200 RepID=C3IAJ8_BACTU Length = 47 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 14/30 (46%), Positives = 20/30 (66%) Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKR 317 +T++P + EQV + Y LRWQIE+ KR Sbjct: 1 MTNVPWEWVPMEQVHELYTLRWQIEIVLKR 30 >UniRef50_Q04QP0 Transposase, ISLbp11 n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QP0_LEPBJ Length = 243 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 49/134 (36%), Gaps = 3/134 (2%) Query: 213 GFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGR 272 +L L+ TV K +L L + K + Sbjct: 19 SYLETLEPAHTYTITV---PRKKGKEAREAIIQLRFEKLTIKSPQYKKLENIDMYALTAT 75 Query: 273 VVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEP 332 V E+ L T + +A++V Y+ RW IE+ FK LKS ++++ + K Sbjct: 76 EVDGPKEESIDWKFLTTIPIHNSENAKRVISYYKSRWGIEVFFKVLKSGCNIESTQFKFG 135 Query: 333 ELAKAWIFANLLAA 346 + KA I + + A Sbjct: 136 DRFKACIAVSAIVA 149 >UniRef50_B2AKB8 Transposase, IS4 family n=40 Tax=cellular organisms RepID=B2AKB8_CUPTR Length = 442 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 21/117 (17%), Positives = 48/117 (41%), Gaps = 6/117 (5%) Query: 250 SLPPEKALISKTRLLSENRRKGRVVQAETLEAAG------HVLLLTSLPEDEYSAEQVAD 303 ++ + +L + V A +EA L+ +D + ++ + Sbjct: 247 REVKQELRAQRMKLPGLVGAEFTCVAAREIEAPAGVKPVVWRLVTNREAQDADAVNKLVE 306 Query: 304 CYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFP 360 YR RW+IE+ F LK+ ++AL+ + + + ++ A+ I +++ P Sbjct: 307 WYRARWEIEMFFHVLKTGCKVEALQLSHMDRVERALALYMVVAWRIARLMRLGRTCP 363 >UniRef50_C6PFH6 Transposase IS4 family protein n=2 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PFH6_CLOTS Length = 398 Score = 46.0 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 3/77 (3%) Query: 275 QAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPEL 334 +A E A H + T+ E E + + Y RW IE+ F++ K+ L L+ + + + Sbjct: 278 KAFKNENALHAFICTNT---ELDTETILNYYSQRWPIEIFFRQTKNNLGLNTYQVRSTKS 334 Query: 335 AKAWIFANLLAAFLIDD 351 ++ L Sbjct: 335 IDRLLWLISLTYMYCTT 351 >UniRef50_Q1PW38 Putative uncharacterized protein n=4 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PW38_9BACT Length = 467 Score = 46.0 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 43/305 (14%), Positives = 84/305 (27%), Gaps = 60/305 (19%) Query: 86 DWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWR-LHMGYDP------- 137 + L + + GK L DG + G + LH D Sbjct: 72 EKLTKLWTKLIIKTHPQVKRFRGKLLLCGDGLKVPKEGKKMPGVKSLHQESDSNNKAEYI 131 Query: 138 --HTCQFTDFELTDSRDA---------------------ERLDRFAQTADEI-------R 167 H+CQ + LD+ + + Sbjct: 132 MGHSCQVVSLLAEAGKSCFAIPLVSRIHEGVVFSNRDQRTLLDKMVLLINSLELKELFYF 191 Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT 227 IAD + S I + + I RV + + + +G K Sbjct: 192 IADAYYASHA-IINGVVARGSHLISRVRSNAVAYF-----PVEPTPEKKGRGRPKKYGMK 245 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEA---AGH 284 V + N +A + + I+ L + G +++ ++ Sbjct: 246 VKLKTLLNDRASMKEAESPV---YGEQGIKINYRTLDLLWKPVGILIRFVLVDHPQRGKI 302 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFK-------RLKSLLHLDALRAKEPELAKA 337 +L+ T L SA ++ Y LR++IE++FK + ++ + Sbjct: 303 ILMSTDL---TISAMEIICLYGLRFKIEVSFKQALRTLGTYAYHFWMRNMQPIKRRSGNQ 359 Query: 338 WIFAN 342 + Sbjct: 360 HVHKR 364 >UniRef50_Q64EL6 Putative uncharacterized protein n=1 Tax=uncultured archaeon GZfos11A10 RepID=Q64EL6_9ARCH Length = 237 Score = 46.0 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 16/80 (20%), Positives = 34/80 (42%), Gaps = 5/80 (6%) Query: 276 AETLEAAGHVLLLTSL-----PEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAK 330 + G ++T+ ++ ++ E + YR + QIE AFK +KS + + Sbjct: 143 KKAERTDGLWTIVTNTSDNRDDKNRFTEEDLIQAYRDKNQIEQAFKDVKSFIKIQPFNVW 202 Query: 331 EPELAKAWIFANLLAAFLID 350 P+ +A +L+ + Sbjct: 203 TPKHVRAHYTICILSYLVSS 222 >UniRef50_C9KIM9 Transposase, IS4 family protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KIM9_9FIRM Length = 433 Score = 46.0 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 34/159 (21%), Positives = 51/159 (32%), Gaps = 29/159 (18%) Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT 227 + D F S P + + D I RV +G D+ G + Sbjct: 227 LFDSWFCS-PASLHQIHEFGYDVIARVKKSEKMHFCFQGRMQDVKTIYLGQKKRRGRSAY 285 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLL 287 ++ + K G P RL+ V+ + + VL+ Sbjct: 286 LLSVEAEAVKDGKHLPVRLVY-------------------------VRNKNKRSDYLVLV 320 Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA 326 T L S E++ Y RW IE+ FK KS L L Sbjct: 321 STDL---TLSEEEIIQTYGKRWNIEVFFKMCKSYLKLGK 356 >UniRef50_A5CYT3 FOG: transposase and inactivated derivatives n=11 Tax=Bacteria RepID=A5CYT3_PELTS Length = 465 Score = 46.0 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 60/207 (28%), Gaps = 39/207 (18%) Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL--DCGKN 223 + D F R I + T +G + + + L GK Sbjct: 226 YLLFDSWFAFPSIICRVREQHLLHVICMLKSMKRVLYTYKGEKVTLDTLYKELLKKPGKA 285 Query: 224 GETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAG 283 + G G P PAR++ + +K + + LE Sbjct: 286 KILASALVQIGIDSEGNPVPARIVFIR-------------DRNRSKKWLALLSTDLELTD 332 Query: 284 HVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA-LRAKEPELAKAW---I 339 E++ Y RW IE+ FK KS L+L + + + A + Sbjct: 333 ---------------EEIIRIYGKRWSIEVFFKTTKSFLNLAREFQGRTYDSMVAHTTIV 377 Query: 340 FANLLAAFLIDDIIQPSLDFPPRSAGS 366 F + L + PR+ G Sbjct: 378 FCRYIMLALENR-----ESKDPRTLGD 399 >UniRef50_A1WHR7 Transposase, IS4 family n=11 Tax=Proteobacteria RepID=A1WHR7_VEREI Length = 462 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 37/89 (41%) Query: 276 AETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELA 335 +A LL + D + D YR RW+IE F LK+ ++AL+ Sbjct: 264 PAGCKAVQWHLLTNRMASDFAEVVEWIDWYRCRWEIETFFNVLKNGCRVEALQLGSVAKL 323 Query: 336 KAWIFANLLAAFLIDDIIQPSLDFPPRSA 364 + + +L A+ + +++ P SA Sbjct: 324 ELALAVYMLVAWRLARLVRLGRTHPDLSA 352 >UniRef50_Q8VVL2 TRANSPOSASE ISMmy1G n=9 Tax=Mycoplasma RepID=Q8VVL2_MYCMS Length = 481 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 22/96 (22%), Positives = 44/96 (45%) Query: 257 LISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFK 316 ++ K +S +KG + + LE L+ + + + Y+ RW+IEL FK Sbjct: 331 MVEKKNFISRAHKKGAYDEIKLLERENLFGLIIFECNYDLDLKDIYVAYKKRWEIELLFK 390 Query: 317 RLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 + K++L + + + A F N L++ ++ I Sbjct: 391 QFKNVLEQNEVNVQGNYRLLATEFINFLSSIMLCRI 426 >UniRef50_C8VXQ2 Transposase (IS4 family protein) n=3 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VXQ2_DESAS Length = 560 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 60/151 (39%), Gaps = 11/151 (7%) Query: 210 DMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRR 269 D + +LR + ++G+ ++ R P K + ++S Sbjct: 338 DAISYLRHMKKKQDGQF--YHLDAEVIAEEKKGVGRPKTKEKSPIKIVYRIKAIVSVMDE 395 Query: 270 KGRVVQAETLEAAGHVLLLTSL--PEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDAL 327 + E A +L+T+L P + +AE + Y+ + +E+ F+ LK+ L + Sbjct: 396 TAWQMAK---ERASTFVLITNLKNPREN-TAETILRHYKEQNTVEMRFRFLKNPAILGQV 451 Query: 328 RAKEPELAKAWIF---ANLLAAFLIDDIIQP 355 K+P KA + +L L++ ++ Sbjct: 452 FLKKPSRVKALGYIFLITMLIYALMERRVRQ 482 >UniRef50_A4C5E2 Hypothetical transposase n=2 Tax=Pseudoalteromonas tunicata D2 RepID=A4C5E2_9GAMM Length = 397 Score = 45.2 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 23/67 (34%), Positives = 34/67 (50%), Gaps = 3/67 (4%) Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL---LHLDALRAKEPELAKAWIFAN 342 LL TSL YSA+ + Y R QIE +F+ LK+ L+L R+ E + Sbjct: 267 LLFTSLCNINYSAQDMVKIYSQRMQIEESFRDLKNTSNGLNLRHCRSYEKGRLNVALLIA 326 Query: 343 LLAAFLI 349 L+A F++ Sbjct: 327 LIANFIL 333 >UniRef50_B9P933 Predicted protein n=16 Tax=cellular organisms RepID=B9P933_POPTR Length = 446 Score = 45.2 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 47/155 (30%), Gaps = 17/155 (10%) Query: 177 PECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNK 236 E I +L + + + + G E + + + Sbjct: 225 AELINALTADGVQWTITADKNSAVMELIHRIDEQAWVPVTRSGSGAALEEPCEVAETLHT 284 Query: 237 KAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEY 296 P RLI + A + + + ++ T+ E+ Sbjct: 285 MKETPEAFRLIVKRVRERSA----------------TPLFRNMVSYRYWVVATNFGP-EW 327 Query: 297 SAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKE 331 S +QV ++LR E FK LK+ + ++ + + Sbjct: 328 SPQQVLGWHQLRGHFENFFKELKNGVGMEYMPTGD 362 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P11901 Transposase for insertion sequence element IS421... 345 2e-93 UniRef50_C6AUF2 Transposase IS4 family protein n=7 Tax=Rhizobium... 284 5e-75 UniRef50_B8F976 Transposase IS4 family protein n=2 Tax=Desulfati... 262 1e-68 UniRef50_UPI00017465B5 InsL n=2 Tax=Verrucomicrobium spinosum DS... 245 1e-63 UniRef50_C0AF19 InsL n=9 Tax=Opitutaceae bacterium TAV2 RepID=C0... 244 3e-63 UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001... 235 3e-60 UniRef50_B7GET6 Transposase n=2 Tax=Bacillaceae RepID=B7GET6_ANOFW 232 2e-59 UniRef50_P12249 Transposase for insertion sequence element IS231... 229 1e-58 UniRef50_A6DTQ2 Putative transposase insL for insertion sequence... 228 2e-58 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 226 1e-57 UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipe... 218 2e-55 UniRef50_Q1Q5J6 Putative uncharacterized protein n=6 Tax=Candida... 216 9e-55 UniRef50_Q5L3A2 Transposase of IS231E-like element n=1 Tax=Geoba... 208 2e-52 UniRef50_Q64B41 Transposase n=11 Tax=environmental samples RepID... 199 1e-49 UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coproco... 196 7e-49 UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostri... 193 1e-47 UniRef50_A6TN04 Transposase, IS4 family protein n=1 Tax=Alkaliph... 188 2e-46 UniRef50_A9AZS8 Transposase IS4 family protein n=3 Tax=Herpetosi... 186 7e-46 UniRef50_D1N0Z4 Transposase IS4 family protein n=3 Tax=Bacteria ... 183 7e-45 UniRef50_A6M1E5 Transposase, IS4 family protein n=1 Tax=Clostrid... 182 2e-44 UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea f... 181 3e-44 UniRef50_B5ZZ25 Transposase IS4 family protein n=11 Tax=Rhizobiu... 180 9e-44 UniRef50_Q0F098 ISGsu1, transposase n=6 Tax=Mariprofundus ferroo... 180 1e-43 UniRef50_B7CEB8 Putative uncharacterized protein n=2 Tax=Erysipe... 179 1e-43 UniRef50_A2RJ55 Putative transposase n=7 Tax=Lactobacillales Rep... 178 2e-43 UniRef50_Q73IB8 Transposase, IS4 family n=9 Tax=Wolbachia RepID=... 178 2e-43 UniRef50_B8FDX7 Transposase IS4 family protein n=2 Tax=Desulfati... 177 4e-43 UniRef50_UPI0001BC4BB6 transposase n=2 Tax=Neisseria mucosa ATCC... 177 5e-43 UniRef50_Q46GC6 Transposase n=7 Tax=Methanosarcina RepID=Q46GC6_... 176 8e-43 UniRef50_B3PC11 ISCja2, transposase n=5 Tax=Proteobacteria RepID... 176 9e-43 UniRef50_B3E6V4 Transposase IS4 family protein n=8 Tax=Proteobac... 176 1e-42 UniRef50_B0TD95 Transposase, is4 family n=3 Tax=Heliobacterium m... 176 1e-42 UniRef50_Q877R2 Transposase n=51 Tax=Bacteroidales RepID=Q877R2_... 176 1e-42 UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepI... 176 1e-42 UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium... 175 2e-42 UniRef50_D1T817 Transposase IS4 family protein n=1 Tax=Burkholde... 173 9e-42 UniRef50_Q3A1U3 Transposase n=1 Tax=Pelobacter carbinolicus DSM ... 173 1e-41 UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostrid... 173 1e-41 UniRef50_D1K7L7 Transposase n=3 Tax=Bacteroidales RepID=D1K7L7_9... 171 3e-41 UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicute... 171 4e-41 UniRef50_Q11ZL6 Transposase, IS4 family n=22 Tax=Bacteria RepID=... 169 1e-40 UniRef50_D1Q0M9 ISGsu1 transpoase n=7 Tax=Bacteroidales RepID=D1... 169 1e-40 UniRef50_B2JV26 Transposase IS4 family protein n=9 Tax=Burkholde... 169 2e-40 UniRef50_A6CHG0 Transposase of IS5377-like element n=2 Tax=Bacil... 168 2e-40 UniRef50_B9BXQ1 Transposase, IS4 family n=8 Tax=Proteobacteria R... 168 2e-40 UniRef50_C4XGQ6 Putative transposase for insertion sequence elem... 167 6e-40 UniRef50_Q74P20 IS231-related transposase n=15 Tax=Bacillus RepI... 166 8e-40 UniRef50_B0R9A9 Transposase (ISH8) n=22 Tax=Halobacteriaceae Rep... 166 1e-39 UniRef50_Q648P8 Transposase n=2 Tax=environmental samples RepID=... 166 1e-39 UniRef50_Q7MGY3 Transposase and inactivated derivative n=4 Tax=V... 165 2e-39 UniRef50_B3JNI1 Putative uncharacterized protein n=3 Tax=Bactero... 165 3e-39 UniRef50_A6UXI0 Protein containing transposase DDE domain n=4 Ta... 164 5e-39 UniRef50_C5V7Z6 Transposase IS4 family protein n=3 Tax=root RepI... 164 6e-39 UniRef50_Q2FU81 Transposase, IS4 n=4 Tax=Methanospirillum hungat... 164 6e-39 UniRef50_C0VKK7 ISCja2 transposase n=8 Tax=Acinetobacter RepID=C... 163 7e-39 UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostri... 163 1e-38 UniRef50_A5II18 Transposase, IS4 n=1 Tax=Legionella pneumophila ... 163 1e-38 UniRef50_C3R0J9 Transposase n=4 Tax=Bacteroidales RepID=C3R0J9_9... 163 1e-38 UniRef50_Q1PXV1 Putative uncharacterized protein n=3 Tax=Candida... 162 2e-38 UniRef50_D0LI35 Transposase IS4 family protein n=1 Tax=Haliangiu... 161 4e-38 UniRef50_B0NXD2 Putative uncharacterized protein n=5 Tax=Clostri... 161 5e-38 UniRef50_Q05309 Transposase for insertion sequence element IS115... 160 6e-38 UniRef50_B8FI31 Transposase IS4 family protein n=1 Tax=Desulfati... 160 7e-38 UniRef50_C5T3Q2 Transposase IS4 family protein n=4 Tax=Proteobac... 159 1e-37 UniRef50_A4T2G5 Transposase, IS4 family protein n=10 Tax=Coryneb... 159 1e-37 UniRef50_C3KKH4 Putative transposase Y4ZB n=2 Tax=Rhizobium sp. ... 159 1e-37 UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=... 158 2e-37 UniRef50_A6DKD2 ISPg4, transposase n=7 Tax=Chlamydiae/Verrucomic... 158 2e-37 UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3... 158 3e-37 UniRef50_C6CF98 Transposase IS4 family protein n=20 Tax=Gammapro... 158 4e-37 UniRef50_D0SHM1 Transposase n=3 Tax=Acinetobacter RepID=D0SHM1_A... 156 1e-36 UniRef50_A8RFU1 Putative uncharacterized protein n=1 Tax=Eubacte... 156 1e-36 UniRef50_Q45620 Probable transposase for insertion sequence elem... 154 3e-36 UniRef50_A3IS08 Putative uncharacterized protein n=1 Tax=Cyanoth... 154 4e-36 UniRef50_B8FXQ3 Transposase IS4 family protein n=8 Tax=Desulfito... 153 9e-36 UniRef50_A7C1C1 IS231-related transposase n=6 Tax=Beggiatoa sp. ... 153 1e-35 UniRef50_A6WTA0 Transposase IS4 family protein n=14 Tax=Shewanel... 152 2e-35 UniRef50_A8M893 Transposase IS4 family protein n=3 Tax=Actinomyc... 152 2e-35 UniRef50_Q7MLW1 Transposase and inactivated derivative n=29 Tax=... 152 2e-35 UniRef50_A4J2U7 Transposase, IS4 family protein n=3 Tax=Desulfot... 152 2e-35 UniRef50_A3ZZQ0 Putative uncharacterized protein n=3 Tax=Blastop... 151 3e-35 UniRef50_B6EGT0 Transposase n=20 Tax=Vibrionaceae RepID=B6EGT0_A... 148 3e-34 UniRef50_C6N0W0 Putative uncharacterized protein n=1 Tax=Legione... 148 4e-34 UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia R... 148 4e-34 UniRef50_P03835 Transposase insG for insertion sequence element ... 148 4e-34 UniRef50_B0CC46 Transposase, IS4 family, putative n=9 Tax=Cyanob... 148 4e-34 UniRef50_B2LS82 Putative uncharacterized protein n=3 Tax=Vibrio ... 147 6e-34 UniRef50_C1ZMB0 Transposase family protein n=1 Tax=Planctomyces ... 147 6e-34 UniRef50_A1HQH6 Transposase, IS4 family protein n=2 Tax=Thermosi... 147 7e-34 UniRef50_C9KS84 Transposase domain protein n=5 Tax=Bacteroidales... 146 1e-33 UniRef50_P55729 Putative transposase y4zB n=4 Tax=Rhizobiaceae R... 146 1e-33 UniRef50_A4BL98 Putative uncharacterized protein n=5 Tax=Nitroco... 144 5e-33 UniRef50_D2TH14 ISCro6 transposase n=8 Tax=Gammaproteobacteria R... 144 7e-33 UniRef50_C9LFX6 Transposase domain protein n=14 Tax=Bacteroidale... 143 1e-32 UniRef50_UPI0001C4271A transposase, IS4 family protein n=1 Tax=B... 141 3e-32 UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=St... 141 5e-32 UniRef50_Q1VPP4 ISPg4, transposase n=7 Tax=Bacteria RepID=Q1VPP4... 140 9e-32 UniRef50_A6L0R8 Transposase n=13 Tax=Bacteroidales RepID=A6L0R8_... 139 1e-31 UniRef50_C6J7R2 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 137 6e-31 UniRef50_C7PAE4 Transposase IS4 family protein n=4 Tax=Chitinoph... 137 8e-31 UniRef50_C9C7H0 Transposase n=5 Tax=Enterococcus faecium RepID=C... 137 8e-31 UniRef50_B8FEP3 Transposase IS4 family protein n=1 Tax=Desulfati... 136 1e-30 UniRef50_Q12AI7 Transposase, IS4 family n=3 Tax=Proteobacteria R... 136 1e-30 UniRef50_A4SUB1 IS element transposase n=8 Tax=Bacteria RepID=A4... 134 4e-30 UniRef50_B2J1G3 Transposase, IS4 family protein n=6 Tax=Nostocac... 134 6e-30 UniRef50_Q8VV93 Transposase n=1 Tax=marine psychrotrophic bacter... 134 6e-30 UniRef50_Q7ULM3 Probable transposase n=5 Tax=Planctomycetaceae R... 132 3e-29 UniRef50_B5EK95 Transposase IS4 family protein n=2 Tax=Acidithio... 131 5e-29 UniRef50_D1XZ52 Transposase, IS4 family n=1 Tax=Prevotella bivia... 131 5e-29 UniRef50_Q648P7 Transposase n=2 Tax=environmental samples RepID=... 131 6e-29 UniRef50_A4JGL4 Transposase, IS4 family protein n=3 Tax=Burkhold... 131 6e-29 UniRef50_C6JHT2 Transposase ISLbp1 n=1 Tax=Ruminococcus sp. 5_1_... 130 8e-29 UniRef50_B6FVR6 Putative uncharacterized protein (Fragment) n=2 ... 128 3e-28 UniRef50_A3ZNH0 Probable transposase n=1 Tax=Blastopirellula mar... 126 9e-28 UniRef50_A5KKC4 Putative uncharacterized protein n=1 Tax=Ruminoc... 126 1e-27 UniRef50_C6J0N9 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 126 1e-27 UniRef50_Q3M8C5 Transposase, IS4 n=15 Tax=Cyanobacteria RepID=Q3... 126 2e-27 UniRef50_C0BDH6 Putative uncharacterized protein n=2 Tax=Coproco... 125 3e-27 UniRef50_UPI000196B70E hypothetical protein CATMIT_00144 n=1 Tax... 124 4e-27 UniRef50_A1APW2 Transposase, IS4 family n=6 Tax=Deltaproteobacte... 123 8e-27 UniRef50_C0ING1 Putative uncharacterized protein n=1 Tax=uncultu... 123 1e-26 UniRef50_A6CCZ3 Transposase, IS4 (Fragment) n=7 Tax=Planctomyces... 122 2e-26 UniRef50_A6DSH7 Probable transposase n=3 Tax=Lentisphaera araneo... 121 4e-26 UniRef50_B6FTH4 Putative uncharacterized protein n=3 Tax=Clostri... 121 4e-26 UniRef50_UPI0001C16028 hypothetical protein CRD_01775 n=2 Tax=Ra... 121 4e-26 UniRef50_A8KXP7 Transposase IS4 family protein n=2 Tax=Actinomyc... 121 5e-26 UniRef50_C9R546 Transposase (IS4 family) protein n=1 Tax=Aggrega... 121 5e-26 UniRef50_C3EBZ9 IS231-related transposase n=1 Tax=Bacillus thuri... 120 9e-26 UniRef50_C5VJA1 Transposase domain protein n=15 Tax=Prevotella R... 119 2e-25 UniRef50_Q04V25 Transposase, ISLbp1 n=29 Tax=Leptospira RepID=Q0... 118 3e-25 UniRef50_A1BCF6 Transposase, IS4 family protein n=1 Tax=Chlorobi... 118 3e-25 UniRef50_Q2JAY9 Transposase, IS4 n=2 Tax=Frankia RepID=Q2JAY9_FRASC 118 4e-25 UniRef50_Q8ABH9 Putative transposase n=1 Tax=Bacteroides thetaio... 116 1e-24 UniRef50_C4Z764 Putative uncharacterized protein n=4 Tax=Clostri... 115 2e-24 UniRef50_C6JEA3 Putative uncharacterized protein n=1 Tax=Ruminoc... 112 2e-23 UniRef50_Q7BLZ8 Putative uncharacterized protein (Fragment) n=1 ... 110 8e-23 UniRef50_Q55566 Putative transposase for insertion sequence elem... 110 1e-22 UniRef50_A8L1S1 Transposase IS4 family protein n=2 Tax=Frankia s... 110 1e-22 UniRef50_C6DY52 Transposase IS4 family protein n=1 Tax=Geobacter... 109 1e-22 UniRef50_Q82R31 Putative IS4 family ISFsp6-like transposase n=2 ... 109 2e-22 UniRef50_Q3M9Z5 Transposase, IS4 n=10 Tax=Cyanobacteria RepID=Q3... 107 7e-22 UniRef50_Q093Y3 Isrso13-transposase protein n=7 Tax=Stigmatella ... 107 8e-22 UniRef50_C3BTW8 Transposase for insertion sequence element IS231... 105 3e-21 UniRef50_A7GMF1 Transposase IS4 family protein n=15 Tax=Bacillus... 104 4e-21 UniRef50_A3ZQJ1 Probable transposase n=4 Tax=Blastopirellula mar... 104 4e-21 UniRef50_A3EIG1 FOG: Transposase and inactivated derivatives n=3... 104 7e-21 UniRef50_A5D1X0 Transposase n=1 Tax=Pelotomaculum thermopropioni... 103 8e-21 UniRef50_Q18EK5 Probable transposase (ISH8/ISH26) n=5 Tax=Haloqu... 103 1e-20 UniRef50_D0I6N0 Transposase IS4 n=1 Tax=Grimontia hollisae CIP 1... 102 1e-20 UniRef50_UPI00016C37A0 transposase, IS4 n=2 Tax=Gemmata obscurig... 100 1e-19 UniRef50_A9DPK2 Transposase n=8 Tax=Shewanella benthica KT99 Rep... 100 1e-19 UniRef50_B0NZ84 Putative uncharacterized protein n=1 Tax=Clostri... 99 2e-19 UniRef50_C5EN31 Putative uncharacterized protein n=1 Tax=Clostri... 99 3e-19 UniRef50_A4BSI0 Putative uncharacterized protein n=1 Tax=Nitroco... 99 3e-19 UniRef50_B6FLV1 Putative uncharacterized protein (Fragment) n=1 ... 98 5e-19 UniRef50_UPI0001AF03EF IS4 family transposase n=1 Tax=Streptomyc... 95 3e-18 UniRef50_Q15UH5 Transposase, IS4 family n=36 Tax=Gammaproteobact... 95 3e-18 UniRef50_Q8QNB6 EsV-1-170 n=2 Tax=Ectocarpus siliculosus virus 1... 95 4e-18 UniRef50_A7C2A8 Transposase of IS641 n=1 Tax=Beggiatoa sp. PS Re... 95 4e-18 UniRef50_UPI0000F70487 putative IS4 transposase n=1 Tax=Aeromona... 95 5e-18 UniRef50_A4A0C3 Probable transposase n=1 Tax=Blastopirellula mar... 95 6e-18 UniRef50_Q82R33 Putative IS4 family ISFsp6-like transposase n=1 ... 92 4e-17 UniRef50_A3ZMM8 Transposase insG for insertion sequence element-... 92 4e-17 UniRef50_Q7UPU9 Probable transposase n=2 Tax=Rhodopirellula balt... 91 5e-17 UniRef50_C8XGG2 Transposase IS4 family protein n=2 Tax=Nakamurel... 91 5e-17 UniRef50_B2PVI2 Putative uncharacterized protein n=1 Tax=Provide... 91 7e-17 UniRef50_Q6LJK0 Hypothetical transposase n=2 Tax=Vibrionaceae Re... 91 7e-17 UniRef50_Q67PW6 Transposase-like protein n=14 Tax=Symbiobacteriu... 89 2e-16 UniRef50_UPI00016C3BAC transposase n=1 Tax=Gemmata obscuriglobus... 89 2e-16 UniRef50_C3FBK7 Transposase for insertion sequence element IS231... 89 2e-16 UniRef50_C1DIQ1 Transposase, IS4 n=2 Tax=Azotobacter vinelandii ... 89 3e-16 UniRef50_D1VZM3 Transposase, IS4 family n=3 Tax=Prevotella RepID... 89 3e-16 UniRef50_Q64E61 Transposase n=1 Tax=uncultured archaeon GZfos14B... 88 6e-16 UniRef50_B2IXJ5 Putative uncharacterized protein n=1 Tax=Nostoc ... 87 8e-16 UniRef50_C6JAL6 Transposase (Fragment) n=1 Tax=Ruminococcus sp. ... 86 1e-15 UniRef50_B9EHT2 Olfr780 protein n=158 Tax=root RepID=B9EHT2_MOUSE 85 3e-15 UniRef50_A7C4E9 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 85 3e-15 UniRef50_B5WFI6 Putative uncharacterized protein n=1 Tax=Burkhol... 85 4e-15 UniRef50_Q4V248 Transposase, n=5 Tax=Bacillus cereus group RepID... 84 7e-15 UniRef50_A9F243 Transposase, IS4 family n=4 Tax=Sorangium cellul... 83 2e-14 UniRef50_A1ZPG0 Transposase of, putative n=3 Tax=Microscilla mar... 83 2e-14 UniRef50_Q4A8Q4 ISMHp1 transposase n=19 Tax=Mycoplasma RepID=Q4A... 82 3e-14 UniRef50_B2AJ60 Transposase, IS4 family n=4 Tax=Proteobacteria R... 82 3e-14 UniRef50_A3D336 Transposase, IS4 family n=6 Tax=Shewanella RepID... 81 5e-14 UniRef50_Q46310 Transposase n=1 Tax=Carnobacterium maltaromaticu... 81 6e-14 UniRef50_A0LAZ1 Transposase, IS4 family protein n=4 Tax=Magnetoc... 81 7e-14 UniRef50_A6DG92 ISPg4, transposase n=1 Tax=Lentisphaera araneosa... 81 8e-14 UniRef50_A8YU85 Transposase n=21 Tax=Lactobacillus RepID=A8YU85_... 81 8e-14 UniRef50_Q1Q2K2 Putative uncharacterized protein n=5 Tax=Candida... 81 9e-14 UniRef50_Q2NZH2 ISXoo8 transposase n=73 Tax=Xanthomonas RepID=Q2... 81 9e-14 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 81 9e-14 UniRef50_C8W0R5 Transposase-like protein n=12 Tax=Desulfotomacul... 80 1e-13 UniRef50_D0DW10 Transposase IS4 family protein n=5 Tax=Lactobaci... 79 3e-13 UniRef50_Q7UY96 Similar to transposase n=1 Tax=Rhodopirellula ba... 79 3e-13 UniRef50_A9DNS7 Transposase n=1 Tax=Shewanella benthica KT99 Rep... 78 5e-13 UniRef50_A5WBL3 Transposase, IS4 family n=2 Tax=Bacteria RepID=A... 78 6e-13 UniRef50_Q7NBK2 Predicted transposase n=10 Tax=Mycoplasma RepID=... 77 1e-12 UniRef50_Q9R3J0 Transposase, putative n=10 Tax=Deinococcus radio... 77 1e-12 UniRef50_C0WV66 Transposase IS4 family protein n=5 Tax=Lactobaci... 76 2e-12 UniRef50_Q9X6I5 Putative uncharacterized protein n=2 Tax=Bacillu... 76 3e-12 UniRef50_C3RGR4 Putative uncharacterized protein n=2 Tax=Bactero... 76 3e-12 UniRef50_C6I0E1 Transposase, IS4 family protein n=4 Tax=Leptospi... 74 5e-12 UniRef50_B7AA71 Transposase IS4 family protein n=2 Tax=Thermus a... 74 5e-12 UniRef50_Q6MS13 Transposase IS1634BQ n=39 Tax=Mycoplasma RepID=Q... 74 6e-12 UniRef50_Q737L2 IS231-related transposase n=3 Tax=Bacillus cereu... 74 1e-11 UniRef50_D1JFQ9 Putative uncharacterized protein n=1 Tax=uncultu... 73 2e-11 UniRef50_A3H523 Transposase (IS4 family) protein (Fragment) n=1 ... 72 4e-11 UniRef50_A6DM44 Putative uncharacterized protein n=2 Tax=Lentisp... 72 4e-11 UniRef50_A4A0C6 Probable transposase n=2 Tax=Blastopirellula mar... 71 5e-11 UniRef50_Q04QP0 Transposase, ISLbp11 n=2 Tax=Leptospira borgpete... 71 5e-11 UniRef50_C8W6S4 Transposase IS4 family protein n=1 Tax=Desulfoto... 71 9e-11 UniRef50_UPI0000164DB3 hypothetical protein TVN0693 n=1 Tax=Ther... 70 1e-10 UniRef50_B2AKB8 Transposase, IS4 family n=40 Tax=cellular organi... 70 1e-10 UniRef50_B5CN98 Putative uncharacterized protein n=5 Tax=Clostri... 69 2e-10 UniRef50_C7TBQ5 Transposase n=4 Tax=Lactobacillus rhamnosus RepI... 68 4e-10 UniRef50_C0GNX3 Transposase IS4 family protein n=3 Tax=Desulfona... 68 4e-10 UniRef50_C8VXW5 Transposase IS4 family protein n=3 Tax=Desulfoto... 68 5e-10 UniRef50_Q8DM76 Tlr0247 protein n=2 Tax=Thermosynechococcus elon... 68 5e-10 UniRef50_C6PFH6 Transposase IS4 family protein n=2 Tax=Thermoana... 68 7e-10 Sequences not found previously or not previously below threshold: UniRef50_C8PSK2 ISGsu1, transposase n=1 Tax=Treponema vincentii ... 104 7e-21 UniRef50_Q2J8F5 Putative uncharacterized protein n=3 Tax=Frankia... 86 2e-15 UniRef50_Q877V8 ISPpu8, transposase n=3 Tax=Proteobacteria RepID... 85 3e-15 UniRef50_D2CY12 Transposase n=12 Tax=Mycoplasma RepID=D2CY12_MYCSY 84 6e-15 UniRef50_B9YUA6 Transposase, IS4 family protein n=3 Tax='Nostoc ... 80 1e-13 UniRef50_B2J2I5 Transposase, IS4 family protein n=1 Tax=Nostoc p... 79 2e-13 UniRef50_B3JEV4 Putative uncharacterized protein n=1 Tax=Bactero... 79 2e-13 UniRef50_UPI00003C8608 transposase IS4 family protein n=4 Tax=Fe... 79 3e-13 UniRef50_A4XK23 Transposase, IS4 family protein n=8 Tax=Clostrid... 78 5e-13 UniRef50_UPI00016C560B transposase IS4 family protein n=1 Tax=Ge... 78 6e-13 UniRef50_UPI00016C3BAD transposase, IS4 n=2 Tax=Gemmata obscurig... 77 1e-12 UniRef50_D2ASB5 Transposase, IS4 family n=1 Tax=Streptosporangiu... 77 1e-12 UniRef50_A4C5E2 Hypothetical transposase n=2 Tax=Pseudoalteromon... 76 2e-12 UniRef50_A1BDB0 Transposase, IS4 family protein n=3 Tax=Chlorobi... 74 7e-12 UniRef50_A7N7H3 Putative uncharacterized protein n=31 Tax=Vibrio... 73 2e-11 UniRef50_P30192 Putative uncharacterized protein ychG n=8 Tax=En... 72 4e-11 UniRef50_A7BZU6 Transposase, IS4 n=2 Tax=Beggiatoa sp. PS RepID=... 70 1e-10 UniRef50_D1Y365 Transposase n=1 Tax=Pyramidobacter piscolens W54... 70 1e-10 UniRef50_C5JAH9 Transposase, IS4-like n=1 Tax=uncultured bacteri... 70 1e-10 UniRef50_A9KH40 Putative uncharacterized protein n=1 Tax=Coxiell... 70 1e-10 UniRef50_A8L6T7 Transposase IS4 family protein n=10 Tax=Actinomy... 70 2e-10 UniRef50_B8FNX6 Transposase IS1634 family protein n=9 Tax=Desulf... 69 2e-10 UniRef50_B0JP83 Transposase n=112 Tax=Cyanobacteria RepID=B0JP83... 69 3e-10 UniRef50_UPI0001BC2E1C TnpB family transposase n=1 Tax=Brevibact... 68 4e-10 UniRef50_UPI0000F5175B transposase-like protein n=2 Tax=Ferropla... 68 5e-10 UniRef50_A4BNE3 Transposase n=2 Tax=Gammaproteobacteria RepID=A4... 68 6e-10 UniRef50_UPI00016C5887 hypothetical protein GobsU_05723 n=3 Tax=... 68 6e-10 UniRef50_A6DG91 ISPg4, transposase n=1 Tax=Lentisphaera araneosa... 68 7e-10 UniRef50_C1XVD6 Transposase n=1 Tax=Meiothermus silvanus DSM 994... 68 8e-10 UniRef50_UPI000038E639 hypothetical protein Faci_04540 n=2 Tax=F... 68 8e-10 UniRef50_Q3B5Q7 Transposase-like n=5 Tax=Chlorobium/Pelodictyon ... 67 8e-10 >UniRef50_P11901 Transposase for insertion sequence element IS421 n=41 Tax=cellular organisms RepID=T421_ECOLX Length = 371 Score = 345 bits (885), Expect = 2e-93, Method: Composition-based stats. Identities = 370/371 (99%), Positives = 370/371 (99%), Gaps = 1/371 (0%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS-LRE 59 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS LRE Sbjct: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSSLRE 60 Query: 60 VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI 119 VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI Sbjct: 61 VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI 120 Query: 120 SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC 179 SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC Sbjct: 121 SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC 180 Query: 180 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG 239 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG Sbjct: 181 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG 240 Query: 240 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 299 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE Sbjct: 241 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 300 Query: 300 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF Sbjct: 301 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 360 Query: 360 PPRSAGSEKKN 370 PPRSAGSEKKN Sbjct: 361 PPRSAGSEKKN 371 >UniRef50_C6AUF2 Transposase IS4 family protein n=7 Tax=Rhizobium RepID=C6AUF2_RHILS Length = 372 Score = 284 bits (725), Expect = 5e-75, Method: Composition-based stats. Identities = 154/364 (42%), Positives = 205/364 (56%), Gaps = 4/364 (1%) Query: 5 HDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWA 64 D+W + + +L+ +AR GA TR REI++A TLLRL LAYG GMSLRE AWA Sbjct: 8 LDHWPEVRERLPAGFDLEATARLRGAFTRVREIKNAETLLRLALAYGGLGMSLRETCAWA 67 Query: 65 QLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVT-GCTSGKRLRLVDGTAISAPG 123 + +A LSD +LL+RL AA W G + A +A +A V G +G RLR++DGT+I PG Sbjct: 68 EAGGIARLSDPSLLERLCKAAPWLGDIVAALIAEQAKVPTGRFAGYRLRVLDGTSICHPG 127 Query: 124 GGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSL 183 WRLH+GYD T Q ELTD AE L R +I +ADR + +RP +R + Sbjct: 128 ADRTTWRLHVGYDLATAQVDQLELTDIHGAENLQRLTYAPGDIVLADR-YYARPRDLRPV 186 Query: 184 AFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGN-KKAGAPF 242 AD+IVR W LR L G FD+ L + GE V + P Sbjct: 187 IDAGADFIVRTGWNSLRLLQTNGEPFDLFAALA-AQQEQEGEVQVRVHEGMTGTPPPPPL 245 Query: 243 PARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 RLI P++A + RLL + R++G+ +LEAA ++LLLTSLP + + Sbjct: 246 VLRLIVRRKDPQQAQAEQERLLKDARKRGKKPDPRSLEAAKYILLLTSLPTATFPPADIL 305 Query: 303 DCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPR 362 YR RWQIELAFKR KSL LD+L AK+PELA+AW++A L+ A + + I D PP Sbjct: 306 TLYRFRWQIELAFKRFKSLAGLDSLPAKKPELARAWLYARLIVAIIAEQIAGQVPDSPPS 365 Query: 363 SAGS 366 G+ Sbjct: 366 GCGN 369 >UniRef50_B8F976 Transposase IS4 family protein n=2 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F976_DESAA Length = 371 Score = 262 bits (669), Expect = 1e-68, Method: Composition-based stats. Identities = 119/369 (32%), Positives = 184/369 (49%), Gaps = 18/369 (4%) Query: 1 MNYSH-----DNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGM 55 M D+W AIL + P + A+ GAL+RRR LLR+ L + G Sbjct: 1 MENQILLSEGDDWQAILTFL--PHGWEEKAKELGALSRRRNFDGPEALLRVLLIHLVQGC 58 Query: 56 SLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVR---AAVTGCTSGKRLR 112 SLR +A ++ +A+ SDVALLKRL+ + +W +A + + G+ +R Sbjct: 59 SLRVTSALSKAGGLASASDVALLKRLKASGEWMRWMAVELMKQWFGKQPEKILGMGRTVR 118 Query: 113 LVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRG 172 +VDG+ +S PG W++H + Q + +TD + E L F ++ +ADRG Sbjct: 119 VVDGSTVSEPGSTGTTWKIHYSIQLPSLQCDEVYVTDPKTGEDLKNFNVHPGDVFLADRG 178 Query: 173 FGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGN 232 + R + + G D IVR+ + + G F ++ LR L + G+ I + Sbjct: 179 YYHRTGMLHVV-KGGGDLIVRMIHQY-KLYDINGQEFGLIKNLRSLTVNQIGDWDAFIHH 236 Query: 233 SGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLP 292 +G R+ A+ E A +K +L EN +KG + ETL AA +V + T+L Sbjct: 237 KKEVISG-----RVCAIKKSKEAAEKAKRAILRENSKKGHKTKPETLVAAEYVFVFTTLS 291 Query: 293 EDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 + + A QV + YR RWQ+ELAFKRLKSL+ L L+ + E AKAW+ + AAFL++ + Sbjct: 292 RE-WKASQVLEAYRGRWQVELAFKRLKSLIGLGHLKKTDFEGAKAWLHGKIFAAFLVEAM 350 Query: 353 IQPSLDFPP 361 I F P Sbjct: 351 IAACDSFSP 359 >UniRef50_UPI00017465B5 InsL n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017465B5 Length = 382 Score = 245 bits (626), Expect = 1e-63, Method: Composition-based stats. Identities = 108/361 (29%), Positives = 169/361 (46%), Gaps = 8/361 (2%) Query: 6 DNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ 65 +NW +L+ + PE ++ A+ GA+ R R ++LLR+ L + G SLR + + Sbjct: 7 ENWDYLLSLL--PENWESLAKTTGAVQRLRGAESLSSLLRVLLLHAGHGCSLRTASVVGK 64 Query: 66 LHDVATLSDVALLKRLRNAADWFGILAAQTLAV-RAAVTGCTSGKRLRLVDGTAISAPGG 124 ++SDVAL KR W L A A R + G +LRLVDGT I PG Sbjct: 65 AAGWISMSDVALHKRFALCEGWLQQLCAGLFAQSRLQLPAAYRGLKLRLVDGTTIKEPGA 124 Query: 125 GSAEWRLHMGYDPHTCQFTDFELTDSRDA---ERLDRFAQTADEIRIADRGFGSRPECIR 181 ++WR+H F L R + E L F + +ADRGF S I Sbjct: 125 TGSQWRIHYSLRVPDWHCDFFRLNPVRGSGNGESLKHFEVAPGDCFLADRGF-SHLLGIE 183 Query: 182 SLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD-CGKNGETTVMIGNSGNKKAGA 240 + G A I+R++ + +G ++ +LR L G + + Sbjct: 184 HVYRGGAHVIMRLNEQNTPLEDEQGRPVVLLPWLRKLKQPGAAAGLDLWVRPRKEDSLEK 243 Query: 241 PFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ 300 P RL AV E A +++ ++ ++ ++A TLE +++LT++P D S + Sbjct: 244 RVPVRLCAVRKSVEAAALAQRKVQRRAQQDQTKLRAATLEHTAWIVVLTTVPRDTLSDVE 303 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFP 360 V YR+RWQIELAFKRLKSL + L + ++AW++A LL A L + + + + Sbjct: 304 VLQWYRVRWQIELAFKRLKSLGDVGHLPKSDERSSRAWVYAKLLIALLSEKMQRHAAALS 363 Query: 361 P 361 P Sbjct: 364 P 364 >UniRef50_C0AF19 InsL n=9 Tax=Opitutaceae bacterium TAV2 RepID=C0AF19_9BACT Length = 362 Score = 244 bits (623), Expect = 3e-63, Method: Composition-based stats. Identities = 109/361 (30%), Positives = 168/361 (46%), Gaps = 14/361 (3%) Query: 4 SHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAW 63 + W + + PE + +AR GA + + IR A LLRL L + G+SLR A Sbjct: 6 MTEEWGLVKGLL--PEGWEVAAREQGAFKQAKGIRTAEELLRLILMHAGSGLSLRHAVAR 63 Query: 64 AQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTG---CTSGKRLRLVDGTAIS 120 + +SDVALLKRLRNA W ++ + L +A G VD T I Sbjct: 64 GAAAGLPEVSDVALLKRLRNAEGWLRWMSVRLLEQQAGQPRWSRLPEGWTAVAVDSTTIE 123 Query: 121 APGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECI 180 G +WRLH + ELTD++ E L R+ ++ + DR F P+ I Sbjct: 124 ESGASGTDWRLHYAIGLPSLFCEQAELTDNKGGESLCRYKVRKGDLFLGDRNFCRAPQ-I 182 Query: 181 RSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGA 240 R + + ++R H L +G D+ +L L + E V + + Sbjct: 183 RHVMDHQGAVLLRWHSTSLPLFDQQGHALDVPAWLAQLRSRQCSELPVFLKDGTA----- 237 Query: 241 PFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ 300 RL A+ + P+ A + ++ ++ GR + L A +++++TSLP + Sbjct: 238 ---LRLCALRVSPQAAQRERAKIRLSAKKNGRKPSCQCLCMADYIVVVTSLPSSCLDSRG 294 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFP 360 + YRLRWQIELAFKRLKSLL+ + K+P +++W+ A LL LI+ + S F Sbjct: 295 ILQLYRLRWQIELAFKRLKSLLNTGHVPKKDPLSSRSWLQAKLLTCLLIEKSLLQSEVFS 354 Query: 361 P 361 P Sbjct: 355 P 355 >UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001905F7C Length = 367 Score = 235 bits (598), Expect = 3e-60, Method: Composition-based stats. Identities = 131/345 (37%), Positives = 192/345 (55%), Gaps = 9/345 (2%) Query: 1 MNYSHD--NWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLR 58 M+ S +W ++ +G EEL+ SAR AGAL R+R++ AA LLRL AY GG SLR Sbjct: 1 MSDSLQALDWGELVERLGSAEELEASAREAGALLRKRQVGGAADLLRLCFAYVLGGFSLR 60 Query: 59 EVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAA--VTGCTSGKRLRLVDG 116 + AWA +A++SDVA+LKRL+ +ADW G L ++ LA R G S RL VD Sbjct: 61 TLAAWADQRGLASMSDVAMLKRLKASADWVGYLVSELLAERCPEAFAGVHSDLRLMAVDA 120 Query: 117 TAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSR 176 T ++ PG W +H +D + + E+TD R+AERL R A E+RIADR ++ Sbjct: 121 TVVAPPGPKRDYWMVHTVFDLSRLKLSSVEVTDRREAERLSRG-VKAGELRIADRAH-AK 178 Query: 177 PECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNK 236 + ++ AD++VR R L +G + + R + +V I + +K Sbjct: 179 ATDLAAVVKAGADFLVRAPSNYPRLLDGDGQLLERLALCREAGDKGVLDRSVRIQDGKSK 238 Query: 237 KAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEY 296 AR++ + LPPE A ++ + +E AG+++LLTSL D++ Sbjct: 239 V---EVAARVVILPLPPEAAAKARRAARRLAAKARYKPSEAGIEMAGYLVLLTSLNADDW 295 Query: 297 SAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFA 341 E++A YRLRWQIELAFKR+KSL+ L+ LRAK+ +LA+ WI Sbjct: 296 PPERLASTYRLRWQIELAFKRMKSLIGLEGLRAKDADLARLWINI 340 >UniRef50_B7GET6 Transposase n=2 Tax=Bacillaceae RepID=B7GET6_ANOFW Length = 417 Score = 232 bits (590), Expect = 2e-59, Method: Composition-based stats. Identities = 82/385 (21%), Positives = 147/385 (38%), Gaps = 26/385 (6%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPG--GMSLR 58 M W + + PE L A+ G + R+R + A L L SL+ Sbjct: 1 MENQMQAWMKTIRQLFSPETLTHLAQETGFIQRKRAL-TAEAFLTLCAWGDGSLAQQSLQ 59 Query: 59 EVTAWAQLHDVATLSDVALLKRLRNAADWF-GILAAQTLAVRAA------VTGCTSGKRL 111 + L +LS L +R A F + L + T T RL Sbjct: 60 RLCTSLTLRHDCSLSSEGLNQRFTERAVAFLREVFFLLLQRQPPLLWSTIQTYRTCFTRL 119 Query: 112 RLVDGTAISAP--------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAE----RLDRF 159 R++D T+ P G S+ ++ YD + + D++ + Sbjct: 120 RILDSTSFLVPADYGEDYRGSVSSGAKIQFEYDLLSGACLQLCAQSANDSDARFAYHAQH 179 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 +++ I D GF S + A YI R+ ++ + G Sbjct: 180 TILPNDLCIRDLGFFSVAALTE-IDARGAYYITRLRSDMKVYIKENSQWKEWDWESLGNQ 238 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 + + G+++ P RLI L E+ + +RKG+ + +TL Sbjct: 239 LKEGESVEMEHVYIGHERLYIP---RLIFRRLTEEEWQKRMAYVRKREKRKGKALTRQTL 295 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 E + +LLT+LP++ + +QV + Y LRWQIEL FK KS+ L+ ++ + E + + Sbjct: 296 EQKKYHILLTNLPQESFDGQQVYELYSLRWQIELLFKAWKSVFDLEKVKKMKKERFECHV 355 Query: 340 FANLLAAFLIDDIIQPSLDFPPRSA 364 + L+A + + + + ++A Sbjct: 356 YGTLIAILVTQTFLFQARTYWQQTA 380 >UniRef50_P12249 Transposase for insertion sequence element IS231A n=411 Tax=Bacillus RepID=T231A_BACTB Length = 478 Score = 229 bits (583), Expect = 1e-58, Method: Composition-based stats. Identities = 70/399 (17%), Positives = 144/399 (36%), Gaps = 47/399 (11%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS--LR 58 + +S L P L+ A+ G + R+R+ + L + + S L Sbjct: 5 IQDELQLFSEELCRHLTPSFLEELAKKLGFVKRKRKFSGSE-LATICIWISQRTASDSLV 63 Query: 59 EVTAWAQLHDVATLSDVALLKRL-RNAADWFGILAAQTLAVR------AAVTGCTSGKRL 111 + + +S L KR + A ++ + + + + T T +R+ Sbjct: 64 RLCSQLHAATGTLMSPEGLNKRFDKKAVEFLKYIFSILWKGKLCKTSAISSTALTHFQRI 123 Query: 112 RLVDGTAISAPG------------GGSAEWRLHMGYDPHTCQFTDFELTDSRDAERL--- 156 R++D T P +A ++ + YD H+ QF +F++ ++ ++ Sbjct: 124 RILDATIFQIPKHLASIYPGSGGCAQTAGIKIQLEYDLHSGQFLNFQVGPGKNNDKTFGT 183 Query: 157 -DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWL------------- 202 ++ I D G+ S E + + A YI R+ ++ Sbjct: 184 ECLDTLRPGDLCIRDLGYFS-LEDLDQMDQRGAYYISRLKLNHTVYIKNPSPEYFRNGTV 242 Query: 203 --TAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISK 260 ++ ++ D+ + L G+ E R+I L ++ + Sbjct: 243 KKQSQYIQVDLEHIMNHLKPGQTYEIKE-----AYIGKNQKLFTRVIIYRLTEKQIQERR 297 Query: 261 TRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKS 320 + +KG ++ G + +++ PE EQ+ D Y LRWQIE+ FK KS Sbjct: 298 KKQAYTESKKGITFSEKSKRLTGINIYVSNTPEGIVPMEQIHDFYSLRWQIEIIFKTWKS 357 Query: 321 LLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 L + + + E + ++ L+A F+ + Sbjct: 358 LFQIHHWQNIKQERLECHVYGRLIAIFICSSTMFKIRKL 396 >UniRef50_A6DTQ2 Putative transposase insL for insertion sequence IS186 n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTQ2_9BACT Length = 375 Score = 228 bits (582), Expect = 2e-58, Method: Composition-based stats. Identities = 104/365 (28%), Positives = 161/365 (44%), Gaps = 18/365 (4%) Query: 6 DNWSAILAHIGKPEELDTSARNAGALTRRREI---RDAATLLRLGLAYGPGGMSLREVTA 62 +W + P+ D G L R+ + LLR L + G +SLR A Sbjct: 9 SDWDYFKTFL--PDGWDGMMAETGMLKFGRKFSGEDGPSKLLRTLLIHLGGNLSLRSTCA 66 Query: 63 WAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGC----TSGKRLRLVDGTA 118 A+ ++ +SDVALLKRL+ +++WF Q L R VDG+ Sbjct: 67 LAKEGNIIDVSDVALLKRLQKSSEWFNWCTTQLLDKMKPKNPQGLPEQEEYNFRYVDGSI 126 Query: 119 ISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPE 178 + PG + W LH + T + +TD + E L ++ +++ I DR + R Sbjct: 127 VREPGATGSTWMLHYSMNAKTLAPDEITITDQKKGESLKNYSVKPNDVFIGDRVYPRRNG 186 Query: 179 CIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKA 238 I + + R G F ++ LR L G GE V+I K Sbjct: 187 IIH-VHSNGGYILCRFPPSLTPLHNDNGTPFKLLSKLRKLKLGDIGEYNVVI-----KHN 240 Query: 239 GAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV--VQAETLEAAGHVLLLTSLPEDEY 296 AR+ A+ E L ++ + + + R + ETLE AG++L+LT+L + Sbjct: 241 EGQINARVCAMKKDHESTLKAQKAIHRKASKNSRKGSTRPETLEYAGYILILTTL-AESV 299 Query: 297 SAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 S E++ + YR RWQIEL FKRLKS++ L K ++W+ +L A LI+ II+ Sbjct: 300 SPEKILNIYRSRWQIELLFKRLKSIIGAAPLYKKNDIGMRSWLAGKILVATLIEYIIRCG 359 Query: 357 LDFPP 361 DF P Sbjct: 360 EDFFP 364 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 226 bits (576), Expect = 1e-57, Method: Composition-based stats. Identities = 70/393 (17%), Positives = 139/393 (35%), Gaps = 39/393 (9%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAG--ALTRRREIRDAATLLRLGLAYGPGGMSLR 58 MNYS + +L+ I K + +R+++ +++ L + Sbjct: 1 MNYSTEVKQKLLSIITKMDSYYWLFTKHPKTDFSRKKKW-SFEEVMKFMLTMEGKALRDE 59 Query: 59 EVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTA 118 + + + + S +R + + F L Q +G RL DG+ Sbjct: 60 LLEYFEFDNTTPSNSSFN-QRRAQILPEAFEFLF-QEFTKSFTDNVTYNGLRLIACDGSD 117 Query: 119 ISAPG---------------GGSAEWRLHMGYDPHTCQFTDFELTDSRD-------AERL 156 + G L+ YD + Q+TD + SR E + Sbjct: 118 LCIAHNPQDETTYFQTLPDRKGYNLLHLNAFYDLCSRQYTDAIIQPSRLANERRAMCEMI 177 Query: 157 DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLR 216 DR+ T+ I IADRG+ + + Y++RV +T++ + L Sbjct: 178 DRYNDTS-AIFIADRGYENYN-IFAHVEHKGMYYLIRVKDITSNGITSK------LTMLP 229 Query: 217 GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQA 276 + N+ P R+I P + + + K RV++ Sbjct: 230 ESGEFDEWVNVTLTKKQTNEVKANPKKYRVIDKKTPFDYLDLH---FNNFYEMKMRVIRF 286 Query: 277 ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAK 336 + ++T+LP+D+++++++ Y RW IE +F+ LK L L +K+PE Sbjct: 287 PIP-QGSYECIITNLPQDKFNSDEIKRLYAKRWGIETSFRELKYALGLTRFHSKKPEYIM 345 Query: 337 AWIFANLLAAFLIDDIIQPSLDFPPRSAGSEKK 369 I++ + + I + + + Sbjct: 346 QEIWSRMTLYNFCEIIATNVVINEKKGCKHTYQ 378 >UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipelotrichaceae RepID=B7C7E2_9FIRM Length = 446 Score = 218 bits (556), Expect = 2e-55, Method: Composition-based stats. Identities = 73/346 (21%), Positives = 127/346 (36%), Gaps = 39/346 (11%) Query: 29 GALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWF 88 TR+R+ +TL+ + L G G + + D T+S + R + D F Sbjct: 45 KDFTRKRKHLFGSTLMNVLLLEG-GSLKDELYKLFGYNLDTPTVSSF-IQARDKIKPDTF 102 Query: 89 GILAAQTLAVRAAVTGCTSGKRLRLVDGTAISA----------------PGGGSAEWRLH 132 IL R +G RL VDG+ + + + L+ Sbjct: 103 HILF-NLFNGRTRKPKLYNGYRLLAVDGSTLPITSEIKDKKTTIQKANNSDKPFSAFHLN 161 Query: 133 MGYDPHTCQFTDF-----ELTDSRDA-ERLDRFAQTADEIRIADRGFGSRPECIRSLAFG 186 YD + D + D RDA ++ + I IADRG+ S + Sbjct: 162 TSYDILEYTYDDVILQGQAVQDERDALNKMVERYKGDKAIFIADRGYESINS-FEKIHLS 220 Query: 187 EADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARL 246 Y+VRV G LR + E +++ + K A Sbjct: 221 GNKYLVRVKD------------IHSTGMLRSFGPFLDDEFDLIVKRTLTTKQTNEIKAHP 268 Query: 247 IAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYR 306 P+ + RVV+ + E + ++T+L ++E+S + + + Y Sbjct: 269 EIYKFVPQNQRFDYFEDAPFYDFECRVVRFKITE-DTYECIVTNLDKNEFSMQDIKELYH 327 Query: 307 LRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 LRW+IE +++ LK L L+ L +K+ L + I+A ++ I Sbjct: 328 LRWEIETSYRELKYDLDLNTLHSKKRNLIEQEIYAKMILYNFCSRI 373 >UniRef50_Q1Q5J6 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q5J6_9BACT Length = 367 Score = 216 bits (550), Expect = 9e-55, Method: Composition-based stats. Identities = 101/360 (28%), Positives = 162/360 (45%), Gaps = 10/360 (2%) Query: 6 DNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ 65 ++W + P + + A + AL R+ + LLR L + G SLRE A+ Sbjct: 4 EDWDLLRTFF--PNDWKSLAVDTNALKGLRKDKSEEKLLRTLLIHLGCGYSLRETVVRAK 61 Query: 66 LHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGG 125 ++A LSDVALLKRL+ + +W L R + LRL D T + PG Sbjct: 62 RANLADLSDVALLKRLKKSKEWLYKLCLSLFRERGLQINKRNNFHLRLFDATTVKEPGKT 121 Query: 126 SAEWRLHMGYDPHTCQFTDFEL---TDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS 182 + WR+H + + F+L E +F D+ IADRG+ + + I Sbjct: 122 GSLWRIHYSIEVPSLSCDFFKLTGTEGEGTGESFRQFPMKKDDYIIADRGYCT-GQGIHH 180 Query: 183 LAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGE-TTVMIGNSGNKKAGAP 241 A VRV+ + LR E F ++ ++ L + V I N N + Sbjct: 181 ATRKGAYLSVRVNSQSLRIFGEEKKPFPLLKEIQYLKRPLAIKSWNVFIPNVDNTEY--- 237 Query: 242 FPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQV 301 L + E I+ +L +KG ++ ETL A +V++ T+ PE++++A + Sbjct: 238 VKGSLCIIRKTEEAIKIAHKKLKRHASKKGIELKPETLIYAKYVIVFTTFPENQFTAFDI 297 Query: 302 ADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPP 361 + YR+RWQIEL FKR K + L + + +KAW++ L A L + +I + F P Sbjct: 298 LEWYRVRWQIELVFKRFKQIAQFGHLPKYDDDSSKAWLYGKLFVALLTEKLIDFATSFSP 357 >UniRef50_Q5L3A2 Transposase of IS231E-like element n=1 Tax=Geobacillus kaustophilus RepID=Q5L3A2_GEOKA Length = 453 Score = 208 bits (530), Expect = 2e-52, Method: Composition-based stats. Identities = 76/368 (20%), Positives = 143/368 (38%), Gaps = 29/368 (7%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATL-LRLGLAYGPGGMSLREVTAWAQLHDVA 70 L + EEL+ AR+ + R+ ++R + L L G G SL ++ + L Sbjct: 18 LRSVLSCEELEHMARDHQFIQRKGKLRAHDFVALCTFLQEGGGQKSLVQLCSALALKQNT 77 Query: 71 TLSDVALLKRLRNAADWF------GILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP-- 122 +LS L +R A F +L QT R + R+R++D T+ P Sbjct: 78 SLSAEGLNQRFHEKAVSFLKAVFEKLLIHQTQEARRLCPRHSLFLRIRILDSTSFQLPPE 137 Query: 123 ------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQ----TADEIRIADRG 172 G ++ + Y+ + ++ D+R + + ++ + D G Sbjct: 138 IQGIYEGCTGPGVKIQLEYEWLEGKVLHVDVEDARHHDAAYGASLLSTIQEGDLCLKDLG 197 Query: 173 FGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE--GMRFDMMGFLRGLDCGKNGETTVMI 230 + S E ++++ A YI R+ + +++ FL L G+ E Sbjct: 198 YFS-LEGLQAIHDAGAFYISRLKHNVGIYQKEGDRFRKWEPEDFLAVLQPGETMELE--H 254 Query: 231 GNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTS 290 KK P RLI L E+ + + + ++KG T + +T+ Sbjct: 255 AYVSGKKVHQP---RLIVYRLTEEQERQKEGQWKQKAKQKGAA--YVTRRPHPIYVYITN 309 Query: 291 LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 +P S ++ Y LRWQIE+ FK KSL H+ + + + ++ L+A + Sbjct: 310 IPAIYTSLHEIHTLYSLRWQIEVVFKTWKSLFHIHRFKPMKGARFQCHLYGTLIALLISS 369 Query: 351 DIIQPSLD 358 ++ + Sbjct: 370 TVMFKMRE 377 >UniRef50_Q64B41 Transposase n=11 Tax=environmental samples RepID=Q64B41_9ARCH Length = 439 Score = 199 bits (506), Expect = 1e-49, Method: Composition-based stats. Identities = 71/385 (18%), Positives = 127/385 (32%), Gaps = 37/385 (9%) Query: 2 NYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS-LREV 60 S D + + + L +AR G + R R+I L +G +S +R + Sbjct: 10 ENSVDPTVQTIVEMFPEDFLRNTARETGVVKRERKIDVVILFWVTTLGFGVRFLSTIRGL 69 Query: 61 TAWAQLHDVATLSDVALLKRLRNA-ADWFGILAAQTLAVRAAVTGC------TSGKRLRL 113 + TLS + R A++ +A +A TG K L + Sbjct: 70 KRKYEEKAKTTLSISSFHDRFTPEMAEFLRKCVLHAIAFQAQQTGRVLDDKLKRFKDLVI 129 Query: 114 VDGTAISAP------------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAE--RLDRF 159 D T I +A ++ + R +E L Sbjct: 130 QDSTIIRLHESLAKIWPAARTKKIAAGVKVSCIVSAVADSPKSVRIYPERTSEAKILRLG 189 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 D I + D G+ + ++ R+ + Sbjct: 190 PWLRDRILLIDLGYFKY-LFFDRIDGYGGYFVSRLKGNANPLIVR--------------V 234 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 K +V + + ++ V + E S +R+ R+V A Sbjct: 235 NRKCRGNSVDVVGKKLRDVLPRLKREILDVEVEVEFKRRKYKGKQSTVKRRFRMVCAFNS 294 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 ++ + LT++ D SAE++A Y RW+IEL FK LKS +D + + P + K I Sbjct: 295 DSGKYHSYLTNIRVDILSAEEIALLYGARWEIELIFKELKSHYRMDQIPSANPNIVKCLI 354 Query: 340 FANLLAAFLIDDIIQPSLDFPPRSA 364 + +L I++ + P +A Sbjct: 355 WIAILTLMCSRRILRLIRNANPENA 379 >UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCH0_9FIRM Length = 435 Score = 196 bits (499), Expect = 7e-49, Method: Composition-based stats. Identities = 57/363 (15%), Positives = 119/363 (32%), Gaps = 39/363 (10%) Query: 30 ALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFG 89 +R R+I T + + + G ++ + + + T+S +R + + F Sbjct: 33 DFSRNRKIN-FKTCVGITMNSGGCTLNKELLDFFDFDVNAPTVSAYT-QQRAKILPEAFE 90 Query: 90 ILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP---------------GGGSAEWRLHMG 134 L A G +L DG+ ++ G L+ Sbjct: 91 YLFHAFTEENAQTKNLYEGYQLLACDGSNLTIAPNLNDPETLWKSNQLGATGNHLHLNAL 150 Query: 135 YDPHTCQFTDFELTDSRDAE-------RLDRFAQTADEIRIADRGFGSRPECIRSLAFGE 187 YD + D + + + ++R I IADRG+ + ++ Sbjct: 151 YDVLNRTYIDALVQTASTYQEHRACIQMIERVTLDK-VILIADRGYENYNIMSHAIE-KG 208 Query: 188 ADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLI 247 +++R+ G GL+ + + I + Sbjct: 209 WKFLIRIKD------------VHSNGIASGLELPQTAVFDMDINLILTRNQTKSKKQAGY 256 Query: 248 AVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRL 307 + + R+ + + + + ++T+L +SAE++ + Y L Sbjct: 257 KFMPTVQTFDYLPIGSKEDYPISFRIARFKIAD-DSYETVITNLDRFCFSAEKLKELYHL 315 Query: 308 RWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSE 367 RW IE +F+ LK + L + AK+ + K IFA L + I ++ + Sbjct: 316 RWGIETSFRELKYAIGLTSFHAKKVDYIKQEIFARLALYNYCELITTYVVEHTENISKKN 375 Query: 368 KKN 370 + N Sbjct: 376 QVN 378 >UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostridium spiroforme DSM 1552 RepID=B1C560_9FIRM Length = 399 Score = 193 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 64/343 (18%), Positives = 115/343 (33%), Gaps = 40/343 (11%) Query: 37 IRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTL 96 + D T +++ L G + + + S + R + + F L Sbjct: 1 MLDFETTMKIVLCTGASPIKDELLKFNDFSITTPSASAF-VQARSKIKPEAFRTLFDG-F 58 Query: 97 AVRAAVTGCTSGKRLRLVDGTAISAPGG----------------GSAEWRLHMGYDPHTC 140 + G RL +DG+ + + + L+ YD Sbjct: 59 NKKTFKKKLYHGYRLLAIDGSELPIDNTIFDDETTVLRHGTLAKTFSAYHLNASYDLMER 118 Query: 141 QFTDFELTDSRD-------AERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVR 193 + D + + +DR+ I IADRG+ S + Y++R Sbjct: 119 TYDDIIIQGEAKRDEHGAFCQLVDRY-DGQKAIFIADRGYESYN-GFEHVVHSGHKYLIR 176 Query: 194 VHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPP 253 V + L +GE V + K A P Sbjct: 177 VRDIE-----------SQSSITKSLGPFPDGEFDVDVSRMLTLKQTKMIKACPDVYKFVP 225 Query: 254 EKALISK-TRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIE 312 + + RVV+ + E + ++T+L +E+S E + + Y +RW E Sbjct: 226 KNMRFDFMNKQNPWYEFNCRVVRLKITE-NTYETVITNLSRNEFSMEDICEIYNMRWGEE 284 Query: 313 LAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQP 355 +F+ LK + L+AL AK+ EL + I+A +L I+Q Sbjct: 285 TSFRELKYAIGLNALHAKKRELIQQEIYARMLMYNFCQRIVQE 327 >UniRef50_A6TN04 Transposase, IS4 family protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TN04_ALKMQ Length = 454 Score = 188 bits (478), Expect = 2e-46, Method: Composition-based stats. Identities = 73/392 (18%), Positives = 143/392 (36%), Gaps = 40/392 (10%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGP-GGMSLRE 59 M S + + ++ A G L R++ + L SL + Sbjct: 1 MKISSSLIIQFI-RLFDNNKIMEIAIGTGLLKRQKGMLPDTILKVFTFGLLNIANPSLNQ 59 Query: 60 VTAWAQL-HDVATLSDVALLKRLRNAADWFGILAAQTLAVR-------AAVTGCTSGKRL 111 + + Q T+S A+ KRL+ ++ + + K + Sbjct: 60 IASKCQAFQPGLTISKEAVYKRLKKSSLFLQETFKHMMQKSMNSVIPVKTAAILEQFKDV 119 Query: 112 RLVDGTAISAPGG------------GSAEWRLHMGYDPHTCQFTDFELT--DSRDAERLD 157 ++ D T I+ P + ++ Y +F+ E+T D D Sbjct: 120 KICDSTKITLPDKLVALYPGLGGRNAKSSLKVQGIYSLIPARFSSLEITKAPGADTTYND 179 Query: 158 RFA--QTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFL 215 + E+ I D G+ S+ L+ + Y+ R+ + ++ G + L Sbjct: 180 KLLAMVNPGELLITDLGYFSKA-FFEKLSTKGSYYLTRIKKNSIVYVEKSGQLTKVD--L 236 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQ 275 L G +T V +G + K+ R +A+ LP + + + + + +G+ + Sbjct: 237 TDLLKGTVVDTEVFLGIAHKKQ----LKCRFVAIRLPEKVVNQRRRKANQQAKAQGKQLS 292 Query: 276 AETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELA 335 A+ E +++T++ +D+ S E D YR RWQIEL FK LKS L++D + + Sbjct: 293 AKETELLAWNIIVTNVTKDKLSPEAACDLYRARWQIELVFKSLKSYLNIDKIGSCGKYQL 352 Query: 336 KAWIFANLLA-------AFLIDDIIQPSLDFP 360 + I+ L+A ++ Sbjct: 353 ECLIYGRLIAVVAMFSLYNVLYIPANQHFTRS 384 >UniRef50_A9AZS8 Transposase IS4 family protein n=3 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AZS8_HERA2 Length = 442 Score = 186 bits (473), Expect = 7e-46, Method: Composition-based stats. Identities = 88/357 (24%), Positives = 140/357 (39%), Gaps = 39/357 (10%) Query: 29 GALTRR-REIRDAATLLRLGL--AYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAA 85 G + R R +TL++ + SL ++ A AT+S A+ +R A Sbjct: 27 GYVKRADRATFTPSTLVQTLVYGWLANPTASLGQLAQMAARVG-ATVSPQAIDRRFTLAT 85 Query: 86 DWFGILAAQTLAVR--------AAVTGCTSGKRLRLVDGTAISAPGG------------- 124 L LA AV+ +R+ D T I P Sbjct: 86 VDL--LHHVLLASMEYAISADPVAVSILQRFTSVRIHDSTTIGLPDALATTYRGCGNASA 143 Query: 125 -GSAEWRLHMGYDPHTCQFTDFELTDSRDAE---RLDRFAQTADEIRIADRGFGSRPECI 180 G+A + + D T +LTD R ++ + R A +R+AD GF + Sbjct: 144 RGTAGLKCGVQLDLLTGTLCGIDLTDGRASDQVLSVQRAPLPAGSLRLADLGFYN-IRIF 202 Query: 181 RSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGA 240 R LA E ++ RV L + + ++ + GL + E TV++G+ Sbjct: 203 RELAAAEVYWLSRVQSHSRIRLPGQKEQ-SILEVVTGLGDADHWEGTVLVGSKER----- 256 Query: 241 PFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ 300 ARL+ +P A + R+ E K R V ++ A +++T+ PED+ + Sbjct: 257 -LAARLLVQRVPDAVAAQRRQRVQDEAHDKCRPVSNAAMDLAAWTVVITNAPEDKLGLTE 315 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 ++RWQIEL FK KS H+D R K+P I+A LL I+ S Sbjct: 316 AMVLLKMRWQIELLFKLWKSHGHVDEWRTKKPARILCEIYAKLLGLVFQQWILVASA 372 >UniRef50_D1N0Z4 Transposase IS4 family protein n=3 Tax=Bacteria RepID=D1N0Z4_9BACT Length = 384 Score = 183 bits (465), Expect = 7e-45, Method: Composition-based stats. Identities = 60/379 (15%), Positives = 106/379 (27%), Gaps = 67/379 (17%) Query: 2 NYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVT 61 N S + +L I K E + R++ A + + SLRE+ Sbjct: 3 NTSVSLFRQVLDLIPKREF-EEIVMKHNGDKRKQSFDSWAHFVSMIFCQLAQANSLREIC 61 Query: 62 AWAQLHD----------VATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSG--- 108 + T S+++ + FG + L A+ Sbjct: 62 GGLKTCGGKLNHLGVESAPTKSNLSYAN-AHRSPKMFGDIFHMLLGHCHAIAPRHEFSFP 120 Query: 109 KRLRLVDGTAISAPGGGSAE---------WRLHMGYDPHTCQFTDFELTDSRDAE--RLD 157 K+L +D T I +L+M D + T+ E Sbjct: 121 KKLYSLDATLIELCVKVFPWATYRQTKGAIKLNMLLDHDGHLPVFVDFTNGDVHEVNSAR 180 Query: 158 RFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRG 217 R D + + DRG+ + D++ R+ + Sbjct: 181 RMELPRDSMVVCDRGYVDF-SMLYKWNLSGVDFVTRLKTNATYDIPE------------- 226 Query: 218 LDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE 277 + G +I + + + R V Sbjct: 227 --------------YDVKQYPGTVLSDEVIFLR-----------GSQDKYPERLRKVVVC 261 Query: 278 TLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 +E + LLT+ E A+ + D Y+ RWQIE FK LK + + Sbjct: 262 DVENHRTLTLLTN--NFELDAQTIGDIYKARWQIESFFKMLKQNFKIKTFIGTSENAVRI 319 Query: 338 WIFANLLAAFLIDDIIQPS 356 ++ L+A L + S Sbjct: 320 QVWTALIAILLTKYLKFLS 338 >UniRef50_A6M1E5 Transposase, IS4 family protein n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M1E5_CLOB8 Length = 460 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 64/383 (16%), Positives = 143/383 (37%), Gaps = 37/383 (9%) Query: 4 SHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLA-YGPGGMSLREVTA 62 S D I+ + + +A G R ++ Y SLR + Sbjct: 10 SMDKIKKIIN-LFSKRLITKTAVTTGFTQRNSKLDGFTFFKAFTFGVYSLENPSLRNIAN 68 Query: 63 WAQLHD-VATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTS-------GKRLRLV 114 + + + +S A+ +L+ +++ + + + + + +++ Sbjct: 69 FCEDINPNLKVSRQAIENKLKAGSNFLKTILTNIIEDKIIKSIKHNHIEIFKAFNDIKIC 128 Query: 115 DGTAISAPGG------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAE--RLDRFA 160 D + I ++E ++ Y + Q FE D + + A Sbjct: 129 DSSLIKLNDSLRDSYKGFSEDKSASEMKIQTVYSFKSKQIETFEFEDGTTNDNSYMKTLA 188 Query: 161 QT--ADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGL----RWLTAEGMRFDMMGF 214 +EI + D G+ + +C + L A ++ ++ + + + +M+ F Sbjct: 189 DKINTNEILLVDLGYFDK-KCFKMLEKKSAFFLSKIKYNTALYKENYKKGNFEKVEMIDF 247 Query: 215 LRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVV 274 L+ +T + +G N + R+I LP E + R + + +GR Sbjct: 248 LK--KSSGVIDTYLYVGMKQNNREE----FRVIGKRLPEEIVNLRIRRAREKAKAQGRAP 301 Query: 275 QAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPEL 334 + E V+++T++ +++ + + D YRLRWQIEL FK KS +D +++ + Sbjct: 302 KKIDKELMSWVIMITNIEKEQADVDMLLDIYRLRWQIELLFKCWKSYGKIDHVKSAGIDY 361 Query: 335 AKAWIFANLLAAFLIDDIIQPSL 357 ++ L+ LI+ + Sbjct: 362 LNCLLYGRLIITLLINTVYSELY 384 >UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G346_9FIRM Length = 443 Score = 181 bits (459), Expect = 3e-44, Method: Composition-based stats. Identities = 67/370 (18%), Positives = 113/370 (30%), Gaps = 47/370 (12%) Query: 14 HIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ-------L 66 I + R +R R++ ++R L M +++ Sbjct: 17 KILCRDFSSHVKRPGKDFSRNRKLP-FEEVIRFLLPLQGQCMDQELFRHFSKKPLFFSTD 75 Query: 67 HDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP---- 122 + S + R + + L G +L +DG+ S P Sbjct: 76 YSGIPHSSAMIQARQKLSDSAMPALFHS-FTETCKKGALFQGYQLLAIDGSQFSVPENLK 134 Query: 123 -----------GGGSAEWRLHMGYDPHTCQFTDFELTD-------SRDAERLDRFAQTAD 164 G L+ Y + F D A+ +DR + Sbjct: 135 EPLCWRKIPNISKGRNVIHLNAMYHLQSGIFEDVVFQPICECNEHKALAQMVDRRSSAFP 194 Query: 165 EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 I +ADRG+ S + Y+VR G G GL+ Sbjct: 195 AIFMADRGYESYNT-FAHIEQKGDKYVVRGRESG-------------TGICSGLNLPDTE 240 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKT-RLLSENRRKGRVVQAETLEAAG 283 E + KK A E R R+V+ + E Sbjct: 241 EYDIEKELYICKKHSKKVKTNPRKYKRIRSDATFDFFTDDCEEYRLNLRIVKIKLSETTT 300 Query: 284 HVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANL 343 VL T+L ++++SA+ + Y +RW IE AF +LK L ++ +K EL ++ L Sbjct: 301 EVLF-TNLSKEKFSADDLKRLYHMRWGIETAFDQLKYALGAASVHSKNSELIIQELYGKL 359 Query: 344 LAAFLIDDII 353 + I+ Sbjct: 360 IMFNFCKTIV 369 >UniRef50_B5ZZ25 Transposase IS4 family protein n=11 Tax=Rhizobium RepID=B5ZZ25_RHILW Length = 381 Score = 180 bits (456), Expect = 9e-44, Method: Composition-based stats. Identities = 63/382 (16%), Positives = 114/382 (29%), Gaps = 60/382 (15%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 M + + + +L I + A R + + L+ L G +SLRE+ Sbjct: 1 MRHDNSVFHDVLKRIPWAVF-ERLVDEHQADKHVRRLSTKSQLIALLYGQLAGAVSLREI 59 Query: 61 TAWAQLHDV---------ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRL 111 + H + S A LR + F L AQ +A G+ + Sbjct: 60 VGSLESHSARLYHLGARPVSRSTFADANGLRPS-TVFAELFAQMVARAGRGLKRAIGEAV 118 Query: 112 RLVDGTAISAPGGG---------SAEWRLHMGYDPHTCQFTDFELTDSRDAER--LDRFA 160 L+DG+++S G G + ++H+ YD + + +T + + Sbjct: 119 YLIDGSSLSLAGAGSQWARFSDQACGAKMHVVYDANAERPIYAAVTPANVNDITAAKEMP 178 Query: 161 QTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDC 220 A + D G+ L + R+ ++AE D Sbjct: 179 IEAGATYVFDLGYYDF-GWWAKLNAAGCRIVSRLKSHTKLTVSAEQA--------ANADA 229 Query: 221 GKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLE 280 G + ++ K P R R + Sbjct: 230 GILFDRIGLLPQRQAKSRRNPMN-------------------------RPVREIGVRIET 264 Query: 281 AAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIF 340 + L AE++A Y+ RW IEL F+ +K L + + + Sbjct: 265 GKVLRIFSNDLTA---PAEEIAALYKRRWAIELFFRWVKQTLKIRHFLGNSENAVRIQVA 321 Query: 341 ANLLAAFLIDDIIQ-PSLDFPP 361 L+A L+ + P Sbjct: 322 VALIAYLLLQMAKADQATVTSP 343 >UniRef50_Q0F098 ISGsu1, transposase n=6 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F098_9PROT Length = 383 Score = 180 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 61/374 (16%), Positives = 116/374 (31%), Gaps = 69/374 (18%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 M +++ + +L + + R R R + + A SLR+V Sbjct: 1 MKHANTVFHQLL-RVIPRHRFEEVVRRYDGDRRIRSLSCWTQFCVMLYAQLCSRQSLRDV 59 Query: 61 TAWAQLHDV------------ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSG 108 + + H +TL+D + + + F L Q G Sbjct: 60 VSAWESHASRHYHLGAGSVRRSTLADANVKRSAGMYLELFYWLLHQF-----RGKGIHRK 114 Query: 109 KRLRLVDGTAISAPG---------GGSAEWRLHMGYDPHTCQFTDFELT--DSRDAERLD 157 +RL+D T I G + ++H YDP T F +T D + + Sbjct: 115 DAVRLIDSTTIDLCKHQFEWASFRTGKSGVKVHTVYDPDAQVPTFFSITAAKKHDKKAAE 174 Query: 158 RFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRG 217 + DR + L + ++ R+ F+++ L Sbjct: 175 HMPLLPGATYVFDRAYNDYA-WFHDLTQRDIRFVSRMKRNA---------EFEVVATLPV 224 Query: 218 LDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE 277 D G + + + ++ +K R + Sbjct: 225 SDDGVLEDQHIRLSSAKGRKECPTI----------------------------LRRICFV 256 Query: 278 TLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 E ++ +T+ + + SA +A Y+ RWQIEL F+ +K L + K Sbjct: 257 HEEDGKKLVFITN--DLKRSAGAIAALYKQRWQIELFFRWIKQNLKIKRFIGTSENAVKI 314 Query: 338 WIFANLLAAFLIDD 351 I ++A L+ Sbjct: 315 QIIIAMIAYLLLHM 328 >UniRef50_B7CEB8 Putative uncharacterized protein n=2 Tax=Erysipelotrichaceae RepID=B7CEB8_9FIRM Length = 431 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 56/363 (15%), Positives = 123/363 (33%), Gaps = 39/363 (10%) Query: 30 ALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFG 89 TR+R++ TL+ + ++ + + + T S + +R + F Sbjct: 31 DFTRKRKLP-VETLIHFIIQMQSKSLNSELCEYFNDIDFLPTASALC-QQRDKLDISAFQ 88 Query: 90 ILAAQTLAVRAAVTGCTSGKRLRLVDGTAISA--------------PGGGSAEWRLHMGY 135 + G + DG+ ++ +++ ++ Y Sbjct: 89 RIMH-LFVNAFDDYKTWKGYHVLACDGSDVNIAYDEKDEDTKRQNGNNKPFSQFHINGLY 147 Query: 136 DPHTCQFTDFELTDSRDA-------ERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEA 188 D F D + + E + + + I ADRG+ + + Sbjct: 148 DCINHVFWDTSIDTANKTRECAALMEMIMKHDYPENSIITADRGYEKYNLIACCIENNQK 207 Query: 189 DYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIA 248 ++ R+ D+ G + + E + + +K A Sbjct: 208 -FVFRIKD------------IDVFGSILSNLNLPDEEFDLDVTKILTRKQTNETKANKHK 254 Query: 249 VSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLR 308 + K+ + + RVV+ + + + L+T+L DE+ ++ Y +R Sbjct: 255 YTFISNKSEFNYFGTKEFYKMNLRVVRFKITD-DTYECLVTNLTRDEFDLNELKKMYHMR 313 Query: 309 WQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSL-DFPPRSAGSE 367 W IE AFK LK ++ + A +K+ + I+A +L L + I + + + + Sbjct: 314 WDIETAFKVLKYIIGMMAFHSKKRNFIQQEIYAAILLHCLTNIITERIEVEQSDKRKHTY 373 Query: 368 KKN 370 K N Sbjct: 374 KVN 376 >UniRef50_A2RJ55 Putative transposase n=7 Tax=Lactobacillales RepID=A2RJ55_LACLM Length = 439 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 63/357 (17%), Positives = 120/357 (33%), Gaps = 38/357 (10%) Query: 22 DTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRL 81 + A++ +R R++ T +++ L++G +S ++ + T S + + R Sbjct: 33 EIYAQSPFDFSRNRKL-SFETTIKIILSFGGQSLSSELLSHFNFTLKTPTASAL-VQARS 90 Query: 82 RNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEW------------ 129 + F L +T+ G R+ DG+ ++ P Sbjct: 91 KIKLKAFEQLFYRTIPSAQP-NKLYKGYRIFAHDGSDLNIPYNEKESDTHYRVGKFGKHV 149 Query: 130 ---RLHMGYDPHTCQFTDFELTDSRD-------AERLDRFAQTADEIRIADRGFGSRPEC 179 L+ YDP + + + + +D F T+ I IADRG+ S Sbjct: 150 GSLHLNALYDPLNKHYVAVDFQKIKQLNERKSLCQIVDDFDFTSPTIIIADRGYESFNVY 209 Query: 180 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG 239 + +++R L F + + T+ + K Sbjct: 210 -EHIKKSGQKFLIRAKDTKSNGLLNGLDLPSDGTF--------DKKITLQLTRRQTNKVK 260 Query: 240 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 299 + + I RVV+ + E + L+T+L +++E Sbjct: 261 KDKHYHFLHKRANFDYLPIRSK---ETYPISLRVVRIKLNE-DTYESLVTNLDPFLFTSE 316 Query: 300 QVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 + Y LRW IE +F+ LK L L +K+ + IFA L+ I Sbjct: 317 DLKVLYHLRWGIETSFRELKYALGLSHFHSKKLDFIIQEIFARLIMYNFSMTITLAV 373 >UniRef50_Q73IB8 Transposase, IS4 family n=9 Tax=Wolbachia RepID=Q73IB8_WOLPM Length = 442 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 74/385 (19%), Positives = 140/385 (36%), Gaps = 45/385 (11%) Query: 3 YSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLA-YGPGGMSLREVT 61 + S L E+ D + + R+R+++ ++ + + L G S+ + Sbjct: 2 NKITSLSKKLKEFFN-EKADKISITTRFIKRKRKLKGSSFVKAMVLGNIGVDNCSVETMC 60 Query: 62 AWAQLHDVATLSDVALLKRLRNAADWFG--------ILAAQTLAVRAAVTGCTSGKRLRL 113 D ++ L R A F +L L ++L Sbjct: 61 QLLN-EDSIDITKQGLDFRFTEEAVEFMKRMYNESVLLFKNILQ--VDCKILQQFNSVKL 117 Query: 114 VDGTAISAPGG------------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAER 155 +D + I+ P + +L + +D LT+ +++ Sbjct: 118 LDSSYITLPNSMEEMYKGYGTSYSGYESNTKSGIKLQLVFDYMNQIIDQLNLTEGVRSDQ 177 Query: 156 LDRFAQ---TADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMM 212 R ++++ I+D G+ P + + A +I R + + M Sbjct: 178 GYRKHLSNILSNDLLISDLGYF-VPSSFKQINEIGAYFISRYKSDTNIY---DVETNQKM 233 Query: 213 GFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGR 272 L L+ E V++G A R+I L E+++ + + R +G Sbjct: 234 ELLECLEDKLFLENEVLLG------KEAKIRVRIICQKLTEEQSMARRRKANRLARSQGY 287 Query: 273 VVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEP 332 + + +T++PE++ SAEQV YR+RWQIEL FK KS + LD L+ K P Sbjct: 288 TSSKRNQKLLNWSIFITNVPENKISAEQVLTIYRVRWQIELLFKLYKSHIRLDKLKGK-P 346 Query: 333 ELAKAWIFANLLAAFLIDDIIQPSL 357 ++A L A + I+ + Sbjct: 347 CRVLCELYAKLCAILIFHGIVGCTE 371 >UniRef50_B8FDX7 Transposase IS4 family protein n=2 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FDX7_DESAA Length = 395 Score = 177 bits (449), Expect = 4e-43, Method: Composition-based stats. Identities = 55/371 (14%), Positives = 101/371 (27%), Gaps = 64/371 (17%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTA--------- 62 L + + G+ + + + SLRE+ + Sbjct: 11 LTGLFNRNQFYALVLRHGSEKHAKGFSSWDHFVAMLFCQIAQAKSLREICSGMACCLGKL 70 Query: 63 -WAQLHDVATLSDVAL--LKR-LRNAADWFGILAAQTLAVRAAVTGCTSGK-RLRLVDGT 117 + S ++ KR + D F + + +L +D + Sbjct: 71 RHLGVKGAPKRSTLSYANQKRTWKLFQDVFYDTLHLCRQAPSPGKTKFRFRNKLMSLDSS 130 Query: 118 AISAPGGGSAE---------WRLHMGYDPHTCQFTDFELTDSRDAE--RLDRFAQTADEI 166 IS +LH+ D +TD + + + A + I Sbjct: 131 TISLCLSLFPWAEYRQTKGAVKLHLLLDHDGYLPVFACITDGKTHDVTMARQLALSKGSI 190 Query: 167 RIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGET 226 + DRG+ + E ++ R+ + A+ L Sbjct: 191 VVMDRGYNDY-KLYAEWVEDEVYFVTRLKDNAAFMVLADFPVPKNRNIL----------- 238 Query: 227 TVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVL 286 + L + R V E + Sbjct: 239 -------------------------VDQTILFTGAVAAKNCPYALRRVVVWDKEQERKIE 273 Query: 287 LLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 LLT+ ++ A +A Y+ RW+IEL FK LK L + + I+ L+A Sbjct: 274 LLTN--HLDFGATTIAAIYKDRWEIELFFKALKQNLKVKTFVGTSENALQIQIWTALIAM 331 Query: 347 FLIDDIIQPSL 357 LI + S Sbjct: 332 LLIKFLQFRSR 342 >UniRef50_UPI0001BC4BB6 transposase n=2 Tax=Neisseria mucosa ATCC 25996 RepID=UPI0001BC4BB6 Length = 403 Score = 177 bits (449), Expect = 5e-43, Method: Composition-based stats. Identities = 64/385 (16%), Positives = 117/385 (30%), Gaps = 55/385 (14%) Query: 3 YSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTA 62 YS + I+ I + A + + L+ + A+ SLR + Sbjct: 2 YSISRFQQIIKPIMHGRF-QKHVQQHQADKYSKGFNCHSLLISMVYAHLTHCNSLRTLEQ 60 Query: 63 WAQLHD------------VATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKR 110 H + AL KR F + + +A + + Sbjct: 61 SFNAHSHHHYHLNLCRRIRHSTLSEALAKRDTR---PFTDMLRELMATCSRTLRKHTQDT 117 Query: 111 ---LRLVDGTAISAPGGGSAEW----------RLHMGYDPHTCQFTDFELTDSRDAERLD 157 L L+D T I G G +W ++H+ + T +T++ + Sbjct: 118 ADLLYLLDSTPIILKGRGFNQWVSSNGRISGLKVHVLMNHANGCPTVQSITEASVNDIDQ 177 Query: 158 RFAQTA--DEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFL 215 R + D+G+ L A ++ R+ + + + Sbjct: 178 RHIVQPEKGATYVFDKGYCDYN-WWAELDRAGAYFVTRLKANAAVEVIEQFSPSE-TQNA 235 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQ 275 N T ++ K + TR + + R + Sbjct: 236 HENSRNDNKNTPILTDEYIRFKHKSN------------------STRPNHYHNKTLRRIT 277 Query: 276 AETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELA 335 E VL+ +L SA+++A+ Y+ RWQIEL FK LK L L + Sbjct: 278 VEREGTEALVLVSNNLTA---SAQEIAENYKRRWQIELLFKWLKQHLKLKRFLGRSANAV 334 Query: 336 KAWIFANLLAAFLIDDIIQPSLDFP 360 K + ++A L+ + Q Sbjct: 335 KLQLLCAMMAYLLLK-LYQQCTTHS 358 >UniRef50_Q46GC6 Transposase n=7 Tax=Methanosarcina RepID=Q46GC6_METBF Length = 435 Score = 176 bits (447), Expect = 8e-43, Method: Composition-based stats. Identities = 66/378 (17%), Positives = 125/378 (33%), Gaps = 39/378 (10%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPG-GMSLREVTAWAQLHDVA 70 L + E L +A+ G + R R+I L L++G +L + + Sbjct: 13 LREMFPEEWLRQTAKETGLIVRERKIDPVIIFWVLTLSFGVRLQRTLASLKREYETESQK 72 Query: 71 TLSDVALLKRLR-NAADWFGILAAQTLAVRAAVTGCT------SGKRLRLVDGTAISAPG 123 T+SD + R ++ + A G + + + + D T + Sbjct: 73 TISDSSWYYRFTPELVEFLHQCVIHGMEELAKEPGRKLSKKLETFQDVVIQDSTIVRLHS 132 Query: 124 GGS------------AEWRLHMGYDPHTCQFTDFELTDSRDAE--RLDRFAQTADEIRIA 169 + A ++ + L + AE L D I + Sbjct: 133 SLADRFPAARSRTVAAGVKVGVMVSAIANGPRTIALYSEKTAEIKTLKIGPWIKDHILLV 192 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 D GF + + + ++ R+ L + L + V Sbjct: 193 DLGFY-KTQMFARVEENGGYFVSRIRKNMDPIL------VSIEEELSKTKSKEFAGKPV- 244 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 + +G A + E ++ R+V E + + +T Sbjct: 245 -SECIKQLSGKDIDAVVKI-----EFKRREYKGKQKQDEMIVRLVAVYNDEDEKYHIYIT 298 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 ++ +D +A+ +A+ Y RW IEL FK LKS LD L K ++ +A I+ +L + Sbjct: 299 NIQKDILNAKDIANLYGARWDIELLFKELKSKYSLDVLETKNVQVIEALIWTAILTLIVS 358 Query: 350 DDI---IQPSLDFPPRSA 364 I ++ S P + A Sbjct: 359 RRIYSLVRKSTTHPEKMA 376 >UniRef50_B3PC11 ISCja2, transposase n=5 Tax=Proteobacteria RepID=B3PC11_CELJU Length = 383 Score = 176 bits (447), Expect = 9e-43, Method: Composition-based stats. Identities = 53/371 (14%), Positives = 116/371 (31%), Gaps = 62/371 (16%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 M++S+ + +L + E + A+ + R + + ++ G SLR++ Sbjct: 1 MSHSNTAFHQLLKPL-SRHEFEAEAKKHHVGQKLRSATRWDQFVGMAMSQLSGRQSLRDI 59 Query: 61 TAWAQLHD------VATLSDVALLKRLRN--AADWFGILAAQTLAVRAAVTGCTSG---K 109 + + A + L R+ A+ + + A+ L ++ G Sbjct: 60 QSNLEAQQHKLYHLGAKPIARSTLARINEVQPAELYKHVFARLLHRCKSMQGKHKFQFKN 119 Query: 110 RLRLVDGTAISAPGGGSAE---------WRLHMGYDPHTCQFTDFELTDSRDAERLD--R 158 L +D +AI +L +G + T L+D ++ + ++ + Sbjct: 120 PLYSLDASAIDLSLSVFPWAAHRDDTANVKLSVGLNHGTQVPEFVALSDGQENDMIEGRK 179 Query: 159 FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL 218 F I D+G+ + L ++ R+ + + + G + Sbjct: 180 FDFPKGSIVAFDKGYVDY-RWFKLLTDKGVFFVTRLRAKAVYRVEERRYADSSKGII--- 235 Query: 219 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAET 278 + +S + K R + Sbjct: 236 ---------------------------------SDQVIQLSSAHAIKRGAPKLRRIGYRD 262 Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAW 338 LT+ + +A +A Y+ RWQ+EL FK +K L + A Sbjct: 263 ATTGKFYEFLTN--NFQLAAATIAAIYKDRWQVELFFKAIKQNLKIKAFVGTSRNAVLTQ 320 Query: 339 IFANLLAAFLI 349 I+ ++ L+ Sbjct: 321 IWIAMITYLLL 331 >UniRef50_B3E6V4 Transposase IS4 family protein n=8 Tax=Proteobacteria RepID=B3E6V4_GEOLS Length = 372 Score = 176 bits (447), Expect = 1e-42, Method: Composition-based stats. Identities = 58/378 (15%), Positives = 108/378 (28%), Gaps = 69/378 (18%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 M +S+ + ++ K E +T AR + R + + + G SLR++ Sbjct: 1 MPHSNTVLNQVV-RFFKRHEFETLARKHHVGQQFRSFSRWSQFTAMLVGQLTGRKSLRDL 59 Query: 61 TAWAQLHD------VATLSDVALLKRLRNAAD--WFGILAAQTL---AVRAAVTGCTSGK 109 ++ + L R+ + L + L A Sbjct: 60 VDNLKVQGHKLYHLGTRDVPRSTLARVNEEQPHQLYKELFHKLLGRCQAIAPKNRFKLDA 119 Query: 110 RLRLVDGTAISAPGGGSAE---------WRLHMGYDPHTCQFTDFELTDSRDAER--LDR 158 +L L+D T I+ +LH+G F++T ++ E Sbjct: 120 KLYLLDATVINLCLKVFPWASYQKAKGAIKLHVGLSADGYLPEFFDVTTGKEHEINWARL 179 Query: 159 FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL 218 + DRG+ ++L ++ R+ L + G L Sbjct: 180 LKLPTGSFVVFDRGYTDYD-WYQALMDSSIFFVARLKDNALVEYFKKRPGRRSQGVLTDQ 238 Query: 219 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAET 278 + NG + R+V Sbjct: 239 EISLNG------------------------------------------IKGSLRLVHFVA 256 Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAW 338 + + + + A VA+ Y+ RWQIEL FK +K L + A Sbjct: 257 EDGNEYRFVTN---ANHIPAALVAELYKERWQIELFFKWIKQNLKIKAFYGTSENAVLTQ 313 Query: 339 IFANLLAAFLIDDIIQPS 356 I+ L ++ + S Sbjct: 314 IWIALCVYLVLAWLKFMS 331 >UniRef50_B0TD95 Transposase, is4 family n=3 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TD95_HELMI Length = 441 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 62/377 (16%), Positives = 130/377 (34%), Gaps = 37/377 (9%) Query: 10 AILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS-LREVTAWAQLHD 68 L+ + E + A G + R R+I L L L +G G L ++ Sbjct: 13 EALSKLFPKEWVSEVAAETGFVKRERKISPVVFLWALVLGFGVGVQRTLGDLRRSYMEQA 72 Query: 69 VATLSDVALLKRLR-NAADWFGILAAQTLAVRAAVTG------CTSGKRLRLVDGTAISA 121 ++ A R ++ + + G + ++D + + Sbjct: 73 GHSVVPSAFYDRFTPELVEFLKRCVEKAIGHLVVEPGQVMSERLKDILDIAVIDSSLVRL 132 Query: 122 PGGGSAEW------------RLHMGYDPHTCQFTDFELTDSRDAER--LDRFAQTADEIR 167 + +W +++M + ++ + E L + D I Sbjct: 133 HDQLAKKWPGPRTNHSPAAAKVNMLVSVFGATRSQVQIVEGTRGESKLLSIGSWVKDRIL 192 Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT 227 + D G+ S + + + ++ R+ + + ++ E Sbjct: 193 LFDLGYFSF-KHFGKIMNEKGYFVSRLKSNSNPLI--------LRSLIQHRGRTIAVEGK 243 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLL 287 ++ G+ + +I + + S L+ + RVV E + Sbjct: 244 RLLDIKGSLRR------EIIDFEVLVSNSQSSNMDLVKRTALQLRVVGILNEETKDYHFY 297 Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 +T+LP + + AE +A YR RW IEL FK LKS HL+++ + + + +A ++ LL Sbjct: 298 ITNLPAERFPAEDIATLYRARWTIELLFKELKSYYHLESISSGKDCIVEALLYTALLTLI 357 Query: 348 LIDDIIQPSLDFPPRSA 364 + I+ + P A Sbjct: 358 VSRRILGLLREQFPEHA 374 >UniRef50_Q877R2 Transposase n=51 Tax=Bacteroidales RepID=Q877R2_BACTN Length = 387 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 60/375 (16%), Positives = 113/375 (30%), Gaps = 64/375 (17%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 M ++ + + + + + + LL L SLR++ Sbjct: 1 MFQDKYVFAQLASFL-NRSKFNRIVTKYDGDKYVKHFTCWNQLLALMFGQLSNRESLRDL 59 Query: 61 TAWAQLHD----------VATLSDVAL--LKRLRNAADWFGILAAQTLAVRAAVTGCTSG 108 + H + S +A R + + + + A G Sbjct: 60 IVALEAHHSKCYHLGMGKNVSKSSLARANQDRDYHIFEEYAYYLVSEARQKCANHIFKLG 119 Query: 109 KRLRLVDGTAISAP---------GGGSAEWRLHMGYDPHTCQFTDFELTDS--RDAERLD 157 + D T I ++H YD T F +T++ D++ + Sbjct: 120 GNVYAFDSTTIDLCLSVFWWAKFRKKKGGIKVHTLYDVETQIPAFFHITEASVHDSKVMI 179 Query: 158 RFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRG 217 I DRG+ + + + + EA ++VR + + R Sbjct: 180 EIPYEPSSYYIFDRGYNNF-KMLYKIHQIEAYFVVRAKKNLQY---------KSIQWKRR 229 Query: 218 LDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE 277 L + +V++ K+ + R+V+ Sbjct: 230 LPKNVLSDASVLLTGFYPKQY----------------------------YPKPLRLVKYW 261 Query: 278 TLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 E +T+ SA QVA+ Y+ RWQ+EL FK LK L + + Sbjct: 262 DEEQEREFTFITN--AMHISALQVAELYKNRWQVELFFKWLKQHLKIKRFWGTTENAVRI 319 Query: 338 WIFANLLAAFLIDDI 352 I+A + A L+ I Sbjct: 320 QIYAAICAYCLVAII 334 >UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepID=B0R8M6_HALS3 Length = 449 Score = 176 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 79/373 (21%), Positives = 123/373 (32%), Gaps = 41/373 (10%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL----H 67 + +EL AR + R R+ A L + G S R + A+ + Sbjct: 16 IQRAFPSDELRERARATNLVERERKFDIVALFYTLSFGFAAG--SDRSLQAFLERYVEMA 73 Query: 68 DVATLSDVALLKRLRNAADWFGILA-----AQTLAVRAAVTGC-TSGKRLRLVDGTAISA 121 D LS A RA ++G + + + D T +S Sbjct: 74 DCDDLSYAAFHDWFEPGFVALLREILDDAIENLDTGRADLSGRLERFRDVLIADATIVSL 133 Query: 122 ----------PGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAER--LDRFAQTADEIRIA 169 G AE +LH+ T T F TD ER L AD + + Sbjct: 134 YQDAADVYAATGEDQAELKLHLIESLSTGLPTRFRTTDGTTHERSQLPTGEWVADALILL 193 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 D GF + ++ RV + E + RG GE+ Sbjct: 194 DLGFYDFW-LFDRIDQNGGWFVSRVKDNANFEIVEE------LRTWRGNSIPLEGES--- 243 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 + + R+ + K + R R+V E + L LT Sbjct: 244 LQAVLDDLQRQEIDVRI-------TLSFERKRGSGASATRTFRLVGLRNEETEEYHLYLT 296 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 +L D+YSA +A YR RW++EL FK LKS LD + + + +A I ++ + Sbjct: 297 NLGNDDYSAPDIAQLYRARWEVELLFKELKSRFGLDEINTTDAYIIEALIIMAAISLMMS 356 Query: 350 DDIIQPSLDFPPR 362 I+ R Sbjct: 357 RVIVDELRSLEAR 369 >UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XCY0_9BACT Length = 481 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 69/373 (18%), Positives = 119/373 (31%), Gaps = 49/373 (13%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWA------Q 65 L + P + AGA +R R T G + REV Q Sbjct: 31 LEALFAPFIPEQLLSRAGANSRERFYTLRQTFWAFLWQALHPGTACREVVRQLLSDWQAQ 90 Query: 66 LHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGG 125 + A R R L A A G ++LVDGT S P Sbjct: 91 AGRTRAQAGTAAYCRARQRLP-LERLQAILQATLGPEPPRWRGHAVKLVDGTTFSLPDTA 149 Query: 126 SAEWR-----------------LHMGYDPHTCQFTDFELTDSRDAERLDRFAQTAD---- 164 + + + + + + ++ R E + Sbjct: 150 ANQKKFPQSGAQKPGCGFPTLKVVALFSLASGLALNWARGSLRVHEIPLFRKLWSGLRRR 209 Query: 165 EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 ++ I DRGF S + L D + R+H + + L + Sbjct: 210 DLIIGDRGFSSYTN-LALLLGRGVDCLFRLHQGKKVRHPRR----SRLQRKQKLGPRQ-- 262 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGH 284 K P R + P + + + +V + Sbjct: 263 -----WLVQWKKPYQKPEYMRPKEWAAVPSEMQVRVFEV---------IVCTRGMRTRKL 308 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 +L+ T L Y E++A+ Y RW+IEL+F+ LK+ L L+ LR + P + + ++ +L+ Sbjct: 309 MLVTTLLDPVRYPVEELAELYLRRWEIELSFRDLKTTLGLEVLRCQSPAMVEKEVWMHLI 368 Query: 345 AAFLIDDIIQPSL 357 A L+ ++ S Sbjct: 369 AFNLLRRVMLQSA 381 >UniRef50_D1T817 Transposase IS4 family protein n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1T817_9BURK Length = 448 Score = 173 bits (438), Expect = 9e-42, Method: Composition-based stats. Identities = 60/362 (16%), Positives = 111/362 (30%), Gaps = 27/362 (7%) Query: 7 NWSAILAHIGKPEELDTSARNAG-ALTRRREIRDAATLLRLGLAYGPGGMSLRE----VT 61 W + H+ E ++ + + +G A RRR + + + S+ E + Sbjct: 23 EWGRLGQHL-PYEWIEYAVQASGSASVRRRRLPAQQVVWLVIALALYRHQSISEVVDELD 81 Query: 62 AWAQLHDVATLSDVAL-LKRLRNAADWFGILAAQTLAVRAAV---TGCTSGKRLRLVDGT 117 D + +S A+ R R A L ++ A A G L +DGT Sbjct: 82 LALPAADASFVSKSAIAQARQRIGAAPLAWLFHESAANWVAQDQAKHLFKGFSLFAMDGT 141 Query: 118 AISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRP 177 + + H + +L D + G Sbjct: 142 TLRTADSAANRRHFGASAAAHGRIGSYPQLRAVTLTALATHLV--RDAVF----GPYDIN 195 Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKK 237 E I + I RV + + ++ L ++ Sbjct: 196 EMIWARE-----LIARVPANSITVFDKGFLSAQLLCNLVSGGENRHFIIPAKANTCWEVV 250 Query: 238 AGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYS 297 +G P + + + + P+ + R V A +LL + + Sbjct: 251 SGGPGD-QTVRMRVSPQA---RAKCPDLPEFWQARAVLALDARGRQRILLTSLTDRRRFK 306 Query: 298 AEQVADCYRLRWQIELAFKRLKSL-LHLD-ALRAKEPELAKAWIFANLLAAFLIDDIIQP 355 A + CY RWQIE ++ LK L ++ LR++ E + L+A LI + Sbjct: 307 AVDIVSCYERRWQIETSYHELKQSMLGMELTLRSQTVEGVYQEFWGALIAYNLIRLEMAK 366 Query: 356 SL 357 + Sbjct: 367 AA 368 >UniRef50_Q3A1U3 Transposase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A1U3_PELCD Length = 489 Score = 173 bits (438), Expect = 1e-41, Method: Composition-based stats. Identities = 63/387 (16%), Positives = 126/387 (32%), Gaps = 57/387 (14%) Query: 2 NYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAY----GPGGMSL 57 S + L +GA++RRR T G + Sbjct: 36 QKSFKQIGEVFEKFIPLALLKP--ELSGAMSRRRLFSKENTFWAFFSQVLDADGGCKEVI 93 Query: 58 REVTAWAQLHDV--ATLSDVAL-LKRLRNAADWFGILAAQTLAV--RAAVTGCTSGKRLR 112 R++ ++A + + + S + R + A + A T + TG + +R+ Sbjct: 94 RKLQSYASIKGIKVPSSSTASYCTARKKLAEPMLADILAHTAEQLEKMPATGMLNNRRVI 153 Query: 113 LVDGTAISAPGGGSAEW-----------------RLHMGYDPHTCQFTDFELTDSRDAE- 154 + DGT +S P + R+ + + + + + ++ E Sbjct: 154 VADGTGVSMPDTPENQAAWPQSSALKPGCGFPSARICACFSLDSGALLSYAIGNKKNNEL 213 Query: 155 ---RLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDM 211 R +I + D+GF S + I L D +V + R + + Sbjct: 214 PLFRQQWETFNPGDIFLGDKGFCSYFD-IAKLQDRGVDSVVTLAKRAPVRAASSLKKLGP 272 Query: 212 MGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKG 271 L + K + ++ R I V +P Sbjct: 273 DDLLITWERPKYAQILSYSKDAWA-NLPKKLTLRQIKVKVPH------------------ 313 Query: 272 RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKE 331 G ++ T + Y AE +A+ Y RW +EL F+ +K+ + +D LR Sbjct: 314 -----PGFRTRGFYIVTTLIDAARYPAEDLAELYFKRWDVELFFRDIKTTMGMDVLRCLT 368 Query: 332 PELAKAWIFANLLAAFLIDDIIQPSLD 358 P++ + I + +A + +I + + Sbjct: 369 PDMIRKEILMHFIAYNCVRRLIYEAAE 395 >UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostridiales RepID=C7GFW6_9FIRM Length = 436 Score = 173 bits (437), Expect = 1e-41, Method: Composition-based stats. Identities = 63/352 (17%), Positives = 124/352 (35%), Gaps = 36/352 (10%) Query: 29 GALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWF 88 R+R++ ++ L ++ G ++ + + V T S +R + + F Sbjct: 31 KDFIRKRKLD-FKKMMHLIISMESGSLNHELLKFFEYDSSVPTGSAF-YQQRSKLSVSAF 88 Query: 89 GILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRLH---------------- 132 L + ++ + L DG+ + H Sbjct: 89 RHLLKE-FNLKFPLEKFRGKYYLIACDGSEFNIARNLKDADTFHEPNGKSVSGFNMVHTI 147 Query: 133 MGYDPHTCQFTDFELTDSRD-------AERLDRFAQTADEIRIADRGFGSRPECIRSLAF 185 Y+ + ++ D E+ R +DR+A A I IADRGF S ++ Sbjct: 148 SLYEVCSKRYLDLEVQPGRLKNEFQAICNLMDRYAYGASPIFIADRGFSSYNVFAHAIEN 207 Query: 186 GEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPAR 245 D+++R ++ G D + L T +K R Sbjct: 208 N-VDFLIRAKDLNVQRFLGGGTLPDKLDTTIEL------ILTRTQSKKKHKHPEKESQYR 260 Query: 246 LIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCY 305 I ++ + + E K R+V+ E + ++T+L E++++ + + CY Sbjct: 261 YIGKNIAFDYLNP--ADISDEYLLKLRIVRVEVSD-GVFENIITTLSEEDFTPDDIKYCY 317 Query: 306 RLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 LRW IE +F+ LK + L +K+ E +++ L+ II Sbjct: 318 NLRWGIETSFRDLKHTIGATNLHSKKTEYVAFELWSKLILYNFCSIIILHVP 369 >UniRef50_D1K7L7 Transposase n=3 Tax=Bacteroidales RepID=D1K7L7_9BACE Length = 389 Score = 171 bits (433), Expect = 3e-41, Method: Composition-based stats. Identities = 60/386 (15%), Positives = 111/386 (28%), Gaps = 65/386 (16%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 MN ++ + + D + R +L + S+R++ Sbjct: 1 MNQGKYIFAQLTDFL-PRRVFDRLVEKYSGNKKIRTFTCWNQMLCMIFGQLTARDSMRDL 59 Query: 61 TAWAQLHDV--------ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGK--- 109 + H AT+S L K RN A TL A + Sbjct: 60 MLSLEAHKNKYYHLGFGATVSRTNLGKANRNRDYRIYEEFAYTLIAEARNNYNKNDFEVK 119 Query: 110 ---RLRLVDGTAISAP---------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAE--R 155 + D + I +LH YD T T +T+++ + Sbjct: 120 VDSNVYAFDSSTIDLCLNVFWWAEFRKHKGGIKLHTLYDVKTSIPTIVLVTNAKVHDVNM 179 Query: 156 LDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFL 215 LD + I D+G+ + L A ++ R Sbjct: 180 LDELSYEKGSFYIMDKGYVDFTR-LHKLHTCGAYFVTRAKNNMRFR-------------- 224 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQ 275 + + K + ++ + L K R ++ Sbjct: 225 -------------RMYSCEVDKTTGIKC---------DQIGMLETYKSLKAYPNKLRRLK 262 Query: 276 AETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELA 335 E + +T+ E SAE++A Y+ RWQ+EL FK +K L + + Sbjct: 263 YYDEELDREFVFITN--NMELSAEEIALLYKNRWQVELFFKWIKQHLKVKSFWGTTMNAV 320 Query: 336 KAWIFANLLAAFLIDDIIQPSLDFPP 361 K ++ ++ L+ + P Sbjct: 321 KTQVYCAIITYCLVAIVAYKLKVNRP 346 >UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicutes RepID=B8FXU5_DESHD Length = 381 Score = 171 bits (432), Expect = 4e-41, Method: Composition-based stats. Identities = 57/328 (17%), Positives = 102/328 (31%), Gaps = 38/328 (11%) Query: 50 YGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGK 109 G +S + AT S + +R + + +L + + + Sbjct: 1 MGGNSLSKELYDWLGYSSETATASAF-VQQRDKIRPEALKLLFHEFTRLTVSENSL-QDY 58 Query: 110 RLRLVDGTAISAPGGGS---------------AEWRLHMGYDPHTCQFTDFELTDSRDAE 154 RL VDG+ + P L YD + D + + Sbjct: 59 RLLAVDGSDLRLPSNSKDGFSSIRNSEDSKNYNLVHLDAMYDLMGKVYVDASVQSKKGMN 118 Query: 155 -------RLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGM 207 +D+ + I I DRG+ S I YI+R Sbjct: 119 EHKALVSMVDQSEINGNVIAIMDRGYESFNN-IAHFQEKSWYYIIRAKESYGII-----S 172 Query: 208 RFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVS--LPPEKALISKTRLLS 265 R + + + E + + K+ A P K + Sbjct: 173 RLSLPDY-----PEYDEEIMLTLTRRQTKETLPLLKAYPHRYRWIQPHTTFDFIKPKDSK 227 Query: 266 ENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD 325 R V+ + + + T+L +++ E++ Y LRW IE +FK LK + L Sbjct: 228 FYDLHFRAVRFAIAD-GVYETVYTNLNAEDFPPEKLKQLYNLRWGIETSFKELKYAVGLA 286 Query: 326 ALRAKEPELAKAWIFANLLAAFLIDDII 353 +L +K+ + IFA L+ I+ Sbjct: 287 SLHSKKKDFILQEIFARLILYNYSSIIM 314 >UniRef50_Q11ZL6 Transposase, IS4 family n=22 Tax=Bacteria RepID=Q11ZL6_POLSJ Length = 389 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 54/382 (14%), Positives = 108/382 (28%), Gaps = 67/382 (17%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 MN ++ ++ + + G R + A + A SLR++ Sbjct: 1 MNVGKTLFAQVMEFVPWTSF-SRIVQRHGGDAGVRRMNCAEQFRVMAFAQLTWRESLRDI 59 Query: 61 TAWAQLH-------------DVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTS 107 + +TL+D + R +D L + Sbjct: 60 EVTLGANAGKLYSMGLRHSVHRSTLADANDSRDWRIWSD-LAALLIRRARKLYREEDLGL 118 Query: 108 GKR--LRLVDGTAISAP---------GGGSAEWRLHMGYDPHTCQFTDFELTDSR--DAE 154 + +D T I A ++H D ++D + D Sbjct: 119 DLTNTVYALDATTIDLCLSLFDWAPFRSTKAAVKMHTLLDLRGSIPAFIHISDGKMGDVN 178 Query: 155 RLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGF 214 LD A + DRG+ + + A ++ R Sbjct: 179 VLDFLPVEAGAFYVMDRGYLDFAR-LYKMHQAGAFFVTRAKRGMNAR------------- 224 Query: 215 LRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVV 274 + ++ +A + IA++ + + + R + Sbjct: 225 --------------RVYSAQTDRATGVICDQSIAMN---------GFYVCKDYPEQLRRI 261 Query: 275 QAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPEL 334 + + E ++ LT+ A +A Y+ RWQ+EL FK +K L + Sbjct: 262 RFKDPETGKTLVFLTN--NTTLPALTIAALYKSRWQVELFFKWIKQHLRIKKFLGTSENA 319 Query: 335 AKAWIFANLLAAFLIDDIIQPS 356 K I+ + LI + + Sbjct: 320 VKTQIWCAVCTYVLIAIVKKEL 341 >UniRef50_D1Q0M9 ISGsu1 transpoase n=7 Tax=Bacteroidales RepID=D1Q0M9_9BACT Length = 412 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 56/375 (14%), Positives = 100/375 (26%), Gaps = 63/375 (16%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 MN + S +++ I E + G R L + A SLR + Sbjct: 27 MNVGNTVLSQLMSLIPDYELRKCVDKYRGDFH-ARRFTCRDQFLVMSYAQFTSSASLRSI 85 Query: 61 TAWAQL------HDVATLSDVALLKRLRNAADW-FGILAAQTLAVRAAVTGCTSGKR--- 110 A H + + L + +W A RA + R Sbjct: 86 EAQLTAFNSKLYHAGLKIMPKSTLADMNEKKNWRIYQDYAMIFVDRAKALYKDNYYRLNI 145 Query: 111 ---LRLVDGTAISAPGGGSAE---------WRLHMGYDPHTCQFTDFELTDS--RDAERL 156 + D + I+ +++H D LT D++ + Sbjct: 146 DNMVYAFDSSTINLCLQLCPWAKFLHDKGAFKMHTLVDVKNSIPNFVLLTPGNVHDSQAM 205 Query: 157 DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLR 216 D + D+G+ R L A ++ R + Sbjct: 206 DMLPIETGAYYLMDKGYVDFDRLFRILQQQHAYFVTRAKDNMKYNV-------------- 251 Query: 217 GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQA 276 + + + E +S ++ R+V Sbjct: 252 ---------FETRVVDRQTGV-------------ISDETISLSGLLTAKKHPDVLRLVTY 289 Query: 277 ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAK 336 E LT+ + A +A+ YR RW IE FK +K LH+ + Sbjct: 290 EDYAQNVVYRFLTN--DFILPAITIAELYRERWTIETFFKWIKQHLHIKSFYGTTQNAVF 347 Query: 337 AWIFANLLAAFLIDD 351 I+ + L+ Sbjct: 348 TQIWIAICDYLLLTI 362 >UniRef50_B2JV26 Transposase IS4 family protein n=9 Tax=Burkholderia RepID=B2JV26_BURP8 Length = 442 Score = 169 bits (427), Expect = 2e-40, Method: Composition-based stats. Identities = 58/357 (16%), Positives = 101/357 (28%), Gaps = 26/357 (7%) Query: 12 LAHIGKPEELDTSARNAGALT-RRREIRDAATLLRLGLAYGPGGMS----LREVTAWAQL 66 LA E ++ + + GA + RRR + + + S L + Sbjct: 21 LAEHLPYEWIERAVQATGAASIRRRRLPAEQVVWLVIALAMYRHWSISEVLDSLDLALPN 80 Query: 67 HDVATLSDVAL-LKRLRNAADWFGILAAQTLAVRAAVTGCTS---GKRLRLVDGTAISAP 122 +S A+ R R L QT G L +DGT + P Sbjct: 81 EAAPFVSKSAVVQARQRIGEAPMAWLFEQTARAWTTQDAAHHAFKGLSLWAMDGTTLRTP 140 Query: 123 GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS 182 + A L +AD FG Sbjct: 141 DSAANREHFGAQ----GYASGKVASYPQVRAVTLTAIPTH----LVADINFGCYDTNEMV 192 Query: 183 LAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPF 242 A + ++ L + +++ L ++ AG Sbjct: 193 YAKS---LLPQIPDDSLTVFDKGFLAAEILCGLTMNGRNRHFLIPAKSNTCWEVIAGTAD 249 Query: 243 PARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 A ++ + + + K R ++A VLL + + + Sbjct: 250 DA-MVRMRVSQQA---RKKCPALPEFWNARAIRAIDARGRERVLLTSLGDRRRFKPADIV 305 Query: 303 DCYRLRWQIELAFKRLKSLLHLD--ALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 CY RW+IE ++ LK + LR++ E I+ L+A LI I + Sbjct: 306 ACYERRWRIETSYGELKQSMLGSELTLRSRTVEGVYQEIWGALIAYNLIRREIASAA 362 >UniRef50_A6CHG0 Transposase of IS5377-like element n=2 Tax=Bacillus sp. SG-1 RepID=A6CHG0_9BACI Length = 381 Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 66/377 (17%), Positives = 113/377 (29%), Gaps = 57/377 (15%) Query: 5 HDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWA 64 + S +L EE++ G R+ + ++ S R+ Sbjct: 4 FNKLSEVLQTFITDEEVENLCEKWGYRDTARKFSAKDLVRFFVISSAKDWKSFRDAETKI 63 Query: 65 QLHD-VATLSDVALLKRLRNAA-DWFGILAAQTLAVRAA--VTGCTSGKRLRLVDGTAIS 120 D + ++ L K+ +N L ++ + +L VD T I+ Sbjct: 64 PQEDSLPSVDHSTLAKKAQNVPYQILQELFSRLVNRLGRGMRRALFKPYKLFAVDSTTIT 123 Query: 121 APG---------GGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQ----TADEIR 167 RLH +D Q T T R + + I Sbjct: 124 FQHPDMSWAGYTRTRHAIRLHTKFDVEEGQPTQVIPTTGRHHDVMVAPKLYEDTEPLSII 183 Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT 227 ADRG+ +R L +++R+ + E + Sbjct: 184 TADRGY-ARTRDFEDLQEDNQFFVIRIAS--------------------SFSLSEEMEHS 222 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLL 287 V + GN K E + + + + RVV E + Sbjct: 223 VPLDEDGNVK----------------EDLTAFIGKNSRKTKNRFRVVTFTDNEGNRIKVA 266 Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 + S E +A Y+LRWQIEL F+ +K L L L P ++ L++ F Sbjct: 267 TNLMM---MSPEDIAYIYKLRWQIELFFRWVKGNLDLSNLFGNSPNSVYIQVYGTLISYF 323 Query: 348 LIDDIIQPSLDFPPRSA 364 L+ I + D Sbjct: 324 LLRWIYNETKDEWDILH 340 >UniRef50_B9BXQ1 Transposase, IS4 family n=8 Tax=Proteobacteria RepID=B9BXQ1_9BURK Length = 446 Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 59/370 (15%), Positives = 114/370 (30%), Gaps = 28/370 (7%) Query: 1 MNYSHDNWSAILAHI---GKPEELDTSARNAG-ALTRRREIRDAATLLRLGLAYGPGGMS 56 + + D L+ + ++ + G A RRR + + + S Sbjct: 12 LTFMLDAEPTDLSRLAEHLPHAWIEQAIEATGTASIRRRRLPAEQVVWLVIALAIYRHWS 71 Query: 57 LREVTAWAQL---HDVATLSDVAL-LKRLRNAADWFGILAAQTLAVRAAVTGCTS---GK 109 + EV +L ++ +S A+ R R L QT G G Sbjct: 72 VSEVVDSLELVLPNETTFVSKSAVTQARQRLGHAPIAWLFEQTAQAWCKQDGARHAFKGL 131 Query: 110 RLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIA 169 L +DGT + P + + A L +A Sbjct: 132 SLWAMDGTTLRTPDSAANREHFGSQ----SYASGKVASYPQMRAVTLTSIPTH----LVA 183 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 + FG A + ++ L + +++ L + ++ Sbjct: 184 NIAFGRYDTNEMIYAKN---LLAQIPDHSLTLFDKGFLAAEILCGLNSGERNRHFLIPAK 240 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 +G P A L+ + + P+ + R V+ + + VLL + Sbjct: 241 SNTRWEVLSGKPDDA-LVRMRVSPQA---RQKCPDLPEWWTARAVRIQDAQGRERVLLTS 296 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD--ALRAKEPELAKAWIFANLLAAF 347 + + CY RWQIE +++ LK + LR++ + I+ L+A Sbjct: 297 LTDRRRFKLADLVACYERRWQIEASYRELKQSMLGSELTLRSRTVDGIYQEIWGALIAYN 356 Query: 348 LIDDIIQPSL 357 LI + + Sbjct: 357 LIRREMACAA 366 >UniRef50_C4XGQ6 Putative transposase for insertion sequence element n=2 Tax=Desulfovibrio magneticus RS-1 RepID=C4XGQ6_DESMR Length = 376 Score = 167 bits (422), Expect = 6e-40, Method: Composition-based stats. Identities = 55/379 (14%), Positives = 106/379 (27%), Gaps = 68/379 (17%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 M + ++ L+ + K ++N R + + L A G SLR+ Sbjct: 1 MQSAITVFAQFLSLVPKSVFFK-LSQNYRPERSPRTFSPWSHFVHLLHAQLAGCKSLRDG 59 Query: 61 TAWAQLHDV------------ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSG 108 +T +D + FG L + L+ A + Sbjct: 60 IMGMNAASNRLYHLGVKPVPRSTFADANAKRPYTMFEALFGELYTRCLSQ-APKKKFSFE 118 Query: 109 KRLRLVDGTAISAPGGGSAE---------WRLHMGYDPHTCQFTDFELTDSRDAE--RLD 157 +L +D + + ++H D +T+++ E Sbjct: 119 NKLFSLDASVVDLCLNLFPWAKFRTAKGGIKMHTVMDHDGYLPAVVTVTEAKCHEVNIAK 178 Query: 158 RFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRG 217 I + DRG+ R L + R+ + Sbjct: 179 LLKLPKGSIVVFDRGYNDYT-WFRHLCKSGVFLVTRLKSNARFRV--------------- 222 Query: 218 LDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE 277 I +A +I V++ + R V Sbjct: 223 ------------IERHRTDQATGVTSDHIIQVAVGEK-------------TMTLRRVGYR 257 Query: 278 TLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 E + LT+ A +AD Y+ RWQ+E+ F+ +K L + + + Sbjct: 258 DQETGNRLDFLTN--HMTLPARTIADIYKERWQVEIFFRFIKQNLKIKSFLGNSKNAVLS 315 Query: 338 WIFANLLAAFLIDDIIQPS 356 ++ L+A L+ S Sbjct: 316 QVYVALIAYLLLAYQKFMS 334 >UniRef50_Q74P20 IS231-related transposase n=15 Tax=Bacillus RepID=Q74P20_BACC1 Length = 460 Score = 166 bits (421), Expect = 8e-40, Method: Composition-based stats. Identities = 65/384 (16%), Positives = 133/384 (34%), Gaps = 37/384 (9%) Query: 1 MNYSHDNWSAILAHI-------GKPEELDTSARNAGALTRRREIRDAATL-LRLGLAYGP 52 M+ +D L + L A G + R+R+ R + L + L+ Sbjct: 5 MSKINDVTQEELRLLGEEFKSKFSIHHLQLLAVKTGMIRRKRKCRAQDLVSLCVFLSQAI 64 Query: 53 GGMSLREVTAWAQLHDVATLSDVALLKRLR-NAADWFGILAAQTLAVRAA--VTGCTSGK 109 G SL + A LS L +R + L Q + + Sbjct: 65 GTESLVSLCAKLTRATGIQLSSQGLNERFNAQTVQFLKELFLQVFRKKFSPMTPLSNRFT 124 Query: 110 RLRLVDGTAISAPGGGS------------AEWRLHMGYDPHTCQFTDFELTDSRDAE--- 154 R+R++D TA P + A ++ + Y+ + +F + + D ++ Sbjct: 125 RIRILDSTAFQLPAQYASSYKGVGGGGSEAGVKIQLEYELISGEFLETAVRDGTSSDCRY 184 Query: 155 -RLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 + E+ + D G+ S + + +A +A Y+ R+ W + +G ++ ++ Sbjct: 185 GQERTQTLEPGELSLRDLGYFSIYD-LEKIADRKAFYVSRIRWNTQVYQKEKGGKWTLLD 243 Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV 273 + G+ + RL+ L + + Sbjct: 244 LEKLTKDLSEGQILEL--PEIYIGLHQKHKTRLVIYRLTQTEWTKRLEH-------HKKA 294 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPE 333 + A+ LL+T++ +V + Y LRWQIE+ FK KS+ + ++ + E Sbjct: 295 KKKMPKYASRINLLITNVSSKHLPHNEVYELYSLRWQIEIIFKTWKSIFKIHEVKPVKLE 354 Query: 334 LAKAWIFANLLAAFLIDDIIQPSL 357 + ++ L+ L+ I Sbjct: 355 RFQCHLYGQLIGLCLVASITYRMR 378 >UniRef50_B0R9A9 Transposase (ISH8) n=22 Tax=Halobacteriaceae RepID=B0R9A9_HALS3 Length = 424 Score = 166 bits (421), Expect = 1e-39, Method: Composition-based stats. Identities = 60/344 (17%), Positives = 119/344 (34%), Gaps = 33/344 (9%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPG-GMSLREVTAWAQLHDVA 70 L + + L+ A G + R +++ + L + G +L Sbjct: 4 LTTLFPSKFLEEHAEELGVVEREGKLQIPVLVWALVFGFAAGESRTLAGFRRCYNSTADE 63 Query: 71 TLSDVALLKRLRN-AADWFGILAAQTLAV----RAAVTGCTSGKRLRLVDGTAISAP--- 122 T+S RL A++ L L + + + DGT + Sbjct: 64 TISPGGFYHRLTPTLAEYLRDLVEHGLDEVAVPDTVDADIDRFRDVMIADGTVLRLHEFL 123 Query: 123 -------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERL--DRFAQTADEIRIADRGF 173 A +LH+ ++ ++TD + + + + + + DR + Sbjct: 124 SDEFQARHEEQAGAKLHLLHNATDETIERIDVTDEKTHDSTLFKTGSWLQERLVLFDRAY 183 Query: 174 GSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNS 233 + + ++ R+ +T E + RG G+ + + Sbjct: 184 FKYRR-FALIDENDGYFVSRLKENANPLITEE------LREWRGRAIPLEGKQIHDVVDD 236 Query: 234 GNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPE 293 ++K I V + E S + ++ RVV +A + L +T+LP Sbjct: 237 ISRKY--------IDVEVEAEFKRGQYEGTRSLDTKRFRVVGVRDSDADDYHLYITNLPR 288 Query: 294 DEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 DE+ E +A YR RW++E F+ LK+ LD +P++ K Sbjct: 289 DEFFPEDLATLYRCRWEVETLFRELKTQYELDEFNTSDPDVVKI 332 >UniRef50_Q648P8 Transposase n=2 Tax=environmental samples RepID=Q648P8_9ARCH Length = 464 Score = 166 bits (420), Expect = 1e-39, Method: Composition-based stats. Identities = 64/389 (16%), Positives = 119/389 (30%), Gaps = 73/389 (18%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVA- 70 + + E + R R TL S + A VA Sbjct: 31 FSDVLSAETIRNIMDEEVGSYRDRIYSPLITLSAFLSQVLSSDHSCKNAVAKVLAERVAQ 90 Query: 71 ------TLSDVALLKRLRNAADWFGILAAQ---TLAVRAAVTGCTSGKRLRLVDGTAISA 121 + + RLR + L + L +++ G+ ++LVDGT +S Sbjct: 91 GKLPCSSNTKSYCEARLRLPINLVRRLVRETGKLLHLKSEEAWKWKGRSVKLVDGTTVSM 150 Query: 122 PGGGSAEW-----------------RLHMGYDPHTCQFTDFELTDSRDAERLDRFAQT-- 162 P + RL D + + E + Sbjct: 151 PDTPENQKMYPQPEGQKEGVGFPIARLVAIISLSCGAVLDIAIGPYKGKETGEHALLRQI 210 Query: 163 -----ADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRG 217 +I + DR + S I L AD + R+H + Sbjct: 211 LGSISTGDILLGDRYYCSYF-LIVMLQQLGADSVFRIHGSRKKDFRRGKHLGK------- 262 Query: 218 LDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE 277 ++ P ++ + + + Sbjct: 263 -------------------------KDHIVIWKKPKQRPNWMTESMYLQMPD---TLTIR 294 Query: 278 TLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 ++ V+ T L E++ E++ + Y RW IE+ F+ +K++L +D LR K P++ Sbjct: 295 EIKINRKVITTTLLDPKEFTREEIDELYAKRWLIEVDFRFIKTVLQMDILRCKTPDMVCK 354 Query: 338 WIFANLLAAFLIDDIIQPSL---DFPPRS 363 I+ +LLA LI ++ + + PPR+ Sbjct: 355 EIWVHLLAYNLIRTVMAQAAHRYNLPPRT 383 >UniRef50_Q7MGY3 Transposase and inactivated derivative n=4 Tax=Vibrio vulnificus RepID=Q7MGY3_VIBVY Length = 441 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 58/367 (15%), Positives = 118/367 (32%), Gaps = 29/367 (7%) Query: 12 LAHIGKPEELDTSARNAG-ALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL---- 66 L+ I P+ ++ +G A R+R I + + L + + AQL Sbjct: 23 LSDILCPDFINQCLDASGVATIRKRRIPLDMAVWAVVAMSLYRQEPLWSIVSKAQLMLPG 82 Query: 67 -HDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGC--TSGKRLRLVDGTAISAPG 123 + S + + R R AD + Q+ ++ G +L VDG P Sbjct: 83 KRSLVAPSAI-VQARQRLGADAMKEVFHQSQSLWNETADHPTWCGLKLLAVDGVVWRTPD 141 Query: 124 GGSAEWRLH-MGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS 182 F + + T+ + F S Sbjct: 142 TKENRDAFQSASNQNGEGSFPQVR--------MVCQMELTSHMLV--ASAFASYKTNEMI 191 Query: 183 LAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPF 242 LA I L ++ ++ + + Sbjct: 192 LAEQ---LIETTPDYSLTMFDRGFYSLSLLHRWANTGNERHWLMPMRKNTQFTEVRKLGR 248 Query: 243 PARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 R++ + P+ K L + R+++ +T++ +L + Y ++A Sbjct: 249 NDRIVELKTTPQA---RKKSLSLPETIEVRLIK-KTIKGKEVSILTSMTDHRRYPPAEIA 304 Query: 303 DCYRLRWQIELAFKRLKSLLHLDAL--RAKEPELAKAWIFANLLAAFLIDDIIQPSLDFP 360 + Y RW+IE+ ++ +KS L + R+K+PE+ K ++ LL+ +I + Sbjct: 305 ELYSHRWEIEVGYREMKSSLLNNEFTLRSKKPEMVKQELWGLLLSYNIIRYQMVNMAKAV 364 Query: 361 PRSAGSE 367 P ++ Sbjct: 365 PGIYPNQ 371 >UniRef50_B3JNI1 Putative uncharacterized protein n=3 Tax=Bacteroides coprocola DSM 17136 RepID=B3JNI1_9BACE Length = 389 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 61/380 (16%), Positives = 107/380 (28%), Gaps = 63/380 (16%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 MN +S + + + K + + T+ I LL L G SLRE+ Sbjct: 1 MNIKKYVFSQMTSFLPKRYF-ERLVEKSNDRTKSWSISFWNQLLVLIFGQLDGCNSLREL 59 Query: 61 TAWAQLH---------DVATLSDVALLK----RLRNAADWFGILAAQTLAVRAAVTGCTS 107 T H ++ L K R + F + Sbjct: 60 TDITIAHSSKSYHLGFGKTPITRSTLSKANMLRNYRVFESFAYHMVNLAQQKRIDKEFDL 119 Query: 108 GKRLRLVDGTAISAP---------GGGSAEWRLHMGYDPHTCQFTDFELTDS--RDAERL 156 D T I + ++H D T T F +TD+ D + Sbjct: 120 NGTFYAFDSTTIDLCLSLYDWARFRSTKSGIKVHTQLDIRTEISTSFTITDAVVHDVNAM 179 Query: 157 DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLR 216 D A I DRG+ + + + +++R R +TA + + Sbjct: 180 DSIAYEPFACYIFDRGYFD-LRRLYHINEVSSFFVIREKRRPKYEITAGEDVLEGTDNV- 237 Query: 217 GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQA 276 L + + R + + R + Sbjct: 238 ----------------------------------LQDQTIRFTGERNCTNYPSEIRRIVY 263 Query: 277 ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAK 336 + E T+ A +A Y+ RW++EL FK LK L + + + Sbjct: 264 YSPEMNRTFTYYTN--NFYLKASDIALLYKNRWKVELFFKFLKQHLRVKSFWGNSENAVR 321 Query: 337 AWIFANLLAAFLIDDIIQPS 356 I+ ++ L+ I Sbjct: 322 IQIYVAIITYCLVAIIESEL 341 >UniRef50_A6UXI0 Protein containing transposase DDE domain n=4 Tax=Gammaproteobacteria RepID=A6UXI0_PSEA7 Length = 423 Score = 164 bits (415), Expect = 5e-39, Method: Composition-based stats. Identities = 71/383 (18%), Positives = 127/383 (33%), Gaps = 59/383 (15%) Query: 9 SAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHD 68 I + + P + +N TRRR++ L L L P E+ + ++ + Sbjct: 5 QKITSPLDCPAFIAAHRQNPQDFTRRRQLTFKN--LVLFLLNQPRTALQTELDQFYRVLN 62 Query: 69 VATLSDVALLK------RLRNAADWFGILAAQTLAV--RAAVTGCTSGKRLRLVDGTAIS 120 A+ + R + + F L + G R+ VDG+ + Sbjct: 63 QASTETQMVTAQAFCKARKKLNPEVFESLNRLLQQQIDCFGLRQKWRGLRVLAVDGSTVH 122 Query: 121 AP-----------GGGSAEWRLHMGYDPHTCQFTDFELTD----SRDAERLDRFAQTADE 165 P G RL Y+ Q + RD L AD Sbjct: 123 LPLESTMATFFGSHSGFPMARLSTLYEVADGQTLHSLIVPLTVGERDCAHLHLEHLPADS 182 Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGE 225 + + DRG+ A + +++R+ CG N + Sbjct: 183 LTLFDRGYPGHW-LFALFAQQQRHFLMRLP------------------------CGYNAQ 217 Query: 226 TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV 285 + + + +L + P + S+ + ++ + R+++ E V Sbjct: 218 VKAFLHSGQVE------DTQLFVANHPEARLFCSEAGVDPASQIELRLIRVELANGESEV 271 Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL- 344 LL + L + + AE A+ Y RW IE F+RLK L LD + K A L Sbjct: 272 LLTSLLDREAFPAEVFAELYHRRWGIETDFRRLKQTLTLDNFSGRSVTAVKQDFHAAQLL 331 Query: 345 --AAFLIDDIIQPSLDFPPRSAG 365 A L+ ++QP ++ + Sbjct: 332 KNLALLMQHLLQPVIEQRHKGRK 354 >UniRef50_C5V7Z6 Transposase IS4 family protein n=3 Tax=root RepID=C5V7Z6_9PROT Length = 389 Score = 164 bits (414), Expect = 6e-39, Method: Composition-based stats. Identities = 62/384 (16%), Positives = 114/384 (29%), Gaps = 71/384 (18%) Query: 1 MNYSHDNWSAILAHIGKPEE---LDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSL 57 MN ++ +L + + A N G + + + + A L Sbjct: 1 MNRGKTVFAQLLDFVPFNHFEYLTERFAANHGI----KHFSAWSQFICMAYAQLTRRDGL 56 Query: 58 REVTAWAQLH-------DVATLSDVALLKRLRNAADW--FGILAAQTLAV-----RAAVT 103 R++ A + + + L DW F L + +++ R Sbjct: 57 RDLVACLNSQKSKLYHIGIRSKVSRSTLADANERRDWRLFEALGHRLISIALELYRDEDI 116 Query: 104 GCTSGKRLRLVDGTAISAP---------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAE 154 G + L +D T I A + H D +T + + Sbjct: 117 GLGLKEPLYAMDSTTIDLCLTLFPWAEFRSTKAAVKAHTIIDLRGSIPVFLSITTGKVHD 176 Query: 155 --RLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMM 212 LD A I + DRG+ + +L + +++R Sbjct: 177 VNLLDVIPFPAGTIVVIDRGYLHFAR-LYALHQRQVTFVIRAKNNLRF------------ 223 Query: 213 GFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGR 272 I + KA + I ++ P + + + R Sbjct: 224 ---------------TWIASREVDKATGLRCDQTILLATP---------KSKTAYPERLR 259 Query: 273 VVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEP 332 V E H++ LT+ + A +A+ Y+ RWQIEL FK LK L + Sbjct: 260 RVSFRDPETGKHLVFLTN--RFDLPALTIANIYKNRWQIELFFKWLKQNLAIKHFYGNSL 317 Query: 333 ELAKAWIFANLLAAFLIDDIIQPS 356 K+ I+ + L+ + Sbjct: 318 NAVKSQIWIAICVYLLVSIAKKQL 341 >UniRef50_Q2FU81 Transposase, IS4 n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FU81_METHJ Length = 452 Score = 164 bits (414), Expect = 6e-39, Method: Composition-based stats. Identities = 57/364 (15%), Positives = 116/364 (31%), Gaps = 36/364 (9%) Query: 16 GKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGM-SLREV-TAWAQLHDVATLS 73 + ++ AR G + R+R++ + L +L E+ + L D + Sbjct: 20 FTFDFIEKKARETGFMQRKRKLDPVLLIFSLIFGVSSHLKPTLEEIHRHYVDLDDNPKIE 79 Query: 74 DVALLKRLRNA-----ADWFGILAAQTLAVRAAVTGCTS------GKRLRLVDGTAISAP 122 L + R D+ L + + K + + D + I Sbjct: 80 TSILNQSFRKRFNYKLVDFLKSLMDHYIDQIVHQSPAHLKGIVEDFKDILVQDSSIIRIS 139 Query: 123 GG------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAE--RLDRFAQTADEIRI 168 SA ++H Y + +T R + L + + I Sbjct: 140 KKLYDLHPAARSRDDSAGLKIHAVYSVVYHSVKNAIITTERVHDYKMLKIGPDVENILLI 199 Query: 169 ADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTV 228 D G+ S + + + RV + + + + + +C K+ Sbjct: 200 NDLGYYS-LKTFSKIQEYGGFFASRVKSNAVFKVVSINSGPPEITSIVDHNCFKSINGDD 258 Query: 229 MIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLL 288 + K ++ + ++ RV+ + L + Sbjct: 259 FLDRMPKKGVYDLIC--------SFHIGDKHINKIKTPIFQEFRVICSWNPLTEKWHLYI 310 Query: 289 TSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFL 348 T+L ++ +SA+ + + YR RW IEL FK LK L + +A I++ LL + Sbjct: 311 TNLGKEVFSADDIYELYRFRWVIELIFKELKGDYDLGKMLLNNEPMAFIHIYSMLLRFII 370 Query: 349 IDDI 352 D+ Sbjct: 371 SRDL 374 >UniRef50_C0VKK7 ISCja2 transposase n=8 Tax=Acinetobacter RepID=C0VKK7_9GAMM Length = 385 Score = 163 bits (413), Expect = 7e-39, Method: Composition-based stats. Identities = 51/372 (13%), Positives = 120/372 (32%), Gaps = 61/372 (16%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 M++ + + +L I + + + A+ + R + + ++ SLR++ Sbjct: 1 MSHQNTVFHQLLKPISRCDF-ERLAKQHHCGQKLRSATRWDQFIAILMSQLSCRQSLRDI 59 Query: 61 TAWAQLHDV------ATLSDVALLKRLRNAADW--FGILAAQTLAVRAAVTGCTSGK--- 109 + + A + L R+ + L Q L + Sbjct: 60 QSNLESQQEKLYHLGAKTIARSTLARINQEQPASLYQQLFTQLLRHCENTKIAHKFRFKN 119 Query: 110 RLRLVDGTAISAPGGGSAEWRLH---------MGYDPHTCQFTDFELTDSRDAERLD--R 158 L +D + I ++H +G + L D + + + Sbjct: 120 PLYSLDASHIDLSLSLCEWAKVHESKASIKLTVGLNHSNTIPEFVALGDGIENDMVQGRL 179 Query: 159 FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL 218 I + D+G+ + + + ++ R+ + + + ++ + G L Sbjct: 180 LKFPPGSIVVFDKGYVDY-QWFAEMTDRKVSFVTRLRPKTVYEVKSKREVYACKGILA-- 236 Query: 219 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAET 278 + + + + KK GAP ++ R ++ Sbjct: 237 ------DEYIELSSDYAKKRGAP---------------------------KRLRRIEFYD 263 Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAW 338 +E L++ +A +A Y+ RW++EL FK +K L L + + + Sbjct: 264 VEKKRTFEFLSN--NFHLAASTIAAIYKDRWKVELFFKAIKQNLKLKSFLGRSRNAIQTQ 321 Query: 339 IFANLLAAFLID 350 I+ L+A L+ Sbjct: 322 IWIALIAYLLVS 333 >UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=A7B831_RUMGN Length = 366 Score = 163 bits (412), Expect = 1e-38, Method: Composition-based stats. Identities = 52/345 (15%), Positives = 119/345 (34%), Gaps = 44/345 (12%) Query: 2 NYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVT 61 + + S+++ + L + +R R++ +T ++ L+ G + + Sbjct: 14 EHVKNKLSSLIHKMATAPWLFSKNPEVDF-SRNRKLDFVST-IQFLLSMESGSLKKELLD 71 Query: 62 AWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISA 121 + D + S +R + + F L + + +L DG+ ++ Sbjct: 72 YFQFSVDTPSASAFC-QQRNKLLLEAFQFLFYE-FNSCFSFEKKYKDYQLLACDGSDLNI 129 Query: 122 ---------------PGGGSAEWRLHMGYDPHTCQFTDFELTDSRD-------AERLDRF 159 G + L+ +D ++ D + +R + +DR+ Sbjct: 130 ARNPNDAGTYFQSQPTDRGFNQIHLNALFDLCEKRYIDLVIQPARLENESLAMTQMIDRY 189 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 I IADRG+ + + Y++RV G +T D + Sbjct: 190 KGEKKTIFIADRGYETYN-IFAHVQEKGMYYLIRVKDGGGGSMTGSFDLPD-------EN 241 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 + ++ P + IA S P + + + RVV+ Sbjct: 242 EFDHDMQLILTRKQTKDVKAKPKKFKFIAKSSPFDYLDLYDKK---FYTLNFRVVRFAIS 298 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRW------QIELAFKRL 318 E + ++T+LP++++ E++ Y +RW IE+ ++ + Sbjct: 299 E-DSYESIITNLPKEDFPVEEIKKVYAMRWHRNIVQGIEICYRIM 342 >UniRef50_A5II18 Transposase, IS4 n=1 Tax=Legionella pneumophila str. Corby RepID=A5II18_LEGPC Length = 379 Score = 163 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 59/362 (16%), Positives = 109/362 (30%), Gaps = 56/362 (15%) Query: 6 DNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ 65 + I+ I + + + L + + SLR + Sbjct: 4 TVFQEIIKPITTDLLKECVTIFKSDYDYE-KFKTYEHLQSMLYVHLNQISSLRTLETAIN 62 Query: 66 LHDV-----ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAIS 120 D+ S ++ R R AD F + Q L + K +R++D + I Sbjct: 63 SQDLGLSAKICRSTLSDANR-RRKADCFLWILEQLLEMLPKKQKKEFSKIVRVLDSSPIQ 121 Query: 121 APG-----------GGSAEWRLHMGYDPHTCQFTDFELT--DSRDAERLDRFAQTADEIR 167 G +LH+ YD T L+ + D+ ++ D I Sbjct: 122 LKGYGYEWAKHNATRRCEGLKLHVEYDLGLESPTRVALSFPNFNDSSMGKQWPIETDIIY 181 Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT 227 + D+G+ S+ +A ++ R+ + + + L Sbjct: 182 VFDKGYCDYDWWW-SIHQKKAFFVSRLKVNAAISIEQKFETNENSPILEDG--------- 231 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLL 287 K K L R + + + +L+ Sbjct: 232 --------------------LFRFSNPKPRGGKKNL---YTSLARRISVQREDKDPLILV 268 Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 L E AE +A Y+ RW+IEL FK +K L + + K K + ++A Sbjct: 269 TNLLDE---PAEMIAQLYKSRWEIELFFKWIKQRLKIKKILGKSENAVKIQLITAIIAYL 325 Query: 348 LI 349 L+ Sbjct: 326 LV 327 >UniRef50_C3R0J9 Transposase n=4 Tax=Bacteroidales RepID=C3R0J9_9BACE Length = 424 Score = 163 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 63/381 (16%), Positives = 119/381 (31%), Gaps = 67/381 (17%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 MN ++ ++ I + D + +++ + LL L G +S+R++ Sbjct: 1 MNIGKYIFAQVIDFI-PRYQFDKLVKKYKGDWHAKDLSCYSQLLHLLFGQITGCVSIRDI 59 Query: 61 TAWAQLHDVATLSDVALLK----------------RLRNAADWFGILAAQTLAVRAAVTG 104 + H +++ + + K R+ + I + + VT Sbjct: 60 CLCLEAHG-SSIYHLGIRKSVNQSNLCRANEKRDYRIYEGLGMYLISIVRPMYSNTKVTE 118 Query: 105 CTSGKRLRLVDGTAISAPGGGSAE---------WRLHMGYDPHTCQFTDFELTDS--RDA 153 T L +D T IS +A ++H D + +TD D+ Sbjct: 119 ITIDNVLYALDSTTISTSIVLAAWALGKYSKGAVKMHTLLDLRGSIPANIHITDGKWHDS 178 Query: 154 ERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 LD A + D+ + R A +I R + + FD Sbjct: 179 NELDEIVPEAFAFYMMDKAYVDFIALFR-FHKAGAYWISRPKDNMRYEVVNHRLDFDP-- 235 Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV 273 G G+ + + +KK R+ Sbjct: 236 -----STGICGDFIIKLTTHKSKKL----------------------------YPEPIRM 262 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPE 333 V V +T+ E SA +V + YR RW IE+ FK +K + + L Sbjct: 263 VTYHDSVTGNDVEFITN--NFEISAIEVTNLYRHRWDIEVFFKWIKQNIVVKNLWGYSEN 320 Query: 334 LAKAWIFANLLAAFLIDDIIQ 354 + ++ ++A +I I Sbjct: 321 AVRTHLWVAIIAYLIIAKIKA 341 >UniRef50_Q1PXV1 Putative uncharacterized protein n=3 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PXV1_9BACT Length = 449 Score = 162 bits (410), Expect = 2e-38, Method: Composition-based stats. Identities = 60/373 (16%), Positives = 126/373 (33%), Gaps = 39/373 (10%) Query: 16 GKPEELDTSARNAGALTRRREIRDAATLLRLGLA--YGPGGMSLREVTAWAQLHD-VATL 72 + LD AR + R + L A + G +SL ++ + + + Sbjct: 9 LDEDNLDRIARETCFVQRSTNKVSGRDFVELLSAGHFDSGIISLEGLSDVLREKSPESDI 68 Query: 73 SDVALLKRLR--NAADWFGILAAQTLAV-------RAAVTGCTSGKRLRLVDGTAISAP- 122 + AL K++ A + + + L D T I+ Sbjct: 69 TPQALSKKINSDKAVSFLERTFEAIYKEQVCPKLEKIPFVALEQFSNVYLQDSTQIALNE 128 Query: 123 -----------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTAD----EIR 167 + ++ + Y+ + +T ++ + ++ Sbjct: 129 HLAEEFKGTGGSASKSSVKIDLLYEAVHHILKEVSITKGTYPDQKNGAKVLKHIGERDLL 188 Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWL--TAEGMRFDMMGFLRG-LDCGKNG 224 + D G+ + + A Y+ R +L D++ +++ + Sbjct: 189 LRDLGYFD-LSVLGDIEGKGAYYLSRFFKSTKVYLSADPGAEAIDLVSYVKKHIGNKGLA 247 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGH 284 + V +G +RLIA P + + ++ G+ + E LE + Sbjct: 248 DMEVYLGE-------ERICSRLIAYRAPGHVINERRRKAKRAVQKSGKTLSREYLEWLDY 300 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 +T++ + +S E V YR+RWQIEL FK+ K L +D +R E + ++ L+ Sbjct: 301 SFYITNVGAEIWSPEVVGTIYRIRWQIELVFKQWKQLFRMDVMRGTREERIRCLLYGRLI 360 Query: 345 AAFLIDDIIQPSL 357 ++ I S Sbjct: 361 MICIVTRIYALSA 373 >UniRef50_D0LI35 Transposase IS4 family protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LI35_HALO1 Length = 449 Score = 161 bits (407), Expect = 4e-38, Method: Composition-based stats. Identities = 61/357 (17%), Positives = 106/357 (29%), Gaps = 23/357 (6%) Query: 12 LAHIGKPEELDTSARNAG-ALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLH--- 67 LA PE ++ + G A RRR + + + + EV L Sbjct: 22 LARDVAPEWIEQALEATGTATLRRRRLPMEQLVWLVIGMALFRDRPITEVVTSLDLALPS 81 Query: 68 -DVATLSDVAL-LKRLRNAADWFGILAAQTLAVRAAVT---GCTSGKRLRLVDGTAISAP 122 ++ A+ R R L A + A + G L VDGT + P Sbjct: 82 PGHPEVAPSAVAQARDRLGESPMAWLFAHSADRWAHQSAADDRWRGLALYGVDGTTLRVP 141 Query: 123 GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS 182 R H G + + A R A +G G Sbjct: 142 DSEEN--RDHFGLANGGARGSSGYPVVRLAALMALRSHLLAAVSFGPYQGHGEYWYAADL 199 Query: 183 LAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPF 242 + L + +++ L+ ++ G + Sbjct: 200 W--------PCLPDNSLVIVDRHYWAANVLIPLQQDGLNRHWLIRGRKGLNYRVVEQLGP 251 Query: 243 PARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 L V + P+ R++ + L + L Y A+++ Sbjct: 252 SDELAEVKVSPQA---RSKNPELPRTWTVRIIHYQRKGFRPQRLFTSLLDPVAYPADELV 308 Query: 303 DCYRLRWQIELAFKRLKSLLHLD-ALRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 Y RW+IEL + +KS + + LR+K + A+ I+ L+A LI + Sbjct: 309 ALYHERWEIELGYDEVKSKMLANVPLRSKSVDRARQEIWGLLIAYNLIRLEMARVAH 365 >UniRef50_B0NXD2 Putative uncharacterized protein n=5 Tax=Clostridium sp. SS2/1 RepID=B0NXD2_9CLOT Length = 439 Score = 161 bits (406), Expect = 5e-38, Method: Composition-based stats. Identities = 56/356 (15%), Positives = 111/356 (31%), Gaps = 44/356 (12%) Query: 29 GALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWF 88 TR+R++ + + Y G + ++ T S L ++ + F Sbjct: 33 KDFTRKRKL-SFQDTINTIVTYDAGSIGRCIKRYIPKVEKTPTTSAF-LQQQKKLKLSAF 90 Query: 89 GILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWR-----------------L 131 L + + VDGT ++ P E + Sbjct: 91 QTLFYR-FNDPFPDKTLYH-LHILSVDGTGVTVPMDRINENKEYARVRTNKDCTRPAYQF 148 Query: 132 HMG--YDPHTCQFTDFELTDSRDAE-------RLDRFAQTADEIRIADRGFGSRPECIRS 182 H+ YD ++ D + R L+R + IADRG+ S Sbjct: 149 HVSCIYDLINERYCDAYIEPFRTHSETHVFSVMLERKNFPQKALFIADRGYESYLLM-AQ 207 Query: 183 LAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPF 242 + +++R F ++G ++G + K Sbjct: 208 IQHDGNYFLIRAR-----------EDFGQGSMIKGYPFPRDGTFDKTVTYIYTKTQNKRT 256 Query: 243 PARLIAV-SLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQV 301 A + + + + R V L+T+LP +++ +E + Sbjct: 257 KANPELYKRVATRNSPYFINKEHPYVKMTLRFVMIVLPNGQK-ECLITNLPANKFPSETL 315 Query: 302 ADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 Y +RW+IE +F+ +K +L +K+ E + I+A ++ I Q Sbjct: 316 KKLYCIRWKIETSFRLIKYSANLLEFHSKKIEFLQQEIWAKMIFYNFTTTITQHLR 371 >UniRef50_Q05309 Transposase for insertion sequence element IS1151 n=16 Tax=Clostridium perfringens RepID=T1151_CLOPE Length = 473 Score = 160 bits (405), Expect = 6e-38, Method: Composition-based stats. Identities = 62/393 (15%), Positives = 134/393 (34%), Gaps = 35/393 (8%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLR-LGLAYGPGGMSLRE 59 MN S + E++ A+++ + R+ I L+ + L + Sbjct: 1 MNKKLKYLSKSIKESFDINEINKIAKDSKFIQRKGSITAKDFLMFNVFYGSDICTAPLSQ 60 Query: 60 VTAWAQLHDVATLSDVALLKRLRN-AADWFGILAAQTLAVRA------AVTGCTSGKRLR 112 + A + L AL KR + ++ + + L + T T R+ Sbjct: 61 LAAKYDMIFSKQLPKQALDKRFNKYSVEFMKEIFIKFLYSQNNTLTNLERTLRTYFDRVI 120 Query: 113 LVDGTAISAPG------------GGSAEWRLHMGYDPHTCQFTDFELTDS--RDAERLDR 158 + D + + P + ++ + Y+ T F + ++ D E L Sbjct: 121 INDSISFTLPKEFKKKFPGSGGVASPSSIKVQLQYELLTGSFMNIDIFSGIKNDVEYLKT 180 Query: 159 FAQTADEIRI--ADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLR 216 + D + AD G+ + + ++ L +I +V ++ +G ++ Sbjct: 181 MKKYKDYKDLKLADLGYF-KIDYLKRLDKSGTAFISKVKSNTSLYIKNPSPEKYKVGTIK 239 Query: 217 GLDCGKNGETTVMIGN----------SGNKKAGAPFPARLIAVSLPPEKALISKTRLLSE 266 + + + +RLI L E + Sbjct: 240 KSSEYIKIDIIKLAEPLAAGETIELTDIYIGSKKELKSRLIITKLTEENKSKRIFNHIEG 299 Query: 267 NRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA 326 ++K + L+ +T++ + + QV + Y LRWQIE+ FK KS+ ++ Sbjct: 300 IKKKRLTLNQRRLDFNSINAYITNVSSNIITMNQVHELYSLRWQIEIIFKVWKSIFKINQ 359 Query: 327 LRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 ++ + E +++ L+A L I+ S Sbjct: 360 VKKVKLERFMCFLYGRLIALLLSSTIVFTSKSI 392 >UniRef50_B8FI31 Transposase IS4 family protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FI31_DESAA Length = 386 Score = 160 bits (405), Expect = 7e-38, Method: Composition-based stats. Identities = 51/375 (13%), Positives = 107/375 (28%), Gaps = 67/375 (17%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRR-REIRDAATLLRLGLAYGPGGMSLRE 59 M++ S +L I + + + R R++ + + ++ SLR+ Sbjct: 1 MSHHSTILSQLLQSI-DRHDFNRIEKQGFLPDRSYRKLTRWGQFVAMAFSHLTQRTSLRD 59 Query: 60 VTAWAQLHDV---------ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGK- 109 + H S +A R A++F + A + + Sbjct: 60 LEGQFDAHSSKLYHAGAAPVKRSTLADANNQRP-AEFFEEVFYHMAAKCQSHAPKHKFRF 118 Query: 110 --RLRLVDGTAISAP---------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDA--ERL 156 L +D + + A ++H D +TD++ + E Sbjct: 119 KNPLYSMDSSVVDLCLNLFPWAKHRSTKAGIKIHTVLDHSGYIPAFVRITDAKTSDIEIA 178 Query: 157 DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLR 216 + I + DR + ++ + ++ R+ + Sbjct: 179 RTLSLPKGSILVEDRAYVDFT-WFKNWHENKQFFVTRLKKNIKYKVLERRDVPQNK---- 233 Query: 217 GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQA 276 G + + + R V Sbjct: 234 ----GVTSDQIIKLTGKKAADCPN------------------------------LRRVGY 259 Query: 277 ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAK 336 H + LT+L + SA +AD Y+ RWQIEL FK +K L + + Sbjct: 260 WDKTTKKHYVYLTNLT--KLSARTIADIYKDRWQIELFFKWIKQNLRIKSFLGNSRNAVL 317 Query: 337 AWIFANLLAAFLIDD 351 I+ +++ ++ Sbjct: 318 TQIWTAMISMLILAY 332 >UniRef50_C5T3Q2 Transposase IS4 family protein n=4 Tax=Proteobacteria RepID=C5T3Q2_ACIDE Length = 436 Score = 159 bits (403), Expect = 1e-37, Method: Composition-based stats. Identities = 71/376 (18%), Positives = 126/376 (33%), Gaps = 67/376 (17%) Query: 12 LAHIGKPEELDTSARNAG-ALTRRREIRDAATLLRLGLAYGPGGMSL----REVTAWAQL 66 L+ + P + + + G A RRR++ + + M L +E+ Sbjct: 23 LSALLDPAWIAQALQATGKASMRRRKLPAEHAVWLVIGLALFRHMPLWQVVQEMALTLDG 82 Query: 67 HDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSG-KRLRLVDGTAISAPGGG 125 ++ S V++ R R A+ + +G R+ VDG A SAP Sbjct: 83 QELPAPS-VSVQVRQRLGAEPMEHMFGLLANAWGRAHAVHAGALRVLAVDGVAWSAPDSK 141 Query: 126 SAEWRLHMG-----------------YDPHTCQFTDFELTDSRDAER--LDRFAQTADEI 166 L G D + + D +L D E I Sbjct: 142 DNRQELGSGQTQYGPQPWPMVRAVCLLDTDSHELLDAQLGDYGCGELTLAADLHGLDHSI 201 Query: 167 RIADRGFGSRPECIR-SLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGE 225 + DR + S + S A + +++R + D + Sbjct: 202 TLFDRAYFSAAFLLAWSQAGQQRHWLMRAKDNLRYEVVQTLDEGDWL------------- 248 Query: 226 TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV 285 P A L P+ + RL+ R G+V + Sbjct: 249 --------------IRMPVSPRARKLHPQLPSHWQARLIE-VRAGGKVRRF--------- 284 Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANL 343 + + L ++++A +A YR RW+IEL F+ +K L LR+K+PEL K ++ L Sbjct: 285 -ITSMLDPEQFAAAPLAQLYRQRWEIELGFREIKQSLQQGQAVLRSKQPELVKQEVWGVL 343 Query: 344 LAAFLIDDIIQPSLDF 359 +A L+ ++ + Sbjct: 344 IAYTLLRRWMRLMAEH 359 >UniRef50_A4T2G5 Transposase, IS4 family protein n=10 Tax=Corynebacterineae RepID=A4T2G5_MYCGI Length = 401 Score = 159 bits (403), Expect = 1e-37, Method: Composition-based stats. Identities = 69/389 (17%), Positives = 114/389 (29%), Gaps = 77/389 (19%) Query: 11 ILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYG------------PGGMSLR 58 +L + P +D G R A + + G L Sbjct: 23 VLTRVFPPAMVDEVIEATGRTQVRHRALPARVMAYFAIGMGLYSDGSYEDVLSQLTDGLA 82 Query: 59 EVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLA--VRAAVTGCT-SGKRLRLVD 115 + W + + + S + R R + L A+ A G +G+R+ +D Sbjct: 83 WASGWREQYQLPGKSAI-FQARERLGSQPLAALFARVARPLGAADTPGTWVAGRRVVAID 141 Query: 116 GTAISAPG------------------GGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLD 157 GT + + RL + T + RDAE Sbjct: 142 GTCLDVADNPVNEEFFGRPGVNKGEKSAFPQARLLAVAECGTHAIFAATIGAYRDAESTM 201 Query: 158 RFA----QTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 T + + +ADRGF S R+ + AD + RV Sbjct: 202 VEHVLDALTPEMLVLADRGFFSYALW-RNASDTGADLLWRVSTGRNGPTPTHVEDLADGS 260 Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV 273 +L L + + G P AR+I ++ + Sbjct: 261 WLAHL-------------RAAKDRHGEPMLARVIDYTVDDGRDNP--------------- 292 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAKE 331 + LL T D A ++A Y RW+IE F LK+ LR+K Sbjct: 293 --------VAYRLLTTLTDPDTAPAVELAAAYAQRWEIESVFDELKTHQRGSKVVLRSKS 344 Query: 332 PELAKAWIFANLLAAFLIDDIIQPSLDFP 360 P+L I+ L + I ++ + Sbjct: 345 PDLVLQEIWGYLCCHYAIRSLMSQAAHHS 373 >UniRef50_C3KKH4 Putative transposase Y4ZB n=2 Tax=Rhizobium sp. NGR234 RepID=C3KKH4_RHISN Length = 493 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 65/376 (17%), Positives = 108/376 (28%), Gaps = 64/376 (17%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 M +S + +L I G + L+ L A SLR + Sbjct: 107 MRFSPSIFGQLLKAI-DRRSFQAIVDRHGGDAYDKRFTSWDHLVALIYAQFSAATSLRGL 165 Query: 61 TAWAQLHDV------------ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSG 108 A + +TLSD R F A T Sbjct: 166 EAGWNANAQQHYHLGSARLLRSTLSDAN----ARRPVAVFAETFALVAGQLDRQTRRDGS 221 Query: 109 KRLRLVDGTAISA--------PGGGSAEWRLHMGYDPHTCQFTDFELTDSR--DAERLDR 158 K LRL+D T I G +LH+ YDP ++TD+ DA+ Sbjct: 222 KMLRLIDSTPIPLGKLCDWAKSNGRIRGMKLHVVYDPKADCPRLLDITDANVNDAQIGRT 281 Query: 159 FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL 218 + D+G+ ++A +A ++ R + + + G Sbjct: 282 VTIEKGATYVFDKGYCHY-GWWTAIAAAKAVFVTRPKVNMALKVVRKRR----ITAAEGD 336 Query: 219 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAET 278 + V + + G+ K R + + Sbjct: 337 GFTVLEDARVRLASKGDSKLPIGL-----------------------------RRITVKR 367 Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAW 338 + LL L A + Y+ RWQIEL F+ +K L + + Sbjct: 368 ADGDTITLLTNDLKR---PAVAIGQLYKGRWQIELLFRWIKQHLKIRKFLGNNDNAIRLQ 424 Query: 339 IFANLLAAFLIDDIIQ 354 I A ++A L+ + Sbjct: 425 ILAAMVAYALLRIATR 440 >UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=Streptococcus RepID=A4W4J4_STRS2 Length = 440 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 58/365 (15%), Positives = 112/365 (30%), Gaps = 52/365 (14%) Query: 29 GALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWF 88 +R+ ++ T+++ L G ++ + D+ + +R + F Sbjct: 40 KDFSRKSQL-TMETMIQAILTMGGNTLAKELLDL-----DLPVSQSAFVQRRYQLKHQAF 93 Query: 89 GILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP---------------GGGSAEWRLHM 133 L A + + VDG+ + P ++ Sbjct: 94 KALFANITSKIPTFKDLP----ILAVDGSDVVLPRNRSDKTTTFQTGPHHTPYTLIHINA 149 Query: 134 GYDPHTCQFTDFELTDSRD----AERLDRFAQTA--DEIRIADRGFGSRPECIRSLAFGE 187 Y+ + D + ++R+ A +D + I DRG+ S Sbjct: 150 LYNLEQEIYHDLRIQNNREVDERAAFIDMMESCPFEQALVIMDRGYESYNVMAHC-QERN 208 Query: 188 ADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLI 247 YI+R+ G + + D F + E + I + Sbjct: 209 WSYIIRIRD-GNHSMKSGFNLPDTPCF--------DEEFDLNICRKQTNVMKELYRDFPN 259 Query: 248 AVSLPPEKALI-------SKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ 300 P A K+ +S R+V+ E T + +YS E+ Sbjct: 260 QYHFLPHNASFDLLPNSSRKSDPISFYDLHFRMVRLEIKPG----FFETLVTNTDYSPEK 315 Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFP 360 + D Y RW IE +F+ LK + L AK+ E I+A+ + + + P Sbjct: 316 LKDLYAYRWGIETSFRDLKYSIGLTHFHAKKKEGILQEIYAHFINFNVCKWLTSHVAIKP 375 Query: 361 PRSAG 365 + Sbjct: 376 SKLKQ 380 >UniRef50_A6DKD2 ISPg4, transposase n=7 Tax=Chlamydiae/Verrucomicrobia group RepID=A6DKD2_9BACT Length = 412 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 55/392 (14%), Positives = 114/392 (29%), Gaps = 77/392 (19%) Query: 1 MNYSHDNWSAI--LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLR 58 M + N S + + + P ++ A+ G + R++ + ++ L +SL Sbjct: 1 MKPNKSNLSTLKQICQLIPPHIVNKLAKKHG--IKTRKLSSWSHVVSLLYTQLSHALSLN 58 Query: 59 EVTAWAQLHD----------VATLSDVALLKRLRNA--ADWFGILAAQTLAVRAAVTGCT 106 +V H + + R RNA A+ ++L + G Sbjct: 59 DVCDGLHYHSSSLFQIRGATAPKRNTFSNANRTRNAAMAEDLFWEVLKSLQSQLPSFGLD 118 Query: 107 SGKR---------LRLVDGTAISA---------PGGGSAEWRLHMGYDPHTCQFTDFELT 148 + VD T I A + HM + T + + Sbjct: 119 KQNSNFPQRFKRAVYAVDSTTIQLVAHCLDWAKHRRRKAAAKCHMQLNLQTFLPSYAIVK 178 Query: 149 DSRDAERLDR----FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTA 204 ++ + + EI + D+ + + L +++ R + + Sbjct: 179 EANTHDSTEAKEMCANIKDGEIVVFDKAYVDF-RHLYHLDSRGVNWVTRSKDNMVYDIIE 237 Query: 205 EGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLL 264 E P +I + ++ Sbjct: 238 ER----------------------------------PTKGNII----SDQIIKLNGINTE 259 Query: 265 SENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHL 324 + R+V A +L+ +++ +A Y+ RW IE+ FK+LK L L Sbjct: 260 KHYSQNLRLVTANIEVDGKMKVLMFLTNNLQWAPSSIASIYQSRWGIEVFFKQLKQNLKL 319 Query: 325 DALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 + ++ LL L+ + S Sbjct: 320 ADFLGHNKNAIQWQVWTALLTYVLLRFLAFRS 351 >UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3VMZ1_KLEPN Length = 421 Score = 158 bits (400), Expect = 3e-37, Method: Composition-based stats. Identities = 79/387 (20%), Positives = 130/387 (33%), Gaps = 60/387 (15%) Query: 2 NYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS-LREV 60 +S I + ++L++ + R R I A L A G G + ++ Sbjct: 3 QNQITAFSTIFEALFSEQQLNSLGVQTHMIERFRLITPAKLCLAFVCALGSGNARTIADI 62 Query: 61 TAWAQLHDVATLSDVALLKRLRNA--ADWFGILAAQTLAVRAA------VTGCTSGKRLR 112 + ++ +L ++ + Q LA+ K++ Sbjct: 63 HRYFNHLHSMSVRLKPFHNQLVKLGTPEFMRQVFEQALALHLPAMHTFSDAYRGHFKQVL 122 Query: 113 LVDGTAISAPGG------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAER--LDR 158 L DGT+ + G A LH+ YD Q L++ +ER L Sbjct: 123 LQDGTSFAVHDGLSLHFPGRFSTHSPAAVELHVTYDLEKAQPVRVSLSEDTASERDYLPV 182 Query: 159 FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE--GMRFDMMGFLR 216 + +AD G+ S+ I SL A +++R+ T G+ + +L Sbjct: 183 AQSLRGCLLMADAGYFSKA-YIESLQNEAASFVLRMPASVNPMATCNQTGLCQPLRSWLA 241 Query: 217 GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQA 276 L K+GE + + A P Sbjct: 242 VL--PKHGELDLDVQWPDGPVYRCVLFASTDHKDKP------------------------ 275 Query: 277 ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAK 336 V L T+L + A V + YRLRWQIEL FK KSL L+ + +A+ Sbjct: 276 --------VCLCTNLDRHTFPAATVGEWYRLRWQIELLFKEWKSLNSLNKFNTEYSTIAE 327 Query: 337 AWIFANLLAAFLIDDIIQPSLDFPPRS 363 I+ +LLAA L +I + R Sbjct: 328 TLIWGSLLAATLKRWLINGAQQKYRRV 354 >UniRef50_C6CF98 Transposase IS4 family protein n=20 Tax=Gammaproteobacteria RepID=C6CF98_DICZE Length = 441 Score = 158 bits (398), Expect = 4e-37, Method: Composition-based stats. Identities = 59/374 (15%), Positives = 100/374 (26%), Gaps = 34/374 (9%) Query: 1 MNYSHDNWSA---ILAHIGKPEEL-DTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS 56 M++ D +S+ + P + A RRR + + + + +S Sbjct: 6 MDFVVDAFSSERDAFSRSLDPAWIHQALNACHKASIRRRRLPAEQAVWLVLMMGLLRDLS 65 Query: 57 LREVTAWAQ------LHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAV---TGCTS 107 +++V V R R L + Sbjct: 66 IKDVCHHLDIVLQPDEGYQPLAPSVLTAARQRLGEAPLRYLFHACNEGWLPTVLGSDTFH 125 Query: 108 GKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIR 167 G + VDGT P DP F + Sbjct: 126 GLHVLSVDGTLFRTPDSPDNAAAFGF-IDPVHGTFPQVRMV----------GLMATHSHM 174 Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT 227 + D FG E +LA + L + R + T Sbjct: 175 LLDAAFGGVAEGELTLAHR---LVSSAPDHSLTLFDRCYFSASFLLEWRQAGVETHWLTP 231 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV-- 285 V LI + + P+ K + R+V G + Sbjct: 232 VKRKLRYRVIERYSDYDMLIEMPVSPQA---RKAAPHLPAVWQARMVSYINGSGKGKITG 288 Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL-LHLD-ALRAKEPELAKAWIFANL 343 L + Y E + Y RW+IEL + LK L + LR++ PE K ++ L Sbjct: 289 FLTSMTDPVAYPLEDLLRIYWTRWEIELGYGELKQRQLKGEVTLRSRFPEGVKQELWGIL 348 Query: 344 LAAFLIDDIIQPSL 357 ++ L+ + Sbjct: 349 VSYNLLRKEMADIA 362 >UniRef50_D0SHM1 Transposase n=3 Tax=Acinetobacter RepID=D0SHM1_ACIJO Length = 443 Score = 156 bits (395), Expect = 1e-36, Method: Composition-based stats. Identities = 58/383 (15%), Positives = 113/383 (29%), Gaps = 70/383 (18%) Query: 4 SHDNWSAILAHIGKPEELDTSARNAG-ALTRRREIRDAATLLRLGLAYGPGGMSLREVTA 62 S N+S ++ ++ + G A R+R++ + + + V Sbjct: 27 SLSNFSELIDL----NWIEDCLKRTGKASVRKRKLPAEHAVWLVIGLALFRDQPIWYVVQ 82 Query: 63 WAQLHDVATLS---DVALLKRLRNAADWFGILAAQTLAVR----AAVTGCTSGKRLRLVD 115 QL S ++ R R + +L G + VD Sbjct: 83 QLQLVFGTAESCAPSASVQARQRLGLEPLNVLFNTLSQTWFEDSQPQYSAFHGLSICAVD 142 Query: 116 GTAISAPGGGSAEWRLHM-----------------GYDPHTCQFTDFELTDSRDAER--L 156 G S P + +T + D + E Sbjct: 143 GAVWSMPHTDENFRHFGSSKGKTIAAPWPQARAVCLINTNTHEVIDAGIGSMDQGELTLA 202 Query: 157 DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLR 216 + A+ + + DR + S + +++R + + D Sbjct: 203 KKLKVPANSLTLFDRAYFSADFLSGWQSRENCHWLMRAKDNLRYEIIRKNSAHD------ 256 Query: 217 GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQA 276 P A L P+ + RL+ + GRV + Sbjct: 257 ---------------------FQIRMPVSPRAKKLNPDLGDYWEARLIETE-QSGRVRRY 294 Query: 277 ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD-ALRAKEPELA 335 + + L Y E+V+ Y RW+IE+ ++ +KS L LR+K+PEL Sbjct: 295 ----------VTSLLDSKAYPLEEVSTLYAQRWEIEMCYREIKSDLQDGMHLRSKQPELV 344 Query: 336 KAWIFANLLAAFLIDDIIQPSLD 358 ++ L+A ++ ++ Sbjct: 345 YQELWGVLIAYNILRRQMKFMAQ 367 >UniRef50_A8RFU1 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RFU1_9FIRM Length = 443 Score = 156 bits (394), Expect = 1e-36, Method: Composition-based stats. Identities = 57/341 (16%), Positives = 116/341 (34%), Gaps = 35/341 (10%) Query: 27 NAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL-HDVATLSDVALLKRLRNAA 85 TR R I TL++ L +S + D+ ++S V+ +R Sbjct: 38 QTSNFTRSR-ILTPKTLIKFILGLQAHSLSGEVSDYFTSSNIDIPSISAVS-QRRDLLYP 95 Query: 86 DWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP------------GGGSAEWRLHM 133 + F + + L+ ++ +G + DG+ I+ P ++ L+ Sbjct: 96 EIFKSINRRFLSSIDNLSTL-NGYYILAQDGSDINLPFWHDDTQISYGQDSIVCQYHLNA 154 Query: 134 GYDPHTCQFTDFELT-DSRDAER------LDRFAQTADEIRIADRGFGSRPECIRSLAFG 186 YD F + + ++ +E+ ++ + I ADRG+ S + Sbjct: 155 LYDCINHVFWESRIDLPTKKSEKSALIDFINHRNYPENSIITADRGYESYNLIAHCIENN 214 Query: 187 EADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARL 246 + ++ RV + M + T + N+ + Sbjct: 215 QK-FVFRVK-------DIDTRSGIMTSISLPDETFDITVTRTLTNLQTNEVKKNENNQFV 266 Query: 247 IAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYR 306 +P R+V+ + + + L+T+L E+E+ D Y Sbjct: 267 F---VPSTSVFDYLDACNRFYNLSFRIVRFKIAD-DKYETLVTNLDENEFGLSDFKDLYH 322 Query: 307 LRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 LRW E AF LK + + K+ + + I+A++L Sbjct: 323 LRWNEETAFYYLKHAVGMLYFHCKKRQHIQQEIYASILFYN 363 >UniRef50_Q45620 Probable transposase for insertion sequence element IS5377 n=12 Tax=Bacillaceae RepID=T5377_BACST Length = 377 Score = 154 bits (390), Expect = 3e-36, Method: Composition-based stats. Identities = 62/370 (16%), Positives = 115/370 (31%), Gaps = 53/370 (14%) Query: 3 YSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTA 62 H ++ + EE+ A G R + LA S R Sbjct: 2 NKHTTLPNLMQKLVSDEEIQLIAEAVGYRDSSRTFTLRELIHFFLLAAMHQWKSFRHGAD 61 Query: 63 WAQLHDVATLSDVALLKRLRNAA-DWFGILAAQTLAVRAAVTGCTSG--KRLRLVDGTAI 119 L+ + + K+ + D L A ++ T + K LR+VD T + Sbjct: 62 VGPLYGLPRFHYSTVSKKAKEVPYDIMKRLLALIISKCNRQTRRSLRFPKPLRVVDSTTV 121 Query: 120 SAP---------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAE-RLDRFAQTADEIRIA 169 + G A +LH+ Y P D T + + A ++ + Sbjct: 122 TVGKNRLPWAPYHGERAGVKLHVAYSPEFSLPADVVETTGLRHDGPVGEQLTNAQQVLVE 181 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 DR + R + + +++R+ + L L + T Sbjct: 182 DRAYFKIERLDRFVEQHQL-FVIRMKDN---------IELHQKKSLNRLSSTSSSVQTDF 231 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 G K+ + R++ GR ++ +T Sbjct: 232 TCQLGTKQCRSTKRHRVVIFR-----------------DANGRDIRV-----------VT 263 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 +L SAE +AD Y+ RW +E+ F+ +K L++ L +FA +A L+ Sbjct: 264 NL--FHASAETIADMYQQRWAVEVFFRWVKQYLNVPTLFGTTENAVYNQLFAAFIAYVLL 321 Query: 350 DDIIQPSLDF 359 + + Sbjct: 322 RWLYDQTKKQ 331 >UniRef50_A3IS08 Putative uncharacterized protein n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IS08_9CHRO Length = 472 Score = 154 bits (389), Expect = 4e-36, Method: Composition-based stats. Identities = 56/380 (14%), Positives = 119/380 (31%), Gaps = 44/380 (11%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLR----EVTAWAQLH 67 + K E ++ + G + R + S + + ++ Sbjct: 23 FQKLLKSEIIEDILKEMGVKYKSRIYNPIVIIWSFLSQVLDPDHSCQNAVSRIISYLASE 82 Query: 68 DVATLS---DVALLKRLRNAADWFGI---LAAQTLAVRAAVTGCTSGKRLRLVDGTAISA 121 + T S R + + ++A+ + G+ ++ +DG+ +S Sbjct: 83 GIETPSENTSAYCQARKKLPEELLKKLLEISAKGNEEKVDKKHLWHGRCVKSIDGSTVSM 142 Query: 122 PGGGSAE-----------------WRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQT-- 162 P + ++ + + T + + + T Sbjct: 143 PDSLKNQEAYPQHGSQKKGCGFPLAKIGVLFSYATGSVVGIVIDIFKTHDIKLARKLTDY 202 Query: 163 --ADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDC 220 A +I + DR F S + I S D ++R+H L+ F + Sbjct: 203 LDAGDILLGDRAFCSYID-IYSWKKKGIDSVMRLHQGRLQKGKKRPKYTVSPPFKKKKKT 261 Query: 221 GKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLE 280 K +++ +K SLP + L + Sbjct: 262 RKCPHDRLILWEKPKRKPKDISKE--DFYSLPKDLVLREVH----------CYICIPGFR 309 Query: 281 AAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIF 340 +++ T + EY + + D Y RWQ E+ + +K+ L +D L + PE+ + I+ Sbjct: 310 TKEIIVVTTLIDAIEYPSSDILDLYDQRWQAEVNLRNIKTTLGMDILTCQTPEMVRKEIY 369 Query: 341 ANLLAAFLIDDIIQPSLDFP 360 LLA + I+ + D Sbjct: 370 VYLLAYNFLRSIMYDAGDIF 389 >UniRef50_B8FXQ3 Transposase IS4 family protein n=8 Tax=Desulfitobacterium hafniense RepID=B8FXQ3_DESHD Length = 414 Score = 153 bits (386), Expect = 9e-36, Method: Composition-based stats. Identities = 50/380 (13%), Positives = 110/380 (28%), Gaps = 68/380 (17%) Query: 6 DNWSAILAHIGKPEELDTSARNA-GALTRRREIRDAATLLRLGLAYGPGGMSLREVTA-- 62 ++ + + + R +++ L + A +LR++++ Sbjct: 9 STFTQVFQPFFSKDLWKKIDQEVPNLDQRNYKLKTNQLTLLISHAQLQEYKALRKISSNV 68 Query: 63 ----WAQLHDVATLSDVALLKRLRNAA-----DWFGILAAQTLAVRAAVTGCTSGKRLRL 113 +++ + ++S + +RLR F + + + +L + Sbjct: 69 QSNDFSEAIGLESISHSQISRRLRTLPIKVSEMLFKGVLNKVAQKKGDGKIQQRLGKLYM 128 Query: 114 VDGTAISAPGGGSAE-----------WRLHMGYDPHTCQFTDFELTDSRDAERL---DRF 159 +D + IS L + +D + +T ++ A+R + Sbjct: 129 IDASVISLCLSRFPWAVFRKIKAGVKMHLRLSFDEMA-IPDEVIITPAKTADRKKLDELI 187 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 D + I DRG+ E ++ R+ + T + G + Sbjct: 188 VVDKDALTIFDRGYIDY-LLFDEYCEKEIRFVTRLKNNAVIEFTGVERPVEEEGSIEE-- 244 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 + +++G K + R V + Sbjct: 245 -----DVDIILGTGTRK------------------------------MKHTLREVTIDDN 269 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 +L SAE++ + YR RWQIEL FK LK + I Sbjct: 270 VNEPFTILTNDFD---LSAEELGEVYRYRWQIELFFKWLKQHAQIKHFYGTSEAAVINQI 326 Query: 340 FANLLAAFLIDDIIQPSLDF 359 +L+ + + Sbjct: 327 RLDLMTYCTLILLKLEVEHQ 346 >UniRef50_A7C1C1 IS231-related transposase n=6 Tax=Beggiatoa sp. PS RepID=A7C1C1_9GAMM Length = 445 Score = 153 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 68/360 (18%), Positives = 117/360 (32%), Gaps = 36/360 (10%) Query: 22 DTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRL 81 D G + R+R+ + + L + + E A + + +S L KR Sbjct: 20 DALGETTGFIKRKRKFTGSTFIKTLVFGWMQTPQATLEELVQAGVLNDIEISAQGLDKRF 79 Query: 82 R-NAADWFGILAAQTLAVRAAVTG------CTSGKRLRLVDGTAISAPG----------- 123 +AD + Q +A + L D T ++ P Sbjct: 80 TPKSADLARAVLEQAVAEAVRAPNAVPIELLNRFSSVTLFDTTILNLPDELYQVWAGTGG 139 Query: 124 ---GGSAEWRLHMGYDPHTCQFTDFELTDSRDAE---RLDRFAQTADEIRIADRGFGSRP 177 + + +GYD T Q L + + +L + ++IAD G+ S Sbjct: 140 NGPTSRSALKGEIGYDLKTGQLIGPLLLPGKTHDNAGKLPQMELEECSLQIADLGYFSIA 199 Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMM--GFLRGLDCGKNGETTVMIGNSGN 235 + + + R+ + FD+ + E V++ Sbjct: 200 KMAENF-DANVFCLSRLRHDA-VLFDEQEEEFDLSLYTLFMKKNNRLRAELNVLL----- 252 Query: 236 KKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAET--LEAAGHVLLLTSLPE 293 P RL +P + + + +K + A L LL+T+ P Sbjct: 253 -VRYEKLPVRLFIERVPEMISSKRRRQANKGASKKKKGKTASKKSLSLCDFTLLVTTAPS 311 Query: 294 DEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDII 353 + S ++ Y RWQIEL FK KS LD P +I+ LLA + II Sbjct: 312 VQLSFDEALVLYGARWQIELLFKLWKSHAKLDTSIRPNPWRICRYIYIKLLACLVQHWII 371 >UniRef50_A6WTA0 Transposase IS4 family protein n=14 Tax=Shewanella RepID=A6WTA0_SHEB8 Length = 446 Score = 152 bits (384), Expect = 2e-35, Method: Composition-based stats. Identities = 58/370 (15%), Positives = 116/370 (31%), Gaps = 35/370 (9%) Query: 12 LAHIGKPEELDTSARNAGALT-RRREIRDAATLLRLGLAYGPGGMSLREVTAWA-----Q 65 LA + +PE + + + G T RRR++ A + + G S+R + Q Sbjct: 27 LADVLEPELIQSCLDSQGVATLRRRKLPMDAMIWAVIGMALFRGESVRSLINKLDIVLPQ 86 Query: 66 LHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVT--GCTSGKRLRLVDGTAISAPG 123 D S V R R ++ + +++ A G L VDG P Sbjct: 87 EIDYVARSAVT-QARKRLGSEVIREVFSRSANTWHARAEHPHWCGLNLYGVDGVVWRTPD 145 Query: 124 GGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSL 183 + + + ++ + S+ Sbjct: 146 SVQNQAAFARTANASGEA-------AYPQIRMVCLMELSSHLLV---------NSAFDSV 189 Query: 184 AFGEADY----IVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG 239 A E + I + L ++ + + + G Sbjct: 190 AENEMNLASQLIPSIPNHSLTLFDRGFYSLGLLHAWQQAQPDSHWLLPLKKGTQYEVVRT 249 Query: 240 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 299 + + ++ P+ K + + R++ +T++ +L + Y +E Sbjct: 250 LGKHDQWVKLTTTPQA---RKKWPQLPDTLEARLLT-KTVKGKSVAILTSLTDPMRYPSE 305 Query: 300 QVADCYRLRWQIELAFKRLKSLLHLDAL--RAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 + D Y RW+IEL ++ +K L R++ PEL ++ LLA LI + Sbjct: 306 DIVDLYAHRWEIELGYREMKQHLLESRFTLRSQLPELVTQELWGVLLAYNLIRYKMLLMA 365 Query: 358 DFPPRSAGSE 367 P ++ Sbjct: 366 KSLPSVHPNQ 375 >UniRef50_A8M893 Transposase IS4 family protein n=3 Tax=Actinomycetales RepID=A8M893_SALAI Length = 451 Score = 152 bits (383), Expect = 2e-35, Method: Composition-based stats. Identities = 70/386 (18%), Positives = 107/386 (27%), Gaps = 76/386 (19%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLA---YGPGGMSLREVTAWAQLHD 68 L + E +D RR + A ++ L LA + G A L Sbjct: 25 LTRLVPFEMIDDVLAATRRTQRRVRLLPARVVVYLLLAGCLFADCGYRQVWAKLVAGLRG 84 Query: 69 VA--TLSDVAL-LKRLRNAADWFGILA---AQTLAVRAAVTGCTSGKRLRLVDGTAISAP 122 + SD AL R R L A A G +VDGT I+ Sbjct: 85 LPVADPSDSALRQARQRLGPAPLRALFDLLRGPAATSAVAAVRWRGLLPVVVDGTMIAVA 144 Query: 123 GGGSA-----------------EWRLHMGYDPHTCQFTDFELTDSRDAERLDRF----AQ 161 + RL T D S E + Sbjct: 145 DSPANLGRYGKHRCNNGGSGYPTLRLSALLTCGTRSVIDAVFDPSTTGEITQAHRLTRSL 204 Query: 162 TADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCG 221 A + +ADR + + + I + AD ++R +T + + G Sbjct: 205 RAGMLLLADRNY-AAADLIGAFTATGADLLIRCKSGRKLPMTRRCRDGSWLSVIDGQ--- 260 Query: 222 KNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEA 281 P R+I + + Sbjct: 261 ---------------------PVRIIEARIS--------------------ITTTAGSHT 279 Query: 282 AGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL-LHLDALRAKEPELAKAWIF 340 + L+ T L Y A + Y RW+IE A+ LKS L LRA+ P+ I Sbjct: 280 GDYRLITTLLDPRRYPAADLVRLYHQRWEIETAYLELKSTILGGRVLRARTPDGVDQEIH 339 Query: 341 ANLLAAFLIDDIIQPSLDFPPRSAGS 366 A L+ ++ + + D P Sbjct: 340 ALLIVYQVLRTAMVDATDSRPGLDPD 365 >UniRef50_Q7MLW1 Transposase and inactivated derivative n=29 Tax=Gammaproteobacteria RepID=Q7MLW1_VIBVY Length = 445 Score = 152 bits (383), Expect = 2e-35, Method: Composition-based stats. Identities = 71/374 (18%), Positives = 120/374 (32%), Gaps = 34/374 (9%) Query: 4 SHDNWSAILAHIGKPEELDTSARNA--GALTRRREIRDAATLLRLGLAYGPGGMSL---- 57 +S + P+E A A RRR + L + S+ Sbjct: 20 QLTTFSEHI-----PDEWVAKAATLSDKATIRRRRLPSDMVLWLIVGMAFFRNESIAEVA 74 Query: 58 REVTAWAQ-LHDVATLSDVAL-LKRLRNAADWFGILAAQTLAVRAAV---TGCTSGKRLR 112 R + A+ L D L+ AL R R L Q G ++ Sbjct: 75 RRMNVCAEGLADEELLAKSALTQARQRLGKAAPEWLFRQCSHTWGLERYPEDTWQGLQVF 134 Query: 113 LVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRG 172 +DG +E R H G + + T + + I A Sbjct: 135 AIDGALFRTADT--SELREHFG----SGNTSSERQTPHPVLRVVTMMNVRSHVIVDAAIS 188 Query: 173 FGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGN 232 R E ++ +I + + L D++ L+ ++ G Sbjct: 189 PYRRGEIPLAMP-----FIDSLPDNSVTLLDKGFYGADLLLSLQNSGSNRHWLLPAKKGV 243 Query: 233 SGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLP 292 L+ + + P+ K + + R V + H + TSLP Sbjct: 244 KFRLLDDEESDDMLVEMKVSPQA---RKKNPNLPEKWQVRAVTYQ--VQGKHKTVFTSLP 298 Query: 293 EDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD--ALRAKEPELAKAWIFANLLAAFLID 350 +EY AE VA+ Y RW+I+L ++ +KS + + LR+K EL ++ LL L+ Sbjct: 299 REEYDAESVAELYHERWEIKLGYRDIKSSMQHNALVLRSKTVELVYQELWGLLLGYNLVR 358 Query: 351 DIIQPSLDFPPRSA 364 + R A Sbjct: 359 REASQAAVAHGRMA 372 >UniRef50_A4J2U7 Transposase, IS4 family protein n=3 Tax=Desulfotomaculum reducens MI-1 RepID=A4J2U7_DESRM Length = 413 Score = 152 bits (383), Expect = 2e-35, Method: Composition-based stats. Identities = 54/379 (14%), Positives = 122/379 (32%), Gaps = 66/379 (17%) Query: 2 NYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVT 61 N + ++ + I + L + + +++ + + A LR ++ Sbjct: 4 NTNKSTYNQLFQTIYNEKFLSNV-KESEVDAYAKKLTVIKLIQMISYAQLEQLKGLRHIS 62 Query: 62 A------WAQLHDVATLSDVALLKRLR-----NAADWFGILAAQTLAVRAAVTGCTSGKR 110 ++ + ++S L ++LR F + Q + R Sbjct: 63 NSLNDDNFSSAVGLDSISASQLSRKLRDLSPELTQSLFSDIVHQFGTEIGFKSIRQELGR 122 Query: 111 LRLVDGTAISAP---------GGGSAEWRLHMGYDPHTC--QFTDFELTDSRDAER--LD 157 + L+D + IS + +LH+ + ++ A++ +D Sbjct: 123 IYLIDSSTISLCLSRYRWAEFRKTKSGVKLHLRIQLLEQGVLPDKAIIKPAKSADKTQMD 182 Query: 158 RFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRG 217 D + + DRG+ + + ++ R+ + + F Sbjct: 183 ALVVEKDALNVFDRGYLDYKR-FDNYSNNGTRFVSRLKSNAIVET--------LEEFPTN 233 Query: 218 LDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE 277 D + V++G G K P R+++ E Sbjct: 234 QDSLIKKDHKVILGKDGTTKMQNPL-----------------------------RLIETE 264 Query: 278 TLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 E +++ + E SAE+++D YR RWQIEL FK +K + + + Sbjct: 265 DTEGKPVIIITN---DFELSAEEISDIYRYRWQIELFFKWIKQHFCVKHFYGLSQQAVEN 321 Query: 338 WIFANLLAAFLIDDIIQPS 356 + L+ L+ + + + Sbjct: 322 QLMIALITYCLMMLLKKKT 340 >UniRef50_A3ZZQ0 Putative uncharacterized protein n=3 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZQ0_9PLAN Length = 457 Score = 151 bits (382), Expect = 3e-35, Method: Composition-based stats. Identities = 65/392 (16%), Positives = 111/392 (28%), Gaps = 63/392 (16%) Query: 1 MNYSHDNWSAILAHIGKPEEL-DTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLRE 59 + + S + + + P + G R R + + S ++ Sbjct: 12 VQSKFRDVSNLSSELLLPSGVVAAICHEIGFSFRERIYSPMIVVWMFVMQTLSADHSCQQ 71 Query: 60 ----VTAWAQLHDVATLS---DVALLKRLRNAADWFGILAA---QTLAVRAAVTGCTSGK 109 + AW ++ S R R F L A + G+ Sbjct: 72 VVTRLNAWRLAQGLSRCSGDTTSYCQARRRLPIALFQRLLAWTARKCDEAGLGDWRYQGR 131 Query: 110 RLRLVDGTAISA-----------------PGGGSAEWRLHMGYDPHTCQFTDFELTDSRD 152 + +VDGT ++ PG G R+ + T T F + Sbjct: 132 EVIIVDGTTVTMADTRANQTAFPQIENQKPGCGFPLARIVQVFSLATGAATMFAMGRYAG 191 Query: 153 AERLDRFAQTA-------DEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE 205 E + EI +ADR + S S D + R H R Sbjct: 192 KETGETSLLRTLLSQFHSGEIVLADRYYASFWLLALS-DLRGIDIVARAHHRRKIDFRRG 250 Query: 206 GMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLS 265 + D + + + + + R + Sbjct: 251 LRQGDCDQIVGYAKP----QRPTWMTTDEYDQYPSSILVRHLRY---------------- 290 Query: 266 ENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD 325 V L T L D Y AE +AD YR RWQ EL + LK + +D Sbjct: 291 -------EVTQRGFRTRRITLATTLLQGDVYRAEDLADLYRRRWQAELHIRSLKIQMQMD 343 Query: 326 ALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 LR K P + + + +++ L+ + + Sbjct: 344 HLRCKSPAMVRKELHCHMIGYNLVRAAMLATA 375 >UniRef50_B6EGT0 Transposase n=20 Tax=Vibrionaceae RepID=B6EGT0_ALISL Length = 441 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 51/356 (14%), Positives = 107/356 (30%), Gaps = 29/356 (8%) Query: 10 AILAHIGKPEELDTSARNAGALT-RRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ--- 65 LA + +D + +T R+R++ + + L S++++ Sbjct: 25 ETLADLLPIHLIDEAYSLTDTVTMRKRKLTLESMVWLLVGMAIYNNKSMKDLVNQLDIVD 84 Query: 66 LHDVATLSDVAL-LKRLRNAADWFGILAAQTLAVRAAVTGC--TSGKRLRLVDGTAISAP 122 A ++ AL +R + + + +G L VDG AP Sbjct: 85 RTGKAFVAPSALTQRRKNLGEAAMKAVFERMTSSWLKSANLPKWNGLTLLGVDGVVWRAP 144 Query: 123 GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRS 182 E F+ + T + + ++ I F + Sbjct: 145 DNQKNE-----------EAFSRQKGTQYPQVRMVCQMELSSH--LITASAFDNYNTNEMI 191 Query: 183 LAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPF 242 LA I + ++ + ++ + Sbjct: 192 LAEK---LIDSTPDHSVTMFDKGFYSLGLLHKWQMTGSERHWLIPLKKNTQYEIIRSLGR 248 Query: 243 PARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 +L+ + P K R+V ++ + +L + + Y + + Sbjct: 249 NDKLVILRSNP---RARKLFSNLPETMTARLVT-RKIKGKDYQVLTSMIDPLRYPLKDIV 304 Query: 303 DCYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 Y RW+IEL ++ K + + LR++ PEL K ++ LL LI + Sbjct: 305 GLYEHRWEIELGYREQKQYMLGNRLTLRSRLPELVKQELWGILLTYNLIRYQMVEL 360 >UniRef50_C6N0W0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N0W0_9GAMM Length = 453 Score = 148 bits (372), Expect = 4e-34, Method: Composition-based stats. Identities = 60/392 (15%), Positives = 113/392 (28%), Gaps = 65/392 (16%) Query: 11 ILAHIGKP--EELDTSARNAGAL--TRRREIRDAATLLRLGLAYGPGGMS---------L 57 ++ P ++ + + TR+R I L+ L S L Sbjct: 4 VMEDFWSPLQAMIEEVCEDFDKVWQTRKRVINT-QFLVTFILKLVLSKNSQGYKILLNEL 62 Query: 58 REVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVT--GCTSGKRLRLVD 115 E + ++ L + + R + F ++ + LA+R R+ VD Sbjct: 63 WETSEFSALQEQPVSASSICEARQKMPETIFTLINQKVLAMREESDTLPLWRNHRVFGVD 122 Query: 116 GTAISAPG-----GGSAEWRLH--------MGYDPHTCQFTDFELTD---SRDAERLDRF 159 G+ I+ P G A + Y + D L R Sbjct: 123 GSRINVPHELLEAGYKAPIKQQYYPQGLMSTLYHLGSGLIYDGILEPVKGERICLLSHME 182 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 T ++ + DRG+ S ++++ I R+ + D + Sbjct: 183 KLTLGDVLVLDRGYFSYLILVKAIE-RGIHLICRMQSGPVNKAVQAFWDSDKEDEVISYI 241 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 + G P RLI T+ Sbjct: 242 PSSPVKYESK--KQGYDIELNPIELRLIKY----------------------------TI 271 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 + +V T L ++Y + Y RW IE +K K + ++ ++ + Sbjct: 272 DNETYVCCTTLL-GEQYPLNEFPAVYHGRWGIEELYKISKEFVDVEDFHSRSERGVRQEC 330 Query: 340 FANLLAAFLIDDIIQPSLDF-PPRSAGSEKKN 370 +A++L L + PP S + N Sbjct: 331 YAHMLLINLARIFEAEADKQLPPPSEPDNRDN 362 >UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LF21_FRASN Length = 420 Score = 148 bits (372), Expect = 4e-34, Method: Composition-based stats. Identities = 67/388 (17%), Positives = 123/388 (31%), Gaps = 63/388 (16%) Query: 11 ILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAY----------GPGGM--SLR 58 +LA I + +D L +R+ + A ++ +A + +LR Sbjct: 26 VLARIVPRDLVDEVLAETRRLEQRKRLLPARVVVYFTMAMCLFFDDDYDEVMRRLVGTLR 85 Query: 59 EVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAV--TGCTSG-KRLRLVD 115 + +W V + ++ R R + +L + A + G G +RL VD Sbjct: 86 WLGSWKGDWKVPSTGAIS-QARTRLGPEPLKLLFERVAVPVAGLGTKGAWLGSRRLVAVD 144 Query: 116 GTAISAPGG-----------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDR 158 G + + + + T + ER Sbjct: 145 GVHLDTADTPENADAFGRFSHGPKTAAFPQVHVVALAECGTHAVFAAAIGAYTSDERSLA 204 Query: 159 FAQ----TADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGF 214 + ADR F ++L AD + RV+ + + Sbjct: 205 ATLFDACEPGMLLTADRNFYGYGLWQQAL-ATGADLLWRVNANLTLPVIRALPDGSYLSL 263 Query: 215 L--RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGR 272 L + + G+ + P R+I S+P + Sbjct: 264 LIDPKIPVARRGQLIADARAGHAPPTESALPVRVIEYSVPDHE----------------- 306 Query: 273 VVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAK 330 E + L+ L + +A ++A Y RW+IE F +K+ + LR+K Sbjct: 307 ----ENGTSELICLITNILDPTDVAAIELATAYHERWEIESTFDEIKTHQRGEKRVLRSK 362 Query: 331 EPELAKAWIFANLLAAFLIDDIIQPSLD 358 PEL K I+A LL + I ++ + D Sbjct: 363 NPELVKQEIWALLLTHYAIRSLMIEAAD 390 >UniRef50_P03835 Transposase insG for insertion sequence element IS4 n=377 Tax=root RepID=INSG_ECOLI Length = 442 Score = 148 bits (372), Expect = 4e-34, Method: Composition-based stats. Identities = 49/378 (12%), Positives = 109/378 (28%), Gaps = 59/378 (15%) Query: 16 GKPEELDTSARNAGALT-RRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL---HDVAT 71 PE + +G +T R+R + + + L ++ + + Sbjct: 27 LDPELISRCLAESGTVTLRKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPF 86 Query: 72 LSDVA-LLKRLRNAADWFGILAAQTLAVR--AAVTGCTSGKRLRLVDGTAISAPGGGSA- 127 ++ A + R R ++ + +T + A G L +DG P Sbjct: 87 VAPSAVIQARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPEND 146 Query: 128 ----------------EWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADR 171 + ++ + + T +++E Sbjct: 147 AAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSE----------------- 189 Query: 172 GFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIG 231 E L L + ++ ++ + G Sbjct: 190 -----NELAEQLIEQ-------TGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKG 237 Query: 232 NSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSL 291 + L+ + P+ A L +E + V + LL + Sbjct: 238 AQYEEIRKLGKGDHLVKLKTSPQ-ARKKWPGLGNEVTARLLTVTRK---GKVCHLLTSMT 293 Query: 292 PEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLI 349 + ++ D Y RW+IEL ++ +K + LR+K+PEL + ++ LLA L+ Sbjct: 294 DAMRFPGGEMGDLYSHRWEIELGYREIKQTMQRSRLTLRSKKPELVEQELWGVLLAYNLV 353 Query: 350 DDIIQPSLDFPPRSAGSE 367 + + ++ Sbjct: 354 RYQMIKMAEHLKGYWPNQ 371 >UniRef50_B0CC46 Transposase, IS4 family, putative n=9 Tax=Cyanobacteria RepID=B0CC46_ACAM1 Length = 482 Score = 148 bits (372), Expect = 4e-34, Method: Composition-based stats. Identities = 64/375 (17%), Positives = 121/375 (32%), Gaps = 59/375 (15%) Query: 15 IGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLRE----VTAWAQLHDV- 69 I L+ + R R TL + SLR +T W + Sbjct: 40 ILPASRLEELLKEEAFSYRNRIYSPIVTLWAMLYQVLSADKSLRNTVKCITTWLTAAGIQ 99 Query: 70 -ATLSDVALLK-RLRNAADWFGILA---AQTLAVRAAVTGCTSGKRLRLVDGTAISAPGG 124 + A K R R L A+ LA + G+ +++ DGT + Sbjct: 100 PPSSDTGAYSKARSRFPESLLQRLIPESAECLAQPLSPEHLWCGRPVKVYDGTTVLMADS 159 Query: 125 GSAEW-----------------RLHMGYDPHTCQFTDFELTDSRDAE----RLDRFAQTA 163 + + RL + + T + +E RL Sbjct: 160 AANQASYPQHGNQTAGCGFPIARLVVFFCLVTGAVASACIASWDTSEIVMSRLLYQDLEV 219 Query: 164 DEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKN 223 ++ +AD+ +GS + + AD ++R H F +G G Sbjct: 220 GDVVMADQAYGSY-VDLAIIQQHRADGVLRKHH------------ARKTDFRKGNKHGIG 266 Query: 224 GETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAG 283 + + + LI +L + + + + Sbjct: 267 DHQVTWHKPAQRPEHMSEQDFALIPQTLVVREVCLR--------------LSLKGFRDQH 312 Query: 284 HVLLLTSLPEDEYSAEQVADCYRLRWQI-ELAFKRLKSLLHLDALRAKEPELAKAWIFAN 342 +++ T L YSA Q+ Y RW + E+ + LK+ L ++ L AK P++ + I+ + Sbjct: 313 IIVVTTLLDAQRYSAGQLTRLYGWRWPVAEVNLRHLKTTLKMEMLSAKTPDMVRKDIWVH 372 Query: 343 LLAAFLIDDIIQPSL 357 LL L+ +++ + Sbjct: 373 LLGYNLLRSLMELAA 387 >UniRef50_B2LS82 Putative uncharacterized protein n=3 Tax=Vibrio RepID=B2LS82_9VIBR Length = 440 Score = 147 bits (371), Expect = 6e-34, Method: Composition-based stats. Identities = 49/364 (13%), Positives = 110/364 (30%), Gaps = 31/364 (8%) Query: 10 AILAHIGKPEELDTSARNAGALT-RRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLH- 67 + E ++ + + G ++ R+R + + + S+++V +L Sbjct: 21 DVFNKHIPWEWVEEAVQQTGRVSLRKRRLPAEQAVWLVLGIGLQRNRSIQDVCDKLELAF 80 Query: 68 -----DVATLSDVALLK-RLRNAADWFGILAAQTLAVRAAVTGCTS--GKRLRLVDGTAI 119 ++ ++ +++K + R L T + G +L VDGT Sbjct: 81 PDVDGELTPMATSSIIKGKERLGDKPMRYLFKTTAQQWEQQSDFDEVCGLKLLSVDGTYF 140 Query: 120 SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC 179 + H G+ F L + ++D FG Sbjct: 141 KTHNTEENQ---HFGFAQKGASFPSV----------LAVTLMSTRSHLVSDAAFGPVTNS 187 Query: 180 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAG 239 S A + L ++ +G + T + + Sbjct: 188 EISYAQQ---LVGSAPDDSLTLFDRGFTSAELFTSWQGASSNSHWLTPIKTKMRYDIIES 244 Query: 240 APFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAE 299 LI + + P+ K R + R++ T + + + L + Y + Sbjct: 245 YTDYDHLIEMPVSPQA---QKQTPYLGKRWQARLILIPTPKGEIKGFITSCLCPERYLFD 301 Query: 300 QVADCYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 + Y RW+IE ++ LK + LR+K+ ++ L + ++ + Sbjct: 302 DLVKVYWERWEIERSYGELKQYQLQNKPTLRSKKKVGIYQELWGILTSYNIVRLEMAEMA 361 Query: 358 DFPP 361 Sbjct: 362 KQHE 365 >UniRef50_C1ZMB0 Transposase family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZMB0_PLALI Length = 497 Score = 147 bits (370), Expect = 6e-34, Method: Composition-based stats. Identities = 50/368 (13%), Positives = 95/368 (25%), Gaps = 60/368 (16%) Query: 22 DTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLR-EVTAWAQLHDVATLSDVAL--- 77 + ++ G L R + A L + + A +S Sbjct: 57 EQASIEDGGLVYTRGVTLWAMLSQALFTDVQRACRAAVQRVAVYYALSGIRISSTNTGAY 116 Query: 78 -LKRLRNAADWFGIL---AAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEW---- 129 R + L Q G R ++DGT S P + Sbjct: 117 CRARAKIPEGVVQRLAVGVGQRCEAAVPDKWRWHGFRTLVIDGTTCSMPDTQENQAEYPQ 176 Query: 130 -------------RLHMGYDPHTCQFTDFELTDSRDAERLDRFA-------QTADEIRIA 169 R T + A ++ ++ Sbjct: 177 PSSQGKGLGFPILRAVALTSLATGMILALVTGPCAGKATGETALFRTLFDQLKAGDLVLS 236 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 DR + + L +++ R+H + + + Sbjct: 237 DR-YYGGWFMLALLQELGVEFVTRLHQFRIADFHQGKRLGQRDHVVAWAKP----QKPAW 291 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 + + + R I V +P A V++ + Sbjct: 292 LDQATYDRLPDQLEVREIEVQVP-----------------------VPGFRTASLVVVTS 328 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 + E++A YR RW +EL + +K+ + L LR +P + ++ LLA LI Sbjct: 329 LRDHRRFPREELALLYRRRWTVELELRDIKATMDLAVLRCTKPAWVRQELWTGLLAYNLI 388 Query: 350 DDIIQPSL 357 + S Sbjct: 389 RQSMLQSA 396 >UniRef50_A1HQH6 Transposase, IS4 family protein n=2 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HQH6_9FIRM Length = 400 Score = 147 bits (370), Expect = 7e-34, Method: Composition-based stats. Identities = 63/368 (17%), Positives = 121/368 (32%), Gaps = 67/368 (18%) Query: 16 GKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWA-------QLHD 68 E+ +AG ++++ L + +A SLR++ +L Sbjct: 7 FPLEKFLQIVASAGCDRYVKKLKTLKLLYLMLVAQFLRLDSLRDIANRLTCDKQLQKLLH 66 Query: 69 VATLSDVALLKRLRNAA----DWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP-- 122 + ++S L +RLRN + + + +A TG +L ++D + I+ Sbjct: 67 LTSISASTLSRRLRNIDHRVWEQVFAEVKRQIWQQANKTGAVRQYQLNVIDSSTITLCLR 126 Query: 123 -------GGGSAEWRLHMGYDPHTC--QFTDFELTDSRDAE---RLDRFAQTADEIRIAD 170 + +LH H LT +R A+ + + D + + D Sbjct: 127 KYLWADYRKTKSGIKLHQRITIHDGNSYPDSAVLTSARKADKTVMDELVVTSPDALNVFD 186 Query: 171 RGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMI 230 RG+ + ++ R+ + D++ E V + Sbjct: 187 RGYVDYAKW-DDYCRKGIRFVSRLKSNAV---------IDVLEEKSVETNQVLAEKIVRL 236 Query: 231 GNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTS 290 GN+ + P RLI V+++T+ Sbjct: 237 GNAYTTQMTHP--VRLIETR----------------------------DNQGNAVIIVTN 266 Query: 291 LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 E A +++D YR+RWQIEL FK +K L + I+ L+ L+ Sbjct: 267 --ELTLPAAEISDIYRMRWQIELFFKWIKQHLVVKEFFGTSQNAVYGQIWLALIGYCLLQ 324 Query: 351 DIIQPSLD 358 ++ Q Sbjct: 325 NLQQELPK 332 >UniRef50_C9KS84 Transposase domain protein n=5 Tax=Bacteroidales RepID=C9KS84_9BACE Length = 407 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 63/379 (16%), Positives = 117/379 (30%), Gaps = 72/379 (18%) Query: 7 NWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTA--WA 64 + ++ + +++ +R G + LL + A SLRE+ Sbjct: 12 VYGQLIKSLDH-DKIVEISRKHGGERYVKSFDGYTHLLTMLYAVIMRFDSLREIETTMIT 70 Query: 65 QLHDVATLSDVALLKR-------LRNAADWFGILAAQTLA---VRAAVTGCTSG-----K 109 ++ + + + KR R + +F + + +G K Sbjct: 71 EVRKLHHVGIERIPKRSTLSDANARRSEKFFEEVYHNLYEANKEKLTSDSRRNGTEEWIK 130 Query: 110 RLRLVDGTAISA--------------PGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAER 155 RLR++D T IS G ++H + D + T + + Sbjct: 131 RLRIIDSTTISLFSNAIFKGVGRHPKTGRKKGGIKVHSVIHANEGVHCDVKFTSAATNDS 190 Query: 156 LDRFA--QTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 DEI DR + + + +L Y+ ++ + Sbjct: 191 FMLAPNHFRHDEIVALDRAYINYAK-FEALTERNVVYVTKMKKNLVY-----------DT 238 Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV 273 + + NGE +K G AR+I Sbjct: 239 LVDCMYQNNNGEMEYREQVVVFRKDGINHIARIITY------------------------ 274 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPE 333 V + + + LLT+ + E E + YR RWQIE FK++K L + Sbjct: 275 VDVKKGKQPKLISLLTNDFDMEL--ETIVAIYRRRWQIESLFKQIKQNFPLRYFYGESAN 332 Query: 334 LAKAWIFANLLAAFLIDDI 352 K I+ L+A L+ + Sbjct: 333 AIKIQIWVTLIANLLLSVL 351 >UniRef50_P55729 Putative transposase y4zB n=4 Tax=Rhizobiaceae RepID=Y4ZB_RHISN Length = 356 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 50/284 (17%), Positives = 91/284 (32%), Gaps = 47/284 (16%) Query: 81 LRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISA--------PGGGSAEWRLH 132 R F T LRL+D T I G ++H Sbjct: 57 ARRPVAVFAETFGLLAGQLDRQTRREGRAMLRLIDSTPIPLGKLCGWAKSNGRIRGMKMH 116 Query: 133 MGYDPHTCQFTDFELTDSR--DAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADY 190 + YDP + ++TD+ DA+ A + I D+G+ ++A +A + Sbjct: 117 VVYDPDSDCPRLLDITDANVNDAQIGRTIAIESGATYIFDKGYCHY-GWWTAIAEAKAFF 175 Query: 191 IVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVS 250 + R + + + G + TV + + G+ K P Sbjct: 176 VTRPKSNMGLKVVRQRR----IKVAEGDGFTVIDDATVRLASKGDSKLPIPL-------- 223 Query: 251 LPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQ 310 R + + + LL + + A +A Y+ RWQ Sbjct: 224 ---------------------RRLTVKRADGDTITLLTN---DRKRPAVAIAALYKGRWQ 259 Query: 311 IELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQ 354 IEL F+ +K L + + + +FA ++A L+ + Sbjct: 260 IELLFRWIKQHLKIRSFLGNNDNAVRLQLFAAMIAYALLRIAAR 303 >UniRef50_A4BL98 Putative uncharacterized protein n=5 Tax=Nitrococcus mobilis Nb-231 RepID=A4BL98_9GAMM Length = 426 Score = 144 bits (363), Expect = 5e-33, Method: Composition-based stats. Identities = 55/364 (15%), Positives = 95/364 (26%), Gaps = 70/364 (19%) Query: 33 RRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVAT-LSDVALL------KRLRNAA 85 R R L S + A + + S ++ R R Sbjct: 13 RDRIFTPLVVLKAFLFQVLSQDGSCKHAVARVLSERLQSGQSANSINTGPYCKARQRLPR 72 Query: 86 DWFG---ILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGG-----------------G 125 + QTL RA G R+ L DGT P G Sbjct: 73 APLENAVRESGQTLHQRAPSAWGWRGHRVVLADGTTALMPDTLDNQREFPQQGNQQPGLG 132 Query: 126 SAEWRLHMGYDPHTCQFTDFELTDSRDAERLD-------RFAQTADEIRIADRGFGSRPE 178 R+ D+ L + + ++ +ADR + + Sbjct: 133 FPIVRIVALISLGAGAVLDYALGPYQGKGSGESSLFSTLLHTLQPGDLLLADRYYCTYAI 192 Query: 179 CIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKA 238 + + + H + ++ Sbjct: 193 MALLVH-HGVQGLFQKHAQRKPHWHRGERLGAKDHLIKWAKPP----------------- 234 Query: 239 GAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSA 298 P + +L + L G V + T Y Sbjct: 235 -----------RKPVWMSAQDYLKLPP-------TLTIRELAVNGIVYVTTLSNPKRYPR 276 Query: 299 EQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 +A+ YR RW IEL + +K+ + ++ LR K PE + I A+LLA L+ + + Sbjct: 277 RALAEHYRSRWTIELDLRSIKTDMAMERLRCKSPERVRKEIAAHLLAYNLVRANLNRAAQ 336 Query: 359 FPPR 362 + Sbjct: 337 CFEK 340 >UniRef50_D2TH14 ISCro6 transposase n=8 Tax=Gammaproteobacteria RepID=D2TH14_CITRO Length = 438 Score = 144 bits (362), Expect = 7e-33, Method: Composition-based stats. Identities = 54/379 (14%), Positives = 102/379 (26%), Gaps = 71/379 (18%) Query: 12 LAHIGKPEELDTSARNAG-ALTRRREIRDAATLLRLGLAYGPGGMSLRE----VTAWAQL 66 E + + A R+R++ + + S+ + + Sbjct: 21 FQRAIPLEWISQVLDSTNKASIRKRKLPAELVVWLIVGMGLYRDRSITDVVTKLDLVLSS 80 Query: 67 HDVATLSDVAL-LKRLRNAADWFGILAAQTLAVRAAVT---GCTSGKRLRLVDGTAISAP 122 + TL+ ++ R R + + L T + G RL VDGT P Sbjct: 81 QEGETLAASSVARARQRLSDEPLRELFTLTASHWTQQEDKDDLWYGLRLFAVDGTLFRTP 140 Query: 123 GGGS------------------AEWRLHMGYDPHTCQFTDFELTDSRDAE--RLDRFAQT 162 RL + + + E + + Sbjct: 141 DTPELAEHFEYIKHRPDRHTEYPMVRLCAMMSLRSRLIHGVKFGPANTGEVSYAKQLSPQ 200 Query: 163 ADEIRIADRGFGSRPECIR-SLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCG 221 A + + DR + S I EA ++V + + D Sbjct: 201 AKSLTLFDRCYLSAELLINWQRRQQEAHWLVPLKGNTKYRIVETFAGGD----------- 249 Query: 222 KNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEA 281 L+ + + P+ K + R+++ E Sbjct: 250 -----------------------HLVEMQVSPQA---RKQDSSLPENWQARLIEYEDESG 283 Query: 282 AGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA---LRAKEPELAKAW 338 + + +Y AE + Y+ RW IE + LK L LR+++ Sbjct: 284 DYKGFITSLTEPGQYPAEALRYVYQERWSIENGYGELKQ-FQLSTATLLRSQKVSGIYQE 342 Query: 339 IFANLLAAFLIDDIIQPSL 357 I+ L A LI + Sbjct: 343 IWGLLTAYNLIRMEMSQIA 361 >UniRef50_C9LFX6 Transposase domain protein n=14 Tax=Bacteroidales RepID=C9LFX6_9BACT Length = 424 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 62/384 (16%), Positives = 119/384 (30%), Gaps = 74/384 (19%) Query: 8 WSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLH 67 +S I+ + K E L + GA + LL + A SLRE+TA Q Sbjct: 25 YSQIIKLLNKSEIL-RISSEQGAERYVKSFDAWTHLLVMLYAVIMRFDSLREITASLQAE 83 Query: 68 DV----------ATLSDVALLKRLRNAADW---FGILAA---QTLAVRAAVTGCTSG--- 108 + S +A + R+ A + + L A L+ + + + Sbjct: 84 ACKLRHLGIFMMTSRSTLADGNKRRSEAVFEAVYRDLYAKHRHLLSSDSRLCTRKNEPKW 143 Query: 109 -KRLRLVDGTAISAPGG--------------GSAEWRLHMGYDPHTCQFTDFELTDSRDA 153 KRL+++D T I+ ++H + +D T + Sbjct: 144 MKRLKIIDSTTITLFSNLLFKGVGRHPKTGKKKGGIKVHSIIQANEGVPSDIRFTSAATN 203 Query: 154 ERLDRFA--QTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDM 211 + +I DR + + + + Y+ ++ + + M Sbjct: 204 DSFMLLPATLNRGDIIAMDRAYIDYAK-FQQMTERGVVYVTKMKKNLQYTIEEDVMC--- 259 Query: 212 MGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKG 271 ++ + K + S Sbjct: 260 --------------------------QTPEGVMQVRVQRVTF----RKKLKGGSSIVHHA 289 Query: 272 RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKE 331 R+V ++ + LLT+ + ++ D Y RW IEL FK++K L + Sbjct: 290 RIVTYVDVQKRKLISLLTN--DMTSDPLEIMDIYHKRWAIELLFKQIKQNFPLKYFYGES 347 Query: 332 PELAKAWIFANLLAAFLIDDIIQP 355 K I+ L+A L+ I++ Sbjct: 348 ANAIKIQIWVTLIA-NLLLMIMRR 370 >UniRef50_UPI0001C4271A transposase, IS4 family protein n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C4271A Length = 399 Score = 141 bits (356), Expect = 3e-32, Method: Composition-based stats. Identities = 55/342 (16%), Positives = 101/342 (29%), Gaps = 66/342 (19%) Query: 32 TRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVA-------TLSDVALLKRL--- 81 +++ L L ++ SL +++ + + T+S L ++L Sbjct: 16 KYVKKLTAYKFLQLLIISQLKETKSLTQMSKKLKDKEELQVQLAFDTISTSQLSRKLGDL 75 Query: 82 --RNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP---------GGGSAEWR 130 F L A RL ++D T +S A R Sbjct: 76 SPTLFEKIFHYLVLNIQAKMKQSPIIREIGRLHVIDSTTMSMSVSQYPWATFRKTKAGIR 135 Query: 131 LHMGYDPHTCQ--FTDFELTDSRDAE---RLDRFAQTADEIRIADRGFGSRPECIRSLAF 185 LH+ L ++ A+ D +D I + DRG+ + L Sbjct: 136 LHLRVVVTKELTLPDKGILLPAKHADRTQMGDLIEMDSDAIHLFDRGYIDYKQ-FDHLCL 194 Query: 186 GEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPAR 245 + +I R+ + +E + + V +GNS N Sbjct: 195 HDVRFITRLKKNAQVEVLSEQ--------IPQAGSPIVKDQEVFLGNSQNGTKM------ 240 Query: 246 LIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCY 305 R+++ + + +++ SAE++ D Y Sbjct: 241 ----------------------THPLRLIETQDSQGNVVMIVTNCFD---LSAEEIGDLY 275 Query: 306 RLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 R RW+IE FK +K L K I+ L+ Sbjct: 276 RYRWKIETFFKWMKQHLTFKTFYGKSENAVCNQIWVALITYC 317 >UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4D726 Length = 464 Score = 141 bits (354), Expect = 5e-32, Method: Composition-based stats. Identities = 68/385 (17%), Positives = 108/385 (28%), Gaps = 59/385 (15%) Query: 11 ILAHIGKPEELDTSARNAGAL-TRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL--- 66 +L P +D G R R + + + G R V A Sbjct: 3 VLTRWVPPVLVDEVLAATGRFEKRVRMLPARVVVYFVLAMTLFGDCGYRGVWAALTAGMP 62 Query: 67 -HDVATLSDVALLKRLRN-AADWFGILAAQTLAVRAAVTG---CTSGKRLRLVDGTAISA 121 H V S AL + R +L + G R+ DGT++ Sbjct: 63 GHLVPDPSAAALRQARRRLGTAPLALLFDRVCGPVGTKETPGVFWHGLRVVAWDGTSVEV 122 Query: 122 PGGGSA------------------EWRLHMGYDPHTCQFTDFELTDSRDAE----RLDRF 159 + + RL + T D E R Sbjct: 123 ADSAANVAHYGRHGKATSRPAGYPQVRLTALVECGTRALMGAVFGPMHDKELPQARRLLP 182 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 + +ADRG+ E IR A AD + RV L + L + Sbjct: 183 VLRPGILLLADRGYDGY-EAIRDAASTGADLLWRVQSGRLLPVIQ---PLPDGSHLSQIL 238 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPA---RLIAVSLPPEKALISKTRLLSENRRKGRVVQA 276 ++G+ A R+I + A + Sbjct: 239 DRRSGDRLAAWQRRKRPTPPPALTAMAVRVIRYQVTVTTADGRQH--------------- 283 Query: 277 ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHL--DALRAKEPEL 334 ++ L+ T L + A ++A+ Y RW+IE A+ LK L LR+ + Sbjct: 284 ----SSTVRLITTLLDPARHPAAELAELYHQRWEIETAYYGLKVTLRGSDRVLRSHTVQG 339 Query: 335 AKAWIFANLLAAFLIDDIIQPSLDF 359 + I+A L L I + Sbjct: 340 VEQEIYALLTVFQLTRTAIHNTAHI 364 >UniRef50_Q1VPP4 ISPg4, transposase n=7 Tax=Bacteria RepID=Q1VPP4_9FLAO Length = 411 Score = 140 bits (352), Expect = 9e-32, Method: Composition-based stats. Identities = 62/385 (16%), Positives = 125/385 (32%), Gaps = 68/385 (17%) Query: 2 NYSHDNWSAILAHI-GKPEELDTSARNA-GALTRRREIRDAATLLRLGLAYGPGGMSLRE 59 ++N I + P L S N + R + L SL + Sbjct: 6 RNKNNNKPVIRQILDLVPHWLFRSCTNTYKTDKGVHKYRTYDQFVALTFGQLNKCQSLND 65 Query: 60 VTAWAQLHDVATLSDVALLKR----------LRNAADWFGILAAQTLAVRAAVTGCTS-- 107 ++A + ++ +SD+ L + + F L + L+ +V Sbjct: 66 ISAGIGVSEI-FISDLGLTQSPARSTMSDGNKKRDWQVFESLYYRLLSHYKSVLKQHHNT 124 Query: 108 -------GKRLRLVDGTAISAP---------GGGSAEWRLHMGYDPHTCQFTDFELTDSR 151 GK ++L+D + IS +LH +D + +T+++ Sbjct: 125 HIIEEIKGKVVKLIDSSTISLCLAMFDWAEFRTAKGGIKLHTSWDYNLMIPDVVNITEAK 184 Query: 152 DAER--LDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRF 209 +R L + D I + DR + + + ++ R+ L E Sbjct: 185 VHDRYGLKQLIFPKDTIIVEDRAYFDFELMLNRIKAENV-FVTRIKSNTLYETIEELELA 243 Query: 210 DMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRR 269 D + ++ +I + + ++ Sbjct: 244 DDVD--------QHILKDEII------------------------QLTSGRAIETGISKH 271 Query: 270 KGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRA 329 K R+V + + ++T+ + EY+ +A Y+ RW IEL FK LK L + Sbjct: 272 KLRLVHVYKEDENKVIAIITNQLDWEYN--TIAALYKKRWDIELFFKALKQNLQVKTFWG 329 Query: 330 KEPELAKAWIFANLLAAFLIDDIIQ 354 K+ I+ L+ L++ I + Sbjct: 330 TSENAVKSQIYVALINYLLLELIKR 354 >UniRef50_A6L0R8 Transposase n=13 Tax=Bacteroidales RepID=A6L0R8_BACV8 Length = 411 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 61/376 (16%), Positives = 116/376 (30%), Gaps = 68/376 (18%) Query: 7 NWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTA--WA 64 + ++ + K + L ++ G + LL + A SLRE+T + Sbjct: 12 AYGQLINLLDKSKILQ-ISQEKGGERYVKHFDAWQHLLIMLYAVIKRFDSLREITDSMFP 70 Query: 65 QLHDVA-----------TLSDVALLKRLRNAADWFGILAA----QTLAVRAAVTGCTSGK 109 + +A TLSD + + L + + + Sbjct: 71 EARKLAHLGISMMPRRSTLSDANARRSEGIFEAIYRDLYKTYRNELSSDSRNNPSSSWIN 130 Query: 110 RLRLVDGTAISAPGG--------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAE- 154 RL+++D T IS ++H + +D T + + Sbjct: 131 RLQIIDSTTISLFSNLIFTGVGRHPKTGKKKGGIKVHTNIHANEGVSSDIRFTSAATNDS 190 Query: 155 -RLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 L T+ +I DR + + L+ Y+ ++ + ++A Sbjct: 191 FMLKPSNYTSGDIVALDRAYIDYAK-FEELSRAGVIYVTKMKKNLVYEVSA--------D 241 Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV 273 + + G + + K G + R+ Sbjct: 242 TIYMTESGLMALRERHVTFTKKVKDGDDI-------------------------KHHARI 276 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPE 333 V + G L+ + E SAE + YR RW+IEL FK++K L + Sbjct: 277 VTYVDQKKRGAKLISLLTNDMEMSAEDIVAIYRKRWEIELLFKQIKQNFPLRYFYGESAN 336 Query: 334 LAKAWIFANLLAAFLI 349 K I+ L+A L+ Sbjct: 337 AIKIQIWITLIANLLL 352 >UniRef50_C6J7R2 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J7R2_9BACL Length = 399 Score = 137 bits (344), Expect = 6e-31, Method: Composition-based stats. Identities = 58/350 (16%), Positives = 112/350 (32%), Gaps = 74/350 (21%) Query: 25 ARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL--------HDVATLSDVA 76 +R++ R R++ ++LL A S E++ + + ++S Sbjct: 22 SRDSVLDYRARKLFVGSSLLLFIEAQLQQRESYAEMSEHLEANEDFQAILGGLESISPSQ 81 Query: 77 LLKRLRNAA-DWFGILAAQTLAVRAAVTGCTSG-----KRLRLVDGTAISAP-------- 122 L ++++ + +L Q +T G +L ++D T I+ P Sbjct: 82 LSRKMKKLPLENLHLLFMQVTRQIQQLTENKPGITTKIGKLAIMDSTQITLPAILSKWAY 141 Query: 123 -GGGSAEWRLHMGY---DPHTCQFTDFELT--DSRDAERLDRFAQTADEIRIADRGFGSR 176 + ++H D T + D D E F + + DRG+ Sbjct: 142 CSASNHGVKMHTSLLVVDAKTMVPDKIIASTKDVADHEVAPNFTVDKEVTYVMDRGYQVH 201 Query: 177 PECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNK 236 + ++ ++ RV + E F+R D Sbjct: 202 -KHFQAWVDQGMKFVARVKDNTRLTILKERALPKRGDFIRDADV---------------- 244 Query: 237 KAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEY 296 L + K R+++ + + + L+ + + Sbjct: 245 --------------------------TLPGQQMKLRLIEFQDQQGRLYRLVTS---RMDL 275 Query: 297 SAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 S Q+AD YR RWQIEL FK +K L L E ++ L++ Sbjct: 276 SVHQIADVYRHRWQIELFFKWIKQHLRLVKPHGYTAEAIWNQMYIALISY 325 >UniRef50_C7PAE4 Transposase IS4 family protein n=4 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PAE4_CHIPD Length = 412 Score = 137 bits (344), Expect = 8e-31, Method: Composition-based stats. Identities = 58/383 (15%), Positives = 114/383 (29%), Gaps = 70/383 (18%) Query: 8 WSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTA----- 62 ++ +L+ I +D R A + + L+ + + SLRE+ Sbjct: 13 FNQLLSFIPTT-LIDKVCRETNADYYYKHFKAFDHLVTMLFSSFHQCTSLRELHTGLLAN 71 Query: 63 -----WAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAA------VTGCTSGKRL 111 + S ++ R R A +F L + + RL Sbjct: 72 QHRLHHLGIKHTPRRSTISDANRTRPVA-FFEKLYHRLYNHHYQAFSPDSRKRKSLVDRL 130 Query: 112 RLVDGTA-------------ISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAER--L 156 +VD T I G + H+ T + LT++ +R + Sbjct: 131 FIVDSTTVSLFSNVMKGAGVIRMDGRKKGGIKAHVLMTAKTELPSFTILTEAAKNDRIIM 190 Query: 157 DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRV-HWRGLRWLTAEGMRFDMMGFL 215 + I DR + + + E ++ RV ++ LT ++ + Sbjct: 191 PQLELLPGSIIAMDRAYVNYKLM-KEWTEKEITWVTRVTKSMKIKLLTRNRLK------I 243 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQ 275 G + + +GN ++ AR+I Sbjct: 244 LHKRKGILKDWVIQLGNPLTEEKSPVQTARVI---------------------------S 276 Query: 276 AETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELA 335 + LLT+ Y+ + Y+ RW IE+ FKR+K L+ + Sbjct: 277 IYDRNTKKKIHLLTN--NFTYTPTTIRKLYQKRWAIEMLFKRIKQNSQLNNFLGENKNAI 334 Query: 336 KAWIFANLLAAFLIDDIIQPSLD 358 ++ L+ L + + Sbjct: 335 SIQLWCTLIKDLLTKIVKDKLTE 357 >UniRef50_C9C7H0 Transposase n=5 Tax=Enterococcus faecium RepID=C9C7H0_ENTFC Length = 373 Score = 137 bits (344), Expect = 8e-31, Method: Composition-based stats. Identities = 63/377 (16%), Positives = 123/377 (32%), Gaps = 61/377 (16%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 + S W + + G E + + + +++ TL L A S RE+ Sbjct: 6 VKTSFQKWFSAINFSGLSENSQSLISDFDFYS--KKLDFQTTLKVLLHAVYEELPSYREI 63 Query: 61 TAWA------QLHDVATLSDVALLKRL-RNAADWFGILAAQTLAVRAAVTGCTSGKRLRL 113 + + +L +L +R + + Q +A +A + L+L Sbjct: 64 DRAFLDQRLCKELGIDSLCYSSLSRRAPEIKQEVLMEIFTQLVARISAQQPSSKTTSLQL 123 Query: 114 VDGTAISAPG---------GGSAEWRLHMG---YDPHTCQFTDFELTDSRDAER--LDRF 159 +D T I + +LH+ D F +T++ + +R L+ Sbjct: 124 IDSTTIPLNKAWFPWAKFRKTKSGIKLHLNLCYLDKTNQYPESFTMTNASEHDRNHLEVL 183 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 + DRG+ + + L ++ R + + D Sbjct: 184 VDKTQATYVVDRGYFDY-KLLDKLNRDGYFFVTRTKSNTKITILDQIEVAD--------- 233 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 + G + + + ++ + R+V T Sbjct: 234 --------------TTTRDGTIISDQQVIL-----------VGGVNHVTERFRLVTVLT- 267 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 + + ++T+L + S +VAD Y+ RWQIEL FK LK L + L + + A + Sbjct: 268 KGQKILRMVTNL--FDVSPNEVADMYQARWQIELLFKHLKQNLTIKRLYSHSEQGAINQV 325 Query: 340 FANLLAAFLIDDIIQPS 356 L+A L I Sbjct: 326 ILTLIATLLTYVIKIEL 342 >UniRef50_B8FEP3 Transposase IS4 family protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FEP3_DESAA Length = 422 Score = 136 bits (343), Expect = 1e-30, Method: Composition-based stats. Identities = 66/380 (17%), Positives = 106/380 (27%), Gaps = 67/380 (17%) Query: 12 LAHIGKPEELDTSAR-NAGALTRRREIRDAATLLRLGLAYGPGGMS---LREVTAWAQLH 67 + ++ + A + +R R + +L L L G + + Sbjct: 1 MERFLPADKSQSQAPFKSKDFSRNRILTLPV-VLALILNMVRPGKRVGYDEVLARFFAAA 59 Query: 68 DV--------ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGC-----TSGKRLRLV 114 + S R + + L + L + G+R+ + Sbjct: 60 SLMNGQNITPPDKSAFC-RARKKVPFEALTELYGKALEHAKDLAAKAPGTTWRGRRVLAI 118 Query: 115 DGTAISAPGGG-------------SAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFA- 160 DGT I P + + YD D + ER Sbjct: 119 DGTKIMLPRTKELLDAFGKCSHGWFPQTHACVLYDVLAGLPLDVAWGHYKSGERGLARDM 178 Query: 161 ---QTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRG 217 +I + DRGF L D+IVR+ G + FL+ Sbjct: 179 FDGFLPGDILVLDRGFPGFA-FFLDLMEQGIDFIVRLRGDGQF--------AALRPFLQE 229 Query: 218 LDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE 277 + E P R+ E A K R V+ Sbjct: 230 NRRDQIIEIP---------------PTRVAI----EEYARQGKPAPGPV---TLRFVKVS 267 Query: 278 TLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 + + T + Y +++ Y LRWQ E FK +K LL + +R K L Sbjct: 268 LGKGKSALYATTLVDRKRYKFKELKHLYHLRWQEEEFFKHMKDLLEAENIRGKSEALVDQ 327 Query: 338 WIFANLLAAFLIDDIIQPSL 357 I A L L +I S Sbjct: 328 EIVAVHLYHLLARILIMESA 347 >UniRef50_Q12AI7 Transposase, IS4 family n=3 Tax=Proteobacteria RepID=Q12AI7_POLSJ Length = 458 Score = 136 bits (343), Expect = 1e-30, Method: Composition-based stats. Identities = 56/393 (14%), Positives = 103/393 (26%), Gaps = 78/393 (19%) Query: 2 NYSHDNWSAILAHIGKPEELD---TSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLR 58 + + PE L+ R R L S + Sbjct: 19 QTKATHAVEFFNVLTSPELLETTEALLPEH----RERLYPPTVALSMFMRQVLEADGSCQ 74 Query: 59 EVTAWAQLHDVA------TLSDVAL-LKRLRNAADWFGILAA---QTLAVRAAVTGCTSG 108 + A ++ R R + G L + L +A G Sbjct: 75 KAVNGWAAQRAADGLRPCSVRTGGYCRARQRLPLEMVGTLTRETGRLLHEKALAQWLWRG 134 Query: 109 KRLRLVDGTAISAPGGGSAE-----------------WRLHMGYDPHTCQFTDFELTDSR 151 + ++LVDGT IS P + RL M T D + Sbjct: 135 RAVKLVDGTGISMPDTPENQERYPQPSTQAPGVGFPLARLVMVICLATGAALDMAVGPHS 194 Query: 152 DAERLDRFAQT-------ADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTA 204 + ++ +AD + + I SL D + + + Sbjct: 195 GKGSGELGLVRRLLAGFCPGDVMLADALYCNYF-LIASLMAAGVDVLFEQNGSRITDFRR 253 Query: 205 EGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLL 264 +R + R + V+ Sbjct: 254 GQSLGPRDHIVRWPKPP----RPEWMTPEQYTGFPDELTVREVKVAH------------- 296 Query: 265 SENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHL 324 VL+ T L + S ++ Y RW +EL + LK+ + Sbjct: 297 -------------------QVLVTTLLDYRKVSKNDLSALYARRWNVELDLRNLKTTTGM 337 Query: 325 DALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 D L + P++ + ++ +LLA +I ++ + Sbjct: 338 DVLSCQTPQMNEKQLWVHLLAYNVIRLLMAQAA 370 >UniRef50_A4SUB1 IS element transposase n=8 Tax=Bacteria RepID=A4SUB1_AERS4 Length = 420 Score = 134 bits (338), Expect = 4e-30, Method: Composition-based stats. Identities = 67/381 (17%), Positives = 126/381 (33%), Gaps = 51/381 (13%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS----LREVTAWAQLH 67 + + P E + AR R R I + L A+G G + L + QL Sbjct: 10 INQLLTPAETECIARLCKFCLRLRAITPWMLVTSLLRAFGGGKVGAIACLHQHFNGLQLA 69 Query: 68 DVATLSDVALLKRLRNAA--DWFGILAAQTLA----VRAAVTGCTSGKRLRLVDGTAISA 121 +S +LR A + L + +A + + K++ L DGT+ + Sbjct: 70 HTHQVSYKPFHNQLRKPAFAQFMKALVERAIALRIGQQVTDVAQGAFKQVLLQDGTSFAV 129 Query: 122 PGG------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAER--LDRFAQTADEIR 167 A HM + +L+ +ER L + + Sbjct: 130 HKRLATVFPGRFKTISPAAIECHMTMSLLEQKPLCMQLSADTASERQFLPDAKKLTGSLL 189 Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT 227 +AD G+ R + Y+VR + L ++ Sbjct: 190 LADAGYIDRA-YFAEVNKAGCFYLVRGRKGLNPKI---------------LRAWRDDGRA 233 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLL 287 V + K R + + + + + R+++ E + Sbjct: 234 VEKLTGMSLKEEGRRHCRAEVLDMD-----------VKSGKYEYRLIRRWFAEETRFCVW 282 Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 +T+L + + AE+V YR RWQ+EL FK KS +L + + + ++ +LL+ Sbjct: 283 MTNLARETWPAERVMRLYRCRWQVELLFKERKSYNNLKGFVTGQKAITEGLVWDSLLSLV 342 Query: 348 LIDDIIQPSLDFPPRSAGSEK 368 L + Q + + + K Sbjct: 343 LKRRVAQTLVKEGLSTLKAAK 363 >UniRef50_B2J1G3 Transposase, IS4 family protein n=6 Tax=Nostocaceae RepID=B2J1G3_NOSP7 Length = 381 Score = 134 bits (336), Expect = 6e-30, Method: Composition-based stats. Identities = 56/364 (15%), Positives = 111/364 (30%), Gaps = 73/364 (20%) Query: 33 RRREIRDAATLLRLGLAYGPGGMSLREV---------TAWAQLHDVATLS--DVALLKRL 81 R+R + + + S+R+V AW ++ ++ R Sbjct: 14 RKRSLPAQLVVSLVIAMSLWSKDSMRDVLKNLIDGLSEAWLKVGKYWRVACKSAITQARQ 73 Query: 82 RNAADWFGILAAQTLAVRAAVTGCTSGK---RLRLVDGTAISAPGG-------------- 124 R A L Q + A + R+ ++DG+ P Sbjct: 74 RLGARVMCKLFHQLVKPMATQETLGAFLQELRIVVIDGSCFDVPDSDENARVFGRPGSRP 133 Query: 125 ----GSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRF----AQTADEIRIADRGFGSR 176 + RL + + T D + R ER+ + T + + DRG S Sbjct: 134 GTKAAFPKVRLVILVEAGTHIIFDALMWPYRIGERVRALRLLRSVTPGMLLMWDRGLHSY 193 Query: 177 PECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNK 236 +++ DY+ R+ + ++ K Sbjct: 194 A-MVQATVTKGCDYLGRIPANIKFIAEKPLEDGSYLSWI-------------YPSGKLRK 239 Query: 237 KAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEY 296 KA P R+I ++ E + L+ + L +++ Sbjct: 240 KASQPILVRIIEYTIEHPDNP---------------------TEQLTYRLITSLLNIEKF 278 Query: 297 SAEQVADCYRLRWQIELAFKRLKSLLHLD--ALRAKEPELAKAWIFANLLAAFLIDDIIQ 354 AE +A Y RW++E LK L +R+++P ++A LL + + ++ Sbjct: 279 PAELLAREYHQRWEVENTIDELKIHLLGRKTHVRSQKPREVVQEVYAWLLGHWTVRLLMF 338 Query: 355 PSLD 358 + Sbjct: 339 QAAT 342 >UniRef50_Q8VV93 Transposase n=1 Tax=marine psychrotrophic bacterium Mst37 RepID=Q8VV93_9GAMM Length = 423 Score = 134 bits (336), Expect = 6e-30, Method: Composition-based stats. Identities = 60/377 (15%), Positives = 122/377 (32%), Gaps = 57/377 (15%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 MN + + + +++ + R+R I+ L + L G S+ V Sbjct: 1 MNKDNTELLQ-ITKVLNELKINEIGKKVNFCNRKRIIKPFE--LVMSLITALGDKSVDTV 57 Query: 61 TAWAQLHDVATLSDVALLKRLR--NAADWFGILAAQTL---------AVRAAVTGCTSGK 109 T + T +DV K + F L + + V ++ K Sbjct: 58 TDLHRYFVKLTETDVQY-KPFHNQLSKPEFVGLIKELIGVAVNDWQQQVLGTEVELSAFK 116 Query: 110 RLRLVDGTAISAPGG------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAER-- 155 + L DG++ + A +H+ +D ++ AE Sbjct: 117 GIVLQDGSSFAVHDSLKDIFTGRFTKISPAAIEVHVSWDVLKGYPEQVSISPDSQAEYDF 176 Query: 156 LDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFL 215 L + +ADRG+ + + + Y+VR + A + Sbjct: 177 LPDADALEGRLLLADRGYF-KLSYLDEIDQAGGAYVVRAKTTVNPMVVAGFNKAG----- 230 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQ 275 ++ ++ + E + R++ Sbjct: 231 ----------------------KPLKRFQKIKQKAVKKHIRRSGIVDMDVEGKTNYRLIA 268 Query: 276 AETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELA 335 + T+L +++SAE+V Y+LRWQIEL FK KS +L ++ + Sbjct: 269 SWPEGKDEPTYWATNLDREQFSAEKVMKLYQLRWQIELLFKEWKSYCNLQKFNTRKATMM 328 Query: 336 KAWIFANLLAAFLIDDI 352 + ++++LL+ + I Sbjct: 329 EGLVWSSLLSLLVKRRI 345 >UniRef50_Q7ULM3 Probable transposase n=5 Tax=Planctomycetaceae RepID=Q7ULM3_RHOBA Length = 458 Score = 132 bits (331), Expect = 3e-29, Method: Composition-based stats. Identities = 64/346 (18%), Positives = 124/346 (35%), Gaps = 50/346 (14%) Query: 34 RREIRDAATLLRLGLAYGPGGMS-LREVTAWAQL-----------HDVATLSDVALLKRL 81 R + + + L +S LR ++ ++L + +LS+ L Sbjct: 80 NRTLHYDQYCMLVLLYVLNPTVSSLRAISQASELTKVRNKLSNEKASLGSLSEAGGLFSA 139 Query: 82 RNAADWFGILAAQTLAVRAAVTGCTS-GKRLRLVDGTAI-------------SAPGGGSA 127 + L+A+ A +S + + VDG+ + G Sbjct: 140 DHLKPVIEALSAEV-NDAAPDPRLSSIQQTITAVDGSLVNALPSLIAASILKQTTGSALV 198 Query: 128 EWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQ----TADEIRIADRGFGSRPECIRSL 183 WRLH ++ + ++T + +R D + + DRG+ ++ S+ Sbjct: 199 RWRLHTHFEVNNLLPARVDVTPDGGGQHDERAVLKRVLEEDRLYVMDRGY-AKFSLFNSI 257 Query: 184 AFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFP 243 + Y+ R+ + T E + G +T V +G S + P Sbjct: 258 VASSSSYVCRLRDNTVYETTQELELTEGDRA-----AGVLSDTIVKLGGSSSSSNSPDHP 312 Query: 244 ARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVAD 303 RLI + P + NR G+ ++ + G + + T+L AE +A Sbjct: 313 IRLIQIRCTPHQ-----------NRTGGKARGSKAPNSDGILRIATNLLN--VPAEIIAL 359 Query: 304 CYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 Y RW IE+ F+ K L+ D L + + ++ +++A LI Sbjct: 360 IYAYRWTIEIFFRFYKQLMGGDHLISHNANGIQIQVYCSVIACLLI 405 >UniRef50_B5EK95 Transposase IS4 family protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EK95_ACIF5 Length = 369 Score = 131 bits (328), Expect = 5e-29, Method: Composition-based stats. Identities = 60/352 (17%), Positives = 102/352 (28%), Gaps = 69/352 (19%) Query: 39 DAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAV 98 +L+L + + + + R + A L + Sbjct: 35 AYEEVLQLVVDGLRPLLGDDRLAQTVVSKGAIS------QARAKVGAAPLKTLYQNQVQP 88 Query: 99 RAAVTGC---TSGKRLRLVDGTAISAPG-----------------GGSAEWRLHMGYDPH 138 + G RL +DG+ + P + R + Sbjct: 89 HGPLGMAGVGYKGLRLMAIDGSTLDMPDEAANAERFGYPASSRGSAAFPQLRFVAMAECG 148 Query: 139 TCQFTDFELTDSRDAERLDRFAQTADE----IRIADRGFGSRPECIRSLAFGEADYIVRV 194 T E+ +ER A + ADR F S +SL A + R+ Sbjct: 149 THTLCYAEMGSYEQSERTLAGPVMAHADATMLITADRNFYSYAFWQQSL-ATGARLLFRL 207 Query: 195 HWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPE 254 + L E + D +T+ KK R+I +L Sbjct: 208 --SSVLKLPREKILADGS-----------YLSTIYSSTQDRKKGRGGIRVRIIEYTLDG- 253 Query: 255 KALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELA 314 E + L+ + E A ++A Y RW IE + Sbjct: 254 ---------------------IPDAE-PSYRLITNWMDPTEAPALELAALYHRRWTIESS 291 Query: 315 FKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSA 364 LK+ L LR+K PEL + +A LLA + ++ + D ++A Sbjct: 292 LDELKTHLADRQVVLRSKRPELVEQEFYALLLAHAAVRHLMTEAADQTGQAA 343 >UniRef50_D1XZ52 Transposase, IS4 family n=1 Tax=Prevotella bivia JCVIHMP010 RepID=D1XZ52_9BACT Length = 241 Score = 131 bits (328), Expect = 5e-29, Method: Composition-based stats. Identities = 44/233 (18%), Positives = 80/233 (34%), Gaps = 41/233 (17%) Query: 129 WRLHMGYDPHTCQFTDFELTDS--RDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFG 186 +LH YD T T +TD+ D++ ++ + I DR + + + + + Sbjct: 1 MKLHELYDVKTDIPTFSVITDASVHDSQVMELIPYEKESFYIFDRAYMATNK-LYIIEEA 59 Query: 187 EADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARL 246 EA ++VR + + + + Sbjct: 60 EAYFVVREKHKMSFEVIEDKEYNTPSSGIM------------------------------ 89 Query: 247 IAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYR 306 + + + K R V + + T+ E +AEQVA Y+ Sbjct: 90 -----ADQIIRFKGHKTKKQYPNKLRRVVFYDYDGNRTFVFYTN--NFEVTAEQVALLYK 142 Query: 307 LRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 RW++EL FK LK L + K I+A ++A L+ I+Q +D Sbjct: 143 YRWRVELFFKWLKQHLRIKEFYGTSENAVKIQIYAAIIAYCLV-VIVQECMDL 194 >UniRef50_Q648P7 Transposase n=2 Tax=environmental samples RepID=Q648P7_9ARCH Length = 281 Score = 131 bits (328), Expect = 6e-29, Method: Composition-based stats. Identities = 35/270 (12%), Positives = 89/270 (32%), Gaps = 46/270 (17%) Query: 105 CTSGKRLRLVDGTAISAPGG----------GSAEWRLHMGYDPHTCQFTDFELTDSRDAE 154 + K VDG+ I A +L++ + T ++T+ + + Sbjct: 41 LSRFKDCFAVDGSIIRLNKTLEKIFKSTCKSQAALKLNVKFSIVNLAVTKLQVTEGKRHD 100 Query: 155 -RLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 R + + + + D G+ S + + + ++ ++ R+ + Sbjct: 101 NRFRFITKDPNILYLFDLGYWSF-KNFKKIVDAKSFFVSRLKKSCDPLIVTVSDPKWSHL 159 Query: 214 FLRGLDC-----GKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENR 268 + L E + + S ++K+ RL+ Sbjct: 160 AGKRLSQINGALKGMVELDMKVQLSKSEKSPLKDDLRLVG-------------------- 199 Query: 269 RKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALR 328 L +T++ + ++ + + + Y RW IE+ F +K +L L+ + Sbjct: 200 ---------ILYEGKWRFYVTNIFDTLFTPQVIYELYSERWTIEIFFNDIKHVLKLEHIF 250 Query: 329 AKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 ++ I++ L+ L+ +I + Sbjct: 251 SQNKNGIMVEIYSALIFYLLVRIMIAIAAK 280 >UniRef50_A4JGL4 Transposase, IS4 family protein n=3 Tax=Burkholderiaceae RepID=A4JGL4_BURVG Length = 402 Score = 131 bits (328), Expect = 6e-29, Method: Composition-based stats. Identities = 66/385 (17%), Positives = 115/385 (29%), Gaps = 76/385 (19%) Query: 11 ILAHIGKPEELDTSARNAG-ALTRRREIRDAATLLRLGLAYGPGGMSLREVT-------A 62 +LA + ++ G A R R + A + + L EV Sbjct: 25 VLASVCPRTLIEEVLAETGKASQRERLLPAPAVVYYVMALALWREAPLEEVLRVVCEGLQ 84 Query: 63 WAQLHDVATL--SDVAL-LKRLRNAADWFGILAAQTLAVRAA--VTGCT-SGKRLRLVDG 116 W + S A+ R R + LA + L AA G G R+ +DG Sbjct: 85 WLGGGHTEAVQASKSAISQARSRLGPEVMRQLADRVLRPLAAPGAPGAWYRGLRVMALDG 144 Query: 117 TAISAPG-----------------GGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRF 159 + + + R+ + T + +E++ Sbjct: 145 SCMDVADEAANAKFFGYPGASRGQSAFPQARVLGLVECGTHAVVAAGIAPYGHSEQVMAA 204 Query: 160 -----AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGF 214 T + + +ADR F + ++ A RV + + Sbjct: 205 QLLPAKLTPEMLVLADRNFYGF-KLWQTACATGAKLAWRVKSNLKLPVEQMLPDGSYLS- 262 Query: 215 LRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVV 274 + +S ++ A R+I +L Sbjct: 263 --------------RVFDSDDRARRAGQTVRVIDYALEGSATPAQ--------------- 293 Query: 275 QAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD--ALRAKEP 332 + LL L D A ++A Y RW+IE F K+ L + LR+K P Sbjct: 294 -------GSYRLLTNLLDPDAAPALELAALYHERWEIEGVFDEFKTHLRANSTVLRSKTP 346 Query: 333 ELAKAWIFANLLAAFLIDDIIQPSL 357 EL + ++ LLA F I ++ + Sbjct: 347 ELVQQELWGLLLAHFAIRQLMAQAA 371 >UniRef50_C6JHT2 Transposase ISLbp1 n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JHT2_9FIRM Length = 424 Score = 130 bits (326), Expect = 8e-29, Method: Composition-based stats. Identities = 70/384 (18%), Positives = 131/384 (34%), Gaps = 64/384 (16%) Query: 16 GKPEELDTSAR--NAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLS 73 +E R N TR R++ LL + ++L H ++S Sbjct: 1 MSSDEFKAFCRLGNKNHFTRIRKMPL-QDLLFTMINRKGLTLALELRNYMKLAHPGVSIS 59 Query: 74 DVALL-KRLRNAADWFGILAA---QTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGS--- 126 L +R++ D F L + + + + DG+ I+ P Sbjct: 60 KPGYLKQRMKLNPDAFLELYKYHNRNFYADSTFST-YKNHLILAADGSDINIPTTTETLK 118 Query: 127 -------------AEWRLHMGYDPHTCQ-----FTDFELTDSRDAER-LDRFAQTADEI- 166 A+ L YD + + R AE+ ++R +T I Sbjct: 119 LYGSASRKNTKPQAQIGLGCIYDVMNRMILESDCNKVKFDEMRLAEKQMERIPETIGNIP 178 Query: 167 --RIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 I DRG+ S P I + + +IVR+ + + D + ++ Sbjct: 179 YIIIMDRGYPSTPAFIH-MMDKDLKFIVRLKSSDYKKEQSSLTENDQLVKIK-------- 229 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGH 284 L + + R+ R+V+ LE Sbjct: 230 ---------------------LDKSRIRHYEGTPDGERMKELGEISLRMVKI-LLENGNL 267 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 +L T+L + E+ E++ + Y +RW IE A++ LK+ L L+ +P L I++ + Sbjct: 268 EVLATNLSQTEFHTEEIKELYHMRWGIETAYETLKNRLQLENFTGTKPILLLQDIYSTIY 327 Query: 345 AAFLIDDIIQPSLDFPPRSAGSEK 368 + L++DII + + + K Sbjct: 328 LSNLVEDIILDAERELDQKETNRK 351 >UniRef50_B6FVR6 Putative uncharacterized protein (Fragment) n=2 Tax=Clostridium nexile DSM 1787 RepID=B6FVR6_9CLOT Length = 286 Score = 128 bits (322), Expect = 3e-28, Method: Composition-based stats. Identities = 47/282 (16%), Positives = 88/282 (31%), Gaps = 34/282 (12%) Query: 48 LAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTS 107 L++G + + T+S + +R + + L + Sbjct: 1 LSFGSNSLGHEIGEFFEYRKGFPTVSAF-VQQRKKLSYTALEHLFYRFNECTFKKPVLYK 59 Query: 108 GKRLRLVDGTAISAPGGG----------SAEWRLHMGYDPHTCQFTDFELT-------DS 150 RL +DG+ S P + L+ +D + F D + Sbjct: 60 NYRLLAIDGSDFSLPYNSQEDNVMGDNHFSTLHLNALFDVCSKSFLDVIVQKGLHENETG 119 Query: 151 RDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFD 210 E +DR ++ I +ADRG+ + + DY+VRV D Sbjct: 120 AACELVDRISEKHPVIIMADRGYENYN-LFAHIEERLFDYVVRVRDS------------D 166 Query: 211 MMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISK--TRLLSENR 268 + GL+ K E + + P +K+ + Sbjct: 167 NSCMVSGLNLPKTVEYDITKRVVLTRHFSGPAAINTEKYKYLSKKSRFDYIENSKSPDYE 226 Query: 269 RKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQ 310 R V+ L+ + ++ TSLPE+ +S E + + Y Sbjct: 227 ITIRFVRF-LLDDNTYEVIATSLPEEIFSMEDLKEIYHRHMG 267 >UniRef50_A3ZNH0 Probable transposase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZNH0_9PLAN Length = 451 Score = 126 bits (317), Expect = 9e-28, Method: Composition-based stats. Identities = 51/263 (19%), Positives = 89/263 (33%), Gaps = 30/263 (11%) Query: 110 RLRLVDGTAI--------SAPGGGSAEWRLHMGYDPHTCQFTDFELTDS----------- 150 RL VDG+ + +WR H Q +LT+ Sbjct: 158 RLIAVDGSVLTALPQIVGRIAAKEKGQWRFHALVHVLDGQPVASKLTEEPSAKGRAERDV 217 Query: 151 ----RDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEG 206 A+++D + + DRG+ S E + DYI R++ + L Sbjct: 218 LAEMIAADQIDIPQSDEGHLFLMDRGYRS-AELFNKIHTAGHDYICRLNRTDGKLLKPPK 276 Query: 207 MRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSE 266 L + I G A I P + + Sbjct: 277 KGEVREPI--QLPPLSAEAIAMGIVADELITMGGNCGASKIGSDHPMRRIKLIPPADRPS 334 Query: 267 NRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA 326 + R+GRV +T VL T + +AE++ Y RW++EL F+ LK +L Sbjct: 335 SARQGRVRTDQTGR-DELVLATTLMD---LTAEEIVRLYEHRWEVELFFRFLKQVLGCKK 390 Query: 327 LRAKEPELAKAWIFANLLAAFLI 349 L + + + ++ ++A+ L+ Sbjct: 391 LLSAKTAGVQIQLYCAIIASLLL 413 >UniRef50_A5KKC4 Putative uncharacterized protein n=1 Tax=Ruminococcus torques ATCC 27756 RepID=A5KKC4_9FIRM Length = 422 Score = 126 bits (317), Expect = 1e-27, Method: Composition-based stats. Identities = 59/361 (16%), Positives = 114/361 (31%), Gaps = 85/361 (23%) Query: 29 GALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVAT-----LSDVALLK-RLR 82 A TR ++ + L+ L + + L + + +S A+ K R Sbjct: 35 NAFTRSGKL-SFSNLIYFVLQSVHKSIP---INYARFLENFPSDLPIFVSKQAISKARQG 90 Query: 83 NAADWFGILAAQTLAVRAAVT---GCTSGKRLRLVDGTAISAPGGGSA---------EWR 130 + F L ++ +G + VDG+ I P + + Sbjct: 91 ISHKAFLELFRLSVKQFYFQPVNLRTWNGFHIYAVDGSTIQIPESKENYEVFGGNPNKTK 150 Query: 131 L-------HMGYDPHTCQFTDFELTDSRDAER------LDRFAQTADEIRIADRGFGSRP 177 + + YD D L R ER +D + + I + DRG+ S Sbjct: 151 IISPLASASVLYDVINDILIDVSLHPYRYNERESAKAHVDFLPRFPNSIILFDRGYPS-E 209 Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKK 237 + L +++RV + ++ + Sbjct: 210 DMFHYLNSKGILFLMRVPKTFKKAISEQ-------------------------------- 237 Query: 238 AGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYS 297 E AL + ++ R + LE L+T+L ++ + Sbjct: 238 ----------------EDALFTYPASCNKESLTLRSIHF-LLEDGSTEYLVTNLMPEQIA 280 Query: 298 AEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 E D Y+ RW +E ++ LK+ L ++A + +P + FA + + L+ I S Sbjct: 281 KENFPDLYQFRWGVESKYRELKNRLEIEAFNSIKPASIQQEFFAAMYLSNLVAVIKSESD 340 Query: 358 D 358 Sbjct: 341 S 341 >UniRef50_C6J0N9 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J0N9_9BACL Length = 402 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 62/354 (17%), Positives = 115/354 (32%), Gaps = 64/354 (18%) Query: 30 ALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWA------QLHDVATLSDVALLKRLRN 83 A R R++ + + G S E++ Q + ++S L ++++ Sbjct: 27 ADHRTRKLTTGKAIQIFVESQLAGRTSYDEISEHLRIMPDLQDDHLKSISASQLSRKIKQ 86 Query: 84 AA-DWFGILA----AQTLAVRAAVTGCTSGKRLRLVDGTAISAP---------GGGSAEW 129 D + A+ + G + +LR++D T ++ P Sbjct: 87 LPTDLLQAIFLCNIARIQEITKQKQGIPNIGKLRILDSTVLTLPTLAGRWAYWSKEQNAV 146 Query: 130 RLHMGY---DPHTCQFTDFELTDSR--DAERLDRFAQTADEIRIADRGFGSRPECIRSLA 184 ++H D T + + D E D I + DRG+ E SL Sbjct: 147 KIHTQLVVADRETVFPGKIINSTAAVSDQEVALDLVVADDAIHVMDRGYIQY-ELYESLI 205 Query: 185 FGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPA 244 + ++ R+ + + + + + V I + +K A Sbjct: 206 HQQMRFVARLQTKNKVTILHQRAVPEGFPI--------TIDADVEIQWNDKQKQTHYLQA 257 Query: 245 RLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADC 304 RLI T E LLT++ + SA+++++ Sbjct: 258 RLIEF----------------------------TDEQKRTYRLLTNVQ--DRSAQEISEI 287 Query: 305 YRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 YR RW IEL FK +K L L + + I+ L+A L+ I Sbjct: 288 YRYRWLIELFFKWIKQHLRLVKIYSANQTAIWNQIYLALIAYSLVLLIKLEMQT 341 >UniRef50_Q3M8C5 Transposase, IS4 n=15 Tax=Cyanobacteria RepID=Q3M8C5_ANAVT Length = 340 Score = 126 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 57/324 (17%), Positives = 101/324 (31%), Gaps = 70/324 (21%) Query: 45 RLGLAYGPGGMSLREVTAWAQ----LHDVATLSDVALLKRLRNAADWFGILAAQTLAVRA 100 L S+R + A D++T S L + + + + L L Sbjct: 34 WLSYVLDNSLTSMRGLFARLNNTGFELDISTFSKANLHRSQKPFQEIYQKL--NKLVQNK 91 Query: 101 AVTGCTSGKRLRLVDGTAISAPGG-----GSAEWRLHMGYDPHTCQFTDFELTDSRDAER 155 A + + +D T I+ G + +L + T D + D + Sbjct: 92 AENKLHNKYAICPIDSTVITLTSKLLWVLGHHQVKLFSSLNLATGSPEDNLINFGHDHDY 151 Query: 156 LD----RFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDM 211 + + + DRGF + I+ L +++R+ Sbjct: 152 KFGSKMIANLPTNAVGVMDRGFAG-LKFIQELVQENKYFVLRIKNN-------------- 196 Query: 212 MGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKG 271 L E++ S++ + Sbjct: 197 -------------------------------------WKLEFEESSGLIKVGASDDAQAY 219 Query: 272 RVVQAETLEAAGHVLLLTSLPED---EYSAEQVADCYRLRWQIELAFKRLKSLLHLDALR 328 RV+ LE L+T+LP D S + + D Y LRW +EL +K LK L LD L Sbjct: 220 RVINFCDLETKTEFRLVTNLPADGEATVSDDDIRDIYLLRWGVELLWKFLKMHLKLDKLI 279 Query: 329 AKEPELAKAWIFANLLAAFLIDDI 352 K I+ +L+A ++ + Sbjct: 280 TKNVNGITIQIYVSLIAYLILQLV 303 >UniRef50_C0BDH6 Putative uncharacterized protein n=2 Tax=Coprococcus comes ATCC 27758 RepID=C0BDH6_9FIRM Length = 204 Score = 125 bits (313), Expect = 3e-27, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 70/185 (37%), Gaps = 13/185 (7%) Query: 165 EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 + I DRG+ S + ++ R + L D F + Sbjct: 31 SVYIGDRGYCSYNNMAHVVEQ-GQYFLFRTKDIHSKGLVGNFNFPDAESF--------DI 81 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISK--TRLLSENRRKGRVVQAETLEAA 282 +V++ S +KK A + A R+++ + Sbjct: 82 NVSVILVRSHSKKILADIHTEGYI-RFVDQSAAFDYIEYGSYDTYELSFRILRFPIS-TS 139 Query: 283 GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFAN 342 + ++T+LP DE+ E++ Y RW IE +F++LK + L A +PE K I+A Sbjct: 140 TYECIVTNLPRDEFPVERIKTLYNARWSIESSFRKLKYTIGLSNFHAYKPEYVKQEIWAR 199 Query: 343 LLAAF 347 LLA+ Sbjct: 200 LLASL 204 >UniRef50_UPI000196B70E hypothetical protein CATMIT_00144 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196B70E Length = 479 Score = 124 bits (311), Expect = 4e-27, Method: Composition-based stats. Identities = 61/389 (15%), Positives = 117/389 (30%), Gaps = 59/389 (15%) Query: 4 SHDNWSAILAHIGKPEELDTSARNA-GALTRRREIRDAATLLRLGLAYGPGGMSLREVTA 62 + ++ R TR R + + + + Sbjct: 11 ELSELFFDFNTLIHSTRVNELCRKKKCDFTRSRNMNFYSIIYYFIFRNRTTTNAELTHFY 70 Query: 63 WAQLHDVATLSDVALLKRLR-NAADWFGILAAQT--LAVRAAVTGCTSGKRLRLVDGTAI 119 + +S AL K +R + F L Q + ++ L DGT + Sbjct: 71 SSIDRFEKRISKQALNKAIRKLNPNVFTYLINQFASIYYSTSLPKKYRDHLLIAEDGTYM 130 Query: 120 SAPGGGSAEWRLHM---------------------GYDPHTCQFTDFELTDSRDAE---- 154 P YD F DF L + +E Sbjct: 131 EIPYNMLNINEFQFALGCHVRNMFDVKKVQSKAGGLYDVTNGLFIDFSLRQAPYSETPLA 190 Query: 155 -----RLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRF 209 R + I +ADR +GS E I L Y++R Sbjct: 191 FAHLYRTREMLENQKVIYLADRYYGS-AEIISHLEDLRYSYVIRGKSN------------ 237 Query: 210 DMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRR 269 F + G + I ++K F A L E + + E Sbjct: 238 ----FYKKQVAGMESD-DEWIEVEVDEKWLKRFRFSPEAKKLRKENPTLKIRVIKREY-- 290 Query: 270 KGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRA 329 R + E +++ T+L + ++ +++ + Y RW IE+++K +K+ ++ + Sbjct: 291 --RYTDNKNKEHCENLIYFTNLSSESFTTDEIMEIYSRRWDIEVSYKTMKTTQEVERHIS 348 Query: 330 KEPELAKAWIFANLLAAFL---IDDIIQP 355 + ++A+ I+A +L + I + Sbjct: 349 SDGDVARNDIYAKVLFHNIAGVIRKEMNQ 377 >UniRef50_A1APW2 Transposase, IS4 family n=6 Tax=Deltaproteobacteria RepID=A1APW2_PELPD Length = 391 Score = 123 bits (309), Expect = 8e-27, Method: Composition-based stats. Identities = 67/367 (18%), Positives = 116/367 (31%), Gaps = 68/367 (18%) Query: 7 NWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL 66 + +L + + + D + G R ++ L L + S E+ + Sbjct: 15 AFKLLLTPVFERFKSDKQLESRGY--RPLQMTFDDQLKALIFYHLEEFSSGSELLQALEQ 72 Query: 67 HDVAT----------LSDV--ALLKR-LRNAADWFGILAAQTLAVRAAVTGCTSGKRLRL 113 +D A S A+ R L ++ FG L Q A + L Sbjct: 73 NDFAKECVAPPKGIKKSAFFEAINNRGLEQLSEVFGHLVKQ--AGKVLPAEYAHLGNLVS 130 Query: 114 VDGTAISAP--------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQ-TAD 164 +DG+ I A GS + + H+G+D + L+D ++ ER Sbjct: 131 IDGSLIDAVLSMEWADYRSGSKKAKAHVGFDINRGIPRKIYLSDGKEGERPFVDKIIDKG 190 Query: 165 EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 E + DRG+ S E ++ R+ ++ + E D Sbjct: 191 ETGVMDRGYQSHDH-FDKWQAAEKFFVCRIRENTIKIVIRENAVNP--------DSIIFY 241 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGH 284 + VM+G G RL+ + Sbjct: 242 DRIVMLGTKG--VNQTEKELRLVGYRV-----------------------------DGKD 270 Query: 285 VLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLL 344 + T+ + +AEQVA+ Y+LRW IE F K L + L A+ + L+ Sbjct: 271 YWIATN--RYDLTAEQVAEVYKLRWNIETFFGWWKRHLKVYHLIARSKYGLMVQLLGGLI 328 Query: 345 AAFLIDD 351 L+ Sbjct: 329 TYLLLAI 335 >UniRef50_C0ING1 Putative uncharacterized protein n=1 Tax=uncultured bacterium BLR12 RepID=C0ING1_9BACT Length = 337 Score = 123 bits (307), Expect = 1e-26, Method: Composition-based stats. Identities = 53/287 (18%), Positives = 89/287 (31%), Gaps = 56/287 (19%) Query: 106 TSGKRLRLVDGTAISAPG------------------GGSAEWRLHMGYDPHTCQFTDFEL 147 +G RL +DG+ PG + R + YD D ++ Sbjct: 15 WNGLRLLAIDGSTAVLPGHKSITEEFGITNFGPYANSPRSVARTSVLYDVLNLTVLDGQI 74 Query: 148 TDSRDAER-LDRFAQ----TADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWL 202 ER L R A ++ + DRG+ S + Y++R+ Sbjct: 75 DRYDSCERNLARQHFAQVKPATDLLLFDRGYPSLGLMFE-MQAQGIHYLIRMRED----- 128 Query: 203 TAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTR 262 +L NGET + LP + + Sbjct: 129 ----------WWLDVRKMLANGETDKEVT-----------------FKLPATERDLLNKY 161 Query: 263 LLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLL 322 ++ K R+V + E VL + + ++ E AD Y RW IE A+K K + Sbjct: 162 ATKNDKFKCRLVAVQLPEGGTEVLCTSIINKEILPYECFADLYHCRWNIEEAYKLFKCRV 221 Query: 323 HLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEKK 369 L+A K K FA + + P + + + + Sbjct: 222 QLEAFSGKTAIAVKQDFFAKIFMMTTTAVLAFPVEEQIKQECQNSTR 268 >UniRef50_A6CCZ3 Transposase, IS4 (Fragment) n=7 Tax=Planctomyces maris DSM 8797 RepID=A6CCZ3_9PLAN Length = 531 Score = 122 bits (305), Expect = 2e-26, Method: Composition-based stats. Identities = 48/328 (14%), Positives = 82/328 (25%), Gaps = 75/328 (22%) Query: 78 LKRLRNAADWFGILAAQTLAVR------------------------AAVTGCTSGKRLRL 113 RL+ + + Q A A V ++G R+ L Sbjct: 146 RARLKLSFTAIREIVQQLAADAEAACDQNCVQSQEQSAARLSPSNVADVKSRSTGGRILL 205 Query: 114 VDGTAISAPGGGSAEW-----------------RLHMGYDPHTCQFTDFELTDSRDAERL 156 VDG I+A + R T D Sbjct: 206 VDGFTITAADTPENQRAYPQNPAQKPGLGFPVLRCVSLISMTTGLLVDLVSGPYSGKGSG 265 Query: 157 DRF-------AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRF 209 + + +AD + + + + +++ H Sbjct: 266 ETALLWQMLDVLRPGDTLVADSYYCTYW-LVSACHARGVQILMKNHHLRDDHPQTARRLN 324 Query: 210 DMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRR 269 + L + ++ RL+ V Sbjct: 325 KRERLVTWLRPPV---RPAWMARQEYRRQPLTLTLRLVDVQ------------------- 362 Query: 270 KGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRA 329 V + T A +A Y+ RW IEL + +K L +D LRA Sbjct: 363 ----VSQPGCRTKTFTIATTITDRKACPARWIAAVYQSRWLIELDIRSIKCSLGMDILRA 418 Query: 330 KEPELAKAWIFANLLAAFLIDDIIQPSL 357 K P + +++ LLA LI + S Sbjct: 419 KSPAMVLTELWSCLLAYNLIRLKMLQSS 446 >UniRef50_A6DSH7 Probable transposase n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH7_9BACT Length = 382 Score = 121 bits (303), Expect = 4e-26, Method: Composition-based stats. Identities = 62/342 (18%), Positives = 115/342 (33%), Gaps = 58/342 (16%) Query: 31 LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGI 90 L R + A+ L ++ G +SL + + D +L L+++L + Sbjct: 63 LKSMRGLCAASELKKVQEHVTLGKISLGSFSEAQHVFDATSL--QHLVQKLSSKIP---- 116 Query: 91 LAAQTLAVRAAVTGCTSGKRLRLVDGTAIS-APGGGSAEW--------RLHMGYDPHTCQ 141 + + + K L VDG+ AEW +LH+G+ Sbjct: 117 -----INKIQDRSLLAAVKDLVAVDGSLFQTLTRVLWAEWLDENHKAAKLHLGFSLLKQS 171 Query: 142 FTDFELTDSRDAERLDRFA-QTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLR 200 D +T ER + + DR +G L A + +R+ + Sbjct: 172 AVDAVITAGNSCERKALLKMVQPGVMYVCDRYYGLDYSYFEELQQRGALFTIRIRNKPKL 231 Query: 201 WLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISK 260 + E + + G + V +G++ + Sbjct: 232 TVIKEYEITE-----KDRKEGVISDQLVYLGDTDRELKPI-------------------- 266 Query: 261 TRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKS 320 R+V+ LL+TS ++ +A ++ YR RWQIE+ FK LKS Sbjct: 267 -----------RLVRTGAFNDKEI-LLVTSEAPEKLNAAIISTIYRQRWQIEVFFKWLKS 314 Query: 321 LLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPR 362 +L L A+ +++ L+AA ++ D+ Sbjct: 315 ILGCRKLLAESSNGVAIQMYSALIAAIMLFDLFGKKPTLRQM 356 >UniRef50_B6FTH4 Putative uncharacterized protein n=3 Tax=Clostridium nexile DSM 1787 RepID=B6FTH4_9CLOT Length = 224 Score = 121 bits (303), Expect = 4e-26, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 67/176 (38%), Gaps = 11/176 (6%) Query: 188 ADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLI 247 Y++RV G +T D + + ++ P + I Sbjct: 1 MYYLIRVKDGGGGSMTGSFDLPD-------DNEFDHDMQLILTRKQTKDVKANPQKFKFI 53 Query: 248 AVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRL 307 A S P + + + RVV+ E + +LT+LP++++ E++ Y + Sbjct: 54 AKSSPFDYLDLYDKK---FYTLNFRVVRFAISE-DSYESILTNLPKEDFPVEEIKKVYAM 109 Query: 308 RWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRS 363 RW IE +F+ LK + L +K+ E I+A L+ + I + + Sbjct: 110 RWGIETSFRELKYAIGLCCFHSKKVEYIMQEIYARLILYNYCELITMHVIIQQKGT 165 >UniRef50_UPI0001C16028 hypothetical protein CRD_01775 n=2 Tax=Raphidiopsis brookii D9 RepID=UPI0001C16028 Length = 465 Score = 121 bits (303), Expect = 4e-26, Method: Composition-based stats. Identities = 60/388 (15%), Positives = 109/388 (28%), Gaps = 79/388 (20%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHD--- 68 L+ + + + + + + RR I ++ + S V + L Sbjct: 22 LSQVIPSQTITKAIESTCSSQRRLRILP-TYIIVTLVIAMSFWSSDSIVDVFKNLIHGLS 80 Query: 69 ---------VATLSDVAL-LKRLRNAADWFGILAAQTLAVRAAV--TGCTSG-KRLRLVD 115 + T S ++ R R A L A + G G R+ VD Sbjct: 81 SLHIPSGLRLQTPSASSITEARQRTGAAVMRRLFELVAKPLATILTPGAFLGELRIMAVD 140 Query: 116 GTAISAPGG------------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAERLD 157 GT P G + RL + T D R ER Sbjct: 141 GTVFDVPDTSTNARVFGYPGSPKGTYPGFPKVRLVFLVEAGTHLIIDAFCYPYRMGERRG 200 Query: 158 RFAQ----TADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 + + + DRG S + + ++ + +++ RV + + Sbjct: 201 ALKLLRSINSSMLLMWDRGLHSF-KMVHTVIKQQGNFLGRVPGNVKFQVVKTLADGSYLS 259 Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV 273 ++ KK R+I Sbjct: 260 WIAPDGQS-------------RKKGAKRMEVRIIEY------------------------ 282 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAKE 331 V E + L+ + D++ A +A Y RW+ E LK L +R+K Sbjct: 283 VIEEDGTLKTYRLITNLMDVDKFPALLLAQEYHKRWEAENTLDELKVHLLARKIPIRSKN 342 Query: 332 PELAKAWIFANLLAAFLIDDIIQPSLDF 359 P ++ LLA + + ++ S Sbjct: 343 PREVVQELYGWLLAHYCLRCLMFQSATL 370 >UniRef50_A8KXP7 Transposase IS4 family protein n=2 Tax=Actinomycetales RepID=A8KXP7_FRASN Length = 421 Score = 121 bits (303), Expect = 5e-26, Method: Composition-based stats. Identities = 51/310 (16%), Positives = 83/310 (26%), Gaps = 64/310 (20%) Query: 79 KRLRNAADWFGILAAQTL----AVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRLH-M 133 R R + +L +T R V G R+ VDGT + P Sbjct: 102 ARDRLGVEPVKLLFERTAVPMALPRRTVGAFYRGWRVCTVDGTTLLVPDTDENAAAFGKP 161 Query: 134 GYDPHTCQFTDFEL-------------------TDSRDAERLDRFA-----QTADEIRIA 169 G D + S+ A F + +A Sbjct: 162 GNDQGEGALPQVRVLGLVECGTRALLGAGFGGTGGSKAASEQALFPDLLGALRPGMLVLA 221 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 DR F E A AD + R + E + L Sbjct: 222 DRNFLGF-ELFAKAAATGADLLWRAKSDRRLPIDTELADGSYLSHL-------------- 266 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 + G R I V + L +++ + L+ T Sbjct: 267 ------VEPGTRDKGRKITVRVVEYTLDRDPDSPLPAGKKE------------TYRLVTT 308 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKAWIFANLLAAF 347 L D A +A Y RW++E +K LR++ P+ + ++ LL Sbjct: 309 ILDPDAAPATDLAALYSDRWEVETLLDEIKVHQQDGRLVLRSRAPDRVEQEVWGVLLLHR 368 Query: 348 LIDDIIQPSL 357 + +I + Sbjct: 369 ALRKLIHDTA 378 >UniRef50_C9R546 Transposase (IS4 family) protein n=1 Tax=Aggregatibacter actinomycetemcomitans D11S-1 RepID=C9R546_AGGAD Length = 382 Score = 121 bits (302), Expect = 5e-26, Method: Composition-based stats. Identities = 65/370 (17%), Positives = 120/370 (32%), Gaps = 65/370 (17%) Query: 7 NWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYG-PGGMSLREVTAWAQ 65 IL++I K E+L + +++ L + SLR + + Sbjct: 2 KLREILSYIPK-EQLRMFTLEYQVDRQVKKLSGEVMFYLLLFSSLNVRHNSLRTLEQFYS 60 Query: 66 LHDVATLSDV--------ALLKRLR-NAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDG 116 L D ++ RLR AD+F + L R + + + D Sbjct: 61 SPTFTGLFDKPIKHAKYNSISDRLRTINADYFKAIFESVL-SRYSDSYLKPRDNIIAFDS 119 Query: 117 TAISAPGG------------GSAEWRLHMGYD---PHTCQFTD--FELTDSRDAERLDRF 159 T ++ G + + + + FT + D E + Sbjct: 120 TIVTLSSKLLKTGMKVGSYQGVNGIKFSVAFSSVPVKSKLFTQRVYSSEDVALKELIVEH 179 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 + D I + D G SR + ++ RV + + Sbjct: 180 PLSRDNILLFDMGIQSRNT-FDEFSDKHFTFVTRVREIARYRVMS--------------- 223 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 E V I + + + + RL + E R R++ A Sbjct: 224 -----ENPVEIRETASMVIQSDYNVRLF-------------NKENKETRHIFRLIIARLK 265 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 E + LT+ +YSA ++A+ Y+ RW+IE+ FK +K L L ++ K + Sbjct: 266 EKDEEIYFLTN--HADYSATEIAELYKRRWEIEVFFKFIKQHLDFSHLLSRNENGMKVEM 323 Query: 340 FANLLAAFLI 349 + L+ A L+ Sbjct: 324 YMTLITAILL 333 >UniRef50_C3EBZ9 IS231-related transposase n=1 Tax=Bacillus thuringiensis serovar pakistani str. T13001 RepID=C3EBZ9_BACTU Length = 221 Score = 120 bits (300), Expect = 9e-26, Method: Composition-based stats. Identities = 47/235 (20%), Positives = 82/235 (34%), Gaps = 24/235 (10%) Query: 133 MGYDPHTCQFTDFELTDS--RDAERLDRF--AQTADEIRIADRGFGSRPECIRSLAFGEA 188 M YD + F ++T+ DA+ ++ I D G+ + A Sbjct: 1 MEYDVISGDFLQLDITNGISHDAKYGQELIHTVEKRDLCIRDLGYF-YLPDFHEINQKGA 59 Query: 189 DYIVRVHWRGLRWLTAE--GMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARL 246 Y+ R+ + R + F++ + GK E + P RL Sbjct: 60 YYLSRLPINTQVYRKKGILYERLYLEDFIKKVSEGKTIEW-----FDVYIRKQHKVPTRL 114 Query: 247 IAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYR 306 I L A +S + R V +L+T++P D E++ Y Sbjct: 115 IIYKLTG--AGYDGKNNVSTATKYKRQVS----------ILMTNIPSDILQKEEIYPLYT 162 Query: 307 LRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPP 361 +R QIE+ FK KSL + + + E + ++ L A L ++ F Sbjct: 163 VRGQIEILFKTWKSLCGIHLCKHVKLERFQCHLYGQLTAILLHSMLMFRMRKFLH 217 >UniRef50_C5VJA1 Transposase domain protein n=15 Tax=Prevotella RepID=C5VJA1_9BACT Length = 405 Score = 119 bits (297), Expect = 2e-25, Method: Composition-based stats. Identities = 58/379 (15%), Positives = 107/379 (28%), Gaps = 75/379 (19%) Query: 7 NWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV-----T 61 ++ ++ + K + S + + + L+ + SLREV Sbjct: 12 VYNQLIKLLDKQQIKQISLETPRSEAYVKRLDGWTHLVIMLFGVLKHFDSLREVEIGMKA 71 Query: 62 AWAQLHDV-----ATLSDVALLKRLRNAADWFGILAAQTLAV--------RAAVTGCTSG 108 +LH + S +A + R ++F + A L R T Sbjct: 72 EVNKLHHLGIDYVVRRSTLADANK-RRPQEFFASVYAYLLERYGSFLSDSRPKGEQKTWE 130 Query: 109 KRLRLVDGTAISAPGG-------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAER 155 K L ++D T I+ ++H H +LT + + Sbjct: 131 KLLYMMDSTTITLFDNILKGVGRHPKSGKKKGGMKVHTVMKYHVGVPMVVQLTSAATHDH 190 Query: 156 --LDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 L D DR + + + L Y+ ++ + G Sbjct: 191 YLLKEVHLPKDATLTMDRAYVDYAQ-FQRLTEEGVCYVTKMKKNLTYTELSSVTYVSPDG 249 Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV 273 + D E E R + R Sbjct: 250 LVTHTDKKIVFE--------------------------------------KGEIRHQARR 271 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPE 333 V+ + + V+LLT+ E + + + Y+ RW IE +K+LK L Sbjct: 272 VELWSDNSHKSVVLLTN--NLELDVKDLEEIYKRRWAIESLYKQLKQNFPLHFFYGDSVN 329 Query: 334 LAKAWIFANLLAAFLIDDI 352 + + L+A L I Sbjct: 330 AIQIQTWVVLIANLLCTVI 348 >UniRef50_Q04V25 Transposase, ISLbp1 n=29 Tax=Leptospira RepID=Q04V25_LEPBJ Length = 423 Score = 118 bits (296), Expect = 3e-25, Method: Composition-based stats. Identities = 62/379 (16%), Positives = 108/379 (28%), Gaps = 65/379 (17%) Query: 23 TSAR-NAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQL-HDVATLSDVALLKR 80 AR A TR R++ L+ + +++ + L T + R Sbjct: 2 EIARMKPSAFTRNRQLTLPRLLIAMI-NLLNKSLAVELYRYFKNLGKKAVTKQAFSF-TR 59 Query: 81 LRNAADWFG---ILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGS----------- 126 F + + G + D T IS P Sbjct: 60 ENLNPQVFESLNEIFVNSYYKNVTNCKTHKGYIVAACDATGISLPKTKEFVKDFGCVKNQ 119 Query: 127 ------AEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFA-----------QTADEIRIA 169 + +D + + R +ER Q I + Sbjct: 120 LGESESPNANSSIIFDIYNDIILSSTVGSHRTSERSMALHHIEKLRSISALQNKKLILLF 179 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 D+G+ S E I L +I+R + R L+ G + T M Sbjct: 180 DKGYPS-MELIGKLMANGIHFIIRSNTRWLKEAKIAGE------YKEYDKVKNILITNNM 232 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 + K A L ++ + + + +T Sbjct: 233 LKKKEWLKEYANTKGNLFSLRFVGSR-----------------------YKDGQVGIFVT 269 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 LP+ E+S E + Y RW IE F+ K L L+ + K +A +L L Sbjct: 270 DLPDSEFSREDIVFLYGKRWNIETHFRFEKYSLELENVAPKTSIRFLQEYYAKILTFNLA 329 Query: 350 DDIIQPSLDFPPRSAGSEK 368 +IQ + + +S ++K Sbjct: 330 SLLIQEAQEEYDQSIQNKK 348 >UniRef50_A1BCF6 Transposase, IS4 family protein n=1 Tax=Chlorobium phaeobacteroides DSM 266 RepID=A1BCF6_CHLPD Length = 252 Score = 118 bits (296), Expect = 3e-25, Method: Composition-based stats. Identities = 35/259 (13%), Positives = 73/259 (28%), Gaps = 55/259 (21%) Query: 114 VDGTAISAPGGGSAE---------WRLHMGYDPHTCQFTDFELTDSRDAER-------LD 157 +D T I +LH YD + +TD + ++ Sbjct: 1 MDATVIDLCLRVFPWAEFRQRKGAIKLHYLYDHRSSLPAFMVMTDGKKSDIRVARSQEKL 60 Query: 158 RFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRG 217 F D I DR + E + +L + ++ R + + R Sbjct: 61 DFHLLPDSIVSFDRAYIDF-EWLYTLDQRKVWFVTRSKANIQYRIIGQHQPIKNKQVTR- 118 Query: 218 LDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE 277 E+ + + ++ + R+V Sbjct: 119 -----------------------------------DERIELIIEKSRAKYLKPLRLVCYT 143 Query: 278 TLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKA 337 E +T+ + +A +A Y+ RWQIE F+ +K L + + + + Sbjct: 144 DQETGKAYEFITN--NIKLAASTIAAIYKSRWQIETFFRWIKQNLKIKSFQGTSQNAVLS 201 Query: 338 WIFANLLAAFLIDDIIQPS 356 + + + I + Sbjct: 202 QTWIAMCYYLRLSYIKFKT 220 >UniRef50_Q2JAY9 Transposase, IS4 n=2 Tax=Frankia RepID=Q2JAY9_FRASC Length = 412 Score = 118 bits (294), Expect = 4e-25, Method: Composition-based stats. Identities = 61/396 (15%), Positives = 101/396 (25%), Gaps = 83/396 (20%) Query: 11 ILAHIGKPEELDT-SARNAGALTRRREIRDAATLLRLGLAYGPGGMSL-------REVTA 62 +L + PE +D A A RRR + + + + G + R + Sbjct: 23 VLTRVYPPELVDRVLAVTDTAEVRRRLLPSWLVVYFVLALWLFRGRNCGYVQVLARLTSG 82 Query: 63 WAQLHDVA--------------TLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCT-- 106 A S R R +D +L Sbjct: 83 LHFQRRAAVLAAGGAGGAGWSLPASPSLGEARARIGSDPVRMLFEHAAGPVGVEGQAGVF 142 Query: 107 -SGKRLRLVDGTAISAPGGGSA---------------EWRLHMGY--DPHTCQFTDFELT 148 G RL +DG+ P + ++ + T Sbjct: 143 LHGLRLVQIDGSTCDLPDTQANRAFFPGPSNAGGPAPFPKVRWVIAAEAATGALLGASFG 202 Query: 149 DSRDAE----RLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTA 204 E R + +ADR F S + A + R + A Sbjct: 203 PWSTGEPALARDLLGQLGPGMLTLADRNFLSH-RLAGEVLATGAHLLWRAKA---TFTLA 258 Query: 205 EGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLL 264 D +L L + E P R+I ++ A Sbjct: 259 PVHVLDDGSYLAELTPPRGSEGP-------------PLTMRVIEYTVHSTTAGGD----- 300 Query: 265 SENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHL 324 + L+ L +E+S +A Y RW E K+ L Sbjct: 301 -------------ESSSELFCLVTDLLDPEEWSMLDLARAYPTRWGCETVIGHHKTDLGE 347 Query: 325 DA--LRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 LR+K+PE ++A + +I + D Sbjct: 348 GRPVLRSKDPEGVAQEMWALFAVHQALARLIGVAAD 383 >UniRef50_Q8ABH9 Putative transposase n=1 Tax=Bacteroides thetaiotaomicron RepID=Q8ABH9_BACTN Length = 310 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 66/205 (32%), Gaps = 40/205 (19%) Query: 148 TDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGM 207 + + I DRG+ + + + + EA ++VR Sbjct: 93 HPYTITKVMIEIPYEPSSYYIFDRGYNNF-KMLYKIHQIEAYFVVRAKKNLQY------- 144 Query: 208 RFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSEN 267 + + R L + +V++ K+ Sbjct: 145 --KSIQWKRRLPKNVLSDASVLLTGFYPKQY----------------------------Y 174 Query: 268 RRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDAL 327 + R+V+ E +T+ SA QVA+ Y+ RWQ+EL FK LK L + Sbjct: 175 PKPLRLVKYWDEEQEREFTFITN--AMHISALQVAELYKNRWQVELFFKWLKQHLKIKRF 232 Query: 328 RAKEPELAKAWIFANLLAAFLIDDI 352 + I+A + A L+ I Sbjct: 233 WGTTENAVRIQIYAAICAYCLVAII 257 >UniRef50_C4Z764 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=C4Z764_EUBE2 Length = 236 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 73/202 (36%), Gaps = 15/202 (7%) Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT 227 +AD G+ S L + +++R+ + + D T Sbjct: 1 MADSGYESFNT-FAHLIWKGMYFVIRMKDINSNGILSSYDLPDSE--------FDTHIRT 51 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLL 287 + + G P ++ S + + + R+V+ L+ ++ + Sbjct: 52 TLTRRHTKETLGNPNTYTILQPSTDFDFLDENCMY----YDIEFRIVRVH-LDNGTYICI 106 Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 T+L +E+ E++ Y +RW E +F+ LK + L + + + I A+++ Sbjct: 107 ATNLS-EEFPLEEINKLYLMRWSEETSFRELKYTIGLINWHSSKYDGILQEINAHMILYN 165 Query: 348 LIDDIIQPSLDFPPRSAGSEKK 369 + + ++ ++A K Sbjct: 166 FCELVTSHAMVKKSKNAKHVYK 187 >UniRef50_C6JEA3 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JEA3_9FIRM Length = 329 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 55/342 (16%), Positives = 108/342 (31%), Gaps = 49/342 (14%) Query: 1 MNYSHDNWSAILAHIGK-PEELDTSARNAG-ALTRRREIRDAATLLRLGLAYGPGGMSLR 58 MNYS + +LA I + + A G R R++ +L + L + Sbjct: 1 MNYSDSIKAILLAAINDLSKTPEKYAVKPGVDFIRNRKLGFKDYML-MFLTMEADCIREE 59 Query: 59 EVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTA 118 + + D + + +R + D F L + DG++ Sbjct: 60 LYRFFGRTIDAPSKAAF-YRQRKKIREDAFRNLLLA-FNRKLPKKLYNGKYEFWACDGSS 117 Query: 119 ISA----------------PGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQT 162 G + ++ + +FTD + +R F Sbjct: 118 CDIFLNPEDKDTYFEPNGKSTRGFNQIHINAMFSLFDKRFTDILVQPARKRNEYSAFCSM 177 Query: 163 ADE---------IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 D I DRG+ S + +++R + MMG Sbjct: 178 VDSADIPEHYKVIFFGDRGYTSYNN-FAHVIEKGQYFLIRCND---------KRASGMMG 227 Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFP----ARLIAVSLPPEKALISKTRLLSENRR 269 + + + ++++ S + R I + P + + +E Sbjct: 228 YPVDTLPAFDEDISLILTRSKAVSKYSRPELFSSYRYIYQNAPMDYLNDQR----TEYDL 283 Query: 270 KGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQI 311 R+++ + L+ + + T+LPE+E+ AE Y LRW I Sbjct: 284 ALRLLRIQ-LDDGSYENIATNLPEEEFKAEDFKALYHLRWGI 324 >UniRef50_Q7BLZ8 Putative uncharacterized protein (Fragment) n=1 Tax=Streptomyces rishiriensis RepID=Q7BLZ8_9ACTO Length = 341 Score = 110 bits (275), Expect = 8e-23, Method: Composition-based stats. Identities = 51/282 (18%), Positives = 82/282 (29%), Gaps = 64/282 (22%) Query: 106 TSGKRLRLVDGTAISAPGGGSA-------------------EWRLHMGYDPHTCQFTDFE 146 G RL VDGT P + + RL + T E Sbjct: 1 YRGWRLVAVDGTTFDVPDTEANAAFFGRPGVSRGQEKSAYPQVRLAALAECGTHAVFAAE 60 Query: 147 LTD--SRDAERLDRF--AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWL 202 + E R + T + +ADRGF R+ A AD + RV + + Sbjct: 61 AGPLAVHETELAQRLFGSLTPGMLLLADRGFRGFDLW-RAAAATGADLLWRVKNDAVLPV 119 Query: 203 TAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTR 262 + + + + P R+I +L Sbjct: 120 RTLLEDGSYLSEI--------------VAARDKNRRADPARVRVIEYTLG---------- 155 Query: 263 LLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLL 322 + L+ T L A +A RW+IE +K+ L Sbjct: 156 --------------RDGSDTVYRLITTILDPKAAPAASLAALAAQRWEIESTLDEIKTHL 201 Query: 323 HLDA--LRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPR 362 LR++ P A+ IFA LL + D++ + + Sbjct: 202 GGPRLVLRSQHPRGAEQEIFAFLLVHHALRDLMHQAAHQSEQ 243 >UniRef50_Q55566 Putative transposase for insertion sequence element IS4SA n=10 Tax=Synechocystis sp. PCC 6803 RepID=T4SA_SYNY3 Length = 338 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 51/328 (15%), Positives = 105/328 (32%), Gaps = 60/328 (18%) Query: 37 IRDAATL-LRLGLAYGPGGMSLREVTAWAQLHD-VATLSDVALLKRLRNAADWFGILAAQ 94 + + + LGL S+R + L +S + + R+ + I+ + Sbjct: 24 LDTFKFVSIWLGLVLDQSQTSMRSMFKRLNLRGETVDISTFSKASKKRDVGVFREIIFSL 83 Query: 95 TLAVRAAVTGCTSGKRLRLVDGTAISAPGG-----GSAEWRLHMGYDPHTCQFTDFEL-- 147 + + +D T +S G + ++ G + T + Sbjct: 84 KKELSKRKEIKQGELEIFPLDSTIVSITSKLMWNLGFHQVKVFSGINLSTGIPGGIVIHF 143 Query: 148 TDSRDAERLDR--FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE 205 D + + + + + DRGF R +++R+ Sbjct: 144 GQGHDNKYGNETIEETPENGVAVMDRGFCDLQRIKRLQKENNKYHVLRIKNN-------- 195 Query: 206 GMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLS 265 + K M+G NK S+ + + Sbjct: 196 ------------IKLEKLANDNYMVGTGKNKIE--------------------SRVVIFT 223 Query: 266 ENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD 325 + + R+V +E+ + S E++A+ Y+ RWQIEL +K LK L L+ Sbjct: 224 HDNSEFRLVTNLPIESKEI---------EGVSDEKIAEIYKKRWQIELLWKFLKMHLKLN 274 Query: 326 ALRAKEPELAKAWIFANLLAAFLIDDII 353 L AK I+ ++A ++ ++ Sbjct: 275 RLIAKNENAIGIQIYTCIIAYLILKLLV 302 >UniRef50_A8L1S1 Transposase IS4 family protein n=2 Tax=Frankia sp. EAN1pec RepID=A8L1S1_FRASN Length = 425 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 68/374 (18%), Positives = 110/374 (29%), Gaps = 59/374 (15%) Query: 11 ILAHIGKPEELDTSARNAGALTRR--REIRDAATLLRLGLAYGPGGM-----------SL 57 +L + +D + G RR ++ T SL Sbjct: 29 VLVTAVPRDAVDEAVAACGVGARRAGGKLPPHVTAYLTLAMSLFPDDDYAEVAQKVTGSL 88 Query: 58 REVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCT---SGKRLRLV 114 W + S + R R + + A ++ G+ L + Sbjct: 89 DRFGCWDAAWAPPSASGIT-QARKRLGRMVMAEVFERVAGQVATLSTRGAWLRGRLLLAI 147 Query: 115 DGTAISAPGG-----------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAER-- 155 DG + P + R+ + T F E+ ER Sbjct: 148 DGFDVDVPDTEENAAEFGYAGTGEKRSAFPKIRVVALAECGTHAFRAAEVGGWAAGERTL 207 Query: 156 --LDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 DE+ ADRGF S + A AD I R + Sbjct: 208 ARGLLMRLNRDEVLTADRGFYSFDNWALA-AGTGADLIWRAPTGLNLPVVRV---LSDGT 263 Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV 273 FL L + G RL+A + ++ + L Sbjct: 264 FLTVLINP--------------EITGGRRRERLLAAAKAGDELDPDEAHLARVVEYD-IP 308 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLH--LDALRAKE 331 +A V+L T L + A++VA Y RW+ E A +LK+ L LR++ Sbjct: 309 DRAGNGTGELVVVLTTILDPRQARADEVAAGYNERWEEETANDQLKTHLRGPGRVLRSRL 368 Query: 332 PELAKAWIFANLLA 345 P+LA ++A L+ Sbjct: 369 PDLAVQEMWAWLIV 382 >UniRef50_C6DY52 Transposase IS4 family protein n=1 Tax=Geobacter sp. M21 RepID=C6DY52_GEOSM Length = 394 Score = 109 bits (273), Expect = 1e-22, Method: Composition-based stats. Identities = 62/354 (17%), Positives = 104/354 (29%), Gaps = 69/354 (19%) Query: 33 RRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVAT----------LSDV--ALLKR 80 R ++ L L + S RE+ + D A S A+ R Sbjct: 43 RPLQLSFEDQLKALIYFHLHEFSSGRELLQALEQDDFAKECVAPPKGIKKSAFFEAVNNR 102 Query: 81 -LRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP--------GGGSAEWRL 131 L A+ F +L A L +DG+ I A + + Sbjct: 103 GLEQLAELFKLLLK--DAKNVIPAEFADIGNLVAIDGSYIDAVMSMDWADYSSTHNKAKA 160 Query: 132 HMGYDPHTCQFTDFELTDSRDAERLDRFAQ-TADEIRIADRGFGSRPECIRSLAFGEADY 190 H+ +D + D LTD ER DE + DRG+ E + Sbjct: 161 HVAFDINRGIPKDLILTDGNQTERQFVERMIGPDETAVLDRGYQC-NANFDQWQENEKKF 219 Query: 191 IVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVS 250 I R+ R + + E + V++G + A R++A Sbjct: 220 ICRIQARSNKKVIRENPIA--------RGSIIFYDAVVLLGAPSTR---AKKEVRVVAYR 268 Query: 251 LPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQ 310 + E + + +A Q+A+ Y+LRW Sbjct: 269 V----------------------------EGKDFWIATN---RHDLTALQIAEAYKLRWH 297 Query: 311 IELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSA 364 IE F K L + L A+ I + L+ L+ + + + Sbjct: 298 IESFFAWWKRHLSVYHLIARSQYGLTVQILSGLITYLLLA--MYCQREHNEPVS 349 >UniRef50_Q82R31 Putative IS4 family ISFsp6-like transposase n=2 Tax=Streptomyces avermitilis RepID=Q82R31_STRAW Length = 542 Score = 109 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 53/391 (13%), Positives = 103/391 (26%), Gaps = 75/391 (19%) Query: 7 NWSAILAHIGKPEELDTSARNAGALTRRREIRDAA---TLLRLGLAYGPGGMSLREVTAW 63 + + + D R GA R R + +L L L G + + + Sbjct: 25 HLGELTQQLPFELVDDVLERAGGAQHRLRLLPSRVGVYFVLALALFPQLGYVRVWDKLTA 84 Query: 64 AQLHDVATLSDVALLK--RLRNAADWFGILAAQTL---AVRAAVTGCTSGKRLRLVDG-T 117 + L+ R R +L A R DG + Sbjct: 85 GLRGILHRRPSEKALREVRRRLGVAPLRLLFETLAGPVAQPITPGVRYRCWRTVAFDGCS 144 Query: 118 AISAPGG-----------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFA 160 + AP G ++ + + T + + E Sbjct: 145 STKAPDRPRVCAWLGKHKHRYGTDGYPMLKIMVLCETGTRALLGAVFGPTPEKETGYAEQ 204 Query: 161 QTA----DEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLR 216 + + DRGF S + + A A +VR+ Sbjct: 205 LLPLLDGGMLLLNDRGFDS-DDFLAKAAATGAQLLVRLKGTRTPA--------------- 248 Query: 217 GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQA 276 A P + + + + + R+ Sbjct: 249 ---------------------RWALLPDGSFLTRINGTRLRVIDAHIAVTTAKGLRL--- 284 Query: 277 ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD-ALRAKEPELA 335 + L T Y A ++ + Y RW+IE AF L+ L LR+++ Sbjct: 285 ----EGHYRLATTLTDHRRYPAVELVELYHERWEIESAFYSLRHTLQCGLVLRSQDVAGI 340 Query: 336 KAWIFANLLAAFLIDDIIQPSLDFPPRSAGS 366 + ++A+L + + +++ P + Sbjct: 341 QQELWAHLTVYQALRRAMVEAVETLPGTDPD 371 >UniRef50_Q3M9Z5 Transposase, IS4 n=10 Tax=Cyanobacteria RepID=Q3M9Z5_ANAVT Length = 439 Score = 107 bits (267), Expect = 7e-22, Method: Composition-based stats. Identities = 54/391 (13%), Positives = 104/391 (26%), Gaps = 83/391 (21%) Query: 6 DNWSAILAHIGKPEEL--DTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAW 63 + ++ L+ + P + R G R + + L G++ E+T Sbjct: 5 EVIASHLSALLTPAITSQENYYRQLGLRDRILNLPLMVAAVLTLLWRDVAGVT--ELTRM 62 Query: 64 AQLHDVA-----TLSDVALLKRL-----RNAADWFGILAAQTLAVRAAVTGCT------- 106 +S A+ +R + F L A Sbjct: 63 LAREGFLWCRPLEVSQQAISQRFLTFPAQLFEKVFKDLLPHLQASWQRRNQRKIPPSVQF 122 Query: 107 ---SGKRLRLVDGTAI--------SAPGGGSAEW--RLHMGYDPHTCQFTDFEL-TDSRD 152 +++ +VD + + S + ++ + + + R Sbjct: 123 TLTKFEKIWIVDCSILEALFQKLDSLKDAPQGQLAGKIGTVINLVNLLPVEIWFCENPRT 182 Query: 153 AERLDRF----AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMR 208 A+ T + + DRGF ++ +A ++I R+ + Sbjct: 183 ADTKFEADILNLVTPHTLLLLDRGFYHFNFWLQLIAQN-VNFITRLKKGAAIHVQQVFTD 241 Query: 209 FDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENR 268 ++ K RL+ + Sbjct: 242 -------------SFALRDRLVRLGSGTKKTTFITLRLVEIRS----------------- 271 Query: 269 RKGRVVQAETLEAAGHVLLLTS-LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDAL 327 LTS L + VAD YR RW+IE AF +K LL L L Sbjct: 272 ------------DKTWHSYLTSVLDPEVLPPYVVADLYRRRWRIEDAFNTVKRLLGLSYL 319 Query: 328 RAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 + ++ L ++ D+ D Sbjct: 320 WTGSVNGVQLQVWGTWLFYAVLVDLGDAVAD 350 >UniRef50_Q093Y3 Isrso13-transposase protein n=7 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q093Y3_STIAU Length = 457 Score = 107 bits (266), Expect = 8e-22, Method: Composition-based stats. Identities = 67/375 (17%), Positives = 122/375 (32%), Gaps = 60/375 (16%) Query: 11 ILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVA 70 ++ E ++ + +E+ +A + + L SL A AQ + Sbjct: 23 LMQRALSAEWMEGLFQEHRQRQYTKELLFSAEVGLMELVALGLRPSLH---AAAQDSEEL 79 Query: 71 TLSDVALLKRLRN-AADWFGILAAQTLAVRAAVTGCTS--------GKRLRLVDGTAIS- 120 +S AL +++ + + L + + G R+R++DG ++ Sbjct: 80 KVSQQALYEKVNHTEPELVRALVQGSGERLTPIVKQLKLQQEPWAAGYRVRVLDGNKLAA 139 Query: 121 -------APGGGSAEWRLH--MGYDPHTCQFTDF-ELTDSRDAERLDRFA----QTADEI 166 G A + Y P D D+ ER E+ Sbjct: 140 SEKRLKPLRGFRGAAMPGQSLVVYAPEWDLVVDILPAEDAHAQERALMGPILERVQPGEL 199 Query: 167 RIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGET 226 +ADR F ++ + A ++VR H + + + G E Sbjct: 200 WLADRNFSTKNILFG-IEETGAAFLVREHAQT-----PHPKEVGTLKEVGRSKTGVVFEQ 253 Query: 227 TVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVL 286 V I G K+ R + V L T + Sbjct: 254 AVEIEAEGGKR----LALRRVEVHLDE-----------------------PTENGDTCIR 286 Query: 287 LLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 LLT++P + A +VA+ YR RW IE F RL+S+LH + P A +++A Sbjct: 287 LLTNVPAERMGALEVAELYRRRWSIEGMFGRLESVLHSEVHALGHPRAALLAFGVSVMAY 346 Query: 347 FLIDDIIQPSLDFPP 361 ++ ++ + Sbjct: 347 NVLAVLLAAVEEEHH 361 >UniRef50_C3BTW8 Transposase for insertion sequence element IS231B n=13 Tax=Bacillus RepID=C3BTW8_9BACI Length = 387 Score = 105 bits (261), Expect = 3e-21, Method: Composition-based stats. Identities = 29/118 (24%), Positives = 49/118 (41%) Query: 242 FPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQV 301 P R+I L E+ ++KG A + +G + +T+ P D Q+ Sbjct: 190 VPTRVIVHRLTKEQQQKRLQDQTVREKKKGMKYSARSKRLSGINVYMTNTPTDIVPMGQL 249 Query: 302 ADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 D Y LRWQIE+ FK KS ++ + + E + ++ L+A L + Sbjct: 250 HDWYSLRWQIEILFKTWKSFFYIHHCKKIKRERLECHLYGQLIAILLCSSTMFQMRQL 307 Score = 95.7 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 65/188 (34%), Gaps = 20/188 (10%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATL-LRLGLAYGPGGMSLRE 59 ++ + L P L AR+ G + R + + + L + ++ SL + Sbjct: 9 VSDELQLFGQELQSFLSPHILRDLARDVGFVQRTSKYQAKDLVALCVWMSQNVATTSLTQ 68 Query: 60 VTAWAQLHDVATLSDVALLKRLRNAAD-WFGILAAQTLAVRAA------VTGCTSGKRLR 112 +++ + +S L +R +A + + A+ L + + KR+R Sbjct: 69 LSSCLEASTEVLISPEGLNQRFNKSAVQFLQHILAELLNQKLTSSMPISSPYTSVFKRIR 128 Query: 113 LVDGTAISAPG------------GGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFA 160 ++D TA P +A ++ + YD + QF + +R Sbjct: 129 ILDSTAFQLPDPFSFVYPGAGGCSHTAGVKIQLEYDLLSGQFLHIHTGPGKQHDRTYGSL 188 Query: 161 QTADEIRI 168 + + Sbjct: 189 CVPTRVIV 196 >UniRef50_A7GMF1 Transposase IS4 family protein n=15 Tax=Bacillus RepID=A7GMF1_BACCN Length = 294 Score = 104 bits (260), Expect = 4e-21, Method: Composition-based stats. Identities = 23/205 (11%), Positives = 59/205 (28%), Gaps = 42/205 (20%) Query: 159 FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL 218 F + + + D G+ + ++ R+ + + + + Sbjct: 103 FVDDKECMYVFDSGYLDYER-FDHMTDKGYFFVSRLRKNAVTQVIEKFSFPKKAAVI--- 158 Query: 219 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAET 278 E +I + + R ++ Sbjct: 159 ---------------------------------SDEMIVIGIGTTQNRSENAFRFIKVLD 185 Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAW 338 + LL + A+++A+ ++ RW IEL FK +K L++ + + Sbjct: 186 SKGNELHLLTN---RFDLGADEIAELHKSRWAIELFFKWMKQHLNIKKFYGQNEQAVHNQ 242 Query: 339 IFANLLAAFLIDDIIQPSLDFPPRS 363 I+ ++ L ++ R+ Sbjct: 243 IYIAMIVYCL--HVLAQLSSQSKRT 265 >UniRef50_A3ZQJ1 Probable transposase n=4 Tax=Blastopirellula marina DSM 3645 RepID=A3ZQJ1_9PLAN Length = 432 Score = 104 bits (260), Expect = 4e-21, Method: Composition-based stats. Identities = 54/335 (16%), Positives = 105/335 (31%), Gaps = 49/335 (14%) Query: 35 REIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQ 94 R +R + +S++++ TLSD L + L Q Sbjct: 71 RTLRTIEDFSQT--QQVQRHLSIQKICR-------TTLSDFHRLVDPQRLEPILQALREQ 121 Query: 95 TLAVRAAVTGCTSGK-----RLRLVDGTAI-------------SAPGGGSAEWRLHMGYD 136 A + + R VDGT + + G ++ RL Sbjct: 122 LSRKEAGLGRAANDLSELLKRTVAVDGTFLEAAAEVAWAVRGSNQHGRENSYIRLDFQVG 181 Query: 137 PHTCQFTDFELTDSRDAERL-DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVH 195 + + + +E + + DRGF + R Sbjct: 182 VTSWAPEMIVVAEPGHSESASAAANVQDGRLYLYDRGFSGFDVINAHYHLQNESWTPRAQ 241 Query: 196 WRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEK 255 + +R+ A G + + + + + + + AR I + +P + Sbjct: 242 F-VIRYKPAGGNAPHLAD--ADENPLSEKDLAAGVVSDRRGRFRSSKAARHIVLDVPLRE 298 Query: 256 ALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAF 315 +++ + + V L+T+L + SAE +A Y+ RWQIEL F Sbjct: 299 V----------------IIEYQEQDETKTVRLITNL--LDVSAEVIAQLYQQRWQIELFF 340 Query: 316 KRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 + LK + + L + + ++AA L Sbjct: 341 RWLKCFANFNHLISHHRSGVLLSFYVAVIAALLTY 375 >UniRef50_A3EIG1 FOG: Transposase and inactivated derivatives n=3 Tax=Vibrio cholerae V51 RepID=A3EIG1_VIBCH Length = 264 Score = 104 bits (258), Expect = 7e-21, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 60/162 (37%), Gaps = 6/162 (3%) Query: 198 GLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKAL 257 L ++ + ++ + G LI +SL A Sbjct: 26 ALTLFDKGFYALGLLHRWQSQGKERHWLIPLRKGAQYKTLRKLGRGDGLIELSLT---AQ 82 Query: 258 ISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKR 317 K + + + R++ + ++ LL + Y +A+ Y RW+IEL ++ Sbjct: 83 AKKKWADAPDTLEARLITTK-VKGKEVQLLTSMTDPKRYIGADIAELYSHRWEIELGYRE 141 Query: 318 LKSLLHLD--ALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 +K + + LR+K P L K ++ LLA L+ ++ Sbjct: 142 MKQYMLQNSLTLRSKTPALVKQELWGMLLAYNLLRFMMCQMA 183 >UniRef50_C8PSK2 ISGsu1, transposase n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PSK2_9SPIO Length = 263 Score = 104 bits (258), Expect = 7e-21, Method: Composition-based stats. Identities = 34/302 (11%), Positives = 81/302 (26%), Gaps = 62/302 (20%) Query: 28 AGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVAT----------LSDVAL 77 + + L+ + A LR + S ++ Sbjct: 3 TQSDKYCKGFSTWQQLVTMCYAQIANPHGLRSLIDSINASGSCRYHLGIYKDLVRSTLSY 62 Query: 78 LKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAE--------- 128 + + F L K +D T IS Sbjct: 63 AN-NHRSPEVFKKLFYSLRDTLDRSARKKLRKDFYAIDATEISLNINDFPWATFRSAIGG 121 Query: 129 WRLHMGYDPHTCQFTDFELTDSRDAE--RLDRFAQTADEIRIADRGFGSRPECIRSLAFG 186 +++M YD + +T++ + E L+ + + D+G+ + + Sbjct: 122 IKINMKYDINNSVPDYLFMTNANEHENHTLNDMHLSKGDTATFDKGYCNY-STFGAFCEK 180 Query: 187 EADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARL 246 + ++ R+ + A + + Sbjct: 181 DIFFVTRLKENAKYTVIASRLTDSPLV--------------------------------- 207 Query: 247 IAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYR 306 + E + S + ++ + R V + + +++LT++ + SA +A YR Sbjct: 208 ----VSDETIIFSGKQTKTKCPYQLRKVVSIDEKTNKSIIILTNI--FDLSAADIAKLYR 261 Query: 307 LR 308 R Sbjct: 262 ER 263 >UniRef50_A5D1X0 Transposase n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D1X0_PELTS Length = 547 Score = 103 bits (257), Expect = 8e-21, Method: Composition-based stats. Identities = 63/422 (14%), Positives = 123/422 (29%), Gaps = 63/422 (14%) Query: 2 NYSHDNWSAIL-AHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREV 60 NW ++ I +L R A R+ + + + + S ++ Sbjct: 70 EAQIKNWGYVVYQKIWNQFDLPNLLRKISA-QRKVQFDLNNAAFLMAVQHLLEPRS--KL 126 Query: 61 TAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGK---RLRLVDGT 117 + H A+L +V+L L + D + L V + D T Sbjct: 127 GTYTHQHRYASLPNVSLNH-LYRSLDLL-WEHKELLEVEIFKKNHHLFNMQVDVVFYDVT 184 Query: 118 AISAPG------------GGSAEWRLHMGYDP---HTCQFTDFELTDSR--DAERLDRFA 160 S + + + +EL D + L++ Sbjct: 185 TFSFASVEADSLRNFGFSKDGKFNEVQVVLGLLIDCEGRPIGYELFPGNTFDGKTLEKAL 244 Query: 161 QTADE-------IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLT---------- 203 +E I +ADRG S+ R + G + + +T Sbjct: 245 VKLEERFGLRRVIIVADRGINSKLNLKRIVDRGYSYIFAARIKNMKKEITDEILSENGYQ 304 Query: 204 -----AEGMRFDMMGFLRGLD-CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKAL 257 E +R+ ++ +L G+ + + + + + A + + L Sbjct: 305 EINDGEEVIRYKVIEYLNEFTAEGQKYQLPEKLIVTYSSRRAEKDRADRERLIAKAQNLL 364 Query: 258 ISKTRLLSENRRKGRVVQAETLEAAGHVL--------------LLTSLPEDEYSAEQVAD 303 SK ++ + N+R G+ E +L E E SA + Sbjct: 365 ESKAKIQASNKRGGKKYLKEIDCTGTWILDEEAIAREEQFDGYYGIQTSEKEMSARDILA 424 Query: 304 CYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRS 363 Y W+IE +F+ +KS L + + K LA L + + Sbjct: 425 AYHNLWRIEESFRVMKSTLEVRPVFHWTERRIKGHFVICFLAFLLERTLEFKLRQAGENA 484 Query: 364 AG 365 + Sbjct: 485 SP 486 >UniRef50_Q18EK5 Probable transposase (ISH8/ISH26) n=5 Tax=Haloquadratum walsbyi DSM 16790 RepID=Q18EK5_HALWD Length = 417 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 53/390 (13%), Positives = 98/390 (25%), Gaps = 67/390 (17%) Query: 3 YSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTA 62 S D + L + + ++ A + + SL E+ Sbjct: 26 NSTDLVADELYSLLDEVDSESIADEFKIGLHSDKHDFTTHVKTAVREGLDPSSSLAELED 85 Query: 63 WAQLHDVAT---LSDVALLKRLRNAADWFGILAA-----QTLAVRAAVTGC--TSGKRLR 112 S + L R+ +L Q R + + Sbjct: 86 KTIADGSLEHMPKSRFSELTNDRDYCAVVQLLFEVLHTPQLYHQRGVQRKRLEWMTRDVV 145 Query: 113 LVDGTAISAPGG---------------------GSAEWRLHMGYDPHTCQFTDFELTDSR 151 VD T + G E D D +T+ Sbjct: 146 AVDATNLELTRSVVVSDEFVGDDDKVYKIDTDDGGLELHCAARVDGENKHPLDATVTEGD 205 Query: 152 DAERLDRFAQTADE---------IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWL 202 E D I + DR + + + D++ ++ L Sbjct: 206 THESPQFDLLKEDVEVFADLDSVIWVCDRAYTRYLR-FCEIKHSDNDFVTLMYSDARFEL 264 Query: 203 TAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTR 262 T F++ + + + E+ ++ Sbjct: 265 TETLEEFEVTVSGNNAAQPTHSDEES-------------------TRRVRDERIELA--- 302 Query: 263 LLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLL 322 E + R + ET + L T L EY V + Y LR IE+ F+ K L Sbjct: 303 ---ETGEEFRRIVLETPDGEEIEYLTT-LASSEYDPIDVINIYTLRTVIEILFREWKQYL 358 Query: 323 HLDALRAKEPELAKAWIFANLLAAFLIDDI 352 +++ +K +F L+ L+ Sbjct: 359 NIENFHSKSLNGVLFELFCALIGYMLVVWF 388 >UniRef50_D0I6N0 Transposase IS4 n=1 Tax=Grimontia hollisae CIP 101886 RepID=D0I6N0_VIBHO Length = 345 Score = 102 bits (255), Expect = 1e-20, Method: Composition-based stats. Identities = 58/343 (16%), Positives = 101/343 (29%), Gaps = 42/343 (12%) Query: 36 EIRDAATLLRLGLAYGPGGMSL----REVTAWAQLHDVATLSDVAL-LKRLRNAADWFGI 90 + R + LL A G ++L R + + D L RL + Sbjct: 20 KKRLQSLLLATESALGGADLTLTKLGRSLNTFTAAKHAIKRVDRLLGNTRLHREKEDIYK 79 Query: 91 LAAQTLAVRAAVT-------GCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFT 143 A+ +A R + + I+ G + Y Q+ Sbjct: 80 WNARLIAGANPCPVILLDWSDVREQLRFMTLRAS-IALDGRAVTLYEQAFEY----AQYN 134 Query: 144 DFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLT 203 + + + +A I I+D GF R R + ++ RV +T Sbjct: 135 SPKTHQYFLGKLQEILPPSATPIIISDAGF--RNTWFRQVQSKGWFWLGRVRGDVSIKMT 192 Query: 204 AEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVS---LPPEKALISK 260 + ++ + K + +L S K Sbjct: 193 Q-----------------SDWQSNKTLYPDATSKPHSLGQCQLARRSPLTCNGYVVKQQK 235 Query: 261 TRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKS 320 + S +K + A LL+T++P + +A Q+ Y R QIE AF+ LKS Sbjct: 236 AQRHSRTGQKHTASRLFAKNANEPWLLVTNIPTETLNAVQICRLYAKRMQIEEAFRDLKS 295 Query: 321 L---LHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFP 360 L L R + N + L + +I Sbjct: 296 TAYGLALRHNRTHHNRRLLSESANNFCLSGLSEILITQLSSHS 338 >UniRef50_UPI00016C37A0 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C37A0 Length = 334 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 47/335 (14%), Positives = 84/335 (25%), Gaps = 62/335 (18%) Query: 16 GKPEELDTSARNAGALTRRREIRDAATLLRLG----LAYGPGGMSLREVTAWAQLHDVAT 71 ++ + + G R T G +++ V AW + Sbjct: 20 LPESSIEPAIQEHGGGWRDEVFTPVVTPWAFLTQVICPVGCCRLAVARVLAWLVVRGEPP 79 Query: 72 --LSDVALLKRLRNAADWFGILAAQT---LAVRAAVTGCTSGKRLRLVDGTAISAPGG-- 124 K LA T L RA +G+R+ + DGT ++ P Sbjct: 80 CGPGTGGYCKPAPGCPRAIPQLARHTGRGLHDRAPGNWRWNGRRVLIADGTTVTMPDTPK 139 Query: 125 ---------------GSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRF-------AQT 162 G + RL + D L SR + + + Sbjct: 140 NQNEYPHPGSQADGIGFPQIRLVALFCLACGAVLDAALGPSRGKQSGETALRRQIAGSVG 199 Query: 163 ADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGK 222 + + +ADR + + D + R+H + +R Sbjct: 200 SGTVLLADR-YFGGWFDLVLWRERGIDVVTRIHQKRATDFRRGRRLGRDDHVVRW----P 254 Query: 223 NGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAA 282 G+ + + R + V V Sbjct: 255 KGQRPEWMDRDTYVRLPDELDIREVRVR-----------------------VAQRGFRTR 291 Query: 283 GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKR 317 V+ T A + + YR RW IE+ + Sbjct: 292 VLVVATTLTDP-SIRATDLGERYRQRWSIEVDLRH 325 >UniRef50_A9DPK2 Transposase n=8 Tax=Shewanella benthica KT99 RepID=A9DPK2_9GAMM Length = 269 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 85/236 (36%), Gaps = 29/236 (12%) Query: 135 YDPHTCQFTDFELTDSRDAERL--DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIV 192 D + + ++ ++ER + D + + D G+ + C ++ I+ Sbjct: 1 MDLMSGHYNYLGISPDSESERHYNPFAYEIQDTLLLMDAGYFNIDYCYQA-DKHGGHVIM 59 Query: 193 RVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLP 252 R + + + A + +IG + Sbjct: 60 RTNGKINPDIKAAFDSQGLA-------------IEGLIGKKLKQLKWHR----------- 95 Query: 253 PEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIE 312 + + R++ + L+T+L ++SA++V+ Y LRWQIE Sbjct: 96 EQIIDLDVQWKSKPGTH--RLIAFWDRNKSAIGYLITNLKRAQFSADKVSKLYGLRWQIE 153 Query: 313 LAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEK 368 L FK LKS L ++ +A++ ++A++L L I + S + ++K Sbjct: 154 LFFKELKSYSGLKTFNTRDKSIAESLVWASMLTLLLKRFIARASGLIHQVTISTQK 209 >UniRef50_B0NZ84 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0NZ84_9CLOT Length = 244 Score = 99.1 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 66/202 (32%), Gaps = 24/202 (11%) Query: 179 CIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKA 238 + +++R+ G GL+ + E + + +K Sbjct: 2 IWLIYRKKDGFFLIRIKDGRN-------------GIKMGLELPRRNEFDLDVSLKLTRKQ 48 Query: 239 GAPFP--------ARLIAVSLPPEKALISKTR--LLSENRRKGRVVQAETLEAAGHVLLL 288 R IA S + + R+V+ E + +L Sbjct: 49 TNDVKKLLKDKNHYRYIASSATFDFLPSHSRKSEQTRFYEINFRIVRFEITPGN-YETVL 107 Query: 289 TSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFL 348 TSL ++Y +++ Y LRW E +F+ LK + + +K+ I+A+L+ Sbjct: 108 TSLDVNKYPPKELKRLYALRWGTETSFRDLKYTVGMLNFHSKKVMCIHQEIYAHLIIYNF 167 Query: 349 IDDIIQPSLDFPPRSAGSEKKN 370 + I + + K N Sbjct: 168 SEMITSHVAISKKKRLYTYKAN 189 >UniRef50_C5EN31 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EN31_9FIRM Length = 148 Score = 98.7 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 50/126 (39%), Gaps = 5/126 (3%) Query: 232 NSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSL 291 + R I ++P + S+ E R + RVV+ + E + ++T+L Sbjct: 23 PGKRLHPESEPLYRYICKAVPFDLITDSR----PEYRMQLRVVRFQIAEGG-YENIITNL 77 Query: 292 PEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDD 351 P DE+S EQ+ Y L W E +F+ LK + + + P+ + I + Sbjct: 78 PADEFSLEQIKHIYHLLWGQETSFRDLKHTIGTENFHSGSPKYIEFEILCRMTLYNFCTI 137 Query: 352 IIQPSL 357 I Sbjct: 138 ITMEVP 143 >UniRef50_A4BSI0 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BSI0_9GAMM Length = 406 Score = 98.7 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 54/362 (14%), Positives = 102/362 (28%), Gaps = 73/362 (20%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVAT 71 + LD + R+R L S + + A Sbjct: 28 FKLLTSEALLDRVEQGLPRGHRKRLYPPTRALSLFLAQALTADRSCQNIVNQAA------ 81 Query: 72 LSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRL 131 + A + T G L G +PG G RL Sbjct: 82 --------------------VERLAGGLATGSTHTGGYCLARQRG---QSPGLGFPIGRL 118 Query: 132 HMGYDPHTCQFTDFELT-------DSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLA 184 + + + + + R + + +I I D F + I ++ Sbjct: 119 VGITYLASGALLNAAIGRFQGKGGNEQTLLRSMQESFAPGDILIGD-AFFATYFFIAAMQ 177 Query: 185 FGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPA 244 D ++ H + ++G+ Sbjct: 178 AKGVDILMEQHGSR-----------------KRSTDFRHGQ-------------HLGPRD 207 Query: 245 RLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADC 304 +I + P ++ + + L+A G +L+ T + + Sbjct: 208 HVIVIHKPKKRPQWMSETEYAAAPA---TLTLRELKAGGKLLVTTLRCPNTAPKGALKAL 264 Query: 305 YRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS---LDFPP 361 Y+ RW +EL + +K L +D L K P++ + I+ LLA LI ++ S D P Sbjct: 265 YQSRWHVELDIRHIKETLGMDVLSCKTPDMTRKEIWVYLLAYNLIRLMMVQSARLADIAP 324 Query: 362 RS 363 R+ Sbjct: 325 RT 326 >UniRef50_B6FLV1 Putative uncharacterized protein (Fragment) n=1 Tax=Clostridium nexile DSM 1787 RepID=B6FLV1_9CLOT Length = 135 Score = 98.0 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 25/100 (25%), Positives = 45/100 (45%), Gaps = 1/100 (1%) Query: 264 LSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLH 323 RV++ E+ ++T+L E+++ +++ Y RW IE +F+ LK + Sbjct: 8 NPFYTLHFRVLRFPITES-TMECIITNLEEEDFPMKEIKKLYEWRWGIERSFRELKYTIG 66 Query: 324 LDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRS 363 L AK+ E IFA L+ + II + +S Sbjct: 67 LTNFHAKKVEYILQEIFARLIIYNFCERIITKIVIQQKKS 106 >UniRef50_UPI0001AF03EF IS4 family transposase n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF03EF Length = 374 Score = 95.3 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 39/228 (17%), Positives = 69/228 (30%), Gaps = 49/228 (21%) Query: 139 TCQFTDFELTDSRDAE----RLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRV 194 T + AE R T D + +ADR F E + ++A A ++VR Sbjct: 10 TRALIAAAFGPAVKAESDYARELTGHLTPDMLLLADRAF-DGNELLAAIARQGAQFLVRC 68 Query: 195 HWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPE 254 + ++ + R+I + Sbjct: 69 TSTRRPPVL------------------------ALLPDGSYLTRIGNLSLRVIEAKV--- 101 Query: 255 KALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELA 314 + R V T + LL T A ++ Y RW+IE A Sbjct: 102 ---------------EARTVDGSTF-GDAYRLLTTLTDHRTDPAARLMRLYHERWEIETA 145 Query: 315 FKRLKSL-LHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPP 361 + L+ L LR+K+P ++ L + I+ +++ P Sbjct: 146 YLALRHTLLQGRVLRSKDPVGLCQEVWGLLTLYQALRSIMVTAVETEP 193 >UniRef50_Q15UH5 Transposase, IS4 family n=36 Tax=Gammaproteobacteria RepID=Q15UH5_PSEA6 Length = 420 Score = 95.3 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 51/336 (15%), Positives = 104/336 (30%), Gaps = 42/336 (12%) Query: 38 RDAATLLRLGLAYGPGGMSLREVT-----AWAQLHDVATLSDVALLKRLRNAADWFGILA 92 R +A ++ +SL E+ + A H++ + + L N Sbjct: 40 RLSALMVATQSLLDGQQLSLTELGRNISGSVAPKHNIKRIDRLLGNNNLHNERLDIYRWH 99 Query: 93 AQTLAVRAAVTGCTSGKRLRLVDGTAIS-------APGGGSAEWRLHMGYD-PHT-CQFT 143 A+ L + + LVD + + S + R Y+ + ++ Sbjct: 100 ARLLCGANPMP-------VVLVDWSDVREQLRHLTLRASVSVQGRSVTLYERVFSFGEYN 152 Query: 144 DFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLT 203 + E + + D G+ R R + ++ RV Sbjct: 153 SPVSHNPFLRELASILPLGCCPLIVTDAGY--RNPWFREVEKHGWFWLGRVRGDVGFKRD 210 Query: 204 AEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRL 263 + F + +G +P A L +K R Sbjct: 211 GQASWQSNKSFYPSANSRAKYLGCGQLGRK------SPLHAHLHLYKA------KAKHRK 258 Query: 264 LSENRRKGRVVQAETLEAAG----HVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLK 319 + + + GR A+ AG +L D+ +++Q+ Y R QIE F+ +K Sbjct: 259 DNRSSKAGRNHTAQQSYRAGSKEPWLLATNLPENDKLNSKQLVSLYARRMQIEETFRDIK 318 Query: 320 S---LLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 S + L ++ + + +LA +L+ + Sbjct: 319 SPQYGMGLRHSNSRCTKRFDILLLIAMLAEWLLRLL 354 >UniRef50_Q8QNB6 EsV-1-170 n=2 Tax=Ectocarpus siliculosus virus 1 RepID=Q8QNB6_ESV1 Length = 383 Score = 95.3 bits (235), Expect = 4e-18, Method: Composition-based stats. Identities = 61/368 (16%), Positives = 106/368 (28%), Gaps = 88/368 (23%) Query: 16 GKPEELDTSAR-NAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSD 74 PE D + + + RRR++ ++ L G R V ++ D A S Sbjct: 5 MSPEIYDAFKKCDQQWMQRRRKMDTSSLFYTLTRCCVQG----RGVNHVLKMEDEAYSSQ 60 Query: 75 VALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGG---------- 124 R + F + + R+ VDG+ + Sbjct: 61 AVHSARKKLPMGAFKEV-------NRFLHRGPHEPRVFAVDGSKVHVHPSFINAGYKTRT 113 Query: 125 ---------GSAEWRLHMGYDPHTCQFTDFELT---DSRDAERLDRFAQTADEIRIADRG 172 L D T DFELT + R A + + + DRG Sbjct: 114 NDQPVSRPAKRPLVMLSSMVDVKTKACIDFELTKHFNERRAATSMLRSVQKGDTLLFDRG 173 Query: 173 FGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGN 232 + S+ + S+ A + R+ R + Sbjct: 174 YYSKD-LLHSVHGSHAFGVWRLKIDAFRGTRSFFNS----------------------CR 210 Query: 233 SGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLP 292 + ARL+ ++ +V L T Sbjct: 211 TEATCLILGVKARLLKY----------------------------FIDGKTYVCLTTDPS 242 Query: 293 EDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 S ++ Y RW++E +FKRLKS L L+ A+ P+L + A +L + + Sbjct: 243 ---LSRLKIKTMYASRWRVEESFKRLKSNLRLEKAHARTPDLYIQEVEARVLLDTITLRM 299 Query: 353 IQPSLDFP 360 + + Sbjct: 300 QGSTKESS 307 >UniRef50_A7C2A8 Transposase of IS641 n=1 Tax=Beggiatoa sp. PS RepID=A7C2A8_9GAMM Length = 304 Score = 94.9 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 44/230 (19%), Positives = 74/230 (32%), Gaps = 40/230 (17%) Query: 130 RLHMGYDPHTCQFTDFELTDSRDAERLDRFA-QTADEIRIADRGFGSRPECIRSLAFGEA 188 +LH+ ++ + +F +T + +ER A IADRG+ S L A Sbjct: 14 KLHLCFELNRMLAVEFLVTAANFSERAALIKMLKAGVTYIADRGYMSFKVGDEVLKAK-A 72 Query: 189 DYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIA 248 ++ RV + + + P I Sbjct: 73 HFVFRVKTG------------------------------LRLTVTKTLLVQLPKTVAAIF 102 Query: 249 VSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLR 308 ++ E E + R+V G + + QV Y LR Sbjct: 103 NNVTDELI----RYTNDEFKHIYRLVCFSI----GFDQFHILTDRHDLTTFQVIMLYALR 154 Query: 309 WQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 WQIEL F+ LK ++ L ++ + L+AA L + Q + D Sbjct: 155 WQIELLFRFLKRTINGIHLIKQDERGVTIQFYTMLIAALLELRLKQMTAD 204 >UniRef50_UPI0000F70487 putative IS4 transposase n=1 Tax=Aeromonas salmonicida subsp. salmonicida RepID=UPI0000F70487 Length = 168 Score = 94.9 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 19/112 (16%), Positives = 44/112 (39%), Gaps = 3/112 (2%) Query: 258 ISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKR 317 + + + + R++ T+ +L + + + + + Y RW+IEL ++ Sbjct: 5 QRRCSTIRISYLEARLLT-RTINGKERQVLTSMVDPMRFPGADIVELYGHRWEIELGYRE 63 Query: 318 LKSLLHLDA--LRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSE 367 +K L LR+K+ + ++ LLA L+ + S+ Sbjct: 64 MKHCLQQHRLTLRSKKAAGIRQELWGVLLAYNLLRSQMVKMAASLKGYTASQ 115 >UniRef50_A4A0C3 Probable transposase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A0C3_9PLAN Length = 445 Score = 94.5 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 55/289 (19%), Positives = 84/289 (29%), Gaps = 62/289 (21%) Query: 86 DWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRLH------------- 132 D ++A G + VDG+ AP + E L Sbjct: 110 DEVRRQLRSSVAGWLEQYRVF-GWVVMAVDGSRFEAPRTRANEAGLGCAGREKTTPQIYQ 168 Query: 133 -MGYDPHTCQFTDFELTDSRDAERLDRFAQTAD----EIRIADRGFGSRPECIRSLAFGE 187 T DF + +ER D + IAD GF S C R L G Sbjct: 169 TTLQHVGTSLPWDFRIGPGTASERRQLDEMLPDLPGKSLLIADAGFISYDLC-RVLLMGR 227 Query: 188 ADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLI 247 D+++RV L L P I Sbjct: 228 HDFLLRVGGNT--------------HLLEKLGFACE--------TRERTVYLWPLRFHAI 265 Query: 248 AVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRL 307 + V++ E V L+T++ + E + YRL Sbjct: 266 -----------------PPVVLRLIVLRDANKE---PVYLVTNIDSESLPEEIASQIYRL 305 Query: 308 RWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 RW +E F+ LK L D + ++ P A A + A+L+ + + Sbjct: 306 RWGLETHFRGLKQTLGRDRVLSRTPATALAEQGWLNIGAWLLQLMTTAA 354 >UniRef50_Q82R33 Putative IS4 family ISFsp6-like transposase n=1 Tax=Streptomyces avermitilis RepID=Q82R33_STRAW Length = 333 Score = 91.8 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 58/202 (28%), Gaps = 46/202 (22%) Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGE 225 + + D+GF + ++ A ++ R+ + + + + Sbjct: 28 LVLWDKGF-DANAFLAAVHDTGARFLGRLRANRRTPVLSRLTDGSYLSVI---------- 76 Query: 226 TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV 285 P R++ + S + Sbjct: 77 --------------GTVPVRVVEAQITVIYDDCSF--------------------TNSYR 102 Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLL-HLDALRAKEPELAKAWIFANLL 344 L+ T Y A + Y RW+ E A+ L+ + LR+ +P + ++A L Sbjct: 103 LVTTLTDARRYPAPALVALYHQRWEHESAYFALRHTITDGRVLRSGDPVGVEQEMWALLA 162 Query: 345 AAFLIDDIIQPSLDFPPRSAGS 366 + ++ + + P + Sbjct: 163 LYQALRTVMVEAAESRPGTDPD 184 >UniRef50_A3ZMM8 Transposase insG for insertion sequence element-like protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZMM8_9PLAN Length = 464 Score = 91.8 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 55/364 (15%), Positives = 106/364 (29%), Gaps = 75/364 (20%) Query: 37 IRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVAT---------------LSDVALLKRL 81 A T+ L L GG SL+ V + A H S + R Sbjct: 47 YTQAVTIWLLILQRLRGGASLQTVVSEAVEHQADLFPDNKRVHEGTLGENTSSFS-AARK 105 Query: 82 RNAADWFGILAAQTLAVRAA-VTGCTSGKRLRLVDGTAISAPGGG--------------- 125 R D + V +R+ ++DGT I+ P Sbjct: 106 RLPLDAIERFSRCVCDHLGRTVEPVFDDRRVFIIDGTTITLPPTPVLKKAFPPATNQLGE 165 Query: 126 --SAEWRLHMGYDPHTCQFTDFEL----TDSRDAERLDRFA----QTADEIRIADRGFGS 175 L + + T ++ + +E + I +AD F Sbjct: 166 TVWPVAMLMVAAEMQTGCILVPKIDPMYGPNNSSEAKQAREIVGDLPSRSIVLADSCF-G 224 Query: 176 RPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGN 235 D++ R+ + L A + +++ G ++ Sbjct: 225 IFSVAHHTRAAGHDFLFRL---SMLRLKAHRKKAELID----QGEGYKSCRLTWRPSAKE 277 Query: 236 KKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDE 295 + PA + + + E + A L+ + Sbjct: 278 RNTNPDLPA---------------------DASLDVFLHEVELEDGATLALVTS----MS 312 Query: 296 YSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQP 355 + A +A+ YR R+ IE + LK + + LRAK E+ + +++A L+ + + Sbjct: 313 FDALCLAELYRRRYDIEFDIRDLKVTMDTENLRAKSVEMVMKELMGSVIAYNLVSQLRRG 372 Query: 356 SLDF 359 + Sbjct: 373 AAKL 376 >UniRef50_Q7UPU9 Probable transposase n=2 Tax=Rhodopirellula baltica RepID=Q7UPU9_RHOBA Length = 656 Score = 91.4 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 52/377 (13%), Positives = 98/377 (25%), Gaps = 96/377 (25%) Query: 36 EIRDAATLLRLGLAYGPGGMSLREVTAWAQLH------DVATLSDVALLKRLRNAADWFG 89 + +++ L L++ +A S L+K L W Sbjct: 119 GWKTMTLMIQALLWIFSDKDKLKDAFDSGTRQCKKVHGRIAFSSYSGLIKALVRWTPWLS 178 Query: 90 ILA----AQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAE----------------- 128 + + + A T+G + VDG+ + P S E Sbjct: 179 EVLLTRIHKQIETTAGKLWRTTGWVVMAVDGSRDTTPRTLSNEKAFCAPNHGHGKTARYR 238 Query: 129 -----------------------------WRLHMGYDPHTCQFTDFELTDSRDAERLDRF 159 W + + E L+ Sbjct: 239 KKKTKGMRRQAIEKNPPAPPVPQIWITMIWHVATQLTWCWKLGPSNASERAHVQEMLENG 298 Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 + D GF E +S+ G ++VRV ++ L Sbjct: 299 EFPEKTLFTGDAGFVGY-EFWKSIIDGGHHFLVRVGANVN-----------LLHSLGYDV 346 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 ++ P R+I + + + + Sbjct: 347 EPDEDNLVYCWPKDKRREGMRPLKLRMI-------QIQLGRKKA---------------- 383 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 VLL + L E + + +Q Y+ RW IEL F+ LK LR ++ A + Sbjct: 384 -----VLLTSVLDEKKLTDKQALVIYKSRWGIELEFRNLKQTYGRRQLRCRQSVRALVEL 438 Query: 340 FANLLAAFLIDDIIQPS 356 ++L+ ++ Sbjct: 439 HWSILSILIVKLYALKV 455 >UniRef50_C8XGG2 Transposase IS4 family protein n=2 Tax=Nakamurella multipartita DSM 44233 RepID=C8XGG2_NAKMY Length = 457 Score = 91.4 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 42/241 (17%), Positives = 72/241 (29%), Gaps = 22/241 (9%) Query: 137 PHTCQFTDFELTDSRDAERLDRFAQT--ADEIRI-ADRGFGSRPECIRSLAFGEADYIVR 193 + D E + R Q +I + AD GF AD Sbjct: 186 LRRGRSNDATAAPLFINETISRLRQAGATGQIVLRADSGFYLHDVVAAC---RAADVRFS 242 Query: 194 VHWRGLRWLTAEGMRFDMMGF--LRGLDCGK---NGETTVMIGNSGNKKAGAPFPARLIA 248 + R + L + + + G T ++ + P RL+ Sbjct: 243 IGARMIGHLRGQIEAIPDEQWQPIEYFLPGAGVAEIPYTPFAQDTHGRDRTDTVPLRLMV 302 Query: 249 VSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLR 308 PP +A + ++ QA + ++T D E R Sbjct: 303 RRTPPTQAQVRNRGQDTD--------QAALFPVYDYHPIITDRDGDLRDLEADH---RRH 351 Query: 309 WQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEK 368 ++EL + LK + L L K AW+ N +A L I + D R + + Sbjct: 352 AEVELTIRDLKHGMGLAHLPTKSFGGNAAWLILNTIAHNLTRWITRLGFDQGHRMTKNIR 411 Query: 369 K 369 + Sbjct: 412 R 412 >UniRef50_B2PVI2 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PVI2_PROST Length = 144 Score = 90.7 bits (223), Expect = 7e-17, Method: Composition-based stats. Identities = 22/100 (22%), Positives = 44/100 (44%), Gaps = 3/100 (3%) Query: 268 RRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLL--HLD 325 + R++ + + G +L + Y +AD YR RW+IE F+ +K + + Sbjct: 2 TLRARLIT-KNINGKGVQILTSMSEPLRYPKADIADLYRHRWEIEHGFREMKQHMLNNEL 60 Query: 326 ALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAG 365 LR+K+P L ++ +LA L+ ++ + Sbjct: 61 TLRSKKPALVNQELWGIVLAYNLLRFMMAQMAYSLKDTEP 100 >UniRef50_Q6LJK0 Hypothetical transposase n=2 Tax=Vibrionaceae RepID=Q6LJK0_PHOPR Length = 394 Score = 90.7 bits (223), Expect = 7e-17, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 57/190 (30%), Gaps = 17/190 (8%) Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGE 225 I + D GF R R + + Y+ RV + + + G Sbjct: 156 IIVTDAGF--RNTWFRQVDDMDWCYLGRVRGDVNVLIKNQWQHIKQLFIKANSKPKYVGF 213 Query: 226 TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV 285 T + P L K + + R A Sbjct: 214 TQL--------AKRKPLQCHLHLYKK----QTPKKRKDRPKGREHFSAQAVHKKSALEPW 261 Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKS---LLHLDALRAKEPELAKAWIFAN 342 +L T+LP D +S+ + Y R QIE F+ LKS L R +P+ + Sbjct: 262 VLATNLPTDIFSSRCIVRLYTKRMQIEETFRDLKSPQYGFGLRQSRTHDPKRFDILLLIG 321 Query: 343 LLAAFLIDDI 352 LLA + Sbjct: 322 LLAFMVYWWF 331 >UniRef50_Q67PW6 Transposase-like protein n=14 Tax=Symbiobacterium thermophilum RepID=Q67PW6_SYMTH Length = 552 Score = 89.5 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 48/289 (16%), Positives = 88/289 (30%), Gaps = 38/289 (13%) Query: 77 LLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYD 136 + R+ + A +A V + +S G + +L + Sbjct: 237 IEARIGRPLRRVEWVEACLAQQKARVRQLYQQLQ-------TVSGKGSARRKQKLQREFQ 289 Query: 137 PHTCQFTDFELTDSRDAERLDRFAQTADEIRI-ADRGFGSRPECIRSLAFGEADYIVRVH 195 + R + +R I + AD F PE I+ L ++ ++ + Sbjct: 290 EEVQHLREVN-QRLRQYRQENRTNLAPLRILLRADSAF-GTPEVIQRLLELGYEFTIKSY 347 Query: 196 WRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEK 255 ++ E + G + AP+P RL+A+ Sbjct: 348 SGSNVAYKHLFDAVPAENWVEVEKNRFASEAVTVPGPT----LLAPYPVRLVAMR----- 398 Query: 256 ALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAF 315 GR V+ ++LT+L +E + +V Y R IE F Sbjct: 399 ----------RWDADGREVR---------SVILTTLQPEELTTTEVVKLYHGRQTIEAGF 439 Query: 316 KRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSA 364 + K H R ++ E A+ L A L+ + P+ A Sbjct: 440 QEWKGTFHFGTPRLRKYEANAAFTQLVLFAFNLVRWAWRFLSTNSPKLA 488 >UniRef50_UPI00016C3BAC transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3BAC Length = 218 Score = 89.5 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 23/85 (27%), Positives = 40/85 (47%) Query: 273 VVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEP 332 V+ V+ T Y+ E +A Y RW++EL + +K L +D L K P Sbjct: 21 RVKRPGYRTREIVVATTLTDATAYTREDLAQLYHHRWRVELWIRDIKQTLAMDVLGGKTP 80 Query: 333 ELAKAWIFANLLAAFLIDDIIQPSL 357 E+ + I+ +LLA ++ +I + Sbjct: 81 EMLRREIWCHLLAYNVVRHVIAQAA 105 >UniRef50_C3FBK7 Transposase for insertion sequence element IS231B n=3 Tax=Bacillus thuringiensis RepID=C3FBK7_BACTU Length = 180 Score = 89.1 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 24/99 (24%), Positives = 43/99 (43%) Query: 258 ISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKR 317 + + +KG ++ G + +T+ P + EQ+ D Y LRWQIE+ FK Sbjct: 2 ERRKKQSYTESKKGITFSEKSKRLTGINIYVTNAPWEVVPMEQIHDFYSLRWQIEIIFKT 61 Query: 318 LKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 KSL + + + E + ++ L+A L + Sbjct: 62 WKSLFQMHHWQTIKQERLECHVYEKLIAILLCFSTMFQM 100 >UniRef50_C1DIQ1 Transposase, IS4 n=2 Tax=Azotobacter vinelandii DJ RepID=C1DIQ1_AZOVD Length = 400 Score = 88.7 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 57/303 (18%), Positives = 90/303 (29%), Gaps = 38/303 (12%) Query: 58 REVTAWAQLHDVATLSDVALLKR-LRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDG 116 R + A D L R L+ F + + L L LVD Sbjct: 46 RSLPGSAWPRHAIKRIDRLLGNRQLQAERGLFYWVMLRALLGSFRHP-------LILVDW 98 Query: 117 TAISAPGG----------GSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEI 166 + I A G + P + Sbjct: 99 SPIDAAGKLFLLRAALPLAGRSLPVCEVVHPREGCPRC---QKRLLEALAAMLPADCRPV 155 Query: 167 RIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE-GMRFDMMGFLRGLDCGKNGE 225 + D GF +++ Y+ RV R L L + + L + G Sbjct: 156 LVTDAGFQR--PWFQAVEIRGWHYVGRVRNRDLCRLGEQPWGPVKSLYALASASPKRLGC 213 Query: 226 TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV 285 + AP+ +L V P + ++ R E Sbjct: 214 VEM--------TRSAPWSTQLCVVKHAPRGRQHRRITGTLARDKRSRQSAQRESE---PW 262 Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL---LHLDALRAKEPELAKAWIFAN 342 LL ++LPE +++A QV YR R QIE F+ LKS + L R++ P + + Sbjct: 263 LLASNLPEAQWNAAQVVAIYRRRTQIEEGFRDLKSHRLGIGLGLHRSRCPRRIEILLLIA 322 Query: 343 LLA 345 +LA Sbjct: 323 VLA 325 >UniRef50_D1VZM3 Transposase, IS4 family n=3 Tax=Prevotella RepID=D1VZM3_9BACT Length = 511 Score = 88.7 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 68/228 (29%), Gaps = 18/228 (7%) Query: 133 MGYDPHTC-QFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYI 191 + Y Q+ F + D + RF D + +AD G ++ G + Sbjct: 223 LSYSLFNGSQYEGFTMIPMID-DFKQRFTLGDDFVIVADSGLMNKNNVALLQNAGYKYIL 281 Query: 192 VRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAV-- 249 + + D +NGE ++ + K A R IA Sbjct: 282 GARIRNERNNIRQWILSLDKKDNASYEMYRQNGERLIVCYSERRAKKDAYNRTRGIARLR 341 Query: 250 ------SLPPEKALISKTRLLSENRRKGRVVQ-----AETLEAAGHVLLLTSLPEDEYSA 298 + ++ E + V E + G +T+ E A Sbjct: 342 KAYKSGRITKQQVNKRGYNKFLEISKDIEVTISQEKIEEDCKWDGWKGYITNT---ELDA 398 Query: 299 EQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA 346 E+V Y W +E +F+ K L + + +A + +A Sbjct: 399 ERVIAQYHGLWVVERSFRISKGTLEMRPMFHFTERRIEAHVCICFIAY 446 >UniRef50_Q64E61 Transposase n=1 Tax=uncultured archaeon GZfos14B8 RepID=Q64E61_9ARCH Length = 622 Score = 87.6 bits (215), Expect = 6e-16, Method: Composition-based stats. Identities = 47/351 (13%), Positives = 97/351 (27%), Gaps = 65/351 (18%) Query: 46 LGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGC 105 L + ++RE TL L + L + + + Sbjct: 244 FSLGFKIKNKNVREFLGIEAAAHCTTL----YLNFDNFDPEKLERLNEELV--KRFWEHR 297 Query: 106 TSGKRLRLVDGTAISAPGGGSAE--------------WRLHMGYDPHTCQFTDFELTDSR 151 + +D + G ++L+ ++ + F++ Sbjct: 298 RRKTGIIGIDSMLLEIFGDYEGAEVGWDHVNNKSVHCYKLYAAFELKSNYPVCFKIEPGN 357 Query: 152 DA------ERLDRFAQTAD----EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRW 201 + E +R + EI + D+GF + + + L + Sbjct: 358 TSDSTMLVEMCERAKKVVGKENIEIVMFDKGFYN-AKSFNKI------------KGDLTF 404 Query: 202 LTAEGMRFDMMGFLRGLDCGKNGET--TVMIGNSGNKKAGAPFPARLIAVSLPPEKALIS 259 T +M + G++ K +T I + G RLI V +A Sbjct: 405 NTPAKKYKTIMDAIAGIEPEKFKQTGYNRWISETRVALEGYDGKLRLIVVKKVEPRAKKD 464 Query: 260 KTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLK 319 K + + LT+ Y RW+IE FK L+ Sbjct: 465 KETGEKSWTME-----------DVYYSYLTN--NKTLGTIDAPKLYSKRWRIENFFKELR 511 Query: 320 SLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEKKN 370 + ++ + + ++ +A I ++ G E +N Sbjct: 512 NHWNIRNFPSTSLDAVRSH-----IALLFIQFMVLSLFKHY--VLGGEYRN 555 >UniRef50_B2IXJ5 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2IXJ5_NOSP7 Length = 238 Score = 87.2 bits (214), Expect = 8e-16, Method: Composition-based stats. Identities = 22/80 (27%), Positives = 36/80 (45%) Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPE 333 + L+ T L YS + Y RW +E+ + LK+ L +D LR K P Sbjct: 113 IVVPGFRTEQVSLITTLLDITTYSTLDIVGLYGKRWDVEIDLRHLKTTLGMDVLRCKTPS 172 Query: 334 LAKAWIFANLLAAFLIDDII 353 + + I+ LLA L+ ++ Sbjct: 173 MVRKEIYVYLLAYNLLRGLM 192 >UniRef50_C6JAL6 Transposase (Fragment) n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JAL6_9FIRM Length = 237 Score = 86.4 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 67/244 (27%), Gaps = 57/244 (23%) Query: 135 YDPHTCQFTDFELTDSRDAERLDRFAQTA---------DEIRIADRGFGSRPECIRSLAF 185 YD + +ER + I I DRG+ S + R Sbjct: 20 YDVLDDYILHASIHKFLSSERAAALEHLKVLEDMGLYNNSIIIFDRGYYS-EDMFRYCVE 78 Query: 186 GEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPAR 245 ++R+ G DM+ L+G + + Sbjct: 79 HGHLCVMRLKEGINLSKKCNG---DMISILQGTSKEGTSDVPIR---------------- 119 Query: 246 LIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCY 305 V L+ L T+L + + + + Y Sbjct: 120 ----------------------------VLEIPLDDGTKEYLATNLFDPAVTKDMFRELY 151 Query: 306 RLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAG 365 RW +EL +K LKS ++ + + N+L + L I + + SA Sbjct: 152 FYRWPVELKYKELKSRFAMEEFSGATAVSIQQEFYINMLLSNLASLIKNEADEEIQISAK 211 Query: 366 SEKK 369 S K Sbjct: 212 STNK 215 >UniRef50_Q2J8F5 Putative uncharacterized protein n=3 Tax=Frankia sp. CcI3 RepID=Q2J8F5_FRASC Length = 451 Score = 86.0 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 63/371 (16%), Positives = 100/371 (26%), Gaps = 66/371 (17%) Query: 4 SHDNWSAI--LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSL---- 57 +W ++ L + +D + GA RR + ++ + Sbjct: 11 QLTDWISLGVLTSFVPRDAVDEAIEATGAGARRSDTTIPPQVVAYFVMALALFADDDYET 70 Query: 58 --REVTAWAQLHDVA-----TLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSG-- 108 R + A DV S R R A L Q A + + Sbjct: 71 VARRLAATLTDLDVVGPRWEPTSSGLTKARQRLGAAPLAELFGQVAGPVADLDTVGAFLS 130 Query: 109 -KRLRLVDGTAISAPGGGSA------------------EWRLHMGYDPHTCQFTDFELTD 149 RL +DG AP + R + + Sbjct: 131 RWRLMSIDGLEWDAPASKENIAAFGLPAGRVDAPGVLPKVRAVTVSECASHAPVLAAFGP 190 Query: 150 SRDAERLDRFAQTADE--------IRIADRGFGSRPECIRSLAFGEADYIVRVHWR-GLR 200 + A+ A + +ADR F S + A A + RV L Sbjct: 191 AGGAKPASEQALARTVYPRLASDWLLLADRNFYS-WADWCTAADTGAALLWRVKATLRLP 249 Query: 201 WLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNK-KAGAPFPARLIAVSLPPEKALIS 259 L A + + GK ET V +G RL+ +P + Sbjct: 250 PLRALSDGSYLTVLVNPKVTGKARETLVTAARAGAPLDPTKARYTRLVEYDVPDREGDGK 309 Query: 260 KTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLK 319 LL T E +A +A YR RW+ E+A + K Sbjct: 310 HEITG---------------------LLTTICDPREATATALAGAYRQRWEHEVAIEDAK 348 Query: 320 SLLHLDALRAK 330 L+ + R + Sbjct: 349 QLVGVGQARNR 359 >UniRef50_B9EHT2 Olfr780 protein n=158 Tax=root RepID=B9EHT2_MOUSE Length = 402 Score = 85.3 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 43/317 (13%), Positives = 86/317 (27%), Gaps = 35/317 (11%) Query: 58 REVTAWAQLHDVATLSDVALLKR-LRNAADWFGILAAQTLAVRAAVT-------GCTSGK 109 R + A+ D L R L A + + K Sbjct: 46 RNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQK 105 Query: 110 RLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIA 169 RL ++ + ++ G + P + Q + + D A+ + ++ Sbjct: 106 RLMVLRAS-VALHGRSVTLYEKAF---PLSEQCSK-KAHDQFLADLASILPSNTTPLIVS 160 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLR--WLTAEGMRFDMMGFLRGLDCGKNGETT 227 D GF + +S+ ++ RV + + + G Sbjct: 161 DAGF--KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG--- 215 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLL 287 P +++ + + R + A+ Sbjct: 216 -----YKRLTKSNPISCQILLYK-----SRSKGRKNQRSTRTHCHHPSPKIYSASAKEPW 265 Query: 288 L--TSLPEDEYSAEQVADCYRLRWQIELAFKRLKS---LLHLDALRAKEPELAKAWIFAN 342 + T+LP + + +Q+ + Y R QIE F+ LKS L L R E + Sbjct: 266 VLATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIA 325 Query: 343 LLAAFLIDDIIQPSLDF 359 L+ + Sbjct: 326 LMLQLTCWLAGVHAQKQ 342 >UniRef50_A7C4E9 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C4E9_9GAMM Length = 216 Score = 85.3 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 61/187 (32%), Gaps = 27/187 (14%) Query: 179 CIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKA 238 + + ++ R+ + + ++T ++ ++K Sbjct: 1 MLWEIDNIGGFFLSRIKSKTVIYITE------------------------IVQGKISQKY 36 Query: 239 GAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSA 298 + + + + +L + R + + +T+L +A Sbjct: 37 IGTKLLSVPIKNKRSDILEVIVEKLCDKGTLCCRAIGFWNPVDKCYHWYITNLS---VAA 93 Query: 299 EQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 + YRLRWQIEL FK K L+ + L + + + + A++ A ++ L Sbjct: 94 HLIYPLYRLRWQIELIFKACKQSLNANRLTSNNKHIIENLLLASIAAQLASHTVLDIVLP 153 Query: 359 FPPRSAG 365 + Sbjct: 154 QLTKMKQ 160 >UniRef50_Q877V8 ISPpu8, transposase n=3 Tax=Proteobacteria RepID=Q877V8_PSEPK Length = 433 Score = 85.3 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 61/369 (16%), Positives = 120/369 (32%), Gaps = 63/369 (17%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVAT 71 L PE +D RE+ + + + L SL A A+ D Sbjct: 6 LEQAIAPEWVDQVFEEHRQRQYSRELLFSTIIKLMSLVSLGLKPSLH---AAARQLDDLP 62 Query: 72 LSDVALLKRL-RNAADWFGILAAQTLAVRAA------VTGCTSGKRLRLVDGTAISAPGG 124 +S AL ++ R L A + ++R+VDG+ +++ Sbjct: 63 VSLAALYDKISRTEPALLRALVTGCAQRLAPTIHELGCSAMLPDWQVRVVDGSHLASTEK 122 Query: 125 GSAEWRL----------HMGYDPHTCQFTDFEL-TDSRDAERLDRFAQ----TADEIRIA 169 R + YDP Q D + D+ +ER+ +++ IA Sbjct: 123 RLGALRQERGAARPGFSVVVYDPDLDQVIDLQPCEDAYASERVCVLPLLAEAKTNQVWIA 182 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 DR + + P + + +++R + R + + + Sbjct: 183 DRLYCTLPVM-EACEQVKTSFVIRQQAKHPRLIQEG-----------------EWQAPMP 224 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 + ++ + RR + + ++ + Sbjct: 225 VATGTVREQSIEV-------------------KGGHRWRRVELTLHSPNDSGDNSLMFWS 265 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 +LP + SA+Q+AD YR RW IE F+RL+++L + P A +LA ++ Sbjct: 266 NLP-ESISAQQIADFYRRRWSIEGMFQRLEAILESEIETLGSPRAALLGFTTAVLAYNVL 324 Query: 350 DDIIQPSLD 358 + + Sbjct: 325 ALLKRSVEQ 333 >UniRef50_B5WFI6 Putative uncharacterized protein n=1 Tax=Burkholderia sp. H160 RepID=B5WFI6_9BURK Length = 256 Score = 84.9 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 47/140 (33%), Gaps = 6/140 (4%) Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 ++ +G A + + + P+ K R ++ Sbjct: 4 PNRHFLIPAKTNTCWEVISGTADDA-TVRMRVSPQA---RKKCPALSEFWNARAIRTIDA 59 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLD--ALRAKEPELAKA 337 VLL + + + CY RWQ+E +++ LK + LR++ E Sbjct: 60 RGRERVLLTSLEDRRRFKPADIVACYERRWQLETSYRGLKQSMLGSELTLRSRTVEGVYQ 119 Query: 338 WIFANLLAAFLIDDIIQPSL 357 I L+A+ LI I + Sbjct: 120 EISGALIASNLIRREIANAT 139 >UniRef50_D2CY12 Transposase n=12 Tax=Mycoplasma RepID=D2CY12_MYCSY Length = 552 Score = 84.5 bits (207), Expect = 6e-15, Method: Composition-based stats. Identities = 51/404 (12%), Positives = 111/404 (27%), Gaps = 66/404 (16%) Query: 3 YSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLRE--- 59 Y + + I+ ++ P L + R + A+ L L S R Sbjct: 93 YGNLVYEEIMNYLELPSFLTDLQKK----NSRSKYDLASITKILILTRILEPSSKRSSVE 148 Query: 60 --VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGT 117 W + +D +L + L Q + D T Sbjct: 149 KIKKYWYEFNDSLKDVYRSLEFLQDKKVEILSYLNEQFVEKINR------NLTFCFYDVT 202 Query: 118 AISAPGGGSAEWR----------------LHMGYDPHTCQFTDFELTDSRDAERLDRFAQ 161 + E+R L + D ++L ++ L Sbjct: 203 TVYFESFLPDEFRKFGFSKDNKVNQTQVVLGLLIDDM-GIPIYYDLFPGNTSDFLTLKPV 261 Query: 162 TADE---------IRIADRGFGSRPECIRSLAFGEADYIVRVHWRG-------------- 198 + +ADRG S+ + ++ DYI+ +G Sbjct: 262 LENIKKDLGIEKITIVADRGLNSKSNLL-AIKEAGYDYIMAYKIKGKENKIEGIYDLETY 320 Query: 199 -LRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIG-NSGNKKAGAPFPARLIAVSLPPEKA 256 + + + D F + + + +++ + ++ RLI + Sbjct: 321 KMIYEEFGVKKQDHKEFFKSNNTFYEIDNKLILTFSRKRQRKDKKDRERLIKKAEKLLNL 380 Query: 257 LISKTRLLSENRRKGRV--------VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLR 308 K+ + ++ + QA + A ++ +++ Y Sbjct: 381 SAIKSEMKKGGKKYLKFADNEVELDHQAILKDEAADCFYGILTSHEDMDEKEIIKQYSKL 440 Query: 309 WQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 W+IE +F+ +K+ L + K LA + + Sbjct: 441 WKIEESFRVMKTNFELRPIFLSRENTIKGHFLICFLALTIQRYL 484 >UniRef50_Q4V248 Transposase, n=5 Tax=Bacillus cereus group RepID=Q4V248_BACCZ Length = 140 Score = 84.1 bits (206), Expect = 7e-15, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 38/91 (41%) Query: 238 AGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYS 297 R+I L ++ + + +KG ++ G + +T+ P + Sbjct: 47 KDQKLFTRVIIYRLTEKQIQERRKKQNYTESKKGITYSEKSKRLTGINIYVTNTPWEIVP 106 Query: 298 AEQVADCYRLRWQIELAFKRLKSLLHLDALR 328 EQ+ D Y LRWQIE+ FK KSL + Sbjct: 107 MEQIHDFYSLRWQIEITFKTWKSLFQIHHWH 137 >UniRef50_A9F243 Transposase, IS4 family n=4 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F243_SORC5 Length = 461 Score = 83.0 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 67/342 (19%), Positives = 110/342 (32%), Gaps = 40/342 (11%) Query: 55 MSLREVTAWAQLHDVATLSDVALLKR------LRNAADWFGILAAQTLAVRAAVTGCTSG 108 +S E+ A + ++ A+L R A+ + T + G G Sbjct: 49 LSDSELEAAYRFFGNDAVTPAAILAPHVRATLARMEAEPVVLAIHDTTTLSFRSDGQRQG 108 Query: 109 K-RLRLVDGTAI---SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRD--AERLDRFAQ- 161 RLR T + G R D T D + D E+++R A Sbjct: 109 LGRLRSSGQTFFAHFTLAVSGDGTRRPLGVLDLSTHVRDDGTTDNEHDRWGEQVERVAVL 168 Query: 162 ---TADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL 218 D + + DR L +++R+ L A G + L + Sbjct: 169 GAAPHDVVHVMDREGDDY-GLFAQLLSAGHRFVIRLAHNRLVEADALGAEAKLEQALAHV 227 Query: 219 DCGKNGETTVMIGNSGN-----KKAGAPFPARLIAVSLPPEKALISKTRLLSE---NRRK 270 E + +GN K+ P RL ++L + + + R Sbjct: 228 QAVAVREVELSPRPAGNRSPQQKRLHPPRAGRLAKLALGSTRVTLRRPRSQPRELPATLS 287 Query: 271 GRVVQAETLEAA------GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLL-- 322 RVV+ +E VLL + E Q+ D YR RW +E FK LK+ Sbjct: 288 LRVVRVWEIEPPPGEAPVEWVLLTSEPVESVEQLTQLVDWYRARWMVEELFKALKTGCAY 347 Query: 323 ---HLDALRA-KEPELAKAWIFANLLAAFLIDDIIQPSLDFP 360 ++ L + A I LL L+ + + + P Sbjct: 348 EKRQIEDLHGLRNVLALFAPIAWQLL---LLRSEARRAPEQP 386 >UniRef50_A1ZPG0 Transposase of, putative n=3 Tax=Microscilla marina ATCC 23134 RepID=A1ZPG0_9SPHI Length = 395 Score = 82.6 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 53/362 (14%), Positives = 110/362 (30%), Gaps = 44/362 (12%) Query: 25 ARNAGALTRRREIRDAATLLRLGLAYGPG-GMSLREVTAWAQLHDVATLSDV-------- 75 +++ ++ + L + +SLRE+++ + S Sbjct: 22 SQSTDVDKWVSKLPGKLFIKLLLYSVLNNERLSLREISSEMSNPIFQSFSSEMVEQMAGW 81 Query: 76 -ALLKRLRN-AADWFGILAAQTLAVRAAVTGCTS--GKRLRLVDGTAISAPGGGSAEWRL 131 + +RLR+ + + A A+ G ++ D T I G ++ Sbjct: 82 TGIRERLRHIKLPFIEQVYEHFFAEAHALYGEKKLLDYHIKRYDSTLIKVFGHLLQGMKV 141 Query: 132 HMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYI 191 + +LT R + D+ S ++ Sbjct: 142 G----NTSKNKFQVKLTTEHTDGFGLRVSFHQ------DQAHLSEETALQEQINLGKH-- 189 Query: 192 VRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSL 251 + + R + + P +L+ Sbjct: 190 ---SSQDIIVFDNGLKGR------RKFKDFDEASIQFVTNIGKKPRYQVNRPHQLLDRHH 240 Query: 252 PP-----EKALISKTRLLSENRRK--GRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADC 304 P + + R N + R+++ E H+ +L++L + AE VA Sbjct: 241 PDLDFIQDSVVQLFERGQPTNSMEHEFRLIEFRVKETGKHLFILSNL--WDLPAEVVAQV 298 Query: 305 YRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQP-SLDFPPRS 363 Y +RW IE+ F+ LK ++L + K I+ L+AA +I Q ++ R+ Sbjct: 299 YLMRWDIEVIFRFLKQEMNLTHFVCNDLNAIKVMIYVKLIAAMMILIFKQKNAIKTYKRT 358 Query: 364 AG 365 Sbjct: 359 KK 360 >UniRef50_Q4A8Q4 ISMHp1 transposase n=19 Tax=Mycoplasma RepID=Q4A8Q4_MYCH7 Length = 552 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 44/336 (13%), Positives = 89/336 (26%), Gaps = 62/336 (18%) Query: 81 LRNAADWFGILAAQTLAVRAAVTGCTSGKR--LRLVDGTAISAPGGGSAEWRL------- 131 D Q L + GKR D + + R+ Sbjct: 157 FYRLLDLVYESQNQLLDSLNKMVISELGKRDNEFYFDSSTVYFETFERNGLRIPGYSKDA 216 Query: 132 -----HMGYDP---HTCQFTDFELTDSRDAERLDRFAQT---------ADEIRIADRGFG 174 + ++ A+ + IADRG Sbjct: 217 KFKEDQIVIALACDKNGIPFHIKVFKGNTADSSTLIPFVLDIESKYNIKNMTIIADRG-M 275 Query: 175 SRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSG 234 S IR L E ++I+ + + D ++ K + + Sbjct: 276 STAANIRFLESKEYNFIISYRAKIGSQKFKNYL-LDPSDYVDLNTDFK-YKKEEFYSSYK 333 Query: 235 NKKAGAPFPARLIAV---------SLPPEKALISKTRLLS-----------ENRRKGRVV 274 NK+ R+I E+ + + + R + Sbjct: 334 NKRYTENIRRRIITYSKKRAIKDSKAREEQIQSFIKKQNKDGFIEVNKLFGKKPKYFREI 393 Query: 275 QA-----------ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLH 323 + + G+ + T++ + + + Y+ +W IE F+ LK LL+ Sbjct: 394 SNMKFELDQSKIDKDKQFDGYYVYETNILN--LNVLDIVEKYQKQWNIEANFRSLKGLLN 451 Query: 324 LDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 + + + E A F ++ ++ II Sbjct: 452 IRPVFLRIDEHILAHTFLCFISLVILKTIIFKINKH 487 >UniRef50_B2AJ60 Transposase, IS4 family n=4 Tax=Proteobacteria RepID=B2AJ60_CUPTR Length = 412 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 59/345 (17%), Positives = 106/345 (30%), Gaps = 50/345 (14%) Query: 19 EELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS-LREVTAWAQLHDVAT--LSDV 75 +L A TR+R++ + + G + L + A ++ +S+ Sbjct: 20 HDLARHPERPSAFTRQRKLTLPTLIAFMLGNLRMGVQAELDQFFAALARQNILRRCVSEQ 79 Query: 76 AL-LKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGT----AISAPGGGSAEWR 130 A R + + D F L L + G RL D + AI A R Sbjct: 80 AFAQARSKLSGDVFAHLNDWLLRQVSDHLPRWHGFRLVAADASHLRFAIRHSHLPRAATR 139 Query: 131 LHMGYDP---HTCQFTDFELTDSRDAERLDRFA----QTADEIRIADRGFGSRPECIRSL 183 + + L + ER F +D++ + DRG+ +R + L Sbjct: 140 DQLAFGLYLPGAEIMLAASLHSVHENERQILFEHLDRLQSDDLLLLDRGYPARW-LVAVL 198 Query: 184 AFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFP 243 + + +R +R + L + Sbjct: 199 NQRKIPFCMRA-DGSGFAAVRHFVRSGRDEAIVTLPAPARND------------------ 239 Query: 244 ARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVAD 303 ++ + + R+++ + VL+ L Y A D Sbjct: 240 ---------------ARDYECAATPQTVRLIRQISPAGKVRVLITNLLDMHHYPAATFRD 284 Query: 304 CYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFL 348 Y RW++E AFKRLK + L+ L A+ A +L L Sbjct: 285 LYHQRWRLEEAFKRLKHRMALEHLSGLSQLAARQDFGAKILCDNL 329 >UniRef50_A3D336 Transposase, IS4 family n=6 Tax=Shewanella RepID=A3D336_SHEB5 Length = 460 Score = 81.4 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 51/188 (27%), Gaps = 18/188 (9%) Query: 153 AERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMM 212 E + + D GF R R + ++ RV + G +F + Sbjct: 145 NELRKVLPDNITPLIVTDAGF--RNPWFRKVEQLGWYWLGRVRGLSVYRPHPFGRQFSLK 202 Query: 213 GFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGR 272 V + P ++ P + + Sbjct: 203 ALYPQARRRAKHVGRVALSVK------KPLLCEMVLFRAPSKG-----RKGQRSTTTDCH 251 Query: 273 VVQAETLEAAGHVLL--LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKS---LLHLDAL 327 T E +T+L S +++ + Y+ R Q+E F+ LKS L Sbjct: 252 HTAQWTYELTAKEPWALVTNLTMKAMSPQKLVNIYQKRMQMEETFRDLKSPAYGFGLRHS 311 Query: 328 RAKEPELA 335 R + Sbjct: 312 RTRYAARM 319 >UniRef50_Q46310 Transposase n=1 Tax=Carnobacterium maltaromaticum RepID=Q46310_CARML Length = 152 Score = 81.0 bits (198), Expect = 6e-14, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 47/136 (34%), Gaps = 10/136 (7%) Query: 3 YSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAY--GPGGMSLREV 60 ++ ++ P + +R + + R+R LL L + G SL ++ Sbjct: 2 TQFKKFAEHISSCFSPSAIQEFSRKSKIVKRKRMF-TIDHLLWLCVWQEKNMGDSSLIDM 60 Query: 61 TAWAQLHDVATLSDVALLKRLRNAADWF-GILAAQTLAVRAAVTGC------TSGKRLRL 113 A +S L +R + F +L L + T R+R+ Sbjct: 61 CASLWQQFGIKISPEGLNQRFNEKSTAFLKLLFHSILEKQTPDLAAIQHAYSTHFNRIRI 120 Query: 114 VDGTAISAPGGGSAEW 129 +D T+ P S ++ Sbjct: 121 LDSTSFQLPNTFSDKY 136 >UniRef50_A0LAZ1 Transposase, IS4 family protein n=4 Tax=Magnetococcus sp. MC-1 RepID=A0LAZ1_MAGSM Length = 563 Score = 81.0 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 74/277 (26%), Gaps = 47/277 (16%) Query: 130 RLHMGYDPH---TCQFTDFELTDSRDAE---------RLDRFAQTADEIRIADRGFGSRP 177 +L + + E+ + +L I + DRG ++ Sbjct: 198 KLQIEIGLLCDKEGRPLAVEVFPGHTGDPTTLTAQVNKLKARFGLKRVIVVGDRGMITQA 257 Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG-----------------------F 214 + +I + +R L + F Sbjct: 258 RIDEDIQPAGFSWITALRHSTIRSLAKADNWPSLFDARNFAEITSPDFPGERLMVCRNPF 317 Query: 215 LRGLDCGKNGETTVMIGNSGNKKAGAPFPARL-----IAVSLPPEKALISKTRLLSENRR 269 L E + A L I + R + Sbjct: 318 LAEDQGRSRRELLAIAEQGLETILQAVQEGSLRDCGQIGKRADRVLRKLKIARYFNLEIA 377 Query: 270 KGRVVQAETLE-------AAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLL 322 GR + + G ++ T+LPE E S+ Y+ Q+E AF+ LKS L Sbjct: 378 PGRFIWSRKQAVIDQETALDGIYVVRTNLPEKEISSADTIRQYKSLAQVESAFRDLKSSL 437 Query: 323 HLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDF 359 + + + KA +F +LA + ++ Sbjct: 438 DIRPIFHFRADRIKAHVFLCMLAYMVEREMRIKLKTL 474 >UniRef50_A6DG92 ISPg4, transposase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG92_9BACT Length = 189 Score = 80.6 bits (197), Expect = 8e-14, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 39/106 (36%) Query: 251 LPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQ 310 + + ++ + R+V A +L +++ +A Y+ RW Sbjct: 23 ISDQLIKLNGVNTEKHYPKILRLVAANVEIDGKMKVLKFLSNNLQWAPSSIASIYQSRWG 82 Query: 311 IELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 IE+ FK+LK L L + ++ LL L+ + S Sbjct: 83 IEVFFKQLKQNLKLADFLGHNKNAIQWQVWTALLTYVLLRFLAFRS 128 >UniRef50_A8YU85 Transposase n=21 Tax=Lactobacillus RepID=A8YU85_LACH4 Length = 194 Score = 80.6 bits (197), Expect = 8e-14, Method: Composition-based stats. Identities = 20/120 (16%), Positives = 43/120 (35%), Gaps = 3/120 (2%) Query: 242 FPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLE---AAGHVLLLTSLPEDEYSA 298 + ++ + + K RV + +L+T+L +E+ Sbjct: 3 LHKKHHYKAVRSKNTQDCRWDFEDLCNVKFRVCKFRINPPGSDDEWEVLITNLDRNEFPL 62 Query: 299 EQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 ++ + Y L W IE +F+ LK L +K+ + I+A+ + + S Sbjct: 63 ARMKEIYHLSWGIETSFRELKYDLSGIQFHSKKDQFVYMEIYAHFAMYNAVSLSVATSSK 122 >UniRef50_Q1Q2K2 Putative uncharacterized protein n=5 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q2K2_9BACT Length = 457 Score = 80.6 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 56/188 (29%), Gaps = 14/188 (7%) Query: 173 FGSRPECIRSLAFGEADYIVRVHWRGL---RWLTAEGMRFDMMGFLRGLDCGKNGETTVM 229 + + + + + + + + L L Sbjct: 226 WYASQRFLEHIHAKKKHFFSEIKSNRNISMYHPEKQKYCIIKPDELVTLIKKHYAGKIKY 285 Query: 230 IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLT 289 + + + L + +L + ++ + + +L+T Sbjct: 286 VTLKSADGSEVSYKTYTFDAKLNGCNVPLKFVVILGKWNKE---------DDKKYHVLIT 336 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 + + + S + V Y LRW IE FK LK + D + + + + + L++ + Sbjct: 337 N--QLDASVKTVITNYLLRWGIEHCFKELKDTFYFDHYQVRHIDKIERYWNICLISWTFV 394 Query: 350 DDIIQPSL 357 I Q + Sbjct: 395 YWIKQNAY 402 >UniRef50_Q2NZH2 ISXoo8 transposase n=73 Tax=Xanthomonas RepID=Q2NZH2_XANOM Length = 407 Score = 80.6 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 79/227 (34%), Gaps = 16/227 (7%) Query: 105 CTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELT------DSRDAERLDR 158 + + ++D + + P R + T D ++ + L + Sbjct: 89 LRGDQPVIVIDWSDLK-PDKSWCLLRAAVPVGGRTLTLLDMVVSRKQQGSPGAEKRFLQQ 147 Query: 159 F-AQTADE---IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGF 214 A D+ I + D GF R R+++ D++ R+ R + + + D + + Sbjct: 148 LRALIPDDVRPILVTDAGF--RTPWFRAVSAMGWDWVGRLRGRTQ--VKPQDVPDDAVQW 203 Query: 215 LRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVV 274 + P RL+ + + R ++ R + Sbjct: 204 IDSRRLHALASNRARALPPMQANRSDPLDCRLVLYAKTRQGRQQRNRRSSAKVSRASSSL 263 Query: 275 QAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL 321 +A E L++ S SA+Q+ + Y R QIELAF+ LKS Sbjct: 264 KAAAREREPW-LIVASPQLHAPSAKQLVNLYARRMQIELAFRDLKSH 309 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 80.6 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 46/257 (17%), Positives = 71/257 (27%), Gaps = 25/257 (9%) Query: 102 VTGCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQ 161 T+G+R+ VDG + G + L D HT D + E Sbjct: 128 QPAATTGRRVYSVDGKTLRGSGPAGEQVHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPL 187 Query: 162 -----TADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLR 216 + AD R + +A Y+ V R + L Sbjct: 188 LGPLDLTAVVVTADALHTQREHARWLVDTKKAAYVFTVKKNQPRLYRQ-------LKTLP 240 Query: 217 GLDCGKNGETTVM-IGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQ 275 ET+ G ++ A +A+ P + R N GR Sbjct: 241 WTKIPIQDETSTRGHGRYDIRRLQAVTCTGPLALDFP-HAVQALRIRRRRLNLATGRWST 299 Query: 276 AETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELA--FKRLKSLLHLDALRAKEPE 333 V +T+L + ++AD R W IE + LR Sbjct: 300 VT-------VYAITNLSAAQAGPAELADWLRGHWAIETLHHIRDTTYAEDASRLRTGNAP 352 Query: 334 LAKAWIFANLLAAFLID 350 A A + A L+ Sbjct: 353 RAMATL--RNTAINLLR 367 >UniRef50_C8W0R5 Transposase-like protein n=12 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W0R5_DESAS Length = 604 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 67/174 (38%), Gaps = 1/174 (0%) Query: 194 VHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPP 253 ++ + + L L+ GK + ++ + K F + L Sbjct: 361 LNKKPKEQERRQKRITSTEDALVELN-GKLNKRNLITKEACEKVVDNIFKGQPDMRRLFN 419 Query: 254 EKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIEL 313 +++ + + K + E + G +LLT+ +++ A ++ YR R IE+ Sbjct: 420 VTIKLNQHNAIVMSWSKDEAIIPELEKTDGIFVLLTNHDKEKVDANELLTRYRGRNDIEI 479 Query: 314 AFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSE 367 +F+ LK L L + + PE A+ F +LA +++ + + + Sbjct: 480 SFRFLKGSLDLQQIFLRNPERVDAYCFLKVLAMLVLNLAAWLLAKNGKKMSPQK 533 >UniRef50_B9YUA6 Transposase, IS4 family protein n=3 Tax='Nostoc azollae' 0708 RepID=B9YUA6_ANAAZ Length = 256 Score = 79.9 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 55/201 (27%), Gaps = 40/201 (19%) Query: 160 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLD 219 + + + DRG S + + + + + RV L + ++ Sbjct: 22 SVGEGMLLMWDRGLHSF-KMVHAAIKQKCHILGRVPANVKFELVKTLGNGSYLSWVAPD- 79 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 + KK P +I V + Sbjct: 80 ------------SKSKKKGAKRIPICVIEY------------------------VIEDNG 103 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA--LRAKEPELAKA 337 + L + + A +A Y RW+ E LK L +R+K P Sbjct: 104 SDKVYRLTTDLMDISTFPALILAQEYHTRWEAENTLDELKVHLLGRKTLIRSKSPREVVQ 163 Query: 338 WIFANLLAAFLIDDIIQPSLD 358 I+ LL F I ++ S Sbjct: 164 EIYGWLLGHFCIRCLMFQSAS 184 >UniRef50_B2J2I5 Transposase, IS4 family protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J2I5_NOSP7 Length = 439 Score = 79.1 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 51/378 (13%), Positives = 107/378 (28%), Gaps = 74/378 (19%) Query: 6 DNWSAILAHI-GKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS--LREVTA 62 ++L+ + KP +L + R + L+ + G+ R + Sbjct: 30 QKIYSLLSPLNFKPLKLYETEEKKPFRDRILTLSVMMALVVSLVYRQIPGLREVQRVLCE 89 Query: 63 WAQL-HDVATLSDVALLKRLRNAA-DWFGILAAQTLAVRAAVTGCT-----------SGK 109 L +S A+ KRLR + F + Q + Sbjct: 90 EGLLWAGRIEVSAQAVSKRLRTLPIELFAQIFEQVMERMNVQPQNQAVPENWQPVCAKFT 149 Query: 110 RLRLVDGTAISAPGGGSAEW---------RLHMGYDPHTCQFTDF-ELTDSRDAERL--- 156 + + DG+ + A ++ M + + +S+ ++ Sbjct: 150 AIWIADGSTLEALRRKLKVLQEQEKTLAGKIMMVVEAFSHHPVTTWYTQNSKANDKTWCE 209 Query: 157 -DRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFL 215 + I D GF P + + ++ R+ + + + Sbjct: 210 QLLERLPIGGLLIFDLGFFKFP-WFDAFTEADKFFLTRLREKTSYKV------------I 256 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQ 275 R L ++ + + + R+V Sbjct: 257 RCLTNASFYRNEII----------------------------SMGEYRSNPCQHQVRLVS 288 Query: 276 AETLEAAGHVLLLTS-LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPEL 334 L + LT+ L SA V + RW++E AF K LL L + + Sbjct: 289 V--LWGSTWYYYLTNVLDPQMLSAHLVCELGSRRWRVEDAFLLTKRLLGLAYIWVGDNNG 346 Query: 335 AKAWIFANLLAAFLIDDI 352 + IFA + +++ + Sbjct: 347 VQIQIFATWIFYAVLNQL 364 >UniRef50_B3JEV4 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JEV4_9BACE Length = 169 Score = 79.1 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 53/186 (28%), Gaps = 39/186 (20%) Query: 131 LHMGYDPHTCQFTDFELTDSRDAE-RLDR-FAQTADEIRIADRGFGSRPECIRSLAFGEA 188 +H D + +T+ + + R D + +ADRG+ + Sbjct: 1 MHTLLDYDSLLPEFVNITEGKCGDNRGDLDIPVPPHSVVVADRGYCDF-SLLDYWDSRNV 59 Query: 189 DYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIA 248 ++VR L E + + E + G KK P Sbjct: 60 FFVVRHRDNLLYSQIEERLLPE-----TRAQNVLIDEIIELTGEQTKKKYTRPL------ 108 Query: 249 VSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLR 308 R + E V LLT+ + +A +A Y+ R Sbjct: 109 -----------------------RRIAVWNDEHGYVVQLLTN--NFKLTASTIAQLYKAR 143 Query: 309 WQIELA 314 W IE+ Sbjct: 144 WMIEIL 149 >UniRef50_D0DW10 Transposase IS4 family protein n=5 Tax=Lactobacillus RepID=D0DW10_LACFE Length = 452 Score = 79.1 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 56/167 (33%), Gaps = 10/167 (5%) Query: 200 RWLTAEGMRFDMMGFLRGLDCGKNGETT----VMIGNSGNKKAGAPFPARLIAVSLPPEK 255 + M L L G V G + + RL A P++ Sbjct: 220 VLFDSWYSSPKMFYELTKLGLNGVGMLKRSSKVYYQYRGRQYSVKALYKRLQASKYQPKQ 279 Query: 256 ALISKTRLLSEN---RRKGRVVQAET-LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQI 311 A + + + K R+V +++L T + +++ Y RWQI Sbjct: 280 AYQYSCFVEAHVGNQKFKLRLVFVANRARQDDYLVLAT--TQLSLQPQEIIQLYARRWQI 337 Query: 312 ELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 E FK K L LD + + + + ++A L+ + + D Sbjct: 338 ENYFKVAKQYLRLDKSQVQSYDGLCGHLAIVMIAYNLLAWQERQNED 384 >UniRef50_UPI00003C8608 transposase IS4 family protein n=4 Tax=Ferroplasma acidarmanus fer1 RepID=UPI00003C8608 Length = 349 Score = 78.7 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 43/366 (11%), Positives = 94/366 (25%), Gaps = 89/366 (24%) Query: 22 DTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRL 81 ++ G + R+ +A L L G + ++T A + +S L K Sbjct: 19 ESKVEGIGIDSYSRKFNLSAHLSIL----ATGIIKKNDLTDIAYNNG---ISKSQLSKLN 71 Query: 82 RNAA-DWFGILAAQTLAV--RAAVTGCTSGK-----RLRLVDGTAISA---------PGG 124 F + L +A + +D T I G Sbjct: 72 NKRPYSIFEKVFYSILRPFIKAHRYDIYHDYIDRLYSVLAIDSTFIETMVKGSGIYQRGE 131 Query: 125 GSAEWRLHMG-YDPHTCQFTDFELTDSRDAERLDRFAQTA--------DEIRIADRGFGS 175 ++H +T + + + + D G+ + Sbjct: 132 RRNGIKIHTAAIASPYPLPLKAIITPANVHDSKVFDDLLEYINEYISGNTVLTFDLGYYN 191 Query: 176 RPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGN 235 L +++ R+ + E Sbjct: 192 LGR-FMELKEKGINFVSRIKKNADYTVIKE------------------------------ 220 Query: 236 KKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDE 295 + + N + R+V + + + Sbjct: 221 --------------------ETFNSKIVRFRNGLELRLVSLDINNRKREYI----TDILD 256 Query: 296 YSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQP 355 + Y RW IE F+ +K +L + L +++ +FA L++ ++ I+Q Sbjct: 257 LPEIYIYYIYMRRWVIEKIFENMKRILKITHLISRDLNGIINQVFATLISYIVL-LILQS 315 Query: 356 SLDFPP 361 S++ Sbjct: 316 SMNIYH 321 >UniRef50_Q7UY96 Similar to transposase n=1 Tax=Rhodopirellula baltica RepID=Q7UY96_RHOBA Length = 403 Score = 78.7 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 77/204 (37%), Gaps = 10/204 (4%) Query: 154 ERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMG 213 + + ++ + + DR S +RS ++VR R +R + ++ Sbjct: 133 DEVAQWELPRRVVHVIDREADSLGR-LRSWHAKGHLFLVRCDDRRVRCEGRSVLLSELND 191 Query: 214 FLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRV 273 L + + G K + P + + + + ++ + R Sbjct: 192 ELDSQCEYADAGKALYHG---KKVQRQVAEKTVTLYR-PHSEVIDGEKKAVTGEPIEVRT 247 Query: 274 VQAETLEAAGHVL----LLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLH-LDALR 328 V ++A G +L LLT++P D+ +A V Y RW+IE FK LKS L+ + Sbjct: 248 VFVRLVDADGWILAEWTLLTNVPADQANASDVGRWYYFRWRIESFFKLLKSHGQELEYWQ 307 Query: 329 AKEPELAKAWIFANLLAAFLIDDI 352 + E + +A L+ + Sbjct: 308 QESGEAITKRLLMASMACVLVKQL 331 >UniRef50_A9DNS7 Transposase n=1 Tax=Shewanella benthica KT99 RepID=A9DNS7_9GAMM Length = 190 Score = 78.3 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 28/97 (28%), Positives = 52/97 (53%) Query: 272 RVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKE 331 R++ + L+T+L ++SA++V++ Y LRWQIEL FK LKS L ++ Sbjct: 34 RLIAFWDRNKSAIGYLITNLKRAQFSADKVSELYGLRWQIELFFKELKSYSGLKTFNTRD 93 Query: 332 PELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEK 368 +A++ ++A++L L I + S + ++K Sbjct: 94 KSIAESLVWASMLTLLLKRFIARASGLIHQVTISTQK 130 >UniRef50_A4XK23 Transposase, IS4 family protein n=8 Tax=Clostridia RepID=A4XK23_CALS8 Length = 567 Score = 77.9 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 53/415 (12%), Positives = 104/415 (25%), Gaps = 69/415 (16%) Query: 5 HDNWSAI-LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS-LREVTA 62 NW I + + E+D + R+ + + + MS LR Sbjct: 81 IKNWGYIVFRKLWEELEIDKFLKERATKGRKIKFDVDKVSFLMTIQRLIEPMSKLRTYHQ 140 Query: 63 WAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP 122 + D+ L R + D L + + D T I Sbjct: 141 RNKYFGFEEDIDLNQLYRCLDFLDSIKEDLETYLYQKNRDL-FKMVVDVVFYDVTTIYFE 199 Query: 123 GGGSAEWR------------LHMGYDPH---TCQFTDFELTDSRDAERLDRFAQTAD--- 164 + E + + + + +EL + Sbjct: 200 SCRADELKNFGFSKDNKINEVQVVLGLLVDKEGRPIGYELFPGNTIDSKTMVKILRKLKE 259 Query: 165 ------EIRIADRGFGSRPECIRSLAFGEADYI--VRVHWRGLRWLTAEGMR--FDMMGF 214 + +AD+G SR ++ + DYI R+ L + + + Sbjct: 260 KFSIDKIVIVADKGLNSR-LNLKMIKEAGYDYIVASRLKNASKEVLDEVFEQEGYKRLDG 318 Query: 215 LRGLDCGKNG--ETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLS------- 265 L+ + E + N I L + + Sbjct: 319 KSCLNAEEIYGDEFKYKVLERTNVIKDEEGKEFKIEERLIITYSSKRAKKDKEDRERLVS 378 Query: 266 --------------ENRRKGRVVQAETLEAAGHVL--------------LLTSLPEDEYS 297 ++ R + ++ +VL + + Sbjct: 379 KAKELLENKGSITALEKKGARKYLKKKSKSEEYVLDEEAIKRDEKFDGYYAIQTSKKDMD 438 Query: 298 AEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDI 352 E+V Y W+IE +F+ +KS L + + K LA L + Sbjct: 439 VEEVLGAYHDLWKIEQSFRVMKSCLEVRPIYHFTESRIKGHFVICFLAFLLQRTL 493 >UniRef50_A5WBL3 Transposase, IS4 family n=2 Tax=Bacteria RepID=A5WBL3_PSYWF Length = 427 Score = 77.9 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 49/247 (19%), Positives = 79/247 (31%), Gaps = 17/247 (6%) Query: 127 AEWRLHMGY-DPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAF 185 +H Y D Q ELT E L++ I I DR S + R Sbjct: 134 TANGVHSSYQDTPAKQTHLNELT--TRIEYLEQQGFDKPLIHIIDREADSAYQM-RQWDE 190 Query: 186 GEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPAR 245 + +I RV + R + + + V K+ A Sbjct: 191 HDYKFITRVKAGSYLSYEGKSQRCSQIA----GQLNFSYQRQVNYKGKAAKQYIATAKVV 246 Query: 246 LIAVSLPPEKALISKTRLLSENRRKG-------RVVQAETLEAAGHVLLLTSLPEDEYSA 298 L + P + R+ + R+ + A LL++L E + Sbjct: 247 LTRSAKPQAIDPATGKRIAPIKGKPLSLLLTVSRIYDDQDKRLATWY-LLSNLQEPSVNG 305 Query: 299 EQVADCYRLRWQIELAFKRLKSL-LHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 ++ Y RWQIE FK LKS L L++ + + + A L+ I+Q + Sbjct: 306 ADISQWYYWRWQIESYFKLLKSAGLQLESWLQQSGDAYFKRLLIASQACTLVWRIMQKTD 365 Query: 358 DFPPRSA 364 A Sbjct: 366 KQSKEFA 372 >UniRef50_UPI00016C560B transposase IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C560B Length = 280 Score = 77.6 bits (189), Expect = 6e-13, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 58/201 (28%), Gaps = 36/201 (17%) Query: 164 DEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKN 223 D + + DRGF S ++ + A + R+ + Sbjct: 28 DMLLLWDRGFLSYD-LVQQVRQRCAHLLARIKSNLVFRPLHRLPDGS------------- 73 Query: 224 GETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAG 283 + + R+I +L Sbjct: 74 YRAKLYPSPRHRHRDEGGVMVRIIEYALNDPG---------------------RVGSGQK 112 Query: 284 HVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA-LRAKEPELAKAWIFAN 342 H LL T + + A QV Y RW+ EL LK+ LR++ P I Sbjct: 113 HRLLNTLVDARRHPAPQVIVQYHERWEEELTIDELKTHQRERPVLRSETPGGVVQEIQGL 172 Query: 343 LLAAFLIDDIIQPSLDFPPRS 363 +LA +++ ++ + RS Sbjct: 173 VLAHYVVRVLMCEAAKQNKRS 193 >UniRef50_UPI00016C3BAD transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3BAD Length = 258 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 57/204 (27%), Gaps = 33/204 (16%) Query: 37 IRDAATLLRLGLAYGPGGMSLRE-----VTAWAQLHDVATLSDVALLKRLRNAADWFGIL 91 A TL S + LH + + R + L Sbjct: 54 WTPARTLWTFLTQCLSTSTSCAAAAAVALRVTLGLHPCSEATGAYCKARAKLPVALLSRL 113 Query: 92 AAQT---LAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAE-----------------WRL 131 A Q+ L A G+R+ L DGT +S P + + R+ Sbjct: 114 ATQSGDELERHAPKEWQWKGRRVLLGDGTTLSGPDTPANQAAYPQHTNQKRGLGFPLIRV 173 Query: 132 HMGYDPHTCQFTDFELTDSRDAERLDRFAQT-------ADEIRIADRGFGSRPECIRSLA 184 + T + ++ E + A ++ +ADR + S + +L Sbjct: 174 VVLLGFATGALVGAAIGPAKGKEAGEMALLRELLDRFQAGDVFVADRAYCSYW-LVSALQ 232 Query: 185 FGEADYIVRVHWRGLRWLTAEGMR 208 D +R+H A Sbjct: 233 ARGVDVAIRLHQSRHYDFGAGPPP 256 >UniRef50_D2ASB5 Transposase, IS4 family n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2ASB5_STRRD Length = 356 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 53/300 (17%), Positives = 95/300 (31%), Gaps = 42/300 (14%) Query: 70 ATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTS---GKRLRLVDGTAISAPGGGS 126 + S ++ R R A+ +L + A S G R +DGT + P Sbjct: 89 PSASAIS-RARARLGAEPLRVLFCRVTGPVAEPQASRSWLAGLRPVTMDGTTLVVPETRD 147 Query: 127 AEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFG 186 GY +F + + + D FGS R+LA Sbjct: 148 NSA---FGYPDGAARFPCVRVVAVAEN----------GTHALIDATFGSSAVEERTLARR 194 Query: 187 EADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARL 246 + + L + F++ T ++ G +G Sbjct: 195 ---LLRCLESDMLLLARSGRWGFEL------WRQAAETGTHLLWGVTGAD---------- 235 Query: 247 IAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYR 306 +LP ++ + L G ++ L A L+ T + + SA ++A Y Sbjct: 236 ---ALPIGRSFEDGSYLSRPAGLGGAPLRVIPLPGAEW-LITTLVDPGQASASELAARYA 291 Query: 307 LRWQIELAFKRLKSLLHLD--ALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSA 364 RW ++ A L S LR++ PE+ I+A L + ++ + S Sbjct: 292 ERWVMDSALAWLHSDRRGPAITLRSRSPEMVAQEIWALLCVYQAMRELTCQAASHEGASC 351 >UniRef50_Q7NBK2 Predicted transposase n=10 Tax=Mycoplasma RepID=Q7NBK2_MYCGA Length = 348 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 41/262 (15%), Positives = 89/262 (33%), Gaps = 38/262 (14%) Query: 133 MGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIV 192 Y D ++ E + + I +AD+G S+ +R L YIV Sbjct: 30 FHYKVLEGNIVDSKVLVKFLIEMQKIYKI-KNTIIVADKG-ISQNANLRYLEQKGYKYIV 87 Query: 193 RVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGET--TVMIGNSGNKKAGAPFPARLIAVS 250 + L + + GF++ + +V N K+ F + + S Sbjct: 88 QKRIDILGKEDKSFIVNEQ-GFVQENEYFTKSRFVQSVWAKNKNKKRYSNTFRKQFVYFS 146 Query: 251 LPPEKALISKT-------------------RLLSENRRKGRVVQAET------------L 279 + K L+ E ++K V +T Sbjct: 147 PSKQTLDKIKRQNLINKLEKKSINGELPLSALVPEYKKKYMDVDGKTVGRLNIEKIKKVA 206 Query: 280 EAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI 339 G ++ T++ ++++ + Y+ +W++E +F+ LKS + + + + E ++ + Sbjct: 207 NEDGFYMIETNITN--INSKEANEIYKRQWKVEESFRTLKSAIEVRPMYVYKDEHIQSHV 264 Query: 340 FANLLAAFLIDDIIQPSLDFPP 361 F L+ ++ I F Sbjct: 265 FLCFLSLIVLKYCIYKLKKFYK 286 >UniRef50_Q9R3J0 Transposase, putative n=10 Tax=Deinococcus radiodurans RepID=Q9R3J0_DEIRA Length = 416 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 62/195 (31%), Gaps = 21/195 (10%) Query: 167 RIADRGFGSRPECIRSLAFGEADYIVRVHWRGLR---WLTAEGMRFDMMGFLRGLDCGKN 223 +AD G ++ + ++ +I R + R G + Sbjct: 187 VVAD-GNYAKESMVETVTGHGLPFISRFPRNANLKYLYTGEHPRRRGRPKKFDGKVDFSD 245 Query: 224 GETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAG 283 + ++ + ++ + + A + ++ + +KG+V Sbjct: 246 LQRFDLVSETSTERVWTQVVWSV-------QWAREVRAVVIQQVGKKGQVTGYA------ 292 Query: 284 HVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANL 343 VL T A +V YR R++IEL F+ K L ++ + +A L Sbjct: 293 -VLFST---AVTMPAHEVIALYRSRFEIELIFRDAKQFLGGQDVQLRSQPGIEAHWNVVL 348 Query: 344 LAAFLIDDIIQPSLD 358 L L + + Sbjct: 349 LTLNLCRLEALRAAE 363 >UniRef50_C0WV66 Transposase IS4 family protein n=5 Tax=Lactobacillus RepID=C0WV66_LACFE Length = 450 Score = 76.4 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 45/390 (11%), Positives = 101/390 (25%), Gaps = 51/390 (13%) Query: 4 SHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAW 63 S+D ++ K L + + L+ + A Sbjct: 9 SNDEAQFLIRTFFKQIGLGKIIHQINF----KRHTPISPLMMIKWLMTTIFARKSLYRAQ 64 Query: 64 AQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPG 123 + + + L N +L +Q + L ++D + + P Sbjct: 65 SDANFTTRTARNFLNDGRTNWQKLTCLLTSQVIKALRPFIDSRRRLAL-IIDDSLFARPY 123 Query: 124 GGSAEWRLHMGYDPHTCQFT--------------------DFELTDSRDAERLDRFAQTA 163 AE L YD + + ++ + +L + Sbjct: 124 AKKAEL-LARVYDHNKGVYVRGYRALTLGWSDANTFLPVNFALMSSGKAENQLGPCLKND 182 Query: 164 DEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGL----RWLTAEGMRFDMMGFLRGLD 219 + R R + + D + + G+ + M + L Sbjct: 183 QRTLASQR----RNQARTKMNEAAVDLVSQALQNGVPAQYVLFDSWYSSPKMFDLINQLG 238 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETL 279 G +KK + RL +V ++ + + V Sbjct: 239 LDGVGML------KRSKKVYYRYRKRLYSVKTLYQRLQTEGRKSKEGATYQYSCVVESLS 292 Query: 280 EAAGHVLLLTS-----------LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALR 328 ++ +++ + ++ Y RWQIE FK K L D + Sbjct: 293 GVELKLVFVSNRHHANNYLVLATTKTSLRPNEIIQLYGRRWQIETYFKAAKQYLRFDQTQ 352 Query: 329 AKEPELAKAWIFANLLAAFLIDDIIQPSLD 358 ++ + + ++ L+ + D Sbjct: 353 VQKYDGLCGHLAMVMMTYDLLAWQERQERD 382 >UniRef50_A4C5E2 Hypothetical transposase n=2 Tax=Pseudoalteromonas tunicata D2 RepID=A4C5E2_9GAMM Length = 397 Score = 75.6 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 61/204 (29%), Gaps = 14/204 (6%) Query: 159 FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL 218 I + D GF R + + D++ RV + D +L Sbjct: 150 LPSDCKPIIVTDAGF--RNPWFKLVLKFGWDFVGRVRHQTQYQ-----KPEDDTSWLPVK 202 Query: 219 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAET 278 T + L K K + L + V Sbjct: 203 TLYSKA--TAKPVYLFETQLAKANSLSGHFY-LFKSKPKQRKKKNLRGKTIRCSVSLKHA 259 Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL---LHLDALRAKEPELA 335 A LL TSL YSA+ + Y R QIE +F+ LK+ L+L R+ E Sbjct: 260 KGATEPWLLFTSLCNINYSAQDMVKIYSQRMQIEESFRDLKNTSNGLNLRHCRSYEKGRL 319 Query: 336 KAWIFANLLAAFLIDDIIQPSLDF 359 + L+A I + + Sbjct: 320 NVALLIALIA-NFILWLAGLTAKI 342 >UniRef50_Q9X6I5 Putative uncharacterized protein n=2 Tax=Bacillus thuringiensis RepID=Q9X6I5_BACTU Length = 118 Score = 75.6 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 22/70 (31%), Positives = 33/70 (47%) Query: 298 AEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSL 357 +QV + Y LRWQIE+ FK KSL +D R + E + ++ L+A FL + Sbjct: 1 MKQVHELYSLRWQIEIVFKTWKSLFDIDHCRTVKQERIECHLYGKLIAIFLCSSTMFKMR 60 Query: 358 DFPPRSAGSE 367 + E Sbjct: 61 QLLLQKKQKE 70 >UniRef50_C3RGR4 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C3RGR4_9BACE Length = 413 Score = 75.6 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 61/185 (32%), Gaps = 17/185 (9%) Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKK 237 E +R + + + D++ F+ + + +G + + Sbjct: 165 EMVRHAMKKGIRF-------DYLLVDSWFTCADLIRFITSRHLECHLIGMLKMGKTRYRT 217 Query: 238 AGAPFPARLIAVSLPPEKALISKTRLL--------SENRRKGRVVQAETLEAAGHVLLLT 289 A +I L EK++ +L RK R+ + L+ Sbjct: 218 EAGNLNAPVIIDRLKKEKSVRYSRKLNCYYAHMDAEYANRKIRIFFCKRGRKGAWNAFLS 277 Query: 290 SLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLI 349 + ++ + Y +RW IE+ F +K LL L + + A I L+ ++ Sbjct: 278 TDTRLDF--FEAYRIYSMRWAIEVCFSEMKGLLRLGKCQCRNFSSQIASISLTLMQYNIL 335 Query: 350 DDIIQ 354 I + Sbjct: 336 SHIKR 340 >UniRef50_C6I0E1 Transposase, IS4 family protein n=4 Tax=Leptospirillum ferrodiazotrophum RepID=C6I0E1_9BACT Length = 650 Score = 74.5 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 26/122 (21%), Positives = 47/122 (38%), Gaps = 3/122 (2%) Query: 231 GNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTS 290 + P R+ ++ R++ K + Q +V+ T Sbjct: 454 HYTVAPVYETTAPPRISRGKKKKASPSLASPRIVDLAWEKSPLRQVRKTLTGAYVIETTH 513 Query: 291 LPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID 350 E SA + Y Q+E AF+ LKS L + + + + +A +F ++LA FL+ Sbjct: 514 T---ELSASGIWSLYTTLTQVEGAFRALKSDLGVRPVFHQTADRTRAHLFVSVLAYFLLS 570 Query: 351 DI 352 I Sbjct: 571 HI 572 >UniRef50_B7AA71 Transposase IS4 family protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7AA71_THEAQ Length = 393 Score = 74.5 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 67/205 (32%), Gaps = 35/205 (17%) Query: 146 ELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAE 205 E+ + +A R + +ADRGF R + + +++VRV+ R L Sbjct: 149 EVEGAIEAARERLGGVGRRLVYVADRGFDDR-KVFGQVLALGEEFVVRVYRD--RKLGEG 205 Query: 206 GMRFDMMGFLRGLDCGKNGE-------TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALI 258 G + L L CG+ E V + + L+ +P Sbjct: 206 GSLAKVASSLA-LPCGEEVELRVGGRYQRVRLHFGWREVEVEGRRLHLVVCRVP------ 258 Query: 259 SKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRL 318 LL + A QV + YR RW++E F+ L Sbjct: 259 ------------------ALGRRGEWWLLTSLPVRGREEAAQVVEAYRRRWEVERFFRLL 300 Query: 319 KSLLHLDALRAKEPELAKAWIFANL 343 K+ L L+ + + + + L Sbjct: 301 KTGLGLETFQVRGLARIRKVVAVLL 325 >UniRef50_Q6MS13 Transposase IS1634BQ n=39 Tax=Mycoplasma RepID=Q6MS13_MYCMS Length = 557 Score = 74.5 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 37/255 (14%), Positives = 69/255 (27%), Gaps = 42/255 (16%) Query: 138 HTCQFTDFELTDSRDAERLDRFAQTAD---------EIRIADRGFGSRPECIRSLAFGEA 188 +++ A+ + IAD+G S IR L Sbjct: 237 ENGIPLHYKIFPGNVADPNTLIPFMLEIADIYEVNSVTIIADKG-MSVNRNIRFLESKNW 295 Query: 189 DYIV--RVHWRGL----RWLTAE---------GMRFDMMGFLRGLDCGKNGETTVMIGNS 233 YI+ R+ L + D+ + ++ + Sbjct: 296 KYIISYRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQIISFSQ 355 Query: 234 GNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQA---------------ET 278 LI + +K R + E Sbjct: 356 KRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPINKGAFYELDIEKIQED 415 Query: 279 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAW 338 + G+ + T+ + S ++V + Y +WQIE FK LK L L + + Sbjct: 416 QKYDGYYVYETN--RTDLSVKEVINLYSKQWQIESNFKTLKGKLSLRPMYLSTWNHIVGY 473 Query: 339 IFANLLAAFLIDDII 353 I ++ ++ II Sbjct: 474 ICLCFISLVFLNYII 488 >UniRef50_A1BDB0 Transposase, IS4 family protein n=3 Tax=Chlorobium phaeobacteroides DSM 266 RepID=A1BDB0_CHLPD Length = 554 Score = 74.1 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 54/383 (14%), Positives = 112/383 (29%), Gaps = 54/383 (14%) Query: 35 REIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQ 94 R ++ A T+ L LA S R+ + L A+ + D+ + + Sbjct: 131 RRVQAAKTIKHLVLARIANPASKRKSVIDLERDFGVKLDLSAVYR----TLDYVDEQSIE 186 Query: 95 TLAVRAAVTGCTSGK---RLRLVDGTAI--------SAPGGGSAEWRLHMGYDPH----- 138 + +A + D T + G ++ H Sbjct: 187 LIQKKAWEAATGLFGEKIDVVFYDCTTLYFESFTDDELRDKGLSKENKHSEVQVLLAMMV 246 Query: 139 --TCQFTDFELTDSRD---------AERLDRFAQTADEIRIADRGFGSRPECIRSLAFGE 187 + L + E++ A+ + +AD G S+ E + + E Sbjct: 247 SKHGFPLGYRLYNGATWEGHTLKDAIEQIKGMAEVDRVVFVADSGLLSK-ENLALIEQSE 305 Query: 188 ADYIV--RVHWRGLRWLTA--EGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFP 243 YIV R+ + + + +R + +++ S + F Sbjct: 306 KQYIVAARLKNQPETIKEQILDKAGYQQGEQIRFKEIPLPENRRLLVSYSEKRARKDAFD 365 Query: 244 ARLIAVSLPPEKALISKTRLLSENRRKGRVVQAE----------------TLEAAGHVLL 287 + L + +N + V+ E T + G + Sbjct: 366 RQKALDKLQKKLNKSKNPESFLKNTSYRKYVKIEQLSQQTVIIDEEKISYTAQWDGLHGV 425 Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 +T++ + +A + YR WQIE F+ K L + + P +A + +A Sbjct: 426 ITNIT--DITASDAFEHYRGLWQIEETFRLTKHDLKVRPIYHWTPRRIEAHVAMCFMALV 483 Query: 348 LIDDIIQPSLDFPPRSAGSEKKN 370 I + + +N Sbjct: 484 CIRHLSYRVQLQYQALSPEIIRN 506 >UniRef50_Q737L2 IS231-related transposase n=3 Tax=Bacillus cereus group RepID=Q737L2_BACC1 Length = 167 Score = 73.7 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 50/140 (35%), Gaps = 20/140 (14%) Query: 127 AEWRLHMGYDPHTCQFTDFELTDSRDAERL----DRFAQTADEIRIADRGFGSRPECIRS 182 A ++ + YD H+ +F +F++ ++ ++ +++ I D G+ S E + Sbjct: 19 AGIKIQLEYDLHSGEFLNFQVGPGKNNDKTFGTECLDTLRPEDLCIRDLGYFS-LEDLDQ 77 Query: 183 LAFGEADYIVRVHWRGLRWL---------------TAEGMRFDMMGFLRGLDCGKNGETT 227 + YI R+ ++ +E + DM L+ L T Sbjct: 78 MDQRGTYYISRLKLNTNVYMKNSNPEYFKNSAIKKQSEYIHIDMKQILKQLQLYLEVCTP 137 Query: 228 VMIGNSGNKKAGAPFPARLI 247 M + N P LI Sbjct: 138 EMNASFINIVYFHPLFFVLI 157 >UniRef50_D1JFQ9 Putative uncharacterized protein n=1 Tax=uncultured archaeon RepID=D1JFQ9_9ARCH Length = 483 Score = 72.6 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 81/245 (33%), Gaps = 20/245 (8%) Query: 127 AEWRLHMGYDPHTCQFTDFELTDS--RDAERLDRFAQTAD---EIRIADRGFGSRPECIR 181 + + + + F + RD L + A + I D+GF S+ + Sbjct: 201 PQIHMIFLFSLDHHMPSYFRIVAGSIRDVSSLVLTVKEAGIKNAVLITDKGFYSKTNILA 260 Query: 182 SLAFG--EADYIVRVHWRGL----RWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGN 235 + E YI+ + + + FL + + G+ Sbjct: 261 LVKEKKDELHYIIPLKRDSSLIDYTKIRQGNRKSFDGYFLFEKRAIWYYKYELEDGDLKG 320 Query: 236 KKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDE 295 KK RL EK +S+ + G + ++T L + Sbjct: 321 KKVIVFLDERL---RAEEEKDYLSRLEKNDTATLDN---FFKIQHRMGTIAVITDLDK-- 372 Query: 296 YSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQP 355 S E++ + + R +IE+ F K++L+ D ++ + W+F N +A + + Sbjct: 373 -SGERIYNLLKSRVEIEIMFDAFKNVLNADRTYMRDDYQMEGWMFINFIALVFYYRLYKI 431 Query: 356 SLDFP 360 D Sbjct: 432 LADNS 436 >UniRef50_A7N7H3 Putative uncharacterized protein n=31 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N7H3_VIBHB Length = 397 Score = 72.6 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 60/183 (32%), Gaps = 16/183 (8%) Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGE 225 I D GF + + + ++ RV + D + G Sbjct: 159 IVTTDAGF--KVPWFKPIEQQGWYWLGRVRGNSKLRVNDRWCSADEVFVQAQYKPQHLGT 216 Query: 226 TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV 285 + + P ++ K+ K + S + ++ V + Sbjct: 217 AELTKQHQY--------PCQVCLYRK---KSKGRKAKNWSGSLQRNTVSLSHAKGEREPW 265 Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLK---SLLHLDALRAKEPELAKAWIFAN 342 LL+++LP + + AE+V Y R IE F+ K L L+ + P+ + + Sbjct: 266 LLVSNLPGETWFAERVVALYTQRMSIEEGFRDTKNERYGLALNFSGSASPKRIEILLMIG 325 Query: 343 LLA 345 +L Sbjct: 326 MLT 328 >UniRef50_A3H523 Transposase (IS4 family) protein (Fragment) n=1 Tax=Vibrio cholerae B33 RepID=A3H523_VIBCH Length = 371 Score = 71.8 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 64/210 (30%), Gaps = 13/210 (6%) Query: 164 DEIRIADRGFGSRPECIRSLAFGEADYI---VRVHWRGLRWLTAEGMRFDMMGFLRGLDC 220 D I + D + + L +++ R+ + + + G + Sbjct: 128 DIIIVCDS-WFGNNGLFKPLRTKLGNFVHLLSRLRSNTVLYSIPKIGSSKKPGRPKKYGS 186 Query: 221 GKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLE 280 + + E S+ +L + RVV Sbjct: 187 RLGSCAEMAAAF-----MAYASTYHVFLYGKYREVNAYSQIVMLKTLKCPVRVVWVF--- 238 Query: 281 AAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIF 340 + + + S EQ+ + Y RW+IE FK +K + + + + I Sbjct: 239 -RKTQWIAIFSTDLKLSVEQIIEYYGARWKIESGFKEIKQDIGSSKSQTRNAQAVINHIN 297 Query: 341 ANLLAAFLIDDIIQPSLDFPPRSAGSEKKN 370 +++AA +I + P R + +N Sbjct: 298 FSIMAATIIWIYGSRLENIPERRHKVKGRN 327 >UniRef50_P30192 Putative uncharacterized protein ychG n=8 Tax=Enterobacteriaceae RepID=YCHG_ECOLI Length = 299 Score = 71.8 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 39/281 (13%), Positives = 72/281 (25%), Gaps = 34/281 (12%) Query: 10 AILAHIGKPEELDT-SARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLH- 67 + A E + +A A RRR + ++ + + P +R + A Sbjct: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPG-DMVIWMVVQNEPITDVVRRLNLSADGEA 80 Query: 68 --DVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGC---TSGKRLRLVDGTAISAP 122 ++ S V R R A L QT R A G +L +DG P Sbjct: 81 GMNLLARSAVT-QARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTP 139 Query: 123 GGGS------------------AEWRLHMGYDPHTCQFTDFELTDSRDAERL----DRFA 160 RL + + + R +E + Sbjct: 140 DKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLAT 199 Query: 161 QTADEIRIADRGFGSRPECIRSLAFGEA--DYIVRVHWRGLRWLTAEGMRFDMMGFLRGL 218 + I + D+ F S + +L +++ + G + L Sbjct: 200 IPDNSITLFDKLFYSEDLLL-TLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRL 258 Query: 219 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALIS 259 + + V I + A + Sbjct: 259 EHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK 299 >UniRef50_A6DM44 Putative uncharacterized protein n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DM44_9BACT Length = 272 Score = 71.8 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 73/207 (35%), Gaps = 35/207 (16%) Query: 163 ADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGK 222 + +AD G+ + I + ++ + L + + ++ ++ Sbjct: 6 EKSLYMADAGY-NGMAFIALAKELGHEVLMPLKMSHLAQKMNDSKKRSLVHEIKLTRSH- 63 Query: 223 NGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAA 282 + G RLI T + Sbjct: 64 -----LKNYPDHQHLLGTTLKIRLIR--------------------------TLGTSKLK 92 Query: 283 GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFAN 342 VL+ T L + ++S ++++ YR R+ +E+A++ LK L+L+++R ++ K +++A Sbjct: 93 SQVLITTLLDDAKFSWKELSGLYRQRYLVEVAYRHLKVNLNLESIRKRKFSRIKKFMYAA 152 Query: 343 LLAAFLIDDIIQ--PSLDFPPRSAGSE 367 + L + + P G++ Sbjct: 153 IALYNLAAVLRNRIKLPEILPEDHGTK 179 >UniRef50_A4A0C6 Probable transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A4A0C6_9PLAN Length = 442 Score = 71.4 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 54/361 (14%), Positives = 101/361 (27%), Gaps = 85/361 (23%) Query: 42 TLLRLGLAYGPGGMSLRE------VTAWAQLHDVATLSDVALLKRL-RNAADWFGILAAQ 94 + + G + R QL AT LLK L R+ + A Sbjct: 46 AMAITCWGWLQGTLEERTATAQAMTCEALQLDFTATR--QGLLKALARHGEALIPQVVAH 103 Query: 95 TLAVRAAVTGCT--SGKRLRLVDGTAISAPGGGSAEWRLH-------------------- 132 + G GK VDG AP + + + Sbjct: 104 IADQLRELKGDWTQRGKVNFAVDGAKFLAPRTAANQQQFASKKEKQYASKSNQSKAESAQ 163 Query: 133 ----MGYDPHTCQFTDFELTDSRDAERLDR----FAQTADEIRIADRGFGSRPECIRSLA 184 + + + + S+ +ER ++ IAD + P ++ Sbjct: 164 LLATVVWHLTAGLPYRWRIAGSKGSERHALTDMLDELPSNARIIADAEYVGYPLW-SAIL 222 Query: 185 FGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPA 244 + ++VRV + L+ L + V + + P Sbjct: 223 DSKRSFLVRVGSN--------------VSLLKNLGSLRIRNGFVYFWPTTAMRKLQP--- 265 Query: 245 RLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADC 304 K R+++ +T + +++ E + S + + Sbjct: 266 -----------------------PLKLRLIKVDTGKETIYLV----SSELDMSDQAACEL 298 Query: 305 YRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS-LDFPPRS 363 YR RW +E+ F+ +K LR P I L+ + + D P + Sbjct: 299 YRQRWGVEVFFRTVKQSCQRSKLRCCTPRNLLTEIHWTLIGVWAAFYYAKQVQRDQPGKR 358 Query: 364 A 364 A Sbjct: 359 A 359 >UniRef50_Q04QP0 Transposase, ISLbp11 n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QP0_LEPBJ Length = 243 Score = 71.4 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 57/167 (34%), Gaps = 6/167 (3%) Query: 194 VHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPP 253 + R + G + + L T + K +L L Sbjct: 3 IRANHERKIEGGGCSWSYLETLEP------AHTYTITVPRKKGKEAREAIIQLRFEKLTI 56 Query: 254 EKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIEL 313 + K + V E+ L T + +A++V Y+ RW IE+ Sbjct: 57 KSPQYKKLENIDMYALTATEVDGPKEESIDWKFLTTIPIHNSENAKRVISYYKSRWGIEV 116 Query: 314 AFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFP 360 FK LKS ++++ + K + KA I + + A+ + + + P Sbjct: 117 FFKVLKSGCNIESTQFKFGDRFKACIAVSAIVAWRVTMLTFLGRNIP 163 >UniRef50_C8W6S4 Transposase IS4 family protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W6S4_DESAS Length = 465 Score = 70.6 bits (171), Expect = 9e-11, Method: Composition-based stats. Identities = 28/77 (36%), Positives = 42/77 (54%) Query: 276 AETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELA 335 A + G LLT+ D SA ++ YR R QIE+ FK LK LL L+ + + PE Sbjct: 296 AHEEKLDGIFALLTNYDADRVSANKLIKKYRERNQIEVNFKDLKGLLDLERIFLQLPERI 355 Query: 336 KAWIFANLLAAFLIDDI 352 +A++F LA F++ + Sbjct: 356 EAYVFPKTLAYFVLAFL 372 >UniRef50_A7BZU6 Transposase, IS4 n=2 Tax=Beggiatoa sp. PS RepID=A7BZU6_9GAMM Length = 270 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 34/267 (12%), Positives = 79/267 (29%), Gaps = 22/267 (8%) Query: 2 NYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLR--- 58 ++ +L I +P D ++ + + L + G S+ Sbjct: 8 EVQSSVFNKLLEPI-EPFIQDQESKLPKH--HNQIFNYYDFFILLMYYFVAGKQSVGLFV 64 Query: 59 --EVTAWAQLHDVATLSDVALLKRL-RNAADWFGILAAQTLAVR--AAVTGCTSGKRLRL 113 E+ + ++ R + + F + L+ ++ ++ L Sbjct: 65 KTELKLLPITLGLRQVAYSTFNDAFERFSPNLFQEVFKYILSTIPFKQISELSTLGVLYC 124 Query: 114 VDGTAI--------SAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFA-QTAD 164 +DG+ + +LH+ ++ + +F +T + +ER A Sbjct: 125 IDGSLFPVINSMLWAEYTSKHCALKLHLCFELNRMIVVEFLVTAANGSERKALQEMLKAG 184 Query: 165 EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 I DRG+ S C + EA ++ R+ L T + + + + + Sbjct: 185 VTYIGDRGYMSFELC-HLMMQKEAYFVFRLKRN-LLCFTIKKLPVLLPESVESVFKQVTD 242 Query: 225 ETTVMIGNSGNKKAGAPFPARLIAVSL 251 E + S Sbjct: 243 EMIDFTNDKFKHTYRLVCFTVPFHHSK 269 >UniRef50_UPI0000164DB3 hypothetical protein TVN0693 n=1 Tax=Thermoplasma volcanium GSS1 RepID=UPI0000164DB3 Length = 83 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 18/72 (25%), Positives = 35/72 (48%), Gaps = 2/72 (2%) Query: 301 VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAA--FLIDDIIQPSLD 358 + Y RW I++ F+ +K+ L +D L +++ IF ++A +I D+I S+ Sbjct: 3 IHWIYSQRWNIDIFFRTMKTYLKIDHLISRKINSIMVQIFTAMIAYIVLMIQDMISCSMS 62 Query: 359 FPPRSAGSEKKN 370 P + + N Sbjct: 63 IPKMISLPLRIN 74 >UniRef50_D1Y365 Transposase n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y365_9BACT Length = 536 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 60/390 (15%), Positives = 104/390 (26%), Gaps = 66/390 (16%) Query: 21 LDTSARNAGALTRRRE------IRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSD 74 + + RR AA L P + D ++L+ Sbjct: 46 WERLGLDVWFKQYRRNHRLKFDFDLAAFFLAALRILAPCSKKRTHEYRGNFVFDFSSLTQ 105 Query: 75 VALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWR---- 130 L + L + +L A + T + L D T S E R Sbjct: 106 ADLYETLGLLSGSKDVLIRNVNKGIADIYERTM--TVALYDCTTFYFESFDSDELRARGM 163 Query: 131 --------LHMGYDPH---TCQFTDFELTDSRDAERLDRFAQTADE---------IRIAD 170 + + D+EL +E +AD Sbjct: 164 SKENRANEVQVVMGLLIDADGIPLDYELFRGNTSEIKTLLQVVRKHKVNSGLGKVTVVAD 223 Query: 171 RGFGSRPECIRSLAFGEADYI-----VRVHWRGLRWLTAEGMRFDMMGF------LRGLD 219 RG + ++ LA DYI R+ + +E F ++ LD Sbjct: 224 RG-LNCKLNLQHLAEEGFDYIVPQSISRLKKDVKERVLSEENWEHSERFHEDVFKMKRLD 282 Query: 220 CGKNGETTVMIGNSGNKKAGAPFPARLIA-------------------VSLPPEKALISK 260 + E I + + K L + + L SK Sbjct: 283 ARADPERDAGIIVTWSLKRHHHDLDVLEELWTKGKELIAKGASAVETSMKHGSRQFLKSK 342 Query: 261 TRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKS 320 E + + +A +V+ ++ S +++ R W++E F+ KS Sbjct: 343 KGKKGEYEVNTSLYEKRKKQAGFYVIATSNKDA---SPQEIFANLRQLWRVEECFRVFKS 399 Query: 321 LLHLDALRAKEPELAKAWIFANLLAAFLID 350 L + PE + LA L Sbjct: 400 NLDARPVFVWTPEHIRGHFLVCYLALVLER 429 >UniRef50_C5JAH9 Transposase, IS4-like n=1 Tax=uncultured bacterium RepID=C5JAH9_9BACT Length = 597 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 73/421 (17%), Positives = 116/421 (27%), Gaps = 79/421 (18%) Query: 14 HIGKPEELDTSARNAGA--------LTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQ 65 IG P + R GA R E + L S R W + Sbjct: 78 RIGPPLIFERLWRETGAKAVIDDLLAGRGFEFSVERAVFLTVLHRLIDPGSDRAAERWRE 137 Query: 66 LHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVT--------------GCTSGKRL 111 + + ++D+ L R A LA QT A T G S L Sbjct: 138 AYAINGMADLDLHHLYRAMAWLGEDLADQTGAGWVPRTTKDVIEERLFARRHGLFSEMSL 197 Query: 112 RLVDGTAISAPGGGS----------------AEWRLHMGYDPHTCQFTDFELTDSRDAER 155 + D T++ G G + L + D + E+ A+ Sbjct: 198 AIFDTTSLYFEGRGGETLGRHGHSKDHRPDLHQMVLGVVID-EAGRPVCSEMWPGNTADV 256 Query: 156 LDRFAQTA---------DEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEG 206 IADRG S +L +YI+ V R A Sbjct: 257 TTLLPVVTRLRERFLIGQVCIIADRGMISSATV-ATLERQGIEYILGVRERRTLEAAAVL 315 Query: 207 MRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAP----FPARLIAVSLPPEKALIS--- 259 L G + + K G ++ + K + Sbjct: 316 ADATPFTPLAIPKANGRGTLDLQVKEVVRKVKGPTSRIAKHRYIVCYNGAEAKNDAAARE 375 Query: 260 -----------------------KTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEY 296 + L + + R+ + + AA + Sbjct: 376 AILANLTKALGQGDKSLVGNKGFRRHLKTTDGRRFAIDPDQVAAAAKFDGIYILRTNSRA 435 Query: 297 SAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPS 356 SA QVA YR R+++E F+ K++L + K E +F + LA L D++ Sbjct: 436 SALQVALRYRERYRVEDIFRTSKTILETRPIFHKCDETICGHVFCSFLALVLRKDLMDRL 495 Query: 357 L 357 Sbjct: 496 T 496 >UniRef50_A9KH40 Putative uncharacterized protein n=1 Tax=Coxiella burnetii Dugway 5J108-111 RepID=A9KH40_COXBN Length = 242 Score = 69.9 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 24/219 (10%), Positives = 61/219 (27%), Gaps = 23/219 (10%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVAT 71 L+ + + R+I L L ++ ++ + + Sbjct: 13 FKQAFSETALNELGKQVKFAQKLRKITPFRLALSLISSFAGKIQTIADTHRTFNELNEEH 72 Query: 72 LSDVALLKRL--RNAADWFGILAAQTLAV-------RAAVTGCTSGKRLRLVDGTAISAP 122 + K+L R ++ + + ++ + + + + DG++ + Sbjct: 73 VQYKPYHKQLAKRAFPNFMRRVVCRLMSEFACRTLTINDHNPFSMFEHIFIHDGSSYAIK 132 Query: 123 GGGS-----------AEWRLHMGYDPHTCQFTDFELTD--SRDAERLDRFAQTADEIRIA 169 A +LH D + T L D L + + + + Sbjct: 133 SNLKSVFPGKYKQGLATVQLHTTMDLLADEITAVILAPFSHADVNYLPSAKEIKNSLLLL 192 Query: 170 DRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMR 208 DR + +R + + +IVR + Sbjct: 193 DRAYLDF-SYLREVDTHQGFFIVRGKRHMNPMIEQGYDS 230 >UniRef50_B2AKB8 Transposase, IS4 family n=40 Tax=cellular organisms RepID=B2AKB8_CUPTR Length = 442 Score = 69.9 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 42/219 (19%), Positives = 81/219 (36%), Gaps = 28/219 (12%) Query: 150 SRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGE--ADYIVRVHWRGLRWLTAEGM 207 R AE+ QT + + DR G E + AD+++R L G Sbjct: 165 ERVAEQAALLPQTR-LVYMTDRE-GDIAELMARAQELGQPADWLIRSQHNRN--LAEGGK 220 Query: 208 RFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSEN 267 +D + GE T ++ +KA ++ + +L Sbjct: 221 LWDSVDA-----SPVLGEITFILPGRAGQKA-----------REVKQELRAQRMKLPGLV 264 Query: 268 RRKGRVVQAETLEAAG-----HVLLLTSLPEDEYSA-EQVADCYRLRWQIELAFKRLKSL 321 + V A +EA L+T+ + A ++ + YR RW+IE+ F LK+ Sbjct: 265 GAEFTCVAAREIEAPAGVKPVVWRLVTNREAQDADAVNKLVEWYRARWEIEMFFHVLKTG 324 Query: 322 LHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFP 360 ++AL+ + + + ++ A+ I +++ P Sbjct: 325 CKVEALQLSHMDRVERALALYMVVAWRIARLMRLGRTCP 363 >UniRef50_A8L6T7 Transposase IS4 family protein n=10 Tax=Actinomycetales RepID=A8L6T7_FRASN Length = 522 Score = 69.9 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 52/371 (14%), Positives = 92/371 (24%), Gaps = 67/371 (18%) Query: 41 ATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAA--------------- 85 L LA S + + S + +RLR A Sbjct: 106 EVFQALVLARVVEPTSKLDSLRVLEEVGAPAPSYRTVQRRLRRYAGVDEVDAETGQPVPV 165 Query: 86 -----DWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRL--------- 131 W L+ L L D T + +R Sbjct: 166 DPAGGPWRARLSRACADHVKLGPAI-----LLLYDVTTLYFETDQGDGFREPGFSNERRL 220 Query: 132 --HMGYDPHT---CQFTDFELTDSRDAERLDRFAQTA---------DEIRIADRGFGSRP 177 + T + A+ D +AD G S Sbjct: 221 EPQVTVGLLTDGAGFPLTVHAFEGNRADTTTMLPVLTAFLKAHDLRDVTVVADAGMVSEA 280 Query: 178 ECIRSLAFGEADYIVRVHWRGLRWLTAEGMR-------FDMMGFLRGLDCGKNGETTVMI 230 R++ +++ + +L D F++ G + Sbjct: 281 NK-RAIEAAGLSFVLGARVPEVPYLVKAWRERHPDTEIPDGHVFVQPWPAGPSDNRRDHT 339 Query: 231 GNSGNKKAGAPFPARLIAVSLPP-------EKALISKTRLLSENRRKG--RVVQAETLEA 281 K A R I + + A+ + +K R ++ + Sbjct: 340 VFYQYKADRARRTLRGIDQQVAKAENAVAGKTAVKRNRYVRLTGAKKSVNRALEEKNRAL 399 Query: 282 AGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFA 341 AG +T+LP + +QV Y +E +F+ KS L + E +A + Sbjct: 400 AGIKGYVTNLPNPD--PDQVISTYSQLLNVEKSFRMSKSDLAARPIYHHTRESIEAHLTV 457 Query: 342 NLLAAFLIDDI 352 A + I Sbjct: 458 VFAALAVSRWI 468 >UniRef50_B8FNX6 Transposase IS1634 family protein n=9 Tax=Desulfitobacterium hafniense RepID=B8FNX6_DESHD Length = 532 Score = 69.5 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 61/361 (16%), Positives = 92/361 (25%), Gaps = 47/361 (13%) Query: 30 ALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFG 89 R I + + L S V+ W + L A D+ Sbjct: 104 FNERDLSIDVQEAIFCMVLNRLTEPTSKLGVSDWKDSVYRPEFESLKLHH-FYKAIDFLD 162 Query: 90 I----LAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEW-------------RLH 132 L Q T L D T+ G A RL Sbjct: 163 ENKDTLEEQLFFHHT--NLFTQQLDLVFFDTTSTYVEGDAGAFDLLEYGHSKDHRPDRLQ 220 Query: 133 MGYDPH---TCQFTDFELTDSRDAERLDRFAQTAD---------EIRIADRGFGSRPECI 180 + + ++ +D I + DRG + Sbjct: 221 VMIGLLMSRDGIPIAHHVFPGNTSDTDAFIEAVSDLKKRFTIQRVIVVGDRGMMGKRTLE 280 Query: 181 RSLAFGEADYI-VRVHW-RGLRWLTAEGMRF----DMMGFLRGLDCGKNGETTVMIGNSG 234 + VR+ + L + D + L GK V + Sbjct: 281 LLEELQLHYILGVRMRNVKAGPELATSPEPYVFTKDNLKVKEVLHQGKRY--IVCLNEEE 338 Query: 235 NKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVV-------QAETLEAAGHVLL 287 K+ + + E I SE ++ V + +AA L Sbjct: 339 AKRDQWVREQIEVKLRSKLEHGSIKDLIGHSEYKKYLNVSAEAATINTDKLKQAAVFDGL 398 Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 E E+VA YR WQIE AF+ LKS L L + + I LA Sbjct: 399 YILQTNTELPTEEVATAYRDLWQIERAFRNLKSTLDLRPVYHWKERRISGHIMLCFLALV 458 Query: 348 L 348 + Sbjct: 459 V 459 >UniRef50_B5CN98 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=B5CN98_9FIRM Length = 582 Score = 69.5 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 51/158 (32%), Gaps = 8/158 (5%) Query: 207 MRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSE 266 + R + + +I G K+ G + V Sbjct: 365 YSPKYKAYQRRIRDRQIEHAEKIINTPGRKRKGKNQNDPMRFVKKTSVTPDGEIANKQVY 424 Query: 267 NRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDA 326 N + ++ + G ++T+L D ++ + RW+IE F+ +K+ Sbjct: 425 NIDEEQI--QKEEMYDGFYAVITNLEGD---VSEIIRINKQRWEIEENFRIMKTEFEARP 479 Query: 327 LRAKEPELAKAWI---FANLLAAFLIDDIIQPSLDFPP 361 + + E KA + +LL L++ + + Sbjct: 480 VYVRREERIKAHFMTCYISLLLYRLLEKKLGDAYTVSQ 517 >UniRef50_B0JP83 Transposase n=112 Tax=Cyanobacteria RepID=B0JP83_MICAN Length = 422 Score = 69.1 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 54/186 (29%), Gaps = 11/186 (5%) Query: 166 IRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGE 225 I + D G+G+ +++L + Y+ + + EG + + + + Sbjct: 191 IVLIDAGYGNNTNFLKALEERKLKYLGGLAKNRKVIIEKEGGVEETIQLEQLAKSLSEKD 250 Query: 226 TTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHV 285 + N +K R L E+ L S + Sbjct: 251 WEKITLNLDKEKTVWVAVFRAKISQLEGERNLAIVMNASSMEKA-----------TEVDY 299 Query: 286 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLA 345 + + D +A + Y R +E+ ++ K L L + ++ A Sbjct: 300 FITNVVEADTVTASWIVRTYTERNWVEVFYREAKGWLGLREYQVRDKRSLLRHFILVFCA 359 Query: 346 AFLIDD 351 I Sbjct: 360 YTFILW 365 >UniRef50_UPI0001BC2E1C TnpB family transposase n=1 Tax=Brevibacterium linens BL2 RepID=UPI0001BC2E1C Length = 515 Score = 68.3 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 59/427 (13%), Positives = 113/427 (26%), Gaps = 63/427 (14%) Query: 1 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDA--ATLLRLGLAYGPGGMSLR 58 +++ H + A + K A +R +L LA S Sbjct: 56 LDFDHTDKPATAPAVVKSSRSQVIVDTIRAAYQRLGFDTVDDEAFFQLVLARLIEPTSKS 115 Query: 59 EVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTA 118 + V + L L+ A D Q A + T+G L L D T Sbjct: 116 DSVRVLSELGVDVVHRNTFLTCLKRARDHNYR--DQIAAKCFDYSVATTGISLLLYDVTT 173 Query: 119 ISAPGGGSAEWR---------------LHMGYDPHTCQFTDFELTDSRDAERLDRFAQTA 163 + R + + D T + + AE Sbjct: 174 LYFEAEKEDSLRKVGYSKERRVDPQIVVGLLVD-RTGFPLEIGCFEGNKAETHTIIPVIK 232 Query: 164 ---------DEIRIADRGFGSRPECIRSLAFGEADYIV-----RVHWRGLRWLTAEGMRF 209 D + AD G S + + L +IV + + G Sbjct: 233 RFQDRHQVSDMVVAADAGMLS-SKNLTELDDAGLRFIVGSRQVKAPHDLATFFAWNGEWA 291 Query: 210 DMMGFLRGLDCGKNGET----------TVMIGNSGNKKAGAPFPARLIAVSLPPEKALIS 259 D + + V S A + R + + Sbjct: 292 DDQTVIDTITPRGQKRLAPHRTQTRSEPVWDAESFPDAWRAVWQYRRKRAMRDEQTLNLQ 351 Query: 260 KTRLL-----SENRRKGRVVQAETLE-------------AAGHVLLLTSLPEDEYSAEQV 301 + R + + R V+ + E +G +T++P+ A +V Sbjct: 352 RNRAISIIEGDSQPKSARFVKTKGAEKVFDEKAYDRAMKLSGFKGYVTNIPKTIMPAREV 411 Query: 302 ADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPP 361 Y W +E +F+ K+ L+ + + + +A + A + + S Sbjct: 412 IGSYHDLWHVEQSFRMSKTDLNARPMFHRTRDAIEAHLTVVFTALGVARFMQDASGVSLK 471 Query: 362 RSAGSEK 368 + + + Sbjct: 472 KIITTLR 478 >UniRef50_C7TBQ5 Transposase n=4 Tax=Lactobacillus rhamnosus RepID=C7TBQ5_LACRG Length = 374 Score = 68.3 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 56/199 (28%), Gaps = 29/199 (14%) Query: 168 IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT 227 + D + + P+ L I + + G D+ L K Sbjct: 143 LFDS-WFAYPKMFHELLKRGITGIGMIKQTEKVYFRYRGREMDVKRLYATLKQSKRLIHQ 201 Query: 228 VMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLL 287 + +A+ L V + + VL Sbjct: 202 ---HYLYSPIVQYDMDGTKMAMKLVF--------------------VTKKGAKGRFLVLA 238 Query: 288 LTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAF 347 T E++ Y RWQIE FK K L DA + + + A + +++ Sbjct: 239 TTKTN---LRPERIIQMYGRRWQIEGYFKVAKQYLRFDATQVRGYDGLCAHMAMVMMSYD 295 Query: 348 LIDDIIQPSLDFPPRSAGS 366 L+ + S R+ G Sbjct: 296 LLA--LCQSGQTEERTLGD 312 >UniRef50_C0GNX3 Transposase IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GNX3_9DELT Length = 851 Score = 68.3 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 50/143 (34%), Gaps = 3/143 (2%) Query: 215 LRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVS-LPPEKALISKTRLLSENRRKGRV 273 L L + T+ + + K + + + SK +L E ++ R Sbjct: 624 LIYLTVRSESDITMCLKQNPKIKRWKEATIQQGVWQDYREQYRIASKDFILPETKKPFRF 683 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPE 333 V + E + + +YS ++ D Y +RW +E K L L+ PE Sbjct: 684 VVKQNKETSEIRCFGS--THTDYSPTKILDAYHIRWPVETGIKDLIENYFLNKPTGTSPE 741 Query: 334 LAKAWIFANLLAAFLIDDIIQPS 356 +A + +LA +D Sbjct: 742 KVEAHYYCIMLARLAVDYFRSQL 764 >UniRef50_UPI0000F5175B transposase-like protein n=2 Tax=Ferroplasma acidarmanus fer1 RepID=UPI0000F5175B Length = 497 Score = 68.3 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 48/362 (13%), Positives = 95/362 (26%), Gaps = 51/362 (14%) Query: 36 EIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQT 95 + LL + S + W + + + + A + N D F + Sbjct: 88 GMSPGDYLLLFIMNRLSEPCSKNSIGKWMKRNYASIIFPEASSQDFWNIMDRFADKNIKN 147 Query: 96 LAVRAAVTGCTSGKRL--RLVDGTAI-------SAPGGGSAEWRLHMGYDPH-------- 138 + R G D + + S G + YD + Sbjct: 148 IMDRVRDRIMEMGYNTGDIFFDASNMYTFMEENSIAKKGHNKKH---RYDLNQVSYYIAS 204 Query: 139 --TCQFTDFELTDSRDAERLDRFAQ----TADEIRIADRGFGSRPECIRSLAFGEADYIV 192 E + D I DRG+ S+ I ++ YI Sbjct: 205 TYDYIPLYGEAYPGNIHDSRTFENIVKNIPEDSTLIFDRGYNSKDN-IDMISNR--RYIG 261 Query: 193 RVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMI-----GNSGNKKAGAPFPARLI 247 + + ++ D L GK+ + NK + Sbjct: 262 ALKQSDNHSIMEINVKEDSYIELIRNVYGKDHRIILYHSKSLETRKKNKFMKHLAKVMVK 321 Query: 248 AVSLPPEKALISKTRLL--------------SENRRKGRVVQAETLEAAGHVLLLTSLPE 293 A + S + G ++ + + + + Sbjct: 322 AKKIIDSGDSDSMEKARIYLESEHLNETILLPSLEIDGERMEYRISMMGKNAIFTNIIDK 381 Query: 294 DEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDII 353 D AE + + Y+ R ++E F+ + + L P+ K +F +L+A + I Sbjct: 382 D---AESIVELYKKRARVEHCFRTINAKGIAFPLNHWTPQKIKVHMFFSLMAYLFLALIY 438 Query: 354 QP 355 Sbjct: 439 NE 440 >UniRef50_C8VXW5 Transposase IS4 family protein n=3 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VXW5_DESAS Length = 587 Score = 67.9 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 62/190 (32%), Gaps = 8/190 (4%) Query: 165 EIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 E I DR + + R A ++ ++ + + C Sbjct: 328 EFIIDDRRYILTFDVARFFDEHHAQL-----NNVAYFVQWLTVKNQSLREAKKKRCQSLL 382 Query: 225 ETTV--MIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAA 282 E V M+ KK + + ++ + + V Q Sbjct: 383 EREVAAMLKRKHLKKWVS-VNIEPYDFEVINKRGNSRTIQSFQLSYTINTVAQKNEQRIH 441 Query: 283 GHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFAN 342 G +T+L ++A + YR + ++E AF +K L L + + A + Sbjct: 442 GITCFITNLDVTSHTAIDIIQWYRRKNKVEEAFHEIKDHLDLRPIYLTREQRVMAHVIIC 501 Query: 343 LLAAFLIDDI 352 +LA F+ +DI Sbjct: 502 VLAYFIFNDI 511 >UniRef50_Q8DM76 Tlr0247 protein n=2 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DM76_THEEB Length = 166 Score = 67.9 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 3/88 (3%) Query: 265 SENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHL 324 N + RVV+ E L T+L S E+V+ YR RW IE +K LK L L Sbjct: 41 KFNDHRYRVVEFYD-ENQREFCLATNL--KHLSDEEVSQLYRHRWAIENLWKFLKMHLSL 97 Query: 325 DALRAKEPELAKAWIFANLLAAFLIDDI 352 D L AK + I+ L+ +++ + Sbjct: 98 DRLIAKSLKGMVNQIYMFLIVYLILELV 125 >UniRef50_A4BNE3 Transposase n=2 Tax=Gammaproteobacteria RepID=A4BNE3_9GAMM Length = 546 Score = 67.5 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 57/416 (13%), Positives = 104/416 (25%), Gaps = 79/416 (18%) Query: 25 ARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVA----TLSDVALLKR 80 A + RR A + + S W + + ++S LL+ Sbjct: 93 AIKRALRSSRRSFDAEALVRAMVFNRLCAPDSKLGCLQWLETVSIPGMPESISHDQLLRT 152 Query: 81 LRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGS-------------- 126 + D + A+ A+ + D T + G G Sbjct: 153 MDALMDRTEAVEARAAEQLRAM--LDQQLSVVFYDLTTVRIHGEGQLPEDVRAFGMNKET 210 Query: 127 -------------AEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGF 173 + L + + H + + + L+RF I +ADRG Sbjct: 211 GAIARQFVLGVVQSADELPLMHTVHAGNIAETKTLQGMLCQVLERFPVER-VILVADRGL 269 Query: 174 GSRPEC-----IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLR------------ 216 S + + + +I+ V R L D L Sbjct: 270 LSLDNVSELVALAEASARKLQFILAVPARRYAELGGTLEGMDFAQGLAEGRFAEQRLVVA 329 Query: 217 ----------GLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSE 266 + E K A R ++E Sbjct: 330 HDPQRAAEQSSRRRERIAELEAFAEALVRKLDVQDAGQSTRGRRASDRGAYSRFQRAVAE 389 Query: 267 NRRKGRVVQAETLEAAGHVL--------------LLTSLPEDEYSAEQVADCYRLRWQIE 312 + + + + L+ ++SA +V Y+ IE Sbjct: 390 AELTRFIQADYQADRFSYSVDEEAIARAELFDGKLVVLTNVVDFSAAEVVARYKALADIE 449 Query: 313 LAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID----DIIQPSLDFPPRSA 364 F+ LKS L + + + P+ +A LA L + P +A Sbjct: 450 RGFRVLKSDLEIAPVYHRLPDRIRAHALICFLALVLYRIMRMRLKAHGSTVSPETA 505 >UniRef50_UPI00016C5887 hypothetical protein GobsU_05723 n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5887 Length = 321 Score = 67.5 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 38/364 (10%), Positives = 85/364 (23%), Gaps = 79/364 (21%) Query: 11 ILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLRE-----VTAWAQ 65 + + + +D A L L G SL + +T + Sbjct: 1 MFQQLLARDVIDALAPPPARAVY----TPWVVLWLLVYQRLHGNGSLGDAVSHFLTQFPS 56 Query: 66 LHDVATLSDVALLKRLRNAADWF-----GILAAQTLAVRAAVTGCTSGKRLRLVDGTAIS 120 + + + + + +A G+R+ ++DGT + Sbjct: 57 AAEQPSGATGGYRHARTRLPNAVVATAGRRVFDTLVAA---YPPSWRGRRVFMMDGTTLR 113 Query: 121 AP-----------------GGGSAEWRLHMGYDPHTCQF----TDFELTDSRDAER---L 156 L + ++ + L Sbjct: 114 LAPTDALRGAFTPASNQHGRSHWPVMHLVVAHELASGLAAPPQHGAMYGPGAVGAVQLGL 173 Query: 157 DRFA-QTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFL 215 + + DR F G D ++R+ + Sbjct: 174 RLMPDLPPGSVILGDRNF-GVFGLAHGAVAGGHDAVLRLTQSR------------FQALV 220 Query: 216 RGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQ 275 + +G + S + G P L ++ G + + Sbjct: 221 KKAQPAGDGRWALTWHPSVADRKGNP--------------------DLRADAVLTGWLHE 260 Query: 276 AETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELA 335 L T + + + YR R +E + ++ L LD R + + Sbjct: 261 VPIGGDQPLWLFATV----DGTGAEWGGLYRRRLDVETDIRDVRRTLALDQTRGRTVPMV 316 Query: 336 KAWI 339 + + Sbjct: 317 EKEL 320 >UniRef50_C6PFH6 Transposase IS4 family protein n=2 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PFH6_CLOTS Length = 398 Score = 67.5 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 60/203 (29%), Gaps = 25/203 (12%) Query: 150 SRDAERLDRFAQTADEIR-IADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMR 208 R E + + D + ++ I + + + +G+R Sbjct: 173 ERICEMVSMLPIPKGPAYGLCDSWYINKKVIEAHFE-RGYHLIGALKTNRIIY--PQGIR 229 Query: 209 FDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENR 268 + F + ++ + TV N I ++ Sbjct: 230 IQIKDFAQYIEKNEVHLVTVNGSNYW-------------VYRYEGALNGIDNAVVVLCWP 276 Query: 269 RKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALR 328 K A E A H + T+ E E + + Y RW IE+ F++ K+ L L+ + Sbjct: 277 EK-----AFKNENALHAFICTNT---ELDTETILNYYSQRWPIEIFFRQTKNNLGLNTYQ 328 Query: 329 AKEPELAKAWIFANLLAAFLIDD 351 + + ++ L Sbjct: 329 VRSTKSIDRLLWLISLTYMYCTT 351 >UniRef50_A6DG91 ISPg4, transposase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG91_9BACT Length = 223 Score = 67.5 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 54/189 (28%), Gaps = 34/189 (17%) Query: 1 MNYSHDNWSAI--LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLR 58 M + N S + + + P ++ A+ G + R+ + ++ L +SL Sbjct: 1 MKPNKSNLSTLKQICQLIPPHIVNKLAKKHG--IKTRKFSSWSHVVSLLYTQLSHALSLN 58 Query: 59 EVTAWAQLHD----------VATLSDVALLKRLRNA--ADWFGILAAQTLAVRAAVTGCT 106 +V H + + R R+A A+ +L + G Sbjct: 59 DVCDGLHYHSSALFQIRGATAPKRNTFSNANRTRDAAMAEDLFWEVLNSLQSQLPSFGLD 118 Query: 107 SGKR---------LRLVDGTAISA---------PGGGSAEWRLHMGYDPHTCQFTDFELT 148 + VD T I A + HM + T + + Sbjct: 119 KQNSNFPKRFKRAVYAVDSTTIQLVAHCLNWAKHRRRKAAAKCHMQLNLQTFLPSYAIVK 178 Query: 149 DSRDAERLD 157 ++ + Sbjct: 179 EANTHDSQK 187 >UniRef50_C1XVD6 Transposase n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XVD6_9DEIN Length = 538 Score = 67.5 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 66/379 (17%), Positives = 113/379 (29%), Gaps = 55/379 (14%) Query: 11 ILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLR-EVTAWAQLHDV 69 I + + ELD + R+ A + + L S R V W Q Sbjct: 88 IFERLWREAELDKAFEAL-LEDRQLAFDVAEAVFTMVLNRLTDPCSKRGLVRQWLQGVYR 146 Query: 70 ATLSDV---ALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGS 126 + + L A+ + + A + + + D T+ G G Sbjct: 147 PQAEQLELHHYYRALDVLAEHKEAIEDRLFARARDL--FWTEVDVVFWDTTSSYFEGRGP 204 Query: 127 AE------------WRLHMGYDPH---TCQFTDFELTDSRDAERLDRFAQT--------- 162 R + E+ A++ Sbjct: 205 EGLAAYGYSRDKRPDRPQLVVGVLMTRDGYPIAHEVFPGDTADKATVETVLDALKRRFHL 264 Query: 163 ADEIRIADRGFGSRPECIRSLAFGEADYIV----RVHWRGLRWLTAEGMRFDMMGFLRGL 218 I +ADRG SR + +R++ +YIV R H L+ G + L+ Sbjct: 265 RRVIFVADRGMVSR-QILRAIEEAGMEYIVGMPLRRHRAAEAVLSQPGRYRKVNDQLQIK 323 Query: 219 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQA-- 276 G+ V+ N +A AR A++ ++ + + L NR R ++A Sbjct: 324 QVTHQGQRYVLCYNPL--QAEHDRQAREAALAHLKQRIERGQAKELLRNRLLARYLKALP 381 Query: 277 ------------ETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHL 324 G LL T+ E V Y+ W++E AF+ LKS L L Sbjct: 382 QGALVVDTDAVKRAARYDGKYLLRTNTD---LDPEAVVRAYKDLWRVERAFRTLKSALDL 438 Query: 325 DALRAKEPELAKAWIFANL 343 + + + Sbjct: 439 RPMFHWTERRVRGHVMVCF 457 >UniRef50_UPI000038E639 hypothetical protein Faci_04540 n=2 Tax=Ferroplasma acidarmanus fer1 RepID=UPI000038E639 Length = 453 Score = 67.5 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 69/227 (30%), Gaps = 30/227 (13%) Query: 127 AEWRLHMGYDPHTCQFTDFELTDSRDAE----RLDRFAQTADEIRIADRGFGSRPECIRS 182 R+ MG+ + L A+ R + + DRGF Sbjct: 184 PMIRIIMGFSRLRNEPCYIRLVPGSVADIDTLRKTEQEVHTGTLFVMDRGFID-DNNFGK 242 Query: 183 LAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPF 242 + +I + + + D + +S K Sbjct: 243 MDTNGLYFITPLKRDS-----------KLPDYSINGDNFFMFRKRAIRYSSTTIKNYDIH 291 Query: 243 PARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVA 302 I + E S + + E AG + L+T++ E + + Sbjct: 292 VFEDIMLRAMEENEYYSLVDSGKKP--------LYSPEKAGKIALITNVREK---PQSIF 340 Query: 303 DCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWI---FANLLAA 346 + Y+ R IE +F K+LL +D + + K ++ F +L+A Sbjct: 341 ELYKFRNDIEESFDVFKNLLQVDTPYLRGDDTLKGYVFVSFISLIAY 387 >UniRef50_Q3B5Q7 Transposase-like n=5 Tax=Chlorobium/Pelodictyon group RepID=Q3B5Q7_PELLD Length = 513 Score = 67.2 bits (162), Expect = 8e-10, Method: Composition-based stats. Identities = 58/379 (15%), Positives = 109/379 (28%), Gaps = 48/379 (12%) Query: 12 LAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMS-LREVTAWAQLHDVA 70 L L AR G R L + + + L + + + + Sbjct: 89 LTRQFSRNTLFACARKCGLG---RLPELYLDLALMRIIEPTSKLRTLELLQRYFNVSYLK 145 Query: 71 TLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISA--------- 121 + L K L + + +T A++ A L L D T + Sbjct: 146 RTAYRTLSKILEHQEEI------ETAAIQTACNDLKENFCLVLYDVTTLYFESFKEHDFQ 199 Query: 122 -----PGGGSAEWRLH-------MGYDPHTCQFTDFELTDSRDAERLDRFAQTADE---I 166 + + G+ F + + + RF + E + Sbjct: 200 KPGFSKDNKPRQPHIVIGLITTRSGFPVMHEVFEGNTFEGNTMLDAVHRFQERVGETRPL 259 Query: 167 RIADRGFGSRPECIRSLAFGEADYIV--RVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNG 224 +AD S + L E YI R+ + ++ M Sbjct: 260 IVADASMLSTARM-QQLENKEYRYIAGARLADTSIGFIEQIHNELPRMDTASRRFSYAYA 318 Query: 225 ETTVMI-----GNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGR------V 273 E V + K A+ L + + L +++ KG+ Sbjct: 319 EQKVSVICEFSEARYKKDKREFDKQVERALKLLERNETGRRAKFLKKSKEKGKPFVFDTA 378 Query: 274 VQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPE 333 +QA+ + G +T++PE E +V YR W +E AF+ L + E Sbjct: 379 LQAKAEKLLGIKGYVTNIPEQEMPDAEVVANYRDLWHVEQAFRMNNLELKARPIFHHTKE 438 Query: 334 LAKAWIFANLLAAFLIDDI 352 ++ I +A + + Sbjct: 439 AIRSHILVCFMAMMMGKYL 457 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.121 0.289 Lambda K H 0.267 0.0373 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,579,881,406 Number of Sequences: 3077464 Number of extensions: 52031742 Number of successful extensions: 158548 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 741 Number of HSP's successfully gapped in prelim test: 399 Number of HSP's that attempted gapping in prelim test: 156833 Number of HSP's gapped (non-prelim): 1301 length of query: 370 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 240 effective length of database: 640,326,036 effective search space: 153678248640 effective search space used: 153678248640 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 93 (40.6 bits)