BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (299 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P30192 Putative uncharacterized protein ychG n=8 Tax=En... 617 e-175 UniRef50_Q7MLW1 Transposase and inactivated derivative n=29 Tax=... 263 4e-69 UniRef50_D1T817 Transposase IS4 family protein n=1 Tax=Burkholde... 177 5e-43 UniRef50_B9BXQ1 Transposase, IS4 family n=8 Tax=Proteobacteria R... 172 2e-41 UniRef50_B2JV26 Transposase IS4 family protein n=9 Tax=Burkholde... 167 3e-40 UniRef50_D0LI35 Transposase IS4 family protein n=1 Tax=Haliangiu... 164 2e-39 UniRef50_Q7MGY3 Transposase and inactivated derivative n=4 Tax=V... 149 2e-34 UniRef50_D2TH14 ISCro6 transposase n=8 Tax=Gammaproteobacteria R... 145 1e-33 UniRef50_C6CF98 Transposase IS4 family protein n=20 Tax=Gammapro... 145 2e-33 UniRef50_A6WTA0 Transposase IS4 family protein n=14 Tax=Shewanel... 137 3e-31 UniRef50_B2LS82 Putative uncharacterized protein n=3 Tax=Vibrio ... 128 2e-28 UniRef50_P03835 Transposase insG for insertion sequence element ... 126 7e-28 UniRef50_B6EGT0 Transposase n=20 Tax=Vibrionaceae RepID=B6EGT0_A... 124 3e-27 UniRef50_UPI00016A835E hypothetical protein BoklC_27358 n=1 Tax=... 120 7e-26 UniRef50_C5T3Q2 Transposase IS4 family protein n=4 Tax=Proteobac... 118 3e-25 UniRef50_D0SHM1 Transposase n=3 Tax=Acinetobacter RepID=D0SHM1_A... 116 1e-24 UniRef50_B2J1G3 Transposase, IS4 family protein n=6 Tax=Nostocac... 91 3e-17 UniRef50_UPI000190F8A2 hypothetical protein SentesTyp_33971 n=1 ... 90 1e-16 UniRef50_D0SX83 Predicted protein n=1 Tax=Acinetobacter lwoffii ... 87 7e-16 UniRef50_A4T2G5 Transposase, IS4 family protein n=10 Tax=Coryneb... 82 2e-14 UniRef50_B5EK95 Transposase IS4 family protein n=2 Tax=Acidithio... 81 5e-14 UniRef50_Q8X2N0 Putative uncharacterized protein ECs5267 n=1 Tax... 80 6e-14 UniRef50_A4JGL4 Transposase, IS4 family protein n=3 Tax=Burkhold... 77 8e-13 UniRef50_A1JS05 Transposase for insertion sequence element IS166... 73 2e-11 UniRef50_UPI0001C16028 hypothetical protein CRD_01775 n=2 Tax=Ra... 70 1e-10 UniRef50_Q2JAY9 Transposase, IS4 n=2 Tax=Frankia RepID=Q2JAY9_FRASC 68 3e-10 UniRef50_A8M893 Transposase IS4 family protein n=3 Tax=Actinomyc... 67 7e-10 UniRef50_A8L1S1 Transposase IS4 family protein n=2 Tax=Frankia s... 66 2e-09 UniRef50_A5KKC4 Putative uncharacterized protein n=1 Tax=Ruminoc... 65 2e-09 UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia R... 64 8e-09 UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=St... 62 3e-08 UniRef50_Q648P8 Transposase n=2 Tax=environmental samples RepID=... 58 5e-07 UniRef50_Q7BLZ8 Putative uncharacterized protein (Fragment) n=1 ... 57 8e-07 UniRef50_Q82R31 Putative IS4 family ISFsp6-like transposase n=2 ... 57 1e-06 UniRef50_A3YGY3 Transposase and inactivated derivative n=1 Tax=M... 55 4e-06 UniRef50_A6UXI0 Protein containing transposase DDE domain n=4 Ta... 54 5e-06 UniRef50_A8KXP7 Transposase IS4 family protein n=2 Tax=Actinomyc... 52 3e-05 UniRef50_A4BL98 Putative uncharacterized protein n=5 Tax=Nitroco... 49 3e-04 UniRef50_C1ZMB0 Transposase family protein n=1 Tax=Planctomyces ... 48 4e-04 UniRef50_UPI00016C385B hypothetical protein GobsU_16554 n=3 Tax=... 47 8e-04 UniRef50_Q2J8F5 Putative uncharacterized protein n=3 Tax=Frankia... 46 0.001 UniRef50_Q93UU3 Orf n=3 Tax=Escherichia coli RepID=Q93UU3_ECO57 46 0.002 UniRef50_Q12AI7 Transposase, IS4 family n=3 Tax=Proteobacteria R... 44 0.005 UniRef50_UPI00016C3BAD transposase, IS4 n=2 Tax=Gemmata obscurig... 44 0.005 UniRef50_UPI00016C48B0 transposase, IS4 family protein n=1 Tax=G... 44 0.007 UniRef50_B2Q345 Putative uncharacterized protein n=1 Tax=Provide... 43 0.014 UniRef50_B8CMP8 Transposase OrfA, putative n=1 Tax=Shewanella pi... 42 0.019 UniRef50_D2ASB5 Transposase, IS4 family n=1 Tax=Streptosporangiu... 42 0.019 UniRef50_Q3SHG4 Putative uncharacterized protein n=1 Tax=Thiobac... 42 0.025 UniRef50_C0ING1 Putative uncharacterized protein n=1 Tax=uncultu... 42 0.029 >UniRef50_P30192 Putative uncharacterized protein ychG n=8 Tax=Enterobacteriaceae RepID=YCHG_ECOLI Length = 299 Score = 617 bits (1592), Expect = e-175, Method: Compositional matrix adjust. Identities = 299/299 (100%), Positives = 299/299 (100%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV Sbjct: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 Query: 61 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK 120 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK Sbjct: 61 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK 120 Query: 121 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNA 180 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNA Sbjct: 121 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNA 180 Query: 181 VTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 VTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS Sbjct: 181 VTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 Query: 241 EMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK 299 EMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK Sbjct: 241 EMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK 299 >UniRef50_Q7MLW1 Transposase and inactivated derivative n=29 Tax=Gammaproteobacteria RepID=Q7MLW1_VIBVY Length = 445 Score = 263 bits (673), Expect = 4e-69, Method: Compositional matrix adjust. Identities = 129/251 (51%), Positives = 173/251 (68%), Gaps = 9/251 (3%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQL--FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWM 58 M + N DF + P AQL F+EH+P EW+ TLS AT+RRRRLP DMV+W+ Sbjct: 1 MSIQNYFADFLEES--PVDVAQLTTFSEHIPDEWVAKAATLSDKATIRRRRLPSDMVLWL 58 Query: 59 VV-----QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDR 113 +V +NE I +V RR+N+ A+G A LLA+SA+TQARQR+G A EWLFRQ + Sbjct: 59 IVGMAFFRNESIAEVARRMNVCAEGLADEELLAKSALTQARQRLGKAAPEWLFRQCSHTW 118 Query: 114 GAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLG 173 G ERY +D W GLQ+FAIDGA FRT D ELRE++GS NTS++RQ +PV+R+V +MN+ Sbjct: 119 GLERYPEDTWQGLQVFAIDGALFRTADTSELREHFGSGNTSSERQTPHPVLRVVTMMNVR 178 Query: 174 SHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLP 233 SH++++A +PYR+ E LA + ++PDNS+TL DK FY DLLL+L G NRHWLLP Sbjct: 179 SHVIVDAAISPYRRGEIPLAMPFIDSLPDNSVTLLDKGFYGADLLLSLQNSGSNRHWLLP 238 Query: 234 AWKNIASEMIE 244 A K + +++ Sbjct: 239 AKKGVKFRLLD 249 Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 39/66 (59%), Positives = 49/66 (74%) Query: 234 AWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKH 293 A + IAS++ + SPG PKRL+ LRG L ++FI KRP+P+RPR+VKISKTRYPV Sbjct: 380 ACQFIASQLKVMSKAVSPGNTPKRLKSLRGDLSILFIDKRPKPNRPRAVKISKTRYPVNR 439 Query: 294 SAAPLK 299 AAPLK Sbjct: 440 KAAPLK 445 >UniRef50_D1T817 Transposase IS4 family protein n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1T817_9BURK Length = 448 Score = 177 bits (448), Expect = 5e-43, Method: Compositional matrix adjust. Identities = 91/235 (38%), Positives = 141/235 (60%), Gaps = 6/235 (2%) Query: 17 PPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRR 71 PP +HLP EWI++ + S A+VRRRRLP V+W+V+ +++ I++VV Sbjct: 20 PPLEWGRLGQHLPYEWIEYAVQASGSASVRRRRLPAQQVVWLVIALALYRHQSISEVVDE 79 Query: 72 LNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAI 131 L+L+ A + +++SA+ QARQR+GAAP+ WLF ++A + A+ K + G LFA+ Sbjct: 80 LDLALPA-ADASFVSKSAIAQARQRIGAAPLAWLFHESAANWVAQDQAKHLFKGFSLFAM 138 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETV 191 DG RT D R ++G++ + R +YP +R V L L +H++ +AV PY +E + Sbjct: 139 DGTTLRTADSAANRRHFGASAAAHGRIGSYPQLRAVTLTALATHLVRDAVFGPYDINEMI 198 Query: 192 LAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELG 246 A ++A +P NSIT+FDK F S LL L G NRH+++PA N E++ G Sbjct: 199 WARELIARVPANSITVFDKGFLSAQLLCNLVSGGENRHFIIPAKANTCWEVVSGG 253 >UniRef50_B9BXQ1 Transposase, IS4 family n=8 Tax=Proteobacteria RepID=B9BXQ1_9BURK Length = 446 Score = 172 bits (435), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 90/231 (38%), Positives = 139/231 (60%), Gaps = 7/231 (3%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRL 72 P AEHLP WI+ + + A++RRRRLP + V+W+V+ ++ +++VV L Sbjct: 20 PTDLSRLAEHLPHAWIEQAIEATGTASIRRRRLPAEQVVWLVIALAIYRHWSVSEVVDSL 79 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAID 132 L E +++SAVTQARQR+G AP+ WLF QTAQ + + + GL L+A+D Sbjct: 80 ELVLPNET--TFVSKSAVTQARQRLGHAPIAWLFEQTAQAWCKQDGARHAFKGLSLWAMD 137 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 G RTPD RE++GS + ++ + +YP MR V L ++ +H++ N Y +E + Sbjct: 138 GTTLRTPDSAANREHFGSQSYASGKVASYPQMRAVTLTSIPTHLVANIAFGRYDTNEMIY 197 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMI 243 A ++LA IPD+S+TLFDK F + ++L LN NRH+L+PA N E++ Sbjct: 198 AKNLLAQIPDHSLTLFDKGFLAAEILCGLNSGERNRHFLIPAKSNTRWEVL 248 >UniRef50_B2JV26 Transposase IS4 family protein n=9 Tax=Burkholderia RepID=B2JV26_BURP8 Length = 442 Score = 167 bits (424), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 91/248 (36%), Positives = 143/248 (57%), Gaps = 8/248 (3%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRL 72 P AEHLP EWI+ + + A++RRRRLP + V+W+V+ ++ I++V+ L Sbjct: 15 PADLSRLAEHLPYEWIERAVQATGAASIRRRRLPAEQVVWLVIALAMYRHWSISEVLDSL 74 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAID 132 +L+ EA +++SAV QARQR+G AP+ WLF QTA+ + + GL L+A+D Sbjct: 75 DLALPNEAA-PFVSKSAVVQARQRIGEAPMAWLFEQTARAWTTQDAAHHAFKGLSLWAMD 133 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 G RTPD RE++G+ ++ + +YP +R V L + +H++ + Y +E V Sbjct: 134 GTTLRTPDSAANREHFGAQGYASGKVASYPQVRAVTLTAIPTHLVADINFGCYDTNEMVY 193 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPG 252 A S+L IPD+S+T+FDK F + ++L L G NRH+L+PA N E+I TA Sbjct: 194 AKSLLPQIPDDSLTVFDKGFLAAEILCGLTMNGRNRHFLIPAKSNTCWEVI--AGTADDA 251 Query: 253 TIPKRLEH 260 + R+ Sbjct: 252 MVRMRVSQ 259 >UniRef50_D0LI35 Transposase IS4 family protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LI35_HALO1 Length = 449 Score = 164 bits (416), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 91/235 (38%), Positives = 135/235 (57%), Gaps = 9/235 (3%) Query: 17 PPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRR 71 PP A + EWI+ L + AT+RRRRLP + ++W+V+ ++ PIT+VV Sbjct: 15 PPEEFSRLARDVAPEWIEQALEATGTATLRRRRLPMEQLVWLVIGMALFRDRPITEVVTS 74 Query: 72 LNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDD-WHGLQLFA 130 L+L A G +A SAV QAR R+G +P+ WLF +A DR A + DD W GL L+ Sbjct: 75 LDL-ALPSPGHPEVAPSAVAQARDRLGESPMAWLFAHSA-DRWAHQSAADDRWRGLALYG 132 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR-QSE 189 +DG R PD E R+++G AN + + YPV+RL ALM L SH+L PY+ E Sbjct: 133 VDGTTLRVPDSEENRDHFGLANGGARGSSGYPVVRLAALMALRSHLLAAVSFGPYQGHGE 192 Query: 190 TVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 A + +PDNS+ + D+ +++ ++L+ L Q G NRHWL+ K + ++E Sbjct: 193 YWYAADLWPCLPDNSLVIVDRHYWAANVLIPLQQDGLNRHWLIRGRKGLNYRVVE 247 >UniRef50_Q7MGY3 Transposase and inactivated derivative n=4 Tax=Vibrio vulnificus RepID=Q7MGY3_VIBVY Length = 441 Score = 149 bits (375), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 90/237 (37%), Positives = 138/237 (58%), Gaps = 12/237 (5%) Query: 18 PPSAQL--FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVR 70 P + QL ++ L ++I CL S AT+R+RR+P DM +W VV + EP+ +V Sbjct: 15 PNTEQLGKLSDILCPDFINQCLDASGVATIRKRRIPLDMAVWAVVAMSLYRQEPLWSIVS 74 Query: 71 RLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFA 130 + L G+ +L+A SA+ QARQR+GA ++ +F Q+ Q E W GL+L A Sbjct: 75 KAQLMLPGKR--SLVAPSAIVQARQRLGADAMKEVFHQS-QSLWNETADHPTWCGLKLLA 131 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSET 190 +DG +RTPD E R+ + SA+ + ++P +R+V M L SH+L+ + A Y+ +E Sbjct: 132 VDGVVWRTPDTKENRDAFQSASNQNG-EGSFPQVRMVCQMELTSHMLVASAFASYKTNEM 190 Query: 191 VLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA-SEMIELG 246 +LA ++ T PD S+T+FD+ FYS LL G RHWL+P KN +E+ +LG Sbjct: 191 ILAEQLIETTPDYSLTMFDRGFYSLSLLHRWANTGNERHWLMPMRKNTQFTEVRKLG 247 >UniRef50_D2TH14 ISCro6 transposase n=8 Tax=Gammaproteobacteria RepID=D2TH14_CITRO Length = 438 Score = 145 bits (367), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 81/232 (34%), Positives = 125/232 (53%), Gaps = 8/232 (3%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRL 72 P S F +P EWI L + A++R+R+LP ++V+W++V ++ ITDVV +L Sbjct: 15 PASLSCFQRAIPLEWISQVLDSTNKASIRKRKLPAELVVWLIVGMGLYRDRSITDVVTKL 74 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAID 132 +L + G LA S+V +ARQR+ P+ LF TA + D W+GL+LFA+D Sbjct: 75 DLVLSSQEG-ETLAASSVARARQRLSDEPLRELFTLTASHWTQQEDKDDLWYGLRLFAVD 133 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 G FRTPD PEL E++ R YP++RL A+M+L S ++ P E Sbjct: 134 GTLFRTPDTPELAEHFEYIKHRPDRHTEYPMVRLCAMMSLRSRLIHGVKFGPANTGEVSY 193 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 A + + S+TLFD+ + S +LL+ ++ HWL+P N ++E Sbjct: 194 AKQL--SPQAKSLTLFDRCYLSAELLINWQRRQQEAHWLVPLKGNTKYRIVE 243 >UniRef50_C6CF98 Transposase IS4 family protein n=20 Tax=Gammaproteobacteria RepID=C6CF98_DICZE Length = 441 Score = 145 bits (365), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 77/226 (34%), Positives = 119/226 (52%), Gaps = 8/226 (3%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSADG 78 F+ L WI L A++RRRRLP + +W+V+ ++ I DV L++ Sbjct: 20 FSRSLDPAWIHQALNACHKASIRRRRLPAEQAVWLVLMMGLLRDLSIKDVCHHLDIVLQP 79 Query: 79 EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRT 138 + G LA S +T ARQR+G AP+ +LF + D +HGL + ++DG FRT Sbjct: 80 DEGYQPLAPSVLTAARQRLGEAPLRYLFHACNEGWLPTVLGSDTFHGLHVLSVDGTLFRT 139 Query: 139 PDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLA 198 PD P+ +G + +P +R+V LM SH+LL+A + E LAH +++ Sbjct: 140 PDSPDNAAAFGFIDPV---HGTFPQVRMVGLMATHSHMLLDAAFGGVAEGELTLAHRLVS 196 Query: 199 TIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 + PD+S+TLFD+ ++S LL Q G HWL P + + +IE Sbjct: 197 SAPDHSLTLFDRCYFSASFLLEWRQAGVETHWLTPVKRKLRYRVIE 242 >UniRef50_A6WTA0 Transposase IS4 family protein n=14 Tax=Shewanella RepID=A6WTA0_SHEB8 Length = 446 Score = 137 bits (346), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 84/226 (37%), Positives = 131/226 (57%), Gaps = 11/226 (4%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSADG 78 A+ L E IQ CL AT+RRR+LP D +IW V+ + E + ++ +L++ Sbjct: 27 LADVLEPELIQSCLDSQGVATLRRRKLPMDAMIWAVIGMALFRGESVRSLINKLDIVLPQ 86 Query: 79 EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRT 138 E ++ +ARSAVTQAR+R+G+ + +F ++A A R W GL L+ +DG +RT Sbjct: 87 E--IDYVARSAVTQARKRLGSEVIREVFSRSANTWHA-RAEHPHWCGLNLYGVDGVVWRT 143 Query: 139 PDKPELREYYG-SANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSML 197 PD + + + +AN S + AYP +R+V LM L SH+L+N+ ++E LA ++ Sbjct: 144 PDSVQNQAAFARTANASG--EAAYPQIRMVCLMELSSHLLVNSAFDSVAENEMNLASQLI 201 Query: 198 ATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMI 243 +IP++S+TLFD+ FYS LL Q + HWLLP K E++ Sbjct: 202 PSIPNHSLTLFDRGFYSLGLLHAWQQAQPDSHWLLPLKKGTQYEVV 247 >UniRef50_B2LS82 Putative uncharacterized protein n=3 Tax=Vibrio RepID=B2LS82_9VIBR Length = 440 Score = 128 bits (322), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 70/230 (30%), Positives = 128/230 (55%), Gaps = 16/230 (6%) Query: 23 LFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLS-- 75 +F +H+P EW++ + + ++R+RRLP + +W+V+ +N I DV +L L+ Sbjct: 22 VFNKHIPWEWVEEAVQQTGRVSLRKRRLPAEQAVWLVLGIGLQRNRSIQDVCDKLELAFP 81 Query: 76 -ADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGA 134 DGE + +A S++ + ++R+G P+ +LF+ TAQ + D+ GL+L ++DG Sbjct: 82 DVDGE--LTPMATSSIIKGKERLGDKPMRYLFKTTAQQWEQQSDF-DEVCGLKLLSVDGT 138 Query: 135 QFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAH 194 F+T + E +++G A ++ ++P + V LM+ SH++ +A P SE A Sbjct: 139 YFKTHNTEE-NQHFGFA----QKGASFPSVLAVTLMSTRSHLVSDAAFGPVTNSEISYAQ 193 Query: 195 SMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 ++ + PD+S+TLFD+ F S +L + N HWL P + ++IE Sbjct: 194 QLVGSAPDDSLTLFDRGFTSAELFTSWQGASSNSHWLTPIKTKMRYDIIE 243 Score = 42.4 bits (98), Expect = 0.021, Method: Compositional matrix adjust. Identities = 25/58 (43%), Positives = 31/58 (53%), Gaps = 1/58 (1%) Query: 238 IASEMIELGNTASPGTIPKRLEHLR-GALEVVFITKRPRPSRPRSVKISKTRYPVKHS 294 I EMI +T SPG IPK L+ LR ++ KR R PR+V RYP KH+ Sbjct: 380 IMDEMIWASDTRSPGAIPKNLKALRDNGKRLILPKKRKRKPYPRAVLKKPARYPNKHA 437 >UniRef50_P03835 Transposase insG for insertion sequence element IS4 n=377 Tax=root RepID=INSG_ECOLI Length = 442 Score = 126 bits (317), Expect = 7e-28, Method: Compositional matrix adjust. Identities = 82/225 (36%), Positives = 116/225 (51%), Gaps = 9/225 (4%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSADG 78 ++L E I CL S T+R+RRLP +M++W +V + EP+ +V RL++ G Sbjct: 23 LGDYLDPELISRCLAESGTVTLRKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPG 82 Query: 79 EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRT 138 +A SAV QARQR+G+ V +F +TAQ W GL L AIDG +RT Sbjct: 83 NR--PFVAPSAVIQARQRLGSEAVRRVFTKTAQ-LWHNATPHPHWCGLTLLAIDGVFWRT 139 Query: 139 PDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLA 198 PD PE + T YP +++V M L SH+L A + SE LA ++ Sbjct: 140 PDTPENDAAF-PRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIE 198 Query: 199 TIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMI 243 DN++TL DK +YS LL + G +RHW++P K E I Sbjct: 199 QTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEEI 243 >UniRef50_B6EGT0 Transposase n=20 Tax=Vibrionaceae RepID=B6EGT0_ALISL Length = 441 Score = 124 bits (312), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 74/235 (31%), Positives = 124/235 (52%), Gaps = 21/235 (8%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRL 72 P + + A+ LP I +L+ T+R+R+L + ++W++V N+ + D+V +L Sbjct: 21 PSNVETLADLLPIHLIDEAYSLTDTVTMRKRKLTLESMVWLLVGMAIYNNKSMKDLVNQL 80 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKD----DWHGLQL 128 ++ G +A SA+TQ R+ +G A ++ +F +R +LK W+GL L Sbjct: 81 DIV--DRTGKAFVAPSALTQRRKNLGEAAMKAVF-----ERMTSSWLKSANLPKWNGLTL 133 Query: 129 FAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQS 188 +DG +R PD + E + S ++ YP +R+V M L SH++ + Y + Sbjct: 134 LGVDGVVWRAPDNQKNEEAF-----SRQKGTQYPQVRMVCQMELSSHLITASAFDNYNTN 188 Query: 189 ETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMI 243 E +LA ++ + PD+S+T+FDK FYS LL G RHWL+P KN E+I Sbjct: 189 EMILAEKLIDSTPDHSVTMFDKGFYSLGLLHKWQMTGSERHWLIPLKKNTQYEII 243 >UniRef50_UPI00016A835E hypothetical protein BoklC_27358 n=1 Tax=Burkholderia oklahomensis C6786 RepID=UPI00016A835E Length = 231 Score = 120 bits (300), Expect = 7e-26, Method: Compositional matrix adjust. Identities = 70/231 (30%), Positives = 116/231 (50%), Gaps = 46/231 (19%) Query: 17 PPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRR 71 PP + +HLP EWI+H + S A+VRRRRLP V+W+V+ +++ I++VV Sbjct: 20 PPLELERLGQHLPYEWIEHAVQASGSASVRRRRLPAQQVVWLVIALALYRHQSISEVVDE 79 Query: 72 LNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAI 131 L+L+ + +++SA+ QA+QR GA+P+ WLF ++A+ +W G + Sbjct: 80 LDLALPAP-DTSFVSKSAIAQAKQRTGASPLAWLFHESAR----------NWVGQDI--- 125 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETV 191 +YP + V L + + ++ +A PY +E + Sbjct: 126 ---------------------------GSYPQLHAVTLTAIATRLVRDAGFGPYDINEMI 158 Query: 192 LAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEM 242 A ++ +P+N IT+FDK F S LL L G NRH+++PA N E+ Sbjct: 159 WARELIPRVPENPITVFDKGFLSAQLLCNLVAGGQNRHFIIPARSNPRGEI 209 >UniRef50_C5T3Q2 Transposase IS4 family protein n=4 Tax=Proteobacteria RepID=C5T3Q2_ACIDE Length = 436 Score = 118 bits (295), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 74/218 (33%), Positives = 116/218 (53%), Gaps = 13/218 (5%) Query: 32 WIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSADGEAGMNLLA 86 WI L + A++RRR+LP + +W+V+ ++ P+ VV+ + L+ DG+ L A Sbjct: 31 WIAQALQATGKASMRRRKLPAEHAVWLVIGLALFRHMPLWQVVQEMALTLDGQ---ELPA 87 Query: 87 RSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELRE 146 S Q RQR+GA P+E +F A G + L++ A+DG + PD + R+ Sbjct: 88 PSVSVQVRQRLGAEPMEHMFGLLANAWGRAHAVHAG--ALRVLAVDGVAWSAPDSKDNRQ 145 Query: 147 YYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSIT 206 GS T Q +P++R V L++ SH LL+A Y E LA + D+SIT Sbjct: 146 ELGSGQTQYGPQ-PWPMVRAVCLLDTDSHELLDAQLGDYGCGELTLAADLHGL--DHSIT 202 Query: 207 LFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 LFD+ ++S LL +Q G RHWL+ A N+ E+++ Sbjct: 203 LFDRAYFSAAFLLAWSQAGQQRHWLMRAKDNLRYEVVQ 240 >UniRef50_D0SHM1 Transposase n=3 Tax=Acinetobacter RepID=D0SHM1_ACIJO Length = 443 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 80/233 (34%), Positives = 123/233 (52%), Gaps = 17/233 (7%) Query: 19 PSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLN 73 PS F+E + WI+ CL + A+VR+R+LP + +W+V+ +++PI VV++L Sbjct: 26 PSLSNFSELIDLNWIEDCLKRTGKASVRKRKLPAEHAVWLVIGLALFRDQPIWYVVQQLQ 85 Query: 74 LSADGEAGMNLLARSAVTQARQRVGAAPVEWLFR---QTAQDRGAERYLKDDWHGLQLFA 130 L G A A SA QARQR+G P+ LF QT + +Y +HGL + A Sbjct: 86 L-VFGTA--ESCAPSASVQARQRLGLEPLNVLFNTLSQTWFEDSQPQY--SAFHGLSICA 140 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSET 190 +DGA + P E ++GS+ T +P R V L+N +H +++A Q E Sbjct: 141 VDGAVWSMPHTDENFRHFGSSKGKTI-AAPWPQARAVCLINTNTHEVIDAGIGSMDQGEL 199 Query: 191 VLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMI 243 LA + +P NS+TLFD+ ++S D L + N HWL+ A N+ E+I Sbjct: 200 TLAKKL--KVPANSLTLFDRAYFSADFLSGWQSRE-NCHWLMRAKDNLRYEII 249 >UniRef50_B2J1G3 Transposase, IS4 family protein n=6 Tax=Nostocaceae RepID=B2J1G3_NOSP7 Length = 381 Score = 91.3 bits (225), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 64/217 (29%), Positives = 110/217 (50%), Gaps = 13/217 (5%) Query: 46 RRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSADGEAGMNL------LARSAVTQAR 94 R+R LP +V+ +V+ + + DV++ L + EA + + +SA+TQAR Sbjct: 14 RKRSLPAQLVVSLVIAMSLWSKDSMRDVLKNL-IDGLSEAWLKVGKYWRVACKSAITQAR 72 Query: 95 QRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTS 154 QR+GA + LF Q + + L L++ IDG+ F PD E +G + Sbjct: 73 QRLGARVMCKLFHQLVKPMATQETLGAFLQELRIVVIDGSCFDVPDSDENARVFGRPGSR 132 Query: 155 TKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYS 214 + A+P +RLV L+ G+HI+ +A+ PYR E V A +L ++ + ++D+ +S Sbjct: 133 PGTKAAFPKVRLVILVEAGTHIIFDALMWPYRIGERVRALRLLRSVTPGMLLMWDRGLHS 192 Query: 215 EDLLLTLNQKGCNRHWLLPA-WKNIASEMIELGNTAS 250 ++ KGC+ +PA K IA + +E G+ S Sbjct: 193 YAMVQATVTKGCDYLGRIPANIKFIAEKPLEDGSYLS 229 >UniRef50_UPI000190F8A2 hypothetical protein SentesTyp_33971 n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E98-2068 RepID=UPI000190F8A2 Length = 85 Score = 89.7 bits (221), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 41/46 (89%), Positives = 42/46 (91%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVR 46 M LLNDLLDFSDHPLMPPPSAQ+FAEHLP E IQHCLTLS HATVR Sbjct: 1 MSLLNDLLDFSDHPLMPPPSAQMFAEHLPAECIQHCLTLSKHATVR 46 >UniRef50_D0SX83 Predicted protein n=1 Tax=Acinetobacter lwoffii SH145 RepID=D0SX83_ACILW Length = 140 Score = 87.0 bits (214), Expect = 7e-16, Method: Compositional matrix adjust. Identities = 48/123 (39%), Positives = 68/123 (55%), Gaps = 7/123 (5%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 M D+LD ++ L + F ++P EW++ L LS+ AT+RRR LP D V+W+V+ Sbjct: 10 MIFQQDILDLNN--LFKLSNLSTFIHNIPVEWVKSTLRLSSPATIRRRCLPADQVLWLVL 67 Query: 61 QNEPITDVV-----RRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGA 115 DV+ RRLN+ A +LL R ++T R+ +GA VEWLF QT Q G Sbjct: 68 GMAIFRDVLIHEAARRLNICTQWLASYDLLTRISLTNTRKHLGADSVEWLFHQTDQHWGQ 127 Query: 116 ERY 118 E Y Sbjct: 128 EHY 130 >UniRef50_A4T2G5 Transposase, IS4 family protein n=10 Tax=Corynebacterineae RepID=A4T2G5_MYCGI Length = 401 Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 57/236 (24%), Positives = 101/236 (42%), Gaps = 9/236 (3%) Query: 11 SDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPI 65 SD L S + P + + + VR R LP ++ + + + Sbjct: 11 SDRRLSDLVSVGVLTRVFPPAMVDEVIEATGRTQVRHRALPARVMAYFAIGMGLYSDGSY 70 Query: 66 TDVVRRLNLSADGEAG----MNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKD 121 DV+ +L +G L +SA+ QAR+R+G+ P+ LF + A+ GA Sbjct: 71 EDVLSQLTDGLAWASGWREQYQLPGKSAIFQARERLGSQPLAALFARVARPLGAADTPGT 130 Query: 122 DWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAV 181 G ++ AIDG D P E++G + ++A+P RL+A+ G+H + A Sbjct: 131 WVAGRRVVAIDGTCLDVADNPVNEEFFGRPGVNKGEKSAFPQARLLAVAECGTHAIFAAT 190 Query: 182 TAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKN 237 YR +E+ + +L + + L D+ F+S L + G + W + +N Sbjct: 191 IGAYRDAESTMVEHVLDALTPEMLVLADRGFFSYALWRNASDTGADLLWRVSTGRN 246 >UniRef50_B5EK95 Transposase IS4 family protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EK95_ACIF5 Length = 369 Score = 80.9 bits (198), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 53/163 (32%), Positives = 82/163 (50%), Gaps = 3/163 (1%) Query: 54 MVIWMVVQNEPITD-VVRRLN-LSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQ 111 MV++ V E + VV L L D ++++ A++QAR +VGAAP++ L++ Q Sbjct: 28 MVLYASVAYEEVLQLVVDGLRPLLGDDRLAQTVVSKGAISQARAKVGAAPLKTLYQNQVQ 87 Query: 112 DRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMN 171 G + GL+L AIDG+ PD+ E +G S++ A+P +R VA+ Sbjct: 88 PHGPLGMAGVGYKGLRLMAIDGSTLDMPDEAANAERFGYP-ASSRGSAAFPQLRFVAMAE 146 Query: 172 LGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYS 214 G+H L A Y QSE LA ++A + D+ FYS Sbjct: 147 CGTHTLCYAEMGSYEQSERTLAGPVMAHADATMLITADRNFYS 189 >UniRef50_Q8X2N0 Putative uncharacterized protein ECs5267 n=1 Tax=Escherichia coli O157:H7 RepID=Q8X2N0_ECO57 Length = 80 Score = 80.5 bits (197), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 46/79 (58%), Positives = 49/79 (62%), Gaps = 24/79 (30%) Query: 16 MPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLS 75 MPPPS QLFAEHLPTEWIQH LTLSAHAT VRRLNLS Sbjct: 1 MPPPSTQLFAEHLPTEWIQHFLTLSAHAT------------------------VRRLNLS 36 Query: 76 ADGEAGMNLLARSAVTQAR 94 DGEAGMNLLA +A++ R Sbjct: 37 VDGEAGMNLLAPAALSPRR 55 >UniRef50_A4JGL4 Transposase, IS4 family protein n=3 Tax=Burkholderiaceae RepID=A4JGL4_BURVG Length = 402 Score = 77.0 bits (188), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 60/221 (27%), Positives = 102/221 (46%), Gaps = 11/221 (4%) Query: 20 SAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVR---- 70 SA + A P I+ L + A+ R R LP V++ V+ + P+ +V+R Sbjct: 22 SAGVLASVCPRTLIEEVLAETGKASQRERLLPAPAVVYYVMALALWREAPLEEVLRVVCE 81 Query: 71 RLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFA 130 L G ++SA++QAR R+G + L + + A + GL++ A Sbjct: 82 GLQWLGGGHTEAVQASKSAISQARSRLGPEVMRQLADRVLRPLAAPGAPGAWYRGLRVMA 141 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSET 190 +DG+ D+ +++G S + Q+A+P R++ L+ G+H ++ A APY SE Sbjct: 142 LDGSCMDVADEAANAKFFGYPGAS-RGQSAFPQARVLGLVECGTHAVVAAGIAPYGHSEQ 200 Query: 191 VLAHSML-ATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHW 230 V+A +L A + + L D+ FY L T G W Sbjct: 201 VMAAQLLPAKLTPEMLVLADRNFYGFKLWQTACATGAKLAW 241 >UniRef50_A1JS05 Transposase for insertion sequence element IS1665 n=4 Tax=Yersinia enterocolitica subsp. enterocolitica 8081 RepID=A1JS05_YERE8 Length = 261 Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 46/126 (36%), Positives = 71/126 (56%), Gaps = 10/126 (7%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSADG 78 ++L + I CL S T+R+RRLP +M++W +V + EP+ +V RL++ G Sbjct: 23 LGDYLYPQLISRCLAESGTVTLRKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPG 82 Query: 79 EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQD-RGAERYLKDDWHGLQLFAIDGAQFR 137 + +A SAV QARQR+G+ V +F QTAQ G+ + W GL L A+DG ++ Sbjct: 83 DR--PFVAPSAVIQARQRLGSEAVRRVFSQTAQLWHGSVTH--PHWCGLTLLAVDGVVWQ 138 Query: 138 TPDKPE 143 T + E Sbjct: 139 TDNATE 144 >UniRef50_UPI0001C16028 hypothetical protein CRD_01775 n=2 Tax=Raphidiopsis brookii D9 RepID=UPI0001C16028 Length = 465 Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 55/213 (25%), Positives = 105/213 (49%), Gaps = 11/213 (5%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRL--NLSA 76 ++ +P++ I + + + R R LP +++ +V+ ++ I DV + L LS+ Sbjct: 22 LSQVIPSQTITKAIESTCSSQRRLRILPTYIIVTLVIAMSFWSSDSIVDVFKNLIHGLSS 81 Query: 77 -DGEAGMNLL--ARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDG 133 +G+ L + S++T+ARQR GAA + LF A+ L++ A+DG Sbjct: 82 LHIPSGLRLQTPSASSITEARQRTGAAVMRRLFELVAKPLATILTPGAFLGELRIMAVDG 141 Query: 134 AQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLA 193 F PD +G + +P +RLV L+ G+H++++A PYR E A Sbjct: 142 TVFDVPDTSTNARVFGYPGSPKGTYPGFPKVRLVFLVEAGTHLIIDAFCYPYRMGERRGA 201 Query: 194 HSMLATIPDNSITLFDKLFYSEDLLLT-LNQKG 225 +L +I + + ++D+ +S ++ T + Q+G Sbjct: 202 LKLLRSINSSMLLMWDRGLHSFKMVHTVIKQQG 234 >UniRef50_Q2JAY9 Transposase, IS4 n=2 Tax=Frankia RepID=Q2JAY9_FRASC Length = 412 Score = 68.2 bits (165), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 63/223 (28%), Positives = 93/223 (41%), Gaps = 23/223 (10%) Query: 29 PTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-------QNEPITDVVRRLN-------- 73 P E + L ++ A VRRR LP +V++ V+ +N V+ RL Sbjct: 29 PPELVDRVLAVTDTAEVRRRLLPSWLVVYFVLALWLFRGRNCGYVQVLARLTSGLHFQRR 88 Query: 74 -----LSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQL 128 G AG +L A ++ +AR R+G+ PV LF A G E HGL+L Sbjct: 89 AAVLAAGGAGGAGWSLPASPSLGEARARIGSDPVRMLFEHAAGPVGVEGQAGVFLHGLRL 148 Query: 129 FAIDGAQFRTPDKPELREYY-GSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ 187 IDG+ PD R ++ G +N +P +R V + LL A P+ Sbjct: 149 VQIDGSTCDLPDTQANRAFFPGPSNAGGP--APFPKVRWVIAAEAATGALLGASFGPWST 206 Query: 188 SETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHW 230 E LA +L + +TL D+ F S L + G + W Sbjct: 207 GEPALARDLLGQLGPGMLTLADRNFLSHRLAGEVLATGAHLLW 249 >UniRef50_A8M893 Transposase IS4 family protein n=3 Tax=Actinomycetales RepID=A8M893_SALAI Length = 451 Score = 67.0 bits (162), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 54/205 (26%), Positives = 86/205 (41%), Gaps = 7/205 (3%) Query: 28 LPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLL-- 85 +P E I L + R R LP +V+++++ D R + A AG+ L Sbjct: 29 VPFEMIDDVLAATRRTQRRVRLLPARVVVYLLLAGCLFADCGYR-QVWAKLVAGLRGLPV 87 Query: 86 ---ARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKP 142 + SA+ QARQR+G AP+ LF W GL +DG D P Sbjct: 88 ADPSDSALRQARQRLGPAPLRALFDLLRGPAATSAVAAVRWRGLLPVVVDGTMIAVADSP 147 Query: 143 ELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPD 202 YG + + YP +RL AL+ G+ +++AV P E AH + ++ Sbjct: 148 ANLGRYGKHRCNNG-GSGYPTLRLSALLTCGTRSVIDAVFDPSTTGEITQAHRLTRSLRA 206 Query: 203 NSITLFDKLFYSEDLLLTLNQKGCN 227 + L D+ + + DL+ G + Sbjct: 207 GMLLLADRNYAAADLIGAFTATGAD 231 >UniRef50_A8L1S1 Transposase IS4 family protein n=2 Tax=Frankia sp. EAN1pec RepID=A8L1S1_FRASN Length = 425 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 47/152 (30%), Positives = 69/152 (45%), Gaps = 3/152 (1%) Query: 88 SAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDW-HGLQLFAIDGAQFRTPDKPELRE 146 S +TQAR+R+G + +F + A + A + W G L AIDG PD E Sbjct: 104 SGITQARKRLGRMVMAEVFERVAG-QVATLSTRGAWLRGRLLLAIDGFDVDVPDTEENAA 162 Query: 147 YYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSIT 206 +G A T KR +A+P +R+VAL G+H A + E LA +L + + + Sbjct: 163 EFGYAGTGEKR-SAFPKIRVVALAECGTHAFRAAEVGGWAAGERTLARGLLMRLNRDEVL 221 Query: 207 LFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI 238 D+ FYS D G + W P N+ Sbjct: 222 TADRGFYSFDNWALAAGTGADLIWRAPTGLNL 253 >UniRef50_A5KKC4 Putative uncharacterized protein n=1 Tax=Ruminococcus torques ATCC 27756 RepID=A5KKC4_9FIRM Length = 422 Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 41/144 (28%), Positives = 73/144 (50%), Gaps = 4/144 (2%) Query: 84 LLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPE 143 +++ A+++ARQ + LFR + + + W+G ++A+DG+ + P+ E Sbjct: 78 FVSKQAISKARQGISHKAFLELFRLSVKQFYFQPVNLRTWNGFHIYAVDGSTIQIPESKE 137 Query: 144 LREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPD- 202 E +G TK + P+ L ++ + IL++ PYR +E A + + +P Sbjct: 138 NYEVFGGNPNKTKIIS--PLASASVLYDVINDILIDVSLHPYRYNERESAKAHVDFLPRF 195 Query: 203 -NSITLFDKLFYSEDLLLTLNQKG 225 NSI LFD+ + SED+ LN KG Sbjct: 196 PNSIILFDRGYPSEDMFHYLNSKG 219 >UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LF21_FRASN Length = 420 Score = 63.5 bits (153), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 55/221 (24%), Positives = 94/221 (42%), Gaps = 12/221 (5%) Query: 20 SAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQ-----NEPITDVVRRLNL 74 S + A +P + + L + R+R LP +V++ + ++ +V+RRL Sbjct: 23 SLGVLARIVPRDLVDEVLAETRRLEQRKRLLPARVVVYFTMAMCLFFDDDYDEVMRRLVG 82 Query: 75 S----ADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQ-LF 129 + + + + A++QAR R+G P++ LF + A A K W G + L Sbjct: 83 TLRWLGSWKGDWKVPSTGAISQARTRLGPEPLKLLFERVAVPV-AGLGTKGAWLGSRRLV 141 Query: 130 AIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSE 189 A+DG T D PE + +G + K A+P + +VAL G+H + A Y E Sbjct: 142 AVDGVHLDTADTPENADAFGRFSHGPK-TAAFPQVHVVALAECGTHAVFAAAIGAYTSDE 200 Query: 190 TVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHW 230 LA ++ + D+ FY L G + W Sbjct: 201 RSLAATLFDACEPGMLLTADRNFYGYGLWQQALATGADLLW 241 >UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4D726 Length = 464 Score = 61.6 bits (148), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 56/208 (26%), Positives = 90/208 (43%), Gaps = 6/208 (2%) Query: 28 LPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGM--NLL 85 +P + L + R R LP +V++ V+ D R + A AGM +L+ Sbjct: 8 VPPVLVDEVLAATGRFEKRVRMLPARVVVYFVLAMTLFGDCGYR-GVWAALTAGMPGHLV 66 Query: 86 ---ARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKP 142 + +A+ QAR+R+G AP+ LF + G + WHGL++ A DG D Sbjct: 67 PDPSAAALRQARRRLGTAPLALLFDRVCGPVGTKETPGVFWHGLRVVAWDGTSVEVADSA 126 Query: 143 ELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPD 202 +YG +T R YP +RL AL+ G+ L+ AV P E A +L + Sbjct: 127 ANVAHYGRHGKATSRPAGYPQVRLTALVECGTRALMGAVFGPMHDKELPQARRLLPVLRP 186 Query: 203 NSITLFDKLFYSEDLLLTLNQKGCNRHW 230 + L D+ + + + G + W Sbjct: 187 GILLLADRGYDGYEAIRDAASTGADLLW 214 >UniRef50_Q648P8 Transposase n=2 Tax=environmental samples RepID=Q648P8_9ARCH Length = 464 Score = 57.8 bits (138), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 37/130 (28%), Positives = 61/130 (46%), Gaps = 8/130 (6%) Query: 105 LFRQTAQDRGAERYLKDD----WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNA 160 L R+ ++ G +LK + W G + +DG PD PE ++ Y K Sbjct: 113 LVRRLVRETGKLLHLKSEEAWKWKGRSVKLVDGTTVSMPDTPENQKMYPQPE-GQKEGVG 171 Query: 161 YPVMRLVALMNLGSHILLNAVTAPYRQSET---VLAHSMLATIPDNSITLFDKLFYSEDL 217 +P+ RLVA+++L +L+ PY+ ET L +L +I I L D+ + S L Sbjct: 172 FPIARLVAIISLSCGAVLDIAIGPYKGKETGEHALLRQILGSISTGDILLGDRYYCSYFL 231 Query: 218 LLTLNQKGCN 227 ++ L Q G + Sbjct: 232 IVMLQQLGAD 241 >UniRef50_Q7BLZ8 Putative uncharacterized protein (Fragment) n=1 Tax=Streptomyces rishiriensis RepID=Q7BLZ8_9ACTO Length = 341 Score = 57.0 bits (136), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 32/109 (29%), Positives = 49/109 (44%), Gaps = 1/109 (0%) Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTS-TKRQNAYPVMRLVALMNLGSHILLNAV 181 + G +L A+DG F PD ++G S + ++AYP +RL AL G+H + A Sbjct: 1 YRGWRLVAVDGTTFDVPDTEANAAFFGRPGVSRGQEKSAYPQVRLAALAECGTHAVFAAE 60 Query: 182 TAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHW 230 P ET LA + ++ + L D+ F DL G + W Sbjct: 61 AGPLAVHETELAQRLFGSLTPGMLLLADRGFRGFDLWRAAAATGADLLW 109 >UniRef50_Q82R31 Putative IS4 family ISFsp6-like transposase n=2 Tax=Streptomyces avermitilis RepID=Q82R31_STRAW Length = 542 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 51/209 (24%), Positives = 87/209 (41%), Gaps = 6/209 (2%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNE--PITDVVRRLNLSADGEAG 81 + LP E + L + A R R LP + ++ V+ P VR + G G Sbjct: 29 LTQQLPFELVDDVLERAGGAQHRLRLLPSRVGVYFVLALALFPQLGYVRVWDKLTAGLRG 88 Query: 82 M--NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDG-AQFRT 138 + + A+ + R+R+G AP+ LF A + + A DG + + Sbjct: 89 ILHRRPSEKALREVRRRLGVAPLRLLFETLAGPVAQPITPGVRYRCWRTVAFDGCSSTKA 148 Query: 139 PDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLA 198 PD+P + + G + + YP+++++ L G+ LL AV P + ET A +L Sbjct: 149 PDRPRVCAWLGK-HKHRYGTDGYPMLKIMVLCETGTRALLGAVFGPTPEKETGYAEQLLP 207 Query: 199 TIPDNSITLFDKLFYSEDLLLTLNQKGCN 227 + + L D+ F S+D L G Sbjct: 208 LLDGGMLLLNDRGFDSDDFLAKAAATGAQ 236 >UniRef50_A3YGY3 Transposase and inactivated derivative n=1 Tax=Marinomonas sp. MED121 RepID=A3YGY3_9GAMM Length = 66 Score = 54.7 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 29/58 (50%), Positives = 38/58 (65%) Query: 166 LVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQ 223 LVALMN SHI+++ YR+ + LA S A PDNSITLFDK F+S +L L++ Sbjct: 2 LVALMNTQSHIMMDPQIIHYRRGKIPLAPSTQAKTPDNSITLFDKGFWSTKFMLGLSR 59 >UniRef50_A6UXI0 Protein containing transposase DDE domain n=4 Tax=Gammaproteobacteria RepID=A6UXI0_PSEA7 Length = 423 Score = 54.3 bits (129), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 54/201 (26%), Positives = 91/201 (45%), Gaps = 23/201 (11%) Query: 46 RRRRLP-GDMVIWMVVQNEPIT------DVVRRLNLSADGEAGMNLLARSAVTQARQRVG 98 RRR+L ++V++++ N+P T D R+ A E M + A +AR+++ Sbjct: 29 RRRQLTFKNLVLFLL--NQPRTALQTELDQFYRVLNQASTETQM--VTAQAFCKARKKLN 84 Query: 99 AAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQ 158 E L R Q L+ W GL++ A+DG+ P + + ++GS Sbjct: 85 PEVFESLNRLLQQQIDCFG-LRQKWRGLRVLAVDGSTVHLPLESTMATFFGS-------H 136 Query: 159 NAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLL 218 + +P+ RL L + L+++ P E AH L +P +S+TLFD+ + L Sbjct: 137 SGFPMARLSTLYEVADGQTLHSLIVPLTVGERDCAHLHLEHLPADSLTLFDRGYPGHWLF 196 Query: 219 LTLNQKGCNRHWL--LPAWKN 237 Q+ RH+L LP N Sbjct: 197 ALFAQQ--QRHFLMRLPCGYN 215 >UniRef50_A8KXP7 Transposase IS4 family protein n=2 Tax=Actinomycetales RepID=A8KXP7_FRASN Length = 421 Score = 51.6 bits (122), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 52/222 (23%), Positives = 94/222 (42%), Gaps = 15/222 (6%) Query: 23 LFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQ-----NEPITDVVRRLNLSAD 77 + A P + + + R R LP + + VV ++ +V+R++ ++ D Sbjct: 23 VLAAQFPDALVDRVVAETGRRERRTRDLPAALTLRYVVALALFPSDGYDEVMRQVKVADD 82 Query: 78 ---GEAG-MNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGA-ERYLKDDWHGLQLFAID 132 +AG + + A +A+T+AR R+G PV+ LF +TA R + + G ++ +D Sbjct: 83 WLSDKAGPVKVPATTAITKARDRLGVEPVKLLFERTAVPMALPRRTVGAFYRGWRVCTVD 142 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ----S 188 G PD E +G + + A P +R++ L+ G+ LL A S Sbjct: 143 GTTLLVPDTDENAAAFGKPGND-QGEGALPQVRVLGLVECGTRALLGAGFGGTGGSKAAS 201 Query: 189 ETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHW 230 E L +L + + L D+ F +L G + W Sbjct: 202 EQALFPDLLGALRPGMLVLADRNFLGFELFAKAAATGADLLW 243 >UniRef50_A4BL98 Putative uncharacterized protein n=5 Tax=Nitrococcus mobilis Nb-231 RepID=A4BL98_9GAMM Length = 426 Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 40/162 (24%), Positives = 71/162 (43%), Gaps = 6/162 (3%) Query: 68 VVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQ 127 V R L+ N + +ARQR+ AP+E R++ Q W G + Sbjct: 41 VARVLSERLQSGQSANSINTGPYCKARQRLPRAPLENAVRESGQTLHQRAPSAWGWRGHR 100 Query: 128 LFAIDGAQFRTPDKPE-LREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR 186 + DG PD + RE+ N +P++R+VAL++LG+ +L+ PY+ Sbjct: 101 VVLADGTTALMPDTLDNQREFPQQGNQQPGL--GFPIVRIVALISLGAGAVLDYALGPYQ 158 Query: 187 ---QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKG 225 E+ L ++L T+ + L D+ + + ++ L G Sbjct: 159 GKGSGESSLFSTLLHTLQPGDLLLADRYYCTYAIMALLVHHG 200 >UniRef50_C1ZMB0 Transposase family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZMB0_PLALI Length = 497 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 44/183 (24%), Positives = 73/183 (39%), Gaps = 12/183 (6%) Query: 54 MVIWMVVQNEPITDVVRRLNLSADGEAGMNLLA--------RSAVTQARQRVGAAPVEWL 105 + +W ++ TDV R + A L+ A +AR ++ V+ L Sbjct: 72 VTLWAMLSQALFTDVQRACRAAVQRVAVYYALSGIRISSTNTGAYCRARAKIPEGVVQRL 131 Query: 106 FRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMR 165 Q A K WHG + IDG PD E + Y ++ K +P++R Sbjct: 132 AVGVGQRCEAAVPDKWRWHGFRTLVIDGTTCSMPDTQENQAEYPQPSSQGK-GLGFPILR 190 Query: 166 LVALMNLGSHILLNAVTAPY---RQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLN 222 VAL +L + ++L VT P ET L ++ + + L D+ + +L L Sbjct: 191 AVALTSLATGMILALVTGPCAGKATGETALFRTLFDQLKAGDLVLSDRYYGGWFMLALLQ 250 Query: 223 QKG 225 + G Sbjct: 251 ELG 253 >UniRef50_UPI00016C385B hypothetical protein GobsU_16554 n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C385B Length = 454 Score = 47.0 bits (110), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 33/109 (30%), Positives = 49/109 (44%), Gaps = 5/109 (4%) Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHIL----L 178 W G ++F IDG +PELRE Y A T+ + +PV LV L S + Sbjct: 124 WSGRRVFLIDGTTRTLAPEPELREKYPPA-TNPHGRGVWPVALLVVAHELSSGAAVVPEV 182 Query: 179 NAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCN 227 A P+ SET LA +++ +P N + + D F + L +G Sbjct: 183 GATFGPHAVSETALAGAVMDRLPANGVVMADAGFGIFAVALGARARGLG 231 >UniRef50_Q2J8F5 Putative uncharacterized protein n=3 Tax=Frankia sp. CcI3 RepID=Q2J8F5_FRASC Length = 451 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 43/154 (27%), Positives = 62/154 (40%), Gaps = 10/154 (6%) Query: 88 SAVTQARQRVGAAPVEWLFRQTA---QDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPEL 144 S +T+ARQR+GAAP+ LF Q A D W +L +IDG ++ P E Sbjct: 94 SGLTKARQRLGAAPLAELFGQVAGPVADLDTVGAFLSRW---RLMSIDGLEWDAPASKEN 150 Query: 145 REYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR----QSETVLAHSMLATI 200 +G P +R V + SH + A P SE LA ++ + Sbjct: 151 IAAFGLPAGRVDAPGVLPKVRAVTVSECASHAPVLAAFGPAGGAKPASEQALARTVYPRL 210 Query: 201 PDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPA 234 + + L D+ FYS T G W + A Sbjct: 211 ASDWLLLADRNFYSWADWCTAADTGAALLWRVKA 244 >UniRef50_Q93UU3 Orf n=3 Tax=Escherichia coli RepID=Q93UU3_ECO57 Length = 28 Score = 45.8 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 21/22 (95%), Positives = 21/22 (95%) Query: 242 MIELGNTASPGTIPKRLEHLRG 263 MI LGNTASPGTIPKRLEHLRG Sbjct: 1 MIVLGNTASPGTIPKRLEHLRG 22 >UniRef50_Q12AI7 Transposase, IS4 family n=3 Tax=Proteobacteria RepID=Q12AI7_POLSJ Length = 458 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 37/146 (25%), Positives = 63/146 (43%), Gaps = 10/146 (6%) Query: 88 SAVTQARQRVGAAPVEWLFRQTAQ---DRGAERYLKDDWHGLQLFAIDGAQFRTPDKPEL 144 +ARQR+ V L R+T + ++ ++L W G + +DG PD PE Sbjct: 97 GGYCRARQRLPLEMVGTLTRETGRLLHEKALAQWL---WRGRAVKLVDGTGISMPDTPEN 153 Query: 145 REYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR---QSETVLAHSMLATIP 201 +E Y +T +P+ RLV ++ L + L+ P+ E L +LA Sbjct: 154 QERYPQPSTQAP-GVGFPLARLVMVICLATGAALDMAVGPHSGKGSGELGLVRRLLAGFC 212 Query: 202 DNSITLFDKLFYSEDLLLTLNQKGCN 227 + L D L+ + L+ +L G + Sbjct: 213 PGDVMLADALYCNYFLIASLMAAGVD 238 >UniRef50_UPI00016C3BAD transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3BAD Length = 258 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 35/147 (23%), Positives = 64/147 (43%), Gaps = 8/147 (5%) Query: 86 ARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWH--GLQLFAIDGAQFRTPDKPE 143 A A +AR ++ A + L Q+ + ER+ +W G ++ DG PD P Sbjct: 94 ATGAYCKARAKLPVALLSRLATQSGDE--LERHAPKEWQWKGRRVLLGDGTTLSGPDTPA 151 Query: 144 LREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR---QSETVLAHSMLATI 200 + Y +T+ KR +P++R+V L+ + L+ A P + E L +L Sbjct: 152 NQAAY-PQHTNQKRGLGFPLIRVVVLLGFATGALVGAAIGPAKGKEAGEMALLRELLDRF 210 Query: 201 PDNSITLFDKLFYSEDLLLTLNQKGCN 227 + + D+ + S L+ L +G + Sbjct: 211 QAGDVFVADRAYCSYWLVSALQARGVD 237 >UniRef50_UPI00016C48B0 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C48B0 Length = 202 Score = 43.9 bits (102), Expect = 0.007, Method: Compositional matrix adjust. Identities = 30/111 (27%), Positives = 49/111 (44%), Gaps = 1/111 (0%) Query: 87 RSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELRE 146 RS + AR R+G APV L + H +L +DG D Sbjct: 87 RSTLCMARVRLGVAPVRRLQERVTALLATRATPGAFHHQYRLMGLDGFAADLADSAANTR 146 Query: 147 YYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSML 197 +G S + A+P R+++L LG+H+L ++ P R+ E +A ++L Sbjct: 147 AFGHPG-SGRATGAFPQARVLSLCELGTHVLWRSLIKPCRRGEVTMAPALL 196 >UniRef50_B2Q345 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q345_PROST Length = 130 Score = 42.7 bits (99), Expect = 0.014, Method: Compositional matrix adjust. Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%) Query: 85 LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPD 140 +A SAV QARQR+G V +F +T+Q ++ W+GL L A+DG +R PD Sbjct: 19 VAPSAVVQARQRLGEDAVRKVFEKTSQ-LWLDKLPLSHWNGLTLMAVDGTLWRIPD 73 >UniRef50_B8CMP8 Transposase OrfA, putative n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CMP8_SHEPW Length = 156 Score = 42.4 bits (98), Expect = 0.019, Method: Compositional matrix adjust. Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 7/78 (8%) Query: 59 VVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQ--DRGAE 116 ++ E + ++ +L++ E G +ARS VTQ R+++ + VE +FRQT Q + AE Sbjct: 83 LISGESVRQLIYKLDIILLNEVGY--VARSTVTQTRKKLTSDVVEDIFRQTPQRWNMLAE 140 Query: 117 RYLKDDWHGLQLFAIDGA 134 W GL L+ +DG Sbjct: 141 H---PQWCGLNLYGVDGV 155 >UniRef50_D2ASB5 Transposase, IS4 family n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2ASB5_STRRD Length = 356 Score = 42.4 bits (98), Expect = 0.019, Method: Compositional matrix adjust. Identities = 40/177 (22%), Positives = 76/177 (42%), Gaps = 10/177 (5%) Query: 88 SAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREY 147 SA+++AR R+GA P+ LF + + + GL+ +DG P+ + Sbjct: 92 SAISRARARLGAEPLRVLFCRVTGPVAEPQASRSWLAGLRPVTMDGTTLVVPETRD-NSA 150 Query: 148 YGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITL 207 +G + + + +P +R+VA+ G+H L++A E LA +L + + + L Sbjct: 151 FGYPDGAAR----FPCVRVVAVAENGTHALIDATFGSSAVEERTLARRLLRCLESDMLLL 206 Query: 208 FDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGA 264 + +L + G H L W ++ + +G + G+ R L GA Sbjct: 207 ARSGRWGFELWRQAAETG--THLL---WGVTGADALPIGRSFEDGSYLSRPAGLGGA 258 >UniRef50_Q3SHG4 Putative uncharacterized protein n=1 Tax=Thiobacillus denitrificans ATCC 25259 RepID=Q3SHG4_THIDA Length = 255 Score = 42.0 bits (97), Expect = 0.025, Method: Compositional matrix adjust. Identities = 51/170 (30%), Positives = 75/170 (44%), Gaps = 13/170 (7%) Query: 23 LFAEHLPTEWIQHCLTLSAHATVRRRRLP-----GDMVIWMVVQNEPITDVV-RRLNLSA 76 +F + LPTE I + SA R R P + ++ +++ DVV RRL+ Sbjct: 54 VFEQVLPTEEIMGTIEESA-PVFRHRHYPPLTTLRHFIEQVLSEDQACQDVVGRRLSERV 112 Query: 77 DGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKD-DWHGLQLFAIDGAQ 135 L SA QARQR+ V+ L+R T +R R K W G +L DG Sbjct: 113 GQRQSTCSLNTSAYCQARQRLPQEMVDRLYRTTG-ERLETRLPKSWRWRGRRLVLFDGTT 171 Query: 136 FRTPDKPELREYYGSANTSTKRQN-AYPVMRLVALMNLGSH-ILLNAVTA 183 PD L ++ ++ +PV RL L+ L S +L +AV+A Sbjct: 172 VSMPDT--LASQCAFPQSAEQQPGLGFPVARLSGLIGLASGAVLGHAVSA 219 >UniRef50_C0ING1 Putative uncharacterized protein n=1 Tax=uncultured bacterium BLR12 RepID=C0ING1_9BACT Length = 337 Score = 42.0 bits (97), Expect = 0.029, Method: Compositional matrix adjust. Identities = 30/111 (27%), Positives = 50/111 (45%), Gaps = 3/111 (2%) Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT 182 W+GL+L AIDG+ P + E +G N + V R L ++ + +L+ Sbjct: 15 WNGLRLLAIDGSTAVLPGHKSITEEFGITNFGPYANSPRSVARTSVLYDVLNLTVLDGQI 74 Query: 183 APYRQSETVLAHSMLATI-PDNSITLFDKLFYSEDLLLTLNQKGCNRHWLL 232 Y E LA A + P + LFD+ + S L+ + +G H+L+ Sbjct: 75 DRYDSCERNLARQHFAQVKPATDLLLFDRGYPSLGLMFEMQAQGI--HYLI 123 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P30192 Putative uncharacterized protein ychG n=8 Tax=En... 421 e-116 UniRef50_Q7MLW1 Transposase and inactivated derivative n=29 Tax=... 293 5e-78 UniRef50_B2JV26 Transposase IS4 family protein n=9 Tax=Burkholde... 289 8e-77 UniRef50_B9BXQ1 Transposase, IS4 family n=8 Tax=Proteobacteria R... 278 2e-73 UniRef50_D1T817 Transposase IS4 family protein n=1 Tax=Burkholde... 277 3e-73 UniRef50_Q7MGY3 Transposase and inactivated derivative n=4 Tax=V... 264 2e-69 UniRef50_C6CF98 Transposase IS4 family protein n=20 Tax=Gammapro... 258 2e-67 UniRef50_D2TH14 ISCro6 transposase n=8 Tax=Gammaproteobacteria R... 256 6e-67 UniRef50_D0LI35 Transposase IS4 family protein n=1 Tax=Haliangiu... 254 3e-66 UniRef50_P03835 Transposase insG for insertion sequence element ... 247 4e-64 UniRef50_A6WTA0 Transposase IS4 family protein n=14 Tax=Shewanel... 245 1e-63 UniRef50_B6EGT0 Transposase n=20 Tax=Vibrionaceae RepID=B6EGT0_A... 244 2e-63 UniRef50_B2LS82 Putative uncharacterized protein n=3 Tax=Vibrio ... 244 3e-63 UniRef50_A4T2G5 Transposase, IS4 family protein n=10 Tax=Coryneb... 241 3e-62 UniRef50_D0SHM1 Transposase n=3 Tax=Acinetobacter RepID=D0SHM1_A... 238 2e-61 UniRef50_C5T3Q2 Transposase IS4 family protein n=4 Tax=Proteobac... 228 2e-58 UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia R... 226 6e-58 UniRef50_A4JGL4 Transposase, IS4 family protein n=3 Tax=Burkhold... 215 2e-54 UniRef50_UPI0001C16028 hypothetical protein CRD_01775 n=2 Tax=Ra... 211 2e-53 UniRef50_Q2JAY9 Transposase, IS4 n=2 Tax=Frankia RepID=Q2JAY9_FRASC 211 2e-53 UniRef50_B2J1G3 Transposase, IS4 family protein n=6 Tax=Nostocac... 209 8e-53 UniRef50_UPI00016A835E hypothetical protein BoklC_27358 n=1 Tax=... 203 7e-51 UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=St... 201 2e-50 UniRef50_A8M893 Transposase IS4 family protein n=3 Tax=Actinomyc... 200 6e-50 UniRef50_A8L1S1 Transposase IS4 family protein n=2 Tax=Frankia s... 190 5e-47 UniRef50_Q82R31 Putative IS4 family ISFsp6-like transposase n=2 ... 190 6e-47 UniRef50_B5EK95 Transposase IS4 family protein n=2 Tax=Acidithio... 180 6e-44 UniRef50_A8KXP7 Transposase IS4 family protein n=2 Tax=Actinomyc... 177 3e-43 UniRef50_Q2J8F5 Putative uncharacterized protein n=3 Tax=Frankia... 176 8e-43 UniRef50_A6UXI0 Protein containing transposase DDE domain n=4 Ta... 148 2e-34 UniRef50_A4BL98 Putative uncharacterized protein n=5 Tax=Nitroco... 147 4e-34 UniRef50_C1ZMB0 Transposase family protein n=1 Tax=Planctomyces ... 147 4e-34 UniRef50_Q648P8 Transposase n=2 Tax=environmental samples RepID=... 140 6e-32 UniRef50_Q7BLZ8 Putative uncharacterized protein (Fragment) n=1 ... 140 7e-32 UniRef50_A5KKC4 Putative uncharacterized protein n=1 Tax=Ruminoc... 137 4e-31 UniRef50_D0SX83 Predicted protein n=1 Tax=Acinetobacter lwoffii ... 131 3e-29 UniRef50_A1JS05 Transposase for insertion sequence element IS166... 130 7e-29 UniRef50_UPI00016C385B hypothetical protein GobsU_16554 n=3 Tax=... 108 2e-22 UniRef50_UPI000190F8A2 hypothetical protein SentesTyp_33971 n=1 ... 73 9e-12 UniRef50_A3YGY3 Transposase and inactivated derivative n=1 Tax=M... 71 4e-11 UniRef50_Q8X2N0 Putative uncharacterized protein ECs5267 n=1 Tax... 54 8e-06 Sequences not found previously or not previously below threshold: UniRef50_D2ASB5 Transposase, IS4 family n=1 Tax=Streptosporangiu... 161 4e-38 UniRef50_Q82R32 Putative IS4 family ISFsp5-like transposase n=1 ... 147 6e-34 UniRef50_UPI00016C3BAD transposase, IS4 n=2 Tax=Gemmata obscurig... 119 9e-26 UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium... 117 5e-25 UniRef50_UPI00016C37A0 transposase, IS4 n=2 Tax=Gemmata obscurig... 115 1e-24 UniRef50_B8FEP3 Transposase IS4 family protein n=1 Tax=Desulfati... 115 2e-24 UniRef50_Q12AI7 Transposase, IS4 family n=3 Tax=Proteobacteria R... 114 6e-24 UniRef50_UPI00016C48B0 transposase, IS4 family protein n=1 Tax=G... 111 2e-23 UniRef50_A3ZZQ0 Putative uncharacterized protein n=3 Tax=Blastop... 110 8e-23 UniRef50_Q3A1U3 Transposase n=1 Tax=Pelobacter carbinolicus DSM ... 105 2e-21 UniRef50_B0CC46 Transposase, IS4 family, putative n=9 Tax=Cyanob... 102 1e-20 UniRef50_Q3SHG4 Putative uncharacterized protein n=1 Tax=Thiobac... 100 5e-20 UniRef50_UPI00016C5887 hypothetical protein GobsU_05723 n=3 Tax=... 99 2e-19 UniRef50_Q82QT3 Putative uncharacterized protein n=1 Tax=Strepto... 97 5e-19 UniRef50_A3IS08 Putative uncharacterized protein n=1 Tax=Cyanoth... 97 6e-19 UniRef50_Q7TTE4 Putative uncharacterized protein n=9 Tax=Plancto... 96 1e-18 UniRef50_A6CCZ3 Transposase, IS4 (Fragment) n=7 Tax=Planctomyces... 90 8e-17 UniRef50_C0ING1 Putative uncharacterized protein n=1 Tax=uncultu... 89 1e-16 UniRef50_C6N0W0 Putative uncharacterized protein n=1 Tax=Legione... 87 9e-16 UniRef50_A3ZMM8 Transposase insG for insertion sequence element-... 84 8e-15 UniRef50_Q04V25 Transposase, ISLbp1 n=29 Tax=Leptospira RepID=Q0... 76 1e-12 UniRef50_Q82UV9 Putative uncharacterized protein n=1 Tax=Nitroso... 74 6e-12 UniRef50_A5N5R2 Transposase n=6 Tax=Clostridium RepID=A5N5R2_CLOK5 73 1e-11 UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea f... 71 5e-11 UniRef50_B8CMP8 Transposase OrfA, putative n=1 Tax=Shewanella pi... 70 7e-11 UniRef50_B2Q345 Putative uncharacterized protein n=1 Tax=Provide... 70 9e-11 UniRef50_C6JHT2 Transposase ISLbp1 n=1 Tax=Ruminococcus sp. 5_1_... 68 5e-10 UniRef50_UPI0001AF03EF IS4 family transposase n=1 Tax=Streptomyc... 67 9e-10 UniRef50_B7CEB8 Putative uncharacterized protein n=2 Tax=Erysipe... 67 1e-09 UniRef50_Q8QNB6 EsV-1-170 n=2 Tax=Ectocarpus siliculosus virus 1... 66 1e-09 UniRef50_A2RJ55 Putative transposase n=7 Tax=Lactobacillales Rep... 65 2e-09 UniRef50_B0NXD2 Putative uncharacterized protein n=5 Tax=Clostri... 65 3e-09 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 64 6e-09 UniRef50_A4A0C3 Probable transposase n=1 Tax=Blastopirellula mar... 64 7e-09 UniRef50_A1SV49 ISSod7, transposase n=1 Tax=Psychromonas ingraha... 59 2e-07 UniRef50_B9YUA6 Transposase, IS4 family protein n=3 Tax='Nostoc ... 59 2e-07 UniRef50_B6FVR6 Putative uncharacterized protein (Fragment) n=2 ... 59 2e-07 UniRef50_A3EIG1 FOG: Transposase and inactivated derivatives n=3... 58 4e-07 UniRef50_C6JHV1 Putative uncharacterized protein (Fragment) n=1 ... 58 4e-07 UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostrid... 57 5e-07 UniRef50_Q7UPU9 Probable transposase n=2 Tax=Rhodopirellula balt... 57 7e-07 UniRef50_C8VYK7 Putative uncharacterized protein n=1 Tax=Desulfo... 56 1e-06 UniRef50_A5N172 Transposase n=4 Tax=Clostridium RepID=A5N172_CLOK5 56 1e-06 UniRef50_P12249 Transposase for insertion sequence element IS231... 55 3e-06 UniRef50_B2J2I5 Transposase, IS4 family protein n=1 Tax=Nostoc p... 54 5e-06 UniRef50_UPI00016C560B transposase IS4 family protein n=1 Tax=Ge... 54 6e-06 UniRef50_C1DL03 Transposase inactivated derivative n=1 Tax=Azoto... 54 7e-06 UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostri... 53 1e-05 UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coproco... 53 1e-05 UniRef50_B8CMP9 Transposase OrfB, putative n=1 Tax=Shewanella pi... 53 1e-05 UniRef50_Q09BD0 Isrso13-transposase protein n=1 Tax=Stigmatella ... 53 2e-05 UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostri... 52 2e-05 UniRef50_UPI00019668E9 transposase ISLbp1 n=1 Tax=Methanobreviba... 52 2e-05 UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=... 52 2e-05 UniRef50_A4BSI0 Putative uncharacterized protein n=1 Tax=Nitroco... 52 3e-05 UniRef50_A4A0C6 Probable transposase n=2 Tax=Blastopirellula mar... 52 3e-05 UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicute... 51 4e-05 UniRef50_UPI000196B70E hypothetical protein CATMIT_00144 n=1 Tax... 50 7e-05 UniRef50_A6CHG0 Transposase of IS5377-like element n=2 Tax=Bacil... 50 7e-05 UniRef50_Q2JF90 Putative uncharacterized protein n=1 Tax=Frankia... 50 7e-05 UniRef50_A8RFU1 Putative uncharacterized protein n=1 Tax=Eubacte... 49 2e-04 UniRef50_C4XGQ6 Putative transposase for insertion sequence elem... 48 4e-04 UniRef50_C0R4I1 Putative uncharacterized protein n=5 Tax=Wolbach... 47 7e-04 UniRef50_B2AJ60 Transposase, IS4 family n=4 Tax=Proteobacteria R... 47 0.001 UniRef50_B5ZZ25 Transposase IS4 family protein n=11 Tax=Rhizobiu... 47 0.001 UniRef50_B8FI31 Transposase IS4 family protein n=1 Tax=Desulfati... 46 0.002 UniRef50_C6JAL6 Transposase (Fragment) n=1 Tax=Ruminococcus sp. ... 45 0.004 UniRef50_A1HQH6 Transposase, IS4 family protein n=2 Tax=Thermosi... 45 0.004 UniRef50_C0QMU6 Transposase repeat family IS4 n=1 Tax=Thermosiph... 43 0.012 UniRef50_D2PLH1 Putative uncharacterized protein n=1 Tax=Kribbel... 43 0.014 UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3... 43 0.014 UniRef50_Q73GX2 Conserved domain protein n=5 Tax=Wolbachia RepID... 42 0.021 UniRef50_B5WFI6 Putative uncharacterized protein n=1 Tax=Burkhol... 42 0.032 >UniRef50_P30192 Putative uncharacterized protein ychG n=8 Tax=Enterobacteriaceae RepID=YCHG_ECOLI Length = 299 Score = 421 bits (1081), Expect = e-116, Method: Composition-based stats. Identities = 299/299 (100%), Positives = 299/299 (100%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV Sbjct: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 Query: 61 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK 120 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK Sbjct: 61 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK 120 Query: 121 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNA 180 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNA Sbjct: 121 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNA 180 Query: 181 VTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 VTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS Sbjct: 181 VTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 Query: 241 EMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK 299 EMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK Sbjct: 241 EMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK 299 >UniRef50_Q7MLW1 Transposase and inactivated derivative n=29 Tax=Gammaproteobacteria RepID=Q7MLW1_VIBVY Length = 445 Score = 293 bits (750), Expect = 5e-78, Method: Composition-based stats. Identities = 129/299 (43%), Positives = 184/299 (61%), Gaps = 15/299 (5%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 M + N DF + + F+EH+P EW+ TLS AT+RRRRLP DMV+W++V Sbjct: 1 MSIQNYFADFLEESPVDVAQLTTFSEHIPDEWVAKAATLSDKATIRRRRLPSDMVLWLIV 60 Query: 61 -----QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGA 115 +NE I +V RR+N+ A+G A LLA+SA+TQARQR+G A EWLFRQ + G Sbjct: 61 GMAFFRNESIAEVARRMNVCAEGLADEELLAKSALTQARQRLGKAAPEWLFRQCSHTWGL 120 Query: 116 ERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSH 175 ERY +D W GLQ+FAIDGA FRT D ELRE++GS NTS++RQ +PV+R+V +MN+ SH Sbjct: 121 ERYPEDTWQGLQVFAIDGALFRTADTSELREHFGSGNTSSERQTPHPVLRVVTMMNVRSH 180 Query: 176 ILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 ++++A +PYR+ E LA + ++PDNS+TL DK FY DLLL+L G NRHWLLPA Sbjct: 181 VIVDAAISPYRRGEIPLAMPFIDSLPDNSVTLLDKGFYGADLLLSLQNSGSNRHWLLPAK 240 Query: 236 KNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHS 294 K + +++ + + ++ ++ P+ P ++ Y V+ Sbjct: 241 KGVKFRLLDDEES-DDMLVEMKVSPQ---------ARKKNPNLPEKWQVRAVTYQVQGK 289 Score = 103 bits (258), Expect = 6e-21, Method: Composition-based stats. Identities = 41/94 (43%), Positives = 55/94 (58%), Gaps = 7/94 (7%) Query: 213 YSEDLLLTLNQKGCNRHWLLP-------AWKNIASEMIELGNTASPGTIPKRLEHLRGAL 265 +L+ + H + A + IAS++ + SPG PKRL+ LRG L Sbjct: 352 LGYNLVRREASQAAVAHGRMANEISFKYACQFIASQLKVMSKAVSPGNTPKRLKSLRGDL 411 Query: 266 EVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK 299 ++FI KRP+P+RPR+VKISKTRYPV AAPLK Sbjct: 412 SILFIDKRPKPNRPRAVKISKTRYPVNRKAAPLK 445 >UniRef50_B2JV26 Transposase IS4 family protein n=9 Tax=Burkholderia RepID=B2JV26_BURP8 Length = 442 Score = 289 bits (740), Expect = 8e-77, Method: Composition-based stats. Identities = 91/248 (36%), Positives = 143/248 (57%), Gaps = 8/248 (3%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRL 72 P AEHLP EWI+ + + A++RRRRLP + V+W+V+ ++ I++V+ L Sbjct: 15 PADLSRLAEHLPYEWIERAVQATGAASIRRRRLPAEQVVWLVIALAMYRHWSISEVLDSL 74 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAID 132 +L+ EA +++SAV QARQR+G AP+ WLF QTA+ + + GL L+A+D Sbjct: 75 DLALPNEA-APFVSKSAVVQARQRIGEAPMAWLFEQTARAWTTQDAAHHAFKGLSLWAMD 133 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 G RTPD RE++G+ ++ + +YP +R V L + +H++ + Y +E V Sbjct: 134 GTTLRTPDSAANREHFGAQGYASGKVASYPQVRAVTLTAIPTHLVADINFGCYDTNEMVY 193 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPG 252 A S+L IPD+S+T+FDK F + ++L L G NRH+L+PA N E+I TA Sbjct: 194 AKSLLPQIPDDSLTVFDKGFLAAEILCGLTMNGRNRHFLIPAKSNTCWEVI--AGTADDA 251 Query: 253 TIPKRLEH 260 + R+ Sbjct: 252 MVRMRVSQ 259 >UniRef50_B9BXQ1 Transposase, IS4 family n=8 Tax=Proteobacteria RepID=B9BXQ1_9BURK Length = 446 Score = 278 bits (710), Expect = 2e-73, Method: Composition-based stats. Identities = 91/247 (36%), Positives = 142/247 (57%), Gaps = 9/247 (3%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRL 72 P AEHLP WI+ + + A++RRRRLP + V+W+V+ ++ +++VV L Sbjct: 20 PTDLSRLAEHLPHAWIEQAIEATGTASIRRRRLPAEQVVWLVIALAIYRHWSVSEVVDSL 79 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAID 132 L E +++SAVTQARQR+G AP+ WLF QTAQ + + + GL L+A+D Sbjct: 80 ELVLPNET--TFVSKSAVTQARQRLGHAPIAWLFEQTAQAWCKQDGARHAFKGLSLWAMD 137 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 G RTPD RE++GS + ++ + +YP MR V L ++ +H++ N Y +E + Sbjct: 138 GTTLRTPDSAANREHFGSQSYASGKVASYPQMRAVTLTSIPTHLVANIAFGRYDTNEMIY 197 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPG 252 A ++LA IPD+S+TLFDK F + ++L LN NRH+L+PA N E++ Sbjct: 198 AKNLLAQIPDHSLTLFDKGFLAAEILCGLNSGERNRHFLIPAKSNTRWEVL--SGKPDDA 255 Query: 253 TIPKRLE 259 + R+ Sbjct: 256 LVRMRVS 262 >UniRef50_D1T817 Transposase IS4 family protein n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1T817_9BURK Length = 448 Score = 277 bits (709), Expect = 3e-73, Method: Composition-based stats. Identities = 93/248 (37%), Positives = 145/248 (58%), Gaps = 8/248 (3%) Query: 17 PPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRR 71 PP +HLP EWI++ + S A+VRRRRLP V+W+V+ +++ I++VV Sbjct: 20 PPLEWGRLGQHLPYEWIEYAVQASGSASVRRRRLPAQQVVWLVIALALYRHQSISEVVDE 79 Query: 72 LNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAI 131 L+L+ A + +++SA+ QARQR+GAAP+ WLF ++A + A+ K + G LFA+ Sbjct: 80 LDLALPA-ADASFVSKSAIAQARQRIGAAPLAWLFHESAANWVAQDQAKHLFKGFSLFAM 138 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETV 191 DG RT D R ++G++ + R +YP +R V L L +H++ +AV PY +E + Sbjct: 139 DGTTLRTADSAANRRHFGASAAAHGRIGSYPQLRAVTLTALATHLVRDAVFGPYDINEMI 198 Query: 192 LAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASP 251 A ++A +P NSIT+FDK F S LL L G NRH+++PA N E++ G Sbjct: 199 WARELIARVPANSITVFDKGFLSAQLLCNLVSGGENRHFIIPAKANTCWEVVSGGP--GD 256 Query: 252 GTIPKRLE 259 T+ R+ Sbjct: 257 QTVRMRVS 264 >UniRef50_Q7MGY3 Transposase and inactivated derivative n=4 Tax=Vibrio vulnificus RepID=Q7MGY3_VIBVY Length = 441 Score = 264 bits (675), Expect = 2e-69, Method: Composition-based stats. Identities = 100/298 (33%), Positives = 152/298 (51%), Gaps = 20/298 (6%) Query: 8 LDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QN 62 L ++ ++ L ++I CL S AT+R+RR+P DM +W VV + Sbjct: 7 LTLANRYAPNTEQLGKLSDILCPDFINQCLDASGVATIRKRRIPLDMAVWAVVAMSLYRQ 66 Query: 63 EPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDD 122 EP+ +V + L G+ L+A SA+ QARQR+GA ++ +F Q+ Q E Sbjct: 67 EPLWSIVSKAQLMLPGKRS--LVAPSAIVQARQRLGADAMKEVFHQS-QSLWNETADHPT 123 Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT 182 W GL+L A+DG +RTPD E R+ + SA+ ++P +R+V M L SH+L+ + Sbjct: 124 WCGLKLLAVDGVVWRTPDTKENRDAFQSASNQNGE-GSFPQVRMVCQMELTSHMLVASAF 182 Query: 183 APYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS-E 241 A Y+ +E +LA ++ T PD S+T+FD+ FYS LL G RHWL+P KN E Sbjct: 183 ASYKTNEMILAEQLIETTPDYSLTMFDRGFYSLSLLHRWANTGNERHWLMPMRKNTQFTE 242 Query: 242 MIELGNT------ASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKT---RYP 290 + +LG + K+ L +EV I K + + S+ S T RYP Sbjct: 243 VRKLGRNDRIVELKTTPQARKKSLSLPETIEVRLIKKTIK-GKEVSILTSMTDHRRYP 299 >UniRef50_C6CF98 Transposase IS4 family protein n=20 Tax=Gammaproteobacteria RepID=C6CF98_DICZE Length = 441 Score = 258 bits (659), Expect = 2e-67, Method: Composition-based stats. Identities = 77/226 (34%), Positives = 119/226 (52%), Gaps = 8/226 (3%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSADG 78 F+ L WI L A++RRRRLP + +W+V+ ++ I DV L++ Sbjct: 20 FSRSLDPAWIHQALNACHKASIRRRRLPAEQAVWLVLMMGLLRDLSIKDVCHHLDIVLQP 79 Query: 79 EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRT 138 + G LA S +T ARQR+G AP+ +LF + D +HGL + ++DG FRT Sbjct: 80 DEGYQPLAPSVLTAARQRLGEAPLRYLFHACNEGWLPTVLGSDTFHGLHVLSVDGTLFRT 139 Query: 139 PDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLA 198 PD P+ +G + +P +R+V LM SH+LL+A + E LAH +++ Sbjct: 140 PDSPDNAAAFGFIDP---VHGTFPQVRMVGLMATHSHMLLDAAFGGVAEGELTLAHRLVS 196 Query: 199 TIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 + PD+S+TLFD+ ++S LL Q G HWL P + + +IE Sbjct: 197 SAPDHSLTLFDRCYFSASFLLEWRQAGVETHWLTPVKRKLRYRVIE 242 >UniRef50_D2TH14 ISCro6 transposase n=8 Tax=Gammaproteobacteria RepID=D2TH14_CITRO Length = 438 Score = 256 bits (654), Expect = 6e-67, Method: Composition-based stats. Identities = 81/232 (34%), Positives = 124/232 (53%), Gaps = 8/232 (3%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRL 72 P S F +P EWI L + A++R+R+LP ++V+W++V ++ ITDVV +L Sbjct: 15 PASLSCFQRAIPLEWISQVLDSTNKASIRKRKLPAELVVWLIVGMGLYRDRSITDVVTKL 74 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAID 132 +L + G LA S+V +ARQR+ P+ LF TA + D W+GL+LFA+D Sbjct: 75 DLVLSSQEG-ETLAASSVARARQRLSDEPLRELFTLTASHWTQQEDKDDLWYGLRLFAVD 133 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 G FRTPD PEL E++ R YP++RL A+M+L S ++ P E Sbjct: 134 GTLFRTPDTPELAEHFEYIKHRPDRHTEYPMVRLCAMMSLRSRLIHGVKFGPANTGEVSY 193 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 A + S+TLFD+ + S +LL+ ++ HWL+P N ++E Sbjct: 194 AKQLSPQ--AKSLTLFDRCYLSAELLINWQRRQQEAHWLVPLKGNTKYRIVE 243 >UniRef50_D0LI35 Transposase IS4 family protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LI35_HALO1 Length = 449 Score = 254 bits (648), Expect = 3e-66, Method: Composition-based stats. Identities = 86/234 (36%), Positives = 131/234 (55%), Gaps = 7/234 (2%) Query: 17 PPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRR 71 PP A + EWI+ L + AT+RRRRLP + ++W+V+ ++ PIT+VV Sbjct: 15 PPEEFSRLARDVAPEWIEQALEATGTATLRRRRLPMEQLVWLVIGMALFRDRPITEVVTS 74 Query: 72 LNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAI 131 L+L+ G +A SAV QAR R+G +P+ WLF +A + D W GL L+ + Sbjct: 75 LDLALPSP-GHPEVAPSAVAQARDRLGESPMAWLFAHSADRWAHQSAADDRWRGLALYGV 133 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR-QSET 190 DG R PD E R+++G AN + + YPV+RL ALM L SH+L PY+ E Sbjct: 134 DGTTLRVPDSEENRDHFGLANGGARGSSGYPVVRLAALMALRSHLLAAVSFGPYQGHGEY 193 Query: 191 VLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 A + +PDNS+ + D+ +++ ++L+ L Q G NRHWL+ K + ++E Sbjct: 194 WYAADLWPCLPDNSLVIVDRHYWAANVLIPLQQDGLNRHWLIRGRKGLNYRVVE 247 >UniRef50_P03835 Transposase insG for insertion sequence element IS4 n=377 Tax=root RepID=INSG_ECOLI Length = 442 Score = 247 bits (630), Expect = 4e-64, Method: Composition-based stats. Identities = 84/241 (34%), Positives = 118/241 (48%), Gaps = 9/241 (3%) Query: 8 LDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QN 62 LD ++L E I CL S T+R+RRLP +M++W +V + Sbjct: 7 LDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLPLEMMVWCIVGMALERK 66 Query: 63 EPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDD 122 EP+ +V RL++ G +A SAV QARQR+G+ V +F +TAQ Sbjct: 67 EPLHQIVNRLDIMLPGNR--PFVAPSAVIQARQRLGSEAVRRVFTKTAQLWH-NATPHPH 123 Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT 182 W GL L AIDG +RTPD PE + T YP +++V M L SH+L A Sbjct: 124 WCGLTLLAIDGVFWRTPDTPENDAAFPR-QTHAGNPALYPQVKMVCQMELTSHLLTAAAF 182 Query: 183 APYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEM 242 + SE LA ++ DN++TL DK +YS LL + G +RHW++P K E Sbjct: 183 GTMKNSENELAEQLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEE 242 Query: 243 I 243 I Sbjct: 243 I 243 >UniRef50_A6WTA0 Transposase IS4 family protein n=14 Tax=Shewanella RepID=A6WTA0_SHEB8 Length = 446 Score = 245 bits (626), Expect = 1e-63, Method: Composition-based stats. Identities = 94/289 (32%), Positives = 146/289 (50%), Gaps = 22/289 (7%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRL 72 A+ L E IQ CL AT+RRR+LP D +IW V+ + E + ++ +L Sbjct: 21 LTEFTCLADVLEPELIQSCLDSQGVATLRRRKLPMDAMIWAVIGMALFRGESVRSLINKL 80 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAID 132 ++ E +ARSAVTQAR+R+G+ + +F ++A A R W GL L+ +D Sbjct: 81 DIVLPQEIDY--VARSAVTQARKRLGSEVIREVFSRSANTWHA-RAEHPHWCGLNLYGVD 137 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 G +RTPD + + + ++ AYP +R+V LM L SH+L+N+ ++E L Sbjct: 138 GVVWRTPDSVQNQAAFARTANASGE-AAYPQIRMVCLMELSSHLLVNSAFDSVAENEMNL 196 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGN----- 247 A ++ +IP++S+TLFD+ FYS LL Q + HWLLP K E++ Sbjct: 197 ASQLIPSIPNHSLTLFDRGFYSLGLLHAWQQAQPDSHWLLPLKKGTQYEVVRTLGKHDQW 256 Query: 248 ---TASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKT---RYP 290 T +P K+ L LE +TK + + ++ S T RYP Sbjct: 257 VKLTTTP-QARKKWPQLPDTLEARLLTKTVK-GKSVAILTSLTDPMRYP 303 >UniRef50_B6EGT0 Transposase n=20 Tax=Vibrionaceae RepID=B6EGT0_ALISL Length = 441 Score = 244 bits (624), Expect = 2e-63, Method: Composition-based stats. Identities = 80/290 (27%), Positives = 138/290 (47%), Gaps = 22/290 (7%) Query: 17 PPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRR 71 P + + A+ LP I +L+ T+R+R+L + ++W++V N+ + D+V + Sbjct: 20 KPSNVETLADLLPIHLIDEAYSLTDTVTMRKRKLTLESMVWLLVGMAIYNNKSMKDLVNQ 79 Query: 72 LNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAI 131 L++ G +A SA+TQ R+ +G A ++ +F + W+GL L + Sbjct: 80 LDIV--DRTGKAFVAPSALTQRRKNLGEAAMKAVFERMTSSWLKS-ANLPKWNGLTLLGV 136 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETV 191 DG +R PD + E + S ++ YP +R+V M L SH++ + Y +E + Sbjct: 137 DGVVWRAPDNQKNEEAF-----SRQKGTQYPQVRMVCQMELSSHLITASAFDNYNTNEMI 191 Query: 192 LAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASP 251 LA ++ + PD+S+T+FDK FYS LL G RHWL+P KN E+I Sbjct: 192 LAEKLIDSTPDHSVTMFDKGFYSLGLLHKWQMTGSERHWLIPLKKNTQYEIIRSLGRNDK 251 Query: 252 GTI-------PKRLEHLRGALEVVFITKRPRPSRPRSVK--ISKTRYPVK 292 I K +L + +T++ + + + I RYP+K Sbjct: 252 LVILRSNPRARKLFSNLPETMTARLVTRKIKGKDYQVLTSMIDPLRYPLK 301 >UniRef50_B2LS82 Putative uncharacterized protein n=3 Tax=Vibrio RepID=B2LS82_9VIBR Length = 440 Score = 244 bits (622), Expect = 3e-63, Method: Composition-based stats. Identities = 68/228 (29%), Positives = 126/228 (55%), Gaps = 12/228 (5%) Query: 23 LFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSAD 77 +F +H+P EW++ + + ++R+RRLP + +W+V+ +N I DV +L L+ Sbjct: 22 VFNKHIPWEWVEEAVQQTGRVSLRKRRLPAEQAVWLVLGIGLQRNRSIQDVCDKLELAFP 81 Query: 78 GEAG-MNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQF 136 G + +A S++ + ++R+G P+ +LF+ TAQ + D+ GL+L ++DG F Sbjct: 82 DVDGELTPMATSSIIKGKERLGDKPMRYLFKTTAQQWEQQSDF-DEVCGLKLLSVDGTYF 140 Query: 137 RTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSM 196 +T + E + ++G A ++ ++P + V LM+ SH++ +A P SE A + Sbjct: 141 KTHNTEENQ-HFGFA----QKGASFPSVLAVTLMSTRSHLVSDAAFGPVTNSEISYAQQL 195 Query: 197 LATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 + + PD+S+TLFD+ F S +L + N HWL P + ++IE Sbjct: 196 VGSAPDDSLTLFDRGFTSAELFTSWQGASSNSHWLTPIKTKMRYDIIE 243 >UniRef50_A4T2G5 Transposase, IS4 family protein n=10 Tax=Corynebacterineae RepID=A4T2G5_MYCGI Length = 401 Score = 241 bits (614), Expect = 3e-62, Method: Composition-based stats. Identities = 57/236 (24%), Positives = 101/236 (42%), Gaps = 9/236 (3%) Query: 11 SDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPI 65 SD L S + P + + + VR R LP ++ + + + Sbjct: 11 SDRRLSDLVSVGVLTRVFPPAMVDEVIEATGRTQVRHRALPARVMAYFAIGMGLYSDGSY 70 Query: 66 TDVVRRLNLSADGEAGMN----LLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKD 121 DV+ +L +G L +SA+ QAR+R+G+ P+ LF + A+ GA Sbjct: 71 EDVLSQLTDGLAWASGWREQYQLPGKSAIFQARERLGSQPLAALFARVARPLGAADTPGT 130 Query: 122 DWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAV 181 G ++ AIDG D P E++G + ++A+P RL+A+ G+H + A Sbjct: 131 WVAGRRVVAIDGTCLDVADNPVNEEFFGRPGVNKGEKSAFPQARLLAVAECGTHAIFAAT 190 Query: 182 TAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKN 237 YR +E+ + +L + + L D+ F+S L + G + W + +N Sbjct: 191 IGAYRDAESTMVEHVLDALTPEMLVLADRGFFSYALWRNASDTGADLLWRVSTGRN 246 >UniRef50_D0SHM1 Transposase n=3 Tax=Acinetobacter RepID=D0SHM1_ACIJO Length = 443 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 82/257 (31%), Positives = 128/257 (49%), Gaps = 18/257 (7%) Query: 19 PSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLN 73 PS F+E + WI+ CL + A+VR+R+LP + +W+V+ +++PI VV++L Sbjct: 26 PSLSNFSELIDLNWIEDCLKRTGKASVRKRKLPAEHAVWLVIGLALFRDQPIWYVVQQLQ 85 Query: 74 LSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK-DDWHGLQLFAID 132 L A SA QARQR+G P+ LF +Q + + +HGL + A+D Sbjct: 86 LVFGT---AESCAPSASVQARQRLGLEPLNVLFNTLSQTWFEDSQPQYSAFHGLSICAVD 142 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 GA + P E ++GS+ T +P R V L+N +H +++A Q E L Sbjct: 143 GAVWSMPHTDENFRHFGSSKGKT-IAAPWPQARAVCLINTNTHEVIDAGIGSMDQGELTL 201 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPG 252 A + +P NS+TLFD+ ++S D L + N HWL+ A N+ E+I N+A Sbjct: 202 AKKL--KVPANSLTLFDRAYFSADFLSGWQSR-ENCHWLMRAKDNLRYEIIR-KNSAHDF 257 Query: 253 TIPK----RLEHLRGAL 265 I R + L L Sbjct: 258 QIRMPVSPRAKKLNPDL 274 >UniRef50_C5T3Q2 Transposase IS4 family protein n=4 Tax=Proteobacteria RepID=C5T3Q2_ACIDE Length = 436 Score = 228 bits (580), Expect = 2e-58, Method: Composition-based stats. Identities = 81/249 (32%), Positives = 127/249 (51%), Gaps = 14/249 (5%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 M LL L+ + L P + + L WI L + A++RRR+LP + +W+V+ Sbjct: 1 MSLLQTTLNETLETL-PANAIAELSALLDPAWIAQALQATGKASMRRRKLPAEHAVWLVI 59 Query: 61 -----QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGA 115 ++ P+ VV+ + L+ DG+ L A S Q RQR+GA P+E +F A G Sbjct: 60 GLALFRHMPLWQVVQEMALTLDGQ---ELPAPSVSVQVRQRLGAEPMEHMFGLLANAWGR 116 Query: 116 ERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSH 175 + L++ A+DG + PD + R+ GS T Q +P++R V L++ SH Sbjct: 117 AHAVHAG--ALRVLAVDGVAWSAPDSKDNRQELGSGQTQYGPQ-PWPMVRAVCLLDTDSH 173 Query: 176 ILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 LL+A Y E LA + D+SITLFD+ ++S LL +Q G RHWL+ A Sbjct: 174 ELLDAQLGDYGCGELTLAADLHGL--DHSITLFDRAYFSAAFLLAWSQAGQQRHWLMRAK 231 Query: 236 KNIASEMIE 244 N+ E+++ Sbjct: 232 DNLRYEVVQ 240 >UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LF21_FRASN Length = 420 Score = 226 bits (577), Expect = 6e-58, Method: Composition-based stats. Identities = 60/265 (22%), Positives = 103/265 (38%), Gaps = 11/265 (4%) Query: 11 SDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQ-----NEPI 65 + L S + A +P + + L + R+R LP +V++ + ++ Sbjct: 14 TSDRLTDRISLGVLARIVPRDLVDEVLAETRRLEQRKRLLPARVVVYFTMAMCLFFDDDY 73 Query: 66 TDVVRRL----NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKD 121 +V+RRL + + + A++QAR R+G P++ LF + A Sbjct: 74 DEVMRRLVGTLRWLGSWKGDWKVPSTGAISQARTRLGPEPLKLLFERVAVPVAGLGTKGA 133 Query: 122 DWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAV 181 +L A+DG T D PE + +G + K A+P + +VAL G+H + A Sbjct: 134 WLGSRRLVAVDGVHLDTADTPENADAFGRFSHGPK-TAAFPQVHVVALAECGTHAVFAAA 192 Query: 182 TAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS- 240 Y E LA ++ + D+ FY L G + W + A + Sbjct: 193 IGAYTSDERSLAATLFDACEPGMLLTADRNFYGYGLWQQALATGADLLWRVNANLTLPVI 252 Query: 241 EMIELGNTASPGTIPKRLEHLRGAL 265 + G+ S PK RG L Sbjct: 253 RALPDGSYLSLLIDPKIPVARRGQL 277 >UniRef50_A4JGL4 Transposase, IS4 family protein n=3 Tax=Burkholderiaceae RepID=A4JGL4_BURVG Length = 402 Score = 215 bits (547), Expect = 2e-54, Method: Composition-based stats. Identities = 63/244 (25%), Positives = 109/244 (44%), Gaps = 16/244 (6%) Query: 20 SAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRR--- 71 SA + A P I+ L + A+ R R LP V++ V+ + P+ +V+R Sbjct: 22 SAGVLASVCPRTLIEEVLAETGKASQRERLLPAPAVVYYVMALALWREAPLEEVLRVVCE 81 Query: 72 -LNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFA 130 L G ++SA++QAR R+G + L + + A + GL++ A Sbjct: 82 GLQWLGGGHTEAVQASKSAISQARSRLGPEVMRQLADRVLRPLAAPGAPGAWYRGLRVMA 141 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSET 190 +DG+ D+ +++G S Q+A+P R++ L+ G+H ++ A APY SE Sbjct: 142 LDGSCMDVADEAANAKFFGYPGASRG-QSAFPQARVLGLVECGTHAVVAAGIAPYGHSEQ 200 Query: 191 VLAHSMLA-TIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA---SEMIELG 246 V+A +L + + L D+ FY L T G W + N+ +M+ G Sbjct: 201 VMAAQLLPAKLTPEMLVLADRNFYGFKLWQTACATGAKLAWRV--KSNLKLPVEQMLPDG 258 Query: 247 NTAS 250 + S Sbjct: 259 SYLS 262 >UniRef50_UPI0001C16028 hypothetical protein CRD_01775 n=2 Tax=Raphidiopsis brookii D9 RepID=UPI0001C16028 Length = 465 Score = 211 bits (537), Expect = 2e-53, Method: Composition-based stats. Identities = 59/260 (22%), Positives = 116/260 (44%), Gaps = 19/260 (7%) Query: 15 LMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQN-----EPITDVV 69 L P ++ +P++ I + + + R R LP +++ +V+ + I DV Sbjct: 13 LNPQQIFLALSQVIPSQTITKAIESTCSSQRRLRILPTYIIVTLVIAMSFWSSDSIVDVF 72 Query: 70 RRL-----NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWH 124 + L +L + + S++T+ARQR GAA + LF A+ Sbjct: 73 KNLIHGLSSLHIPSGLRLQTPSASSITEARQRTGAAVMRRLFELVAKPLATILTPGAFLG 132 Query: 125 GLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAP 184 L++ A+DG F PD +G + +P +RLV L+ G+H++++A P Sbjct: 133 ELRIMAVDGTVFDVPDTSTNARVFGYPGSPKGTYPGFPKVRLVFLVEAGTHLIIDAFCYP 192 Query: 185 YRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 YR E A +L +I + + ++D+ +S ++ T+ ++ N +P N+ ++++ Sbjct: 193 YRMGERRGALKLLRSINSSMLLMWDRGLHSFKMVHTVIKQQGNFLGRVPG--NVKFQVVK 250 Query: 245 ---LGNTAS----PGTIPKR 257 G+ S G K+ Sbjct: 251 TLADGSYLSWIAPDGQSRKK 270 >UniRef50_Q2JAY9 Transposase, IS4 n=2 Tax=Frankia RepID=Q2JAY9_FRASC Length = 412 Score = 211 bits (537), Expect = 2e-53, Method: Composition-based stats. Identities = 65/263 (24%), Positives = 106/263 (40%), Gaps = 22/263 (8%) Query: 15 LMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-------QNEPITD 67 L + + P E + L ++ A VRRR LP +V++ V+ +N Sbjct: 15 LPDRVTVGVLTRVYPPELVDRVLAVTDTAEVRRRLLPSWLVVYFVLALWLFRGRNCGYVQ 74 Query: 68 VVRRLNLSADGEA-------------GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRG 114 V+ RL + G +L A ++ +AR R+G+ PV LF A G Sbjct: 75 VLARLTSGLHFQRRAAVLAAGGAGGAGWSLPASPSLGEARARIGSDPVRMLFEHAAGPVG 134 Query: 115 AERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGS 174 E HGL+L IDG+ PD R ++ ++ +P +R V + Sbjct: 135 VEGQAGVFLHGLRLVQIDGSTCDLPDTQANRAFFPGP-SNAGGPAPFPKVRWVIAAEAAT 193 Query: 175 HILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPA 234 LL A P+ E LA +L + +TL D+ F S L + G + W A Sbjct: 194 GALLGASFGPWSTGEPALARDLLGQLGPGMLTLADRNFLSHRLAGEVLATGAHLLWRAKA 253 Query: 235 WKNIA-SEMIELGNTASPGTIPK 256 +A +++ G+ + T P+ Sbjct: 254 TFTLAPVHVLDDGSYLAELTPPR 276 >UniRef50_B2J1G3 Transposase, IS4 family protein n=6 Tax=Nostocaceae RepID=B2J1G3_NOSP7 Length = 381 Score = 209 bits (533), Expect = 8e-53, Method: Composition-based stats. Identities = 64/232 (27%), Positives = 110/232 (47%), Gaps = 15/232 (6%) Query: 44 TVRRRRLPGDMVIWMVVQN-----EPITDVVRRL-----NLSADGEAGMNLLARSAVTQA 93 R+R LP +V+ +V+ + + DV++ L + +SA+TQA Sbjct: 12 EERKRSLPAQLVVSLVIAMSLWSKDSMRDVLKNLIDGLSEAWLKVGKYWRVACKSAITQA 71 Query: 94 RQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANT 153 RQR+GA + LF Q + + L L++ IDG+ F PD E +G + Sbjct: 72 RQRLGARVMCKLFHQLVKPMATQETLGAFLQELRIVVIDGSCFDVPDSDENARVFGRPGS 131 Query: 154 STKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFY 213 + A+P +RLV L+ G+HI+ +A+ PYR E V A +L ++ + ++D+ + Sbjct: 132 RPGTKAAFPKVRLVILVEAGTHIIFDALMWPYRIGERVRALRLLRSVTPGMLLMWDRGLH 191 Query: 214 SEDLLLTLNQKGCNRHWLLPA-WKNIASEMIELGNTAS----PGTIPKRLEH 260 S ++ KGC+ +PA K IA + +E G+ S G + K+ Sbjct: 192 SYAMVQATVTKGCDYLGRIPANIKFIAEKPLEDGSYLSWIYPSGKLRKKASQ 243 >UniRef50_UPI00016A835E hypothetical protein BoklC_27358 n=1 Tax=Burkholderia oklahomensis C6786 RepID=UPI00016A835E Length = 231 Score = 203 bits (516), Expect = 7e-51, Method: Composition-based stats. Identities = 71/246 (28%), Positives = 121/246 (49%), Gaps = 46/246 (18%) Query: 17 PPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRR 71 PP + +HLP EWI+H + S A+VRRRRLP V+W+V+ +++ I++VV Sbjct: 20 PPLELERLGQHLPYEWIEHAVQASGSASVRRRRLPAQQVVWLVIALALYRHQSISEVVDE 79 Query: 72 LNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAI 131 L+L+ + +++SA+ QA+QR GA+P+ WLF ++A+ +W G + Sbjct: 80 LDLALPAP-DTSFVSKSAIAQAKQRTGASPLAWLFHESAR----------NWVGQDI--- 125 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETV 191 +YP + V L + + ++ +A PY +E + Sbjct: 126 ---------------------------GSYPQLHAVTLTAIATRLVRDAGFGPYDINEMI 158 Query: 192 LAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASP 251 A ++ +P+N IT+FDK F S LL L G NRH+++PA N E+ +++ Sbjct: 159 WARELIPRVPENPITVFDKGFLSAQLLCNLVAGGQNRHFIIPARSNPRGEISRRPTSSTA 218 Query: 252 GTIPKR 257 + R Sbjct: 219 TNVAGR 224 >UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4D726 Length = 464 Score = 201 bits (512), Expect = 2e-50, Method: Composition-based stats. Identities = 62/273 (22%), Positives = 108/273 (39%), Gaps = 11/273 (4%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEA 80 + +P + L + R R LP +V++ V+ D R + A A Sbjct: 1 MGVLTRWVPPVLVDEVLAATGRFEKRVRMLPARVVVYFVLAMTLFGDCGYR-GVWAALTA 59 Query: 81 GMNL-----LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQ 135 GM + +A+ QAR+R+G AP+ LF + G + WHGL++ A DG Sbjct: 60 GMPGHLVPDPSAAALRQARRRLGTAPLALLFDRVCGPVGTKETPGVFWHGLRVVAWDGTS 119 Query: 136 FRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHS 195 D +YG +T R YP +RL AL+ G+ L+ AV P E A Sbjct: 120 VEVADSAANVAHYGRHGKATSRPAGYPQVRLTALVECGTRALMGAVFGPMHDKELPQARR 179 Query: 196 MLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIP 255 +L + + L D+ + + + G + W + + ++ + G+ Sbjct: 180 LLPVLRPGILLLADRGYDGYEAIRDAASTGADLLWRVQSG-----RLLPVIQPLPDGSHL 234 Query: 256 KRLEHLRGALEVVFITKRPRPSRPRSVKISKTR 288 ++ R + +R RP+ P ++ R Sbjct: 235 SQILDRRSGDRLAAWQRRKRPTPPPALTAMAVR 267 >UniRef50_A8M893 Transposase IS4 family protein n=3 Tax=Actinomycetales RepID=A8M893_SALAI Length = 451 Score = 200 bits (508), Expect = 6e-50, Method: Composition-based stats. Identities = 52/222 (23%), Positives = 87/222 (39%), Gaps = 5/222 (2%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRR--LNLSADG 78 +P E I L + R R LP +V+++++ D R G Sbjct: 22 LGELTRLVPFEMIDDVLAATRRTQRRVRLLPARVVVYLLLAGCLFADCGYRQVWAKLVAG 81 Query: 79 EAGMNL--LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQF 136 G+ + + SA+ QARQR+G AP+ LF W GL +DG Sbjct: 82 LRGLPVADPSDSALRQARQRLGPAPLRALFDLLRGPAATSAVAAVRWRGLLPVVVDGTMI 141 Query: 137 RTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSM 196 D P YG + + YP +RL AL+ G+ +++AV P E AH + Sbjct: 142 AVADSPANLGRYGKHRCNNGG-SGYPTLRLSALLTCGTRSVIDAVFDPSTTGEITQAHRL 200 Query: 197 LATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI 238 ++ + L D+ + + DL+ G + + + + Sbjct: 201 TRSLRAGMLLLADRNYAAADLIGAFTATGADLLIRCKSGRKL 242 >UniRef50_A8L1S1 Transposase IS4 family protein n=2 Tax=Frankia sp. EAN1pec RepID=A8L1S1_FRASN Length = 425 Score = 190 bits (483), Expect = 5e-47, Method: Composition-based stats. Identities = 54/232 (23%), Positives = 92/232 (39%), Gaps = 11/232 (4%) Query: 17 PPPSAQLFAEHLPTEWIQHCLTLSAHATVRRR-RLPGDMVIWMVVQ-----NEPITDVVR 70 S + +P + + + R +LP + ++ + ++ +V + Sbjct: 23 DQVSVGVLVTAVPRDAVDEAVAACGVGARRAGGKLPPHVTAYLTLAMSLFPDDDYAEVAQ 82 Query: 71 RLNLSAD----GEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGL 126 ++ S D +A + S +TQAR+R+G + +F + A G Sbjct: 83 KVTGSLDRFGCWDAAWAPPSASGITQARKRLGRMVMAEVFERVAGQVATLSTRGAWLRGR 142 Query: 127 QLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR 186 L AIDG PD E +G A T KR +A+P +R+VAL G+H A + Sbjct: 143 LLLAIDGFDVDVPDTEENAAEFGYAGTGEKR-SAFPKIRVVALAECGTHAFRAAEVGGWA 201 Query: 187 QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI 238 E LA +L + + + D+ FYS D G + W P N+ Sbjct: 202 AGERTLARGLLMRLNRDEVLTADRGFYSFDNWALAAGTGADLIWRAPTGLNL 253 >UniRef50_Q82R31 Putative IS4 family ISFsp6-like transposase n=2 Tax=Streptomyces avermitilis RepID=Q82R31_STRAW Length = 542 Score = 190 bits (482), Expect = 6e-47, Method: Composition-based stats. Identities = 54/242 (22%), Positives = 94/242 (38%), Gaps = 7/242 (2%) Query: 15 LMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNE--PITDVVRRL 72 + P + LP E + L + A R R LP + ++ V+ P VR Sbjct: 20 IFAPGHLGELTQQLPFELVDDVLERAGGAQHRLRLLPSRVGVYFVLALALFPQLGYVRVW 79 Query: 73 NLSADGEAGM--NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFA 130 + G G+ + A+ + R+R+G AP+ LF A + + A Sbjct: 80 DKLTAGLRGILHRRPSEKALREVRRRLGVAPLRLLFETLAGPVAQPITPGVRYRCWRTVA 139 Query: 131 IDG-AQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSE 189 DG + + PD+P + + G + YP+++++ L G+ LL AV P + E Sbjct: 140 FDGCSSTKAPDRPRVCAWLGKHKHRYG-TDGYPMLKIMVLCETGTRALLGAVFGPTPEKE 198 Query: 190 TVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLP-AWKNIASEMIELGNT 248 T A +L + + L D+ F S+D L G L ++ G+ Sbjct: 199 TGYAEQLLPLLDGGMLLLNDRGFDSDDFLAKAAATGAQLLVRLKGTRTPARWALLPDGSF 258 Query: 249 AS 250 + Sbjct: 259 LT 260 >UniRef50_B5EK95 Transposase IS4 family protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EK95_ACIF5 Length = 369 Score = 180 bits (456), Expect = 6e-44, Method: Composition-based stats. Identities = 54/209 (25%), Positives = 92/209 (44%), Gaps = 12/209 (5%) Query: 53 DMVIWMVVQNEPITDVVRR--LNLSADGEA--------GMNLLARSAVTQARQRVGAAPV 102 +++++ V+ V L L DG ++++ A++QAR +VGAAP+ Sbjct: 19 EVLVYFVLAMVLYASVAYEEVLQLVVDGLRPLLGDDRLAQTVVSKGAISQARAKVGAAPL 78 Query: 103 EWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYP 162 + L++ Q G + GL+L AIDG+ PD+ E +G +S A+P Sbjct: 79 KTLYQNQVQPHGPLGMAGVGYKGLRLMAIDGSTLDMPDEAANAERFGYPASSRG-SAAFP 137 Query: 163 VMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLN 222 +R VA+ G+H L A Y QSE LA ++A + D+ FYS Sbjct: 138 QLRFVAMAECGTHTLCYAEMGSYEQSERTLAGPVMAHADATMLITADRNFYSYAFWQQSL 197 Query: 223 QKGCNRHWLLPAWKNI-ASEMIELGNTAS 250 G + L + + +++ G+ S Sbjct: 198 ATGARLLFRLSSVLKLPREKILADGSYLS 226 >UniRef50_A8KXP7 Transposase IS4 family protein n=2 Tax=Actinomycetales RepID=A8KXP7_FRASN Length = 421 Score = 177 bits (450), Expect = 3e-43, Method: Composition-based stats. Identities = 55/265 (20%), Positives = 105/265 (39%), Gaps = 16/265 (6%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 MP + + + L+ + A P + + + R R LP + + VV Sbjct: 1 MPRRGQVKEKPEDRLVDRVGLGVLAAQFPDALVDRVVAETGRRERRTRDLPAALTLRYVV 60 Query: 61 Q-----NEPITDVVRRLNLSADGEAG----MNLLARSAVTQARQRVGAAPVEWLFRQTAQ 111 ++ +V+R++ ++ D + + + A +A+T+AR R+G PV+ LF +TA Sbjct: 61 ALALFPSDGYDEVMRQVKVADDWLSDKAGPVKVPATTAITKARDRLGVEPVKLLFERTAV 120 Query: 112 DRG-AERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALM 170 R + + G ++ +DG PD E +G A P +R++ L+ Sbjct: 121 PMALPRRTVGAFYRGWRVCTVDGTTLLVPDTDENAAAFGKPGNDQGE-GALPQVRVLGLV 179 Query: 171 NLGSHILLNAVTA----PYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGC 226 G+ LL A SE L +L + + L D+ F +L G Sbjct: 180 ECGTRALLGAGFGGTGGSKAASEQALFPDLLGALRPGMLVLADRNFLGFELFAKAAATGA 239 Query: 227 NRHWLLPAWKNIASE-MIELGNTAS 250 + W + + + + + G+ S Sbjct: 240 DLLWRAKSDRRLPIDTELADGSYLS 264 >UniRef50_Q2J8F5 Putative uncharacterized protein n=3 Tax=Frankia sp. CcI3 RepID=Q2J8F5_FRASC Length = 451 Score = 176 bits (446), Expect = 8e-43, Method: Composition-based stats. Identities = 62/291 (21%), Positives = 103/291 (35%), Gaps = 15/291 (5%) Query: 15 LMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRR-RRLPGDMVIWMVVQNEPITD-----V 68 L S + +P + + + + R +P +V + V+ D V Sbjct: 12 LTDWISLGVLTSFVPRDAVDEAIEATGAGARRSDTTIPPQVVAYFVMALALFADDDYETV 71 Query: 69 VRRLNLSADGEAGMNL---LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHG 125 RRL + + S +T+ARQR+GAAP+ LF Q A + Sbjct: 72 ARRLAATLTDLDVVGPRWEPTSSGLTKARQRLGAAPLAELFGQVAGPVADLDTVGAFLSR 131 Query: 126 LQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPY 185 +L +IDG ++ P E +G P +R V + SH + A P Sbjct: 132 WRLMSIDGLEWDAPASKENIAAFGLPAGRVDAPGVLPKVRAVTVSECASHAPVLAAFGPA 191 Query: 186 R----QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI-AS 240 SE LA ++ + + + L D+ FYS T G W + A + Sbjct: 192 GGAKPASEQALARTVYPRLASDWLLLADRNFYSWADWCTAADTGAALLWRVKATLRLPPL 251 Query: 241 EMIELGNTASPGTIPKRLEHLRGALEVVFITKRP-RPSRPRSVKISKTRYP 290 + G+ + PK R L P P++ R ++ + P Sbjct: 252 RALSDGSYLTVLVNPKVTGKARETLVTAARAGAPLDPTKARYTRLVEYDVP 302 >UniRef50_D2ASB5 Transposase, IS4 family n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2ASB5_STRRD Length = 356 Score = 161 bits (406), Expect = 4e-38, Method: Composition-based stats. Identities = 58/267 (21%), Positives = 103/267 (38%), Gaps = 19/267 (7%) Query: 12 DHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITD---- 67 D L S P + + +S A RRR LP + I+ V+ +D Sbjct: 7 DGRLADQLSIGFLTSVFPISLLDEVIGVSGCAERRRRALPARLTIYYVLALCLFSDKNYD 66 Query: 68 -VVRRLNLSADGEA----GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDD 122 V+R L + + SA+++AR R+GA P+ LF + + + Sbjct: 67 QVMRLLLNGLAWRSRWVYTWEPPSASAISRARARLGAEPLRVLFCRVTGPVAEPQASRSW 126 Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT 182 GL+ +DG P+ + +G + + + +P +R+VA+ G+H L++A Sbjct: 127 LAGLRPVTMDGTTLVVPETRDNSA-FGYPDGAAR----FPCVRVVAVAENGTHALIDATF 181 Query: 183 APYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA-SE 241 E LA +L + + + L + +L + G + W + + Sbjct: 182 GSSAVEERTLARRLLRCLESDMLLLARSGRWGFELWRQAAETGTHLLWGVTGADALPIGR 241 Query: 242 MIELGNTASP----GTIPKRLEHLRGA 264 E G+ S G P R+ L GA Sbjct: 242 SFEDGSYLSRPAGLGGAPLRVIPLPGA 268 >UniRef50_A6UXI0 Protein containing transposase DDE domain n=4 Tax=Gammaproteobacteria RepID=A6UXI0_PSEA7 Length = 423 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 46/210 (21%), Positives = 85/210 (40%), Gaps = 13/210 (6%) Query: 46 RRRRLPGDMVIWMVVQNEPITDVVRRLN----LSADGEAGMNLLARSAVTQARQRVGAAP 101 RRR+L ++ ++ N+P T + L+ + ++ A +AR+++ Sbjct: 29 RRRQLTFKNLVLFLL-NQPRTALQTELDQFYRVLNQASTETQMVTAQAFCKARKKLNPEV 87 Query: 102 VEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAY 161 E L R Q L+ W GL++ A+DG+ P + + ++GS + + Sbjct: 88 FESLNRLLQQQIDCFG-LRQKWRGLRVLAVDGSTVHLPLESTMATFFGS-------HSGF 139 Query: 162 PVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTL 221 P+ RL L + L+++ P E AH L +P +S+TLFD+ + L Sbjct: 140 PMARLSTLYEVADGQTLHSLIVPLTVGERDCAHLHLEHLPADSLTLFDRGYPGHWLFALF 199 Query: 222 NQKGCNRHWLLPAWKNIASEMIELGNTASP 251 Q+ + LP N + Sbjct: 200 AQQQRHFLMRLPCGYNAQVKAFLHSGQVED 229 >UniRef50_A4BL98 Putative uncharacterized protein n=5 Tax=Nitrococcus mobilis Nb-231 RepID=A4BL98_9GAMM Length = 426 Score = 147 bits (372), Expect = 4e-34, Method: Composition-based stats. Identities = 38/176 (21%), Positives = 73/176 (41%), Gaps = 4/176 (2%) Query: 68 VVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQ 127 V R L+ N + +ARQR+ AP+E R++ Q W G + Sbjct: 41 VARVLSERLQSGQSANSINTGPYCKARQRLPRAPLENAVRESGQTLHQRAPSAWGWRGHR 100 Query: 128 LFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR- 186 + DG PD + + + + +P++R+VAL++LG+ +L+ PY+ Sbjct: 101 VVLADGTTALMPDTLDNQREFPQQGNQ-QPGLGFPIVRIVALISLGAGAVLDYALGPYQG 159 Query: 187 --QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 E+ L ++L T+ + L D+ + + ++ L G + A + Sbjct: 160 KGSGESSLFSTLLHTLQPGDLLLADRYYCTYAIMALLVHHGVQGLFQKHAQRKPHW 215 >UniRef50_C1ZMB0 Transposase family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZMB0_PLALI Length = 497 Score = 147 bits (371), Expect = 4e-34, Method: Composition-based stats. Identities = 48/229 (20%), Positives = 83/229 (36%), Gaps = 16/229 (6%) Query: 14 PLMPPPSAQLFAEHLPTEWI----QHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVV 69 P + + E E + C++ A + +W ++ TDV Sbjct: 28 PFSDALTTRQLEEVFEAEEVSFGRDPCVSEQASIEDGGLVYTRGVTLWAMLSQALFTDVQ 87 Query: 70 RRLNLSADGEAGMNLLA--------RSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKD 121 R + A L+ A +AR ++ V+ L Q A K Sbjct: 88 RACRAAVQRVAVYYALSGIRISSTNTGAYCRARAKIPEGVVQRLAVGVGQRCEAAVPDKW 147 Query: 122 DWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAV 181 WHG + IDG PD E + Y ++ + +P++R VAL +L + ++L V Sbjct: 148 RWHGFRTLVIDGTTCSMPDTQENQAEYPQPSSQ-GKGLGFPILRAVALTSLATGMILALV 206 Query: 182 TAPY---RQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCN 227 T P ET L ++ + + L D+ + +L L + G Sbjct: 207 TGPCAGKATGETALFRTLFDQLKAGDLVLSDRYYGGWFMLALLQELGVE 255 >UniRef50_Q82R32 Putative IS4 family ISFsp5-like transposase n=1 Tax=Streptomyces avermitilis RepID=Q82R32_STRAW Length = 262 Score = 147 bits (370), Expect = 6e-34, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 73/197 (37%), Gaps = 12/197 (6%) Query: 16 MPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNL- 74 P + +P + + L + R R LP + ++ ++ +V RL Sbjct: 6 FAPGHLGELTQVIPFDLVDAVLDETRCVQRRLRDLPSRVGVYFLLAMCLFPEVGYRLVWH 65 Query: 75 -SADGEAGMNL----LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLF 129 G+ A+ R+R+GA P++ +F A + ++ Sbjct: 66 KLTAALTGVGFEVAEPTAKALRDLRRRLGAEPMKRVFETLAGPLAQPVTPGVRFGPFRMA 125 Query: 130 AIDG-AQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQS 188 + DG + + PD E++G + YP++ L+ L+ G+ L+ AV Sbjct: 126 SFDGCSSIKLPDTERNVEWFG-----PGSRGGYPMLELMTLVETGTRALIGAVFGTPSDG 180 Query: 189 ETVLAHSMLATIPDNSI 205 ET A +L + + Sbjct: 181 ETSYARRLLHHLGPGML 197 >UniRef50_Q648P8 Transposase n=2 Tax=environmental samples RepID=Q648P8_9ARCH Length = 464 Score = 140 bits (353), Expect = 6e-32, Method: Composition-based stats. Identities = 56/265 (21%), Positives = 100/265 (37%), Gaps = 28/265 (10%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITD------VVRRLNLSAD 77 F++ L E I++ + + R R + + + +D V + L Sbjct: 31 FSDVLSAETIRNIMDEE-VGSYRDRIYSPLITLSAFLSQVLSSDHSCKNAVAKVLAERVA 89 Query: 78 GEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFR 137 + +AR R+ V L R+T + + W G + +DG Sbjct: 90 QGKLPCSSNTKSYCEARLRLPINLVRRLVRETGKLLHLKSEEAWKWKGRSVKLVDGTTVS 149 Query: 138 TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR---QSETVLAH 194 PD PE ++ Y K +P+ RLVA+++L +L+ PY+ E L Sbjct: 150 MPDTPENQKMYPQPEGQ-KEGVGFPIARLVAIISLSCGAVLDIAIGPYKGKETGEHALLR 208 Query: 195 SMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTI 254 +L +I I L D+ + S L++ L Q G + + + + Sbjct: 209 QILGSISTGDILLGDRYYCSYFLIVMLQQLGADSVFRIHGSRKKDFR------------- 255 Query: 255 PKRLEHLRGALEVVFITKRPRPSRP 279 R +HL G + + I K+P+ RP Sbjct: 256 --RGKHL-GKKDHIVIWKKPK-QRP 276 >UniRef50_Q7BLZ8 Putative uncharacterized protein (Fragment) n=1 Tax=Streptomyces rishiriensis RepID=Q7BLZ8_9ACTO Length = 341 Score = 140 bits (352), Expect = 7e-32, Method: Composition-based stats. Identities = 37/144 (25%), Positives = 61/144 (42%), Gaps = 3/144 (2%) Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTK-RQNAYPVMRLVALMNLGSHILLNAV 181 + G +L A+DG F PD ++G S ++AYP +RL AL G+H + A Sbjct: 1 YRGWRLVAVDGTTFDVPDTEANAAFFGRPGVSRGQEKSAYPQVRLAALAECGTHAVFAAE 60 Query: 182 TAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE 241 P ET LA + ++ + L D+ F DL G + W + + Sbjct: 61 AGPLAVHETELAQRLFGSLTPGMLLLADRGFRGFDLWRAAAATGADLLWRVKNDAVLPVR 120 Query: 242 -MIELGNTASPGTIPKRLEHLRGA 264 ++E G+ S + R ++ R Sbjct: 121 TLLEDGSYLSE-IVAARDKNRRAD 143 >UniRef50_A5KKC4 Putative uncharacterized protein n=1 Tax=Ruminococcus torques ATCC 27756 RepID=A5KKC4_9FIRM Length = 422 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 48/181 (26%), Positives = 82/181 (45%), Gaps = 4/181 (2%) Query: 84 LLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPE 143 +++ A+++ARQ + LFR + + + W+G ++A+DG+ + P+ E Sbjct: 78 FVSKQAISKARQGISHKAFLELFRLSVKQFYFQPVNLRTWNGFHIYAVDGSTIQIPESKE 137 Query: 144 LREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIP-- 201 E +G TK + P+ L ++ + IL++ PYR +E A + + +P Sbjct: 138 NYEVFGGNPNKTKIIS--PLASASVLYDVINDILIDVSLHPYRYNERESAKAHVDFLPRF 195 Query: 202 DNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHL 261 NSI LFD+ + SED+ LN KG +P A E P + K L Sbjct: 196 PNSIILFDRGYPSEDMFHYLNSKGILFLMRVPKTFKKAISEQEDALFTYPASCNKESLTL 255 Query: 262 R 262 R Sbjct: 256 R 256 >UniRef50_D0SX83 Predicted protein n=1 Tax=Acinetobacter lwoffii SH145 RepID=D0SX83_ACILW Length = 140 Score = 131 bits (329), Expect = 3e-29, Method: Composition-based stats. Identities = 48/125 (38%), Positives = 68/125 (54%), Gaps = 7/125 (5%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 M D+LD ++ L + F ++P EW++ L LS+ AT+RRR LP D V+W+V+ Sbjct: 10 MIFQQDILDLNN--LFKLSNLSTFIHNIPVEWVKSTLRLSSPATIRRRCLPADQVLWLVL 67 Query: 61 QNEPITDVV-----RRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGA 115 DV+ RRLN+ A +LL R ++T R+ +GA VEWLF QT Q G Sbjct: 68 GMAIFRDVLIHEAARRLNICTQWLASYDLLTRISLTNTRKHLGADSVEWLFHQTDQHWGQ 127 Query: 116 ERYLK 120 E Y Sbjct: 128 EHYPA 132 >UniRef50_A1JS05 Transposase for insertion sequence element IS1665 n=4 Tax=Yersinia enterocolitica subsp. enterocolitica 8081 RepID=A1JS05_YERE8 Length = 261 Score = 130 bits (326), Expect = 7e-29, Method: Composition-based stats. Identities = 47/145 (32%), Positives = 71/145 (48%), Gaps = 8/145 (5%) Query: 8 LDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QN 62 LD ++L + I CL S T+R+RRLP +M++W +V + Sbjct: 7 LDLVSRYDSLRNPLTTLGDYLYPQLISRCLAESGTVTLRKRRLPLEMMVWCIVGMALERK 66 Query: 63 EPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDD 122 EP+ +V RL++ G+ +A SAV QARQR+G+ V +F QTAQ Sbjct: 67 EPLHQIVNRLDIMLPGDR--PFVAPSAVIQARQRLGSEAVRRVFSQTAQLWHGSVT-HPH 123 Query: 123 WHGLQLFAIDGAQFRTPDKPELREY 147 W GL L A+DG ++T + E + Sbjct: 124 WCGLTLLAVDGVVWQTDNATEQADA 148 >UniRef50_UPI00016C3BAD transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3BAD Length = 258 Score = 119 bits (299), Expect = 9e-26, Method: Composition-based stats. Identities = 33/145 (22%), Positives = 60/145 (41%), Gaps = 4/145 (2%) Query: 86 ARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELR 145 A A +AR ++ A + L Q+ + + W G ++ DG PD P + Sbjct: 94 ATGAYCKARAKLPVALLSRLATQSGDELERHAPKEWQWKGRRVLLGDGTTLSGPDTPANQ 153 Query: 146 EYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR---QSETVLAHSMLATIPD 202 Y T+ KR +P++R+V L+ + L+ A P + E L +L Sbjct: 154 AAYPQH-TNQKRGLGFPLIRVVVLLGFATGALVGAAIGPAKGKEAGEMALLRELLDRFQA 212 Query: 203 NSITLFDKLFYSEDLLLTLNQKGCN 227 + + D+ + S L+ L +G + Sbjct: 213 GDVFVADRAYCSYWLVSALQARGVD 237 >UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XCY0_9BACT Length = 481 Score = 117 bits (293), Expect = 5e-25, Method: Composition-based stats. Identities = 52/238 (21%), Positives = 88/238 (36%), Gaps = 21/238 (8%) Query: 10 FSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPI---- 65 ++ PL P LFA +P + + A R R W + Sbjct: 23 LAEQPL--PQLEALFAPFIPEQLLSRA-----GANSRERFYTLRQTFWAFLWQALHPGTA 75 Query: 66 -TDVVRRL--NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDD 122 +VVR+L + A +A +ARQR+ P+E L A + Sbjct: 76 CREVVRQLLSDWQAQAGRTRAQAGTAAYCRARQRL---PLERL---QAILQATLGPEPPR 129 Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT 182 W G + +DG F PD ++ + + K +P +++VAL +L S + LN Sbjct: 130 WRGHAVKLVDGTTFSLPDTAANQKKFPQSGAQ-KPGCGFPTLKVVALFSLASGLALNWAR 188 Query: 183 APYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 R E L + + + + + D+ F S L L +G + + L K + Sbjct: 189 GSLRVHEIPLFRKLWSGLRRRDLIIGDRGFSSYTNLALLLGRGVDCLFRLHQGKKVRH 246 >UniRef50_UPI00016C37A0 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C37A0 Length = 334 Score = 115 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 67/196 (34%), Gaps = 22/196 (11%) Query: 48 RRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFR 107 R ++ W+VV+ EP G G A + L R Sbjct: 62 RLAVARVLAWLVVRGEP---------PCGPGTGGYCKPAPGC---------PRAIPQLAR 103 Query: 108 QTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLV 167 T + W+G ++ DG PD P+ + Y + +P +RLV Sbjct: 104 HTGRGLHDRAPGNWRWNGRRVLIADGTTVTMPDTPKNQNEYPHPGSQADGI-GFPQIRLV 162 Query: 168 ALMNLGSHILLNAVTAPYR---QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQK 224 AL L +L+A P R ET L + ++ ++ L D+ F L+ ++ Sbjct: 163 ALFCLACGAVLDAALGPSRGKQSGETALRRQIAGSVGSGTVLLADRYFGGWFDLVLWRER 222 Query: 225 GCNRHWLLPAWKNIAS 240 G + + + Sbjct: 223 GIDVVTRIHQKRATDF 238 >UniRef50_B8FEP3 Transposase IS4 family protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FEP3_DESAA Length = 422 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 79/205 (38%), Gaps = 17/205 (8%) Query: 46 RRRRLPGDMVIWMVVQNE------PITDVVRRLNLSADGEAG--MNLLARSAVTQARQRV 97 R R L +V+ +++ +V+ R +A G + +SA +AR++V Sbjct: 23 RNRILTLPVVLALILNMVRPGKRVGYDEVLARFFAAASLMNGQNITPPDKSAFCRARKKV 82 Query: 98 GAAPVEWLFRQTAQD--RGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTST 155 + L+ + + A + W G ++ AIDG + P EL + +G + Sbjct: 83 PFEALTELYGKALEHAKDLAAKAPGTTWRGRRVLAIDGTKIMLPRTKELLDAFGKCS--- 139 Query: 156 KRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSE 215 +P L ++ + + L+ Y+ E LA M I + D+ F Sbjct: 140 --HGWFPQTHACVLYDVLAGLPLDVAWGHYKSGERGLARDMFDGFLPGDILVLDRGFPGF 197 Query: 216 DLLLTLNQKGCNRHWLLPAWKNIAS 240 L L ++G + +++ + Sbjct: 198 AFFLDLMEQGID--FIVRLRGDGQF 220 >UniRef50_Q12AI7 Transposase, IS4 family n=3 Tax=Proteobacteria RepID=Q12AI7_POLSJ Length = 458 Score = 114 bits (284), Expect = 6e-24, Method: Composition-based stats. Identities = 45/196 (22%), Positives = 78/196 (39%), Gaps = 10/196 (5%) Query: 44 TVRRRRLPGDMVIWMVVQNEPITD--VVRRLNLSADGEA--GMNLLA--RSAVTQARQRV 97 R R P + + M ++ D + +N A A G+ + +ARQR+ Sbjct: 47 EHRERLYPPTVALSMFMRQVLEADGSCQKAVNGWAAQRAADGLRPCSVRTGGYCRARQRL 106 Query: 98 GAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKR 157 V L R+T + + + W G + +DG PD PE +E Y +T Sbjct: 107 PLEMVGTLTRETGRLLHEKALAQWLWRGRAVKLVDGTGISMPDTPENQERYPQPSTQA-P 165 Query: 158 QNAYPVMRLVALMNLGSHILLNAVTAPY---RQSETVLAHSMLATIPDNSITLFDKLFYS 214 +P+ RLV ++ L + L+ P+ E L +LA + L D L+ + Sbjct: 166 GVGFPLARLVMVICLATGAALDMAVGPHSGKGSGELGLVRRLLAGFCPGDVMLADALYCN 225 Query: 215 EDLLLTLNQKGCNRHW 230 L+ +L G + + Sbjct: 226 YFLIASLMAAGVDVLF 241 >UniRef50_UPI00016C48B0 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C48B0 Length = 202 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 41/181 (22%), Positives = 72/181 (39%), Gaps = 2/181 (1%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPI-TDVVRRLNLSADGEAGM 82 + LP E + +T S+ + RL ++W VV D R++ + Sbjct: 23 LKQLLPRELMAEVVTESSLPSNFCCRLLNWFMLWFVVGIGLFSRDSYRQVFKWLNPFRPK 82 Query: 83 NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKP 142 RS + AR R+G APV L + H +L +DG D Sbjct: 83 GTPERSTLCMARVRLGVAPVRRLQERVTALLATRATPGAFHHQYRLMGLDGFAADLADSA 142 Query: 143 ELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPD 202 +G + + A+P R+++L LG+H+L ++ P R+ E +A ++L + Sbjct: 143 ANTRAFGHPGSG-RATGAFPQARVLSLCELGTHVLWRSLIKPCRRGEVTMAPALLRHLTS 201 Query: 203 N 203 Sbjct: 202 E 202 >UniRef50_A3ZZQ0 Putative uncharacterized protein n=3 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZQ0_9PLAN Length = 457 Score = 110 bits (274), Expect = 8e-23, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 76/208 (36%), Gaps = 12/208 (5%) Query: 44 TVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSADGEAGMNLLA--RSAVTQARQR 96 + R R +V+WM V + VV RLN G++ + ++ QAR+R Sbjct: 43 SFRERIYSPMIVVWMFVMQTLSADHSCQQVVTRLNAW-RLAQGLSRCSGDTTSYCQARRR 101 Query: 97 VGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTK 156 + A + L TA+ + G ++ +DG D + + K Sbjct: 102 LPIALFQRLLAWTARKCDEAGLGDWRYQGREVIIVDGTTVTMADTRANQTAFPQIENQ-K 160 Query: 157 RQNAYPVMRLVALMNLGSHILLNAVTAPYR---QSETVLAHSMLATIPDNSITLFDKLFY 213 +P+ R+V + +L + Y ET L ++L+ I L D+ + Sbjct: 161 PGCGFPLARIVQVFSLATGAATMFAMGRYAGKETGETSLLRTLLSQFHSGEIVLADRYYA 220 Query: 214 SEDLLLTLNQKGCNRHWLLPAWKNIASE 241 S LL + +G + + I Sbjct: 221 SFWLLALSDLRGIDIVARAHHRRKIDFR 248 >UniRef50_UPI00016C385B hypothetical protein GobsU_16554 n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C385B Length = 454 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 41/160 (25%), Positives = 64/160 (40%), Gaps = 7/160 (4%) Query: 86 ARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELR 145 S+ + ARQ + W ++ R W G ++F IDG +PELR Sbjct: 89 NTSSYSDARQWLPLEAARWFADHVSRAR--IDGAPPTWSGRRVFLIDGTTRTLAPEPELR 146 Query: 146 EYYGSANTSTKRQNAYPVMRLVALMNLGSHILL----NAVTAPYRQSETVLAHSMLATIP 201 E Y T+ + +PV LV L S + A P+ SET LA +++ +P Sbjct: 147 EKYP-PATNPHGRGVWPVALLVVAHELSSGAAVVPEVGATFGPHAVSETALAGAVMDRLP 205 Query: 202 DNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE 241 N + + D F + L +G + L A + A Sbjct: 206 ANGVVMADAGFGIFAVALGARARGLGFVFRLTAARFTAYR 245 >UniRef50_Q3A1U3 Transposase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A1U3_PELCD Length = 489 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 84/213 (39%), Gaps = 13/213 (6%) Query: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWM-----VVQNEPITDVVRRLNLSA 76 ++F + +P ++ L+ A RRR + W + + +V+R+L A Sbjct: 44 EVFEKFIPLALLKPELS---GAMSRRRLFSKENTFWAFFSQVLDADGGCKEVIRKLQSYA 100 Query: 77 DGEAGMNLLARS--AVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGA 134 + G+ + + S + AR+++ + + TA+ + ++ DG Sbjct: 101 SIK-GIKVPSSSTASYCTARKKLAEPMLADILAHTAEQLEKMPATGM-LNNRRVIVADGT 158 Query: 135 QFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAH 194 PD PE + + +++ K +P R+ A +L S LL+ + +E L Sbjct: 159 GVSMPDTPENQAAWPQ-SSALKPGCGFPSARICACFSLDSGALLSYAIGNKKNNELPLFR 217 Query: 195 SMLATIPDNSITLFDKLFYSEDLLLTLNQKGCN 227 T I L DK F S + L +G + Sbjct: 218 QQWETFNPGDIFLGDKGFCSYFDIAKLQDRGVD 250 >UniRef50_B0CC46 Transposase, IS4 family, putative n=9 Tax=Cyanobacteria RepID=B0CC46_ACAM1 Length = 482 Score = 102 bits (255), Expect = 1e-20, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 80/210 (38%), Gaps = 10/210 (4%) Query: 25 AEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSADGE 79 + LP ++ L A + R R + +W ++ ++ + + V+ + Sbjct: 38 TDILPASRLEELLKEEA-FSYRNRIYSPIVTLWAMLYQVLSADKSLRNTVKCITTWL-TA 95 Query: 80 AGMNLLA--RSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFR 137 AG+ + A ++AR R + ++ L ++A+ + W G + DG Sbjct: 96 AGIQPPSSDTGAYSKARSRFPESLLQRLIPESAECLAQPLSPEHLWCGRPVKVYDGTTVL 155 Query: 138 TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSML 197 D + Y T +P+ RLV L + + +A A + SE V++ + Sbjct: 156 MADSAANQASYPQHGNQT-AGCGFPIARLVVFFCLVTGAVASACIASWDTSEIVMSRLLY 214 Query: 198 ATIPDNSITLFDKLFYSEDLLLTLNQKGCN 227 + + + D+ + S L + Q + Sbjct: 215 QDLEVGDVVMADQAYGSYVDLAIIQQHRAD 244 >UniRef50_Q3SHG4 Putative uncharacterized protein n=1 Tax=Thiobacillus denitrificans ATCC 25259 RepID=Q3SHG4_THIDA Length = 255 Score = 100 bits (250), Expect = 5e-20, Method: Composition-based stats. Identities = 43/164 (26%), Positives = 62/164 (37%), Gaps = 8/164 (4%) Query: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITD------VVRRLNLS 75 +F + LPTE I + SA R R P + ++ D V RRL+ Sbjct: 53 GVFEQVLPTEEIMGTIEESAPV-FRHRHYPPLTTLRHFIEQVLSEDQACQDVVGRRLSER 111 Query: 76 ADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQ 135 L SA QARQR+ V+ L+R T + W G +L DG Sbjct: 112 VGQRQSTCSLNTSAYCQARQRLPQEMVDRLYRTTGERLETRLPKSWRWRGRRLVLFDGTT 171 Query: 136 FRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLN 179 PD + + + + +PV RL L+ L S +L Sbjct: 172 VSMPDTLASQCAFPQ-SAEQQPGLGFPVARLSGLIGLASGAVLG 214 >UniRef50_UPI00016C5887 hypothetical protein GobsU_05723 n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5887 Length = 321 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 65/184 (35%), Gaps = 12/184 (6%) Query: 53 DMVIWMVV-----QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFR 107 +V+W++V N + D V A A AR R+ A V R Sbjct: 26 WVVLWLLVYQRLHGNGSLGDAVSHFLTQFPSAAEQPSGATGGYRHARTRLPNAVVATAGR 85 Query: 108 QTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLV 167 + A W G ++F +DG R LR + + ++ ++ +PVM LV Sbjct: 86 RVFDTLVAAYPPS--WRGRRVFMMDGTTLRLAPTDALRGAF-TPASNQHGRSHWPVMHLV 142 Query: 168 ALMNLGSHIL----LNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQ 223 L S + A+ P L ++ +P S+ L D+ F L Sbjct: 143 VAHELASGLAAPPQHGAMYGPGAVGAVQLGLRLMPDLPPGSVILGDRNFGVFGLAHGAVA 202 Query: 224 KGCN 227 G + Sbjct: 203 GGHD 206 >UniRef50_Q82QT3 Putative uncharacterized protein n=1 Tax=Streptomyces avermitilis RepID=Q82QT3_STRAW Length = 182 Score = 97.4 bits (241), Expect = 5e-19, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 48/111 (43%), Gaps = 2/111 (1%) Query: 141 KPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATI 200 E++G + ++A+ R+VAL G+H + AV P L+ + + Sbjct: 21 TWANEEFFGR-QAGGRGESAFAQARVVALAECGTHAVFGAVIGPAVGGRAELSRQLFPQL 79 Query: 201 PDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS-EMIELGNTAS 250 + + L D+ FY +L T G + W L + + ++++ G+ S Sbjct: 80 GEGKLLLADQGFYGFELWQTARATGADLLWRLRSSAAVPGLQVLDDGSYLS 130 >UniRef50_A3IS08 Putative uncharacterized protein n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IS08_9CHRO Length = 472 Score = 97.1 bits (240), Expect = 6e-19, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 85/210 (40%), Gaps = 8/210 (3%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITD------VVRRLNLSAD 77 F + L +E I+ L + R ++IW + D V R ++ A Sbjct: 23 FQKLLKSEIIEDILKEMG-VKYKSRIYNPIVIIWSFLSQVLDPDHSCQNAVSRIISYLAS 81 Query: 78 GEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFR 137 SA QAR+++ ++ L +A+ + K WHG + +IDG+ Sbjct: 82 EGIETPSENTSAYCQARKKLPEELLKKLLEISAKGNEEKVDKKHLWHGRCVKSIDGSTVS 141 Query: 138 TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSML 197 PD + +E Y + K+ +P+ ++ L + + ++ V ++ + LA + Sbjct: 142 MPDSLKNQEAYPQHGSQ-KKGCGFPLAKIGVLFSYATGSVVGIVIDIFKTHDIKLARKLT 200 Query: 198 ATIPDNSITLFDKLFYSEDLLLTLNQKGCN 227 + I L D+ F S + + +KG + Sbjct: 201 DYLDAGDILLGDRAFCSYIDIYSWKKKGID 230 >UniRef50_Q7TTE4 Putative uncharacterized protein n=9 Tax=Planctomycetaceae RepID=Q7TTE4_RHOBA Length = 457 Score = 95.9 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 79/227 (34%), Gaps = 17/227 (7%) Query: 20 SAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQ-NEPITDVVRRLNLSADG 78 S +L + + E AH TV + M+++ ++ + + + V+ L + Sbjct: 17 SFELLQQLVNFEDANKLFEQQAH-TVYTACVVLWMLVYQRLKPDASLENAVKHLLDTRPT 75 Query: 79 --------EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFA 130 E +A ++AR R+ V W + + + D ++F Sbjct: 76 YLPENKRLEDNTLSVATGGYSRARSRLPLEVVRWFAEEVSSGILSATEPAVD--EQRVFL 133 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPY----R 186 IDG + EL++ + A+ +P + L L S + P Sbjct: 134 IDGTTLALAPEKELQQAFPPASNQLGE-GVWPCVLLTVFHELASGAAMLPQVGPMYGPEA 192 Query: 187 QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLP 233 SET LA +P+NSI + D F + G + + Sbjct: 193 ISETQLARQGFEQLPENSIIMSDAGFGIFGIAHGAIDAGHDILLRMK 239 >UniRef50_A6CCZ3 Transposase, IS4 (Fragment) n=7 Tax=Planctomyces maris DSM 8797 RepID=A6CCZ3_9PLAN Length = 531 Score = 90.1 bits (222), Expect = 8e-17, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 74/208 (35%), Gaps = 33/208 (15%) Query: 54 MVIWMVVQNEPITD--------VVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWL 105 + +W ++ + V+R + A + A +AR ++ + + Sbjct: 100 ITLWALISQVFFSGEQRSCKAAVIRVASFWAALGRRVCSTNTGAYCRARLKLSFTAIREI 159 Query: 106 FRQTAQDRGAE---------------------RYLKDDWHGLQLFAIDGAQFRTPDKPEL 144 +Q A D A +K G ++ +DG D PE Sbjct: 160 VQQLAADAEAACDQNCVQSQEQSAARLSPSNVADVKSRSTGGRILLVDGFTITAADTPEN 219 Query: 145 REYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPY---RQSETVLAHSMLATIP 201 + Y K +PV+R V+L+++ + +L++ V+ PY ET L ML + Sbjct: 220 QRAYPQNPAQ-KPGLGFPVLRCVSLISMTTGLLVDLVSGPYSGKGSGETALLWQMLDVLR 278 Query: 202 DNSITLFDKLFYSEDLLLTLNQKGCNRH 229 + D + + L+ + +G Sbjct: 279 PGDTLVADSYYCTYWLVSACHARGVQIL 306 >UniRef50_C0ING1 Putative uncharacterized protein n=1 Tax=uncultured bacterium BLR12 RepID=C0ING1_9BACT Length = 337 Score = 89.4 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 36/162 (22%), Positives = 67/162 (41%), Gaps = 4/162 (2%) Query: 116 ERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSH 175 E W+GL+L AIDG+ P + E +G N + V R L ++ + Sbjct: 8 ESAPYLTWNGLRLLAIDGSTAVLPGHKSITEEFGITNFGPYANSPRSVARTSVLYDVLNL 67 Query: 176 ILLNAVTAPYRQSETVLAHSMLATI-PDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPA 234 +L+ Y E LA A + P + LFD+ + S L+ + +G H+L+ Sbjct: 68 TVLDGQIDRYDSCERNLARQHFAQVKPATDLLLFDRGYPSLGLMFEMQAQG--IHYLIRM 125 Query: 235 WKNIASEMIEL-GNTASPGTIPKRLEHLRGALEVVFITKRPR 275 ++ ++ ++ N + + +L L + TK + Sbjct: 126 REDWWLDVRKMLANGETDKEVTFKLPATERDLLNKYATKNDK 167 >UniRef50_C6N0W0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N0W0_9GAMM Length = 453 Score = 86.7 bits (213), Expect = 9e-16, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 83/213 (38%), Gaps = 15/213 (7%) Query: 31 EWIQHCLTLSAHA-TVRRRRLPGDMVIWMVV------QNEPITDVVRRLNLSADGEA-GM 82 I+ R+R + ++ ++ ++ ++ L +++ A Sbjct: 14 AMIEEVCEDFDKVWQTRKRVINTQFLVTFILKLVLSKNSQGYKILLNELWETSEFSALQE 73 Query: 83 NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKP 142 ++ S++ +ARQ++ L Q E W ++F +DG++ P Sbjct: 74 QPVSASSICEARQKMPETIF-TLINQKVLAMREESDTLPLWRNHRVFGVDGSRINVPH-- 130 Query: 143 ELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPD 202 E + + +Q YP + L +LGS ++ + + P + E + S + + Sbjct: 131 ---ELLEAGYKAPIKQQYYPQGLMSTLYHLGSGLIYDGILEPVK-GERICLLSHMEKLTL 186 Query: 203 NSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 + + D+ ++S +L+ ++G + + + Sbjct: 187 GDVLVLDRGYFSYLILVKAIERGIHLICRMQSG 219 >UniRef50_A3ZMM8 Transposase insG for insertion sequence element-like protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZMM8_9PLAN Length = 464 Score = 83.6 bits (205), Expect = 8e-15, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 61/153 (39%), Gaps = 7/153 (4%) Query: 86 ARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELR 145 S+ + AR+R+ +E F + D R ++ + ++F IDG P P L+ Sbjct: 96 NTSSFSAARKRLPLDAIER-FSRCVCDHL-GRTVEPVFDDRRVFIIDGTTITLPPTPVLK 153 Query: 146 EYYGSANTSTKRQNAYPVMRLVALMNLGSHILL----NAVTAPYRQSETVLAHSMLATIP 201 + + T+ + +PV L+ + + +L + + P SE A ++ +P Sbjct: 154 KAFP-PATNQLGETVWPVAMLMVAAEMQTGCILVPKIDPMYGPNNSSEAKQAREIVGDLP 212 Query: 202 DNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPA 234 SI L D F + G + + L Sbjct: 213 SRSIVLADSCFGIFSVAHHTRAAGHDFLFRLSM 245 >UniRef50_Q04V25 Transposase, ISLbp1 n=29 Tax=Leptospira RepID=Q04V25_LEPBJ Length = 423 Score = 76.3 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 56/167 (33%), Gaps = 12/167 (7%) Query: 81 GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPD 140 G + + A + R+ + E L G + A D P Sbjct: 46 GKKAVTKQAFSFTRENLNPQVFESLNEIFVNSYYKNVTNCKTHKGYIVAACDATGISLPK 105 Query: 141 KPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATI 200 E + +G + P + ++ + I+L++ +R SE +A + + Sbjct: 106 TKEFVKDFGCVKNQLGESES-PNANSSIIFDIYNDIILSSTVGSHRTSERSMALHHIEKL 164 Query: 201 PDNS-------ITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 S I LFDK + S +L+ L G H+++ N Sbjct: 165 RSISALQNKKLILLFDKGYPSMELIGKLMANG--IHFII--RSNTRW 207 >UniRef50_Q82UV9 Putative uncharacterized protein n=1 Tax=Nitrosomonas europaea RepID=Q82UV9_NITEU Length = 91 Score = 73.9 bits (180), Expect = 6e-12, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 32/73 (43%) Query: 152 NTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKL 211 + + + YP +R V L+ G+H+L Y+ +E LAH +A + + L D+ Sbjct: 2 PGTQQGRTGYPQLRFVGLLENGTHVLFGVALGGYQDAEVRLAHQTIAHLKPGMLCLADRG 61 Query: 212 FYSEDLLLTLNQK 224 L ++ Sbjct: 62 LSGYPLWAAASRT 74 >UniRef50_UPI000190F8A2 hypothetical protein SentesTyp_33971 n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E98-2068 RepID=UPI000190F8A2 Length = 85 Score = 73.2 bits (178), Expect = 9e-12, Method: Composition-based stats. Identities = 41/46 (89%), Positives = 42/46 (91%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVR 46 M LLNDLLDFSDHPLMPPPSAQ+FAEHLP E IQHCLTLS HATVR Sbjct: 1 MSLLNDLLDFSDHPLMPPPSAQMFAEHLPAECIQHCLTLSKHATVR 46 >UniRef50_A5N5R2 Transposase n=6 Tax=Clostridium RepID=A5N5R2_CLOK5 Length = 205 Score = 73.2 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 25/122 (20%), Positives = 53/122 (43%), Gaps = 11/122 (9%) Query: 128 LFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVA--LMNLGSHILLNAVTAPY 185 +FA+DG++ + P+ E R ++G + + +R + + ++ +H L+ Sbjct: 1 MFAVDGSKAKVPNSDENRAFFGECGNNHSKG----QVRALVSSIFDVFNHFFLDLQIDSI 56 Query: 186 RQSETVLAHSMLATI-----PDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 + SE+ LA + I N I +FD+ + S +L+ L + G + L + Sbjct: 57 KTSESELAKKNINAIRKILPNTNFIVVFDRGYLSIELIHFLEENGVQYLFRLSSNDYKKE 116 Query: 241 EM 242 Sbjct: 117 RE 118 >UniRef50_A3YGY3 Transposase and inactivated derivative n=1 Tax=Marinomonas sp. MED121 RepID=A3YGY3_9GAMM Length = 66 Score = 71.3 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 29/59 (49%), Positives = 38/59 (64%) Query: 166 LVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQK 224 LVALMN SHI+++ YR+ + LA S A PDNSITLFDK F+S +L L++ Sbjct: 2 LVALMNTQSHIMMDPQIIHYRRGKIPLAPSTQAKTPDNSITLFDKGFWSTKFMLGLSRA 60 >UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G346_9FIRM Length = 443 Score = 70.9 bits (172), Expect = 5e-11, Method: Composition-based stats. Identities = 42/188 (22%), Positives = 73/188 (38%), Gaps = 15/188 (7%) Query: 46 RRRRLPGDMVIWMVVQNEPI---TDVVRRLNLS-ADGEAGMN-LLARSAVTQARQRVGAA 100 R R+LP + VI ++ + ++ R + + + SA+ QARQ++ + Sbjct: 36 RNRKLPFEEVIRFLLPLQGQCMDQELFRHFSKKPLFFSTDYSGIPHSSAMIQARQKLSDS 95 Query: 101 PVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNA 160 + LF + + G QL AIDG+QF P E + Sbjct: 96 AMPALFHSFTETCKK----GALFQGYQLLAIDGSQFSVP---ENLKEPLCWRKIPNISKG 148 Query: 161 YPVMRLVALMNLGSHILLNAVTAP-YRQSETVLAHSMLATIPDN--SITLFDKLFYSEDL 217 V+ L A+ +L S I + V P +E M+ +I + D+ + S + Sbjct: 149 RNVIHLNAMYHLQSGIFEDVVFQPICECNEHKALAQMVDRRSSAFPAIFMADRGYESYNT 208 Query: 218 LLTLNQKG 225 + QKG Sbjct: 209 FAHIEQKG 216 >UniRef50_B8CMP8 Transposase OrfA, putative n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CMP8_SHEPW Length = 156 Score = 70.5 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 36/151 (23%), Positives = 64/151 (42%), Gaps = 26/151 (17%) Query: 1 MPLLNDLLDFSDH----PLMPPPSAQLFAEHLPTEWIQH--CLTLSAHATVRRRRLPGDM 54 + +L+ L++ S+ + P E L E IQ + +HA+ + Sbjct: 16 LFVLDFLMELSEALTRISINRPTEFANLGELLCPELIQKLFTIQWCSHASHTK------- 68 Query: 55 VIWMV----------VQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEW 104 + + V + E + ++ +L++ E G +ARS VTQ R+++ + VE Sbjct: 69 ITYGVNDFGCYRHGLISGESVRQLIYKLDIILLNEVGY--VARSTVTQTRKKLTSDVVED 126 Query: 105 LFRQTAQDRGAERYLKDDWHGLQLFAIDGAQ 135 +FRQT Q W GL L+ +DG Sbjct: 127 IFRQTPQRW-NMLAEHPQWCGLNLYGVDGVV 156 >UniRef50_B2Q345 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q345_PROST Length = 130 Score = 70.1 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 26/76 (34%), Positives = 39/76 (51%), Gaps = 3/76 (3%) Query: 65 ITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWH 124 + +V L++ +A SAV QARQR+G V +F +T+Q + W+ Sbjct: 1 MAQLVFHLDIVLPSNR--PYVAPSAVVQARQRLGEDAVRKVFEKTSQLWLDKL-PLSHWN 57 Query: 125 GLQLFAIDGAQFRTPD 140 GL L A+DG +R PD Sbjct: 58 GLTLMAVDGTLWRIPD 73 >UniRef50_C6JHT2 Transposase ISLbp1 n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JHT2_9FIRM Length = 424 Score = 67.8 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 91/225 (40%), Gaps = 18/225 (8%) Query: 46 RRRRLPGDMVIWMVVQNEPIT---DVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPV 102 R R++P +++ ++ + +T ++ + L+ G + +++ + R ++ Sbjct: 20 RIRKMPLQDLLFTMINRKGLTLALELRNYMKLAHPGVS----ISKPGYLKQRMKLNPDAF 75 Query: 103 EWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYP 162 L++ ++ A+ + + A DG+ P E + YGSA+ + A Sbjct: 76 LELYKYHNRNFYADSTF-STYKNHLILAADGSDINIPTTTETLKLYGSASRKNTKPQA-- 132 Query: 163 VMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNS-----ITLFDKLFYSEDL 217 + L + ++ + ++L + + E LA + IP+ I + D+ + S Sbjct: 133 QIGLGCIYDVMNRMILESDCNKVKFDEMRLAEKQMERIPETIGNIPYIIIMDRGYPSTPA 192 Query: 218 LLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLR 262 + + K + +++ + + + T + + +L+ R Sbjct: 193 FIHMMDK--DLKFIVRLKSS-DYKKEQSSLTENDQLVKIKLDKSR 234 >UniRef50_UPI0001AF03EF IS4 family transposase n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF03EF Length = 374 Score = 66.6 bits (161), Expect = 9e-10, Method: Composition-based stats. Identities = 17/95 (17%), Positives = 39/95 (41%), Gaps = 2/95 (2%) Query: 166 LVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKG 225 L+ L G+ L+ A P ++E+ A + + + + L D+ F +LL + ++G Sbjct: 2 LMTLCETGTRALIAAAFGPAVKAESDYARELTGHLTPDMLLLADRAFDGNELLAAIARQG 61 Query: 226 CNRHWL-LPAWKNIASEMIELGNTASP-GTIPKRL 258 + ++ G+ + G + R+ Sbjct: 62 AQFLVRCTSTRRPPVLALLPDGSYLTRIGNLSLRV 96 >UniRef50_B7CEB8 Putative uncharacterized protein n=2 Tax=Erysipelotrichaceae RepID=B7CEB8_9FIRM Length = 431 Score = 66.6 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 34/249 (13%), Positives = 83/249 (33%), Gaps = 41/249 (16%) Query: 46 RRRRLPGDMVIWMVVQNEP------ITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGA 99 R+R+LP + +I ++Q + + + ++ L SA+ Q R ++ Sbjct: 34 RKRKLPVETLIHFIIQMQSKSLNSELCEYFNDIDF---------LPTASALCQQRDKLDI 84 Query: 100 APVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQN 159 + + + W G + A DG+ + + Sbjct: 85 SAFQRIMHLFVNAFD----DYKTWKGYHVLACDGSDVNIAYDEKDED----TKRQNGNNK 136 Query: 160 AYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATI-----PDNSITLFDKLFYS 214 + + L + +H+ + ++T +++ I P+NSI D+ + Sbjct: 137 PFSQFHINGLYDCINHVFWDTSID--TANKTRECAALMEMIMKHDYPENSIITADRGYEK 194 Query: 215 EDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRP 274 +L+ + + + ++ G+ S +P L+V I R Sbjct: 195 YNLIACCIENNQKFVFRIK-------DIDVFGSILSNLNLPDE----EFDLDVTKILTRK 243 Query: 275 RPSRPRSVK 283 + + ++ K Sbjct: 244 QTNETKANK 252 >UniRef50_Q8QNB6 EsV-1-170 n=2 Tax=Ectocarpus siliculosus virus 1 RepID=Q8QNB6_ESV1 Length = 383 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 74/191 (38%), Gaps = 17/191 (8%) Query: 45 VRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEW 104 RRR++ + + + + V + D + AV AR+++ + Sbjct: 22 QRRRKMDTSSLFYTLTRCCVQGRGVNHVLKMED-----EAYSSQAVHSARKKLPMGAFKE 76 Query: 105 LFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTST-KRQNAYPV 163 + R RG ++FA+DG++ Y N R P+ Sbjct: 77 VNRFL--HRGPHEP--------RVFAVDGSKVHVHPSFINAGYKTRTNDQPVSRPAKRPL 126 Query: 164 MRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQ 223 + L +++++ + ++ + +E A SML ++ LFD+ +YS+DLL +++ Sbjct: 127 VMLSSMVDVKTKACIDFELTKH-FNERRAATSMLRSVQKGDTLLFDRGYYSKDLLHSVHG 185 Query: 224 KGCNRHWLLPA 234 W L Sbjct: 186 SHAFGVWRLKI 196 >UniRef50_A2RJ55 Putative transposase n=7 Tax=Lactobacillales RepID=A2RJ55_LACLM Length = 439 Score = 65.5 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 50/308 (16%), Positives = 105/308 (34%), Gaps = 46/308 (14%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATV---------RRRRLPGDMVIWMVVQNEPITDV 68 P + Q+ + + ++ H + R R+L + I +++ Sbjct: 8 PSTLQV-SHQIKKNLEDQIHEITNHPEIYAQSPFDFSRNRKLSFETTIKIIL---SFGGQ 63 Query: 69 VRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQL 128 L + + SA+ QAR ++ E LF +T + G ++ Sbjct: 64 SLSSELLSHFNFTLKTPTASALVQARSKIKLKAFEQLFYRTI----PSAQPNKLYKGYRI 119 Query: 129 FAIDGAQFRTP-DKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ 187 FA DG+ P ++ E +Y + + L AL + + + +Q Sbjct: 120 FAHDGSDLNIPYNEKESDTHYRVGKFGKHVGS----LHLNALYDPLNKHYVAVDFQKIKQ 175 Query: 188 -SETVLAHSMLATIPDNS--ITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 +E ++ S I + D+ + S ++ + + G +L+ A ++ ++ Sbjct: 176 LNERKSLCQIVDDFDFTSPTIIIADRGYESFNVYEHIKKSGQK--FLIRAKDTKSNGLLN 233 Query: 245 LGNTASPGT---------IPKRLEHLRGALEVVFITKRPR----PSRPRSVKISKTRYPV 291 + S GT ++ ++ F+ KR P R SK YP+ Sbjct: 234 GLDLPSDGTFDKKITLQLTRRQTNKVKKDKHYHFLHKRANFDYLPIR------SKETYPI 287 Query: 292 KHSAAPLK 299 +K Sbjct: 288 SLRVVRIK 295 >UniRef50_B0NXD2 Putative uncharacterized protein n=5 Tax=Clostridium sp. SS2/1 RepID=B0NXD2_9CLOT Length = 439 Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 79/211 (37%), Gaps = 17/211 (8%) Query: 83 NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTP-DK 141 SA Q ++++ + + LF + K +H L + ++DG P D+ Sbjct: 71 KTPTTSAFLQQQKKLKLSAFQTLFYRFNDPF----PDKTLYH-LHILSVDGTGVTVPMDR 125 Query: 142 PELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ-SETVLAHSMLAT- 199 + Y T+ + + +L + +A P+R SET + ML Sbjct: 126 INENKEYARVRTNKDCTRPAYQFHVSCIYDLINERYCDAYIEPFRTHSETHVFSVMLERK 185 Query: 200 -IPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW-KNIASEMIELGNTASPGTIPKR 257 P ++ + D+ + S L+ + G ++L+ A MI+ GT K Sbjct: 186 NFPQKALFIADRGYESYLLMAQIQHDGN--YFLIRAREDFGQGSMIKGYPFPRDGTFDKT 243 Query: 258 LEHLRGALEVVFITKRPRPSRPRSVKISKTR 288 + ++ + KR + + P K TR Sbjct: 244 VTYIYTKTQ----NKRTKAN-PELYKRVATR 269 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 63.9 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 33/156 (21%), Positives = 61/156 (39%), Gaps = 11/156 (7%) Query: 83 NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKP 142 + S+ Q R ++ E+LF Q+ ++GL+L A DG+ P Sbjct: 69 TTPSNSSFNQRRAQILPEAFEFLF----QEFTKSFTDNVTYNGLRLIACDGSDLCIAHNP 124 Query: 143 ELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR-QSETVLAHSMLATIP 201 + Y K Y ++ L A +L S +A+ P R +E M+ Sbjct: 125 QDETTYFQTLPDRK---GYNLLHLNAFYDLCSRQYTDAIIQPSRLANERRAMCEMIDRYN 181 Query: 202 DNS-ITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWK 236 D S I + D+ + + ++ + KG ++L+ Sbjct: 182 DTSAIFIADRGYENYNIFAHVEHKG--MYYLIRVKD 215 >UniRef50_A4A0C3 Probable transposase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A0C3_9PLAN Length = 445 Score = 63.9 bits (154), Expect = 7e-09, Method: Composition-based stats. Identities = 30/115 (26%), Positives = 45/115 (39%), Gaps = 8/115 (6%) Query: 125 GLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAP 184 G + A+DG++F P G A R+ P + L ++G+ + + P Sbjct: 131 GWVVMAVDGSRFEAPRTRANEAGLGCA----GREKTTPQIYQTTLQHVGTSLPWDFRIGP 186 Query: 185 YRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRH-WLLPAWKNI 238 SE ML +P S+ + D F S DL L RH +LL N Sbjct: 187 GTASERRQLDEMLPDLPGKSLLIADAGFISYDLCRVLLMG---RHDFLLRVGGNT 238 >UniRef50_A1SV49 ISSod7, transposase n=1 Tax=Psychromonas ingrahamii 37 RepID=A1SV49_PSYIN Length = 95 Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 41/86 (47%), Gaps = 8/86 (9%) Query: 52 GDMVIWMVV-----QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLF 106 + ++W VV + + +V +L++ G +A SAVTQAR+++G +E + Sbjct: 1 MEKMVWAVVGMALFRKYSMRQLVNQLDIILPN--GEPYVASSAVTQARKKLGYQAIESIS 58 Query: 107 RQTAQDRGAERYLKDDWHGLQLFAID 132 QT Q E+ W GL L D Sbjct: 59 NQT-QSLWHEKSEHPMWCGLSLLGGD 83 >UniRef50_B9YUA6 Transposase, IS4 family protein n=3 Tax='Nostoc azollae' 0708 RepID=B9YUA6_ANAAZ Length = 256 Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 14/74 (18%), Positives = 37/74 (50%), Gaps = 5/74 (6%) Query: 180 AVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA 239 ++ +PY E A +L ++ + + ++D+ +S ++ ++ C+ +PA N+ Sbjct: 3 SLLSPYGIGERKRAIQILPSVGEGMLLMWDRGLHSFKMVHAAIKQKCHILGRVPA--NVK 60 Query: 240 SEMIEL---GNTAS 250 E+++ G+ S Sbjct: 61 FELVKTLGNGSYLS 74 >UniRef50_B6FVR6 Putative uncharacterized protein (Fragment) n=2 Tax=Clostridium nexile DSM 1787 RepID=B6FVR6_9CLOT Length = 286 Score = 58.9 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 82/218 (37%), Gaps = 34/218 (15%) Query: 84 LLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPE 143 SA Q R+++ +E LF + + + +L AIDG+ F P Sbjct: 22 FPTVSAFVQQRKKLSYTALEHLFYRFNE---CTFKKPVLYKNYRLLAIDGSDFSLP---- 74 Query: 144 LREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT-APYRQSETVLAHSMLATIPD 202 Y S + N + + L AL ++ S L+ + ++ET A ++ I + Sbjct: 75 ----YNSQEDNVMGDNHFSTLHLNALFDVCSKSFLDVIVQKGLHENETGAACELVDRISE 130 Query: 203 N--SITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEH 260 I + D+ + + +L + ++ + + N S +PK +E+ Sbjct: 131 KHPVIIMADRGYENYNLFAHIEERLFDYVVRVRDSDNSC--------MVSGLNLPKTVEY 182 Query: 261 LRGALEVVFITKRPRPSR----PRSVKISKTRYPVKHS 294 ITKR +R P ++ K +Y K S Sbjct: 183 --------DITKRVVLTRHFSGPAAINTEKYKYLSKKS 212 >UniRef50_A3EIG1 FOG: Transposase and inactivated derivatives n=3 Tax=Vibrio cholerae V51 RepID=A3EIG1_VIBCH Length = 264 Score = 58.2 bits (139), Expect = 4e-07, Method: Composition-based stats. Identities = 16/39 (41%), Positives = 22/39 (56%) Query: 205 ITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMI 243 +TLFDK FY+ LL +G RHWL+P K + + Sbjct: 27 LTLFDKGFYALGLLHRWQSQGKERHWLIPLRKGAQYKTL 65 >UniRef50_C6JHV1 Putative uncharacterized protein (Fragment) n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JHV1_9FIRM Length = 138 Score = 58.2 bits (139), Expect = 4e-07, Method: Composition-based stats. Identities = 21/106 (19%), Positives = 48/106 (45%), Gaps = 3/106 (2%) Query: 45 VRRRRLPGDMVIWMVV--QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPV 102 VRRR+L VI + + + ++ G +++ A+++ARQ + + Sbjct: 34 VRRRKLSLLQVIIYLFFSSKASMFQNLSQIREEL-GTLSFPDVSKQALSKARQFINPSLF 92 Query: 103 EWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYY 148 + L+ + ++ + W G LFA+DG++ P+ +++ Sbjct: 93 KELYYLSVDLFYSQIPSRKLWQGYHLFAVDGSRIELPNSKSTFDFF 138 >UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostridiales RepID=C7GFW6_9FIRM Length = 436 Score = 57.4 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 87/219 (39%), Gaps = 22/219 (10%) Query: 45 VRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEW 104 +R+R+L ++ +++ E + L E ++ SA Q R ++ + Sbjct: 34 IRKRKLDFKKMMHLIISMESGS---LNHELLKFFEYDSSVPTGSAFYQQRSKLSVSAFRH 90 Query: 105 LFRQTAQDRGAERYLKDDWHG-LQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPV 163 L ++ ++ + + G L A DG++F + + + N + + + + Sbjct: 91 LLKEFN-----LKFPLEKFRGKYYLIACDGSEFNIARNLKDADTFHEPNGKS--VSGFNM 143 Query: 164 MRLVALMNLGSHILLNAVTAPYR-QSETVLAHSMLATI--PDNSITLFDKLFYSEDLLLT 220 + ++L + S L+ P R ++E +++ + I + D+ F S ++ Sbjct: 144 VHTISLYEVCSKRYLDLEVQPGRLKNEFQAICNLMDRYAYGASPIFIADRGFSSYNVFAH 203 Query: 221 LNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLE 259 + + +L+ A + + GT+P +L+ Sbjct: 204 AIENNVD--FLIRAKD------LNVQRFLGGGTLPDKLD 234 >UniRef50_Q7UPU9 Probable transposase n=2 Tax=Rhodopirellula baltica RepID=Q7UPU9_RHOBA Length = 656 Score = 57.0 bits (136), Expect = 7e-07, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 56/182 (30%), Gaps = 26/182 (14%) Query: 82 MNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDK 141 + + S + +A R E L + + G + A+DG++ TP Sbjct: 158 IAFSSYSGLIKALVRWTPWLSEVLLTRIHKQIETTAGKLWRTTGWVVMAVDGSRDTTPRT 217 Query: 142 PELREYYGSANTSTKRQNAY----------------------PVMRLVALMNLGSHILLN 179 + + + N + Y P + + + ++ + + Sbjct: 218 LSNEKAFCAPNHGHGKTARYRKKKTKGMRRQAIEKNPPAPPVPQIWITMIWHVATQLTWC 277 Query: 180 AVTAPYRQSETVLAHSML--ATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKN 237 P SE ML P+ ++ D F + ++ G H+L+ N Sbjct: 278 WKLGPSNASERAHVQEMLENGEFPEKTLFTGDAGFVGYEFWKSIIDGGH--HFLVRVGAN 335 Query: 238 IA 239 + Sbjct: 336 VN 337 >UniRef50_C8VYK7 Putative uncharacterized protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VYK7_DESAS Length = 161 Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 16/113 (14%), Positives = 41/113 (36%), Gaps = 7/113 (6%) Query: 112 DRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMN 171 + G +L AID + + LR+ +G + T + + + + Sbjct: 44 TWYYGDDNFKTFKGYRLSAIDASILEITNSERLRDAFGYSEGKTVKLA---RAKASDIYD 100 Query: 172 LGSHILLNAVTAPYRQSETVLAHSMLATIP----DNSITLFDKLFYSEDLLLT 220 + + +++ + Y E +A ++ + N + LFD+ + + Sbjct: 101 IENDMMITSKITRYTTGERDIAIELIEKLKKLVLKNDLILFDRRYALAKIWKG 153 >UniRef50_A5N172 Transposase n=4 Tax=Clostridium RepID=A5N172_CLOK5 Length = 147 Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 19/110 (17%), Positives = 47/110 (42%), Gaps = 1/110 (0%) Query: 45 VRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEW 104 +R+R++P +I + + +T + A +++ Q R+++ + Sbjct: 34 IRKRKMPLSDIILCTLSKKGLTIAIELHQYFTQKGACHMSISKQGYLQQRKKLNYKVFSF 93 Query: 105 LFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTS 154 L ++ +D W+ +FA+DG++ P+ E R ++G + Sbjct: 94 LNKEYLEDFYHSTEP-ILWNNHLVFAVDGSKAEVPNSDENRAFFGECGNN 142 >UniRef50_P12249 Transposase for insertion sequence element IS231A n=411 Tax=Bacillus RepID=T231A_BACTB Length = 478 Score = 55.1 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 44/276 (15%), Positives = 97/276 (35%), Gaps = 24/276 (8%) Query: 19 PSAQLFAEHLPTEWIQHCLTLSAH---ATVRRRRLP----GDMVIWMVVQNEPITDVVRR 71 QLF+E L L A R+R+ + IW + +D + R Sbjct: 7 DELQLFSEELCRHLTPSFLEELAKKLGFVKRKRKFSGSELATICIW--ISQRTASDSLVR 64 Query: 72 LNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYL---KDDWHGLQL 128 L G L++ + + + ++++F + + + H ++ Sbjct: 65 LCSQLHAATG-TLMSPEGLNKRFDKKAVEFLKYIFSILWKGKLCKTSAISSTALTHFQRI 123 Query: 129 FAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQS 188 +D F+ P L Y + + +++ +L S LN P + + Sbjct: 124 RILDATIFQIP--KHLASIYPGSGGCAQTAG----IKIQLEYDLHSGQFLNFQVGPGKNN 177 Query: 189 ETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKN-IASEMIELGN 247 + L T+ + + D ++S + L ++Q+G +++ N Sbjct: 178 DKTFGTECLDTLRPGDLCIRDLGYFSLEDLDQMDQRGA--YYISRLKLNHTVYIKNPSPE 235 Query: 248 TASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVK 283 GT+ K+ ++++ LE I +P + +K Sbjct: 236 YFRNGTVKKQSQYIQVDLE--HIMNHLKPGQTYEIK 269 >UniRef50_B2J2I5 Transposase, IS4 family protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J2I5_NOSP7 Length = 439 Score = 54.3 bits (129), Expect = 5e-06, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 71/219 (32%), Gaps = 25/219 (11%) Query: 46 RRRRLP----GDMVIWMVVQNEP-ITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAA 100 R R L +V+ +V + P + +V R L AG ++ AV++ + + Sbjct: 56 RDRILTLSVMMALVVSLVYRQIPGLREVQRVLCEEGLLWAGRIEVSAQAVSKRLRTLPIE 115 Query: 101 PVEWLFRQT-----AQDRGAERYLKDDWHGLQLFAI---DGAQFRTPDKPELREYYGSAN 152 +F Q Q + + AI DG+ LR Sbjct: 116 LFAQIFEQVMERMNVQPQNQAVPENWQPVCAKFTAIWIADGSTL-----EALRRKLKVLQ 170 Query: 153 TSTKRQNAYPVMRLVALMNLGSHILLNAVTA-PYRQSETVLAHSMLATIPDNSITLFDKL 211 K +++ ++ SH + + ++ +L +P + +FD Sbjct: 171 EQEKTLAG----KIMMVVEAFSHHPVTTWYTQNSKANDKTWCEQLLERLPIGGLLIFDLG 226 Query: 212 FYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTAS 250 F+ ++ +L + + ++I AS Sbjct: 227 FFKFPWFDAF--TEADKFFLTRLREKTSYKVIRCLTNAS 263 >UniRef50_UPI00016C560B transposase IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C560B Length = 280 Score = 53.9 bits (128), Expect = 6e-06, Method: Composition-based stats. Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 2/60 (3%) Query: 184 PYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMI 243 P +SE +A +L + ++ + L+D+ F S DL+ + Q+ H L N+ + Sbjct: 9 PCHRSEVTMAPYLLRCLQNDMLLLWDRGFLSYDLVQQVRQRCA--HLLARIKSNLVFRPL 66 >UniRef50_C1DL03 Transposase inactivated derivative n=1 Tax=Azotobacter vinelandii DJ RepID=C1DL03_AZOVD Length = 54 Score = 53.9 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 16/54 (29%), Positives = 24/54 (44%), Gaps = 1/54 (1%) Query: 102 VEWLFRQTAQDRGAERYLKD-DWHGLQLFAIDGAQFRTPDKPELREYYGSANTS 154 + LF Q A+ A + + GL++FA DG + PD E RE + Sbjct: 1 MAALFEQLARAWLAVKPPASARFRGLRIFAADGVVWSMPDTAENREAFSGGRNQ 54 >UniRef50_Q8X2N0 Putative uncharacterized protein ECs5267 n=1 Tax=Escherichia coli O157:H7 RepID=Q8X2N0_ECO57 Length = 80 Score = 53.5 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 46/79 (58%), Positives = 49/79 (62%), Gaps = 24/79 (30%) Query: 16 MPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLS 75 MPPPS QLFAEHLPTEWIQH LTLSAHAT VRRLNLS Sbjct: 1 MPPPSTQLFAEHLPTEWIQHFLTLSAHAT------------------------VRRLNLS 36 Query: 76 ADGEAGMNLLARSAVTQAR 94 DGEAGMNLLA +A++ R Sbjct: 37 VDGEAGMNLLAPAALSPRR 55 >UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostridium spiroforme DSM 1552 RepID=B1C560_9FIRM Length = 399 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 54/156 (34%), Gaps = 10/156 (6%) Query: 81 GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPD 140 + + SA QAR ++ LF + + K +HG +L AIDG++ + Sbjct: 30 SITTPSASAFVQARSKIKPEAFRTLFD----GFNKKTFKKKLYHGYRLLAIDGSELPIDN 85 Query: 141 KP-ELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT-APYRQSETVLAHSMLA 198 + T K +AY L A +L + + ++ E ++ Sbjct: 86 TIFDDETTVLRHGTLAKTFSAY---HLNASYDLMERTYDDIIIQGEAKRDEHGAFCQLVD 142 Query: 199 TIPD-NSITLFDKLFYSEDLLLTLNQKGCNRHWLLP 233 +I + D+ + S + + G + Sbjct: 143 RYDGQKAIFIADRGYESYNGFEHVVHSGHKYLIRVR 178 >UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCH0_9FIRM Length = 435 Score = 52.8 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 31/158 (19%), Positives = 60/158 (37%), Gaps = 10/158 (6%) Query: 81 GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPD 140 +N SA TQ R ++ E+LF ++ K+ + G QL A DG+ Sbjct: 68 DVNAPTVSAYTQQRAKILPEAFEYLFHAFTEENAQT---KNLYEGYQLLACDGSNLTIAP 124 Query: 141 KPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ-SETVLAHSMLAT 199 E +N N + L AL ++ + ++A+ E M+ Sbjct: 125 NLNDPETLWKSNQLGATGN---HLHLNALYDVLNRTYIDALVQTASTYQEHRACIQMIER 181 Query: 200 IP-DNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWK 236 + D I + D+ + + +++ +KG +L+ Sbjct: 182 VTLDKVILIADRGYENYNIMSHAIEKGWK--FLIRIKD 217 >UniRef50_B8CMP9 Transposase OrfB, putative n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CMP9_SHEPW Length = 75 Score = 52.8 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 14/42 (33%), Positives = 26/42 (61%) Query: 166 LVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITL 207 +V LM L SH+L+++ ++E LA +++ PDN+I + Sbjct: 1 MVCLMELSSHLLVDSSFGSVAENEMALAANLINNTPDNNIVI 42 >UniRef50_Q09BD0 Isrso13-transposase protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09BD0_STIAU Length = 387 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 48/253 (18%), Positives = 86/253 (33%), Gaps = 31/253 (12%) Query: 23 LFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGM 82 + + EW+ R+R+ +++ VV+ + + R +L A +A Sbjct: 24 VLQRAVSAEWMDSLFEA-----HRKRQYTRELLFSTVVELMSVVAMGLRPSLHAGAKATE 78 Query: 83 NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYL-----KDDWHGLQLFAIDGAQFR 137 + +A+ + R+ V L R +AQ K G ++ +DG Sbjct: 79 GGTSIAALYEKVNRMEPDLVRALVRGSAQRLEPVVQPLRTGEKPWAEGYRVRVMDGNHLP 138 Query: 138 TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAV-TAPYRQSETVLAHSM 196 +K R A P LV ++++ V E L ++ Sbjct: 139 ASEKR-------LKPLREFRGAALPGHSLVV-YAPEQGLVVDVVPCEDAHAQERTLVAAV 190 Query: 197 LATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPK 256 L + + D+ F + ++ L +RH A + E G T SP + K Sbjct: 191 LEHAQQGDLWIADRNFSTTRIVFGL----EDRH--------AAFIIREHGRTPSPTEVGK 238 Query: 257 RLEHLRGALEVVF 269 R R VVF Sbjct: 239 RKRVGRVETGVVF 251 >UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=A7B831_RUMGN Length = 366 Score = 52.4 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 72/198 (36%), Gaps = 21/198 (10%) Query: 46 RRRRLPGDMVIWMVVQNEP---ITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPV 102 R R+L I ++ E +++ S D + SA Q R ++ Sbjct: 44 RNRKLDFVSTIQFLLSMESGSLKKELLDYFQFSVDT------PSASAFCQQRNKLLLEAF 97 Query: 103 EWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYP 162 ++LF + E+ + QL A DG+ P Y + + + + Sbjct: 98 QFLFYEFNSCFSFEK----KYKDYQLLACDGSDLNIARNPNDAGTYFQSQPTDR---GFN 150 Query: 163 VMRLVALMNLGSHILLNAVTAPYRQSETVLAH-SMLATIPDN--SITLFDKLFYSEDLLL 219 + L AL +L ++ V P R LA M+ +I + D+ + + ++ Sbjct: 151 QIHLNALFDLCEKRYIDLVIQPARLENESLAMTQMIDRYKGEKKTIFIADRGYETYNIFA 210 Query: 220 TLNQKGCNRHWLLPAWKN 237 + +KG ++L+ Sbjct: 211 HVQEKG--MYYLIRVKDG 226 >UniRef50_UPI00019668E9 transposase ISLbp1 n=1 Tax=Methanobrevibacter smithii DSM 2374 RepID=UPI00019668E9 Length = 319 Score = 52.4 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 60/159 (37%), Gaps = 8/159 (5%) Query: 85 LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPEL 144 + + AV++ RQ + + A + + G ++AIDG+ P+ Sbjct: 71 ITKQAVSEKRQFIDPQVYIDMNGSLISKIYAHKDEMTTFKGFNVYAIDGSIVEIPNTKLT 130 Query: 145 REYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPD-- 202 RE + + ++ R+ + + ++++ SE A L + + Sbjct: 131 REEFEIPEKTELMKDT-STARISCMADTKWDFIISSNITNKSTSEIEHALMHLDDVKNKI 189 Query: 203 ---NSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI 238 +IT +D+ + S +++ + ++++ + Sbjct: 190 DLTKTITTYDRFYNSIEIM--FKTMLLDSYFIIRGKTHT 226 >UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=Streptococcus RepID=A4W4J4_STRS2 Length = 440 Score = 52.4 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 70/196 (35%), Gaps = 25/196 (12%) Query: 46 RRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWL 105 R+ +L + +I ++ T L+L +++SA Q R ++ + L Sbjct: 44 RKSQLTMETMIQAILTMGGNTLAKELLDLDLP-------VSQSAFVQRRYQLKHQAFKAL 96 Query: 106 FRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMR 165 F + L + A+DG+ P R + + Y ++ Sbjct: 97 F-------ANITSKIPTFKDLPILAVDGSDVVLP---RNRSDKTTTFQTGPHHTPYTLIH 146 Query: 166 LVALMNLGSHILLNAVTAPYRQ-SETVLAHSMLATIP-DNSITLFDKLFYSEDLLLTLNQ 223 + AL NL I + R+ E M+ + P + ++ + D+ + S +++ Sbjct: 147 INALYNLEQEIYHDLRIQNNREVDERAAFIDMMESCPFEQALVIMDRGYESYNVMAHCQ- 205 Query: 224 KGCNRHW--LLPAWKN 237 R+W ++ Sbjct: 206 ---ERNWSYIIRIRDG 218 >UniRef50_A4BSI0 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BSI0_9GAMM Length = 406 Score = 52.0 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 31/79 (39%), Gaps = 3/79 (3%) Query: 154 STKRQNAYPVMRLVALMNLGSHILLNAVTAPYR---QSETVLAHSMLATIPDNSITLFDK 210 +P+ RLV + L S LLNA ++ +E L SM + I + D Sbjct: 106 GQSPGLGFPIGRLVGITYLASGALLNAAIGRFQGKGGNEQTLLRSMQESFAPGDILIGDA 165 Query: 211 LFYSEDLLLTLNQKGCNRH 229 F + + + KG + Sbjct: 166 FFATYFFIAAMQAKGVDIL 184 >UniRef50_A4A0C6 Probable transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A4A0C6_9PLAN Length = 442 Score = 51.6 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 30/159 (18%), Positives = 56/159 (35%), Gaps = 9/159 (5%) Query: 86 ARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELR 145 R + +A R G A + + A E G FA+DGA+F P + Sbjct: 81 TRQGLLKALARHGEALIPQVVAHIADQL-RELKGDWTQRGKVNFAVDGAKFLAPRTAANQ 139 Query: 146 EYY------GSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLAT 199 + + A+ S + + + + +L + + A + SE ML Sbjct: 140 QQFASKKEKQYASKSNQSKAESAQLLATVVWHLTAGLPYRWRIAGSKGSERHALTDMLDE 199 Query: 200 IPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI 238 +P N+ + D + L + R +L+ N+ Sbjct: 200 LPSNARIIADAEYVGYPLWSAILD--SKRSFLVRVGSNV 236 >UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicutes RepID=B8FXU5_DESHD Length = 381 Score = 51.2 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 32/145 (22%), Positives = 63/145 (43%), Gaps = 10/145 (6%) Query: 83 NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKP 142 SA Q R ++ ++ LF + + +E L+D +L A+DG+ R P Sbjct: 19 ETATASAFVQQRDKIRPEALKLLFHEFTRLTVSENSLQD----YRLLAVDGSDLRLPSNS 74 Query: 143 ELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR-QSETVLAHSMLAT-- 199 + + + S S +N Y ++ L A+ +L + ++A + +E SM+ Sbjct: 75 K--DGFSSIRNSEDSKN-YNLVHLDAMYDLMGKVYVDASVQSKKGMNEHKALVSMVDQSE 131 Query: 200 IPDNSITLFDKLFYSEDLLLTLNQK 224 I N I + D+ + S + + +K Sbjct: 132 INGNVIAIMDRGYESFNNIAHFQEK 156 >UniRef50_UPI000196B70E hypothetical protein CATMIT_00144 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196B70E Length = 479 Score = 50.5 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 55/163 (33%), Gaps = 11/163 (6%) Query: 83 NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKP 142 +++ A+ +A +++ +L Q A + K + L A DG P Sbjct: 78 KRISKQALNKAIRKLNPNVFTYLINQFASIYYSTSLPK-KYRDHLLIAEDGTYMEIPYNM 136 Query: 143 ELREYYGSA---NTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLAT 199 + A + + L ++ + + ++ SET LA + L Sbjct: 137 LNINEFQFALGCHVRNMFDVKKVQSKAGGLYDVTNGLFIDFSLRQAPYSETPLAFAHLYR 196 Query: 200 IPD-----NSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKN 237 + I L D+ + S +++ L +++ N Sbjct: 197 TREMLENQKVIYLADRYYGSAEIISHLED--LRYSYVIRGKSN 237 >UniRef50_A6CHG0 Transposase of IS5377-like element n=2 Tax=Bacillus sp. SG-1 RepID=A6CHG0_9BACI Length = 381 Score = 50.5 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 39/266 (14%), Positives = 84/266 (31%), Gaps = 20/266 (7%) Query: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAG 81 ++ + E +++ + R+ D+V + V+ + R E Sbjct: 9 EVLQTFITDEEVENLCEKWGYRDTARKFSAKDLVRFFVISSAKDWKSFRDAETKIPQEDS 68 Query: 82 MNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQ--FRTP 139 + + S + + Q V ++ LF + G + + +LFA+D F+ P Sbjct: 69 LPSVDHSTLAKKAQNVPYQILQELFSRLVNRLG-RGMRRALFKPYKLFAVDSTTITFQHP 127 Query: 140 DKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLAT 199 D + G T +RL ++ + R + ++A + Sbjct: 128 DMS----WAGYTRTRHA-------IRLHTKFDVEEGQPTQVIPTTGRHHDVMVAPKLYED 176 Query: 200 IPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLE 259 SI D+ + L + + + +++ EM G + + L Sbjct: 177 TEPLSIITADRGYARTRDFEDLQEDNQFFVIRIASSFSLSEEMEHSVPLDEDGNVKEDLT 236 Query: 260 HLRGALEVVFITKRPRPSRPRSVKIS 285 G R +R R V + Sbjct: 237 AFIGK------NSRKTKNRFRVVTFT 256 >UniRef50_Q2JF90 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JF90_FRASC Length = 87 Score = 50.5 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 8/54 (14%), Positives = 19/54 (35%) Query: 14 PLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITD 67 L S + + + + L + R R+LP ++++ + D Sbjct: 14 RLTDHISLGVLTGLVHHDLVDDVLVETGRVEKRSRKLPARVMVYFTLAMWLFFD 67 >UniRef50_A8RFU1 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RFU1_9FIRM Length = 443 Score = 48.9 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 72/190 (37%), Gaps = 22/190 (11%) Query: 36 CLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADG---EAGMNLLARSAVTQ 92 C+ +++ T R R L +I ++ + L+ + +++ + SAV+Q Sbjct: 35 CVDQTSNFT-RSRILTPKTLIKFILGLQ-----AHSLSGEVSDYFTSSNIDIPSISAVSQ 88 Query: 93 ARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSAN 152 R + +F+ + + +G + A DG+ P + + Sbjct: 89 RRDLLYPE----IFKSINRRFLSSIDNLSTLNGYYILAQDGSDINLPFWHDDTQI----- 139 Query: 153 TSTKRQNAYPVMRLVALMNLGSHILLNAVTA-PYRQSETVLAHSMLAT--IPDNSITLFD 209 S + + L AL + +H+ + P ++SE + P+NSI D Sbjct: 140 -SYGQDSIVCQYHLNALYDCINHVFWESRIDLPTKKSEKSALIDFINHRNYPENSIITAD 198 Query: 210 KLFYSEDLLL 219 + + S +L+ Sbjct: 199 RGYESYNLIA 208 >UniRef50_C4XGQ6 Putative transposase for insertion sequence element n=2 Tax=Desulfovibrio magneticus RS-1 RepID=C4XGQ6_DESMR Length = 376 Score = 48.1 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 51/170 (30%), Gaps = 25/170 (14%) Query: 81 GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPD 140 G+ + RS A + E LF + ++ K +LF++D + Sbjct: 74 GVKPVPRSTFADANAKRPYTMFEALFGELYTRCLSQAPKKKFSFENKLFSLDASVVDLCL 133 Query: 141 KPELREYYGSANTSTKRQNA------YPVMRLVALMNLGSHILLNAVTAPYRQSETVLAH 194 + +A K P + V + H E +A Sbjct: 134 NLFPWAKFRTAKGGIKMHTVMDHDGYLPAV--VTVTEAKCH-------------EVNIAK 178 Query: 195 SMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 + +P SI +FD+ + L + G + N +IE Sbjct: 179 LL--KLPKGSIVVFDRGYNDYTWFRHLCKSGVFL--VTRLKSNARFRVIE 224 >UniRef50_C0R4I1 Putative uncharacterized protein n=5 Tax=Wolbachia RepID=C0R4I1_WOLWR Length = 94 Score = 47.4 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 8/85 (9%) Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT 182 W G +L A DG+ R P E+ + T+ N + ++L + ++ +A Sbjct: 5 WRGYRLIAADGSGMRLPSSGEIVSEFEPNGTTGTIGNLF--------VDLCTSLICSARL 56 Query: 183 APYRQSETVLAHSMLATIPDNSITL 207 A + E LA L + L Sbjct: 57 AAWNIGEQTLAAEQLPEVITQMRLL 81 >UniRef50_B2AJ60 Transposase, IS4 family n=4 Tax=Proteobacteria RepID=B2AJ60_CUPTR Length = 412 Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 69/187 (36%), Gaps = 19/187 (10%) Query: 42 HATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNL----LARSAVTQARQRV 97 A R+R+L +I ++ N V L+ A N+ ++ A QAR ++ Sbjct: 30 SAFTRQRKLTLPTLIAFMLGN-LRMGVQAELDQFFAALARQNILRRCVSEQAFAQARSKL 88 Query: 98 GAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKR 157 L + WHG +L A D + R + +T+ Sbjct: 89 SGDVFAHLNDWLLRQV---SDHLPRWHGFRLVAADASHLRFA-----IRHSHLPRAATRD 140 Query: 158 QNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDL 217 Q A+ L G+ I+L A ++E + L + + + L D+ + + L Sbjct: 141 QLAF------GLYLPGAEIMLAASLHSVHENERQILFEHLDRLQSDDLLLLDRGYPARWL 194 Query: 218 LLTLNQK 224 + LNQ+ Sbjct: 195 VAVLNQR 201 >UniRef50_B5ZZ25 Transposase IS4 family protein n=11 Tax=Rhizobium RepID=B5ZZ25_RHILW Length = 381 Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 57/205 (27%), Gaps = 20/205 (9%) Query: 28 LPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNE----PITDVVRRLNLSADG--EAG 81 +P + + RR +I ++ + ++V L + G Sbjct: 15 IPWAVFERLVDEHQADKHVRRLSTKSQLIALLYGQLAGAVSLREIVGSLESHSARLYHLG 74 Query: 82 MNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDK 141 ++RS A + LF AQ G ++ IDG+ Sbjct: 75 ARPVSRSTFADANGLRPSTVFAELF---AQMVARAGRGLKRAIGEAVYLIDGSSLSLAGA 131 Query: 142 PELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIP 201 + K Y + + + A P ++ A M I Sbjct: 132 GSQWARFSDQACGAKMHVVY---------DANAERPIYAAVTPANVNDITAAKEM--PIE 180 Query: 202 DNSITLFDKLFYSEDLLLTLNQKGC 226 + +FD +Y LN GC Sbjct: 181 AGATYVFDLGYYDFGWWAKLNAAGC 205 >UniRef50_B8FI31 Transposase IS4 family protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FI31_DESAA Length = 386 Score = 45.8 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 59/167 (35%), Gaps = 15/167 (8%) Query: 79 EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQ-LFAIDGAQFR 137 AG + RS + A + A E +F A K + L+++D + Sbjct: 73 HAGAAPVKRSTLADANNQRPAEFFEEVFYHMAAKC-QSHAPKHKFRFKNPLYSMDSSVVD 131 Query: 138 TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSML 197 + A + + +++ +++ +I + S+ +A ++ Sbjct: 132 LCLNL-----FPWAKHRSTKAG----IKIHTVLDHSGYIPAFVRITDAKTSDIEIARTL- 181 Query: 198 ATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 ++P SI + D+ + + ++ KNI +++E Sbjct: 182 -SLPKGSILVEDRAYVDFTWFKNW--HENKQFFVTRLKKNIKYKVLE 225 >UniRef50_C6JAL6 Transposase (Fragment) n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JAL6_9FIRM Length = 237 Score = 44.7 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 22/110 (20%), Positives = 44/110 (40%), Gaps = 12/110 (10%) Query: 161 YPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPD-----NSITLFDKLFYSE 215 Y + + ++ +L+A + SE A L + D NSI +FD+ +YSE Sbjct: 11 YTMGLASIIYDVLDDYILHASIHKFLSSERAAALEHLKVLEDMGLYNNSIIIFDRGYYSE 70 Query: 216 DLLLTLNQKGCNRHWLLPAWKNIASE-------MIELGNTASPGTIPKRL 258 D+ + G L N++ + +++ + +P R+ Sbjct: 71 DMFRYCVEHGHLCVMRLKEGINLSKKCNGDMISILQGTSKEGTSDVPIRV 120 >UniRef50_A1HQH6 Transposase, IS4 family protein n=2 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HQH6_9FIRM Length = 400 Score = 44.7 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 28/229 (12%), Positives = 75/229 (32%), Gaps = 32/229 (13%) Query: 60 VQNEPITDVVRRL--NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAER 117 ++ + + D+ RL + + ++ S +++ + + E +F + + + Sbjct: 43 LRLDSLRDIANRLTCDKQLQKLLHLTSISASTLSRRLRNIDHRVWEQVFAEVKRQIWQQA 102 Query: 118 YLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQ--------NAYPVMRLVAL 169 QL ID + + L Y + K N+YP ++ Sbjct: 103 NKTGAVRQYQLNVIDSSTITLCLRKYLWADYRKTKSGIKLHQRITIHDGNSYPDSAVLTS 162 Query: 170 MNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRH 229 +++ L +++ +FD+ + +KG Sbjct: 163 ARKADKTVMDE----------------LVVTSPDALNVFDRGYVDYAKWDDYCRKGIR-- 204 Query: 230 WLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSR 278 ++ N +++E + + + +++ L A + T+ P R Sbjct: 205 FVSRLKSNAVIDVLEEKSVETNQVLAEKIVRLGNA----YTTQMTHPVR 249 >UniRef50_C0QMU6 Transposase repeat family IS4 n=1 Tax=Thermosipho africanus TCF52B RepID=C0QMU6_THEAB Length = 254 Score = 43.1 bits (100), Expect = 0.012, Method: Composition-based stats. Identities = 16/54 (29%), Positives = 25/54 (46%), Gaps = 3/54 (5%) Query: 165 RLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLL 218 RL L +L++ Y SET A +L + +NSI L D+ ++ L Sbjct: 114 RLHVLYEAKEKVLIDFKIGEY--SETEQAELLLEEV-ENSILLADRGYWVWRFL 164 >UniRef50_D2PLH1 Putative uncharacterized protein n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PLH1_9ACTO Length = 98 Score = 42.7 bits (99), Expect = 0.014, Method: Composition-based stats. Identities = 18/73 (24%), Positives = 27/73 (36%), Gaps = 11/73 (15%) Query: 72 LNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYL-----------K 120 L+ A ++ SA+TQAR+ +G LF + +E L Sbjct: 17 LDRWNCWNAAWSVPTVSAITQARKWLGRCVFPELFERACGPVVSEAGLTAEAVALGTARG 76 Query: 121 DDWHGLQLFAIDG 133 +L AIDG Sbjct: 77 SFLRRWRLLAIDG 89 >UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3VMZ1_KLEPN Length = 421 Score = 42.7 bits (99), Expect = 0.014, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 49/147 (33%), Gaps = 12/147 (8%) Query: 96 RVG-AAPVEWLFRQ-TAQDRGAERYLKDDWHGL--QLFAIDGAQFRTPDKPELREYYGSA 151 ++G + +F Q A A D + G Q+ DG F D L ++ Sbjct: 85 KLGTPEFMRQVFEQALALHLPAMHTFSDAYRGHFKQVLLQDGTSFAVHDGLSL--HFPGR 142 Query: 152 NTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKL 211 ++ + L +L + + SE +A + + D Sbjct: 143 FSTHSPAA----VELHVTYDLEKAQPVRVSLSEDTASERDYLP--VAQSLRGCLLMADAG 196 Query: 212 FYSEDLLLTLNQKGCNRHWLLPAWKNI 238 ++S+ + +L + + +PA N Sbjct: 197 YFSKAYIESLQNEAASFVLRMPASVNP 223 >UniRef50_Q73GX2 Conserved domain protein n=5 Tax=Wolbachia RepID=Q73GX2_WOLPM Length = 143 Score = 42.4 bits (98), Expect = 0.021, Method: Composition-based stats. Identities = 20/121 (16%), Positives = 52/121 (42%), Gaps = 2/121 (1%) Query: 38 TLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRV 97 S+ +R+R+L + I++++ + + + LN + SA +QAR+++ Sbjct: 11 KSSSKDFMRKRKL-SFIDIFILILRKSVKSLQVILNEFILYRKKDYTVTASAFSQARKKM 69 Query: 98 GAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKR 157 + + ++ K + G ++ A+D ++ P E++ +GS ++ Sbjct: 70 KHSAFSEINEGVVSLYYQDQKFKTCF-GFRVLALDASKIILPTSVEIKNEFGSRKIRNQK 128 Query: 158 Q 158 Sbjct: 129 P 129 >UniRef50_B5WFI6 Putative uncharacterized protein n=1 Tax=Burkholderia sp. H160 RepID=B5WFI6_9BURK Length = 256 Score = 41.6 bits (96), Expect = 0.032, Method: Composition-based stats. Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 2/33 (6%) Query: 227 NRHWLLPAWKNIASEMIELGNTASPGTIPKRLE 259 NRH+L+PA N E+I TA T+ R+ Sbjct: 5 NRHFLIPAKTNTCWEVI--SGTADDATVRMRVS 35 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P30192 Putative uncharacterized protein ychG n=8 Tax=En... 306 5e-82 UniRef50_Q7MLW1 Transposase and inactivated derivative n=29 Tax=... 242 1e-62 UniRef50_Q7MGY3 Transposase and inactivated derivative n=4 Tax=V... 228 2e-58 UniRef50_D1T817 Transposase IS4 family protein n=1 Tax=Burkholde... 226 1e-57 UniRef50_B2JV26 Transposase IS4 family protein n=9 Tax=Burkholde... 225 1e-57 UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia R... 225 2e-57 UniRef50_A6WTA0 Transposase IS4 family protein n=14 Tax=Shewanel... 219 1e-55 UniRef50_B9BXQ1 Transposase, IS4 family n=8 Tax=Proteobacteria R... 218 2e-55 UniRef50_B6EGT0 Transposase n=20 Tax=Vibrionaceae RepID=B6EGT0_A... 217 3e-55 UniRef50_D0LI35 Transposase IS4 family protein n=1 Tax=Haliangiu... 216 9e-55 UniRef50_A4T2G5 Transposase, IS4 family protein n=10 Tax=Coryneb... 214 3e-54 UniRef50_P03835 Transposase insG for insertion sequence element ... 213 8e-54 UniRef50_D2TH14 ISCro6 transposase n=8 Tax=Gammaproteobacteria R... 207 4e-52 UniRef50_C6CF98 Transposase IS4 family protein n=20 Tax=Gammapro... 205 2e-51 UniRef50_Q2JAY9 Transposase, IS4 n=2 Tax=Frankia RepID=Q2JAY9_FRASC 196 7e-49 UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=St... 196 9e-49 UniRef50_D0SHM1 Transposase n=3 Tax=Acinetobacter RepID=D0SHM1_A... 195 2e-48 UniRef50_A4JGL4 Transposase, IS4 family protein n=3 Tax=Burkhold... 193 4e-48 UniRef50_UPI0001C16028 hypothetical protein CRD_01775 n=2 Tax=Ra... 186 5e-46 UniRef50_A8L1S1 Transposase IS4 family protein n=2 Tax=Frankia s... 186 5e-46 UniRef50_B2LS82 Putative uncharacterized protein n=3 Tax=Vibrio ... 186 8e-46 UniRef50_Q2J8F5 Putative uncharacterized protein n=3 Tax=Frankia... 184 4e-45 UniRef50_C5T3Q2 Transposase IS4 family protein n=4 Tax=Proteobac... 183 6e-45 UniRef50_Q82R31 Putative IS4 family ISFsp6-like transposase n=2 ... 179 9e-44 UniRef50_D2ASB5 Transposase, IS4 family n=1 Tax=Streptosporangiu... 178 2e-43 UniRef50_A8M893 Transposase IS4 family protein n=3 Tax=Actinomyc... 178 3e-43 UniRef50_B2J1G3 Transposase, IS4 family protein n=6 Tax=Nostocac... 175 1e-42 UniRef50_A8KXP7 Transposase IS4 family protein n=2 Tax=Actinomyc... 174 2e-42 UniRef50_Q648P8 Transposase n=2 Tax=environmental samples RepID=... 173 8e-42 UniRef50_UPI00016A835E hypothetical protein BoklC_27358 n=1 Tax=... 164 3e-39 UniRef50_C1ZMB0 Transposase family protein n=1 Tax=Planctomyces ... 157 4e-37 UniRef50_B5EK95 Transposase IS4 family protein n=2 Tax=Acidithio... 156 1e-36 UniRef50_A5KKC4 Putative uncharacterized protein n=1 Tax=Ruminoc... 152 1e-35 UniRef50_A3IS08 Putative uncharacterized protein n=1 Tax=Cyanoth... 151 2e-35 UniRef50_A3ZZQ0 Putative uncharacterized protein n=3 Tax=Blastop... 149 1e-34 UniRef50_B8FEP3 Transposase IS4 family protein n=1 Tax=Desulfati... 147 4e-34 UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium... 147 4e-34 UniRef50_A2RJ55 Putative transposase n=7 Tax=Lactobacillales Rep... 146 7e-34 UniRef50_A4BL98 Putative uncharacterized protein n=5 Tax=Nitroco... 146 1e-33 UniRef50_UPI00016C3BAD transposase, IS4 n=2 Tax=Gemmata obscurig... 145 2e-33 UniRef50_B0CC46 Transposase, IS4 family, putative n=9 Tax=Cyanob... 143 8e-33 UniRef50_A6UXI0 Protein containing transposase DDE domain n=4 Ta... 143 9e-33 UniRef50_Q3A1U3 Transposase n=1 Tax=Pelobacter carbinolicus DSM ... 143 1e-32 UniRef50_Q12AI7 Transposase, IS4 family n=3 Tax=Proteobacteria R... 143 1e-32 UniRef50_UPI00016C37A0 transposase, IS4 n=2 Tax=Gemmata obscurig... 141 3e-32 UniRef50_B7CEB8 Putative uncharacterized protein n=2 Tax=Erysipe... 141 3e-32 UniRef50_Q82R32 Putative IS4 family ISFsp5-like transposase n=1 ... 141 3e-32 UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostri... 137 5e-31 UniRef50_A6CHG0 Transposase of IS5377-like element n=2 Tax=Bacil... 136 1e-30 UniRef50_P12249 Transposase for insertion sequence element IS231... 134 3e-30 UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea f... 132 2e-29 UniRef50_UPI00016C48B0 transposase, IS4 family protein n=1 Tax=G... 131 4e-29 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 130 5e-29 UniRef50_B0NXD2 Putative uncharacterized protein n=5 Tax=Clostri... 130 5e-29 UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostrid... 130 6e-29 UniRef50_B6FVR6 Putative uncharacterized protein (Fragment) n=2 ... 129 2e-28 UniRef50_Q7BLZ8 Putative uncharacterized protein (Fragment) n=1 ... 127 5e-28 UniRef50_A6CCZ3 Transposase, IS4 (Fragment) n=7 Tax=Planctomyces... 127 5e-28 UniRef50_Q3SHG4 Putative uncharacterized protein n=1 Tax=Thiobac... 125 2e-27 UniRef50_Q7TTE4 Putative uncharacterized protein n=9 Tax=Plancto... 123 6e-27 UniRef50_UPI00016C5887 hypothetical protein GobsU_05723 n=3 Tax=... 123 8e-27 UniRef50_C6N0W0 Putative uncharacterized protein n=1 Tax=Legione... 122 2e-26 UniRef50_A8RFU1 Putative uncharacterized protein n=1 Tax=Eubacte... 121 2e-26 UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coproco... 121 3e-26 UniRef50_C6JHT2 Transposase ISLbp1 n=1 Tax=Ruminococcus sp. 5_1_... 116 1e-24 UniRef50_A1JS05 Transposase for insertion sequence element IS166... 114 3e-24 UniRef50_Q04V25 Transposase, ISLbp1 n=29 Tax=Leptospira RepID=Q0... 114 3e-24 UniRef50_C0ING1 Putative uncharacterized protein n=1 Tax=uncultu... 113 6e-24 UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostri... 112 2e-23 UniRef50_UPI00016C385B hypothetical protein GobsU_16554 n=3 Tax=... 112 2e-23 UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=... 111 3e-23 UniRef50_B5ZZ25 Transposase IS4 family protein n=11 Tax=Rhizobiu... 106 1e-21 UniRef50_UPI000196B70E hypothetical protein CATMIT_00144 n=1 Tax... 106 1e-21 UniRef50_Q8QNB6 EsV-1-170 n=2 Tax=Ectocarpus siliculosus virus 1... 105 2e-21 UniRef50_A3ZMM8 Transposase insG for insertion sequence element-... 105 2e-21 UniRef50_Q7UPU9 Probable transposase n=2 Tax=Rhodopirellula balt... 103 8e-21 UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicute... 100 1e-19 UniRef50_Q82QT3 Putative uncharacterized protein n=1 Tax=Strepto... 99 1e-19 UniRef50_Q09BD0 Isrso13-transposase protein n=1 Tax=Stigmatella ... 99 2e-19 UniRef50_C4XGQ6 Putative transposase for insertion sequence elem... 97 5e-19 UniRef50_D0SX83 Predicted protein n=1 Tax=Acinetobacter lwoffii ... 97 7e-19 UniRef50_UPI00019668E9 transposase ISLbp1 n=1 Tax=Methanobreviba... 96 1e-18 UniRef50_B2J2I5 Transposase, IS4 family protein n=1 Tax=Nostoc p... 95 2e-18 UniRef50_B8FI31 Transposase IS4 family protein n=1 Tax=Desulfati... 95 3e-18 UniRef50_A4A0C6 Probable transposase n=2 Tax=Blastopirellula mar... 90 8e-17 UniRef50_A4A0C3 Probable transposase n=1 Tax=Blastopirellula mar... 89 2e-16 UniRef50_A5N5R2 Transposase n=6 Tax=Clostridium RepID=A5N5R2_CLOK5 88 5e-16 UniRef50_B8CMP8 Transposase OrfA, putative n=1 Tax=Shewanella pi... 84 7e-15 UniRef50_A5N172 Transposase n=4 Tax=Clostridium RepID=A5N172_CLOK5 78 3e-13 UniRef50_C8VYK7 Putative uncharacterized protein n=1 Tax=Desulfo... 77 9e-13 UniRef50_C6JHV1 Putative uncharacterized protein (Fragment) n=1 ... 74 5e-12 UniRef50_B2AJ60 Transposase, IS4 family n=4 Tax=Proteobacteria R... 73 2e-11 UniRef50_UPI0001AF03EF IS4 family transposase n=1 Tax=Streptomyc... 71 7e-11 UniRef50_Q82UV9 Putative uncharacterized protein n=1 Tax=Nitroso... 71 7e-11 UniRef50_A4BSI0 Putative uncharacterized protein n=1 Tax=Nitroco... 64 7e-09 UniRef50_B2Q345 Putative uncharacterized protein n=1 Tax=Provide... 63 1e-08 UniRef50_A1SV49 ISSod7, transposase n=1 Tax=Psychromonas ingraha... 61 4e-08 UniRef50_B9YUA6 Transposase, IS4 family protein n=3 Tax='Nostoc ... 59 2e-07 UniRef50_C0R4I1 Putative uncharacterized protein n=5 Tax=Wolbach... 57 7e-07 UniRef50_UPI00016C560B transposase IS4 family protein n=1 Tax=Ge... 56 1e-06 UniRef50_UPI000190F8A2 hypothetical protein SentesTyp_33971 n=1 ... 55 4e-06 UniRef50_Q2JF90 Putative uncharacterized protein n=1 Tax=Frankia... 54 4e-06 UniRef50_A3YGY3 Transposase and inactivated derivative n=1 Tax=M... 54 6e-06 UniRef50_A3EIG1 FOG: Transposase and inactivated derivatives n=3... 53 1e-05 UniRef50_C1DL03 Transposase inactivated derivative n=1 Tax=Azoto... 50 1e-04 UniRef50_B8CMP9 Transposase OrfB, putative n=1 Tax=Shewanella pi... 46 0.001 Sequences not found previously or not previously below threshold: UniRef50_C6JEA3 Putative uncharacterized protein n=1 Tax=Ruminoc... 101 3e-20 UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipe... 94 6e-18 UniRef50_D1N0Z4 Transposase IS4 family protein n=3 Tax=Bacteria ... 88 5e-16 UniRef50_B8FDX7 Transposase IS4 family protein n=2 Tax=Desulfati... 86 1e-15 UniRef50_Q877R2 Transposase n=51 Tax=Bacteroidales RepID=Q877R2_... 81 3e-14 UniRef50_C7PAE4 Transposase IS4 family protein n=4 Tax=Chitinoph... 77 7e-13 UniRef50_D1K7L7 Transposase n=3 Tax=Bacteroidales RepID=D1K7L7_9... 77 8e-13 UniRef50_Q0F098 ISGsu1, transposase n=6 Tax=Mariprofundus ferroo... 75 3e-12 UniRef50_C5V7Z6 Transposase IS4 family protein n=3 Tax=root RepI... 75 3e-12 UniRef50_A1HQH6 Transposase, IS4 family protein n=2 Tax=Thermosi... 74 8e-12 UniRef50_A7BZU6 Transposase, IS4 n=2 Tax=Beggiatoa sp. PS RepID=... 73 1e-11 UniRef50_C8PSK2 ISGsu1, transposase n=1 Tax=Treponema vincentii ... 72 2e-11 UniRef50_Q093Y3 Isrso13-transposase protein n=7 Tax=Stigmatella ... 71 4e-11 UniRef50_B3E6V4 Transposase IS4 family protein n=8 Tax=Proteobac... 70 1e-10 UniRef50_Q1VPP4 ISPg4, transposase n=7 Tax=Bacteria RepID=Q1VPP4... 69 1e-10 UniRef50_C3KKH4 Putative transposase Y4ZB n=2 Tax=Rhizobium sp. ... 68 4e-10 UniRef50_Q877V8 ISPpu8, transposase n=3 Tax=Proteobacteria RepID... 68 4e-10 UniRef50_C0VKK7 ISCja2 transposase n=8 Tax=Acinetobacter RepID=C... 68 5e-10 UniRef50_B3JNI1 Putative uncharacterized protein n=3 Tax=Bactero... 66 2e-09 UniRef50_Q737L2 IS231-related transposase n=3 Tax=Bacillus cereu... 65 3e-09 UniRef50_D1Q0M9 ISGsu1 transpoase n=7 Tax=Bacteroidales RepID=D1... 64 8e-09 UniRef50_B3PC11 ISCja2, transposase n=5 Tax=Proteobacteria RepID... 63 9e-09 UniRef50_UPI0001BC4BB6 transposase n=2 Tax=Neisseria mucosa ATCC... 63 1e-08 UniRef50_C5VJA1 Transposase domain protein n=15 Tax=Prevotella R... 63 2e-08 UniRef50_Q11ZL6 Transposase, IS4 family n=22 Tax=Bacteria RepID=... 62 2e-08 UniRef50_B8FXQ3 Transposase IS4 family protein n=8 Tax=Desulfito... 62 2e-08 UniRef50_A4J2U7 Transposase, IS4 family protein n=3 Tax=Desulfot... 62 2e-08 UniRef50_C9C7H0 Transposase n=5 Tax=Enterococcus faecium RepID=C... 61 4e-08 UniRef50_Q73GX2 Conserved domain protein n=5 Tax=Wolbachia RepID... 61 5e-08 UniRef50_C6JAL6 Transposase (Fragment) n=1 Tax=Ruminococcus sp. ... 61 7e-08 UniRef50_A5II18 Transposase, IS4 n=1 Tax=Legionella pneumophila ... 61 7e-08 UniRef50_A1APW2 Transposase, IS4 family n=6 Tax=Deltaproteobacte... 60 1e-07 UniRef50_UPI0001C4271A transposase, IS4 family protein n=1 Tax=B... 59 1e-07 UniRef50_A6TN04 Transposase, IS4 family protein n=1 Tax=Alkaliph... 59 2e-07 UniRef50_Q74P20 IS231-related transposase n=15 Tax=Bacillus RepI... 59 2e-07 UniRef50_UPI00003C8608 transposase IS4 family protein n=4 Tax=Fe... 59 3e-07 UniRef50_C6DY52 Transposase IS4 family protein n=1 Tax=Geobacter... 58 3e-07 UniRef50_Q5L3A2 Transposase of IS231E-like element n=1 Tax=Geoba... 57 1e-06 UniRef50_Q3M8C5 Transposase, IS4 n=15 Tax=Cyanobacteria RepID=Q3... 56 1e-06 UniRef50_C3AUM2 Transposase for insertion sequence element IS231... 56 2e-06 UniRef50_B0TD95 Transposase, is4 family n=3 Tax=Heliobacterium m... 54 5e-06 UniRef50_Q3M9Z5 Transposase, IS4 n=10 Tax=Cyanobacteria RepID=Q3... 54 6e-06 UniRef50_C6J7R2 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 54 6e-06 UniRef50_Q55566 Putative transposase for insertion sequence elem... 54 6e-06 UniRef50_C3R0J9 Transposase n=4 Tax=Bacteroidales RepID=C3R0J9_9... 54 8e-06 UniRef50_Q4C0I4 Putative uncharacterized protein n=2 Tax=Cyanoba... 54 8e-06 UniRef50_A1BCF6 Transposase, IS4 family protein n=1 Tax=Chlorobi... 53 1e-05 UniRef50_C5EN32 Putative uncharacterized protein n=1 Tax=Clostri... 53 1e-05 UniRef50_Q2FU81 Transposase, IS4 n=4 Tax=Methanospirillum hungat... 53 2e-05 UniRef50_B7GET6 Transposase n=2 Tax=Bacillaceae RepID=B7GET6_ANOFW 53 2e-05 UniRef50_A3ZNH0 Probable transposase n=1 Tax=Blastopirellula mar... 52 2e-05 UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3... 52 3e-05 UniRef50_C3BTW8 Transposase for insertion sequence element IS231... 52 3e-05 UniRef50_A6DSH7 Probable transposase n=3 Tax=Lentisphaera araneo... 51 4e-05 UniRef50_Q07SJ1 Transposase, mutator type n=22 Tax=Bacteria RepI... 51 6e-05 UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepI... 50 8e-05 UniRef50_C0QMU6 Transposase repeat family IS4 n=1 Tax=Thermosiph... 50 1e-04 UniRef50_P55729 Putative transposase y4zB n=4 Tax=Rhizobiaceae R... 50 1e-04 UniRef50_Q05309 Transposase for insertion sequence element IS115... 50 1e-04 UniRef50_Q46GC6 Transposase n=7 Tax=Methanosarcina RepID=Q46GC6_... 49 2e-04 UniRef50_A3YV11 Transposase (Class II) n=2 Tax=Synechococcus sp.... 49 2e-04 UniRef50_D1XZ52 Transposase, IS4 family n=1 Tax=Prevotella bivia... 49 3e-04 UniRef50_C6J0N9 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 49 3e-04 UniRef50_B5ID46 Transposase, IS4 family protein n=9 Tax=Acidulip... 48 3e-04 UniRef50_B9MHR2 Transposase IS4 family protein n=1 Tax=Diaphorob... 48 5e-04 UniRef50_D2PLH1 Putative uncharacterized protein n=1 Tax=Kribbel... 47 0.001 UniRef50_Q3EKJ9 Transposase n=1 Tax=Bacillus thuringiensis serov... 47 0.001 UniRef50_C6AUF2 Transposase IS4 family protein n=7 Tax=Rhizobium... 46 0.001 UniRef50_B3E2B5 Transposase IS4 family protein n=7 Tax=Proteobac... 46 0.001 UniRef50_B3JEV4 Putative uncharacterized protein n=1 Tax=Bactero... 46 0.001 UniRef50_Q7ULM3 Probable transposase n=5 Tax=Planctomycetaceae R... 46 0.002 UniRef50_A9AZS8 Transposase IS4 family protein n=3 Tax=Herpetosi... 46 0.002 UniRef50_C3EBZ9 IS231-related transposase n=1 Tax=Bacillus thuri... 46 0.002 UniRef50_Q55646 Transposase n=1 Tax=Synechocystis sp. PCC 6803 R... 45 0.003 UniRef50_A4SUB1 IS element transposase n=8 Tax=Bacteria RepID=A4... 45 0.003 UniRef50_B7IHA7 Putative uncharacterized protein n=5 Tax=Thermos... 45 0.003 UniRef50_A8YN96 Similar to the central part of tr|Q3M9Z5|Q3M9Z5_... 45 0.003 UniRef50_B9LW44 Transposase IS4 family protein n=12 Tax=Halobact... 45 0.003 UniRef50_A7C2A8 Transposase of IS641 n=1 Tax=Beggiatoa sp. PS Re... 44 0.005 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 44 0.006 UniRef50_A1RJM4 Transposase, IS4 family n=17 Tax=Gammaproteobact... 44 0.006 UniRef50_Q9F9K7 Transposase n=2 Tax=Gammaproteobacteria RepID=Q9... 44 0.006 UniRef50_B0VT13 Transposase of ISAba6, IS982 family n=32 Tax=Aci... 43 0.011 UniRef50_B0R9A9 Transposase (ISH8) n=22 Tax=Halobacteriaceae Rep... 43 0.011 UniRef50_D0J4K5 Transposase, IS4 n=16 Tax=Proteobacteria RepID=D... 43 0.011 UniRef50_A0LL64 ISGsu1, transposase n=1 Tax=Syntrophobacter fuma... 43 0.012 UniRef50_P11901 Transposase for insertion sequence element IS421... 43 0.013 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 42 0.018 UniRef50_A6UM74 Transposase and inactivated derivatives-like pro... 42 0.018 UniRef50_Q5P589 Transposase, is4 family n=3 Tax=Rhodocyclaceae R... 42 0.024 UniRef50_C0AF19 InsL n=9 Tax=Opitutaceae bacterium TAV2 RepID=C0... 41 0.043 UniRef50_A8KXB4 Transposase n=1 Tax=Frankia sp. EAN1pec RepID=A8... 41 0.045 UniRef50_A8UR40 Transposase n=2 Tax=Hydrogenivirga sp. 128-5-R1-... 41 0.057 UniRef50_Q00840 Transposase for insertion sequence element IS110... 41 0.065 UniRef50_D2SEZ5 Transposase IS4 family protein n=5 Tax=Actinomyc... 41 0.067 UniRef50_C9YUP4 Putative transposase (Fragment) n=4 Tax=Streptom... 41 0.072 UniRef50_B9XA94 ISPg4, transposase n=1 Tax=bacterium Ellin514 Re... 41 0.072 UniRef50_Q3STN4 Transposase, IS4 family n=13 Tax=Rhizobiales Rep... 40 0.090 UniRef50_B1Y0W6 Transposase IS4 family protein n=3 Tax=Burkholde... 40 0.100 >UniRef50_P30192 Putative uncharacterized protein ychG n=8 Tax=Enterobacteriaceae RepID=YCHG_ECOLI Length = 299 Score = 306 bits (784), Expect = 5e-82, Method: Composition-based stats. Identities = 299/299 (100%), Positives = 299/299 (100%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV Sbjct: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 Query: 61 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK 120 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK Sbjct: 61 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK 120 Query: 121 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNA 180 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNA Sbjct: 121 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNA 180 Query: 181 VTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 VTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS Sbjct: 181 VTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 Query: 241 EMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK 299 EMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK Sbjct: 241 EMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK 299 >UniRef50_Q7MLW1 Transposase and inactivated derivative n=29 Tax=Gammaproteobacteria RepID=Q7MLW1_VIBVY Length = 445 Score = 242 bits (617), Expect = 1e-62, Method: Composition-based stats. Identities = 129/299 (43%), Positives = 184/299 (61%), Gaps = 15/299 (5%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 M + N DF + + F+EH+P EW+ TLS AT+RRRRLP DMV+W++V Sbjct: 1 MSIQNYFADFLEESPVDVAQLTTFSEHIPDEWVAKAATLSDKATIRRRRLPSDMVLWLIV 60 Query: 61 -----QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGA 115 +NE I +V RR+N+ A+G A LLA+SA+TQARQR+G A EWLFRQ + G Sbjct: 61 GMAFFRNESIAEVARRMNVCAEGLADEELLAKSALTQARQRLGKAAPEWLFRQCSHTWGL 120 Query: 116 ERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSH 175 ERY +D W GLQ+FAIDGA FRT D ELRE++GS NTS++RQ +PV+R+V +MN+ SH Sbjct: 121 ERYPEDTWQGLQVFAIDGALFRTADTSELREHFGSGNTSSERQTPHPVLRVVTMMNVRSH 180 Query: 176 ILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 ++++A +PYR+ E LA + ++PDNS+TL DK FY DLLL+L G NRHWLLPA Sbjct: 181 VIVDAAISPYRRGEIPLAMPFIDSLPDNSVTLLDKGFYGADLLLSLQNSGSNRHWLLPAK 240 Query: 236 KNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHS 294 K + +++ + + ++ ++ P+ P ++ Y V+ Sbjct: 241 KGVKFRLLDDEES-DDMLVEMKVSPQ---------ARKKNPNLPEKWQVRAVTYQVQGK 289 Score = 61.4 bits (147), Expect = 3e-08, Method: Composition-based stats. Identities = 35/51 (68%), Positives = 41/51 (80%) Query: 249 ASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK 299 SPG PKRL+ LRG L ++FI KRP+P+RPR+VKISKTRYPV AAPLK Sbjct: 395 VSPGNTPKRLKSLRGDLSILFIDKRPKPNRPRAVKISKTRYPVNRKAAPLK 445 >UniRef50_Q7MGY3 Transposase and inactivated derivative n=4 Tax=Vibrio vulnificus RepID=Q7MGY3_VIBVY Length = 441 Score = 228 bits (580), Expect = 2e-58, Method: Composition-based stats. Identities = 101/302 (33%), Positives = 153/302 (50%), Gaps = 20/302 (6%) Query: 4 LNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV--- 60 L L ++ ++ L ++I CL S AT+R+RR+P DM +W VV Sbjct: 3 LTTALTLANRYAPNTEQLGKLSDILCPDFINQCLDASGVATIRKRRIPLDMAVWAVVAMS 62 Query: 61 --QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERY 118 + EP+ +V + L G+ L+A SA+ QARQR+GA ++ +F Q+ Q E Sbjct: 63 LYRQEPLWSIVSKAQLMLPGKRS--LVAPSAIVQARQRLGADAMKEVFHQS-QSLWNETA 119 Query: 119 LKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILL 178 W GL+L A+DG +RTPD E R+ + SA+ + +P +R+V M L SH+L+ Sbjct: 120 DHPTWCGLKLLAVDGVVWRTPDTKENRDAFQSASNQNGEGS-FPQVRMVCQMELTSHMLV 178 Query: 179 NAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI 238 + A Y+ +E +LA ++ T PD S+T+FD+ FYS LL G RHWL+P KN Sbjct: 179 ASAFASYKTNEMILAEQLIETTPDYSLTMFDRGFYSLSLLHRWANTGNERHWLMPMRKNT 238 Query: 239 AS-EMIELGNTA------SPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKT---R 288 E+ +LG + K+ L +EV I K + + S+ S T R Sbjct: 239 QFTEVRKLGRNDRIVELKTTPQARKKSLSLPETIEVRLIKKTIK-GKEVSILTSMTDHRR 297 Query: 289 YP 290 YP Sbjct: 298 YP 299 >UniRef50_D1T817 Transposase IS4 family protein n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1T817_9BURK Length = 448 Score = 226 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 95/276 (34%), Positives = 149/276 (53%), Gaps = 17/276 (6%) Query: 17 PPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRR 71 PP +HLP EWI++ + S A+VRRRRLP V+W+V+ +++ I++VV Sbjct: 20 PPLEWGRLGQHLPYEWIEYAVQASGSASVRRRRLPAQQVVWLVIALALYRHQSISEVVDE 79 Query: 72 LNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAI 131 L+L+ A + +++SA+ QARQR+GAAP+ WLF ++A + A+ K + G LFA+ Sbjct: 80 LDLALPA-ADASFVSKSAIAQARQRIGAAPLAWLFHESAANWVAQDQAKHLFKGFSLFAM 138 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETV 191 DG RT D R ++G++ + R +YP +R V L L +H++ +AV PY +E + Sbjct: 139 DGTTLRTADSAANRRHFGASAAAHGRIGSYPQLRAVTLTALATHLVRDAVFGPYDINEMI 198 Query: 192 LAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASP 251 A ++A +P NSIT+FDK F S LL L G NRH+++PA N E++ G Sbjct: 199 WARELIARVPANSITVFDKGFLSAQLLCNLVSGGENRHFIIPAKANTCWEVVSGG--PGD 256 Query: 252 GTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKT 287 T+ R+ + P P + Sbjct: 257 QTVRMRVSPQ---------ARAKCPDLPEFWQARAV 283 >UniRef50_B2JV26 Transposase IS4 family protein n=9 Tax=Burkholderia RepID=B2JV26_BURP8 Length = 442 Score = 225 bits (574), Expect = 1e-57, Method: Composition-based stats. Identities = 94/276 (34%), Positives = 149/276 (53%), Gaps = 17/276 (6%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRL 72 P AEHLP EWI+ + + A++RRRRLP + V+W+V+ ++ I++V+ L Sbjct: 15 PADLSRLAEHLPYEWIERAVQATGAASIRRRRLPAEQVVWLVIALAMYRHWSISEVLDSL 74 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAID 132 +L+ EA +++SAV QARQR+G AP+ WLF QTA+ + + GL L+A+D Sbjct: 75 DLALPNEA-APFVSKSAVVQARQRIGEAPMAWLFEQTARAWTTQDAAHHAFKGLSLWAMD 133 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 G RTPD RE++G+ ++ + +YP +R V L + +H++ + Y +E V Sbjct: 134 GTTLRTPDSAANREHFGAQGYASGKVASYPQVRAVTLTAIPTHLVADINFGCYDTNEMVY 193 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPG 252 A S+L IPD+S+T+FDK F + ++L L G NRH+L+PA N E+I TA Sbjct: 194 AKSLLPQIPDDSLTVFDKGFLAAEILCGLTMNGRNRHFLIPAKSNTCWEVI--AGTADDA 251 Query: 253 TIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTR 288 + R+ ++ P+ P R Sbjct: 252 MVRMRVSQQ---------ARKKCPALPEFWNARAIR 278 >UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LF21_FRASN Length = 420 Score = 225 bits (573), Expect = 2e-57, Method: Composition-based stats. Identities = 67/304 (22%), Positives = 112/304 (36%), Gaps = 15/304 (4%) Query: 1 MPLLNDLLDF---SDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIW 57 M L L+ + L S + A +P + + L + R+R LP +V++ Sbjct: 1 MARLPGALELPAVTSDRLTDRISLGVLARIVPRDLVDEVLAETRRLEQRKRLLPARVVVY 60 Query: 58 MVVQNEPITD-----VVRRL----NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQ 108 + D V+RRL + + + A++QAR R+G P++ LF + Sbjct: 61 FTMAMCLFFDDDYDEVMRRLVGTLRWLGSWKGDWKVPSTGAISQARTRLGPEPLKLLFER 120 Query: 109 TAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVA 168 A +L A+DG T D PE + +G + K A+P + +VA Sbjct: 121 VAVPVAGLGTKGAWLGSRRLVAVDGVHLDTADTPENADAFGRFSHGPKT-AAFPQVHVVA 179 Query: 169 LMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNR 228 L G+H + A Y E LA ++ + D+ FY L G + Sbjct: 180 LAECGTHAVFAAAIGAYTSDERSLAATLFDACEPGMLLTADRNFYGYGLWQQALATGADL 239 Query: 229 HWLLPAWKNIAS-EMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRS-VKISK 286 W + A + + G+ S PK RG L P+ V++ + Sbjct: 240 LWRVNANLTLPVIRALPDGSYLSLLIDPKIPVARRGQLIADARAGHAPPTESALPVRVIE 299 Query: 287 TRYP 290 P Sbjct: 300 YSVP 303 >UniRef50_A6WTA0 Transposase IS4 family protein n=14 Tax=Shewanella RepID=A6WTA0_SHEB8 Length = 446 Score = 219 bits (557), Expect = 1e-55, Method: Composition-based stats. Identities = 89/306 (29%), Positives = 142/306 (46%), Gaps = 22/306 (7%) Query: 5 NDLLDFSDH----PLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 + + S+ + A+ L E IQ CL AT+RRR+LP D +IW V+ Sbjct: 4 DFFMQLSEALARTHITRLTEFTCLADVLEPELIQSCLDSQGVATLRRRKLPMDAMIWAVI 63 Query: 61 QNEPITD-----VVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGA 115 ++ +L++ E +ARSAVTQAR+R+G+ + +F ++A A Sbjct: 64 GMALFRGESVRSLINKLDIVLPQEIDY--VARSAVTQARKRLGSEVIREVFSRSANTWHA 121 Query: 116 ERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSH 175 W GL L+ +DG +RTPD + + + ++ AYP +R+V LM L SH Sbjct: 122 R-AEHPHWCGLNLYGVDGVVWRTPDSVQNQAAFARTANASGE-AAYPQIRMVCLMELSSH 179 Query: 176 ILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 +L+N+ ++E LA ++ +IP++S+TLFD+ FYS LL Q + HWLLP Sbjct: 180 LLVNSAFDSVAENEMNLASQLIPSIPNHSLTLFDRGFYSLGLLHAWQQAQPDSHWLLPLK 239 Query: 236 KNIASEMIELGN-------TASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVK--ISK 286 K E++ + K+ L LE +TK + + Sbjct: 240 KGTQYEVVRTLGKHDQWVKLTTTPQARKKWPQLPDTLEARLLTKTVKGKSVAILTSLTDP 299 Query: 287 TRYPVK 292 RYP + Sbjct: 300 MRYPSE 305 >UniRef50_B9BXQ1 Transposase, IS4 family n=8 Tax=Proteobacteria RepID=B9BXQ1_9BURK Length = 446 Score = 218 bits (556), Expect = 2e-55, Method: Composition-based stats. Identities = 94/278 (33%), Positives = 147/278 (52%), Gaps = 18/278 (6%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRL 72 P AEHLP WI+ + + A++RRRRLP + V+W+V+ ++ +++VV L Sbjct: 20 PTDLSRLAEHLPHAWIEQAIEATGTASIRRRRLPAEQVVWLVIALAIYRHWSVSEVVDSL 79 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAID 132 L E +++SAVTQARQR+G AP+ WLF QTAQ + + + GL L+A+D Sbjct: 80 ELVLPNET--TFVSKSAVTQARQRLGHAPIAWLFEQTAQAWCKQDGARHAFKGLSLWAMD 137 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 G RTPD RE++GS + ++ + +YP MR V L ++ +H++ N Y +E + Sbjct: 138 GTTLRTPDSAANREHFGSQSYASGKVASYPQMRAVTLTSIPTHLVANIAFGRYDTNEMIY 197 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPG 252 A ++LA IPD+S+TLFDK F + ++L LN NRH+L+PA N E++ Sbjct: 198 AKNLLAQIPDHSLTLFDKGFLAAEILCGLNSGERNRHFLIPAKSNTRWEVL--SGKPDDA 255 Query: 253 TIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYP 290 + R+ ++ P P R Sbjct: 256 LVRMRVSPQ---------ARQKCPDLPEWWTARAVRIQ 284 >UniRef50_B6EGT0 Transposase n=20 Tax=Vibrionaceae RepID=B6EGT0_ALISL Length = 441 Score = 217 bits (553), Expect = 3e-55, Method: Composition-based stats. Identities = 81/307 (26%), Positives = 140/307 (45%), Gaps = 26/307 (8%) Query: 4 LNDLLDFSDH----PLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMV 59 L+ +D S P + + A+ LP I +L+ T+R+R+L + ++W++ Sbjct: 3 LDFFMDVSQALNIINDWKPSNVETLADLLPIHLIDEAYSLTDTVTMRKRKLTLESMVWLL 62 Query: 60 VQNE-----PITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRG 114 V + D+V +L++ G +A SA+TQ R+ +G A ++ +F + Sbjct: 63 VGMAIYNNKSMKDLVNQLDIV--DRTGKAFVAPSALTQRRKNLGEAAMKAVFERMTSSWL 120 Query: 115 AERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGS 174 W+GL L +DG +R PD + E + ++ YP +R+V M L S Sbjct: 121 K-SANLPKWNGLTLLGVDGVVWRAPDNQKNEEAFSR-----QKGTQYPQVRMVCQMELSS 174 Query: 175 HILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPA 234 H++ + Y +E +LA ++ + PD+S+T+FDK FYS LL G RHWL+P Sbjct: 175 HLITASAFDNYNTNEMILAEKLIDSTPDHSVTMFDKGFYSLGLLHKWQMTGSERHWLIPL 234 Query: 235 WKNIASEMIELGN-------TASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVK--IS 285 KN E+I S K +L + +T++ + + + I Sbjct: 235 KKNTQYEIIRSLGRNDKLVILRSNPRARKLFSNLPETMTARLVTRKIKGKDYQVLTSMID 294 Query: 286 KTRYPVK 292 RYP+K Sbjct: 295 PLRYPLK 301 >UniRef50_D0LI35 Transposase IS4 family protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LI35_HALO1 Length = 449 Score = 216 bits (549), Expect = 9e-55, Method: Composition-based stats. Identities = 92/284 (32%), Positives = 142/284 (50%), Gaps = 17/284 (5%) Query: 16 MPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVR 70 PP A + EWI+ L + AT+RRRRLP + ++W+V+ ++ PIT+VV Sbjct: 14 PPPEEFSRLARDVAPEWIEQALEATGTATLRRRRLPMEQLVWLVIGMALFRDRPITEVVT 73 Query: 71 RLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFA 130 L+L+ G +A SAV QAR R+G +P+ WLF +A + D W GL L+ Sbjct: 74 SLDLALPSP-GHPEVAPSAVAQARDRLGESPMAWLFAHSADRWAHQSAADDRWRGLALYG 132 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR-QSE 189 +DG R PD E R+++G AN + + YPV+RL ALM L SH+L PY+ E Sbjct: 133 VDGTTLRVPDSEENRDHFGLANGGARGSSGYPVVRLAALMALRSHLLAAVSFGPYQGHGE 192 Query: 190 TVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTA 249 A + +PDNS+ + D+ +++ ++L+ L Q G NRHWL+ K + ++E Sbjct: 193 YWYAADLWPCLPDNSLVIVDRHYWAANVLIPLQQDGLNRHWLIRGRKGLNYRVVEQLG-P 251 Query: 250 SPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKH 293 S ++ + P PR+ + Y K Sbjct: 252 SDELAEVKVSPQ---------ARSKNPELPRTWTVRIIHYQRKG 286 >UniRef50_A4T2G5 Transposase, IS4 family protein n=10 Tax=Corynebacterineae RepID=A4T2G5_MYCGI Length = 401 Score = 214 bits (544), Expect = 3e-54, Method: Composition-based stats. Identities = 66/300 (22%), Positives = 116/300 (38%), Gaps = 16/300 (5%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 MP SD L S + P + + + VR R LP ++ + + Sbjct: 1 MPRAGWRKPESDRRLSDLVSVGVLTRVFPPAMVDEVIEATGRTQVRHRALPARVMAYFAI 60 Query: 61 -----QNEPITDVVRRLNLSADGEAGMN----LLARSAVTQARQRVGAAPVEWLFRQTAQ 111 + DV+ +L +G L +SA+ QAR+R+G+ P+ LF + A+ Sbjct: 61 GMGLYSDGSYEDVLSQLTDGLAWASGWREQYQLPGKSAIFQARERLGSQPLAALFARVAR 120 Query: 112 DRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMN 171 GA G ++ AIDG D P E++G + ++A+P RL+A+ Sbjct: 121 PLGAADTPGTWVAGRRVVAIDGTCLDVADNPVNEEFFGRPGVNKGEKSAFPQARLLAVAE 180 Query: 172 LGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWL 231 G+H + A YR +E+ + +L + + L D+ F+S L + G + W Sbjct: 181 CGTHAIFAATIGAYRDAESTMVEHVLDALTPEMLVLADRGFFSYALWRNASDTGADLLWR 240 Query: 232 LPAWKN----IASEMIELGNTASPGTIPKR---LEHLRGALEVVFITKRPRPSRPRSVKI 284 + +N E + G+ + K L ++ R P R + Sbjct: 241 VSTGRNGPTPTHVEDLADGSWLAHLRAAKDRHGEPMLARVIDYTVDDGRDNPVAYRLLTT 300 >UniRef50_P03835 Transposase insG for insertion sequence element IS4 n=377 Tax=root RepID=INSG_ECOLI Length = 442 Score = 213 bits (541), Expect = 8e-54, Method: Composition-based stats. Identities = 89/300 (29%), Positives = 129/300 (43%), Gaps = 18/300 (6%) Query: 5 NDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQN-- 62 LD ++L E I CL S T+R+RRLP +M++W +V Sbjct: 4 GQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLPLEMMVWCIVGMAL 63 Query: 63 ---EPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYL 119 EP+ +V RL++ G +A SAV QARQR+G+ V +F +TAQ Sbjct: 64 ERKEPLHQIVNRLDIMLPGNR--PFVAPSAVIQARQRLGSEAVRRVFTKTAQLWHN-ATP 120 Query: 120 KDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLN 179 W GL L AIDG +RTPD PE + T YP +++V M L SH+L Sbjct: 121 HPHWCGLTLLAIDGVFWRTPDTPENDAAFPR-QTHAGNPALYPQVKMVCQMELTSHLLTA 179 Query: 180 AVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA 239 A + SE LA ++ DN++TL DK +YS LL + G +RHW++P K Sbjct: 180 AAFGTMKNSENELAEQLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQ 239 Query: 240 SEMIELGN-------TASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVK--ISKTRYP 290 E I + K+ L + +T + + R+P Sbjct: 240 YEEIRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFP 299 >UniRef50_D2TH14 ISCro6 transposase n=8 Tax=Gammaproteobacteria RepID=D2TH14_CITRO Length = 438 Score = 207 bits (527), Expect = 4e-52, Method: Composition-based stats. Identities = 89/290 (30%), Positives = 139/290 (47%), Gaps = 18/290 (6%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRL 72 P S F +P EWI L + A++R+R+LP ++V+W++V ++ ITDVV +L Sbjct: 15 PASLSCFQRAIPLEWISQVLDSTNKASIRKRKLPAELVVWLIVGMGLYRDRSITDVVTKL 74 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAID 132 +L + G LA S+V +ARQR+ P+ LF TA + D W+GL+LFA+D Sbjct: 75 DLVLSSQEG-ETLAASSVARARQRLSDEPLRELFTLTASHWTQQEDKDDLWYGLRLFAVD 133 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 G FRTPD PEL E++ R YP++RL A+M+L S ++ P E Sbjct: 134 GTLFRTPDTPELAEHFEYIKHRPDRHTEYPMVRLCAMMSLRSRLIHGVKFGPANTGEVSY 193 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE---LGNTA 249 A + S+TLFD+ + S +LL+ ++ HWL+P N ++E G+ Sbjct: 194 AKQLSPQ--AKSLTLFDRCYLSAELLINWQRRQQEAHWLVPLKGNTKYRIVETFAGGDHL 251 Query: 250 SPGTI----PKRLEHLRGALEVVFITKRPRPSRPRSVKISKT---RYPVK 292 + K+ L + I + S T +YP + Sbjct: 252 VEMQVSPQARKQDSSLPENWQARLIEYEDESGDYKGFITSLTEPGQYPAE 301 >UniRef50_C6CF98 Transposase IS4 family protein n=20 Tax=Gammaproteobacteria RepID=C6CF98_DICZE Length = 441 Score = 205 bits (520), Expect = 2e-51, Method: Composition-based stats. Identities = 81/271 (29%), Positives = 128/271 (47%), Gaps = 18/271 (6%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSADG 78 F+ L WI L A++RRRRLP + +W+V+ ++ I DV L++ Sbjct: 20 FSRSLDPAWIHQALNACHKASIRRRRLPAEQAVWLVLMMGLLRDLSIKDVCHHLDIVLQP 79 Query: 79 EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRT 138 + G LA S +T ARQR+G AP+ +LF + D +HGL + ++DG FRT Sbjct: 80 DEGYQPLAPSVLTAARQRLGEAPLRYLFHACNEGWLPTVLGSDTFHGLHVLSVDGTLFRT 139 Query: 139 PDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLA 198 PD P+ +G + +P +R+V LM SH+LL+A + E LAH +++ Sbjct: 140 PDSPDNAAAFGFIDPVHGT---FPQVRMVGLMATHSHMLLDAAFGGVAEGELTLAHRLVS 196 Query: 199 TIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRL 258 + PD+S+TLFD+ ++S LL Q G HWL P + + +IE + I + Sbjct: 197 SAPDHSLTLFDRCYFSASFLLEWRQAGVETHWLTPVKRKLRYRVIERYS-DYDMLIEMPV 255 Query: 259 EHLRGALEVVFITKRPRPSRPRSVKISKTRY 289 ++ P P + Y Sbjct: 256 SPQ---------ARKAAPHLPAVWQARMVSY 277 >UniRef50_Q2JAY9 Transposase, IS4 n=2 Tax=Frankia RepID=Q2JAY9_FRASC Length = 412 Score = 196 bits (498), Expect = 7e-49, Method: Composition-based stats. Identities = 67/282 (23%), Positives = 110/282 (39%), Gaps = 22/282 (7%) Query: 11 SDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-------QNE 63 + L + + P E + L ++ A VRRR LP +V++ V+ +N Sbjct: 11 PEGWLPDRVTVGVLTRVYPPELVDRVLAVTDTAEVRRRLLPSWLVVYFVLALWLFRGRNC 70 Query: 64 PITDVVRRLNLSADGEA-------------GMNLLARSAVTQARQRVGAAPVEWLFRQTA 110 V+ RL + G +L A ++ +AR R+G+ PV LF A Sbjct: 71 GYVQVLARLTSGLHFQRRAAVLAAGGAGGAGWSLPASPSLGEARARIGSDPVRMLFEHAA 130 Query: 111 QDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALM 170 G E HGL+L IDG+ PD R ++ + +P +R V Sbjct: 131 GPVGVEGQAGVFLHGLRLVQIDGSTCDLPDTQANRAFFPGPSN-AGGPAPFPKVRWVIAA 189 Query: 171 NLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHW 230 + LL A P+ E LA +L + +TL D+ F S L + G + W Sbjct: 190 EAATGALLGASFGPWSTGEPALARDLLGQLGPGMLTLADRNFLSHRLAGEVLATGAHLLW 249 Query: 231 LLPAWKNI-ASEMIELGNTASPGTIPKRLEHLRGALEVVFIT 271 A + +++ G+ + T P+ E + V+ T Sbjct: 250 RAKATFTLAPVHVLDDGSYLAELTPPRGSEGPPLTMRVIEYT 291 >UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4D726 Length = 464 Score = 196 bits (497), Expect = 9e-49, Method: Composition-based stats. Identities = 62/274 (22%), Positives = 108/274 (39%), Gaps = 11/274 (4%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEA 80 + +P + L + R R LP +V++ V+ D R + A A Sbjct: 1 MGVLTRWVPPVLVDEVLAATGRFEKRVRMLPARVVVYFVLAMTLFGDCGYR-GVWAALTA 59 Query: 81 GMNL-----LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQ 135 GM + +A+ QAR+R+G AP+ LF + G + WHGL++ A DG Sbjct: 60 GMPGHLVPDPSAAALRQARRRLGTAPLALLFDRVCGPVGTKETPGVFWHGLRVVAWDGTS 119 Query: 136 FRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHS 195 D +YG +T R YP +RL AL+ G+ L+ AV P E A Sbjct: 120 VEVADSAANVAHYGRHGKATSRPAGYPQVRLTALVECGTRALMGAVFGPMHDKELPQARR 179 Query: 196 MLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIP 255 +L + + L D+ + + + G + W + + ++ + G+ Sbjct: 180 LLPVLRPGILLLADRGYDGYEAIRDAASTGADLLWRVQSG-----RLLPVIQPLPDGSHL 234 Query: 256 KRLEHLRGALEVVFITKRPRPSRPRSVKISKTRY 289 ++ R + +R RP+ P ++ R Sbjct: 235 SQILDRRSGDRLAAWQRRKRPTPPPALTAMAVRV 268 >UniRef50_D0SHM1 Transposase n=3 Tax=Acinetobacter RepID=D0SHM1_ACIJO Length = 443 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 81/270 (30%), Positives = 129/270 (47%), Gaps = 14/270 (5%) Query: 2 PLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV- 60 + + S PS F+E + WI+ CL + A+VR+R+LP + +W+V+ Sbjct: 9 FFMTFSQNLSSAFQQTAPSLSNFSELIDLNWIEDCLKRTGKASVRKRKLPAEHAVWLVIG 68 Query: 61 ----QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAE 116 +++PI VV++L L A SA QARQR+G P+ LF +Q + Sbjct: 69 LALFRDQPIWYVVQQLQLVF---GTAESCAPSASVQARQRLGLEPLNVLFNTLSQTWFED 125 Query: 117 RYLK-DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSH 175 + +HGL + A+DGA + P E ++GS+ T +P R V L+N +H Sbjct: 126 SQPQYSAFHGLSICAVDGAVWSMPHTDENFRHFGSSKGKT-IAAPWPQARAVCLINTNTH 184 Query: 176 ILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 +++A Q E LA + +P NS+TLFD+ ++S D L + N HWL+ A Sbjct: 185 EVIDAGIGSMDQGELTLAKKL--KVPANSLTLFDRAYFSADFLSGWQSR-ENCHWLMRAK 241 Query: 236 KNIASEMIELGNTASPGTIPKRLEHLRGAL 265 N+ E+I N+A I + L Sbjct: 242 DNLRYEIIR-KNSAHDFQIRMPVSPRAKKL 270 >UniRef50_A4JGL4 Transposase, IS4 family protein n=3 Tax=Burkholderiaceae RepID=A4JGL4_BURVG Length = 402 Score = 193 bits (491), Expect = 4e-48, Method: Composition-based stats. Identities = 62/244 (25%), Positives = 109/244 (44%), Gaps = 12/244 (4%) Query: 19 PSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLN 73 SA + A P I+ L + A+ R R LP V++ V+ + P+ +V+R + Sbjct: 21 ISAGVLASVCPRTLIEEVLAETGKASQRERLLPAPAVVYYVMALALWREAPLEEVLRVVC 80 Query: 74 LSADGEAG----MNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLF 129 G ++SA++QAR R+G + L + + A + GL++ Sbjct: 81 EGLQWLGGGHTEAVQASKSAISQARSRLGPEVMRQLADRVLRPLAAPGAPGAWYRGLRVM 140 Query: 130 AIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSE 189 A+DG+ D+ +++G S Q+A+P R++ L+ G+H ++ A APY SE Sbjct: 141 ALDGSCMDVADEAANAKFFGYPGASRG-QSAFPQARVLGLVECGTHAVVAAGIAPYGHSE 199 Query: 190 TVLAHSMLA-TIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE-MIELGN 247 V+A +L + + L D+ FY L T G W + + + E M+ G+ Sbjct: 200 QVMAAQLLPAKLTPEMLVLADRNFYGFKLWQTACATGAKLAWRVKSNLKLPVEQMLPDGS 259 Query: 248 TASP 251 S Sbjct: 260 YLSR 263 >UniRef50_UPI0001C16028 hypothetical protein CRD_01775 n=2 Tax=Raphidiopsis brookii D9 RepID=UPI0001C16028 Length = 465 Score = 186 bits (473), Expect = 5e-46, Method: Composition-based stats. Identities = 59/264 (22%), Positives = 116/264 (43%), Gaps = 19/264 (7%) Query: 15 LMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQN-----EPITDVV 69 L P ++ +P++ I + + + R R LP +++ +V+ + I DV Sbjct: 13 LNPQQIFLALSQVIPSQTITKAIESTCSSQRRLRILPTYIIVTLVIAMSFWSSDSIVDVF 72 Query: 70 RRL-----NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWH 124 + L +L + + S++T+ARQR GAA + LF A+ Sbjct: 73 KNLIHGLSSLHIPSGLRLQTPSASSITEARQRTGAAVMRRLFELVAKPLATILTPGAFLG 132 Query: 125 GLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAP 184 L++ A+DG F PD +G + +P +RLV L+ G+H++++A P Sbjct: 133 ELRIMAVDGTVFDVPDTSTNARVFGYPGSPKGTYPGFPKVRLVFLVEAGTHLIIDAFCYP 192 Query: 185 YRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 YR E A +L +I + + ++D+ +S ++ T+ ++ N +P N+ ++++ Sbjct: 193 YRMGERRGALKLLRSINSSMLLMWDRGLHSFKMVHTVIKQQGNFLGRVPG--NVKFQVVK 250 Query: 245 ---LGNTAS----PGTIPKRLEHL 261 G+ S G K+ Sbjct: 251 TLADGSYLSWIAPDGQSRKKGAKR 274 >UniRef50_A8L1S1 Transposase IS4 family protein n=2 Tax=Frankia sp. EAN1pec RepID=A8L1S1_FRASN Length = 425 Score = 186 bits (473), Expect = 5e-46, Method: Composition-based stats. Identities = 59/291 (20%), Positives = 99/291 (34%), Gaps = 12/291 (4%) Query: 16 MPPPSAQLFAEHLPTEWIQHCLTLSA-HATVRRRRLPGDMVIWMVVQNEPITD------- 67 S + +P + + + A +LP + ++ + D Sbjct: 22 PDQVSVGVLVTAVPRDAVDEAVAACGVGARRAGGKLPPHVTAYLTLAMSLFPDDDYAEVA 81 Query: 68 --VVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHG 125 V L+ +A + S +TQAR+R+G + +F + A G Sbjct: 82 QKVTGSLDRFGCWDAAWAPPSASGITQARKRLGRMVMAEVFERVAGQVATLSTRGAWLRG 141 Query: 126 LQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPY 185 L AIDG PD E +G A T KR +A+P +R+VAL G+H A + Sbjct: 142 RLLLAIDGFDVDVPDTEENAAEFGYAGTGEKR-SAFPKIRVVALAECGTHAFRAAEVGGW 200 Query: 186 RQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA-SEMIE 244 E LA +L + + + D+ FYS D G + W P N+ ++ Sbjct: 201 AAGERTLARGLLMRLNRDEVLTADRGFYSFDNWALAAGTGADLIWRAPTGLNLPVVRVLS 260 Query: 245 LGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSA 295 G + P+ R + + Y + A Sbjct: 261 DGTFLTVLINPEITGGRRRERLLAAAKAGDELDPDEAHLARVVEYDIPDRA 311 >UniRef50_B2LS82 Putative uncharacterized protein n=3 Tax=Vibrio RepID=B2LS82_9VIBR Length = 440 Score = 186 bits (472), Expect = 8e-46, Method: Composition-based stats. Identities = 70/250 (28%), Positives = 129/250 (51%), Gaps = 17/250 (6%) Query: 4 LNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV--- 60 L +F + +F +H+P EW++ + + ++R+RRLP + +W+V+ Sbjct: 8 LQVTAEFCHERPID-----VFNKHIPWEWVEEAVQQTGRVSLRKRRLPAEQAVWLVLGIG 62 Query: 61 --QNEPITDVVRRLNLSADGEAG-MNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAER 117 +N I DV +L L+ G + +A S++ + ++R+G P+ +LF+ TAQ + Sbjct: 63 LQRNRSIQDVCDKLELAFPDVDGELTPMATSSIIKGKERLGDKPMRYLFKTTAQQWEQQS 122 Query: 118 YLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHIL 177 D+ GL+L ++DG F+T + E + ++G A + ++P + V LM+ SH++ Sbjct: 123 D-FDEVCGLKLLSVDGTYFKTHNTEENQ-HFGFAQ----KGASFPSVLAVTLMSTRSHLV 176 Query: 178 LNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKN 237 +A P SE A ++ + PD+S+TLFD+ F S +L + N HWL P Sbjct: 177 SDAAFGPVTNSEISYAQQLVGSAPDDSLTLFDRGFTSAELFTSWQGASSNSHWLTPIKTK 236 Query: 238 IASEMIELGN 247 + ++IE Sbjct: 237 MRYDIIESYT 246 >UniRef50_Q2J8F5 Putative uncharacterized protein n=3 Tax=Frankia sp. CcI3 RepID=Q2J8F5_FRASC Length = 451 Score = 184 bits (466), Expect = 4e-45, Method: Composition-based stats. Identities = 62/291 (21%), Positives = 103/291 (35%), Gaps = 15/291 (5%) Query: 15 LMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRR-RRLPGDMVIWMVVQNEPITD-----V 68 L S + +P + + + + R +P +V + V+ D V Sbjct: 12 LTDWISLGVLTSFVPRDAVDEAIEATGAGARRSDTTIPPQVVAYFVMALALFADDDYETV 71 Query: 69 VRRLNLSADGEAGMNL---LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHG 125 RRL + + S +T+ARQR+GAAP+ LF Q A + Sbjct: 72 ARRLAATLTDLDVVGPRWEPTSSGLTKARQRLGAAPLAELFGQVAGPVADLDTVGAFLSR 131 Query: 126 LQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPY 185 +L +IDG ++ P E +G P +R V + SH + A P Sbjct: 132 WRLMSIDGLEWDAPASKENIAAFGLPAGRVDAPGVLPKVRAVTVSECASHAPVLAAFGPA 191 Query: 186 R----QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI-AS 240 SE LA ++ + + + L D+ FYS T G W + A + Sbjct: 192 GGAKPASEQALARTVYPRLASDWLLLADRNFYSWADWCTAADTGAALLWRVKATLRLPPL 251 Query: 241 EMIELGNTASPGTIPKRLEHLRGALEVVFITKRP-RPSRPRSVKISKTRYP 290 + G+ + PK R L P P++ R ++ + P Sbjct: 252 RALSDGSYLTVLVNPKVTGKARETLVTAARAGAPLDPTKARYTRLVEYDVP 302 >UniRef50_C5T3Q2 Transposase IS4 family protein n=4 Tax=Proteobacteria RepID=C5T3Q2_ACIDE Length = 436 Score = 183 bits (464), Expect = 6e-45, Method: Composition-based stats. Identities = 83/271 (30%), Positives = 130/271 (47%), Gaps = 15/271 (5%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 M LL L+ + L P + + L WI L + A++RRR+LP + +W+V+ Sbjct: 1 MSLLQTTLNETLETL-PANAIAELSALLDPAWIAQALQATGKASMRRRKLPAEHAVWLVI 59 Query: 61 -----QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGA 115 ++ P+ VV+ + L+ D G L A S Q RQR+GA P+E +F A G Sbjct: 60 GLALFRHMPLWQVVQEMALTLD---GQELPAPSVSVQVRQRLGAEPMEHMFGLLANAWGR 116 Query: 116 ERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSH 175 + L++ A+DG + PD + R+ GS T Q +P++R V L++ SH Sbjct: 117 AHAVHA--GALRVLAVDGVAWSAPDSKDNRQELGSGQTQYGPQ-PWPMVRAVCLLDTDSH 173 Query: 176 ILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 LL+A Y E LA + D+SITLFD+ ++S LL +Q G RHWL+ A Sbjct: 174 ELLDAQLGDYGCGELTLAADLHGL--DHSITLFDRAYFSAAFLLAWSQAGQQRHWLMRAK 231 Query: 236 KNIASEMIELGNTASPGTIPKRLEHLRGALE 266 N+ E+++ + I + L Sbjct: 232 DNLRYEVVQTLD-EGDWLIRMPVSPRARKLH 261 >UniRef50_Q82R31 Putative IS4 family ISFsp6-like transposase n=2 Tax=Streptomyces avermitilis RepID=Q82R31_STRAW Length = 542 Score = 179 bits (454), Expect = 9e-44, Method: Composition-based stats. Identities = 61/302 (20%), Positives = 108/302 (35%), Gaps = 13/302 (4%) Query: 1 MPLLNDLLDFSD-----HPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMV 55 MP+ + + + P + LP E + L + A R R LP + Sbjct: 1 MPVQCSTVTLTSSITVADGIFAPGHLGELTQQLPFELVDDVLERAGGAQHRLRLLPSRVG 60 Query: 56 IWMVVQNEPITDV--VRRLNLSADGEAGM--NLLARSAVTQARQRVGAAPVEWLFRQTAQ 111 ++ V+ + VR + G G+ + A+ + R+R+G AP+ LF A Sbjct: 61 VYFVLALALFPQLGYVRVWDKLTAGLRGILHRRPSEKALREVRRRLGVAPLRLLFETLAG 120 Query: 112 DRGAERYLKDDWHGLQLFAIDG-AQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALM 170 + + A DG + + PD+P + + G YP+++++ L Sbjct: 121 PVAQPITPGVRYRCWRTVAFDGCSSTKAPDRPRVCAWLGKHKHRYGTD-GYPMLKIMVLC 179 Query: 171 NLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHW 230 G+ LL AV P + ET A +L + + L D+ F S+D L G Sbjct: 180 ETGTRALLGAVFGPTPEKETGYAEQLLPLLDGGMLLLNDRGFDSDDFLAKAAATGAQLLV 239 Query: 231 LLPA-WKNIASEMIELGNTASPGT-IPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTR 288 L ++ G+ + R+ A+ + R + R Sbjct: 240 RLKGTRTPARWALLPDGSFLTRINGTRLRVIDAHIAVTTAKGLRLEGHYRLATTLTDHRR 299 Query: 289 YP 290 YP Sbjct: 300 YP 301 >UniRef50_D2ASB5 Transposase, IS4 family n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2ASB5_STRRD Length = 356 Score = 178 bits (451), Expect = 2e-43, Method: Composition-based stats. Identities = 61/292 (20%), Positives = 112/292 (38%), Gaps = 22/292 (7%) Query: 12 DHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITD---- 67 D L S P + + +S A RRR LP + I+ V+ +D Sbjct: 7 DGRLADQLSIGFLTSVFPISLLDEVIGVSGCAERRRRALPARLTIYYVLALCLFSDKNYD 66 Query: 68 -----VVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDD 122 ++ L + + SA+++AR R+GA P+ LF + + + Sbjct: 67 QVMRLLLNGLAWRSRWVYTWEPPSASAISRARARLGAEPLRVLFCRVTGPVAEPQASRSW 126 Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT 182 GL+ +DG P+ + +G + + + +P +R+VA+ G+H L++A Sbjct: 127 LAGLRPVTMDGTTLVVPETRDN-SAFGYPDGAAR----FPCVRVVAVAENGTHALIDATF 181 Query: 183 APYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA-SE 241 E LA +L + + + L + +L + G + W + + Sbjct: 182 GSSAVEERTLARRLLRCLESDMLLLARSGRWGFELWRQAAETGTHLLWGVTGADALPIGR 241 Query: 242 MIELGNTASP----GTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRY 289 E G+ S G P R+ L GA IT P + + +++ RY Sbjct: 242 SFEDGSYLSRPAGLGGAPLRVIPLPGA--EWLITTLVDPGQASASELAA-RY 290 >UniRef50_A8M893 Transposase IS4 family protein n=3 Tax=Actinomycetales RepID=A8M893_SALAI Length = 451 Score = 178 bits (450), Expect = 3e-43, Method: Composition-based stats. Identities = 61/282 (21%), Positives = 101/282 (35%), Gaps = 10/282 (3%) Query: 16 MPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRR--LN 73 +P E I L + R R LP +V+++++ D R Sbjct: 17 FAAGHLGELTRLVPFEMIDDVLAATRRTQRRVRLLPARVVVYLLLAGCLFADCGYRQVWA 76 Query: 74 LSADGEAGMNL--LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAI 131 G G+ + + SA+ QARQR+G AP+ LF W GL + Sbjct: 77 KLVAGLRGLPVADPSDSALRQARQRLGPAPLRALFDLLRGPAATSAVAAVRWRGLLPVVV 136 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETV 191 DG D P YG + + YP +RL AL+ G+ +++AV P E Sbjct: 137 DGTMIAVADSPANLGRYGKHRCNNG-GSGYPTLRLSALLTCGTRSVIDAVFDPSTTGEIT 195 Query: 192 LAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS-EMIELGNTAS 250 AH + ++ + L D+ + + DL+ G + + + + G+ S Sbjct: 196 QAHRLTRSLRAGMLLLADRNYAAADLIGAFTATGADLLIRCKSGRKLPMTRRCRDGSWLS 255 Query: 251 PGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKI--SKTRYP 290 I + + A + T R + RYP Sbjct: 256 --VIDGQPVRIIEARISITTTAGSHTGDYRLITTLLDPRRYP 295 >UniRef50_B2J1G3 Transposase, IS4 family protein n=6 Tax=Nostocaceae RepID=B2J1G3_NOSP7 Length = 381 Score = 175 bits (444), Expect = 1e-42, Method: Composition-based stats. Identities = 64/232 (27%), Positives = 110/232 (47%), Gaps = 15/232 (6%) Query: 44 TVRRRRLPGDMVIWMVVQN-----EPITDVVRRL-----NLSADGEAGMNLLARSAVTQA 93 R+R LP +V+ +V+ + + DV++ L + +SA+TQA Sbjct: 12 EERKRSLPAQLVVSLVIAMSLWSKDSMRDVLKNLIDGLSEAWLKVGKYWRVACKSAITQA 71 Query: 94 RQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANT 153 RQR+GA + LF Q + + L L++ IDG+ F PD E +G + Sbjct: 72 RQRLGARVMCKLFHQLVKPMATQETLGAFLQELRIVVIDGSCFDVPDSDENARVFGRPGS 131 Query: 154 STKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFY 213 + A+P +RLV L+ G+HI+ +A+ PYR E V A +L ++ + ++D+ + Sbjct: 132 RPGTKAAFPKVRLVILVEAGTHIIFDALMWPYRIGERVRALRLLRSVTPGMLLMWDRGLH 191 Query: 214 SEDLLLTLNQKGCNRHWLLPAW-KNIASEMIELGNTAS----PGTIPKRLEH 260 S ++ KGC+ +PA K IA + +E G+ S G + K+ Sbjct: 192 SYAMVQATVTKGCDYLGRIPANIKFIAEKPLEDGSYLSWIYPSGKLRKKASQ 243 >UniRef50_A8KXP7 Transposase IS4 family protein n=2 Tax=Actinomycetales RepID=A8KXP7_FRASN Length = 421 Score = 174 bits (442), Expect = 2e-42, Method: Composition-based stats. Identities = 56/278 (20%), Positives = 104/278 (37%), Gaps = 16/278 (5%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 MP + + + L+ + A P + + + R R LP + + VV Sbjct: 1 MPRRGQVKEKPEDRLVDRVGLGVLAAQFPDALVDRVVAETGRRERRTRDLPAALTLRYVV 60 Query: 61 QNE-----PITDVVRRL----NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQ 111 +V+R++ + +D + + A +A+T+AR R+G PV+ LF +TA Sbjct: 61 ALALFPSDGYDEVMRQVKVADDWLSDKAGPVKVPATTAITKARDRLGVEPVKLLFERTAV 120 Query: 112 DRG-AERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALM 170 R + + G ++ +DG PD E +G P +R++ L+ Sbjct: 121 PMALPRRTVGAFYRGWRVCTVDGTTLLVPDTDENAAAFGKPGNDQGEGA-LPQVRVLGLV 179 Query: 171 NLGSHILLNAVTA----PYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGC 226 G+ LL A SE L +L + + L D+ F +L G Sbjct: 180 ECGTRALLGAGFGGTGGSKAASEQALFPDLLGALRPGMLVLADRNFLGFELFAKAAATGA 239 Query: 227 NRHWLLPAWKNIASE-MIELGNTASPGTIPKRLEHLRG 263 + W + + + + + G+ S P + R Sbjct: 240 DLLWRAKSDRRLPIDTELADGSYLSHLVEPGTRDKGRK 277 >UniRef50_Q648P8 Transposase n=2 Tax=environmental samples RepID=Q648P8_9ARCH Length = 464 Score = 173 bits (437), Expect = 8e-42, Method: Composition-based stats. Identities = 56/285 (19%), Positives = 104/285 (36%), Gaps = 27/285 (9%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITD------VVRRLNLSAD 77 F++ L E I++ + + R R + + + +D V + L Sbjct: 31 FSDVLSAETIRNIMDEE-VGSYRDRIYSPLITLSAFLSQVLSSDHSCKNAVAKVLAERVA 89 Query: 78 GEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFR 137 + +AR R+ V L R+T + + W G + +DG Sbjct: 90 QGKLPCSSNTKSYCEARLRLPINLVRRLVRETGKLLHLKSEEAWKWKGRSVKLVDGTTVS 149 Query: 138 TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR---QSETVLAH 194 PD PE ++ Y K +P+ RLVA+++L +L+ PY+ E L Sbjct: 150 MPDTPENQKMYPQPEG-QKEGVGFPIARLVAIISLSCGAVLDIAIGPYKGKETGEHALLR 208 Query: 195 SMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTI 254 +L +I I L D+ + S L++ L Q G + + + + Sbjct: 209 QILGSISTGDILLGDRYYCSYFLIVMLQQLGADSVFRIHGSRKKDFR------------- 255 Query: 255 PKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK 299 R +HL G + + I K+P+ + + P + +K Sbjct: 256 --RGKHL-GKKDHIVIWKKPKQRPNWMTESMYLQMPDTLTIREIK 297 >UniRef50_UPI00016A835E hypothetical protein BoklC_27358 n=1 Tax=Burkholderia oklahomensis C6786 RepID=UPI00016A835E Length = 231 Score = 164 bits (415), Expect = 3e-39, Method: Composition-based stats. Identities = 71/250 (28%), Positives = 121/250 (48%), Gaps = 46/250 (18%) Query: 16 MPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVR 70 PP + +HLP EWI+H + S A+VRRRRLP V+W+V+ +++ I++VV Sbjct: 19 NPPLELERLGQHLPYEWIEHAVQASGSASVRRRRLPAQQVVWLVIALALYRHQSISEVVD 78 Query: 71 RLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFA 130 L+L+ + +++SA+ QA+QR GA+P+ WLF ++A+ +W G + Sbjct: 79 ELDLALPAP-DTSFVSKSAIAQAKQRTGASPLAWLFHESAR----------NWVGQDI-- 125 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSET 190 +YP + V L + + ++ +A PY +E Sbjct: 126 ----------------------------GSYPQLHAVTLTAIATRLVRDAGFGPYDINEM 157 Query: 191 VLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTAS 250 + A ++ +P+N IT+FDK F S LL L G NRH+++PA N E+ +++ Sbjct: 158 IWARELIPRVPENPITVFDKGFLSAQLLCNLVAGGQNRHFIIPARSNPRGEISRRPTSST 217 Query: 251 PGTIPKRLEH 260 + R Sbjct: 218 ATNVAGRSRP 227 >UniRef50_C1ZMB0 Transposase family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZMB0_PLALI Length = 497 Score = 157 bits (397), Expect = 4e-37, Method: Composition-based stats. Identities = 48/243 (19%), Positives = 85/243 (34%), Gaps = 16/243 (6%) Query: 14 PLMPPPSAQLFAEHLPTEWI----QHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVV 69 P + + E E + C++ A + +W ++ TDV Sbjct: 28 PFSDALTTRQLEEVFEAEEVSFGRDPCVSEQASIEDGGLVYTRGVTLWAMLSQALFTDVQ 87 Query: 70 RRLNLSADGEAGM--------NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKD 121 R + A + A +AR ++ V+ L Q A K Sbjct: 88 RACRAAVQRVAVYYALSGIRISSTNTGAYCRARAKIPEGVVQRLAVGVGQRCEAAVPDKW 147 Query: 122 DWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAV 181 WHG + IDG PD E + Y ++ + +P++R VAL +L + ++L V Sbjct: 148 RWHGFRTLVIDGTTCSMPDTQENQAEYPQPSS-QGKGLGFPILRAVALTSLATGMILALV 206 Query: 182 TAPY---RQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI 238 T P ET L ++ + + L D+ + +L L + G L ++ Sbjct: 207 TGPCAGKATGETALFRTLFDQLKAGDLVLSDRYYGGWFMLALLQELGVEFVTRLHQFRIA 266 Query: 239 ASE 241 Sbjct: 267 DFH 269 >UniRef50_B5EK95 Transposase IS4 family protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EK95_ACIF5 Length = 369 Score = 156 bits (393), Expect = 1e-36, Method: Composition-based stats. Identities = 54/209 (25%), Positives = 93/209 (44%), Gaps = 12/209 (5%) Query: 53 DMVIWMVVQNEPITDVVRR--LNLSADGEAGM--------NLLARSAVTQARQRVGAAPV 102 +++++ V+ V L L DG + ++++ A++QAR +VGAAP+ Sbjct: 19 EVLVYFVLAMVLYASVAYEEVLQLVVDGLRPLLGDDRLAQTVVSKGAISQARAKVGAAPL 78 Query: 103 EWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYP 162 + L++ Q G + GL+L AIDG+ PD+ E +G +S A+P Sbjct: 79 KTLYQNQVQPHGPLGMAGVGYKGLRLMAIDGSTLDMPDEAANAERFGYPASSRG-SAAFP 137 Query: 163 VMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLN 222 +R VA+ G+H L A Y QSE LA ++A + D+ FYS Sbjct: 138 QLRFVAMAECGTHTLCYAEMGSYEQSERTLAGPVMAHADATMLITADRNFYSYAFWQQSL 197 Query: 223 QKGCNRHWLLPAWKNI-ASEMIELGNTAS 250 G + L + + +++ G+ S Sbjct: 198 ATGARLLFRLSSVLKLPREKILADGSYLS 226 >UniRef50_A5KKC4 Putative uncharacterized protein n=1 Tax=Ruminococcus torques ATCC 27756 RepID=A5KKC4_9FIRM Length = 422 Score = 152 bits (384), Expect = 1e-35, Method: Composition-based stats. Identities = 56/242 (23%), Positives = 102/242 (42%), Gaps = 12/242 (4%) Query: 25 AEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMN- 83 + L I + T S +L +I+ V+Q+ + + + + + Sbjct: 25 EDFLTRHRIGNAFTRSG-------KLSFSNLIYFVLQSVHKSIPINYARFLENFPSDLPI 77 Query: 84 LLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPE 143 +++ A+++ARQ + LFR + + + W+G ++A+DG+ + P+ E Sbjct: 78 FVSKQAISKARQGISHKAFLELFRLSVKQFYFQPVNLRTWNGFHIYAVDGSTIQIPESKE 137 Query: 144 LREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIP-- 201 E +G TK + P+ L ++ + IL++ PYR +E A + + +P Sbjct: 138 NYEVFGGNPNKTKIIS--PLASASVLYDVINDILIDVSLHPYRYNERESAKAHVDFLPRF 195 Query: 202 DNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHL 261 NSI LFD+ + SED+ LN KG +P A E P + K L Sbjct: 196 PNSIILFDRGYPSEDMFHYLNSKGILFLMRVPKTFKKAISEQEDALFTYPASCNKESLTL 255 Query: 262 RG 263 R Sbjct: 256 RS 257 >UniRef50_A3IS08 Putative uncharacterized protein n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IS08_9CHRO Length = 472 Score = 151 bits (381), Expect = 2e-35, Method: Composition-based stats. Identities = 49/273 (17%), Positives = 100/273 (36%), Gaps = 10/273 (3%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITD------VVRRLNLSAD 77 F + L +E I+ L + R ++IW + D V R ++ A Sbjct: 23 FQKLLKSEIIEDILKEMG-VKYKSRIYNPIVIIWSFLSQVLDPDHSCQNAVSRIISYLAS 81 Query: 78 GEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFR 137 SA QAR+++ ++ L +A+ + K WHG + +IDG+ Sbjct: 82 EGIETPSENTSAYCQARKKLPEELLKKLLEISAKGNEEKVDKKHLWHGRCVKSIDGSTVS 141 Query: 138 TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSML 197 PD + +E Y + K+ +P+ ++ L + + ++ V ++ + LA + Sbjct: 142 MPDSLKNQEAYPQHGS-QKKGCGFPLAKIGVLFSYATGSVVGIVIDIFKTHDIKLARKLT 200 Query: 198 ATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKR 257 + I L D+ F S + + +KG + L + + + P K+ Sbjct: 201 DYLDAGDILLGDRAFCSYIDIYSWKKKGIDSVMRLHQGRLQKGKKRPKYTVSPPFKKKKK 260 Query: 258 LEHLRGALEVVFITKRPRPSRPRSVKISKTRYP 290 + + ++P+ K P Sbjct: 261 TRKCPHDRLI--LWEKPKRKPKDISKEDFYSLP 291 >UniRef50_A3ZZQ0 Putative uncharacterized protein n=3 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZQ0_9PLAN Length = 457 Score = 149 bits (375), Expect = 1e-34, Method: Composition-based stats. Identities = 50/282 (17%), Positives = 92/282 (32%), Gaps = 17/282 (6%) Query: 27 HLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLS-ADGEA 80 LP+ + + R R +V+WM V + VV RLN Sbjct: 27 LLPSGVVAAICHEIG-FSFRERIYSPMIVVWMFVMQTLSADHSCQQVVTRLNAWRLAQGL 85 Query: 81 GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPD 140 ++ QAR+R+ A + L TA+ + G ++ +DG D Sbjct: 86 SRCSGDTTSYCQARRRLPIALFQRLLAWTARKCDEAGLGDWRYQGREVIIVDGTTVTMAD 145 Query: 141 KPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR---QSETVLAHSML 197 + + K +P+ R+V + +L + Y ET L ++L Sbjct: 146 TRANQTAFPQIEN-QKPGCGFPLARIVQVFSLATGAATMFAMGRYAGKETGETSLLRTLL 204 Query: 198 ATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKR 257 + I L D+ + S LL + +G + + I G + Sbjct: 205 SQFHSGEIVLADRYYASFWLLALSDLRGIDIVARAHHRRKIDFR-----RGLRQGDCDQI 259 Query: 258 LEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK 299 + + + ++T P S+ + RY V + Sbjct: 260 VGYAKPQ-RPTWMTTDEYDQYPSSILVRHLRYEVTQRGFRTR 300 >UniRef50_B8FEP3 Transposase IS4 family protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FEP3_DESAA Length = 422 Score = 147 bits (371), Expect = 4e-34, Method: Composition-based stats. Identities = 49/273 (17%), Positives = 99/273 (36%), Gaps = 18/273 (6%) Query: 25 AEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNE------PITDVVRRLNLSADG 78 LP + Q + R R L +V+ +++ +V+ R +A Sbjct: 2 ERFLPADKSQSQAPFKSKDFSRNRILTLPVVLALILNMVRPGKRVGYDEVLARFFAAASL 61 Query: 79 EAG--MNLLARSAVTQARQRVGAAPVEWLFRQTAQDR--GAERYLKDDWHGLQLFAIDGA 134 G + +SA +AR++V + L+ + + A + W G ++ AIDG Sbjct: 62 MNGQNITPPDKSAFCRARKKVPFEALTELYGKALEHAKDLAAKAPGTTWRGRRVLAIDGT 121 Query: 135 QFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAH 194 + P EL + +G + +P L ++ + + L+ Y+ E LA Sbjct: 122 KIMLPRTKELLDAFGKCS-----HGWFPQTHACVLYDVLAGLPLDVAWGHYKSGERGLAR 176 Query: 195 SMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTI 254 M I + D+ F L L ++G + +++ + + + Sbjct: 177 DMFDGFLPGDILVLDRGFPGFAFFLDLMEQGID--FIVRLRGDGQFAALRPFLQENRRDQ 234 Query: 255 PKRLEHLRGALEVVFITKRPRPSRPRSVKISKT 287 + R A+E +P P P +++ K Sbjct: 235 IIEIPPTRVAIEEYARQGKPAPG-PVTLRFVKV 266 >UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XCY0_9BACT Length = 481 Score = 147 bits (371), Expect = 4e-34, Method: Composition-based stats. Identities = 54/290 (18%), Positives = 102/290 (35%), Gaps = 27/290 (9%) Query: 10 FSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEP----- 64 ++ PL P LFA +P + + A R R W + Sbjct: 23 LAEQPL--PQLEALFAPFIPEQLLSRA-----GANSRERFYTLRQTFWAFLWQALHPGTA 75 Query: 65 ITDVVRRL--NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDD 122 +VVR+L + A +A +ARQR+ ++ + + Sbjct: 76 CREVVRQLLSDWQAQAGRTRAQAGTAAYCRARQRLPLERLQAILQ------ATLGPEPPR 129 Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT 182 W G + +DG F PD ++ + + K +P +++VAL +L S + LN Sbjct: 130 WRGHAVKLVDGTTFSLPDTAANQKKFPQSGA-QKPGCGFPTLKVVALFSLASGLALNWAR 188 Query: 183 APYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEM 242 R E L + + + + + D+ F S L L +G + + L K + Sbjct: 189 GSLRVHEIPLFRKLWSGLRRRDLIIGDRGFSSYTNLALLLGRGVDCLFRLHQGKKVRH-- 246 Query: 243 IELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVK 292 S ++L + ++ ++P RP+ + V+ Sbjct: 247 ----PRRSRLQRKQKLGPRQWLVQWKKPYQKPEYMRPKEWAAVPSEMQVR 292 >UniRef50_A2RJ55 Putative transposase n=7 Tax=Lactobacillales RepID=A2RJ55_LACLM Length = 439 Score = 146 bits (369), Expect = 7e-34, Method: Composition-based stats. Identities = 48/304 (15%), Positives = 105/304 (34%), Gaps = 38/304 (12%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVR---------RRRLPGDMVIWMVVQNEPITDV 68 P + Q+ + + ++ H + R+L + I +++ + Sbjct: 8 PSTLQV-SHQIKKNLEDQIHEITNHPEIYAQSPFDFSRNRKLSFETTIKIILSFGGQSLS 66 Query: 69 VRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQL 128 L+ + SA+ QAR ++ E LF +T + G ++ Sbjct: 67 SELLS---HFNFTLKTPTASALVQARSKIKLKAFEQLFYRTI----PSAQPNKLYKGYRI 119 Query: 129 FAIDGAQFRTP-DKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ 187 FA DG+ P ++ E +Y + + L AL + + + +Q Sbjct: 120 FAHDGSDLNIPYNEKESDTHYRVGKFGKHVGS----LHLNALYDPLNKHYVAVDFQKIKQ 175 Query: 188 -SETVLAHSMLATIPDNS--ITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 +E ++ S I + D+ + S ++ + + G +L+ A ++ ++ Sbjct: 176 LNERKSLCQIVDDFDFTSPTIIIADRGYESFNVYEHIKKSGQK--FLIRAKDTKSNGLLN 233 Query: 245 LGNTASPGTIPKRLE---------HLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSA 295 + S GT K++ ++ F+ KR + SK YP+ Sbjct: 234 GLDLPSDGTFDKKITLQLTRRQTNKVKKDKHYHFLHKR--ANFDYLPIRSKETYPISLRV 291 Query: 296 APLK 299 +K Sbjct: 292 VRIK 295 >UniRef50_A4BL98 Putative uncharacterized protein n=5 Tax=Nitrococcus mobilis Nb-231 RepID=A4BL98_9GAMM Length = 426 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 80/205 (39%), Gaps = 10/205 (4%) Query: 46 RRRRLPGDMVIWMVVQNEPITD------VVRRLNLSADGEAGMNLLARSAVTQARQRVGA 99 R R +V+ + D V R L+ N + +ARQR+ Sbjct: 13 RDRIFTPLVVLKAFLFQVLSQDGSCKHAVARVLSERLQSGQSANSINTGPYCKARQRLPR 72 Query: 100 APVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQN 159 AP+E R++ Q W G ++ DG PD + + + + Sbjct: 73 APLENAVRESGQTLHQRAPSAWGWRGHRVVLADGTTALMPDTLDNQREFPQQGN-QQPGL 131 Query: 160 AYPVMRLVALMNLGSHILLNAVTAPYR---QSETVLAHSMLATIPDNSITLFDKLFYSED 216 +P++R+VAL++LG+ +L+ PY+ E+ L ++L T+ + L D+ + + Sbjct: 132 GFPIVRIVALISLGAGAVLDYALGPYQGKGSGESSLFSTLLHTLQPGDLLLADRYYCTYA 191 Query: 217 LLLTLNQKGCNRHWLLPAWKNIASE 241 ++ L G + A + Sbjct: 192 IMALLVHHGVQGLFQKHAQRKPHWH 216 >UniRef50_UPI00016C3BAD transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3BAD Length = 258 Score = 145 bits (365), Expect = 2e-33, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 71/205 (34%), Gaps = 8/205 (3%) Query: 43 ATVRRRRLPGDMVIWMVVQNEPITD----VVRRLNLSADGEAGMNLLARSAVTQARQRVG 98 AT +W + T + L A A +AR ++ Sbjct: 47 ATEGHHVWTPARTLWTFLTQCLSTSTSCAAAAAVALRVTLGLHPCSEATGAYCKARAKLP 106 Query: 99 AAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQ 158 A + L Q+ + + W G ++ DG PD P + Y T+ KR Sbjct: 107 VALLSRLATQSGDELERHAPKEWQWKGRRVLLGDGTTLSGPDTPANQAAYPQH-TNQKRG 165 Query: 159 NAYPVMRLVALMNLGSHILLNAVTAPYR---QSETVLAHSMLATIPDNSITLFDKLFYSE 215 +P++R+V L+ + L+ A P + E L +L + + D+ + S Sbjct: 166 LGFPLIRVVVLLGFATGALVGAAIGPAKGKEAGEMALLRELLDRFQAGDVFVADRAYCSY 225 Query: 216 DLLLTLNQKGCNRHWLLPAWKNIAS 240 L+ L +G + L ++ Sbjct: 226 WLVSALQARGVDVAIRLHQSRHYDF 250 >UniRef50_B0CC46 Transposase, IS4 family, putative n=9 Tax=Cyanobacteria RepID=B0CC46_ACAM1 Length = 482 Score = 143 bits (360), Expect = 8e-33, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 81/224 (36%), Gaps = 10/224 (4%) Query: 25 AEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSADGE 79 + LP ++ L A + R R + +W ++ ++ + + V+ + Sbjct: 38 TDILPASRLEELLKEEA-FSYRNRIYSPIVTLWAMLYQVLSADKSLRNTVKCITTWL-TA 95 Query: 80 AGMNLLA--RSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFR 137 AG+ + A ++AR R + ++ L ++A+ + W G + DG Sbjct: 96 AGIQPPSSDTGAYSKARSRFPESLLQRLIPESAECLAQPLSPEHLWCGRPVKVYDGTTVL 155 Query: 138 TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSML 197 D + Y T +P+ RLV L + + +A A + SE V++ + Sbjct: 156 MADSAANQASYPQHGNQT-AGCGFPIARLVVFFCLVTGAVASACIASWDTSEIVMSRLLY 214 Query: 198 ATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE 241 + + + D+ + S L + Q + + Sbjct: 215 QDLEVGDVVMADQAYGSYVDLAIIQQHRADGVLRKHHARKTDFR 258 >UniRef50_A6UXI0 Protein containing transposase DDE domain n=4 Tax=Gammaproteobacteria RepID=A6UXI0_PSEA7 Length = 423 Score = 143 bits (359), Expect = 9e-33, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 82/213 (38%), Gaps = 11/213 (5%) Query: 45 VRRRRLPGDMVIWMVVQN---EPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAP 101 RRR+L ++ ++ T++ + + ++ A +AR+++ Sbjct: 28 TRRRQLTFKNLVLFLLNQPRTALQTELDQFYRVLNQASTETQMVTAQAFCKARKKLNPEV 87 Query: 102 VEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAY 161 E L R Q + W GL++ A+DG+ P + + ++GS + + Sbjct: 88 FESLNRLLQQQIDCFGLRQK-WRGLRVLAVDGSTVHLPLESTMATFFGS-------HSGF 139 Query: 162 PVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTL 221 P+ RL L + L+++ P E AH L +P +S+TLFD+ + L Sbjct: 140 PMARLSTLYEVADGQTLHSLIVPLTVGERDCAHLHLEHLPADSLTLFDRGYPGHWLFALF 199 Query: 222 NQKGCNRHWLLPAWKNIASEMIELGNTASPGTI 254 Q+ + LP N + + Sbjct: 200 AQQQRHFLMRLPCGYNAQVKAFLHSGQVEDTQL 232 >UniRef50_Q3A1U3 Transposase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A1U3_PELCD Length = 489 Score = 143 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 52/299 (17%), Positives = 103/299 (34%), Gaps = 34/299 (11%) Query: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQN-----EPITDVVRRLNLSA 76 ++F + +P ++ + A RRR + W +V+R+L Sbjct: 44 EVFEKFIPLALLKP---ELSGAMSRRRLFSKENTFWAFFSQVLDADGGCKEVIRKLQSY- 99 Query: 77 DGEAGMNLLARS--AVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGA 134 G+ + + S + AR+++ + + TA+ + ++ DG Sbjct: 100 ASIKGIKVPSSSTASYCTARKKLAEPMLADILAHTAEQLEKMPATGML-NNRRVIVADGT 158 Query: 135 QFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAH 194 PD PE + + ++ K +P R+ A +L S LL+ + +E L Sbjct: 159 GVSMPDTPENQAAWPQSSAL-KPGCGFPSARICACFSLDSGALLSYAIGNKKNNELPLFR 217 Query: 195 SMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHW----LLPAWKNIASEMI------- 243 T I L DK F S + L +G + P + + + Sbjct: 218 QQWETFNPGDIFLGDKGFCSYFDIAKLQDRGVDSVVTLAKRAPVRAASSLKKLGPDDLLI 277 Query: 244 --------ELGNTASP--GTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVK 292 ++ + + +PK+L + ++V R R + I RYP + Sbjct: 278 TWERPKYAQILSYSKDAWANLPKKLTLRQIKVKVPHPGFRTRGFYIVTTLIDAARYPAE 336 >UniRef50_Q12AI7 Transposase, IS4 family n=3 Tax=Proteobacteria RepID=Q12AI7_POLSJ Length = 458 Score = 143 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 47/230 (20%), Positives = 84/230 (36%), Gaps = 12/230 (5%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITD--VVRRLNLSADG 78 + F E ++ T + R R P + + M ++ D + +N A Sbjct: 26 VEFFNVLTSPELLET--TEALLPEHRERLYPPTVALSMFMRQVLEADGSCQKAVNGWAAQ 83 Query: 79 EA----GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGA 134 A + +ARQR+ V L R+T + + + W G + +DG Sbjct: 84 RAADGLRPCSVRTGGYCRARQRLPLEMVGTLTRETGRLLHEKALAQWLWRGRAVKLVDGT 143 Query: 135 QFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPY---RQSETV 191 PD PE +E Y +T +P+ RLV ++ L + L+ P+ E Sbjct: 144 GISMPDTPENQERYPQPSTQA-PGVGFPLARLVMVICLATGAALDMAVGPHSGKGSGELG 202 Query: 192 LAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE 241 L +LA + L D L+ + L+ +L G + + + Sbjct: 203 LVRRLLAGFCPGDVMLADALYCNYFLIASLMAAGVDVLFEQNGSRITDFR 252 >UniRef50_UPI00016C37A0 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C37A0 Length = 334 Score = 141 bits (355), Expect = 3e-32, Method: Composition-based stats. Identities = 43/234 (18%), Positives = 72/234 (30%), Gaps = 28/234 (11%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNE--------------PITDVV 69 FA+ LP I+ + R + W + VV Sbjct: 16 FADALPESSIEPAIQEHGGG-WRDEVFTPVVTPWAFLTQVICPVGCCRLAVARVLAWLVV 74 Query: 70 RRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLF 129 R G G A + L R T + W+G ++ Sbjct: 75 RGEPPCGPGTGGYCKPAPGC---------PRAIPQLARHTGRGLHDRAPGNWRWNGRRVL 125 Query: 130 AIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR--- 186 DG PD P+ + Y + +P +RLVAL L +L+A P R Sbjct: 126 IADGTTVTMPDTPKNQNEYPHPGSQAD-GIGFPQIRLVALFCLACGAVLDAALGPSRGKQ 184 Query: 187 QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 ET L + ++ ++ L D+ F L+ ++G + + + Sbjct: 185 SGETALRRQIAGSVGSGTVLLADRYFGGWFDLVLWRERGIDVVTRIHQKRATDF 238 >UniRef50_B7CEB8 Putative uncharacterized protein n=2 Tax=Erysipelotrichaceae RepID=B7CEB8_9FIRM Length = 431 Score = 141 bits (355), Expect = 3e-32, Method: Composition-based stats. Identities = 37/256 (14%), Positives = 81/256 (31%), Gaps = 26/256 (10%) Query: 42 HATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAP 101 R+R+LP + +I ++Q + + L L SA+ Q R ++ + Sbjct: 30 SDFTRKRKLPVETLIHFIIQMQSKS---LNSELCEYFNDIDFLPTASALCQQRDKLDISA 86 Query: 102 VEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAY 161 + + W G + A DG+ + + + Sbjct: 87 FQRIMHLFVNAF----DDYKTWKGYHVLACDGSDVNIAYDEKDED----TKRQNGNNKPF 138 Query: 162 PVMRLVALMNLGSHILLNAVTAPY-RQSETVLAHSML--ATIPDNSITLFDKLFYSEDLL 218 + L + +H+ + + E M+ P+NSI D+ + +L+ Sbjct: 139 SQFHINGLYDCINHVFWDTSIDTANKTRECAALMEMIMKHDYPENSIITADRGYEKYNLI 198 Query: 219 LTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSR 278 + + + ++ G+ S +P L+V I R + + Sbjct: 199 ACCIENNQKFVFRIK-------DIDVFGSILSNLNLPDE----EFDLDVTKILTRKQTNE 247 Query: 279 PRSVKISKTRYPVKHS 294 ++ K K + S Sbjct: 248 TKANK-HKYTFISNKS 262 >UniRef50_Q82R32 Putative IS4 family ISFsp5-like transposase n=1 Tax=Streptomyces avermitilis RepID=Q82R32_STRAW Length = 262 Score = 141 bits (355), Expect = 3e-32, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 73/197 (37%), Gaps = 12/197 (6%) Query: 16 MPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLS 75 P + +P + + L + R R LP + ++ ++ +V RL Sbjct: 6 FAPGHLGELTQVIPFDLVDAVLDETRCVQRRLRDLPSRVGVYFLLAMCLFPEVGYRLVWH 65 Query: 76 --ADGEAGMNL----LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLF 129 G+ A+ R+R+GA P++ +F A + ++ Sbjct: 66 KLTAALTGVGFEVAEPTAKALRDLRRRLGAEPMKRVFETLAGPLAQPVTPGVRFGPFRMA 125 Query: 130 AIDG-AQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQS 188 + DG + + PD E++G + YP++ L+ L+ G+ L+ AV Sbjct: 126 SFDGCSSIKLPDTERNVEWFG-----PGSRGGYPMLELMTLVETGTRALIGAVFGTPSDG 180 Query: 189 ETVLAHSMLATIPDNSI 205 ET A +L + + Sbjct: 181 ETSYARRLLHHLGPGML 197 >UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=A7B831_RUMGN Length = 366 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 39/248 (15%), Positives = 90/248 (36%), Gaps = 16/248 (6%) Query: 46 RRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWL 105 R R+L I ++ E + L+ + ++ + SA Q R ++ ++L Sbjct: 44 RNRKLDFVSTIQFLLSMESGSLKKELLDY---FQFSVDTPSASAFCQQRNKLLLEAFQFL 100 Query: 106 FRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMR 165 F + E + QL A DG+ P Y + + + + + Sbjct: 101 FYEFNSCFSFE----KKYKDYQLLACDGSDLNIARNPNDAGTYFQSQPTDR---GFNQIH 153 Query: 166 LVALMNLGSHILLNAVTAPYR-QSETVLAHSMLATIPD--NSITLFDKLFYSEDLLLTLN 222 L AL +L ++ V P R ++E++ M+ +I + D+ + + ++ + Sbjct: 154 LNALFDLCEKRYIDLVIQPARLENESLAMTQMIDRYKGEKKTIFIADRGYETYNIFAHVQ 213 Query: 223 QKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSV 282 +KG ++L+ M + ++ + + + +P+ + + Sbjct: 214 EKG--MYYLIRVKDGGGGSMTGSFDLPDENEFDHDMQLILTRKQTKDVKAKPKKFKFIA- 270 Query: 283 KISKTRYP 290 K S Y Sbjct: 271 KSSPFDYL 278 >UniRef50_A6CHG0 Transposase of IS5377-like element n=2 Tax=Bacillus sp. SG-1 RepID=A6CHG0_9BACI Length = 381 Score = 136 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 35/264 (13%), Positives = 79/264 (29%), Gaps = 16/264 (6%) Query: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAG 81 ++ + E +++ + R+ D+V + V+ + R E Sbjct: 9 EVLQTFITDEEVENLCEKWGYRDTARKFSAKDLVRFFVISSAKDWKSFRDAETKIPQEDS 68 Query: 82 MNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDK 141 + + S + + Q V ++ LF + G + + +LFA+D Sbjct: 69 LPSVDHSTLAKKAQNVPYQILQELFSRLVNRLG-RGMRRALFKPYKLFAVDSTTITFQHP 127 Query: 142 PELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIP 201 Y + +RL ++ + R + ++A + Sbjct: 128 DMSWAGYTRTRHA---------IRLHTKFDVEEGQPTQVIPTTGRHHDVMVAPKLYEDTE 178 Query: 202 DNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHL 261 SI D+ + L + + + +++ EM G + + L Sbjct: 179 PLSIITADRGYARTRDFEDLQEDNQFFVIRIASSFSLSEEMEHSVPLDEDGNVKEDLTAF 238 Query: 262 RGALEVVFITKRPRPSRPRSVKIS 285 G R +R R V + Sbjct: 239 IGK------NSRKTKNRFRVVTFT 256 >UniRef50_P12249 Transposase for insertion sequence element IS231A n=411 Tax=Bacillus RepID=T231A_BACTB Length = 478 Score = 134 bits (338), Expect = 3e-30, Method: Composition-based stats. Identities = 45/280 (16%), Positives = 98/280 (35%), Gaps = 24/280 (8%) Query: 15 LMPPPSAQLFAEHLPTEWIQHCLTLSAH---ATVRRRRLP----GDMVIWMVVQNEPITD 67 L QLF+E L L A R+R+ + IW + +D Sbjct: 3 LSIQDELQLFSEELCRHLTPSFLEELAKKLGFVKRKRKFSGSELATICIW--ISQRTASD 60 Query: 68 VVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYL---KDDWH 124 + RL G L++ + + + ++++F + + + H Sbjct: 61 SLVRLCSQLHAATG-TLMSPEGLNKRFDKKAVEFLKYIFSILWKGKLCKTSAISSTALTH 119 Query: 125 GLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAP 184 ++ +D F+ P L Y + + +++ +L S LN P Sbjct: 120 FQRIRILDATIFQIP--KHLASIYPGSGGCAQTAG----IKIQLEYDLHSGQFLNFQVGP 173 Query: 185 YRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKN-IASEMI 243 + ++ L T+ + + D ++S + L ++Q+G +++ N Sbjct: 174 GKNNDKTFGTECLDTLRPGDLCIRDLGYFSLEDLDQMDQRGA--YYISRLKLNHTVYIKN 231 Query: 244 ELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVK 283 GT+ K+ ++++ LE I +P + +K Sbjct: 232 PSPEYFRNGTVKKQSQYIQVDLE--HIMNHLKPGQTYEIK 269 >UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G346_9FIRM Length = 443 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 51/284 (17%), Positives = 91/284 (32%), Gaps = 37/284 (13%) Query: 37 LTLSAHATVRRRRLPGDMVIWMVV---QNEPITDVVRRLNL--SADGEAGMNLLARSAVT 91 + R R+LP + VI ++ ++ R + + SA+ Sbjct: 27 VKRPGKDFSRNRKLPFEEVIRFLLPLQGQCMDQELFRHFSKKPLFFSTDYSGIPHSSAMI 86 Query: 92 QARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSA 151 QARQ++ + + LF + + G QL AIDG+QF P E + Sbjct: 87 QARQKLSDSAMPALFHSFTETC----KKGALFQGYQLLAIDGSQFSVP---ENLKEPLCW 139 Query: 152 NTSTKRQNAYPVMRLVALMNLGSHILLNAVTAP-YRQSETVLAHSMLATIPDN--SITLF 208 V+ L A+ +L S I + V P +E M+ +I + Sbjct: 140 RKIPNISKGRNVIHLNAMYHLQSGIFEDVVFQPICECNEHKALAQMVDRRSSAFPAIFMA 199 Query: 209 DKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEH-LRGALEV 267 D+ + S + + QKG S +P E+ + L + Sbjct: 200 DRGYESYNTFAHIEQKGDKYVVRGR---------ESGTGICSGLNLPDTEEYDIEKELYI 250 Query: 268 VFITKRPRPSRPRSVKISKTR------------YPVKHSAAPLK 299 + + PR K ++ Y + +K Sbjct: 251 CKKHSKKVKTNPRKYKRIRSDATFDFFTDDCEEYRLNLRIVKIK 294 >UniRef50_UPI00016C48B0 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C48B0 Length = 202 Score = 131 bits (328), Expect = 4e-29, Method: Composition-based stats. Identities = 40/183 (21%), Positives = 70/183 (38%), Gaps = 2/183 (1%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPI-TDVVRRLNLSADGE 79 + LP E + +T S+ + RL ++W VV D R++ + Sbjct: 20 TAALKQLLPRELMAEVVTESSLPSNFCCRLLNWFMLWFVVGIGLFSRDSYRQVFKWLNPF 79 Query: 80 AGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTP 139 RS + AR R+G APV L + H +L +DG Sbjct: 80 RPKGTPERSTLCMARVRLGVAPVRRLQERVTALLATRATPGAFHHQYRLMGLDGFAADLA 139 Query: 140 DKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLAT 199 D +G + +P R+++L LG+H+L ++ P R+ E +A ++L Sbjct: 140 DSAANTRAFGHPGSGRATGA-FPQARVLSLCELGTHVLWRSLIKPCRRGEVTMAPALLRH 198 Query: 200 IPD 202 + Sbjct: 199 LTS 201 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 130 bits (327), Expect = 5e-29, Method: Composition-based stats. Identities = 43/249 (17%), Positives = 91/249 (36%), Gaps = 18/249 (7%) Query: 46 RRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWL 105 R+++ + V+ ++ E R L E + S+ Q R ++ E+L Sbjct: 35 RKKKWSFEEVMKFMLTMEGK---ALRDELLEYFEFDNTTPSNSSFNQRRAQILPEAFEFL 91 Query: 106 FRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMR 165 F++ + ++GL+L A DG+ P+ Y K Y ++ Sbjct: 92 FQEFTKSF----TDNVTYNGLRLIACDGSDLCIAHNPQDETTYFQTLPDRK---GYNLLH 144 Query: 166 LVALMNLGSHILLNAVTAPYR-QSETVLAHSMLATIPDNS-ITLFDKLFYSEDLLLTLNQ 223 L A +L S +A+ P R +E M+ D S I + D+ + + ++ + Sbjct: 145 LNAFYDLCSRQYTDAIIQPSRLANERRAMCEMIDRYNDTSAIFIADRGYENYNIFAHVEH 204 Query: 224 KGCNRHWLLPAWKNIASEMIELGNTASP-GTIPKRLEHLRGALEVVFITKRPRPSRPRSV 282 KG ++L+ ++ + G + + + + + P + R + Sbjct: 205 KG--MYYLIRVKDITSNGITSKLTMLPESGEFDEWVNVTLTKKQTNEV--KANPKKYRVI 260 Query: 283 -KISKTRYP 290 K + Y Sbjct: 261 DKKTPFDYL 269 >UniRef50_B0NXD2 Putative uncharacterized protein n=5 Tax=Clostridium sp. SS2/1 RepID=B0NXD2_9CLOT Length = 439 Score = 130 bits (327), Expect = 5e-29, Method: Composition-based stats. Identities = 46/262 (17%), Positives = 90/262 (34%), Gaps = 24/262 (9%) Query: 34 QHCLTLSAHATVRRRRLPGDMVIWMVVQ--NEPITDVVRRLNLSADGEAGMNLLARSAVT 91 + R+R+L I +V I ++R SA Sbjct: 25 DSYVKNPEKDFTRKRKLSFQDTINTIVTYDAGSIGRCIKRYIPKV-----EKTPTTSAFL 79 Query: 92 QARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTP-DKPELREYYGS 150 Q ++++ + + LF + + L + ++DG P D+ + Y Sbjct: 80 QQQKKLKLSAFQTLFYRFNDPF-----PDKTLYHLHILSVDGTGVTVPMDRINENKEYAR 134 Query: 151 ANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ-SETVLAHSMLAT--IPDNSITL 207 T+ + + +L + +A P+R SET + ML P ++ + Sbjct: 135 VRTNKDCTRPAYQFHVSCIYDLINERYCDAYIEPFRTHSETHVFSVMLERKNFPQKALFI 194 Query: 208 FDKLFYSEDLLLTLNQKGCNRHWLLPAW-KNIASEMIELGNTASPGTIPKRLEHLRGALE 266 D+ + S L+ + G ++L+ A MI+ GT K + ++ + Sbjct: 195 ADRGYESYLLMAQIQHDGN--YFLIRAREDFGQGSMIKGYPFPRDGTFDKTVTYIYTKTQ 252 Query: 267 VVFITKRPRPSRPRSVKISKTR 288 KR + + P K TR Sbjct: 253 ----NKRTKAN-PELYKRVATR 269 >UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostridiales RepID=C7GFW6_9FIRM Length = 436 Score = 130 bits (327), Expect = 6e-29, Method: Composition-based stats. Identities = 40/261 (15%), Positives = 98/261 (37%), Gaps = 25/261 (9%) Query: 33 IQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQ 92 + + +R+R+L ++ +++ E + L E ++ SA Q Sbjct: 22 LSSFVKNPDKDFIRKRKLDFKKMMHLIISMESGS---LNHELLKFFEYDSSVPTGSAFYQ 78 Query: 93 ARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHG-LQLFAIDGAQFRTPDKPELREYYGSA 151 R ++ + L ++ + + G L A DG++F + + + Sbjct: 79 QRSKLSVSAFRHLLKEFNLKF-----PLEKFRGKYYLIACDGSEFNIARNLKDADTFHEP 133 Query: 152 NTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR-QSETVLAHSMLATI--PDNSITLF 208 N K + + ++ ++L + S L+ P R ++E +++ + I + Sbjct: 134 NG--KSVSGFNMVHTISLYEVCSKRYLDLEVQPGRLKNEFQAICNLMDRYAYGASPIFIA 191 Query: 209 DKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVV 268 D+ F S ++ + + +L+ A + + GT+P +L+ +E++ Sbjct: 192 DRGFSSYNVFAHAIENNVD--FLIRAKD------LNVQRFLGGGTLPDKLD---TTIELI 240 Query: 269 FITKRPRPSRPRSVKISKTRY 289 + + K S+ RY Sbjct: 241 LTRTQSKKKHKHPEKESQYRY 261 >UniRef50_B6FVR6 Putative uncharacterized protein (Fragment) n=2 Tax=Clostridium nexile DSM 1787 RepID=B6FVR6_9CLOT Length = 286 Score = 129 bits (323), Expect = 2e-28, Method: Composition-based stats. Identities = 45/230 (19%), Positives = 84/230 (36%), Gaps = 34/230 (14%) Query: 72 LNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAI 131 + E SA Q R+++ +E LF + + + +L AI Sbjct: 10 HEIGEFFEYRKGFPTVSAFVQQRKKLSYTALEHLFYRFNE---CTFKKPVLYKNYRLLAI 66 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPY-RQSET 190 DG+ F P Y S + N + + L AL ++ S L+ + ++ET Sbjct: 67 DGSDFSLP--------YNSQEDNVMGDNHFSTLHLNALFDVCSKSFLDVIVQKGLHENET 118 Query: 191 VLAHSMLATIPDN--SITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNT 248 A ++ I + I + D+ + + +L + ++ + + N Sbjct: 119 GAACELVDRISEKHPVIIMADRGYENYNLFAHIEERLFDYVVRVRDSDNSC--------M 170 Query: 249 ASPGTIPKRLEHLRGALEVVFITKRPRPSR----PRSVKISKTRYPVKHS 294 S +PK +E+ ITKR +R P ++ K +Y K S Sbjct: 171 VSGLNLPKTVEY--------DITKRVVLTRHFSGPAAINTEKYKYLSKKS 212 >UniRef50_Q7BLZ8 Putative uncharacterized protein (Fragment) n=1 Tax=Streptomyces rishiriensis RepID=Q7BLZ8_9ACTO Length = 341 Score = 127 bits (319), Expect = 5e-28, Method: Composition-based stats. Identities = 37/150 (24%), Positives = 59/150 (39%), Gaps = 2/150 (1%) Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTK-RQNAYPVMRLVALMNLGSHILLNAV 181 + G +L A+DG F PD ++G S ++AYP +RL AL G+H + A Sbjct: 1 YRGWRLVAVDGTTFDVPDTEANAAFFGRPGVSRGQEKSAYPQVRLAALAECGTHAVFAAE 60 Query: 182 TAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE 241 P ET LA + ++ + L D+ F DL G + W + + Sbjct: 61 AGPLAVHETELAQRLFGSLTPGMLLLADRGFRGFDLWRAAAATGADLLWRVKNDAVLPVR 120 Query: 242 -MIELGNTASPGTIPKRLEHLRGALEVVFI 270 ++E G+ S + V I Sbjct: 121 TLLEDGSYLSEIVAARDKNRRADPARVRVI 150 >UniRef50_A6CCZ3 Transposase, IS4 (Fragment) n=7 Tax=Planctomyces maris DSM 8797 RepID=A6CCZ3_9PLAN Length = 531 Score = 127 bits (318), Expect = 5e-28, Method: Composition-based stats. Identities = 51/330 (15%), Positives = 107/330 (32%), Gaps = 47/330 (14%) Query: 3 LLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQN 62 ++ S PL Q + + I + +W ++ Sbjct: 56 FKRSMMQESSLPLADVLDDQRWQQVFDEHEID-------FGNDPDAIYTPAITLWALISQ 108 Query: 63 EPITD--------VVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRG 114 + V+R + A + A +AR ++ + + +Q A D Sbjct: 109 VFFSGEQRSCKAAVIRVASFWAALGRRVCSTNTGAYCRARLKLSFTAIREIVQQLAADAE 168 Query: 115 AE---------------------RYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANT 153 A +K G ++ +DG D PE + Y N Sbjct: 169 AACDQNCVQSQEQSAARLSPSNVADVKSRSTGGRILLVDGFTITAADTPENQRAYPQ-NP 227 Query: 154 STKRQNAYPVMRLVALMNLGSHILLNAVTAPYR---QSETVLAHSMLATIPDNSITLFDK 210 + K +PV+R V+L+++ + +L++ V+ PY ET L ML + + D Sbjct: 228 AQKPGLGFPVLRCVSLISMTTGLLVDLVSGPYSGKGSGETALLWQMLDVLRPGDTLVADS 287 Query: 211 LFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKR-LEHLRGALEVVF 269 + + L+ + +G ++ + TA +R + LR + + Sbjct: 288 YYCTYWLVSACHARGVQILMKNHHLRD------DHPQTARRLNKRERLVTWLRPPVRPAW 341 Query: 270 ITKRPRPSRPRSVKISKTRYPVKHSAAPLK 299 + ++ +P ++ + V K Sbjct: 342 MARQEYRRQPLTLTLRLVDVQVSQPGCRTK 371 >UniRef50_Q3SHG4 Putative uncharacterized protein n=1 Tax=Thiobacillus denitrificans ATCC 25259 RepID=Q3SHG4_THIDA Length = 255 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 43/164 (26%), Positives = 62/164 (37%), Gaps = 8/164 (4%) Query: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITD------VVRRLNLS 75 +F + LPTE I + SA R R P + ++ D V RRL+ Sbjct: 53 GVFEQVLPTEEIMGTIEESAPV-FRHRHYPPLTTLRHFIEQVLSEDQACQDVVGRRLSER 111 Query: 76 ADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQ 135 L SA QARQR+ V+ L+R T + W G +L DG Sbjct: 112 VGQRQSTCSLNTSAYCQARQRLPQEMVDRLYRTTGERLETRLPKSWRWRGRRLVLFDGTT 171 Query: 136 FRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLN 179 PD + + + + +PV RL L+ L S +L Sbjct: 172 VSMPDTLASQCAFPQ-SAEQQPGLGFPVARLSGLIGLASGAVLG 214 >UniRef50_Q7TTE4 Putative uncharacterized protein n=9 Tax=Planctomycetaceae RepID=Q7TTE4_RHOBA Length = 457 Score = 123 bits (309), Expect = 6e-27, Method: Composition-based stats. Identities = 48/273 (17%), Positives = 95/273 (34%), Gaps = 25/273 (9%) Query: 20 SAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQ-NEPITDVVRRLNLSADG 78 S +L + + E AH TV + M+++ ++ + + + V+ L + Sbjct: 17 SFELLQQLVNFEDANKLFEQQAH-TVYTACVVLWMLVYQRLKPDASLENAVKHLLDTRPT 75 Query: 79 --------EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFA 130 E +A ++AR R+ V W + + + D ++F Sbjct: 76 YLPENKRLEDNTLSVATGGYSRARSRLPLEVVRWFAEEVSSGILSATEPAVD--EQRVFL 133 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPY----R 186 IDG + EL++ + A+ +P + L L S + P Sbjct: 134 IDGTTLALAPEKELQQAFPPASNQLGEG-VWPCVLLTVFHELASGAAMLPQVGPMYGPEA 192 Query: 187 QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELG 246 SET LA +P+NSI + D F + G + + + + Sbjct: 193 ISETQLARQGFEQLPENSIIMSDAGFGIFGIAHGAIDAGHDILLRM--------KKVNFQ 244 Query: 247 NTASPGTIPKRLEHLRGALEVVFITKRPRPSRP 279 + + ++ EH + TK+ R ++P Sbjct: 245 SLQKDAELIEQSEHHKTYRHTWKPTKKNRQTQP 277 >UniRef50_UPI00016C5887 hypothetical protein GobsU_05723 n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5887 Length = 321 Score = 123 bits (308), Expect = 8e-27, Method: Composition-based stats. Identities = 47/224 (20%), Positives = 73/224 (32%), Gaps = 17/224 (7%) Query: 23 LFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLSAD 77 +F + L + I R +V+W++V N + D V Sbjct: 1 MFQQLLARDVIDAL-----APPPARAVYTPWVVLWLLVYQRLHGNGSLGDAVSHFLTQFP 55 Query: 78 GEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFR 137 A A AR R+ A V R+ A W G ++F +DG R Sbjct: 56 SAAEQPSGATGGYRHARTRLPNAVVATAGRRVFDTLVAAYPPS--WRGRRVFMMDGTTLR 113 Query: 138 TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHIL----LNAVTAPYRQSETVLA 193 LR + A+ R + +PVM LV L S + A+ P L Sbjct: 114 LAPTDALRGAFTPASNQHGR-SHWPVMHLVVAHELASGLAAPPQHGAMYGPGAVGAVQLG 172 Query: 194 HSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKN 237 ++ +P S+ L D+ F L G + L + Sbjct: 173 LRLMPDLPPGSVILGDRNFGVFGLAHGAVAGGHDAVLRLTQSRF 216 >UniRef50_C6N0W0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N0W0_9GAMM Length = 453 Score = 122 bits (306), Expect = 2e-26, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 83/213 (38%), Gaps = 15/213 (7%) Query: 31 EWIQHCLTLSAHA-TVRRRRLPGDMVIWMVV------QNEPITDVVRRLNLSADGEA-GM 82 I+ R+R + ++ ++ ++ ++ L +++ A Sbjct: 14 AMIEEVCEDFDKVWQTRKRVINTQFLVTFILKLVLSKNSQGYKILLNELWETSEFSALQE 73 Query: 83 NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKP 142 ++ S++ +ARQ++ L Q E W ++F +DG++ P + Sbjct: 74 QPVSASSICEARQKMPETIF-TLINQKVLAMREESDTLPLWRNHRVFGVDGSRINVPHEL 132 Query: 143 ELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPD 202 Y + +Q YP + L +LGS ++ + + P + E + S + + Sbjct: 133 LEAGY-----KAPIKQQYYPQGLMSTLYHLGSGLIYDGILEPVK-GERICLLSHMEKLTL 186 Query: 203 NSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 + + D+ ++S +L+ ++G + + + Sbjct: 187 GDVLVLDRGYFSYLILVKAIERGIHLICRMQSG 219 >UniRef50_A8RFU1 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RFU1_9FIRM Length = 443 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 38/258 (14%), Positives = 86/258 (33%), Gaps = 18/258 (6%) Query: 36 CLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQ 95 C+ +++ T R R L +I ++ + + + + +++ + SAV+Q R Sbjct: 35 CVDQTSNFT-RSRILTPKTLIKFILGLQAHSLSGEVSDYFTS--SNIDIPSISAVSQRRD 91 Query: 96 RVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTST 155 + + + R+ + +G + A DG+ P + + S Sbjct: 92 LLYPEIFKSINRR----FLSSIDNLSTLNGYYILAQDGSDINLPFWHDDTQI------SY 141 Query: 156 KRQNAYPVMRLVALMNLGSHILLNAVTA-PYRQSETVLAHSMLAT--IPDNSITLFDKLF 212 + + L AL + +H+ + P ++SE + P+NSI D+ + Sbjct: 142 GQDSIVCQYHLNALYDCINHVFWESRIDLPTKKSEKSALIDFINHRNYPENSIITADRGY 201 Query: 213 YSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITK 272 S +L+ + + + + M + T + L+ + K Sbjct: 202 ESYNLIAHCIENNQKFVFRVKDIDTRSGIM--TSISLPDETFDITVTRTLTNLQTNEVKK 259 Query: 273 RPRPSRPRSVKISKTRYP 290 S Y Sbjct: 260 NENNQFVFVPSTSVFDYL 277 >UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCH0_9FIRM Length = 435 Score = 121 bits (304), Expect = 3e-26, Method: Composition-based stats. Identities = 44/279 (15%), Positives = 90/279 (32%), Gaps = 33/279 (11%) Query: 36 CLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQ 95 L R R++ + + + + T L+ + +N SA TQ R Sbjct: 26 FLKNPDTDFSRNRKINFKTCVGITMNSGGCTLNKELLDF---FDFDVNAPTVSAYTQQRA 82 Query: 96 RVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTST 155 ++ E+LF ++ K+ + G QL A DG+ E +N Sbjct: 83 KILPEAFEYLFHAFTEENAQT---KNLYEGYQLLACDGSNLTIAPNLNDPETLWKSNQLG 139 Query: 156 KRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ-SETVLAHSMLATIP-DNSITLFDKLFY 213 N + L AL ++ + ++A+ E M+ + D I + D+ + Sbjct: 140 ATGNH---LHLNALYDVLNRTYIDALVQTASTYQEHRACIQMIERVTLDKVILIADRGYE 196 Query: 214 SEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKR 273 + +++ +KG + + AS +P + + ++ + Sbjct: 197 NYNIMSHAIEKGWKFLIRIKD--------VHSNGIASGLELP-QTAVFDMDINLILTRNQ 247 Query: 274 PRPSRPRSVKISKT-------------RYPVKHSAAPLK 299 + + K T YP+ A K Sbjct: 248 TKSKKQAGYKFMPTVQTFDYLPIGSKEDYPISFRIARFK 286 >UniRef50_C6JHT2 Transposase ISLbp1 n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JHT2_9FIRM Length = 424 Score = 116 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 33/223 (14%), Positives = 88/223 (39%), Gaps = 12/223 (5%) Query: 45 VRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEW 104 R R++P +++ ++ + +T + N G+ +++ + R ++ Sbjct: 19 TRIRKMPLQDLLFTMINRKGLTLALELRNYMKLAHPGV-SISKPGYLKQRMKLNPDAFLE 77 Query: 105 LFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVM 164 L++ ++ A+ + + A DG+ P E + YGSA+ + A + Sbjct: 78 LYKYHNRNFYADST-FSTYKNHLILAADGSDINIPTTTETLKLYGSASRKNTKPQA--QI 134 Query: 165 RLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNS-----ITLFDKLFYSEDLLL 219 L + ++ + ++L + + E LA + IP+ I + D+ + S + Sbjct: 135 GLGCIYDVMNRMILESDCNKVKFDEMRLAEKQMERIPETIGNIPYIIIMDRGYPSTPAFI 194 Query: 220 TLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLR 262 + K + +++ + + + T + + +L+ R Sbjct: 195 HMMDK--DLKFIVRLKSS-DYKKEQSSLTENDQLVKIKLDKSR 234 >UniRef50_A1JS05 Transposase for insertion sequence element IS1665 n=4 Tax=Yersinia enterocolitica subsp. enterocolitica 8081 RepID=A1JS05_YERE8 Length = 261 Score = 114 bits (286), Expect = 3e-24, Method: Composition-based stats. Identities = 47/147 (31%), Positives = 70/147 (47%), Gaps = 8/147 (5%) Query: 6 DLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQN--- 62 LD ++L + I CL S T+R+RRLP +M++W +V Sbjct: 5 QALDLVSRYDSLRNPLTTLGDYLYPQLISRCLAESGTVTLRKRRLPLEMMVWCIVGMALE 64 Query: 63 --EPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK 120 EP+ +V RL++ G+ +A SAV QARQR+G+ V +F QTAQ Sbjct: 65 RKEPLHQIVNRLDIMLPGDR--PFVAPSAVIQARQRLGSEAVRRVFSQTAQLWHGSVT-H 121 Query: 121 DDWHGLQLFAIDGAQFRTPDKPELREY 147 W GL L A+DG ++T + E + Sbjct: 122 PHWCGLTLLAVDGVVWQTDNATEQADA 148 >UniRef50_Q04V25 Transposase, ISLbp1 n=29 Tax=Leptospira RepID=Q04V25_LEPBJ Length = 423 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 67/206 (32%), Gaps = 14/206 (6%) Query: 42 HATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAP 101 A R R+L ++ ++ + V G + + A + R+ + Sbjct: 9 SAFTRNRQLTLPRLLIAMINLLNKSLAVELYRYF--KNLGKKAVTKQAFSFTRENLNPQV 66 Query: 102 VEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAY 161 E L G + A D P E + +G + Sbjct: 67 FESLNEIFVNSYYKNVTNCKTHKGYIVAACDATGISLPKTKEFVKDFGCVKNQLGESES- 125 Query: 162 PVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNS-------ITLFDKLFYS 214 P + ++ + I+L++ +R SE +A + + S I LFDK + S Sbjct: 126 PNANSSIIFDIYNDIILSSTVGSHRTSERSMALHHIEKLRSISALQNKKLILLFDKGYPS 185 Query: 215 EDLLLTLNQKGCNRHWLLPAWKNIAS 240 +L+ L G H+++ N Sbjct: 186 MELIGKLMANG--IHFIIR--SNTRW 207 >UniRef50_C0ING1 Putative uncharacterized protein n=1 Tax=uncultured bacterium BLR12 RepID=C0ING1_9BACT Length = 337 Score = 113 bits (283), Expect = 6e-24, Method: Composition-based stats. Identities = 36/162 (22%), Positives = 67/162 (41%), Gaps = 4/162 (2%) Query: 116 ERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSH 175 E W+GL+L AIDG+ P + E +G N + V R L ++ + Sbjct: 8 ESAPYLTWNGLRLLAIDGSTAVLPGHKSITEEFGITNFGPYANSPRSVARTSVLYDVLNL 67 Query: 176 ILLNAVTAPYRQSETVLAHSMLATI-PDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPA 234 +L+ Y E LA A + P + LFD+ + S L+ + +G H+L+ Sbjct: 68 TVLDGQIDRYDSCERNLARQHFAQVKPATDLLLFDRGYPSLGLMFEMQAQG--IHYLIRM 125 Query: 235 WKNIASEMIEL-GNTASPGTIPKRLEHLRGALEVVFITKRPR 275 ++ ++ ++ N + + +L L + TK + Sbjct: 126 REDWWLDVRKMLANGETDKEVTFKLPATERDLLNKYATKNDK 167 >UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostridium spiroforme DSM 1552 RepID=B1C560_9FIRM Length = 399 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 70/228 (30%), Gaps = 16/228 (7%) Query: 81 GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPD 140 + + SA QAR ++ LF + + K +HG +L AIDG++ + Sbjct: 30 SITTPSASAFVQARSKIKPEAFRTLFD----GFNKKTFKKKLYHGYRLLAIDGSELPIDN 85 Query: 141 KP-ELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT-APYRQSETVLAHSMLA 198 + T K +AY L A +L + + ++ E ++ Sbjct: 86 TIFDDETTVLRHGTLAKTFSAY---HLNASYDLMERTYDDIIIQGEAKRDEHGAFCQLVD 142 Query: 199 TIPD-NSITLFDKLFYSEDLLLTLNQKGCNRHWLLP-----AWKNIASEMIELGNTASPG 252 +I + D+ + S + + G + + + G Sbjct: 143 RYDGQKAIFIADRGYESYNGFEHVVHSGHKYLIRVRDIESQSSITKSLGPFPDGEFDVDV 202 Query: 253 TIPKRLEHLRGALEVVFITK-RPRPSRPRSVKISKTRYPVKHSAAPLK 299 + L+ + + K P+ R + Y LK Sbjct: 203 SRMLTLKQTKMIKACPDVYKFVPKNMRFDFMNKQNPWYEFNCRVVRLK 250 >UniRef50_UPI00016C385B hypothetical protein GobsU_16554 n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C385B Length = 454 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 48/238 (20%), Positives = 81/238 (34%), Gaps = 25/238 (10%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV-----QNEPITDVVRRLNLS 75 L A H + +++WM+V ++ + V+ L + Sbjct: 16 FDLLAAHFDPSEAD-----ARFPRRANAVYTASVILWMLVYQRTHPDKSLEATVKHLLDA 70 Query: 76 ADGE--------AGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQ 127 S+ + ARQ + W ++ W G + Sbjct: 71 RPDLLPNTKRVRENALSSNTSSYSDARQWLPLEAARWFADHVSR--ARIDGAPPTWSGRR 128 Query: 128 LFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILL----NAVTA 183 +F IDG +PELRE Y T+ + +PV LV L S + A Sbjct: 129 VFLIDGTTRTLAPEPELREKYP-PATNPHGRGVWPVALLVVAHELSSGAAVVPEVGATFG 187 Query: 184 PYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE 241 P+ SET LA +++ +P N + + D F + L +G + L A + A Sbjct: 188 PHAVSETALAGAVMDRLPANGVVMADAGFGIFAVALGARARGLGFVFRLTAARFTAYR 245 >UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=Streptococcus RepID=A4W4J4_STRS2 Length = 440 Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats. Identities = 35/272 (12%), Positives = 83/272 (30%), Gaps = 42/272 (15%) Query: 46 RRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWL 105 R+ +L + +I ++ T L+L +++SA Q R ++ + L Sbjct: 44 RKSQLTMETMIQAILTMGGNTLAKELLDLDLP-------VSQSAFVQRRYQLKHQAFKAL 96 Query: 106 FRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMR 165 F + L + A+DG+ P + Y ++ Sbjct: 97 F-------ANITSKIPTFKDLPILAVDGSDVVLPRNRSDKTTTFQTGPH---HTPYTLIH 146 Query: 166 LVALMNLGSHILLNAVTAPYRQ-SETVLAHSMLATIP-DNSITLFDKLFYSEDLLLTLNQ 223 + AL NL I + R+ E M+ + P + ++ + D+ + S +++ + Sbjct: 147 INALYNLEQEIYHDLRIQNNREVDERAAFIDMMESCPFEQALVIMDRGYESYNVMAHCQE 206 Query: 224 KGCNRHWLLPAW-----------KNIASEMIELGNTASPGTIPKRLEHLRGALEVVFI-- 270 + + + + N T + + + F+ Sbjct: 207 RNWSYIIRIRDGNHSMKSGFNLPDTPCFDEEFDLNICRKQTNVMKELYRDFPNQYHFLPH 266 Query: 271 ----------TKRPRPSRPRSVKISKTRYPVK 292 +++ P + R +K Sbjct: 267 NASFDLLPNSSRKSDPISFYDLHFRMVRLEIK 298 >UniRef50_B5ZZ25 Transposase IS4 family protein n=11 Tax=Rhizobium RepID=B5ZZ25_RHILW Length = 381 Score = 106 bits (264), Expect = 1e-21, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 74/264 (28%), Gaps = 21/264 (7%) Query: 28 LPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQN----EPITDVVRRLNLSADG--EAG 81 +P + + RR +I ++ + ++V L + G Sbjct: 15 IPWAVFERLVDEHQADKHVRRLSTKSQLIALLYGQLAGAVSLREIVGSLESHSARLYHLG 74 Query: 82 MNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDK 141 ++RS A + LF AQ G ++ IDG+ Sbjct: 75 ARPVSRSTFADANGLRPSTVFAELF---AQMVARAGRGLKRAIGEAVYLIDGSSLSLAGA 131 Query: 142 PELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIP 201 + ++ + + + + A P ++ A M I Sbjct: 132 GSQWARFSDQACG---------AKMHVVYDANAERPIYAAVTPANVNDITAAKEM--PIE 180 Query: 202 DNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHL 261 + +FD +Y LN GC L + + E A G + R+ L Sbjct: 181 AGATYVFDLGYYDFGWWAKLNAAGCRIVSRLKSHTKLTVSA-EQAANADAGILFDRIGLL 239 Query: 262 RGALEVVFITKRPRPSRPRSVKIS 285 RP R V+I Sbjct: 240 PQRQAKSRRNPMNRPVREIGVRIE 263 >UniRef50_UPI000196B70E hypothetical protein CATMIT_00144 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196B70E Length = 479 Score = 106 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 36/279 (12%), Positives = 84/279 (30%), Gaps = 18/279 (6%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMN 83 F + + + R R + +I+ + T + + + Sbjct: 19 FNTLIHSTRVNELCRKKKCDFTRSRNMNFYSIIYYFIFRNRTTTNAELTHFYSSIDRFEK 78 Query: 84 LLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPE 143 +++ A+ +A +++ +L Q A + K + L A DG P Sbjct: 79 RISKQALNKAIRKLNPNVFTYLINQFASIYYSTSLPKK-YRDHLLIAEDGTYMEIPYNML 137 Query: 144 LREYYGSANTSTKR---QNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATI 200 + A R + L ++ + + ++ SET LA + L Sbjct: 138 NINEFQFALGCHVRNMFDVKKVQSKAGGLYDVTNGLFIDFSLRQAPYSETPLAFAHLYRT 197 Query: 201 PD-----NSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASP---- 251 + I L D+ + S +++ L +++ N ++ S Sbjct: 198 REMLENQKVIYLADRYYGSAEIISHLED--LRYSYVIRGKSN--FYKKQVAGMESDDEWI 253 Query: 252 -GTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRY 289 + ++ ++ P+ V + RY Sbjct: 254 EVEVDEKWLKRFRFSPEAKKLRKENPTLKIRVIKREYRY 292 >UniRef50_Q8QNB6 EsV-1-170 n=2 Tax=Ectocarpus siliculosus virus 1 RepID=Q8QNB6_ESV1 Length = 383 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 46/262 (17%), Positives = 96/262 (36%), Gaps = 22/262 (8%) Query: 44 TVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVE 103 RRR++ + + + + V + D + AV AR+++ + Sbjct: 21 MQRRRKMDTSSLFYTLTRCCVQGRGVNHVLKMED-----EAYSSQAVHSARKKLPMGAFK 75 Query: 104 WLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTST-KRQNAYP 162 + R + ++FA+DG++ Y N R P Sbjct: 76 EVNRFLHRGPHEP----------RVFAVDGSKVHVHPSFINAGYKTRTNDQPVSRPAKRP 125 Query: 163 VMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLN 222 ++ L +++++ + ++ +E A SML ++ LFD+ +YS+DLL +++ Sbjct: 126 LVMLSSMVDVKTKACIDFELTK-HFNERRAATSMLRSVQKGDTLLFDRGYYSKDLLHSVH 184 Query: 223 QKGCNRHWLL-----PAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPS 277 W L ++ + G + L++ V +T P S Sbjct: 185 GSHAFGVWRLKIDAFRGTRSFFNSCRTEATCLILGVKARLLKYFIDGKTYVCLTTDPSLS 244 Query: 278 RPRSVKISKTRYPVKHSAAPLK 299 R + + +R+ V+ S LK Sbjct: 245 RLKIKTMYASRWRVEESFKRLK 266 >UniRef50_A3ZMM8 Transposase insG for insertion sequence element-like protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZMM8_9PLAN Length = 464 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 41/237 (17%), Positives = 81/237 (34%), Gaps = 21/237 (8%) Query: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPI---------TDVVRRL 72 L +E L I+ + + + IW+++ ++ V Sbjct: 20 TLLSEILRVPQIEAIVDFDDRP-NTKMVYTQAVTIWLLILQRLRGGASLQTVVSEAVEHQ 78 Query: 73 NLSADG----EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQL 128 G S+ + AR+R+ +E F + D R ++ + ++ Sbjct: 79 ADLFPDNKRVHEGTLGENTSSFSAARKRLPLDAIER-FSRCVCDHL-GRTVEPVFDDRRV 136 Query: 129 FAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILL----NAVTAP 184 F IDG P P L++ + T+ + +PV L+ + + +L + + P Sbjct: 137 FIIDGTTITLPPTPVLKKAFP-PATNQLGETVWPVAMLMVAAEMQTGCILVPKIDPMYGP 195 Query: 185 YRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE 241 SE A ++ +P SI L D F + G + + L + A Sbjct: 196 NNSSEAKQAREIVGDLPSRSIVLADSCFGIFSVAHHTRAAGHDFLFRLSMLRLKAHR 252 >UniRef50_Q7UPU9 Probable transposase n=2 Tax=Rhodopirellula baltica RepID=Q7UPU9_RHOBA Length = 656 Score = 103 bits (256), Expect = 8e-21, Method: Composition-based stats. Identities = 29/248 (11%), Positives = 71/248 (28%), Gaps = 27/248 (10%) Query: 34 QHCLTLSAHATVRRRRLPGDM--VIWMVVQNEPITDVVRR-LNLSADGEAGMNLLARSAV 90 + + AH + + + ++W+ + + D + + S + Sbjct: 107 EQAVPTQAHGNTGWKTMTLMIQALLWIFSDKDKLKDAFDSGTRQCKKVHGRIAFSSYSGL 166 Query: 91 TQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGS 150 +A R E L + + G + A+DG++ TP + + + Sbjct: 167 IKALVRWTPWLSEVLLTRIHKQIETTAGKLWRTTGWVVMAVDGSRDTTPRTLSNEKAFCA 226 Query: 151 ANTSTKRQNAY----------------------PVMRLVALMNLGSHILLNAVTAPYRQS 188 N + Y P + + + ++ + + P S Sbjct: 227 PNHGHGKTARYRKKKTKGMRRQAIEKNPPAPPVPQIWITMIWHVATQLTWCWKLGPSNAS 286 Query: 189 ETVLAHSMLA--TIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELG 246 E ML P+ ++ D F + ++ G + + A N+ + Sbjct: 287 ERAHVQEMLENGEFPEKTLFTGDAGFVGYEFWKSIIDGGHHFLVRVGANVNLLHSLGYDV 346 Query: 247 NTASPGTI 254 + Sbjct: 347 EPDEDNLV 354 >UniRef50_C6JEA3 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JEA3_9FIRM Length = 329 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 44/255 (17%), Positives = 94/255 (36%), Gaps = 26/255 (10%) Query: 41 AHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAA 100 +R R+L + M + E D +R L ++ +++A + R+++ Sbjct: 30 GVDFIRNRKLGFKDYMLMFLTME--ADCIRE-ELYRFFGRTIDAPSKAAFYRQRKKIRED 86 Query: 101 PVEWLFRQTAQDRGAERYLKDDWHG-LQLFAIDGAQFRTPDKPELREYYGSANTSTKRQN 159 L + K ++G + +A DG+ PE ++ Y N + R Sbjct: 87 AFRNLLLAFNRKL-----PKKLYNGKYEFWACDGSSCDIFLNPEDKDTYFEPNGKSTR-- 139 Query: 160 AYPVMRLVALMNLGSHILLNAVTAPYR-QSETVLAHSMLAT--IPDNSITLF--DKLFYS 214 + + + A+ +L + + P R ++E SM+ + IP++ +F D+ + S Sbjct: 140 GFNQIHINAMFSLFDKRFTDILVQPARKRNEYSAFCSMVDSADIPEHYKVIFFGDRGYTS 199 Query: 215 EDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRP 274 + + +KG ++L+ AS M+ L + ++ + Sbjct: 200 YNNFAHVIEKGQ--YFLIRCNDKRASGMM---GYPVD-----TLPAFDEDISLILTRSKA 249 Query: 275 RPSRPRSVKISKTRY 289 R S RY Sbjct: 250 VSKYSRPELFSSYRY 264 >UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicutes RepID=B8FXU5_DESHD Length = 381 Score = 99.9 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 84/226 (37%), Gaps = 24/226 (10%) Query: 69 VRRLNLSADGEAGMNL--LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGL 126 L+ G + SA Q R ++ ++ LF + + +E Sbjct: 3 GNSLSKELYDWLGYSSETATASAFVQQRDKIRPEALKLLFHEFTRLTVSENS----LQDY 58 Query: 127 QLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR 186 +L A+DG+ R P + + S S +N Y ++ L A+ +L + ++A + Sbjct: 59 RLLAVDGSDLRLPSNSKD--GFSSIRNSEDSKN-YNLVHLDAMYDLMGKVYVDASVQSKK 115 Query: 187 -QSETVLAHSMLAT--IPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMI 243 +E SM+ I N I + D+ + S + + +K ++++ A Sbjct: 116 GMNEHKALVSMVDQSEINGNVIAIMDRGYESFNNIAHFQEKSW--YYIIRAK-------- 165 Query: 244 ELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRY 289 E S ++P E+ + +T+R +K RY Sbjct: 166 ESYGIISRLSLPDYPEYDEEIMLT--LTRRQTKETLPLLKAYPHRY 209 >UniRef50_Q82QT3 Putative uncharacterized protein n=1 Tax=Streptomyces avermitilis RepID=Q82QT3_STRAW Length = 182 Score = 99.1 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 24/112 (21%), Positives = 46/112 (41%), Gaps = 2/112 (1%) Query: 140 DKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLAT 199 E++G +A+ R+VAL G+H + AV P L+ + Sbjct: 20 RTWANEEFFGRQAGGRGE-SAFAQARVVALAECGTHAVFGAVIGPAVGGRAELSRQLFPQ 78 Query: 200 IPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS-EMIELGNTAS 250 + + + L D+ FY +L T G + W L + + ++++ G+ S Sbjct: 79 LGEGKLLLADQGFYGFELWQTARATGADLLWRLRSSAAVPGLQVLDDGSYLS 130 >UniRef50_Q09BD0 Isrso13-transposase protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09BD0_STIAU Length = 387 Score = 99.1 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 82/253 (32%), Gaps = 31/253 (12%) Query: 23 LFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGM 82 + + EW+ R+R+ +++ VV+ + + R +L A +A Sbjct: 24 VLQRAVSAEWMDSLFEA-----HRKRQYTRELLFSTVVELMSVVAMGLRPSLHAGAKATE 78 Query: 83 NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYL-----KDDWHGLQLFAIDGAQFR 137 + +A+ + R+ V L R +AQ K G ++ +DG Sbjct: 79 GGTSIAALYEKVNRMEPDLVRALVRGSAQRLEPVVQPLRTGEKPWAEGYRVRVMDGNHLP 138 Query: 138 TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAV-TAPYRQSETVLAHSM 196 R A P + + ++++ V E L ++ Sbjct: 139 -------ASEKRLKPLREFRGAALP-GHSLVVYAPEQGLVVDVVPCEDAHAQERTLVAAV 190 Query: 197 LATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPK 256 L + + D+ F + ++ L + A + E G T SP + K Sbjct: 191 LEHAQQGDLWIADRNFSTTRIVFGLEDRHA------------AFIIREHGRTPSPTEVGK 238 Query: 257 RLEHLRGALEVVF 269 R R VVF Sbjct: 239 RKRVGRVETGVVF 251 >UniRef50_C4XGQ6 Putative transposase for insertion sequence element n=2 Tax=Desulfovibrio magneticus RS-1 RepID=C4XGQ6_DESMR Length = 376 Score = 97.2 bits (240), Expect = 5e-19, Method: Composition-based stats. Identities = 31/233 (13%), Positives = 75/233 (32%), Gaps = 23/233 (9%) Query: 14 PLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLN 73 L+P +++ E + +H ++ + + + D + +N Sbjct: 13 SLVPKSVFFKLSQNYRPERSPRTFSPWSHFVH--------LLHAQLAGCKSLRDGIMGMN 64 Query: 74 LSADG--EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAI 131 +++ G+ + RS A + E LF + ++ K +LF++ Sbjct: 65 AASNRLYHLGVKPVPRSTFADANAKRPYTMFEALFGELYTRCLSQAPKKKFSFENKLFSL 124 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETV 191 D + + +A +++ +M+ ++ + E Sbjct: 125 DASVVDLCLNLFPWAKFRTAKGG---------IKMHTVMDHDGYLPAVVTVTEAKCHEVN 175 Query: 192 LAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 +A + +P SI +FD+ + L + G N +IE Sbjct: 176 IAKLL--KLPKGSIVVFDRGYNDYTWFRHLCKSGVFLV--TRLKSNARFRVIE 224 >UniRef50_D0SX83 Predicted protein n=1 Tax=Acinetobacter lwoffii SH145 RepID=D0SX83_ACILW Length = 140 Score = 97.2 bits (240), Expect = 7e-19, Method: Composition-based stats. Identities = 48/124 (38%), Positives = 68/124 (54%), Gaps = 7/124 (5%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 M D+LD ++ L + F ++P EW++ L LS+ AT+RRR LP D V+W+V+ Sbjct: 10 MIFQQDILDLNN--LFKLSNLSTFIHNIPVEWVKSTLRLSSPATIRRRCLPADQVLWLVL 67 Query: 61 QNEPITDVV-----RRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGA 115 DV+ RRLN+ A +LL R ++T R+ +GA VEWLF QT Q G Sbjct: 68 GMAIFRDVLIHEAARRLNICTQWLASYDLLTRISLTNTRKHLGADSVEWLFHQTDQHWGQ 127 Query: 116 ERYL 119 E Y Sbjct: 128 EHYP 131 >UniRef50_UPI00019668E9 transposase ISLbp1 n=1 Tax=Methanobrevibacter smithii DSM 2374 RepID=UPI00019668E9 Length = 319 Score = 96.4 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 39/288 (13%), Positives = 101/288 (35%), Gaps = 27/288 (9%) Query: 24 FAEHLPTEW------IQHCLTLSAHATVRRRRLPGDMVIWMVV--QNEPITDVVRRLNLS 75 + + E+ ++ LS R+ +L + +I + + + + R + Sbjct: 3 LSRIVSDEFGNFNNFVERKYVLSDVDFTRKGKLSLESMIKYPLCNNKKTTSIEINRFLRN 62 Query: 76 ADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQ 135 + G+ + + AV++ RQ + + A + + G ++AIDG+ Sbjct: 63 ELNDRGVR-ITKQAVSEKRQFIDPQVYIDMNGSLISKIYAHKDEMTTFKGFNVYAIDGSI 121 Query: 136 FRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHS 195 P+ RE + + ++ R+ + + ++++ SE A Sbjct: 122 VEIPNTKLTREEFEIPEKTELMKDT-STARISCMADTKWDFIISSNITNKSTSEIEHALM 180 Query: 196 MLATIPD-----NSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI----ASEMIELG 246 L + + +IT +D+ + S +++ + ++++ + ++M + G Sbjct: 181 HLDDVKNKIDLTKTITTYDRFYNSIEIM--FKTMLLDSYFIIRGKTHTFKKQQNKMKKEG 238 Query: 247 NTASPGTIPKRLE------HLRGALEVVFITKRPRPSRPRSVKISKTR 288 I + H R T+ +R R + + R Sbjct: 239 KPDETHEINIKKSTNQQFFHRRSEKIRKKNTRNKSQNRIRKTQNRRNR 286 >UniRef50_B2J2I5 Transposase, IS4 family protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J2I5_NOSP7 Length = 439 Score = 95.3 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 33/230 (14%), Positives = 70/230 (30%), Gaps = 25/230 (10%) Query: 35 HCLTLSAHATVRRRRLPGDMVIWMVVQNE-----PITDVVRRLNLSADGEAGMNLLARSA 89 R R L +++ +VV + +V R L AG ++ A Sbjct: 45 KLYETEEKKPFRDRILTLSVMMALVVSLVYRQIPGLREVQRVLCEEGLLWAGRIEVSAQA 104 Query: 90 VTQARQRVGAAPVEWLFRQT-----AQDRGAERYLKDDWHGLQLFAI---DGAQFRTPDK 141 V++ + + +F Q Q + + AI DG+ Sbjct: 105 VSKRLRTLPIELFAQIFEQVMERMNVQPQNQAVPENWQPVCAKFTAIWIADGSTLE---- 160 Query: 142 PELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT-APYRQSETVLAHSMLATI 200 LR K +++ ++ SH + + ++ +L + Sbjct: 161 -ALRRKLKVLQEQEKTLAG----KIMMVVEAFSHHPVTTWYTQNSKANDKTWCEQLLERL 215 Query: 201 PDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTAS 250 P + +FD F+ ++ +L + + ++I AS Sbjct: 216 PIGGLLIFDLGFFKFPWFDAF--TEADKFFLTRLREKTSYKVIRCLTNAS 263 >UniRef50_B8FI31 Transposase IS4 family protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FI31_DESAA Length = 386 Score = 95.3 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 29/231 (12%), Positives = 72/231 (31%), Gaps = 20/231 (8%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGD-MVIWMVVQNEPITDVVRRLNLSADGE 79 + + R+L + M + +R L D Sbjct: 8 LSQLLQSIDRHDFNRIEKQGFLPDRSYRKLTRWGQFVAMAFSHLTQRTSLRDLEGQFDAH 67 Query: 80 ------AGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDG 133 AG + RS + A + A E +F A + L+++D Sbjct: 68 SSKLYHAGAAPVKRSTLADANNQRPAEFFEEVFYHMAAKCQSHAPKHKFRFKNPLYSMDS 127 Query: 134 AQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLA 193 + + A + + +++ +++ +I + S+ +A Sbjct: 128 SVVDLCLNL-----FPWAKHRSTKAG----IKIHTVLDHSGYIPAFVRITDAKTSDIEIA 178 Query: 194 HSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 ++ ++P SI + D+ + ++ + ++ KNI +++E Sbjct: 179 RTL--SLPKGSILVEDRAYVDFTWFKNWHEN--KQFFVTRLKKNIKYKVLE 225 >UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipelotrichaceae RepID=B7C7E2_9FIRM Length = 446 Score = 93.7 bits (231), Expect = 6e-18, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 65/196 (33%), Gaps = 14/196 (7%) Query: 58 MVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAER 117 ++++ + D + +L ++ S+ QAR ++ LF Sbjct: 63 LLLEGGSLKDELYKL-----FGYNLDTPTVSSFIQARDKIKPDTFHILFNLFNGR----T 113 Query: 118 YLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHIL 177 ++G +L A+DG+ P E+++ + + + L ++ + Sbjct: 114 RKPKLYNGYRLLAVDGSTL--PITSEIKDKKTTIQKANNSDKPFSAFHLNTSYDILEYTY 171 Query: 178 LNAVT-APYRQSETVLAHSMLATI-PDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 + + Q E + M+ D +I + D+ + S + ++ G + Sbjct: 172 DDVILQGQAVQDERDALNKMVERYKGDKAIFIADRGYESINSFEKIHLSGNKYLVRVKDI 231 Query: 236 KNIASEMIELGNTASP 251 + + G Sbjct: 232 HST-GMLRSFGPFLDD 246 >UniRef50_A4A0C6 Probable transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A4A0C6_9PLAN Length = 442 Score = 90.3 bits (222), Expect = 8e-17, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 57/160 (35%), Gaps = 9/160 (5%) Query: 86 ARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELR 145 R + +A R G A + + A E G FA+DGA+F P + Sbjct: 81 TRQGLLKALARHGEALIPQVVAHIADQL-RELKGDWTQRGKVNFAVDGAKFLAPRTAANQ 139 Query: 146 EYY------GSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLAT 199 + + A+ S + + + + +L + + A + SE ML Sbjct: 140 QQFASKKEKQYASKSNQSKAESAQLLATVVWHLTAGLPYRWRIAGSKGSERHALTDMLDE 199 Query: 200 IPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA 239 +P N+ + D + L + R +L+ N++ Sbjct: 200 LPSNARIIADAEYVGYPLWSAILDS--KRSFLVRVGSNVS 237 >UniRef50_A4A0C3 Probable transposase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A0C3_9PLAN Length = 445 Score = 88.7 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 71/232 (30%), Gaps = 18/232 (7%) Query: 18 PPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIW--MVVQNEPITDVVRRLNLS 75 ++ LP ++ + W ++ + + RL Sbjct: 24 RELTAAISQFLPPQFFAR------WRFHGNANWTPQRLAWAACLMSWSTDSTLTDRLESV 77 Query: 76 ADGEAGMNL-----LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFA 130 + LA +A R ++ + RQ + G + A Sbjct: 78 GALLGELFPRWRVGLALGGFGRACIRETPRMLDEVRRQLRSSVA-GWLEQYRVFGWVVMA 136 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSET 190 +DG++F P G A R+ P + L ++G+ + + P SE Sbjct: 137 VDGSRFEAPRTRANEAGLGCA----GREKTTPQIYQTTLQHVGTSLPWDFRIGPGTASER 192 Query: 191 VLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEM 242 ML +P S+ + D F S DL L + + ++ ++ Sbjct: 193 RQLDEMLPDLPGKSLLIADAGFISYDLCRVLLMGRHDFLLRVGGNTHLLEKL 244 >UniRef50_A5N5R2 Transposase n=6 Tax=Clostridium RepID=A5N5R2_CLOK5 Length = 205 Score = 87.6 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 52/121 (42%), Gaps = 7/121 (5%) Query: 128 LFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ 187 +FA+DG++ + P+ E R ++G + + + ++ ++ +H L+ + Sbjct: 1 MFAVDGSKAKVPNSDENRAFFGECGNNHSKGQVR--ALVSSIFDVFNHFFLDLQIDSIKT 58 Query: 188 SETVLAHSMLATIPD-----NSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEM 242 SE+ LA + I N I +FD+ + S +L+ L + G + L + Sbjct: 59 SESELAKKNINAIRKILPNTNFIVVFDRGYLSIELIHFLEENGVQYLFRLSSNDYKKERE 118 Query: 243 I 243 Sbjct: 119 F 119 >UniRef50_D1N0Z4 Transposase IS4 family protein n=3 Tax=Bacteria RepID=D1N0Z4_9BACT Length = 384 Score = 87.6 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 29/236 (12%), Positives = 70/236 (29%), Gaps = 20/236 (8%) Query: 26 EHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAG---- 81 + +P + + ++ + M+ + +R + G Sbjct: 14 DLIPKREFEEIVMKHNGDKRKQSFDSWAHFVSMIFCQLAQANSLREICGGLKTCGGKLNH 73 Query: 82 ---MNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRT 138 + +S ++ A +F A + +L+++D Sbjct: 74 LGVESAPTKSNLSYANAHRSPKMFGDIFHMLLGHCHAIAPRHEFSFPKKLYSLDATLIEL 133 Query: 139 PDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLA 198 K Y + ++L L++ H+ + E A M Sbjct: 134 CVKVFPWATYRQTKGA---------IKLNMLLDHDGHLPVFVDFTNGDVHEVNSARRM-- 182 Query: 199 TIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTI 254 +P +S+ + D+ + +L N G + ++ N ++ E PGT+ Sbjct: 183 ELPRDSMVVCDRGYVDFSMLYKWNLSGVD--FVTRLKTNATYDIPEYDVKQYPGTV 236 >UniRef50_B8FDX7 Transposase IS4 family protein n=2 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FDX7_DESAA Length = 395 Score = 86.4 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 24/243 (9%), Positives = 61/243 (25%), Gaps = 22/243 (9%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEA 80 + + D + M+ +R + Sbjct: 8 FSQLTGLFNRNQFYALVLRHGSEKHAKGFSSWDHFVAMLFCQIAQAKSLREICSGMACCL 67 Query: 81 GM-------NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQ--LFAI 131 G RS ++ A Q+ + +F T + L ++ Sbjct: 68 GKLRHLGVKGAPKRSTLSYANQKRTWKLFQDVFYDTLHLCRQAPSPGKTKFRFRNKLMSL 127 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETV 191 D + Y + ++L L++ ++ + A + + Sbjct: 128 DSSTISLCLSLFPWAEYRQTKGA---------VKLHLLLDHDGYLPVFACITDGKTHDVT 178 Query: 192 LAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASP 251 +A + + SI + D+ + L + +++ N A ++ Sbjct: 179 MARQL--ALSKGSIVVMDRGYNDYKLYAEWVED--EVYFVTRLKDNAAFMVLADFPVPKN 234 Query: 252 GTI 254 I Sbjct: 235 RNI 237 >UniRef50_B8CMP8 Transposase OrfA, putative n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CMP8_SHEPW Length = 156 Score = 83.7 bits (205), Expect = 7e-15, Method: Composition-based stats. Identities = 35/144 (24%), Positives = 62/144 (43%), Gaps = 12/144 (8%) Query: 1 MPLLNDLLDFSDH----PLMPPPSAQLFAEHLPTEWIQHC--LTLSAHATVRRRRL---P 51 + +L+ L++ S+ + P E L E IQ + +HA+ + Sbjct: 16 LFVLDFLMELSEALTRISINRPTEFANLGELLCPELIQKLFTIQWCSHASHTKITYGVND 75 Query: 52 GDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQ 111 ++ E + ++ +L++ E G +ARS VTQ R+++ + VE +FRQT Q Sbjct: 76 FGCYRHGLISGESVRQLIYKLDIILLNEVGY--VARSTVTQTRKKLTSDVVEDIFRQTPQ 133 Query: 112 DRGAERYLKDDWHGLQLFAIDGAQ 135 W GL L+ +DG Sbjct: 134 RW-NMLAEHPQWCGLNLYGVDGVV 156 >UniRef50_Q877R2 Transposase n=51 Tax=Bacteroidales RepID=Q877R2_BACTN Length = 387 Score = 81.4 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 23/231 (9%), Positives = 71/231 (30%), Gaps = 20/231 (8%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEA 80 A L +T + + ++ ++ + +R L ++ + Sbjct: 8 FAQLASFLNRSKFNRIVTKYDGDKYVKHFTCWNQLLALMFGQLSNRESLRDLIVALEAHH 67 Query: 81 GMNL-------LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDG 133 +++S++ +A Q E + + G ++A D Sbjct: 68 SKCYHLGMGKNVSKSSLARANQDRDYHIFEEYAYYLVSEARQKCANHIFKLGGNVYAFDS 127 Query: 134 AQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLA 193 + +++ L ++ + I ++ + Sbjct: 128 TTIDLCLSVFWWAKFRKKKGG---------IKVHTLYDVETQIPAFFHITEASVHDSKVM 178 Query: 194 HSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 + +S +FD+ + + +L ++Q ++++ A KN+ + I+ Sbjct: 179 IEI--PYEPSSYYIFDRGYNNFKMLYKIHQ--IEAYFVVRAKKNLQYKSIQ 225 >UniRef50_A5N172 Transposase n=4 Tax=Clostridium RepID=A5N172_CLOK5 Length = 147 Score = 78.3 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 19/109 (17%), Positives = 46/109 (42%), Gaps = 1/109 (0%) Query: 46 RRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWL 105 R+R++P +I + + +T + A +++ Q R+++ +L Sbjct: 35 RKRKMPLSDIILCTLSKKGLTIAIELHQYFTQKGACHMSISKQGYLQQRKKLNYKVFSFL 94 Query: 106 FRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTS 154 ++ +D W+ +FA+DG++ P+ E R ++G + Sbjct: 95 NKEYLEDFYHSTEPI-LWNNHLVFAVDGSKAEVPNSDENRAFFGECGNN 142 >UniRef50_C7PAE4 Transposase IS4 family protein n=4 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PAE4_CHIPD Length = 412 Score = 77.2 bits (188), Expect = 7e-13, Method: Composition-based stats. Identities = 27/242 (11%), Positives = 71/242 (29%), Gaps = 19/242 (7%) Query: 12 DHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRR 71 P +PT I + + D ++ M+ + +R Sbjct: 4 SKFFSGQPIFNQLLSFIPTTLIDKVCRETNADYYYKHFKAFDHLVTMLFSSFHQCTSLRE 63 Query: 72 L-------NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWH 124 L + RS ++ A + A E L+ + Sbjct: 64 LHTGLLANQHRLHHLGIKHTPRRSTISDANRTRPVAFFEKLYHRLYNHHYQAFSPDSRKR 123 Query: 125 GL---QLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAV 181 +LF +D + + G +++ ++ LM + + + Sbjct: 124 KSLVDRLFIVDSTTVSLFSN--VMKGAGVIRMDGRKKGG---IKAHVLMTAKTELPSFTI 178 Query: 182 TAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE 241 +++ ++ + + SI D+ + + L+ +K W+ K++ + Sbjct: 179 LTEAAKNDRIIMPQL--ELLPGSIIAMDRAYVNYKLMKEWTEK--EITWVTRVTKSMKIK 234 Query: 242 MI 243 ++ Sbjct: 235 LL 236 >UniRef50_D1K7L7 Transposase n=3 Tax=Bacteroidales RepID=D1K7L7_9BACE Length = 389 Score = 76.8 bits (187), Expect = 8e-13, Method: Composition-based stats. Identities = 26/231 (11%), Positives = 67/231 (29%), Gaps = 24/231 (10%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEA 80 + LP + + R + ++ M+ D +R L LS + Sbjct: 8 FAQLTDFLPRRVFDRLVEKYSGNKKIRTFTCWNQMLCMIFGQLTARDSMRDLMLSLEAHK 67 Query: 81 GM-------NLLARSAVTQARQRVGAAPVEWLFRQT---AQDRGAERYLKDDWHGLQLFA 130 ++R+ + +A + E A++ + + ++A Sbjct: 68 NKYYHLGFGATVSRTNLGKANRNRDYRIYEEFAYTLIAEARNNYNKNDFEVK-VDSNVYA 126 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSET 190 D + + ++L L ++ + I + + + Sbjct: 127 FDSSTIDLCLNVFWWAEFRKHKGG---------IKLHTLYDVKTSIPTIVLVTNAKVHDV 177 Query: 191 VLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE 241 + + + S + DK + L L+ G +++ A N+ Sbjct: 178 NMLDEL--SYEKGSFYIMDKGYVDFTRLHKLHTCGA--YFVTRAKNNMRFR 224 >UniRef50_C8VYK7 Putative uncharacterized protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VYK7_DESAS Length = 161 Score = 76.8 bits (187), Expect = 9e-13, Method: Composition-based stats. Identities = 15/130 (11%), Positives = 42/130 (32%), Gaps = 8/130 (6%) Query: 95 QRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTS 154 + + + G +L AID + + LR+ +G + Sbjct: 28 SKRSPNAFIKMAEAII-TWYYGDDNFKTFKGYRLSAIDASILEITNSERLRDAFGYS--- 83 Query: 155 TKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIP----DNSITLFDK 210 + + + ++ + +++ + Y E +A ++ + N + LFD+ Sbjct: 84 EGKTVKLARAKASDIYDIENDMMITSKITRYTTGERDIAIELIEKLKKLVLKNDLILFDR 143 Query: 211 LFYSEDLLLT 220 + + Sbjct: 144 RYALAKIWKG 153 >UniRef50_Q0F098 ISGsu1, transposase n=6 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F098_9PROT Length = 383 Score = 75.2 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 33/264 (12%), Positives = 77/264 (29%), Gaps = 28/264 (10%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGD----MVIWMVV-QNEPITDVVRRLNLS 75 +P + + R R L ++++ + + + DVV Sbjct: 8 FHQLLRVIPRHRFEEVVRRYDG-DRRIRSLSCWTQFCVMLYAQLCSRQSLRDVVSAWESH 66 Query: 76 ADGEA--GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDG 133 A G + RS + A + A LF + + D + ID Sbjct: 67 ASRHYHLGAGSVRRSTLADANVKRSAGMYLELFYWLLHQFRGKGIHRKD----AVRLIDS 122 Query: 134 AQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLA 193 + + +++ + + + + ++ A Sbjct: 123 TTIDLCKHQFEWASF---------RTGKSGVKVHTVYDPDAQVPTFFSITAAKKH-DKKA 172 Query: 194 HSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGT 253 + +P + +FD+ + L Q+ + ++ +N E++ + G Sbjct: 173 AEHMPLLP-GATYVFDRAYNDYAWFHDLTQR--DIRFVSRMKRNAEFEVVATLPVSDDGV 229 Query: 254 IPK---RLEHLRGALEVVFITKRP 274 + RL +G E I +R Sbjct: 230 LEDQHIRLSSAKGRKECPTILRRI 253 >UniRef50_C5V7Z6 Transposase IS4 family protein n=3 Tax=root RepID=C5V7Z6_9PROT Length = 389 Score = 74.8 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 37/284 (13%), Positives = 86/284 (30%), Gaps = 31/284 (10%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMV-VQNEPITDVVRRLNLSADGE 79 + +P ++ A A + + D +R L + + Sbjct: 8 FAQLLDFVPFNHFEYLTERFA-ANHGIKHFSAWSQFICMAYAQLTRRDGLRDLVACLNSQ 66 Query: 80 AGM-------NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQ--LFA 130 + ++RS + A +R E L + +D GL+ L+A Sbjct: 67 KSKLYHIGIRSKVSRSTLADANERRDWRLFEALGHRLISIALELYRDEDIGLGLKEPLYA 126 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSET 190 +D + S + ++ +++L I + + + Sbjct: 127 MDSTTIDLCLTLFPWAEFRSTKAA---------VKAHTIIDLRGSIPVFLSITTGKVHDV 177 Query: 191 VLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTAS 250 L ++ P +I + D+ + L L+Q+ +++ A N+ I Sbjct: 178 NLL-DVIP-FPAGTIVVIDRGYLHFARLYALHQRQVT--FVIRAKNNLRFTWIASREV-- 231 Query: 251 PGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHS 294 + LR ++ T + + + P ++ R P Sbjct: 232 -----DKATGLRCDQTILLATPKSKTAYPERLRRVSFRDPETGK 270 >UniRef50_C6JHV1 Putative uncharacterized protein (Fragment) n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JHV1_9FIRM Length = 138 Score = 74.1 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 21/106 (19%), Positives = 48/106 (45%), Gaps = 3/106 (2%) Query: 45 VRRRRLPGDMVIWMVV--QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPV 102 VRRR+L VI + + + ++ G +++ A+++ARQ + + Sbjct: 34 VRRRKLSLLQVIIYLFFSSKASMFQNLSQIREEL-GTLSFPDVSKQALSKARQFINPSLF 92 Query: 103 EWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYY 148 + L+ + ++ + W G LFA+DG++ P+ +++ Sbjct: 93 KELYYLSVDLFYSQIPSRKLWQGYHLFAVDGSRIELPNSKSTFDFF 138 >UniRef50_A1HQH6 Transposase, IS4 family protein n=2 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HQH6_9FIRM Length = 400 Score = 73.7 bits (179), Expect = 8e-12, Method: Composition-based stats. Identities = 28/243 (11%), Positives = 83/243 (34%), Gaps = 18/243 (7%) Query: 56 IWMVVQNEPITDVVRRL--NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDR 113 + ++ + + D+ RL + + ++ S +++ + + E +F + + Sbjct: 39 VAQFLRLDSLRDIANRLTCDKQLQKLLHLTSISASTLSRRLRNIDHRVWEQVFAEVKRQI 98 Query: 114 GAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLG 173 + QL ID + + L Y + K + ++ G Sbjct: 99 WQQANKTGAVRQYQLNVIDSSTITLCLRKYLWADYRKTKSGIK-------LHQRITIHDG 151 Query: 174 SHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLP 233 + +AV R+++ + ++ +++ +FD+ + +KG ++ Sbjct: 152 NSYPDSAVLTSARKADKTVMDELV-VTSPDALNVFDRGYVDYAKWDDYCRKGIR--FVSR 208 Query: 234 AWKNIASEMIELGNTASPGTIPKRLEHL------RGALEVVFITKRPRPSRPRSVKISKT 287 N +++E + + + +++ L + V I R + ++ Sbjct: 209 LKSNAVIDVLEEKSVETNQVLAEKIVRLGNAYTTQMTHPVRLIETRDNQGNAVIIVTNEL 268 Query: 288 RYP 290 P Sbjct: 269 TLP 271 >UniRef50_A7BZU6 Transposase, IS4 n=2 Tax=Beggiatoa sp. PS RepID=A7BZU6_9GAMM Length = 270 Score = 72.9 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 69/228 (30%), Gaps = 25/228 (10%) Query: 14 PLMPPPSAQLFAEHLPTEWIQHCLT--LSAHATVRRRRLP-----GDMVIWMVVQNEPIT 66 P + +F + L I+ + S + ++ + V + + Sbjct: 4 PSPREVQSSVFNKLLEP--IEPFIQDQESKLPKHHNQIFNYYDFFILLMYYFVAGKQSVG 61 Query: 67 DVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDR-GAERYLKDDWHG 125 V+ G+ +A S A +R + +F+ + Sbjct: 62 LFVKTELKLLPITLGLRQVAYSTFNDAFERFSPNLFQEVFKYILSTIPFKQISELSTLG- 120 Query: 126 LQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPY 185 L+ IDG+ F + L Y S + + ++L L I++ + Sbjct: 121 -VLYCIDGSLFPVINSM-LWAEYTSKHCA---------LKLHLCFELNRMIVVEFLVTAA 169 Query: 186 RQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLP 233 SE M + + D+ + S +L + QK + L Sbjct: 170 NGSERKALQEM---LKAGVTYIGDRGYMSFELCHLMMQKEAYFVFRLK 214 >UniRef50_B2AJ60 Transposase, IS4 family n=4 Tax=Proteobacteria RepID=B2AJ60_CUPTR Length = 412 Score = 72.5 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 75/227 (33%), Gaps = 21/227 (9%) Query: 21 AQLFAEHLPTEWIQHCLT--LSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADG 78 E+L T+ A R+R+L +I ++ N V L+ Sbjct: 7 FASLTEYLHTKVFHDLARHPERPSAFTRQRKLTLPTLIAFMLGN-LRMGVQAELDQFFAA 65 Query: 79 EAGMNL----LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGA 134 A N+ ++ A QAR ++ L + WHG +L A D + Sbjct: 66 LARQNILRRCVSEQAFAQARSKLSGDVFAHLNDWLLRQV---SDHLPRWHGFRLVAADAS 122 Query: 135 QFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAH 194 R + +T+ Q A+ L G+ I+L A ++E + Sbjct: 123 HLRFA-----IRHSHLPRAATRDQLAF------GLYLPGAEIMLAASLHSVHENERQILF 171 Query: 195 SMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE 241 L + + + L D+ + + L+ LNQ+ A Sbjct: 172 EHLDRLQSDDLLLLDRGYPARWLVAVLNQRKIPFCMRADGSGFAAVR 218 >UniRef50_C8PSK2 ISGsu1, transposase n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PSK2_9SPIO Length = 263 Score = 72.1 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 49/183 (26%), Gaps = 16/183 (8%) Query: 82 MNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDK 141 L RS ++ A + LF K +AID + Sbjct: 52 YKDLVRSTLSYANNHRSPEVFKKLFYSLRDTLDRSARKKLR---KDFYAIDATEISLNIN 108 Query: 142 PELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIP 201 + SA +++ ++ + + + E + M + Sbjct: 109 DFPWATFRSAIGG---------IKINMKYDINNSVPDYLFMTNANEHENHTLNDM--HLS 157 Query: 202 DNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHL 261 FDK + + +K + ++ +N +I T SP + Sbjct: 158 KGDTATFDKGYCNYSTFGAFCEK--DIFFVTRLKENAKYTVIASRLTDSPLVVSDETIIF 215 Query: 262 RGA 264 G Sbjct: 216 SGK 218 >UniRef50_Q093Y3 Isrso13-transposase protein n=7 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q093Y3_STIAU Length = 457 Score = 71.4 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 37/281 (13%), Positives = 89/281 (31%), Gaps = 27/281 (9%) Query: 8 LDFSDHPLMPPPSAQLFAEHL-----PTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQN 62 LD + + A L EW++ R+R+ +++ V Sbjct: 3 LDDVFERFVRKSPLSVMARLLMQRALSAEWMEGLFQE-----HRQRQYTKELLFSAEVGL 57 Query: 63 EPITDVVRRLNLSADGEAGMNL-LARSAVTQARQRVGAAPVEWLFRQTAQDRGA-----E 116 + + R +L A + L +++ A+ + V L + + + + Sbjct: 58 MELVALGLRPSLHAAAQDSEELKVSQQALYEKVNHTEPELVRALVQGSGERLTPIVKQLK 117 Query: 117 RYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHI 176 + G ++ +DG + R A P LV + + Sbjct: 118 LQQEPWAAGYRVRVLDGNKLA-------ASEKRLKPLRGFRGAAMPGQSLV-VYAPEWDL 169 Query: 177 LLNAVTA-PYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 +++ + A E L +L + + L D+ F ++++L + + G A Sbjct: 170 VVDILPAEDAHAQERALMGPILERVQPGELWLADRNFSTKNILFGIEETGAAFLVREHAQ 229 Query: 236 KNIASEM--IELGNTASPGTIPKRLEHLRGALEVVFITKRP 274 E+ ++ + G + ++ + +R Sbjct: 230 TPHPKEVGTLKEVGRSKTGVVFEQAVEIEAEGGKRLALRRV 270 >UniRef50_UPI0001AF03EF IS4 family transposase n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF03EF Length = 374 Score = 70.6 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 41/107 (38%), Gaps = 2/107 (1%) Query: 165 RLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQK 224 L+ L G+ L+ A P ++E+ A + + + + L D+ F +LL + ++ Sbjct: 1 MLMTLCETGTRALIAAAFGPAVKAESDYARELTGHLTPDMLLLADRAFDGNELLAAIARQ 60 Query: 225 GCNRHWL-LPAWKNIASEMIELGNTASP-GTIPKRLEHLRGALEVVF 269 G + ++ G+ + G + R+ + V Sbjct: 61 GAQFLVRCTSTRRPPVLALLPDGSYLTRIGNLSLRVIEAKVEARTVD 107 >UniRef50_Q82UV9 Putative uncharacterized protein n=1 Tax=Nitrosomonas europaea RepID=Q82UV9_NITEU Length = 91 Score = 70.6 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 32/73 (43%) Query: 152 NTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKL 211 + + + YP +R V L+ G+H+L Y+ +E LAH +A + + L D+ Sbjct: 2 PGTQQGRTGYPQLRFVGLLENGTHVLFGVALGGYQDAEVRLAHQTIAHLKPGMLCLADRG 61 Query: 212 FYSEDLLLTLNQK 224 L ++ Sbjct: 62 LSGYPLWAAASRT 74 >UniRef50_B3E6V4 Transposase IS4 family protein n=8 Tax=Proteobacteria RepID=B3E6V4_GEOLS Length = 372 Score = 69.8 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 32/246 (13%), Positives = 69/246 (28%), Gaps = 24/246 (9%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV 60 MP N +L+ + A + + + L G + Sbjct: 1 MPHSNTVLNQV-VRFFKRHEFETLAR---KHHVGQQFRSFSRWSQFTAMLVGQLT----- 51 Query: 61 QNEPITDVVRRLNLSADG--EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERY 118 + + D+V L + G + RS + + + + LF + A Sbjct: 52 GRKSLRDLVDNLKVQGHKLYHLGTRDVPRSTLARVNEEQPHQLYKELFHKLLGRCQAIAP 111 Query: 119 LKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILL 178 +L+ +D K Y A + ++L ++ ++ Sbjct: 112 KNRFKLDAKLYLLDATVINLCLKVFPWASYQKAKGA---------IKLHVGLSADGYLPE 162 Query: 179 NAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI 238 ++ E A + +P S +FD+ + D L ++ N Sbjct: 163 FFDVTTGKEHEINWARLL--KLPTGSFVVFDRGYTDYDWYQALMDSS--IFFVARLKDNA 218 Query: 239 ASEMIE 244 E + Sbjct: 219 LVEYFK 224 >UniRef50_Q1VPP4 ISPg4, transposase n=7 Tax=Bacteria RepID=Q1VPP4_9FLAO Length = 411 Score = 69.5 bits (168), Expect = 1e-10, Method: Composition-based stats. Identities = 28/249 (11%), Positives = 68/249 (27%), Gaps = 27/249 (10%) Query: 16 MPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRL--- 72 P Q+ + +P + C + D + + + + Sbjct: 11 NKPVIRQIL-DLVPHWLFRSCTNTYKTDKGVHKYRTYDQFVALTFGQLNKCQSLNDISAG 69 Query: 73 ----NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWH---- 124 + ARS ++ ++ E L+ + + + H Sbjct: 70 IGVSEIFISDLGLTQSPARSTMSDGNKKRDWQVFESLYYRLLSHYKSVLKQHHNTHIIEE 129 Query: 125 --GLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT 182 G + ID + + +A ++L + I Sbjct: 130 IKGKVVKLIDSSTISLCLAMFDWAEFRTAKGG---------IKLHTSWDYNLMIPDVVNI 180 Query: 183 APYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEM 242 + + ++ P ++I + D+ ++ +L+ LN+ ++ N E Sbjct: 181 TEAKVHDRYGLKQLI--FPKDTIIVEDRAYFDFELM--LNRIKAENVFVTRIKSNTLYET 236 Query: 243 IELGNTASP 251 IE A Sbjct: 237 IEELELADD 245 >UniRef50_C3KKH4 Putative transposase Y4ZB n=2 Tax=Rhizobium sp. NGR234 RepID=C3KKH4_RHISN Length = 493 Score = 67.9 bits (164), Expect = 4e-10, Method: Composition-based stats. Identities = 36/298 (12%), Positives = 79/298 (26%), Gaps = 34/298 (11%) Query: 16 MPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNL- 74 P + + Q + +R D ++ ++ +R L Sbjct: 109 FSPSIFGQLLKAIDRRSFQAIVDRHGGDAYDKRFTSWDHLVALIYAQFSAATSLRGLEAG 168 Query: 75 -----SADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLF 129 G L RS ++ A R A F A + Sbjct: 169 WNANAQQHYHLGSARLLRSTLSDANARRPVAVFAETFALVAGQLDRQTRR---------- 218 Query: 130 AIDGAQFR--TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ 187 DG++ P S R M+L + + + Sbjct: 219 --DGSKMLRLIDSTPIPLGKLCDWAKSNGRIRG---MKLHVVYDPKADCPRLLDITDANV 273 Query: 188 SETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE--- 244 ++ + ++ TI + +FDK + + ++ N+A +++ Sbjct: 274 NDAQIGRTV--TIEKGATYVFDKGYCHYGWWTAIAA--AKAVFVTRPKVNMALKVVRKRR 329 Query: 245 ----LGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPL 298 G+ + + +G ++ +R R I+ +K A + Sbjct: 330 ITAAEGDGFTVLEDARVRLASKGDSKLPIGLRRITVKRADGDTITLLTNDLKRPAVAI 387 >UniRef50_Q877V8 ISPpu8, transposase n=3 Tax=Proteobacteria RepID=Q877V8_PSEPK Length = 433 Score = 67.9 bits (164), Expect = 4e-10, Method: Composition-based stats. Identities = 34/261 (13%), Positives = 83/261 (31%), Gaps = 24/261 (9%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEA-GM 82 + + EW+ R+R+ +++ +++ + + + +L A Sbjct: 6 LEQAIAPEWVDQVFEE-----HRQRQYSRELLFSTIIKLMSLVSLGLKPSLHAAARQLDD 60 Query: 83 NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK---DDWHGLQLFAIDGAQFRTP 139 ++ +A+ R A + L AQ + Q+ +DG+ Sbjct: 61 LPVSLAALYDKISRTEPALLRALVTGCAQRLAPTIHELGCSAMLPDWQVRVVDGSHLA-- 118 Query: 140 DKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAV-TAPYRQSETVLAHSMLA 198 +R A P V + + +++ SE V +LA Sbjct: 119 -----STEKRLGALRQERGAARPGFS-VVVYDPDLDQVIDLQPCEDAYASERVCVLPLLA 172 Query: 199 TIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW--KNIASEMIELGNTASPGTIPK 256 N + + D+L+ + ++ Q + A + I + + GT+ + Sbjct: 173 EAKTNQVWIADRLYCTLPVMEACEQVKTSFVIRQQAKHPRLIQEGEWQAPMPVATGTVRE 232 Query: 257 RLEHLRGALEVVFITKRPRPS 277 + ++G +R + Sbjct: 233 QSIEVKGG----HRWRRVELT 249 >UniRef50_C0VKK7 ISCja2 transposase n=8 Tax=Acinetobacter RepID=C0VKK7_9GAMM Length = 385 Score = 67.5 bits (163), Expect = 5e-10, Method: Composition-based stats. Identities = 39/274 (14%), Positives = 81/274 (29%), Gaps = 29/274 (10%) Query: 22 QLFAEHLPTEWIQHC----LTLSAHATVRRRRLPGD-MVIWMVVQNEPITDVVRRLNLSA 76 +F + L I C L H + R I +++ +R + + Sbjct: 6 TVFHQLLKP--ISRCDFERLAKQHHCGQKLRSATRWDQFIAILMSQLSCRQSLRDIQSNL 63 Query: 77 DGEA------GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFA 130 + + G +ARS + + Q A+ + LF Q + + L++ Sbjct: 64 ESQQEKLYHLGAKTIARSTLARINQEQPASLYQQLFTQLLRHCENTKIAHKFRFKNPLYS 123 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSET 190 +D + + S ++L +N + I +++ Sbjct: 124 LDASHIDLSLSLCEWAKVHESKAS---------IKLTVGLNHSNTIPEFVALGDGIENDM 174 Query: 191 VLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTAS 250 V + P SI +FDK + + + ++ E+ + Sbjct: 175 VQGRLL--KFPPGSIVVFDKGYVDYQWFAEMTDR--KVSFVTRLRPKTVYEV---KSKRE 227 Query: 251 PGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKI 284 L L + KR P R R ++ Sbjct: 228 VYACKGILADEYIELSSDYAKKRGAPKRLRRIEF 261 >UniRef50_B3JNI1 Putative uncharacterized protein n=3 Tax=Bacteroides coprocola DSM 17136 RepID=B3JNI1_9BACE Length = 389 Score = 66.0 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 28/278 (10%), Positives = 80/278 (28%), Gaps = 27/278 (9%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRL-------- 72 LP + + + S T + ++ ++ + +R L Sbjct: 8 FSQMTSFLPKRYFERLVEKSNDRTKSWSISFWNQLLVLIFGQLDGCNSLRELTDITIAHS 67 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAID 132 + S G + RS +++A E +R K+ +A D Sbjct: 68 SKSYHLGFGKTPITRSTLSKANMLRNYRVFESFAYHMVNLAQQKRIDKEFDLNGTFYAFD 127 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 + S + +++ +++ + I + + Sbjct: 128 STTIDLCLSLYDWARFRSTKSG---------IKVHTQLDIRTEISTSFTITDAVVHDVNA 178 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPG 252 S+ + +FD+ ++ L +N+ + +++ + E+ + Sbjct: 179 MDSI--AYEPFACYIFDRGYFDLRRLYHINE--VSSFFVIREKRRPKYEITAGEDVLEG- 233 Query: 253 TIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYP 290 +++ + F +R + P ++ P Sbjct: 234 -----TDNVLQDQTIRFTGERNCTNYPSEIRRIVYYSP 266 >UniRef50_Q737L2 IS231-related transposase n=3 Tax=Bacillus cereus group RepID=Q737L2_BACC1 Length = 167 Score = 64.8 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 45/132 (34%), Gaps = 7/132 (5%) Query: 138 TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSML 197 P E Y + + +++ +L S LN P + ++ L Sbjct: 1 MPSALENV--YPGSGGCAQTAG----IKIQLEYDLHSGEFLNFQVGPGKNNDKTFGTECL 54 Query: 198 ATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKR 257 T+ + + D ++S + L ++Q+G L N+ I K+ Sbjct: 55 DTLRPEDLCIRDLGYFSLEDLDQMDQRGTYYISRLKLNTNV-YMKNSNPEYFKNSAIKKQ 113 Query: 258 LEHLRGALEVVF 269 E++ ++ + Sbjct: 114 SEYIHIDMKQIL 125 >UniRef50_A4BSI0 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BSI0_9GAMM Length = 406 Score = 63.7 bits (153), Expect = 7e-09, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 32/89 (35%), Gaps = 3/89 (3%) Query: 152 NTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ---SETVLAHSMLATIPDNSITLF 208 +P+ RLV + L S LLNA ++ +E L SM + I + Sbjct: 104 QRGQSPGLGFPIGRLVGITYLASGALLNAAIGRFQGKGGNEQTLLRSMQESFAPGDILIG 163 Query: 209 DKLFYSEDLLLTLNQKGCNRHWLLPAWKN 237 D F + + + KG + + Sbjct: 164 DAFFATYFFIAAMQAKGVDILMEQHGSRK 192 >UniRef50_D1Q0M9 ISGsu1 transpoase n=7 Tax=Bacteroidales RepID=D1Q0M9_9BACT Length = 412 Score = 63.7 bits (153), Expect = 8e-09, Method: Composition-based stats. Identities = 32/234 (13%), Positives = 75/234 (32%), Gaps = 24/234 (10%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADG-- 78 +P ++ C+ RR D + M + +R + Sbjct: 34 LSQLMSLIPDYELRKCVDKYRGDFHARRFTCRDQFLVMSYAQFTSSASLRSIEAQLTAFN 93 Query: 79 ----EAGMNLLARSAVTQARQRVG---AAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAI 131 AG+ ++ +S + ++ +F A+ + Y + + ++A Sbjct: 94 SKLYHAGLKIMPKSTLADMNEKKNWRIYQDYAMIFVDRAKALYKDNYYRLN-IDNMVYAF 152 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETV 191 D + + + + ++ L+++ + I + P + Sbjct: 153 DSSTINLCLQLCPWAKFLHDKGA---------FKMHTLVDVKNSIPNFVLLTPGNVHD-S 202 Query: 192 LAHSMLATIPDNSITLFDKLFYSED-LLLTLNQKGCNRHWLLPAWKNIASEMIE 244 A ML I + L DK + D L L Q+ + +++ A N+ + E Sbjct: 203 QAMDMLP-IETGAYYLMDKGYVDFDRLFRILQQQ--HAYFVTRAKDNMKYNVFE 253 >UniRef50_B3PC11 ISCja2, transposase n=5 Tax=Proteobacteria RepID=B3PC11_CELJU Length = 383 Score = 63.3 bits (152), Expect = 9e-09, Method: Composition-based stats. Identities = 39/276 (14%), Positives = 79/276 (28%), Gaps = 22/276 (7%) Query: 19 PSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADG 78 + + L + R D + M + +R + + + Sbjct: 6 TAFHQLLKPLSRHEFEAEAKKHHVGQKLRSATRWDQFVGMAMSQLSGRQSLRDIQSNLEA 65 Query: 79 EA------GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAID 132 + G +ARS + + + A + +F + + + L+++D Sbjct: 66 QQHKLYHLGAKPIARSTLARINEVQPAELYKHVFARLLHRCKSMQGKHKFQFKNPLYSLD 125 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 + + A N ++L +N G+ + + ++++ + Sbjct: 126 ASAIDLSLS-----VFPWAAHRDDTAN----VKLSVGLNHGTQVPEFVALSDGQENDMIE 176 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPG 252 P SI FDK + L KG L A E ++ S G Sbjct: 177 GRKF--DFPKGSIVAFDKGYVDYRWFKLLTDKGVFFVTRLRAKAVYRVEERRYADS-SKG 233 Query: 253 TIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTR 288 I ++ L KR P R T Sbjct: 234 IISDQVIQ----LSSAHAIKRGAPKLRRIGYRDATT 265 >UniRef50_B2Q345 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q345_PROST Length = 130 Score = 63.3 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 26/76 (34%), Positives = 39/76 (51%), Gaps = 3/76 (3%) Query: 65 ITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWH 124 + +V L++ +A SAV QARQR+G V +F +T+Q + W+ Sbjct: 1 MAQLVFHLDIVLPSNRPY--VAPSAVVQARQRLGEDAVRKVFEKTSQLWLDKL-PLSHWN 57 Query: 125 GLQLFAIDGAQFRTPD 140 GL L A+DG +R PD Sbjct: 58 GLTLMAVDGTLWRIPD 73 >UniRef50_UPI0001BC4BB6 transposase n=2 Tax=Neisseria mucosa ATCC 25996 RepID=UPI0001BC4BB6 Length = 403 Score = 63.3 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 33/288 (11%), Positives = 80/288 (27%), Gaps = 36/288 (12%) Query: 20 SAQLFAEHLPTEW---IQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSA 76 S F + + Q + + ++I MV + + +R L S Sbjct: 3 SISRFQQIIKPIMHGRFQKHVQQHQADKYSKGFNCHSLLISMVYAHLTHCNSLRTLEQSF 62 Query: 77 DGEAGM-------NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLF 129 + + + S +++A + P + R+ L+ Sbjct: 63 NAHSHHHYHLNLCRRIRHSTLSEALAKRDTRPFTDMLRELMATCSRTLRKHTQDTADLLY 122 Query: 130 AIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSE 189 +D + + +S R + +++ LMN + ++ Sbjct: 123 LLDSTPIILKGRG-----FNQWVSSNGRISG---LKVHVLMNHANGCPTVQSITEASVND 174 Query: 190 TVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTA 249 + + +FDK + + L++ G +++ N A E+IE + Sbjct: 175 ID--QRHIVQPEKGATYVFDKGYCDYNWWAELDRAGA--YFVTRLKANAAVEVIEQFS-- 228 Query: 250 SPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAP 297 + R + R+ K ++ Sbjct: 229 ------------PSETQNAHENSRNDNKNTPILTDEYIRFKHKSNSTR 264 >UniRef50_C5VJA1 Transposase domain protein n=15 Tax=Prevotella RepID=C5VJA1_9BACT Length = 405 Score = 62.5 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 34/267 (12%), Positives = 84/267 (31%), Gaps = 26/267 (9%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDM-VIWMVVQNEPITDVVRRLNLSADGEAGM 82 + L + I+ + + +RL G ++ M+ D +R + + E Sbjct: 16 LIKLLDKQQIKQISLETPRSEAYVKRLDGWTHLVIMLFGVLKHFDSLREVEIGMKAEVNK 75 Query: 83 -------NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGA------ERYLKDDWHGLQLF 129 ++ RS + A +R ++ + G+ + + W L L+ Sbjct: 76 LHHLGIDYVVRRSTLADANKRRPQEFFASVYAYLLERYGSFLSDSRPKGEQKTWEKL-LY 134 Query: 130 AIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSE 189 +D D + G S K++ + V ++ + + + Sbjct: 135 MMDSTTITLFDNI--LKGVGRHPKSGKKKGGM-KVHTVMKYHV--GVPMVVQLTSAATHD 189 Query: 190 TVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTA 249 L + +P ++ D+ + L ++G ++ KN+ + Sbjct: 190 HYLLKEV--HLPKDATLTMDRAYVDYAQFQRLTEEGV--CYVTKMKKNLTYTELSSVTYV 245 Query: 250 S-PGTIPKRLEHLRGAL-EVVFITKRP 274 S G + + + E+ +R Sbjct: 246 SPDGLVTHTDKKIVFEKGEIRHQARRV 272 >UniRef50_Q11ZL6 Transposase, IS4 family n=22 Tax=Bacteria RepID=Q11ZL6_POLSJ Length = 389 Score = 62.1 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 29/227 (12%), Positives = 66/227 (29%), Gaps = 28/227 (12%) Query: 26 EHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGM--- 82 E +P + RR + M + +R + ++ AG Sbjct: 13 EFVPWTSFSRIVQRHGGDAGVRRMNCAEQFRVMAFAQLTWRESLRDIEVTLGANAGKLYS 72 Query: 83 ----NLLARSAVTQARQ----RVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGA 134 + + RS + A R+ + L R+ + E D + ++A+D Sbjct: 73 MGLRHSVHRSTLADANDSRDWRIWSDLAALLIRRARKLYREEDLGLDLTN--TVYALDAT 130 Query: 135 QFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAH 194 + S + +++ L++L I + + + Sbjct: 131 TIDLCLSLFDWAPFRSTKAA---------VKMHTLLDLRGSIPAFIHISDGKMGDVN--- 178 Query: 195 SMLATIP--DNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA 239 +L +P + + D+ + L ++Q G N Sbjct: 179 -VLDFLPVEAGAFYVMDRGYLDFARLYKMHQAGAFFVTRAKRGMNAR 224 >UniRef50_B8FXQ3 Transposase IS4 family protein n=8 Tax=Desulfitobacterium hafniense RepID=B8FXQ3_DESHD Length = 414 Score = 62.1 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 31/279 (11%), Positives = 91/279 (32%), Gaps = 27/279 (9%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITD--VVRRLNLSADG 78 Q+F + + + R +L + + +++ + + + +R+++ + Sbjct: 12 TQVFQPFFSKDLWKKIDQEVPNLDQRNYKLKTNQLT-LLISHAQLQEYKALRKISSNVQS 70 Query: 79 EA-----GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGL-QLFAID 132 G+ ++ S +++ + + E LF+ ++ L +L+ ID Sbjct: 71 NDFSEAIGLESISHSQISRRLRTLPIKVSEMLFKGVLNKVAQKKGDGKIQQRLGKLYMID 130 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 + + K +RL + + I + P + ++ Sbjct: 131 ASVISLCLSRFPWAVFRKIKAGVKMH-----LRLS--FDEMA-IPDEVIITPAKTADRKK 182 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE-------MIEL 245 ++ + +++T+FD+ + L +K ++ N E + E Sbjct: 183 LDELI-VVDKDALTIFDRGYIDYLLFDEYCEKEIR--FVTRLKNNAVIEFTGVERPVEEE 239 Query: 246 GNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKI 284 G+ I + + +T + P ++ Sbjct: 240 GSIEEDVDIILGTGTRKMKHTLREVTIDDNVNEPFTILT 278 >UniRef50_A4J2U7 Transposase, IS4 family protein n=3 Tax=Desulfotomaculum reducens MI-1 RepID=A4J2U7_DESRM Length = 413 Score = 62.1 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 33/266 (12%), Positives = 84/266 (31%), Gaps = 20/266 (7%) Query: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMV-VQNEPITDVVRRLNLSA---- 76 QLF +++ + ++L +I M+ +R ++ S Sbjct: 12 QLFQTIYNEKFLSNVKESE--VDAYAKKLTVIKLIQMISYAQLEQLKGLRHISNSLNDDN 69 Query: 77 -DGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGL-QLFAIDGA 134 G++ ++ S +++ + + + LF G E K L +++ ID + Sbjct: 70 FSSAVGLDSISASQLSRKLRDLSPELTQSLFSDIVHQFGTEIGFKSIRQELGRIYLIDSS 129 Query: 135 QFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAH 194 + + K ++ L + A+ P + ++ Sbjct: 130 TISLCLSRYRWAEFRKTKSGVKLHLRIQLLEQGVLPD-------KAIIKPAKSADKTQMD 182 Query: 195 SMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTI 254 +++ + +++ +FD+ + + G ++ N E +E T I Sbjct: 183 ALV--VEKDALNVFDRGYLDYKRFDNYSNNGTR--FVSRLKSNAIVETLEEFPTNQDSLI 238 Query: 255 PKRLEHLRGALEVVFITKRPRPSRPR 280 K + + G + R Sbjct: 239 KKDHKVILGKDGTTKMQNPLRLIETE 264 >UniRef50_C9C7H0 Transposase n=5 Tax=Enterococcus faecium RepID=C9C7H0_ENTFC Length = 373 Score = 61.4 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 24/229 (10%), Positives = 67/229 (29%), Gaps = 31/229 (13%) Query: 43 ATVRRRRLPGDMVIWMVV-----QNEPITDVVRR-LNLSADGEAGMNLLARSAVTQARQR 96 ++L + +++ + ++ R L+ E G++ L S++++ Sbjct: 33 FDFYSKKLDFQTTLKVLLHAVYEELPSYREIDRAFLDQRLCKELGIDSLCYSSLSRRAPE 92 Query: 97 VGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTK 156 + + +F Q A++ L ID + ++ A Sbjct: 93 IKQEVLMEIFTQLVARISAQQPSSKTT---SLQLIDSTTIPL-----NKAWFPWAKFRKT 144 Query: 157 RQNAYPVMRLVALM-NLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSE 215 + + L + + + + + ++ + D+ ++ Sbjct: 145 KSGI--KLHLNLCYLDKTNQYPESFTMTNASEHDRNHLEVLVDKTQA--TYVVDRGYFDY 200 Query: 216 DLLLTLNQKGCNRHWLLPAWKNIASEMI----------ELGNTASPGTI 254 LL LN+ G ++ N ++ G S + Sbjct: 201 KLLDKLNRDG--YFFVTRTKSNTKITILDQIEVADTTTRDGTIISDQQV 247 >UniRef50_A1SV49 ISSod7, transposase n=1 Tax=Psychromonas ingrahamii 37 RepID=A1SV49_PSYIN Length = 95 Score = 61.0 bits (146), Expect = 4e-08, Method: Composition-based stats. Identities = 23/86 (26%), Positives = 38/86 (44%), Gaps = 8/86 (9%) Query: 52 GDMVIWMVVQNE-----PITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLF 106 + ++W VV + +V +L++ G +A SAVTQAR+++G +E + Sbjct: 1 MEKMVWAVVGMALFRKYSMRQLVNQLDIILPN--GEPYVASSAVTQARKKLGYQAIESIS 58 Query: 107 RQTAQDRGAERYLKDDWHGLQLFAID 132 QT + W GL L D Sbjct: 59 NQTQSLWHEKS-EHPMWCGLSLLGGD 83 >UniRef50_Q73GX2 Conserved domain protein n=5 Tax=Wolbachia RepID=Q73GX2_WOLPM Length = 143 Score = 61.0 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 51/128 (39%), Gaps = 3/128 (2%) Query: 31 EWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAV 90 E+I+ S+ +R+R+L + ++++ + + LN + SA Sbjct: 5 EFIES-HKSSSKDFMRKRKLSFIDIFILILRK-SVKSLQVILNEFILYRKKDYTVTASAF 62 Query: 91 TQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGS 150 +QAR+++ + + + G ++ A+D ++ P E++ +GS Sbjct: 63 SQARKKMKHSAFSEINEGVVS-LYYQDQKFKTCFGFRVLALDASKIILPTSVEIKNEFGS 121 Query: 151 ANTSTKRQ 158 ++ Sbjct: 122 RKIRNQKP 129 >UniRef50_C6JAL6 Transposase (Fragment) n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JAL6_9FIRM Length = 237 Score = 60.6 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 29/161 (18%), Positives = 57/161 (35%), Gaps = 16/161 (9%) Query: 155 TKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPD-----NSITLFD 209 Y + + ++ +L+A + SE A L + D NSI +FD Sbjct: 5 PDPNRRYTMGLASIIYDVLDDYILHASIHKFLSSERAAALEHLKVLEDMGLYNNSIIIFD 64 Query: 210 KLFYSEDLLLTLNQKGCNRHWLLPAWKNIASE-------MIELGNTASPGTIPKRLEHLR 262 + +YSED+ + G L N++ + +++ + +P R+ + Sbjct: 65 RGYYSEDMFRYCVEHGHLCVMRLKEGINLSKKCNGDMISILQGTSKEGTSDVPIRVLEIP 124 Query: 263 GALEVV--FITKRPRPSRPRSVKISK--TRYPVKHSAAPLK 299 T P+ + + R+PV+ LK Sbjct: 125 LDDGTKEYLATNLFDPAVTKDMFRELYFYRWPVELKYKELK 165 >UniRef50_A5II18 Transposase, IS4 n=1 Tax=Legionella pneumophila str. Corby RepID=A5II18_LEGPC Length = 379 Score = 60.6 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 35/268 (13%), Positives = 83/268 (30%), Gaps = 23/268 (8%) Query: 22 QLFAEHLPT---EWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSA-- 76 +F E + + ++ C+T+ + + + M+ + +R L + Sbjct: 4 TVFQEIIKPITTDLLKECVTIFKSDYDYEKFKTYEHLQSMLYVHLNQISSLRTLETAINS 63 Query: 77 DGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQF 136 + RS ++ A +R A W+ Q + ++ + + +D + Sbjct: 64 QDLGLSAKICRSTLSDANRRRKADCFLWILEQLLEMLPKKQKKE---FSKIVRVLDSSPI 120 Query: 137 RTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSM 196 + + + ++L +LG + +++ + Sbjct: 121 QLKGYGYEWAKHNATRRCEG-------LKLHVEYDLGLESPTRVALSFPNFNDSSMGKQ- 172 Query: 197 LATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI----ASEMIELGNTASPG 252 I + I +FDK + D +++QK L I E E G Sbjct: 173 WP-IETDIIYVFDKGYCDYDWWWSIHQKKAFFVSRLKVNAAISIEQKFETNENSPILEDG 231 Query: 253 TIPKRLEHLRGALEVVF--ITKRPRPSR 278 RG + ++ + +R R Sbjct: 232 LFRFSNPKPRGGKKNLYTSLARRISVQR 259 >UniRef50_A1APW2 Transposase, IS4 family n=6 Tax=Deltaproteobacteria RepID=A1APW2_PELPD Length = 391 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 55/183 (30%), Gaps = 17/183 (9%) Query: 54 MVIWMVVQNEPITDVVRRLNLSADGEAGMNLLA---RSAVTQARQRVGAAPVEWLFRQTA 110 ++ + + + +++++ L + + + +SA +A G + +F Sbjct: 52 LIFYHLEEFSSGSELLQALEQNDFAKECVAPPKGIKKSAFFEAINNRGLEQLSEVFGHLV 111 Query: 111 QDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALM 170 + G + G L +IDG+ E Y S + K + Sbjct: 112 KQAGKVLPAEYAHLG-NLVSIDGSLIDAVLSME-WADYRSGSKKAKAHVGF--------- 160 Query: 171 NLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHW 230 ++ I + ++ E ++ + D+ + S D Sbjct: 161 DINRGIPRKIYLSDGKEGERPFVDKIIDK---GETGVMDRGYQSHDHFDKWQAAEKFFVC 217 Query: 231 LLP 233 + Sbjct: 218 RIR 220 >UniRef50_UPI0001C4271A transposase, IS4 family protein n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C4271A Length = 399 Score = 59.4 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 20/219 (9%), Positives = 66/219 (30%), Gaps = 17/219 (7%) Query: 32 WIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMN------LL 85 + + + ++ + +++ T + +++ + + + Sbjct: 4 LFSNLINVIDLDKYVKKLTAYKFLQLLIISQLKETKSLTQMSKKLKDKEELQVQLAFDTI 63 Query: 86 ARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGL-QLFAIDGAQFRTPDKPEL 144 + S +++ + E +F + A+ + +L ID Sbjct: 64 STSQLSRKLGDLSPTLFEKIFHYLVLNIQAKMKQSPIIREIGRLHVIDSTTMSMSVSQYP 123 Query: 145 REYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNS 204 + + +R+V L + + P + ++ ++ ++ Sbjct: 124 WATFRKTKAGIRLH-----LRVVVTKELT--LPDKGILLPAKHADRTQMGDLIEM-DSDA 175 Query: 205 ITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMI 243 I LFD+ + L + ++ KN E++ Sbjct: 176 IHLFDRGYIDYKQFDHLC--LHDVRFITRLKKNAQVEVL 212 >UniRef50_A6TN04 Transposase, IS4 family protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TN04_ALKMQ Length = 454 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 29/270 (10%), Positives = 84/270 (31%), Gaps = 21/270 (7%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMV-----VQNEPITDVVRRLNLSADG 78 F I + ++ LP ++ + N + + + G Sbjct: 11 FIRLFDNNKIMEIAIGTGLLKRQKGMLPDTILKVFTFGLLNIANPSLNQIASKCQAFQPG 70 Query: 79 EAGMNLLARSAVTQARQRVGA---APVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQ 135 +++ AV + ++ + + +++ + + D + Sbjct: 71 LT----ISKEAVYKRLKKSSLFLQETFKHMMQKSMNSVIPVKTAAILEQFKDVKICDSTK 126 Query: 136 FRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHS 195 PD +L Y + + +++ + +L + ++T Sbjct: 127 ITLPD--KLVALYPGLGGRNAKSS----LKVQGIYSLIPARFSSLEITKAPGADTTYNDK 180 Query: 196 MLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTAS-PGTI 254 +LA + + + D ++S+ L+ KG ++L KN + + G T Sbjct: 181 LLAMVNPGELLITDLGYFSKAFFEKLSTKGS--YYLTRIKKNSIVYVEKSGQLTKVDLTD 238 Query: 255 PKRLEHLRGALEVVFITKRPRPSRPRSVKI 284 + + + + K+ R ++++ Sbjct: 239 LLKGTVVDTEVFLGIAHKKQLKCRFVAIRL 268 >UniRef50_B9YUA6 Transposase, IS4 family protein n=3 Tax='Nostoc azollae' 0708 RepID=B9YUA6_ANAAZ Length = 256 Score = 59.1 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 14/74 (18%), Positives = 37/74 (50%), Gaps = 5/74 (6%) Query: 180 AVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA 239 ++ +PY E A +L ++ + + ++D+ +S ++ ++ C+ +PA N+ Sbjct: 3 SLLSPYGIGERKRAIQILPSVGEGMLLMWDRGLHSFKMVHAAIKQKCHILGRVPA--NVK 60 Query: 240 SEMIEL---GNTAS 250 E+++ G+ S Sbjct: 61 FELVKTLGNGSYLS 74 >UniRef50_Q74P20 IS231-related transposase n=15 Tax=Bacillus RepID=Q74P20_BACC1 Length = 460 Score = 59.1 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 61/201 (30%), Gaps = 10/201 (4%) Query: 35 HCLTLSAHATVRRRRLPGDMVIWM--VVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQ 92 L + R+R+ ++ + + T+ + L G+ L + + + Sbjct: 33 QLLAVKTGMIRRKRKCRAQDLVSLCVFLSQAIGTESLVSLCAKLTRATGIQL-SSQGLNE 91 Query: 93 ARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSAN 152 ++ LF Q + + + + ++ +D F+ P Y S+ Sbjct: 92 RFNAQTVQFLKELFLQVFRKKFSPMTPLSN-RFTRIRILDSTAFQLP------AQYASSY 144 Query: 153 TSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLF 212 + +++ L S L S+ T+ ++L D + Sbjct: 145 KGVGGGGSEAGVKIQLEYELISGEFLETAVRDGTSSDCRYGQERTQTLEPGELSLRDLGY 204 Query: 213 YSEDLLLTLNQKGCNRHWLLP 233 +S L + + + Sbjct: 205 FSIYDLEKIADRKAFYVSRIR 225 >UniRef50_UPI00003C8608 transposase IS4 family protein n=4 Tax=Ferroplasma acidarmanus fer1 RepID=UPI00003C8608 Length = 349 Score = 58.7 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 70/199 (35%), Gaps = 20/199 (10%) Query: 85 LARSAVTQARQRVGAAPVEWLFRQTAQDR--GAERYLKDDWHG--LQLFAIDGAQFRTPD 140 +++S +++ + + E +F + + D+ + AID T Sbjct: 62 ISKSQLSKLNNKRPYSIFEKVFYSILRPFIKAHRYDIYHDYIDRLYSVLAIDSTFIET-- 119 Query: 141 KPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATI 200 + Y + + A+ + + L A+ P ++ + +L I Sbjct: 120 MVKGSGIYQRGERRNGIK-----IHTAAIASPYP-LPLKAIITPANVHDSKVFDDLLEYI 173 Query: 201 PD----NSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPK 256 + N++ FD +Y+ + L +KG N ++ KN +I+ S + Sbjct: 174 NEYISGNTVLTFDLGYYNLGRFMELKEKGIN--FVSRIKKNADYTVIKEETFNSK-IVRF 230 Query: 257 RLEHLRGALEVVFITKRPR 275 R L L + I R R Sbjct: 231 R-NGLELRLVSLDINNRKR 248 >UniRef50_C6DY52 Transposase IS4 family protein n=1 Tax=Geobacter sp. M21 RepID=C6DY52_GEOSM Length = 394 Score = 58.3 bits (139), Expect = 3e-07, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 63/236 (26%), Gaps = 22/236 (9%) Query: 52 GDMVIWMVVQN-EPITDVVRRLNLSADGEAGMNLLA---RSAVTQARQRVGAAPVEWLFR 107 +I+ + ++++ L + + +SA +A G + LF+ Sbjct: 53 LKALIYFHLHEFSSGRELLQALEQDDFAKECVAPPKGIKKSAFFEAVNNRGLEQLAELFK 112 Query: 108 QTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLV 167 +D + G L AIDG+ Y S + K A+ Sbjct: 113 LLLKDAKNVIPAEFADIG-NLVAIDGSYIDAVMSM-DWADYSSTHNKAKAHVAF------ 164 Query: 168 ALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCN 227 ++ I + + Q+E M+ + + D+ + + Sbjct: 165 ---DINRGIPKDLILTDGNQTERQFVERMIG---PDETAVLDRGYQCNANFDQWQENEKK 218 Query: 228 RHWLLPAWKNIASE----MIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRP 279 + A N + + R EV + R Sbjct: 219 FICRIQARSNKKVIRENPIARGSIIFYDAVVLLGAPSTRAKKEVRVVAYRVEGKDF 274 >UniRef50_C0R4I1 Putative uncharacterized protein n=5 Tax=Wolbachia RepID=C0R4I1_WOLWR Length = 94 Score = 57.1 bits (136), Expect = 7e-07, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 8/85 (9%) Query: 123 WHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVT 182 W G +L A DG+ R P E+ + T+ N + ++L + ++ +A Sbjct: 5 WRGYRLIAADGSGMRLPSSGEIVSEFEPNGTTGTIGNLF--------VDLCTSLICSARL 56 Query: 183 APYRQSETVLAHSMLATIPDNSITL 207 A + E LA L + L Sbjct: 57 AAWNIGEQTLAAEQLPEVITQMRLL 81 >UniRef50_Q5L3A2 Transposase of IS231E-like element n=1 Tax=Geobacillus kaustophilus RepID=Q5L3A2_GEOKA Length = 453 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 39/274 (14%), Positives = 91/274 (33%), Gaps = 28/274 (10%) Query: 24 FAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWM--VVQNEPITDVVRRLNLSADGEAG 81 L E ++H R+ +L + + +Q + +L + + Sbjct: 18 LRSVLSCEELEHMARDHQFIQ-RKGKLRAHDFVALCTFLQEGGGQKSLVQLCSALALKQN 76 Query: 82 MNLLARSAVTQARQRVGAAPVEWLFRQT--AQDRGAERYLKDDWHGLQLFAIDGAQFRTP 139 L+ + Q + ++ +F + Q + A R L++ +D F+ P Sbjct: 77 -TSLSAEGLNQRFHEKAVSFLKAVFEKLLIHQTQEARRLCPRHSLFLRIRILDSTSFQLP 135 Query: 140 DKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLAT 199 PE++ Y P +++ +L+ R + S+L+T Sbjct: 136 --PEIQGIY--------EGCTGPGVKIQLEYEWLEGKVLHVDVEDARHHDAAYGASLLST 185 Query: 200 IPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLP------AWKNIASEMIELGNTASPGT 253 I + + L D ++S + L ++ G L + E + + Sbjct: 186 IQEGDLCLKDLGYFSLEGLQAIHDAGAFYISRLKHNVGIYQKEGDRFRKWEPEDFLAVLQ 245 Query: 254 IPKRLEHLRGALEVVFITKRPRPSRPRSVKISKT 287 + + LE +++ + + +PR + T Sbjct: 246 PGETM-----ELEHAYVSGK-KVHQPRLIVYRLT 273 >UniRef50_UPI00016C560B transposase IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C560B Length = 280 Score = 56.4 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 35/76 (46%), Gaps = 5/76 (6%) Query: 176 ILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 +L +T P +SE +A +L + ++ + L+D+ F S DL+ + Q+ + + Sbjct: 1 MLWRTLTKPCHRSEVTMAPYLLRCLQNDMLLLWDRGFLSYDLVQQVRQRCAHLLARI--K 58 Query: 236 KNIASEM---IELGNT 248 N+ + G+ Sbjct: 59 SNLVFRPLHRLPDGSY 74 >UniRef50_Q3M8C5 Transposase, IS4 n=15 Tax=Cyanobacteria RepID=Q3M8C5_ANAVT Length = 340 Score = 56.4 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 55/165 (33%), Gaps = 16/165 (9%) Query: 88 SAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREY 147 S ++A P + ++++ + + + + ID +L Sbjct: 63 STFSKANLHRSQKPFQEIYQKLNK-LVQNKAENKLHNKYAICPIDSTVITL--TSKLLWV 119 Query: 148 YGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITL 207 G ++L + +NL + + + + M+A +P N++ + Sbjct: 120 LGHH-----------QVKLFSSLNLATGSPEDNLINFGHDHDYKFGSKMIANLPTNAVGV 168 Query: 208 FDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPG 252 D+ F + L Q N++++L N E E G Sbjct: 169 MDRGFAGLKFIQELVQ--ENKYFVLRIKNNWKLEFEESSGLIKVG 211 >UniRef50_C3AUM2 Transposase for insertion sequence element IS231B n=3 Tax=Bacillus RepID=C3AUM2_BACMY Length = 192 Score = 56.0 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 48/142 (33%), Gaps = 9/142 (6%) Query: 142 PELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIP 201 +L Y S+ + +++ +L S LN P ++ L T+ Sbjct: 44 RDLAPIYPSSGGCAQTAG----IKIQLEYDLHSGKFLNFQMEPGENNDKTFGTDCLDTLC 99 Query: 202 DNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA-SEMIELGNTASPGTIPKRLEH 260 + + D + L + K +++ N + + G I K E+ Sbjct: 100 PGDLCIRDLGCFHLKDLQHIQDKMA--YYISGIKSNTRIYQKNPNPDYFQDGRIKKGTEY 157 Query: 261 LRGALEVVFITKRPRPSRPRSV 282 ++ +EV + +P + + Sbjct: 158 IQIDMEV--LMNSLQPGQTCEI 177 >UniRef50_UPI000190F8A2 hypothetical protein SentesTyp_33971 n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E98-2068 RepID=UPI000190F8A2 Length = 85 Score = 54.8 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 41/46 (89%), Positives = 42/46 (91%) Query: 1 MPLLNDLLDFSDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVR 46 M LLNDLLDFSDHPLMPPPSAQ+FAEHLP E IQHCLTLS HATVR Sbjct: 1 MSLLNDLLDFSDHPLMPPPSAQMFAEHLPAECIQHCLTLSKHATVR 46 >UniRef50_Q2JF90 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JF90_FRASC Length = 87 Score = 54.4 bits (129), Expect = 4e-06, Method: Composition-based stats. Identities = 8/57 (14%), Positives = 19/57 (33%) Query: 11 SDHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITD 67 L S + + + + L + R R+LP ++++ + D Sbjct: 11 PAGRLTDHISLGVLTGLVHHDLVDDVLVETGRVEKRSRKLPARVMVYFTLAMWLFFD 67 >UniRef50_B0TD95 Transposase, is4 family n=3 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TD95_HELMI Length = 441 Score = 54.4 bits (129), Expect = 5e-06, Method: Composition-based stats. Identities = 43/267 (16%), Positives = 90/267 (33%), Gaps = 26/267 (9%) Query: 15 LMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQ--NEPITDVVRRL 72 + P + ++ P EW+ + R R++ + +W +V + + L Sbjct: 6 IEPGLIEEALSKLFPKEWVSEVAAETGFVK-RERKISPVVFLWALVLGFGVGVQRTLGDL 64 Query: 73 NLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQ----- 127 S +AG +++ SA R VE+L R + G + Sbjct: 65 RRSYMEQAGHSVV-PSAFYD---RFTPELVEFLKRCVEKAIGHLVVEPGQVMSERLKDIL 120 Query: 128 -LFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR 186 + ID + R D +L + + T+ ++ L+++ Sbjct: 121 DIAVIDSSLVRLHD--QLAKKWPGPRTNHSPAA----AKVNMLVSVFGATRSQVQIVEGT 174 Query: 187 QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELG 246 + E+ L + + + I LFD ++S + +++ N ++ Sbjct: 175 RGESKLLS--IGSWVKDRILLFDLGYFSFKHFGKIM--NEKGYFVSRLKSNSNPLILRSL 230 Query: 247 NTASPGTIP---KRLEHLRGALEVVFI 270 TI KRL ++G+L I Sbjct: 231 IQHRGRTIAVEGKRLLDIKGSLRREII 257 >UniRef50_Q3M9Z5 Transposase, IS4 n=10 Tax=Cyanobacteria RepID=Q3M9Z5_ANAVT Length = 439 Score = 54.0 bits (128), Expect = 6e-06, Method: Composition-based stats. Identities = 37/252 (14%), Positives = 85/252 (33%), Gaps = 30/252 (11%) Query: 46 RRRRLPGDMVIWMVVQN-----EPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAA 100 R R L +++ V+ +T++ R L +++ A++Q A Sbjct: 32 RDRILNLPLMVAAVLTLLWRDVAGVTELTRMLAREGFLWCRPLEVSQQAISQRFLTFPAQ 91 Query: 101 PVEWLFR-----------QTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYG 149 E +F+ + Q + +++ +D + + Sbjct: 92 LFEKVFKDLLPHLQASWQRRNQRKIPPSVQFTLTKFEKIWIVDCSIL--------EALFQ 143 Query: 150 SANTSTKRQNAYPVMRLVALMNLGSHILLNAVT-APYRQSETVLAHSMLATIPDNSITLF 208 ++ ++ ++NL + + + R ++T +L + +++ L Sbjct: 144 KLDSLKDAPQGQLAGKIGTVINLVNLLPVEIWFCENPRTADTKFEADILNLVTPHTLLLL 203 Query: 209 DKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHL-RGALEV 267 D+ FY + L L + N ++ K A + ++ + RL L G + Sbjct: 204 DRGFYHFNFWLQLIAQNVN--FITRLKKGAAIHVQQV--FTDSFALRDRLVRLGSGTKKT 259 Query: 268 VFITKRPRPSRP 279 FIT R R Sbjct: 260 TFITLRLVEIRS 271 >UniRef50_C6J7R2 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J7R2_9BACL Length = 399 Score = 54.0 bits (128), Expect = 6e-06, Method: Composition-based stats. Identities = 23/252 (9%), Positives = 77/252 (30%), Gaps = 19/252 (7%) Query: 46 RRRRLPGDMVIWMVVQNEPITDVVRRLNLSADG-------EAGMNLLARSAVTQARQRVG 98 R+ G ++ + + ++ + G+ ++ S +++ +++ Sbjct: 31 ARKLFVGSSLLLFIEAQLQQRESYAEMSEHLEANEDFQAILGGLESISPSQLSRKMKKLP 90 Query: 99 AAPVEWLFRQTAQDR--GAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTK 156 + LF Q + E +L +D Q P Y ++N K Sbjct: 91 LENLHLLFMQVTRQIQQLTENKPGITTKIGKLAIMDSTQITLPAILSKWAYCSASNHGVK 150 Query: 157 RQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSED 216 + +++ + + + + ++ +A + T+ + D+ + Sbjct: 151 MHTSL------LVVDAKTMVPDKIIASTKDVADHEVAPNF--TVDKEVTYVMDRGYQVHK 202 Query: 217 LLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRP 276 +G ++ N +++ G + + ++ + Sbjct: 203 HFQAWVDQGMK--FVARVKDNTRLTILKERALPKRGDFIRDADVTLPGQQMKLRLIEFQD 260 Query: 277 SRPRSVKISKTR 288 + R ++ +R Sbjct: 261 QQGRLYRLVTSR 272 >UniRef50_Q55566 Putative transposase for insertion sequence element IS4SA n=10 Tax=Synechocystis sp. PCC 6803 RepID=T4SA_SYNY3 Length = 338 Score = 54.0 bits (128), Expect = 6e-06, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 60/188 (31%), Gaps = 22/188 (11%) Query: 61 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK 120 + + +RLNL + S ++A ++ + ++ + Sbjct: 41 SQTSMRSMFKRLNLRG------ETVDISTFSKASKKRDVGVFREIIFSLKKELSKR--KE 92 Query: 121 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNA 180 L++F +D + + +++ + +NL + I Sbjct: 93 IKQGELEIFPLDSTIVSIT-------------SKLMWNLGFHQVKVFSGINLSTGIPGGI 139 Query: 181 VTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 V + + + + P+N + + D+ F + L ++ H +L NI Sbjct: 140 VIHFGQGHDNKYGNETIEETPENGVAVMDRGFCDLQRIKRLQKENNKYH-VLRIKNNIKL 198 Query: 241 EMIELGNT 248 E + N Sbjct: 199 EKLANDNY 206 >UniRef50_A3YGY3 Transposase and inactivated derivative n=1 Tax=Marinomonas sp. MED121 RepID=A3YGY3_9GAMM Length = 66 Score = 54.0 bits (128), Expect = 6e-06, Method: Composition-based stats. Identities = 29/60 (48%), Positives = 38/60 (63%) Query: 165 RLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQK 224 LVALMN SHI+++ YR+ + LA S A PDNSITLFDK F+S +L L++ Sbjct: 1 MLVALMNTQSHIMMDPQIIHYRRGKIPLAPSTQAKTPDNSITLFDKGFWSTKFMLGLSRA 60 >UniRef50_C3R0J9 Transposase n=4 Tax=Bacteroidales RepID=C3R0J9_9BACE Length = 424 Score = 53.7 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 26/277 (9%), Positives = 78/277 (28%), Gaps = 29/277 (10%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEA 80 + +P + + ++ ++ +R + L + Sbjct: 8 FAQVIDFIPRYQFDKLVKKYKGDWHAKDLSCYSQLLHLLFGQITGCVSIRDICLCLEAHG 67 Query: 81 G-------MNLLARSAVTQARQRVGAAPVEWL---FRQTAQDRGAERYLKDDWHGLQLFA 130 + +S + +A ++ E L + + + + L+A Sbjct: 68 SSIYHLGIRKSVNQSNLCRANEKRDYRIYEGLGMYLISIVRPMYSNTKVTEITIDNVLYA 127 Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSET 190 +D + + S +++ L++L I N + ++ Sbjct: 128 LDSTTIS---TSIVLAAWALGKYSKGA------VKMHTLLDLRGSIPANIHITDGKWHDS 178 Query: 191 VLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTAS 250 ++ + + DK + L ++ G +W+ N+ E++ Sbjct: 179 NELDEIVP--EAFAFYMMDKAYVDFIALFRFHKAGA--YWISRPKDNMRYEVVNHRLDFD 234 Query: 251 PGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKT 287 P + G + T + + P +++ Sbjct: 235 P------STGICGDFIIKLTTHKSKKLYPEPIRMVTY 265 >UniRef50_Q4C0I4 Putative uncharacterized protein n=2 Tax=Cyanobacteria RepID=Q4C0I4_CROWT Length = 211 Score = 53.7 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 66/188 (35%), Gaps = 23/188 (12%) Query: 61 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK 120 + D+ +RLN S ++A +R + + Q ++ ++ + Sbjct: 42 SIVSMQDLFKRLNTQGIDLK------ISNFSKASKRRDSQVFLNIINQLKKELRRQKGKR 95 Query: 121 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNA 180 F ID L Y ++L ++ + Sbjct: 96 ---KARSYFPIDSTVISLT-SKLLWS------------QGYHQVKLFCGLDSWTSEPGGI 139 Query: 181 VTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIAS 240 V + + + +IP+ ++ + D+ F S + + L +K N+ ++L N+ Sbjct: 140 VIHFGQGHDHKYGQKTVESIPEKTVGIMDRGFASSERIKELKEK-QNKAFVLRIKNNVTL 198 Query: 241 EMIELGNT 248 EM++ GN+ Sbjct: 199 EMLDDGNS 206 >UniRef50_A3EIG1 FOG: Transposase and inactivated derivatives n=3 Tax=Vibrio cholerae V51 RepID=A3EIG1_VIBCH Length = 264 Score = 52.9 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 23/95 (24%), Positives = 35/95 (36%), Gaps = 9/95 (9%) Query: 204 SITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPG-------TIPK 256 ++TLFDK FY+ LL +G RHWL+P K + + K Sbjct: 26 ALTLFDKGFYALGLLHRWQSQGKERHWLIPLRKGAQYKTLRKLGRGDGLIELSLTAQAKK 85 Query: 257 RLEHLRGALEVVFITKRPRPSRPRSVK--ISKTRY 289 + LE IT + + + + RY Sbjct: 86 KWADAPDTLEARLITTKVKGKEVQLLTSMTDPKRY 120 >UniRef50_A1BCF6 Transposase, IS4 family protein n=1 Tax=Chlorobium phaeobacteroides DSM 266 RepID=A1BCF6_CHLPD Length = 252 Score = 52.9 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 22/168 (13%), Positives = 56/168 (33%), Gaps = 21/168 (12%) Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSET 190 +D + + + ++L L + S + V ++S+ Sbjct: 1 MDATVIDLCLRVFPWAEFRQRKGA---------IKLHYLYDHRSSLPAFMVMTDGKKSDI 51 Query: 191 VLAH---SMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGN 247 +A + + +SI FD+ + + L TL+Q+ ++ + NI +I Sbjct: 52 RVARSQEKLDFHLLPDSIVSFDRAYIDFEWLYTLDQR--KVWFVTRSKANIQYRII---- 105 Query: 248 TASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKTRYPVKHSA 295 P + + + + I ++ R + +++ A Sbjct: 106 ---GQHQPIKNKQVTRDERIELIIEKSRAKYLKPLRLVCYTDQETGKA 150 >UniRef50_C5EN32 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EN32_9FIRM Length = 127 Score = 52.9 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 36/93 (38%), Gaps = 5/93 (5%) Query: 163 VMRLVALMNLGSHILLNAVTAPYR-QSETVLAHSMLATIPDN--SITLFDKLFYSEDLLL 219 + R ++L +L S L+AV P R ++E ++ I + D+ F S ++ Sbjct: 8 LARTISLYDLLSKRYLDAVIQPGRLKNEFAALCQLIDRYLYGYFPIFVADRSFASYNVFA 67 Query: 220 TLNQKGCNRHWLLPAWKNIASEMIELGNTASPG 252 +KG + + A ++ + Sbjct: 68 NAFEKGG--FFAIRAKDVNIKRLLAADSLPDRL 98 >UniRef50_Q2FU81 Transposase, IS4 n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FU81_METHJ Length = 452 Score = 52.5 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 28/242 (11%), Positives = 82/242 (33%), Gaps = 25/242 (10%) Query: 23 LFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEP--ITDVVRRLNLSADGEA 80 +F + ++I+ + R+R+L ++I+ ++ + + ++ Sbjct: 16 VFQKF-TFDFIEKKARETG-FMQRKRKLDPVLLIFSLIFGVSSHLKPTLEEIHRHYVDLD 73 Query: 81 GMNLLARSAVTQA-RQRVGA---APVEWLFRQTAQDRGAERYLKDDWHG-----LQLFAI 131 + S + Q+ R+R ++ L + G + Sbjct: 74 DNPKIETSILNQSFRKRFNYKLVDFLKSLMDHYIDQIVHQSPAH--LKGIVEDFKDILVQ 131 Query: 132 DGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETV 191 D + R +L + + +A + +++ A+ ++ H + NA+ R + Sbjct: 132 DSSIIRI--SKKLYDLHPAARSRDDSAG----LKIHAVYSVVYHSVKNAIITTERVHDYK 185 Query: 192 LAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASP 251 + + +N + + D +YS + + G + N +++ + + Sbjct: 186 MLK--IGPDVENILLINDLGYYSLKTFSKIQEYGG--FFASRVKSNAVFKVVSINSGPPE 241 Query: 252 GT 253 T Sbjct: 242 IT 243 >UniRef50_B7GET6 Transposase n=2 Tax=Bacillaceae RepID=B7GET6_ANOFW Length = 417 Score = 52.5 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 71/234 (30%), Gaps = 35/234 (14%) Query: 26 EHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMV------VQNEPITDVVRRLNLSADGE 79 + E + H + R+R L + + + + + + + L L D Sbjct: 14 QLFSPETLTHLAQETGFIQ-RKRALTAEAFLTLCAWGDGSLAQQSLQRLCTSLTLRHD-- 70 Query: 80 AGMNLLARSAVTQARQRVGAAPVEWLF-----RQTAQDRGAERYLKDDWHGLQLFAIDGA 134 L+ + Q A + +F RQ + + + L++ D Sbjct: 71 ---CSLSSEGLNQRFTERAVAFLREVFFLLLQRQPPLLWSTIQTYRTCFTRLRIL--DST 125 Query: 135 QFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAH 194 F P YG R + ++ +L S L S+ A+ Sbjct: 126 SFLVP------ADYG----EDYRGSVSSGAKIQFEYDLLSGACLQLCAQSANDSDARFAY 175 Query: 195 SMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPA------WKNIASEM 242 TI N + + D F+S L ++ +G L + +N + Sbjct: 176 HAQHTILPNDLCIRDLGFFSVAALTEIDARGAYYITRLRSDMKVYIKENSQWKE 229 >UniRef50_A3ZNH0 Probable transposase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZNH0_9PLAN Length = 451 Score = 52.1 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 60/200 (30%), Gaps = 28/200 (14%) Query: 74 LSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGL------- 126 G +A + ++A +E + + +R + G Sbjct: 98 KKIAKLTGGKKVADGSFSEASSIFDPRLLEGIIKDLRSRWHQQRMSSEPRSGRASDRTVE 157 Query: 127 QLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTA--P 184 +L A+DG+ L + G K R AL+++ + + P Sbjct: 158 RLIAVDGSVLT-----ALPQIVGRIAAKEKG-----QWRFHALVHVLDGQPVASKLTEEP 207 Query: 185 YRQS--ETVLAHSML-------ATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 + E + M+ + + L D+ + S +L ++ G + L Sbjct: 208 SAKGRAERDVLAEMIAADQIDIPQSDEGHLFLMDRGYRSAELFNKIHTAGHDYICRLNRT 267 Query: 236 KNIASEMIELGNTASPGTIP 255 + + G P +P Sbjct: 268 DGKLLKPPKKGEVREPIQLP 287 >UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3VMZ1_KLEPN Length = 421 Score = 51.7 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 65/226 (28%), Gaps = 22/226 (9%) Query: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEP-----ITDVVRRLNLSA 76 +F + + + R P + + V I D+ R N Sbjct: 11 TIFEALFSEQQLNSLGVQTHMIERFRLITPAKLCLAFVCALGSGNARTIADIHRYFNHLH 70 Query: 77 DGEAGMNLLARSAVTQARQRVG-AAPVEWLFRQ-TAQDRGAERYLKDDWHGL--QLFAID 132 + ++G + +F Q A A D + G Q+ D Sbjct: 71 SMSVRLKP-----FHNQLVKLGTPEFMRQVFEQALALHLPAMHTFSDAYRGHFKQVLLQD 125 Query: 133 GAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVL 192 G F D L ++ ++ + L +L + + SE Sbjct: 126 GTSFAVHDGLSL--HFPGRFSTHSPAA----VELHVTYDLEKAQPVRVSLSEDTASERDY 179 Query: 193 AHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNI 238 +A + + D ++S+ + +L + + +PA N Sbjct: 180 LP--VAQSLRGCLLMADAGYFSKAYIESLQNEAASFVLRMPASVNP 223 >UniRef50_C3BTW8 Transposase for insertion sequence element IS231B n=13 Tax=Bacillus RepID=C3BTW8_9BACI Length = 387 Score = 51.7 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 19/181 (10%), Positives = 52/181 (28%), Gaps = 11/181 (6%) Query: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMV-VQNEPITDVVRRLNLSADGEA 80 Q L ++ + D+V V + T + +L+ + Sbjct: 18 QELQSFLSPHILRDLARDVGFVQRTSKYQAKDLVALCVWMSQNVATTSLTQLSSCLEAST 77 Query: 81 GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKD---DWHGLQLFAIDGAQFR 137 + L++ + Q + ++ + + + ++ +D F+ Sbjct: 78 EV-LISPEGLNQRFNKSAVQFLQHILAELLNQKLTSSMPISSPYTSVFKRIRILDSTAFQ 136 Query: 138 TPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSML 197 PD + + +++ +L S L+ T P +Q + + Sbjct: 137 LPD------PFSFVYPGAGGCSHTAGVKIQLEYDLLSGQFLHIHTGPGKQHDRTYGSLCV 190 Query: 198 A 198 Sbjct: 191 P 191 >UniRef50_A6DSH7 Probable transposase n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH7_9BACT Length = 382 Score = 51.3 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 45/302 (14%), Positives = 96/302 (31%), Gaps = 34/302 (11%) Query: 12 DHPLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRL---PGDMVIWMVVQNEPITDV 68 + L Q F E L E H RR+L I + N P+ Sbjct: 9 ERNLKRWCLVQEFREKLTKELEHH--PRHPSEDDPRRKLHYLDYASAILFTLFN-PVLKS 65 Query: 69 VRRLNLS-----ADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQ--DRGAERYLKD 121 +R L + + ++ + ++A+ A ++ L ++ + + Sbjct: 66 MRGLCAASELKKVQEHVTLGKISLGSFSEAQHVFDATSLQHLVQKLSSKIPINKIQDRSL 125 Query: 122 DWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAV 181 L A+DG+ F+T + E+ + + K + +++ A+ +AV Sbjct: 126 LAAVKDLVAVDGSLFQTLTRVLWAEWLDENHKAAKLHLGFSLLKQSAV---------DAV 176 Query: 182 TAPYRQSETVLAHSMLATIPDNSITLFDKLFY-SEDLLLTLNQKGCNRHWLLPAWKNIAS 240 E M + + + D+ + L Q+G + + Sbjct: 177 ITAGNSCERKALLKM---VQPGVMYVCDRYYGLDYSYFEELQQRGA--LFTIRIRNKPKL 231 Query: 241 EMIELGNTAS----PGTIPKRLEHLRGALEVVFITKRPRPSRP--RSVKISKTRYPVKHS 294 +I+ G I +L +L + + R + + + + P K + Sbjct: 232 TVIKEYEITEKDRKEGVISDQLVYLGDTDRELKPIRLVRTGAFNDKEILLVTSEAPEKLN 291 Query: 295 AA 296 AA Sbjct: 292 AA 293 >UniRef50_Q07SJ1 Transposase, mutator type n=22 Tax=Bacteria RepID=Q07SJ1_RHOP5 Length = 616 Score = 51.0 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 35/260 (13%), Positives = 75/260 (28%), Gaps = 47/260 (18%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRR-------LN 73 L W++ + T R + ++ + +++ + V R +N Sbjct: 15 LARVDRVLDLSWLRSEVAELYCETNGRPGIDPEVAVRLMLAGFLLGIVHDRRLMREAQVN 74 Query: 74 LSADGEAGMNL----LARSAVTQARQRVGAAPVEWLFRQTAQDRGAER------------ 117 L+ G L S++T+ RQR GA +F +T + A + Sbjct: 75 LAIRWFVGYGLHEALPDHSSLTRIRQRWGAESFRRIFERTVRACVAAKIAKGEIVHVDAS 134 Query: 118 --YLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTK---------------RQNA 160 W L + +D D K + Sbjct: 135 LIRADVSWESLAVRHVDAVAEANEDVIAEERDSRKTGKHKKVCVTDPDASMATNGRNRRL 194 Query: 161 YPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNS-----ITLFDKLFYSE 215 P + A+++ ++++ +E + + + + S D + Sbjct: 195 EPAYKQHAVVDDAFGVVMDVEVTTGEVNEGQVVLARIDAAAETSGTPIQTVTADAGYAYA 254 Query: 216 DLLLTLNQKGCNRHWLLPAW 235 + L Q+G +PA Sbjct: 255 KVYGGLEQRGIQAV--IPAK 272 >UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepID=B0R8M6_HALS3 Length = 449 Score = 50.2 bits (118), Expect = 8e-05, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 65/258 (25%), Gaps = 19/258 (7%) Query: 14 PLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLN 73 P++ ++ + R R+ + + + R L Sbjct: 6 SPPDSVVVDRIQRAFPSDELRERARATNLVE-RERKFDIVALFYT-LSFGFAAGSDRSLQ 63 Query: 74 LSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGL-----QL 128 + M + V L + D G + Sbjct: 64 AFLERYVEMADCDDLSYAAFHDWFEPGFVALLREILDDAIENLDTGRADLSGRLERFRDV 123 Query: 129 FAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQS 188 D Y + ++L + +L + + T Sbjct: 124 LIADATIVSLYQDAADV--YAATGEDQAE------LKLHLIESLSTGLPTRFRTTDGTTH 175 Query: 189 ETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNT 248 E + +++ L D FY L ++Q G ++ N E++E T Sbjct: 176 ERSQLPT--GEWVADALILLDLGFYDFWLFDRIDQNGG--WFVSRVKDNANFEIVEELRT 231 Query: 249 ASPGTIPKRLEHLRGALE 266 +IP E L+ L+ Sbjct: 232 WRGNSIPLEGESLQAVLD 249 >UniRef50_C0QMU6 Transposase repeat family IS4 n=1 Tax=Thermosipho africanus TCF52B RepID=C0QMU6_THEAB Length = 254 Score = 49.8 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 31/84 (36%), Gaps = 4/84 (4%) Query: 165 RLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQK 224 RL L +L++ Y SET A +L + +NSI L D+ ++ L + + Sbjct: 114 RLHVLYEAKEKVLIDFKIGEY--SETEQAELLLEEV-ENSILLADRGYWVWRFLERVKDR 170 Query: 225 GCNRHWLLPAWKNIASEMIELGNT 248 + + EL Sbjct: 171 -MRLYVRPRGKEGKKFLKSELNRY 193 >UniRef50_C1DL03 Transposase inactivated derivative n=1 Tax=Azotobacter vinelandii DJ RepID=C1DL03_AZOVD Length = 54 Score = 49.8 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 16/54 (29%), Positives = 24/54 (44%), Gaps = 1/54 (1%) Query: 102 VEWLFRQTAQDRGAERYLK-DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTS 154 + LF Q A+ A + + GL++FA DG + PD E RE + Sbjct: 1 MAALFEQLARAWLAVKPPASARFRGLRIFAADGVVWSMPDTAENREAFSGGRNQ 54 >UniRef50_P55729 Putative transposase y4zB n=4 Tax=Rhizobiaceae RepID=Y4ZB_RHISN Length = 356 Score = 49.8 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 16/161 (9%), Positives = 42/161 (26%), Gaps = 17/161 (10%) Query: 84 LLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPE 143 + + A R A F A + + L ID + Sbjct: 46 SPGQRPLADANARRPVAVFAETFGLLAGQLDRQTRREGRAM---LRLIDSTPIPL---GK 99 Query: 144 LREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDN 203 L + S ++ + + S ++ + ++ I Sbjct: 100 LCGWAKSNGRIRGM-------KMHVVYDPDSDCPRLLDITDANVNDAQIGRTI--AIESG 150 Query: 204 SITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIE 244 + +FDK + + + ++ N+ +++ Sbjct: 151 ATYIFDKGYCHYGWWTAIAE--AKAFFVTRPKSNMGLKVVR 189 >UniRef50_Q05309 Transposase for insertion sequence element IS1151 n=16 Tax=Clostridium perfringens RepID=T1151_CLOPE Length = 473 Score = 49.8 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 20/184 (10%), Positives = 55/184 (29%), Gaps = 14/184 (7%) Query: 87 RSAVTQARQRVGAAPVEWLF----RQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKP 142 + A+ + + ++ +F + ++ D F P Sbjct: 75 KQALDKRFNKYSVEFMKEIFIKFLYSQNNTLTNLERTLRTYFD-RVIINDSISFTLP--- 130 Query: 143 ELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPD 202 + + + + +++ L + +N +++ +M Sbjct: 131 ---KEFKKKFPGSGGVASPSSIKVQLQYELLTGSFMNIDIFSGIKNDVEYLKTMKKYKDY 187 Query: 203 NSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA-SEMIELGNTASPGTIPKRLEHL 261 + L D ++ D L L++ G ++ N + GTI K E++ Sbjct: 188 KDLKLADLGYFKIDYLKRLDKSGTA--FISKVKSNTSLYIKNPSPEKYKVGTIKKSSEYI 245 Query: 262 RGAL 265 + + Sbjct: 246 KIDI 249 >UniRef50_Q46GC6 Transposase n=7 Tax=Methanosarcina RepID=Q46GC6_METBF Length = 435 Score = 49.0 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 31/228 (13%), Positives = 70/228 (30%), Gaps = 21/228 (9%) Query: 14 PLMPPPSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLN 73 P PP E P EW++ + VR R++ ++ W++ ++ VR Sbjct: 3 PCPPPTLEDSLREMFPEEWLRQTAKETGLI-VRERKIDPVIIFWVL----TLSFGVRLQR 57 Query: 74 LSADGEAGMNLLARSAVTQAR--QRVGAAPVEWLFRQTAQDRGAERYL------KDDWHG 125 A + ++ ++ + R VE+L + K Sbjct: 58 TLASLKREYETESQKTISDSSWYYRFTPELVEFLHQCVIHGMEELAKEPGRKLSKKLETF 117 Query: 126 LQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPY 185 + D R R + +A + T +++ +++ ++ Sbjct: 118 QDVVIQDSTIVRLHSSLADR--FPAARSRTVAAG----VKVGVMVSAIANGPRTIALYSE 171 Query: 186 RQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLP 233 + +E + + I L D FY + + + G + Sbjct: 172 KTAEIKTLK--IGPWIKDHILLVDLGFYKTQMFARVEENGGYFVSRIR 217 >UniRef50_A3YV11 Transposase (Class II) n=2 Tax=Synechococcus sp. WH 5701 RepID=A3YV11_9SYNE Length = 344 Score = 49.0 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 37/307 (12%), Positives = 88/307 (28%), Gaps = 50/307 (16%) Query: 19 PSAQLFAEHLPTEWIQHCLTL----SAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNL 74 P+ +E +P + + L + R+R+ ++ M+V + L Sbjct: 21 PTLATLSETIPWDAFRPLLEQGYSHERKSNAGRKRIDPIILFKMLVLQQLFNLSDEELEF 80 Query: 75 SADGEAGM----------NLLARSAVTQARQRV-GAAPVEWLFRQTAQDRGAERYLKDDW 123 + + + + R+R+ A V+ LF + + Sbjct: 81 QVNDRRSFEEFVGLGVMNTIPDATTIAFFRERLRKAGVVDELFERFEEHLRTHGLEA--- 137 Query: 124 HGLQLFAIDGAQFRTP-------------DKPELREYYGSANTSTKRQNAYPVMRLVAL- 169 G Q+ ID P D ++ N ++ ++ + Sbjct: 138 RGGQI--IDATLVPVPKQRNSREENKTIKDGAIPEKWLDKPNRLRQKDTDARWVKKNGVN 195 Query: 170 ---------MNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLT 220 ++ + P ++ + +L + D + Sbjct: 196 HYGYKNSICIDATHGFIRRFAITPANIHDSQMLTQVLDPENRDDFVWADSGYAGAQFEDL 255 Query: 221 LNQKGCNRHWLLPAWKNIASEMIELG--NTASPGTIPKRLEHLRGALEV---VFITKRPR 275 L+ G + + + E T+ R+EH+ GA+ +T+R Sbjct: 256 LDLGGFES--RIHEKGSRCHPLSEEAKERNKVRSTVRARVEHVFGAITTCMRGKLTRRIG 313 Query: 276 PSRPRSV 282 +R ++ Sbjct: 314 LARTKAW 320 >UniRef50_D1XZ52 Transposase, IS4 family n=1 Tax=Prevotella bivia JCVIHMP010 RepID=D1XZ52_9BACT Length = 241 Score = 48.7 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 20/127 (15%), Positives = 49/127 (38%), Gaps = 11/127 (8%) Query: 164 MRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQ 223 M+L L ++ + I +V + ++ S +FD+ + + + L + + Sbjct: 1 MKLHELYDVKTDIPTFSVITDASVHD-SQVMELIP-YEKESFYIFDRAYMATNKLYIIEE 58 Query: 224 KGCNRHWLLPAWKNIASEMIELGNT--ASPGTIPKRLEHLRGALEVVFITKRPRPSRPRS 281 ++++ ++ E+IE S G + ++ +G TK+ P++ R Sbjct: 59 AEA--YFVVREKHKMSFEVIEDKEYNTPSSGIMADQIIRFKG-----HKTKKQYPNKLRR 111 Query: 282 VKISKTR 288 V Sbjct: 112 VVFYDYD 118 >UniRef50_C6J0N9 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J0N9_9BACL Length = 402 Score = 48.7 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 24/203 (11%), Positives = 72/203 (35%), Gaps = 15/203 (7%) Query: 43 ATVRRRRLPGDMVIWMVVQNE-----PITDVVRRLNLSADGEAGM-NLLARSAVTQARQR 96 A R R+L I + V+++ ++ L + D + ++ S +++ ++ Sbjct: 27 ADHRTRKLTTGKAIQIFVESQLAGRTSYDEISEHLRIMPDLQDDHLKSISASQLSRKIKQ 86 Query: 97 VGAAPVEWLF-RQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTST 155 + ++ +F A+ + + + + +L +D P Y+ + Sbjct: 87 LPTDLLQAIFLCNIARIQEITKQKQGIPNIGKLRILDSTVLTLPTLAGRWAYWSKEQNAV 146 Query: 156 KRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSE 215 K + + + + + S+ +A ++ + D++I + D+ + Sbjct: 147 KIHTQL------VVADRETVFPGKIINSTAAVSDQEVALDLV--VADDAIHVMDRGYIQY 198 Query: 216 DLLLTLNQKGCNRHWLLPAWKNI 238 +L +L + L + Sbjct: 199 ELYESLIHQQMRFVARLQTKNKV 221 >UniRef50_B5ID46 Transposase, IS4 family protein n=9 Tax=Aciduliprofundum boonei T469 RepID=B5ID46_9EURY Length = 282 Score = 48.3 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 20/169 (11%), Positives = 57/169 (33%), Gaps = 16/169 (9%) Query: 67 DVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGL 126 +VR + + +S +++ +R+ + + R+ + G Sbjct: 58 QIVREMRKIKRVMRLKKIPHKSTISRELRRIPELWIRIVLREIIRALGIPSK-------- 109 Query: 127 QLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR 186 FA+D + + Y + ++L A +++ + ++ NA+ + Sbjct: 110 --FAVDSTGIQISYRS-----YYYTQRIGEVGKIREGLKLHAAVDIDTKLITNAIVTKWH 162 Query: 187 QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 +++ +L + DK + S + + KG + + Sbjct: 163 TNDSPYLIPLLEEERVKEVY-ADKGYDSLRNIRFVLNKGGTPYIAIRNK 210 >UniRef50_B9MHR2 Transposase IS4 family protein n=1 Tax=Diaphorobacter sp. TPSY RepID=B9MHR2_DIAST Length = 336 Score = 47.9 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 67/237 (28%), Gaps = 21/237 (8%) Query: 57 WMVVQNEPITDVVR-RLNLSADGEAGM---NLLARSAVTQARQRV-GAAPVEWLFRQTAQ 111 W + + + +R RL+ L S + + R R+ A + L Sbjct: 65 WHGLSDTQLEQALRVRLDFMVFTGFEPSAGELPDASTICRFRNRLVKAELEQKLLALINS 124 Query: 112 DRGAERYLKDDWHGLQLFAIDGAQFRTPDKPEL------------REYYGSANTSTKRQN 159 G + ID + +P A K ++ Sbjct: 125 QLEQRGLK---VQGARGAIIDATIIPSAARPRQHVEGEGEQARLVDSADSEARWVKKGKH 181 Query: 160 AYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDN-SITLFDKLFYSEDLL 218 A+ R ++ + + P ++E ++ + L DK + S+ Sbjct: 182 AFFGYRGHTAVDSEDGYVEHVQVHPANEAEINKLPEIVEALSPGIEAVLADKGYASKANR 241 Query: 219 LTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPR 275 L ++G + G+I ++E G ++ F R R Sbjct: 242 QWLAERGIGDLIQHKGSAGKPVHALLKQFNKQIGSIRFKVEQAFGTMKRRFHLGRAR 298 >UniRef50_D2PLH1 Putative uncharacterized protein n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PLH1_9ACTO Length = 98 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 30/85 (35%), Gaps = 11/85 (12%) Query: 63 EPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQ-----------TAQ 111 P + L+ A ++ SA+TQAR+ +G LF + TA+ Sbjct: 8 RPDQASLGVLDRWNCWNAAWSVPTVSAITQARKWLGRCVFPELFERACGPVVSEAGLTAE 67 Query: 112 DRGAERYLKDDWHGLQLFAIDGAQF 136 +L AIDG + Sbjct: 68 AVALGTARGSFLRRWRLLAIDGFEI 92 >UniRef50_Q3EKJ9 Transposase n=1 Tax=Bacillus thuringiensis serovar israelensis ATCC 35646 RepID=Q3EKJ9_BACTI Length = 122 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 13/84 (15%), Positives = 30/84 (35%) Query: 164 MRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQ 223 M++ +L S L T P +Q + T+ N + + D ++ L + Sbjct: 1 MKIQLEYDLLSGQFLYIHTGPGKQHNRTYGSLCIPTVAPNDLCIRDLGYFHLKDLQHIQH 60 Query: 224 KGCNRHWLLPAWKNIASEMIELGN 247 K + L+ + ++ + Sbjct: 61 KKAYSYLLITVHRLAPLQIKRSAS 84 >UniRef50_C6AUF2 Transposase IS4 family protein n=7 Tax=Rhizobium RepID=C6AUF2_RHILS Length = 372 Score = 46.3 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 54/203 (26%), Gaps = 32/203 (15%) Query: 33 IQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARS---A 89 ++ L T R + ++ + + + +R A+ L S Sbjct: 24 LEATARLRGAFTRVREIKNAETLLRLALAYGGLGMSLRETCAWAEAGGIARLSDPSLLER 83 Query: 90 VTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYYG 149 + +A +G V L + A+ + G +L +DG P Sbjct: 84 LCKAAPWLG-DIVAALIAEQAK------VPTGRFAGYRLRVLDGTSICHP---------- 126 Query: 150 SANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFD 209 RL +L + + +E + T I L D Sbjct: 127 --------GADRTTWRLHVGYDLATAQVDQLELTDIHGAEN--LQRL--TYAPGDIVLAD 174 Query: 210 KLFYSEDLLLTLNQKGCNRHWLL 232 + + L + G + Sbjct: 175 RYYARPRDLRPVIDAGADFIVRT 197 >UniRef50_B3E2B5 Transposase IS4 family protein n=7 Tax=Proteobacteria RepID=B3E2B5_GEOLS Length = 360 Score = 46.3 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 36/300 (12%), Positives = 82/300 (27%), Gaps = 54/300 (18%) Query: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLN------LS 75 Q +E P S +++ +L ++ ++ +V ++ Sbjct: 36 QALSEMSP--LFDSMYADSGRSSIAPEKLLKAQLLMILFSIRSNRQLVEQIRYNFLYRWF 93 Query: 76 ADGEAGMNLLARSAVTQARQRV-GAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGA 134 + S+ T+ +R+ G+ + + + F +DG Sbjct: 94 LGMGLDDEIWDHSSFTKNHERLIGSDVAAEFLSRILAQ-----AERKRLLSREHFTVDGT 148 Query: 135 QFRTPDKPELREYYGSANTSTKRQNAY------------------PVMRL---------- 166 + + ++ +N P RL Sbjct: 149 LIEAWASIKSFKPKDGPPSAGGGKNKSVDFKGKQLKNDTHSSSTDPNARLYRKGNTKEAK 208 Query: 167 -----VALMNLGSHILLNAVTAPYR-QSETVLAHSMLATIPDNSITL---FDKLFYSEDL 217 LM + +++ E A +M+ +P + + DK + +E Sbjct: 209 LCYQGHTLMENRNGLIVKTSVTTATGTGEREAAKAMIRQLPRTTRRITVGADKGYDTEGF 268 Query: 218 LLTLNQKGCNRHWL---LPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRP 274 + L Q H I N + I KR+E G ++ + ++ Sbjct: 269 VKELRQINVTPHVAQNNTRRKSGIDGRTTAHPNYSISQRIRKRIEEGFGWMKTIGRLRKT 328 >UniRef50_B3JEV4 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JEV4_9BACE Length = 169 Score = 46.3 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 4/128 (3%) Query: 166 LVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKG 225 + L++ S + + + + +P +S+ + D+ + LL + + Sbjct: 1 MHTLLDYDSLLPEFVNITEGKCGDNR--GDLDIPVPPHSVVVADRGYCDFSLLDYWDSR- 57 Query: 226 CNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKIS 285 N +++ N+ IE ++ + K RP R +V Sbjct: 58 -NVFFVVRHRDNLLYSQIEERLLPETRAQNVLIDEIIELTGEQTKKKYTRPLRRIAVWND 116 Query: 286 KTRYPVKH 293 + Y V+ Sbjct: 117 EHGYVVQL 124 >UniRef50_B8CMP9 Transposase OrfB, putative n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CMP9_SHEPW Length = 75 Score = 46.0 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 14/42 (33%), Positives = 26/42 (61%) Query: 166 LVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITL 207 +V LM L SH+L+++ ++E LA +++ PDN+I + Sbjct: 1 MVCLMELSSHLLVDSSFGSVAENEMALAANLINNTPDNNIVI 42 >UniRef50_Q7ULM3 Probable transposase n=5 Tax=Planctomycetaceae RepID=Q7ULM3_RHOBA Length = 458 Score = 46.0 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 54/185 (29%), Gaps = 19/185 (10%) Query: 60 VQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYL 119 + ++ + N ++ +A + L+ + + + +E L + A Sbjct: 107 ISQAS--ELTKVRNKLSNEKASLGSLSEAGGLFSADHLKP-VIEALSAEVND---AAPDP 160 Query: 120 KDDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLN 179 + + A+DG+ +A RL + + + Sbjct: 161 RLSSIQQTITAVDGSLVNALPSLIAASILKQT-----TGSALVRWRLHTHFEVNNLLPAR 215 Query: 180 AVTAP---YRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWK 236 P + E + +L ++ + + D+ + L ++ + L Sbjct: 216 VDVTPDGGGQHDERAVLKRVLE---EDRLYVMDRGYAKFSLFNSIVASSSSYVCRLR--D 270 Query: 237 NIASE 241 N E Sbjct: 271 NTVYE 275 >UniRef50_A9AZS8 Transposase IS4 family protein n=3 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AZS8_HERA2 Length = 442 Score = 46.0 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 22/115 (19%), Positives = 39/115 (33%), Gaps = 7/115 (6%) Query: 128 LFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ 187 + D PD L Y ++ R A ++ ++L + L R Sbjct: 117 VRIHDSTTIGLPD--ALATTYRGCGNASARGTA--GLKCGVQLDLLTGTLCGIDLTDGRA 172 Query: 188 SETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEM 242 S+ VL+ +P S+ L D FY+ + L +WL + + Sbjct: 173 SDQVLSVQRAP-LPAGSLRLADLGFYNIRIFRELAA--AEVYWLSRVQSHSRIRL 224 >UniRef50_C3EBZ9 IS231-related transposase n=1 Tax=Bacillus thuringiensis serovar pakistani str. T13001 RepID=C3EBZ9_BACTU Length = 221 Score = 45.6 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 16/129 (12%), Positives = 38/129 (29%), Gaps = 2/129 (1%) Query: 170 MNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRH 229 ++ S L + ++ T+ + + D ++ +NQKG Sbjct: 3 YDVISGDFLQLDITNGISHDAKYGQELIHTVEKRDLCIRDLGYFYLPDFHEINQKGAYYL 62 Query: 230 WLLPAWKNI--ASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRPRSVKISKT 287 LP + ++ +V + P+R K++ Sbjct: 63 SRLPINTQVYRKKGILYERLYLEDFIKKVSEGKTIEWFDVYIRKQHKVPTRLIIYKLTGA 122 Query: 288 RYPVKHSAA 296 Y K++ + Sbjct: 123 GYDGKNNVS 131 >UniRef50_Q55646 Transposase n=1 Tax=Synechocystis sp. PCC 6803 RepID=Q55646_SYNY3 Length = 227 Score = 45.2 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 18/163 (11%), Positives = 51/163 (31%), Gaps = 21/163 (12%) Query: 61 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK 120 + + +RLNL + S ++A ++ + ++ + Sbjct: 41 SQTSMRSMFKRLNLRG------ETVDISTFSKASKKRDVGVFREIIFSLKKELSKR--KE 92 Query: 121 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNA 180 L++F +D + + +++ + +NL + I Sbjct: 93 IKQGELEIFPLDSTIVSIT-------------SKLMWNLGFHQVKVFSGINLSTGIPGGI 139 Query: 181 VTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQ 223 V + + + + P+N + + D+ F + L + Sbjct: 140 VIHFGQGHDNKYGNETIEETPENGVAVMDRGFCDLQRIKRLQK 182 >UniRef50_A4SUB1 IS element transposase n=8 Tax=Bacteria RepID=A4SUB1_AERS4 Length = 420 Score = 45.2 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 59/220 (26%), Gaps = 15/220 (6%) Query: 26 EHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDV-----VRRLNLSADGEA 80 + L + C+ +R R + M++ +++ V + + Sbjct: 12 QLLTPAETE-CIARLCKFCLRLRAITPWMLVTSLLRAFGGGKVGAIACLHQHFNGLQLAH 70 Query: 81 GMNLLARSAVTQARQRVGAAPVEWLFRQ-TAQDRGAERYLKDDWHGLQLFAIDGAQFRTP 139 + + Q R+ A ++ L + A G + Q+ DG F Sbjct: 71 THQVSYKPFHNQLRKPAFAQFMKALVERAIALRIGQQVTDVAQGAFKQVLLQDGTSFAV- 129 Query: 140 DKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLAT 199 L + + + M+L L + SE A Sbjct: 130 -HKRLATVFPGRFKTISPAA----IECHMTMSLLEQKPLCMQLSADTASERQFLPD--AK 182 Query: 200 IPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA 239 S+ L D + +N+ GC N Sbjct: 183 KLTGSLLLADAGYIDRAYFAEVNKAGCFYLVRGRKGLNPK 222 >UniRef50_B7IHA7 Putative uncharacterized protein n=5 Tax=Thermosipho africanus TCF52B RepID=B7IHA7_THEAB Length = 130 Score = 45.2 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 27/69 (39%), Gaps = 4/69 (5%) Query: 176 ILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAW 235 +LL+ R E A ML I + SI L D+ ++ + + +N+K + Sbjct: 1 MLLDFEIGNNR--EVEHAEIMLEDI-EGSILLADRGYWQWEFIEKMNEK-MKLYIRSRGR 56 Query: 236 KNIASEMIE 244 K E Sbjct: 57 KGKEFMERE 65 >UniRef50_A8YN96 Similar to the central part of tr|Q3M9Z5|Q3M9Z5_ANAVT Transposase n=1 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YN96_MICAE Length = 148 Score = 44.8 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 12/99 (12%), Positives = 38/99 (38%), Gaps = 10/99 (10%) Query: 128 LFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPY-R 186 ++ DG+ +LR+ ++ + + R++ ++ + + + R Sbjct: 43 IWIADGSTLE-----QLRKSLKASEKESGKLAG----RIMMIVEAFTQVPVTVWYQKNER 93 Query: 187 QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKG 225 ++ V ++ +P + + + D F+S +G Sbjct: 94 CNDKVWVEQLINELPTSGLLVVDLGFFSFPWFDLTRMRG 132 >UniRef50_B9LW44 Transposase IS4 family protein n=12 Tax=Halobacteriaceae RepID=B9LW44_HALLT Length = 273 Score = 44.8 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 31/257 (12%), Positives = 82/257 (31%), Gaps = 32/257 (12%) Query: 31 EWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSA------DGEAGMNL 84 + + + +RR ++ + ++ T L+ L Sbjct: 18 HLARRAVARYSSKFSKRRYTLHQHIVLLCLKVRKNTTYRTLLDELIEMPRIRSAIDLEEL 77 Query: 85 LARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPEL 144 + S + +A R+G A L + + ID + F + Sbjct: 78 PSPSTLCKAFNRLGMAVWRVLLNLSVTLLPTNG----------VVGIDASGFDRSHASK- 126 Query: 145 REYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDN- 203 + + + + +++ L++ + +++ R+ ++ +A S++ D+ Sbjct: 127 -------HYTKRTKLTIQQLKVTLLVDTRVNAIIDLHVTTTRKHDSKIAPSLIRRNTDDV 179 Query: 204 SITLFDKLFYSEDLLLTLNQKGCNRHWL------LPAWKNIASEMIELGNTASPGTIPKR 257 +I L DK + + + + G L N+ + G + T+ R Sbjct: 180 TILLGDKGYDDQKIRTLAREDGVRPVIKHRGFSSLHKAWNVRLDADIHGQRSQNETVNSR 239 Query: 258 LEHLRGA-LEVVFITKR 273 ++ G + K+ Sbjct: 240 IKRKYGEFVRSRRWWKQ 256 >UniRef50_A7C2A8 Transposase of IS641 n=1 Tax=Beggiatoa sp. PS RepID=A7C2A8_9GAMM Length = 304 Score = 44.4 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 7/75 (9%), Positives = 23/75 (30%), Gaps = 3/75 (4%) Query: 164 MRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQ 223 ++L L + + + SE M + + D+ + S + + + Sbjct: 13 LKLHLCFELNRMLAVEFLVTAANFSERAALIKM---LKAGVTYIADRGYMSFKVGDEVLK 69 Query: 224 KGCNRHWLLPAWKNI 238 + + + + Sbjct: 70 AKAHFVFRVKTGLRL 84 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 44.0 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 67/210 (31%), Gaps = 27/210 (12%) Query: 83 NLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKP 142 L + ++ + + F+Q G + + AIDG R Sbjct: 66 GLPSHDTFSRVFRLLDPVAFSRCFQQFLDHLGEDGAG--------VLAIDGKTLR----- 112 Query: 143 ELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIP- 201 S R + +V+ G+ +++ ++E V A ++L Sbjct: 113 ----------RSFDRAAGRSALHVVSAFASGARMIVGQRAVAAGENEIVAARALLELFDL 162 Query: 202 DNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHL 261 + D L E T+ ++G + WL P N + E+ + + H+ Sbjct: 163 KGVLVTGDALHAQERTAQTILERGGD--WLFPLKDNRPALRAEVERYFADPATVLAVPHV 220 Query: 262 RGALEVVFIT-KRPRPSRPRSVKISKTRYP 290 + I +R S + S R+P Sbjct: 221 TTDADHGRIEVRRHWVSHDVAWLASDRRFP 250 >UniRef50_A1RJM4 Transposase, IS4 family n=17 Tax=Gammaproteobacteria RepID=A1RJM4_SHESW Length = 253 Score = 44.0 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 15/135 (11%), Positives = 43/135 (31%), Gaps = 13/135 (9%) Query: 99 AAPV---EWLFRQTAQDRGAERYLKDDWHGLQLFA------IDGAQFRTPDKPELREYYG 149 ++R+ + L + L+ A IDG+ + Sbjct: 51 PEAFGDWSAIYRRFN-LWSKKGILMQLFAELRQLADLEWEFIDGSIVKAHQHATGAR--S 107 Query: 150 SANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFD 209 + + ++ ++ + ++ + ++ A +L +PD + D Sbjct: 108 EDKEAIGKSRGGNTTKIHMAVD-SCGLPIDFIVTGGEVHDSKAAIELLKQLPDAEHVIAD 166 Query: 210 KLFYSEDLLLTLNQK 224 + + SE + + +K Sbjct: 167 RGYDSEKIREQIREK 181 >UniRef50_Q9F9K7 Transposase n=2 Tax=Gammaproteobacteria RepID=Q9F9K7_PISSA Length = 239 Score = 44.0 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 44/119 (36%), Gaps = 8/119 (6%) Query: 128 LFAIDGAQFRTPDKPE---LREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAP 184 ++ +D R + R + G A S Y +L ++N L+ + Sbjct: 111 IYFVDSTILRVCHEKRASQNRAFKGLAKKSKSTMGWYYGFKLHIIVNDM-GELMAFKMSK 169 Query: 185 YRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMI 243 + V+ M + + DK + S+ L L +KG + KN+ ++++ Sbjct: 170 ATTDDRVVLPKMAENLTGK--IIGDKGYISQKLFDQLYEKGLQL--ITKIRKNMKNKLV 224 >UniRef50_B0VT13 Transposase of ISAba6, IS982 family n=32 Tax=Acinetobacter baumannii SDF RepID=B0VT13_ACIBS Length = 315 Score = 43.3 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 8/119 (6%), Positives = 38/119 (31%), Gaps = 6/119 (5%) Query: 128 LFAIDGAQFRTPDKPELREY--YGSANTSTKRQN-AYPVMRLVALMNLGSHILLNAVTAP 184 + ID + + ++ + + + K + ++ + + L++ Sbjct: 127 IAFIDSTKLAVCHNKRIHQHRVFADSASRGKTSVDWFYGFKIHLICDHI-GRLVSYCITT 185 Query: 185 YRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMI 243 + + ++ D+ + ++ L + G + +N+ +++ Sbjct: 186 GNVDDRKVLPDLIEHSKLKGKLFGDRGYVGKNWKSRLAEVGVQL--ITRVKRNMKPQVL 242 >UniRef50_B0R9A9 Transposase (ISH8) n=22 Tax=Halobacteriaceae RepID=B0R9A9_HALS3 Length = 424 Score = 43.3 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 23/227 (10%), Positives = 61/227 (26%), Gaps = 27/227 (11%) Query: 21 AQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVV----QNEPITDVVRRLNLSA 76 + P+++++ R +L +++W +V E T R Sbjct: 1 MRRLTTLFPSKFLEEHAEELGVVE-REGKLQIPVLVWALVFGFAAGESRTLAGFRRCY-- 57 Query: 77 DGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDW-HGLQLFAIDGAQ 135 ++ A + L + + D + DG Sbjct: 58 -NSTADETISPGGFYHRLTPTLAEYLRDLVEHGLDEVAVPDTVDADIDRFRDVMIADGTV 116 Query: 136 FRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHS 195 R + A + +L L N + + ++ L + Sbjct: 117 LRLHEFLSDE---FQARHEEQAG-----AKLHLLHNATDETIERIDVTDEKTHDSTLFKT 168 Query: 196 ---MLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA 239 + + LFD+ ++ +++ + +++ +N Sbjct: 169 GSWLQERL-----VLFDRAYFKYRRFALIDEN--DGYFVSRLKENAN 208 >UniRef50_D0J4K5 Transposase, IS4 n=16 Tax=Proteobacteria RepID=D0J4K5_COMTE Length = 264 Score = 43.3 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 16/107 (14%), Positives = 33/107 (30%), Gaps = 5/107 (4%) Query: 131 IDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSET 190 IDG + R + L + + + ++ Sbjct: 90 IDGTYAKAHQHSSGAASDQPEAIGKSRAGNTSKIHLAVDAH---GLPVAFDITGGEINDC 146 Query: 191 VLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKN 237 A ++A +P + + DK + SE L + +G +P +N Sbjct: 147 TAAPELIAQLPSAEVIVADKGYDSERLRQQIEVQGARPV--IPRKRN 191 >UniRef50_A0LL64 ISGsu1, transposase n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LL64_SYNFM Length = 179 Score = 42.9 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 16/141 (11%), Positives = 35/141 (24%), Gaps = 7/141 (4%) Query: 26 EHLPTEWIQHCLTLSAHATVRRRRLPGD-----MVIWMVVQNEPITDVVRRLNLSADG-- 78 + + + R L M+ + + D+ + A Sbjct: 3 KFVDRHDFNRIEQGGFKPRRKSRTLNRWNQFVAMMFAQLTGRCSLRDIADQFRSQASRLY 62 Query: 79 EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRT 138 G+ + RS ++ A A + LF + A K + + D + Sbjct: 63 HLGVRPVKRSTLSDANHDRPADFFQALFDRQYARCAAIAPKKKFRFKYKPNSFDSSVVNL 122 Query: 139 PDKPELREYYGSANTSTKRQN 159 R + + Sbjct: 123 CLSLFPRARFQETKGGIELHT 143 >UniRef50_P11901 Transposase for insertion sequence element IS421 n=41 Tax=cellular organisms RepID=T421_ECOLX Length = 371 Score = 42.9 bits (99), Expect = 0.013, Method: Composition-based stats. Identities = 35/280 (12%), Positives = 72/280 (25%), Gaps = 35/280 (12%) Query: 22 QLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPI-TDVVRRLNLSADGEA 80 + A E + + T RR ++ + + P +R + A Sbjct: 10 AILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSSLREVTAWAQLHD 69 Query: 81 GMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPD 140 L + + + R +W AQ + G +L +DG P Sbjct: 70 VATLSDVALLKRLRN-----AADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAP- 123 Query: 141 KPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATI 200 RL + + + R +E L Sbjct: 124 -----------------GGGSAEWRLHMGYDPHTCQFTDFELTDSRDAER------LDRF 160 Query: 201 --PDNSITLFDKLFYSE-DLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKR 257 + I + D+ F S + + +L + + W+ + E G + Sbjct: 161 AQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVH-WRGLRWLTAEGMRFDMMGFLRGL 219 Query: 258 LEHLRGALEVVFITK-RPRPSRPRSVKISKTRYPVKHSAA 296 G V+ + P ++ P + + Sbjct: 220 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALI 259 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 42.5 bits (98), Expect = 0.018, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 41/117 (35%), Gaps = 1/117 (0%) Query: 164 MRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNS-ITLFDKLFYSEDLLLTLN 222 + +V + GS ++ +A + ++SE L +L + S + FD L L + Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 223 QKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPRPSRP 279 ++G + + + E ++ T P + KR P Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCP 236 >UniRef50_A6UM74 Transposase and inactivated derivatives-like protein n=83 Tax=Bacteria RepID=A6UM74_SINMW Length = 254 Score = 42.5 bits (98), Expect = 0.018, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 48/149 (32%), Gaps = 14/149 (9%) Query: 127 QLFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYR 186 + ID R + + R ++ AL++ +N + Sbjct: 89 DIVMIDSTCVRVHQHAATGKKGDGDDGGMGRSRGGLTSKIHALVD-AEGRPVNLRLTGGQ 147 Query: 187 QSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELG 246 ++ A ++ + + I L DK + S + + ++ AW NI + G Sbjct: 148 IADCTEADALTDELGEGDILLADKGYDSNAIRAKVAER--------KAWANIPPKTNRKG 199 Query: 247 NTASPGTIPKRLEHLRGALEVVFITKRPR 275 + + R + + L F + + Sbjct: 200 SF-----VFSRWVYRQRNLVERFFNRIKQ 223 >UniRef50_Q5P589 Transposase, is4 family n=3 Tax=Rhodocyclaceae RepID=Q5P589_AZOSE Length = 363 Score = 42.1 bits (97), Expect = 0.024, Method: Composition-based stats. Identities = 36/283 (12%), Positives = 73/283 (25%), Gaps = 53/283 (18%) Query: 41 AHATVRRRRLPGDMVIWMVVQNEPITDVVRRLN------LSADGEAGMNLLARSAVTQAR 94 ++ RL ++ + +L+ D L +S ++ R Sbjct: 53 GRPSIAPERLLKGQLLIALYSIRSDRQFCEQLDYNILFRWFLDMNLESTSLDQSNFSRLR 112 Query: 95 QRV-GAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYY-GSAN 152 +R+ F + E+ L D F +DG + + G Sbjct: 113 ERLVQTDIARRFFDEVVSLARREKLLSSDH-----FTVDGTLIDAWASFKSFKRKDGEPP 167 Query: 153 TSTKRQNAY------------------PVMRL---------------VALMNLGSHILLN 179 P RL LM + + ++ Sbjct: 168 KDGGDGTGMVDFKGEKRSNATHQSTTDPESRLMRKGNGQPAKLSYGGHVLMENRNGLCVD 227 Query: 180 AVTAPYRQSETVLAHSMLATIPDN----SITLFDKLFYSEDLLLTLNQKGCNRHW-LL-- 232 + Q+E A +L DK ++ ++ + L + H + Sbjct: 228 ILITESTQAEHRAARQLLTRARRRRIHPKTLGADKGYHVKEFVSHLREHRVRPHIARIAN 287 Query: 233 PAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVFITKRPR 275 + KR+E + G L+ V ++ R Sbjct: 288 RTTPGLDGRTTRTEGYQISQRKRKRVEEIFGWLKTVGGMRKTR 330 >UniRef50_C0AF19 InsL n=9 Tax=Opitutaceae bacterium TAV2 RepID=C0AF19_9BACT Length = 362 Score = 41.3 bits (95), Expect = 0.043, Method: Composition-based stats. Identities = 27/218 (12%), Positives = 61/218 (27%), Gaps = 25/218 (11%) Query: 19 PSAQLFAEHLPTEWIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADG 78 L LP W + + + ++ +++ + +R ++ Sbjct: 8 EEWGLVKGLLPEGW-EVAAREQGAFKQAKGIRTAEELLRLILMHAGSGLSLRH-AVARGA 65 Query: 79 EAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRT 138 AG+ ++ A+ + R R + W+ + + + + G A+D Sbjct: 66 AAGLPEVSDVALLK-RLRNAEGWLRWMSVRLLEQQAGQPRWSRLPEGWTAVAVDSTTI-- 122 Query: 139 PDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLA 198 E G++ T RL + L S A + E L Sbjct: 123 -------EESGASGTDW---------RLHYAIGLPSLFCEQAELTDNKGGE-SLCR---Y 162 Query: 199 TIPDNSITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWK 236 + + L D+ F + + + Sbjct: 163 KVRKGDLFLGDRNFCRAPQIRHVMDHQGAVLLRWHSTS 200 >UniRef50_A8KXB4 Transposase n=1 Tax=Frankia sp. EAN1pec RepID=A8KXB4_FRASN Length = 449 Score = 41.3 bits (95), Expect = 0.045, Method: Composition-based stats. Identities = 17/131 (12%), Positives = 42/131 (32%), Gaps = 9/131 (6%) Query: 148 YGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPD----N 203 + S ++ P + ++ L I + P S+ + + + Sbjct: 195 FRKYGKSKDHRDDLPQV-VIGLAVTREGIPVRVWCWPGNTSDQTVLAQVKDDLRSWKLGR 253 Query: 204 SITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKN---IASEMIELGNTAS-PGTIPKRLE 259 +T+ D+ F S L L + G + + + + + G + + + Sbjct: 254 VVTVVDRGFSSAANLAYLRRAGGHYLAGMRMRDGNPLVDTVLAHQGRYQTVRDNLRVKEI 313 Query: 260 HLRGALEVVFI 270 L A + F+ Sbjct: 314 KLPEAGDTRFV 324 >UniRef50_A8UR40 Transposase n=2 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8UR40_9AQUI Length = 298 Score = 40.9 bits (94), Expect = 0.057, Method: Composition-based stats. Identities = 11/126 (8%), Positives = 36/126 (28%), Gaps = 14/126 (11%) Query: 128 LFAIDGAQFRTPDKPELREYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQ 187 + +D + R + + + + ++L +L ++ R Sbjct: 111 VIVLDSTGIKV----TNRGEWLRKKHGKRARKGW--IKLHVAFDLKRKKVVEIEVTDERV 164 Query: 188 SETVLAHSMLATIPDN--------SITLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIA 239 ++ A ++ S + D + + + L+ +G L+ + Sbjct: 165 HDSQKAKKLVEGAKREAKDKGKKVSKVVADSGYDTHEFFRYLHDEGICAGVLVRKGAKVR 224 Query: 240 SEMIEL 245 + Sbjct: 225 GNPLRD 230 >UniRef50_Q00840 Transposase for insertion sequence element IS1106 n=276 Tax=Proteobacteria RepID=T1106_NEIMB Length = 288 Score = 40.6 bits (93), Expect = 0.065, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 51/184 (27%), Gaps = 12/184 (6%) Query: 100 APVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELR-EYYGSANTSTKRQ 158 +E + RQ + + G++ R + + + G S + Sbjct: 72 ELLELINRQLTEKGLKVEKASAAVVDATIIQTAGSKQRQAIEVDEEGQISGQTTPSKDKD 131 Query: 159 NAYPVMRLVALMNLG--SHILLNAV-------TAPYRQSETVLAHSMLATIPDNSITLFD 209 + ++ L LG H +A P E +L +P + D Sbjct: 132 ARW--IKKNGLYKLGYKQHTRTDAEGYTEKLHITPANAHECKHLPPLLEGLPKGTTVYAD 189 Query: 210 KLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRGALEVVF 269 K + S + L + + A +N + +E G L F Sbjct: 190 KGYDSAENRQHLKEHQLQDGIMRKACRNRPLTETQTKRNRYLSKTRYVVEQSFGTLHRKF 249 Query: 270 ITKR 273 R Sbjct: 250 RYAR 253 >UniRef50_D2SEZ5 Transposase IS4 family protein n=5 Tax=Actinomycetales RepID=D2SEZ5_9ACTO Length = 297 Score = 40.6 bits (93), Expect = 0.067, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 59/210 (28%), Gaps = 20/210 (9%) Query: 32 WIQHCLTLSAHATVRRRRLPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVT 91 WI L R +++ V Q R + + Sbjct: 15 WIDDYLGPRRRPGRPPRLTDAELLTLAVAQALLDVRSEARWLRLIPQRLPGAFPSLPEQS 74 Query: 92 QARQRVGAAPVEWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFR------TPDKPELR 145 +R+ A L R+ +D A W ++ +D + EL Sbjct: 75 GYNKRLRGAVP--LLRRVIRDLAAATD---LWTD-PVWLVDSTPVECARSRPAARRSELA 128 Query: 146 EYYGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLAT------ 199 G + +RL L+ + + + A + E + ++L Sbjct: 129 GAAGY-GYCASHSRYFWGLRLH-LICTPAGLPITWALAHPKLDERQVLMAVLDHDAHLLT 186 Query: 200 IPDNSITLFDKLFYSEDLLLTLNQKGCNRH 229 + + DK + S +L L+ +G Sbjct: 187 ARPGLLIIADKGYASAELDDYLHARGVELL 216 >UniRef50_C9YUP4 Putative transposase (Fragment) n=4 Tax=Streptomyces RepID=C9YUP4_STRSW Length = 312 Score = 40.6 bits (93), Expect = 0.072, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 54/191 (28%), Gaps = 22/191 (11%) Query: 50 LPGDMVIWMVVQNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQT 109 L V+ ++ +RR+ D + + + R R ++ + + R Sbjct: 52 LVTLAVMSALLGYTSERRWLRRVGK--DFGRLFPYVPQQSGYSKRLRAASSLLTSMIRIL 109 Query: 110 AQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREY-----YGSANTSTKRQNAYPVM 164 A+ W ++ +D E + + + + Sbjct: 110 AR-------DTSLWSD-DVWLVDSTPVGCGCSRETAKRSDLAGWAQYGYCASHSRYFWGL 161 Query: 165 RLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIP------DNSITLFDKLFYSEDLL 218 RL + LG + + + E ML T P + DK +Y + Sbjct: 162 RLHLVCTLG-GLPVLFALTGAKADERETLRDMLDTAPDVTAARPGQTIIGDKNYYGREFE 220 Query: 219 LTLNQKGCNRH 229 L ++ Sbjct: 221 HDLAERHLELL 231 >UniRef50_B9XA94 ISPg4, transposase n=1 Tax=bacterium Ellin514 RepID=B9XA94_9BACT Length = 166 Score = 40.6 bits (93), Expect = 0.072, Method: Composition-based stats. Identities = 15/98 (15%), Positives = 25/98 (25%) Query: 61 QNEPITDVVRRLNLSADGEAGMNLLARSAVTQARQRVGAAPVEWLFRQTAQDRGAERYLK 120 I+D ++ N RS ++ A E LFR A K Sbjct: 55 SLREISDGLKSCEGRLKHLGLENEPRRSTLSYANVHRPWELFERLFRDLLAQCQALSPKK 114 Query: 121 DDWHGLQLFAIDGAQFRTPDKPELREYYGSANTSTKRQ 158 +L ++D + + K Sbjct: 115 KFRFKNRLLSLDSTTVDLCANMFDWARWRRTKDAVKLH 152 >UniRef50_Q3STN4 Transposase, IS4 family n=13 Tax=Rhizobiales RepID=Q3STN4_NITWN Length = 210 Score = 40.2 bits (92), Expect = 0.090, Method: Composition-based stats. Identities = 12/62 (19%), Positives = 26/62 (41%), Gaps = 1/62 (1%) Query: 165 RLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSITLFDKLFYSEDLLLTLNQK 224 ++ AL++ + + + + A M+ T+ + L D+ + S L TL + Sbjct: 79 KIHALVD-ACGLPIVLKITEGQAHDGRSAQDMIDTVERGDVLLADRAYDSNALRQTLAAR 137 Query: 225 GC 226 G Sbjct: 138 GA 139 >UniRef50_B1Y0W6 Transposase IS4 family protein n=3 Tax=Burkholderiales RepID=B1Y0W6_LEPCP Length = 360 Score = 40.2 bits (92), Expect = 0.100, Method: Composition-based stats. Identities = 27/209 (12%), Positives = 57/209 (27%), Gaps = 38/209 (18%) Query: 50 LPGDMVIWMVVQNEPI--------TDVVRRLNLSAD--GEAGMNLLARSAVTQARQRVGA 99 P ++++ MV V+ RL+ + +N+ + RQR+ Sbjct: 71 YPSEVLVRMVFLQGLYNLSDEQCEHQVLDRLSFQRFCRLDGALNIPDARTLWSFRQRLAQ 130 Query: 100 APV--EWLFRQTAQDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREYY--------- 148 + +F +Q + +D + P Sbjct: 131 GGLGGRAIFETLSQQLQQHG-----FIPRGGQIVDASIVAAPITQANTAEREALNKGETP 185 Query: 149 ------------GSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSM 196 G A + K +Y +L A + ++ + + Sbjct: 186 EGWSKKRMAHTDGDARWTQKHGKSYYGYKLHANADARYKLIRTLKITAANADDGQQLPHV 245 Query: 197 LATIPDNSITLFDKLFYSEDLLLTLNQKG 225 L + L D+ + S+ L Q+G Sbjct: 246 LQSANTRDRLLADRGYDSQANRQVLAQQG 274 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.122 0.279 Lambda K H 0.267 0.0372 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,396,934,321 Number of Sequences: 3077464 Number of extensions: 45872870 Number of successful extensions: 148309 Number of sequences better than 1.0e-01: 215 Number of HSP's better than 0.1 without gapping: 192 Number of HSP's successfully gapped in prelim test: 191 Number of HSP's that attempted gapping in prelim test: 147650 Number of HSP's gapped (non-prelim): 435 length of query: 299 length of database: 1,040,396,356 effective HSP length: 128 effective length of query: 171 effective length of database: 646,480,964 effective search space: 110548244844 effective search space used: 110548244844 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 92 (40.2 bits)