BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (402 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_B9EHT2 Olfr780 protein n=158 Tax=root RepID=B9EHT2_MOUSE 825 0.0 UniRef50_A3D336 Transposase, IS4 family n=6 Tax=Shewanella RepID... 417 e-115 UniRef50_Q15UH5 Transposase, IS4 family n=36 Tax=Gammaproteobact... 402 e-110 UniRef50_Q6LJK0 Hypothetical transposase n=2 Tax=Vibrionaceae Re... 352 2e-95 UniRef50_D0I6N0 Transposase IS4 n=1 Tax=Grimontia hollisae CIP 1... 295 2e-78 UniRef50_A7N7H3 Putative uncharacterized protein n=31 Tax=Vibrio... 265 3e-69 UniRef50_C6MY57 Putative transposase, IS4 family protein n=1 Tax... 254 6e-66 UniRef50_Q6LPG7 Hypothetical transposase n=7 Tax=Photobacterium ... 247 6e-64 UniRef50_C1DIQ1 Transposase, IS4 n=2 Tax=Azotobacter vinelandii ... 243 1e-62 UniRef50_Q07YD1 Transposase, IS4 family n=6 Tax=Shewanella RepID... 239 2e-61 UniRef50_A4C5E2 Hypothetical transposase n=2 Tax=Pseudoalteromon... 236 1e-60 UniRef50_Q9UH48 Gastric cancer-related protein GCYS-20 n=1 Tax=H... 234 3e-60 UniRef50_Q2NZH2 ISXoo8 transposase n=73 Tax=Xanthomonas RepID=Q2... 204 7e-51 UniRef50_Q17U39 Transposase n=11 Tax=Gammaproteobacteria RepID=Q... 197 7e-49 UniRef50_B8B8E6 Putative uncharacterized protein n=5 Tax=cellula... 187 6e-46 UniRef50_Q5X8W8 Putative uncharacterized protein n=1 Tax=Legione... 173 8e-42 UniRef50_A7MW84 Putative uncharacterized protein n=2 Tax=Vibrio ... 158 4e-37 UniRef50_B4UH67 Transposase IS4 family protein n=3 Tax=Proteobac... 112 3e-23 UniRef50_D0LPB8 Transposase IS4 family protein n=4 Tax=Haliangiu... 106 2e-21 UniRef50_A7MYH1 Putative uncharacterized protein n=4 Tax=Vibrio ... 95 5e-18 UniRef50_Q47076 BfpT, bfpV, bfpW and transposase genes, complete... 95 5e-18 UniRef50_Q5GUK2 ISxac1 transposase n=1 Tax=Xanthomonas oryzae pv... 84 1e-14 UniRef50_Q6LGR5 Putative transposase similar to Tn10 n=1 Tax=Pho... 76 2e-12 UniRef50_A9AVJ1 Transposase IS4 family protein n=1 Tax=Herpetosi... 70 2e-10 UniRef50_Q72IB6 Transposase n=3 Tax=Thermus thermophilus HB27 Re... 67 1e-09 UniRef50_B2JAE4 Transposase, IS4 family protein n=8 Tax=Cyanobac... 66 2e-09 UniRef50_B4WTK1 Putative uncharacterized protein n=6 Tax=Synecho... 58 5e-07 UniRef50_C7QY62 Transposase IS4 family protein n=9 Tax=Cyanothec... 53 2e-05 UniRef50_Q1QFL8 Putative uncharacterized protein n=1 Tax=Nitroba... 51 9e-05 UniRef50_A7N4N2 Putative uncharacterized protein n=1 Tax=Vibrio ... 49 5e-04 UniRef50_Q7NHH4 Gll2563 protein n=2 Tax=Gloeobacter violaceus Re... 48 5e-04 UniRef50_Q10V90 Transposase, IS4 family n=7 Tax=Trichodesmium er... 47 0.001 UniRef50_Q1ARL9 Transposase, IS4 family n=1 Tax=Rubrobacter xyla... 46 0.003 UniRef50_Q73IB8 Transposase, IS4 family n=9 Tax=Wolbachia RepID=... 44 0.009 UniRef50_B4ABV5 Transposase n=1 Tax=Salmonella enterica subsp. e... 42 0.059 UniRef50_B0BZT8 Transposase, IS4 family n=21 Tax=Cyanobacteria R... 41 0.072 >UniRef50_B9EHT2 Olfr780 protein n=158 Tax=root RepID=B9EHT2_MOUSE Length = 402 Score = 825 bits (2132), Expect = 0.0, Method: Compositional matrix adjust. Identities = 398/402 (99%), Positives = 399/402 (99%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR Sbjct: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS Sbjct: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL Sbjct: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK Sbjct: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 NQRSTRTHCHHPSPKIYSASAKEPW+LATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS Sbjct: 241 NQRSTRTHCHHPSPKIYSASAKEPWVLATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS Sbjct: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 Query: 361 TVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLGKL 402 TVRLGMEVLRHSGYTITRED LVAATLL QNLFTHGY LGKL Sbjct: 361 TVRLGMEVLRHSGYTITREDLLVAATLLAQNLFTHGYALGKL 402 >UniRef50_A3D336 Transposase, IS4 family n=6 Tax=Shewanella RepID=A3D336_SHEB5 Length = 460 Score = 417 bits (1073), Expect = e-115, Method: Compositional matrix adjust. Identities = 214/398 (53%), Positives = 283/398 (71%), Gaps = 2/398 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLP-TKARTKHNIK 59 M L ILH SLYQ CPE+H KRLN+L + C AL++ LTLT LGR++ T TKH+IK Sbjct: 1 MQVLTILHQSLYQHCPEIHQKRLNTLMVTCRALINADCLTLTHLGRHIDGTSTHTKHSIK 60 Query: 60 RIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGR 119 R+DRLLGN HLH ER+AVY+WHA ++ + +TMP +LVDWSD+RE + L+ LRAS+A+ GR Sbjct: 61 RMDRLLGNPHLHHERMAVYQWHAKWLLTAHTMPTILVDWSDMREGRELIALRASIAIKGR 120 Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYW 179 S+TLYE+ FPL Q ++ AH+QFL +L +LP N TPLIV+DAGF+ PW++ VE+LGWYW Sbjct: 121 SITLYERTFPLVLQGTQTAHNQFLNELRKVLPDNITPLIVTDAGFRNPWFRKVEQLGWYW 180 Query: 180 LSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 L RVRG Y + L+ + +K +G L+ P+ C+++L+++ SKGR Sbjct: 181 LGRVRGLSVYRPHPFGRQFSLKALYPQARRRAKHVGRVALSVKKPLLCEMVLFRAPSKGR 240 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 K QRST T CHH + Y +AKEPW L TNL ++ +P++LVNIY KRMQ+EETFRDLK Sbjct: 241 KGQRSTTTDCHHTAQWTYELTAKEPWALVTNLTMKAMSPQKLVNIYQKRMQMEETFRDLK 300 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 SPAYG GLRHSRT + R DI+LLIAL++QL W G++ + Q +HFQANTV+ RNVL Sbjct: 301 SPAYGFGLRHSRTRYAARMDILLLIALLVQLAFWWIGLYGETQQLQRHFQANTVKKRNVL 360 Query: 360 STVRLGMEVLRHS-GYTITREDSLVAATLLTQNLFTHG 396 ST+R+G E+LR Y I+ +D L AA L + THG Sbjct: 361 STIRMGKELLRRRHDYPISADDLLCAAKKLAELSLTHG 398 >UniRef50_Q15UH5 Transposase, IS4 family n=36 Tax=Gammaproteobacteria RepID=Q15UH5_PSEA6 Length = 420 Score = 402 bits (1034), Expect = e-110, Method: Compositional matrix adjust. Identities = 193/369 (52%), Positives = 265/369 (71%), Gaps = 1/369 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M ++ ILHD L + CP LH KRL++L +A +LLD + L+LTELGRN+ KHNIKR Sbjct: 19 MRDIHILHDLLKKQCPNLHAKRLSALMVATQSLLDGQQLSLTELGRNISGSVAPKHNIKR 78 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRLLGN +LH ERL +YRWHA +C N MP+VLVDWSD+REQ R + LRASV++ GRS Sbjct: 79 IDRLLGNNNLHNERLDIYRWHARLLCGANPMPVVLVDWSDVREQLRHLTLRASVSVQGRS 138 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 VTLYE+ F E S +H+ FL +LASILP PLIV+DAG++ PW++ VEK GW+WL Sbjct: 139 VTLYERVFSFGEYNSPVSHNPFLRELASILPLGCCPLIVTDAGYRNPWFREVEKHGWFWL 198 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 RVRG V + G +W+ + + ++S +K LG +L + +P+ + LYK+++K RK Sbjct: 199 GRVRGDVGFKRDGQASWQSNKSFYPSANSRAKYLGCGQLGRKSPLHAHLHLYKAKAKHRK 258 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIR-TPKQLVNIYSKRMQIEETFRDLK 299 + RS++ +H + + Y A +KEPW+LATNLP + KQLV++Y++RMQIEETFRD+K Sbjct: 259 DNRSSKAGRNHTAQQSYRAGSKEPWLLATNLPENDKLNSKQLVSLYARRMQIEETFRDIK 318 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 SP YG+GLRHS + ++RFDI+LLIA++ + L G+ A KQ W++ FQANT+R+R VL Sbjct: 319 SPQYGMGLRHSNSRCTKRFDILLLIAMLAEWLLRLLGIIAVKQNWERAFQANTIRHRRVL 378 Query: 360 STVRLGMEV 368 S +RLG EV Sbjct: 379 SIIRLGREV 387 >UniRef50_Q6LJK0 Hypothetical transposase n=2 Tax=Vibrionaceae RepID=Q6LJK0_PHOPR Length = 394 Score = 352 bits (902), Expect = 2e-95, Method: Compositional matrix adjust. Identities = 175/349 (50%), Positives = 232/349 (66%), Gaps = 5/349 (1%) Query: 30 CHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGN 89 C L L L GR+LP+KA+TKH IKR+DRLLGN HLH +RL +YRWH CS N Sbjct: 29 CKHCLAMMHLRLLYFGRSLPSKAKTKHCIKRVDRLLGNNHLHHDRLDIYRWHCHQFCSVN 88 Query: 90 TMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASI 149 PIVLVDW+DIRE +RLMVLRAS+A+ GRSVTL+E+ F S ++H QFL D ++ Sbjct: 89 PQPIVLVDWADIREYERLMVLRASIAVEGRSVTLFEQTFTFKNYNSPRSHQQFLDDFKAV 148 Query: 150 LPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSS 209 LPS+ P+IV+DAGF+ W++ V+ + W +L RVRG V L W+ I L ++S Sbjct: 149 LPSHVIPIIVTDAGFRNTWFRQVDDMDWCYLGRVRGDVNV--LIKNQWQHIKQLFIKANS 206 Query: 210 HSKTLGYKRLTKSNPISCQILLYKSRS-KGRKNQRSTRTHCHHPSPKIYSASAKEPWILA 268 K +G+ +L K P+ C + LYK ++ K RK++ R H + ++ SA EPW+LA Sbjct: 207 KPKYVGFTQLAKRKPLQCHLHLYKKQTPKKRKDRPKGRE--HFSAQAVHKKSALEPWVLA 264 Query: 269 TNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALML 328 TNLP +I + + +V +Y+KRMQIEETFRDLKSP YG GLR SRT +RFDI+LLI L+ Sbjct: 265 TNLPTDIFSSRCIVRLYTKRMQIEETFRDLKSPQYGFGLRQSRTHDPKRFDILLLIGLLA 324 Query: 329 QLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTIT 377 + W G+ A+ GW +HFQAN+V++R VLS VRLG EV R Y I Sbjct: 325 FMVYWWFGIIAEHNGWHRHFQANSVKDRRVLSFVRLGKEVFRRLEYHIN 373 >UniRef50_D0I6N0 Transposase IS4 n=1 Tax=Grimontia hollisae CIP 101886 RepID=D0I6N0_VIBHO Length = 345 Score = 295 bits (756), Expect = 2e-78, Method: Compositional matrix adjust. Identities = 145/318 (45%), Positives = 204/318 (64%), Gaps = 5/318 (1%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M ++ IL ++L P +H KRL SL LA + L LTLT+LGR+L T KH IKR Sbjct: 1 MRDIQILQETLTNHYPTIHKKRLQSLLLATESALGGADLTLTKLGRSLNTFTAAKHAIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 +DRLLGN LH+E+ +Y+W+A I N P++L+DWSD+REQ R M LRAS+AL GR+ Sbjct: 61 VDRLLGNTRLHREKEDIYKWNARLIAGANPCPVILLDWSDVREQLRFMTLRASIALDGRA 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 VTLYE+AF ++ S K H FL L ILP + TP+I+SDAGF+ W++ V+ GW+WL Sbjct: 121 VTLYEQAFEYAQYNSPKTHQYFLGKLQEILPPSATPIIISDAGFRNTWFRQVQSKGWFWL 180 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 RVRG V + +W+ L+ ++S +LG +L + +P++C + K +K Sbjct: 181 GRVRGDVSI-KMTQSDWQSNKTLYPDATSKPHSLGQCQLARRSPLTCNGYVVKQ----QK 235 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 QR +RT H + ++++ +A EPW+L TN+P E Q+ +Y+KRMQIEE FRDLKS Sbjct: 236 AQRHSRTGQKHTASRLFAKNANEPWLLVTNIPTETLNAVQICRLYAKRMQIEEAFRDLKS 295 Query: 301 PAYGLGLRHSRTSSSERF 318 AYGL LRH+RT + R Sbjct: 296 TAYGLALRHNRTHHNRRL 313 >UniRef50_A7N7H3 Putative uncharacterized protein n=31 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N7H3_VIBHB Length = 397 Score = 265 bits (676), Expect = 3e-69, Method: Compositional matrix adjust. Identities = 140/368 (38%), Positives = 216/368 (58%), Gaps = 8/368 (2%) Query: 21 KRLNSLTLACHALLDCKTLTLTELGRNLP-TKARTKHNIKRIDRLLGNRHLHKERLAVYR 79 +R+ ++ +AL + TLTLT LGR + TK + KH IKR+ RLLGN HLH+ER VY Sbjct: 23 RRITAVLDCINALNEKDTLTLTGLGRGMKNTKTKVKHCIKRVYRLLGNPHLHRERTGVYA 82 Query: 80 WHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAH 139 + F+ PI++VDWS + + +LRA++ + GR+ TLYE+ P + S H Sbjct: 83 YITDFLLKNVKHPIIIVDWSPVNHVDK-QILRATIPIGGRAFTLYEEVHPECKLGSLAVH 141 Query: 140 DQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRG--KVQYADLGAENW 197 F+ LA+++P P++ +DAGFKVPW+K +E+ GWYWL RVRG K++ D W Sbjct: 142 KAFIRRLATMVPKGVIPIVTTDAGFKVPWFKPIEQQGWYWLGRVRGNSKLRVND----RW 197 Query: 198 KPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIY 257 + + + LG LTK + CQ+ LY+ +SKGRK + + + + + Sbjct: 198 CSADEVFVQAQYKPQHLGTAELTKQHQYPCQVCLYRKKSKGRKAKNWSGSLQRNTVSLSH 257 Query: 258 SASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSER 317 + +EPW+L +NLP E +++V +Y++RM IEE FRD K+ YGL L S ++S +R Sbjct: 258 AKGEREPWLLVSNLPGETWFAERVVALYTQRMSIEEGFRDTKNERYGLALNFSGSASPKR 317 Query: 318 FDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTIT 377 +I+L+I ++ Q + G A +G+ K FQANT+R R VLS LG E++ Y+ + Sbjct: 318 IEILLMIGMLTQFALLVVGKVAYLKGYYKDFQANTIRTRRVLSYFFLGKELIGREAYSFS 377 Query: 378 REDSLVAA 385 +D +A Sbjct: 378 VKDLALAV 385 >UniRef50_C6MY57 Putative transposase, IS4 family protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MY57_9GAMM Length = 397 Score = 254 bits (648), Expect = 6e-66, Method: Compositional matrix adjust. Identities = 143/378 (37%), Positives = 220/378 (58%), Gaps = 1/378 (0%) Query: 5 DILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRL 64 +LH+ L Q +H KRL+SL A A + + +T+T LGR L + K+ IK+IDRL Sbjct: 5 QLLHNHL-QKSVVMHSKRLDSLMCAVTAGMKDRCVTVTGLGRRLRMSIKVKNKIKKIDRL 63 Query: 65 LGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLY 124 +GN HLH+E ++Y+ I PI++VDWS + + +LRA++ GR++TLY Sbjct: 64 VGNSHLHQEIPSIYQCMTGLILGNIRRPIIIVDWSPLGQGTEHQLLRATLPSGGRALTLY 123 Query: 125 EKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVR 184 E A+P S S+K H +FLA L ILP+ TP+IV+DAGF+ W++ V LGW W+ RVR Sbjct: 124 ESAYPESLLTSRKVHQEFLAKLCQILPAGCTPIIVTDAGFRNTWFEDVSSLGWDWVGRVR 183 Query: 185 GKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRS 244 + Y AE W PI +L+ ++S + +G+ L++ +SC + LYK + KGR + Sbjct: 184 NRTHYLAANAEQWVPIKSLYHHATSRPQYIGHGNLSRRTSVSCGLYLYKKQPKGRVLKTL 243 Query: 245 TRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYG 304 C + + +EPW++AT+L ++++ IY+KR QIE FRD K+ G Sbjct: 244 KGAKCRQATSLKIAQREREPWLIATSLHHNTTLSRKIIKIYAKRAQIENGFRDTKNQRLG 303 Query: 305 LGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRL 364 L S+TS + R +++L+I + WL G +++ FQANT++NRNVLS V L Sbjct: 304 FSLNDSKTSHTARLNVLLIIIAIATFGLWLLGGLLKQKQLHFQFQANTIKNRNVLSNVFL 363 Query: 365 GMEVLRHSGYTITREDSL 382 G +++ +S R D L Sbjct: 364 GWQIINNSSPRFKRADWL 381 >UniRef50_Q6LPG7 Hypothetical transposase n=7 Tax=Photobacterium profundum RepID=Q6LPG7_PHOPR Length = 402 Score = 247 bits (630), Expect = 6e-64, Method: Compositional matrix adjust. Identities = 136/393 (34%), Positives = 212/393 (53%), Gaps = 3/393 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKART--KHNI 58 M +IL+ L + P++H RL +L + + + +++T LGR L + + T KH+I Sbjct: 1 MKATEILYQDLRSYYPQIHSSRLKTLCTFIESGIKDQRVSVTYLGRGLESGSVTTKKHDI 60 Query: 59 KRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHG 118 KR DRL+GN HLH ER Y + + PI+L+DWS I Q+ +LRAS+ + G Sbjct: 61 KRADRLIGNAHLHCERHDYYEYMTEQLIGREKHPIILIDWSPINGQEIYQLLRASIPMQG 120 Query: 119 RSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWY 178 R + LYEK F SE ++KAH FL +L +LP P+I +DA ++ PW+K+VE GWY Sbjct: 121 RGLVLYEKTFHESELNTEKAHQSFLDELEQVLPEGCQPVITTDAIYRSPWFKAVELKGWY 180 Query: 179 WLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKG 238 W+ RVRG+V + + + ++ LG K C+ +L+K KG Sbjct: 181 WIGRVRGQVSLSQDKETWYTSYQWFKAAKVNKAEHLGVLYYGKVAKFKCEGVLFKRNKKG 240 Query: 239 RKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQL-VNIYSKRMQIEETFRD 297 R ++ + K + A E W+L LP + + V++Y +RMQIEE FRD Sbjct: 241 RSAKKKRGGVSQRTTDKTHEKDANEAWLLVFKLPPRYKNNANIAVSLYRQRMQIEENFRD 300 Query: 298 LKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRN 357 K+ G+ L ++ + S ERFD +LLIA ++ W G A + QAN+++ R Sbjct: 301 TKNGKLGISLEYANSKSVERFDNLLLIAGLILFIIWCVGRAAVMKKIHYSLQANSLKFRA 360 Query: 358 VLSTVRLGMEVLRHSGYTITREDSLVAATLLTQ 390 VLST+ +G EV++ YTIT ++ + L++ Sbjct: 361 VLSTIYIGREVVKDGRYTITIDEYVYVLAHLSE 393 >UniRef50_C1DIQ1 Transposase, IS4 n=2 Tax=Azotobacter vinelandii DJ RepID=C1DIQ1_AZOVD Length = 400 Score = 243 bits (619), Expect = 1e-62, Method: Compositional matrix adjust. Identities = 148/382 (38%), Positives = 222/382 (58%), Gaps = 6/382 (1%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M + LH + + P +H +RL +L A ALL + LTLT LGR+LP A +H IKR Sbjct: 1 MQTVQFLHAAFAKALPTIHARRLEALMAAVAALLQGRCLTLTALGRSLPGSAWPRHAIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRLLGNR L ER Y + P++LVDWS I +L +LRA++ L GRS Sbjct: 61 IDRLLGNRQLQAERGLFYWVMLRALLGSFRHPLILVDWSPIDAAGKLFLLRAALPLAGRS 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 + + E P E C + + L LA++LP++ P++V+DAGF+ PW+++VE GW+++ Sbjct: 121 LPVCEVVHP-REGCPR-CQKRLLEALAAMLPADCRPVLVTDAGFQRPWFQAVEIRGWHYV 178 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 RVR + LG + W P+ +L+ ++S+ K LG +T+S P S Q+ + K +GR+ Sbjct: 179 GRVRNR-DLCRLGEQPWGPVKSLYALASASPKRLGCVEMTRSAPWSTQLCVVKHAPRGRQ 237 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 ++R T T + + EPW+LA+NLP Q+V IY +R QIEE FRDLKS Sbjct: 238 HRRITGTLARDKRSRQSAQRESEPWLLASNLPEAQWNAAQVVAIYRRRTQIEEGFRDLKS 297 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLT---CWLAGVHAQKQGWDKHFQANTVRNRN 357 G+GL R+ R +I+LLIA++ L G+ A++ G ++ FQ+N+++ + Sbjct: 298 HRLGIGLGLHRSRCPRRIEILLLIAVLANYALCLLGLLGLQAREAGHERRFQSNSLKCKR 357 Query: 358 VLSTVRLGMEVLRHSGYTITRE 379 VLS RLG+E R I+RE Sbjct: 358 VLSLWRLGLEYARTGVGAISRE 379 >UniRef50_Q07YD1 Transposase, IS4 family n=6 Tax=Shewanella RepID=Q07YD1_SHEFN Length = 397 Score = 239 bits (609), Expect = 2e-61, Method: Compositional matrix adjust. Identities = 127/366 (34%), Positives = 212/366 (57%), Gaps = 4/366 (1%) Query: 6 ILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLL 65 +L L P +H R SL A + ++ L++T LGR++ +KA+ KH IKR+DRL Sbjct: 6 VLSKCLSLVTPLMHKTRRQSLFSAIESSMNGGALSITGLGRDIESKAKEKHKIKRVDRLC 65 Query: 66 GNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYE 125 N +LH++ +Y + PI+ +DWSD+ ++K+ ++RAS+A GRS+TLYE Sbjct: 66 SNPYLHRDIEFIYTRMTCLLVGKMKQPIIHIDWSDLDDRKQHFLIRASLAAQGRSLTLYE 125 Query: 126 KAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRG 185 + PL+++ K H FL L ++LP++ P+IV+DAGF++PW+K + L W ++ R R Sbjct: 126 EIHPLNKKEKPKTHLSFLTKLKAMLPNDCKPIIVTDAGFRIPWFKQILSLDWDYVGRFRN 185 Query: 186 KVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRST 245 + +W P+ L+ +S+ +K LG L + +++++K KGRK++ +T Sbjct: 186 RTHCRKTIVHHWYPVKRLYIQASARAKNLGVYFLGEQASFCSRLVIFKRTDKGRKDRTAT 245 Query: 246 RTHCHHPSPKIYSAS-AKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYG 304 SA KEPW+LAT+L T K++V IY+ RMQIEE+FRD+K+ G Sbjct: 246 GDRTRRSKQSRSSAEREKEPWLLATSLCHSSATAKRVVKIYATRMQIEESFRDVKT---G 302 Query: 305 LGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRL 364 L + S + ++ ++LLIA + Q L G+ + + +QAN++++RNVLS+ + Sbjct: 303 LKMNDSGSRIKDKLSVLLLIACLSQFMLNLLGLAVKAADKHRQYQANSIKHRNVLSSQFI 362 Query: 365 GMEVLR 370 G+ R Sbjct: 363 GLRAYR 368 >UniRef50_A4C5E2 Hypothetical transposase n=2 Tax=Pseudoalteromonas tunicata D2 RepID=A4C5E2_9GAMM Length = 397 Score = 236 bits (602), Expect = 1e-60, Method: Compositional matrix adjust. Identities = 136/356 (38%), Positives = 203/356 (57%), Gaps = 5/356 (1%) Query: 39 LTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDW 98 +++ LGR L + A+ KHNIKRIDRL GN + R Y+ + P V +DW Sbjct: 39 ISIAALGRKLKSNAKVKHNIKRIDRLFGNPRVQFARYHYYQEITHRVIGQIRRPCVTIDW 98 Query: 99 SDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLI 158 S + +LRA+V + GR++T+YE++F E + H F+ L SILPS+ P+I Sbjct: 99 SGLTPCGEFHLLRAAVPVKGRAMTIYEQSFRECEYMKQSVHKDFIKTLKSILPSDCKPII 158 Query: 159 VSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAE-NWKPISNLHDMSSSHSKTLGYK 217 V+DAGF+ PW+K V K GW ++ RVR + QY + +W P+ L+ +++ L Sbjct: 159 VTDAGFRNPWFKLVLKFGWDFVGRVRHQTQYQKPEDDTSWLPVKTLYSKATAKPVYLFET 218 Query: 218 RLTKSNPISCQILLY--KSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 +L K+N +S L+ K + + +KN R C S K ++ A EPW+L T+L Sbjct: 219 QLAKANSLSGHFYLFKSKPKQRKKKNLRGKTIRC-SVSLK-HAKGATEPWLLFTSLCNIN 276 Query: 276 RTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 + + +V IYS+RMQIEE+FRDLK+ + GL LRH R+ R ++ LLIAL+ WLA Sbjct: 277 YSAQDMVKIYSQRMQIEESFRDLKNTSNGLNLRHCRSYEKGRLNVALLIALIANFILWLA 336 Query: 336 GVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQN 391 G+ A+ + FQANT++NRNVLS+ LG + GY I + L A L ++ Sbjct: 337 GLTAKILNVHRSFQANTIKNRNVLSSFSLGTQYFEKFGYKIKLKTFLEALKQLNKD 392 >UniRef50_Q9UH48 Gastric cancer-related protein GCYS-20 n=1 Tax=Homo sapiens RepID=Q9UH48_HUMAN Length = 332 Score = 234 bits (598), Expect = 3e-60, Method: Compositional matrix adjust. Identities = 110/112 (98%), Positives = 110/112 (98%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR Sbjct: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRA 112 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMV R Sbjct: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVFRV 112 >UniRef50_Q2NZH2 ISXoo8 transposase n=73 Tax=Xanthomonas RepID=Q2NZH2_XANOM Length = 407 Score = 204 bits (518), Expect = 7e-51, Method: Compositional matrix adjust. Identities = 123/371 (33%), Positives = 205/371 (55%), Gaps = 8/371 (2%) Query: 5 DILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRL 64 ++L L +H R +L A AL+ LTL +L R P R + +K DRL Sbjct: 8 EVLQKCLSNSLSGMHALRQRTLLRAVEALVHGGRLTLIDLARAWPGATRVRAPLKACDRL 67 Query: 65 LGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLY 124 L NR L ER A+ + A ++ G+ P++++DWSD++ K +LRA+V + GR++TL Sbjct: 68 LCNRTLQVERSAIEQDMAHWLLRGD-QPVIVIDWSDLKPDKSWCLLRAAVPVGGRTLTLL 126 Query: 125 EKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVR 184 + +Q S A +FL L +++P + P++V+DAGF+ PW+++V +GW W+ R+R Sbjct: 127 DMVVSRKQQGSPGAEKRFLQQLRALIPDDVRPILVTDAGFRTPWFRAVSAMGWDWVGRLR 186 Query: 185 GKVQY--ADL--GAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLY-KSRSKGR 239 G+ Q D+ A W LH ++S+ ++ L + +S+P+ C+++LY K+R + Sbjct: 187 GRTQVKPQDVPDDAVQWIDSRRLHALASNRARALPPMQANRSDPLDCRLVLYAKTRQGRQ 246 Query: 240 KNQRSTRTHCHHPSPKIYSAS-AKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDL 298 + R + S + +A+ +EPW++ + + + KQLVN+Y++RMQIE FRDL Sbjct: 247 QRNRRSSAKVSRASSSLKAAAREREPWLIVASPQLHAPSAKQLVNLYARRMQIELAFRDL 306 Query: 299 KSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNV 358 KS YG + S T ER I+LL+ + WLAG+ + G + + R + Sbjct: 307 KSHRYGQAMEDSLTRRGERLQILLLLNTLATFASWLAGLGCEATGIAQWLSPRSS-TRKL 365 Query: 359 LSTVRLGMEVL 369 ST+R+G E L Sbjct: 366 YSTLRVGREAL 376 >UniRef50_Q17U39 Transposase n=11 Tax=Gammaproteobacteria RepID=Q17U39_ECOLX Length = 394 Score = 197 bits (500), Expect = 7e-49, Method: Compositional matrix adjust. Identities = 131/375 (34%), Positives = 197/375 (52%), Gaps = 16/375 (4%) Query: 6 ILHDSLYQFCPE-LHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRL 64 +L D L P+ +H R + L A AL T+T +GR +P + K +IKR DRL Sbjct: 9 MLADFLTFVTPKSMHKARFSVLLDAVTALAKDACCTVTAIGRAMPGSS-DKVSIKRADRL 67 Query: 65 LGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLY 124 L N +L +E +Y + I T P++LVDWS+ KR +LRAS+A GR++TL Sbjct: 68 LNNPNLQRELPLIYAALTASIVGHKTKPMILVDWSNADTAKRHFILRASIAADGRALTLL 127 Query: 125 EKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVR 184 +K + H FL L ++LP + P+IV+DAGFKVPW K V KLGW++++RVR Sbjct: 128 QKIAAAEDYTCPHLHGAFLKQLKAMLPKDCKPVIVTDAGFKVPWLKQVRKLGWHYVARVR 187 Query: 185 GKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRS 244 G V+ + + ++ L+ + K++G L ++ Q +L KG K + Sbjct: 188 GNVKLKLAEQDKFISVNQLYRQAKKDPKSVGKIMLAQTQHYETQAVLV---GKGYKLLKR 244 Query: 245 TRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYG 304 + + KEPW+L ++L ++ YS RMQIEE+FRD KS YG Sbjct: 245 DKNKTY-----------KEPWLLVSSLADCHGYADKIAKCYSSRMQIEESFRDQKSHRYG 293 Query: 305 LGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRL 364 LG T R +I+LL+A ++ +L G A+K G +QANTV+NR VL+ L Sbjct: 294 LGSDLHGTKKKSRLEILLLLAALVNWFHYLLGSAAEKAGLHLRYQANTVKNRRVLALNFL 353 Query: 365 GMEVLRHSGYTITRE 379 G+ + + I R+ Sbjct: 354 GILLCKEPKQRIRRQ 368 >UniRef50_B8B8E6 Putative uncharacterized protein n=5 Tax=cellular organisms RepID=B8B8E6_ORYSI Length = 753 Score = 187 bits (475), Expect = 6e-46, Method: Compositional matrix adjust. Identities = 88/89 (98%), Positives = 89/89 (100%) Query: 287 KRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDK 346 +RMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDK Sbjct: 358 ERMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDK 417 Query: 347 HFQANTVRNRNVLSTVRLGMEVLRHSGYT 375 HFQANTVRNRNVLSTVRLGMEVLRHSGYT Sbjct: 418 HFQANTVRNRNVLSTVRLGMEVLRHSGYT 446 >UniRef50_Q5X8W8 Putative uncharacterized protein n=1 Tax=Legionella pneumophila str. Paris RepID=Q5X8W8_LEGPA Length = 398 Score = 173 bits (439), Expect = 8e-42, Method: Compositional matrix adjust. Identities = 105/356 (29%), Positives = 180/356 (50%), Gaps = 9/356 (2%) Query: 18 LHLKRLNSLTLACHALLDCKT-LTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLA 76 +H KR L L+D T L++TE+G+ L +K K I + N L ++ + Sbjct: 19 IHAKRKQCLVRFLSDLMDYDTTLSVTEIGKKLTSKTTVKSKIYAAQTFVNNFKLERDIVC 78 Query: 77 VYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSK 136 +Y+ F S +VL+DW+ + VL AS+A HGRS+ +Y + SEQ + Sbjct: 79 IYKSLTHFFWSHAKEIVVLIDWTGGCSEG-YHVLEASIAAHGRSIPIYHEVHSESEQENA 137 Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAEN 196 + H QFL L ++PS+ + I++DAGF W++ V +LGW + R+ Y G N Sbjct: 138 EIHRQFLLRLKEVIPSSLSVTIITDAGFHREWFQQVLELGWDVIGRIYSLYCYQIEGETN 197 Query: 197 WKPISNLHDMSSSHSKTLGYKRLTKS-NPISCQILLYKSRSKGRKNQRSTRTHCHHPSPK 255 W + ++ + LG +L K+ + + YK + G+ ++ + H K Sbjct: 198 WHKVKDILFEGIGKASALGKVKLGKTKKAVEGYLYTYKEKLSGKVRKKKNKYPSH---DK 254 Query: 256 IYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSS 315 +S K W+L ++L R LV+ Y KRMQIE+ F+D+K+ G+G R +++S Sbjct: 255 AHSNYYKNGWVLFSSLNKHARF---LVSYYKKRMQIEQNFKDIKNEQLGMGFRRNQSSGK 311 Query: 316 ERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRH 371 R +++ +A++L + W G+ + + +QANT++N+ V S + L RH Sbjct: 312 TRVNMLFFLAVLLIMIAWWFGLMIESLNKHRSYQANTIKNKRVRSFIHLARMAYRH 367 >UniRef50_A7MW84 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7MW84_VIBHB Length = 235 Score = 158 bits (399), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 84/228 (36%), Positives = 130/228 (57%), Gaps = 6/228 (2%) Query: 163 GFKVPWYKSVEKLGWYWLSRVRG--KVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLT 220 GFKVPW+K +E+ GWYWL RVRG K++ D W + + + LG LT Sbjct: 3 GFKVPWFKPIEQQGWYWLGRVRGNSKLRVND----RWCSADEVFVQAQYKPQHLGTAELT 58 Query: 221 KSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQ 280 K + CQ+ LY+ +SKGRK + + + + ++ +EPW+L +NLP E ++ Sbjct: 59 KQHQYPCQVCLYRKKSKGRKAKNWSGSLQRNTVSLSHAKGEREPWLLVSNLPGETWFAER 118 Query: 281 LVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQ 340 +V +Y++RM IEE FRD K+ YGL L S ++ +R +I+L+I ++ Q + G A Sbjct: 119 VVALYTQRMSIEEGFRDTKNERYGLALNFSGSACPKRIEILLMIGMLTQFALLVVGKVAY 178 Query: 341 KQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLL 388 +G+ K FQANT+R R VLS LG E++ Y+ + +D +A L Sbjct: 179 LKGYYKDFQANTIRTRRVLSYFFLGKELIGREAYSFSVKDLALAVGGL 226 >UniRef50_B4UH67 Transposase IS4 family protein n=3 Tax=Proteobacteria RepID=B4UH67_ANASK Length = 384 Score = 112 bits (280), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 98/364 (26%), Positives = 171/364 (46%), Gaps = 47/364 (12%) Query: 14 FCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKA--RTKHNIKRIDRLLGNRHLH 71 F +LH KR+ SL A +L+ L + +G +L ++KH +K++DR+L N Sbjct: 19 FAEDLHAKRVASLAGAAVGVLEGAALGIHAIGNSLAVAEGLKSKHAVKQVDRMLSN---- 74 Query: 72 KERLAVYRWHASFI--CSGNTMPIVL-VDWSDIREQKRLMVLRASVALHGRSVTLYEKAF 128 E + V+R S++ G+ + IV+ +DW+D E + + + + HGR+ L K Sbjct: 75 -EGIPVWRLFGSWVPCVVGDRLEIVVALDWTDFDEDDQSTIALSMITSHGRATPLLWKTV 133 Query: 129 PLSE-QCSKKAH-DQFLADLASILPSNTTPLIVSDAGF--KVPWYKSVEKLGWYWLSRVR 184 SE + + H D L +LP +++D GF + + ++LG+ ++ R R Sbjct: 134 MKSELKGWRNEHEDVLLERFREVLPEGVKVTVLADRGFGDQALYELLKDQLGFGFIVRFR 193 Query: 185 GKVQYADLGAENWKPISNLHDMSSSHSKTLGYK--RLTKSNPISCQILLYKSRSKGRKNQ 242 G V+ E +P D S+ +TL + R+TKS ++ K++ Sbjct: 194 GVVKVTSAEGET-RPA---KDWVPSNGRTLRLRSARVTKSRREIGAVVCVKAKG------ 243 Query: 243 RSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPA 302 KE W LAT+ + ++V +Y++R IEE+FRD K+ Sbjct: 244 ------------------MKEAWHLATSHG--DKPGSEIVALYARRFTIEESFRDQKNLR 283 Query: 303 YGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTV 362 +G+GL +R + R D +LL++ + + G + G DK + NTV+ R + S + Sbjct: 284 FGMGLSETRIADPARRDRLLLVSAVAIALLTILGAAGEALGLDKWLKTNTVKRRTI-SLL 342 Query: 363 RLGM 366 R GM Sbjct: 343 RQGM 346 >UniRef50_D0LPB8 Transposase IS4 family protein n=4 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPB8_HALO1 Length = 418 Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 92/352 (26%), Positives = 153/352 (43%), Gaps = 44/352 (12%) Query: 14 FCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT--KARTKHNIKRIDRLLGNRHLH 71 F +H KR+ SL+ A + L++ +G L +KH IK++DRLL N L Sbjct: 55 FEGNMHSKRVESLSNAVVGVTHASALSVQAIGHGLAVALDKNSKHAIKQVDRLLSNARL- 113 Query: 72 KERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLS 131 + V+ ++ S T I+ +DW++ + + + +HGRS L K S Sbjct: 114 -DPWQVFSVWVPYVLSERTEAIIALDWTEFAKDGQSTCAAHLMTMHGRSTALAWKTVEKS 172 Query: 132 EQCSKK--AHDQFLADLASILPSNTTPLIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQ 188 + ++ D+ + L I+P + +++D GF + + LGW ++ R R + Sbjct: 173 QLRGQQTAVEDEVIDHLHRIIPPDIEVTLLADRGFAAAERFIHLTTLGWNYVIRFRENIH 232 Query: 189 YADLG----AENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRS 244 + G A +W P S K L ++ P+ + ++K Sbjct: 233 ISHQGQTQPARDWVP------KSGRAKKLLDVGITCRAEPLEAVVCVHK----------- 275 Query: 245 TRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYG 304 A K+ W LATNL + +V +Y++R IEETFRD K +G Sbjct: 276 --------------AQMKQAWCLATNLVDA--SASHVVKLYARRFTIEETFRDQKDLRFG 319 Query: 305 LGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR 356 LGL + R D +LL++ + L G A+ G+D+ +ANTVR R Sbjct: 320 LGLSATHIRDCGRRDRLLLLSAIAHALLTLLGAAAESIGFDRMMKANTVRRR 371 >UniRef50_A7MYH1 Putative uncharacterized protein n=4 Tax=Vibrio RepID=A7MYH1_VIBHB Length = 145 Score = 94.7 bits (234), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 47/119 (39%), Positives = 69/119 (57%), Gaps = 1/119 (0%) Query: 146 LASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHD 205 LA+ + TP+IVSDAGF+ W++ V GW+WL RVRG+V G ++W+ + Sbjct: 28 LATKTVNGCTPIIVSDAGFRNTWFRQVANKGWFWLGRVRGEVSI-KCGEDSWQWNKTFYP 86 Query: 206 MSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP 264 ++ + LG +L K +P+ C LYKS KGRK R +RT H + K++ AKEP Sbjct: 87 QATDKPQFLGESQLAKRSPLECFAYLYKSHPKGRKAHRHSRTCQKHSAGKVFHKGAKEP 145 >UniRef50_Q47076 BfpT, bfpV, bfpW and transposase genes, complete cds n=53 Tax=Enterobacteriaceae RepID=Q47076_ECOLX Length = 186 Score = 94.7 bits (234), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 46/128 (35%), Positives = 81/128 (63%), Gaps = 2/128 (1%) Query: 251 HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHS 310 + + K S SAKE W++ + + R ++++ +YS+RMQIE+ FRD K+ +G GLR S Sbjct: 40 NKTDKEQSKSAKEAWLIFSRTN-DFR-AREIIKLYSRRMQIEQNFRDEKNGRFGFGLRAS 97 Query: 311 RTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLR 370 ++ S+ R ++ L+A + + WL G HA+ +G + +Q N++++R V+S + L VLR Sbjct: 98 KSRSTGRILVLSLLATLSTIVMWLLGYHAENKGLHQKYQVNSIKSRRVISYLTLAKNVLR 157 Query: 371 HSGYTITR 378 HS + + R Sbjct: 158 HSPFILRR 165 >UniRef50_Q5GUK2 ISxac1 transposase n=1 Tax=Xanthomonas oryzae pv. oryzae RepID=Q5GUK2_XANOR Length = 361 Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 59/184 (32%), Positives = 93/184 (50%), Gaps = 10/184 (5%) Query: 192 LGAENW----KPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR--KNQRST 245 L A NW P++ S ++T+ R S+P C+++LY +GR +N+RS Sbjct: 150 LAAGNWVSVGPPLAAGAPSSGLVARTMQANR---SDPRDCRLVLYAKTPQGRQQRNRRSP 206 Query: 246 RTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGL 305 S +A +EPW++ + + + KQLVN+Y++RMQIE FR+LKS YG Sbjct: 207 AKVSRASSSLKAAAREREPWLIVASPQLHAPSAKQLVNLYARRMQIELAFRNLKSHRYGQ 266 Query: 306 GLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLG 365 + S T ER I+LL+ + WLAG+ + G + + R + T+R+G Sbjct: 267 AMEDSLTRRGERLQILLLLTTLASFASWLAGLGCEATGIARWLSPRSS-TRKLYLTLRVG 325 Query: 366 MEVL 369 E L Sbjct: 326 REAL 329 >UniRef50_Q6LGR5 Putative transposase similar to Tn10 n=1 Tax=Photobacterium profundum RepID=Q6LGR5_PHOPR Length = 105 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 36/55 (65%), Positives = 42/55 (76%) Query: 32 ALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFIC 86 ALL LTLT LGR+LP+KA+TKH IKR+DRLLGN HLH +RL +YRWH C Sbjct: 6 ALLSNDALTLTLLGRSLPSKAKTKHCIKRVDRLLGNNHLHHDRLDIYRWHCHQFC 60 >UniRef50_A9AVJ1 Transposase IS4 family protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVJ1_HERA2 Length = 378 Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 87/370 (23%), Positives = 153/370 (41%), Gaps = 45/370 (12%) Query: 10 SLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRH 69 +L+QF P LH +RL + LL +++ L+ + +L + A I RI R L N Sbjct: 22 TLHQFHPTLHARRLATWAWVIVGLLHARSVHLSAVALHLASDAEAAGRIARIRRWLANPW 81 Query: 70 LHKERLAVYRWHASFICSG--NTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKA 127 L + L YR + + + N +++D + K L ++R S++ R++ L + Sbjct: 82 LDTQFL--YRPLITHVLTAWRNRDITIMIDGCYVNHDK-LQMVRLSLSHCYRAIPLAWQV 138 Query: 128 FPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFK-VPWYKSVEKLGWYWLSRVRG- 185 S ++ + L + +L ++D GF+ W S ++ GW ++ R+ Sbjct: 139 MSHHGNVSVESCQRMLNRVQQLLIGTRRVTFLADRGFRDWAWAASCQRRGWDYIIRIANT 198 Query: 186 -KVQYADLGAENWKPISNLHDMSSSHSKTLGYKR--LTKSNPISCQILLYKSRSKGRKNQ 242 +++ D P ++ M+ K++ + LT+ C I + +R+ K Sbjct: 199 TTIRWDD------GPWMAINTMAVKPGKSVYLRNVLLTQDGEWRCTIAITWTRATKTK-- 250 Query: 243 RSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPA 302 P+ + + +EP K ++N Y +RM IEE+FRD KS Sbjct: 251 ---------PAERCAVITNREP-------------SKWILNHYLRRMHIEESFRDDKSG- 287 Query: 303 YGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTV 362 G L SR +R D +LL + L + G K H R LS Sbjct: 288 -GFDLDASRLRDPQRLDRLLLAIAVATLWMYELGERVLKDEQRAHVDPGYQRQ---LSVF 343 Query: 363 RLGMEVLRHS 372 +LG LR + Sbjct: 344 QLGWRWLRRA 353 >UniRef50_Q72IB6 Transposase n=3 Tax=Thermus thermophilus HB27 RepID=Q72IB6_THET2 Length = 365 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 88/332 (26%), Positives = 139/332 (41%), Gaps = 47/332 (14%) Query: 40 TLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERL---AVYRWHASFICSGNTMPIVLV 96 TL++L R P + + R+ R L + L A+ A +P++ V Sbjct: 45 TLSDLARRTPLPTLAQSRLNRLWRFLHHPTLQNPWALTEALLPLLARRFPKDRPLPLI-V 103 Query: 97 DWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSK-KAHDQFLADLA-SILPSNT 154 DW+ E R L A++ L GR++ + PLS S+ + ++FL L ++ Sbjct: 104 DWT-FAEDGRHQALVAALPLKGRALVVAFALHPLSPFPSQNRVEEEFLHRLGRAVQDLGY 162 Query: 155 TPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKT 213 TPL + D GF +V + ++ G +L R+R + G + P+ Sbjct: 163 TPLFLLDRGFDRVSLMRKLQGWGMGFLIRLRQNREVEPRGGKR-LPLKE----------- 210 Query: 214 LGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPV 273 GY+R+ +P+ ++ L+ G + T +P +EPW LA + P Sbjct: 211 -GYRRVV--HPLREEVRLF-----GHGGEEVEVTLLVYPG-------GREPWYLAYSGPF 255 Query: 274 EIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCW 333 + P Y RM IEE FRDLK +GL RT +S R + L+AL + L Sbjct: 256 GGKPP------YGWRMWIEEGFRDLKGQGFGLDRHRLRTGASLR-GWLWLLALGMALLI- 307 Query: 334 LAGVHAQKQGWDKHFQANTVRNRNVLSTVRLG 365 L G Q + W A+ R S RLG Sbjct: 308 LLGARLQGREWLPRLLAHPERQ----SLFRLG 335 >UniRef50_B2JAE4 Transposase, IS4 family protein n=8 Tax=Cyanobacteria RepID=B2JAE4_NOSP7 Length = 448 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 89/347 (25%), Positives = 150/347 (43%), Gaps = 49/347 (14%) Query: 16 PELHLKRLNSL---------TLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLG 66 PEL+ K L SL TL + + + K + L ++ +LP + + K++ R L Sbjct: 3 PELYQKHLQSLLSQSELIFLTLVINVVQNIKDVKLEKISESLPLFIQCQSRRKKLQRFLL 62 Query: 67 NRHLHKERL---AVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTL 123 L+ E L + RW A I GN + +D ++ + + LM+ SV R++ + Sbjct: 63 LPILNIEELWFPIIERWLAQ-IFLGNHRIYLAIDRTNWKRKNLLMI---SVIFQKRAIPI 118 Query: 124 YEKAFPLSEQCSKKAHDQFLADLASILP--SNTTPLIVSDAGF-KVPWYKSVEKLGWYWL 180 Y K L++ S +Q A L I+P N +++ D F V K +++ G+ + Sbjct: 119 YFKL--LAKLGSSNLSEQTKA-LTKIIPLFKNYKTVVLGDREFCSVSLAKWLDEQGFEFC 175 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 R++ K + +L A W I +L + S + +TK+ Q+ + K +K Sbjct: 176 LRLK-KNENIELKAHLWCEIKDL-GLKPGTSFFVSDATVTKTK----QVKGFNVACKWKK 229 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 N R + AKE W + TN+ +I + Y KR IEE FRD KS Sbjct: 230 NYRQNK--------------AKEGWFILTNMNSKITA----IQAYQKRFDIEEMFRDFKS 271 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKH 347 Y L +RF ++LI + L G + + +G K+ Sbjct: 272 GGYNL---EKTNVEGKRFIALVLIISLADTIATLQGQNIKSKGIAKY 315 >UniRef50_B4WTK1 Putative uncharacterized protein n=6 Tax=Synechococcus sp. PCC 7335 RepID=B4WTK1_9SYNE Length = 411 Score = 58.2 bits (139), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 88/379 (23%), Positives = 148/379 (39%), Gaps = 65/379 (17%) Query: 5 DILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKA-RTKHNIKRIDR 63 D L L Q CP HL L + AL+ +++LT+ LP + + +R+ R Sbjct: 28 DALKAWLGQDCPWAHLSHLTTCCWMVFALIQTGSVSLTKWTTYLPCRGLYAQSKQRRVRR 87 Query: 64 LLGNRHLHKERL-------AVYRWHAS--FICSGNTMPIVLVDWSDIREQKRLMVLRASV 114 LGN ++ RL A+ W A ++C +D S EQ ++R +V Sbjct: 88 WLGNSRINIHRLYKPLIQAALATWEAECLYLC---------LDTSLFWEQ--YCLIRLAV 136 Query: 115 ALHGRSVTLYEKAFPL-SEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSV 172 GRS+ L + S + +A+++ L LPSN ++++D GF + Sbjct: 137 VYRGRSIPLAWRVLEHNSASVAFEAYEELLRQSTQYLPSNANMILLADRGFVHTRAMTLI 196 Query: 173 EKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLY 232 ++LGW++ R++ W+P S S H L Sbjct: 197 KQLGWHYRIRIKSDTWI-------WRPGSGWCQPKSFH--------------------LE 229 Query: 233 KSRSKGRKNQRSTRTHCHHPSPKIYSAS--AKEPWILATNLPVEIRTPKQLVNIYSKRMQ 290 + R+ + R R + P I + E W + ++ P T Q Y+ R Sbjct: 230 RGRALCFHHIRLHRHEQYGPVHVIIGRNNINGELWAVVSDQP----TSPQTFMEYALRFD 285 Query: 291 IEETFRDLKSPAYGLGLRHSRT-SSSERFDIMLLIALMLQLTCWLAGVHAQKQGW-DKHF 348 IEE F D +S + L R + R +L +A + +A V + ++ W D H+ Sbjct: 286 IEEGFLDDQSAGWNLQRSEIRGLTDLSRLWFILAVATLYVTAQGVAVVQSGRRRWIDTHW 345 Query: 349 QANTVRNRNVLSTVRLGME 367 S R+G+E Sbjct: 346 DRGN-------SYFRIGLE 357 >UniRef50_C7QY62 Transposase IS4 family protein n=9 Tax=Cyanothece RepID=C7QY62_CYAP0 Length = 365 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 77/337 (22%), Positives = 133/337 (39%), Gaps = 37/337 (10%) Query: 26 LTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERL---AVYRWHA 82 L++ + L + L EL P + + IK++ R L + E L + W Sbjct: 24 LSILVNLLQSLHLVRLEELANRFPHPIQLRSRIKKLQRFLSLPQFNLETLWIPIIESWIK 83 Query: 83 SFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQF 142 G + +V +D S RE + V S+ + R++ L P + + Sbjct: 84 QEWKRGEIIYLV-IDRSQWREINLIFV---SLIYNHRAIPLCVDWLPKKGNSNLEQQKAI 139 Query: 143 LADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISN 202 L + S L L+ V K + + + S K +YA+L + W Sbjct: 140 LEVILSRLKDYKIVLLGDREFCGVDLAKWLSEAKEVYFSLRLKKNEYAELAPQIW---FQ 196 Query: 203 LHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAK 262 L D+ + ++ Y+ + + K++ G N + + SAK Sbjct: 197 LKDLGLNPGMSVYYRGVK----------ITKTKGFGEVNLAAKWKRNYQ------GKSAK 240 Query: 263 EPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIML 322 EPW++ TNL + Q ++ YSKRM IEE FRD K Y L ++ + ++L Sbjct: 241 EPWLIMTNL----ESLSQAMSAYSKRMGIEEMFRDFKRGGY--QLEGTQVTKERLISLVL 294 Query: 323 LIALMLQLTCW--LAGVHAQKQGWDKHFQANTVRNRN 357 LI L CW +G +++G K+ T +R+ Sbjct: 295 LICLA---YCWSTFSGQSLKRKGVAKYVSRPTSGHRS 328 >UniRef50_Q1QFL8 Putative uncharacterized protein n=1 Tax=Nitrobacter hamburgensis X14 RepID=Q1QFL8_NITHX Length = 191 Score = 50.8 bits (120), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 29/101 (28%), Positives = 49/101 (48%), Gaps = 2/101 (1%) Query: 256 IYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSS 315 +++ K W LA + T +++ N Y++R IE FRD K +G+GL R + Sbjct: 46 VHARDMKAAWCLAASNAEA--TAREITNHYARRWTIEPGFRDTKDLRFGMGLGVLRIADP 103 Query: 316 ERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR 356 +R D +LL+ + L G + G D+H + T + R Sbjct: 104 QRRDRLLLLNAFAIVLLTLLGAAGESLGMDRHLKVATAKRR 144 >UniRef50_A7N4N2 Putative uncharacterized protein n=1 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N4N2_VIBHB Length = 54 Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 24/50 (48%), Positives = 33/50 (66%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT 50 M ++ IL ++ CP++H KRL SL LA A+LD LTLT++GR L T Sbjct: 1 MRDIQILQQTIENQCPDIHKKRLRSLMLATKAVLDGSNLTLTKIGRALST 50 >UniRef50_Q7NHH4 Gll2563 protein n=2 Tax=Gloeobacter violaceus RepID=Q7NHH4_GLOVI Length = 212 Score = 48.1 bits (113), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 36/107 (33%), Positives = 49/107 (45%), Gaps = 9/107 (8%) Query: 263 EPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIML 322 EPW L TNL TP++ + Y R IEE FRD KS Y L +RF ML Sbjct: 42 EPWWLLTNLS----TPQEAITWYRCRWGIEEMFRDCKSGGYNL---EKLRVQPKRFKRML 94 Query: 323 LIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNV--LSTVRLGME 367 ++ M + G ++QG K+ R V ST R+G++ Sbjct: 95 MVLAMAMSLSVMHGKQLKRQGLQKYVSRVAEPGRVVKRRSTFRVGLQ 141 >UniRef50_Q10V90 Transposase, IS4 family n=7 Tax=Trichodesmium erythraeum IMS101 RepID=Q10V90_TRIEI Length = 275 Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust. Identities = 30/108 (27%), Positives = 51/108 (47%), Gaps = 21/108 (19%) Query: 251 HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHS 310 + PK S E W + TNL + P ++ IYS+RM IE F+D K+ AY L + Sbjct: 123 YKKPKYRDKSVSEKWYILTNLSL----PGKIKKIYSQRMGIEAMFKDYKTGAY--NLESA 176 Query: 311 RTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNV 358 + + + +++LLIA+ ++ FQ ++N+ V Sbjct: 177 KANETRLNNLILLIAISYAISS---------------FQVQKIKNKGV 209 >UniRef50_Q1ARL9 Transposase, IS4 family n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1ARL9_RUBXD Length = 335 Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust. Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 11/74 (14%) Query: 256 IYSASAKEP-WILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSS 314 ++ KEP W++ LP P +LV +Y +RM+IE+TFRD KS LG+ + + Sbjct: 240 VWRKGCKEPLWVMGNFLP-----PDELVEVYEERMKIEQTFRDAKSL---LGM--EKVMN 289 Query: 315 SERFDIMLLIALML 328 +R + + +ALML Sbjct: 290 KKRVQLEITLALML 303 >UniRef50_Q73IB8 Transposase, IS4 family n=9 Tax=Wolbachia RepID=Q73IB8_WOLPM Length = 442 Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust. Identities = 44/159 (27%), Positives = 76/159 (47%), Gaps = 10/159 (6%) Query: 146 LASILPSNTTPLIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQYADLGA-ENWKPISNL 203 L++IL ++ L++SD G+ VP +K + ++G Y++SR + D+ + + + L Sbjct: 183 LSNILSND---LLISDLGYFVPSSFKQINEIGAYFISRYKSDTNIYDVETNQKMELLECL 239 Query: 204 HDMSSSHSKTLGYKRLTKSNPISCQILLY-KSRSKGRKNQRSTRTHCHHPSPKIYSASAK 262 D ++ L K I CQ L +S ++ RK R R+ + S + Sbjct: 240 EDKLFLENEVLLGKEAKIRVRIICQKLTEEQSMARRRKANRLARSQGYTSSKR---NQKL 296 Query: 263 EPW-ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 W I TN+P + +Q++ IY R QIE F+ KS Sbjct: 297 LNWSIFITNVPENKISAEQVLTIYRVRWQIELLFKLYKS 335 >UniRef50_B4ABV5 Transposase n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL317 RepID=B4ABV5_SALNE Length = 63 Score = 41.6 bits (96), Expect = 0.059, Method: Compositional matrix adjust. Identities = 17/37 (45%), Positives = 28/37 (75%), Gaps = 2/37 (5%) Query: 259 ASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETF 295 +SAKEPW++ +N + TP+ ++ +YS+RMQIE+ F Sbjct: 7 SSAKEPWLIFSN--INDITPRSIMKLYSRRMQIEQNF 41 >UniRef50_B0BZT8 Transposase, IS4 family n=21 Tax=Cyanobacteria RepID=B0BZT8_ACAM1 Length = 397 Score = 41.2 bits (95), Expect = 0.072, Method: Compositional matrix adjust. Identities = 30/105 (28%), Positives = 50/105 (47%), Gaps = 9/105 (8%) Query: 238 GRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRD 297 G+++Q + + K+PW + T+LP T +Q +++Y+ R IE F+D Sbjct: 230 GKRDQLGPFNLAFYWKRQYRGKGGKDPWFIMTSLP----TLEQALSLYACRWGIEMMFKD 285 Query: 298 LKSPAYGLGLRHSRTSSSE-RFDIMLLIALMLQLTCWLAGVHAQK 341 KS Y L RT ++ RF ++L+ M LAG +K Sbjct: 286 CKSGGYNL----ERTKVNDARFLALVLVMAMAYCLATLAGYGLKK 326 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B9EHT2 Olfr780 protein n=158 Tax=root RepID=B9EHT2_MOUSE 549 e-155 UniRef50_A3D336 Transposase, IS4 family n=6 Tax=Shewanella RepID... 451 e-125 UniRef50_Q15UH5 Transposase, IS4 family n=36 Tax=Gammaproteobact... 451 e-125 UniRef50_Q6LPG7 Hypothetical transposase n=7 Tax=Photobacterium ... 426 e-117 UniRef50_A7N7H3 Putative uncharacterized protein n=31 Tax=Vibrio... 423 e-117 UniRef50_Q6LJK0 Hypothetical transposase n=2 Tax=Vibrionaceae Re... 410 e-113 UniRef50_C6MY57 Putative transposase, IS4 family protein n=1 Tax... 409 e-112 UniRef50_Q07YD1 Transposase, IS4 family n=6 Tax=Shewanella RepID... 406 e-112 UniRef50_A4C5E2 Hypothetical transposase n=2 Tax=Pseudoalteromon... 406 e-111 UniRef50_C1DIQ1 Transposase, IS4 n=2 Tax=Azotobacter vinelandii ... 374 e-102 UniRef50_D0I6N0 Transposase IS4 n=1 Tax=Grimontia hollisae CIP 1... 374 e-102 UniRef50_Q17U39 Transposase n=11 Tax=Gammaproteobacteria RepID=Q... 353 6e-96 UniRef50_Q2NZH2 ISXoo8 transposase n=73 Tax=Xanthomonas RepID=Q2... 348 2e-94 UniRef50_Q5X8W8 Putative uncharacterized protein n=1 Tax=Legione... 347 5e-94 UniRef50_B4UH67 Transposase IS4 family protein n=3 Tax=Proteobac... 259 1e-67 UniRef50_A7MW84 Putative uncharacterized protein n=2 Tax=Vibrio ... 259 1e-67 UniRef50_D0LPB8 Transposase IS4 family protein n=4 Tax=Haliangiu... 256 1e-66 UniRef50_A9AVJ1 Transposase IS4 family protein n=1 Tax=Herpetosi... 251 2e-65 UniRef50_B4WTK1 Putative uncharacterized protein n=6 Tax=Synecho... 237 7e-61 UniRef50_B2JAE4 Transposase, IS4 family protein n=8 Tax=Cyanobac... 210 7e-53 UniRef50_C7QY62 Transposase IS4 family protein n=9 Tax=Cyanothec... 206 2e-51 UniRef50_Q72IB6 Transposase n=3 Tax=Thermus thermophilus HB27 Re... 203 8e-51 UniRef50_Q9UH48 Gastric cancer-related protein GCYS-20 n=1 Tax=H... 167 8e-40 UniRef50_B8B8E6 Putative uncharacterized protein n=5 Tax=cellula... 144 5e-33 UniRef50_A7MYH1 Putative uncharacterized protein n=4 Tax=Vibrio ... 142 2e-32 UniRef50_Q47076 BfpT, bfpV, bfpW and transposase genes, complete... 141 6e-32 UniRef50_Q5GUK2 ISxac1 transposase n=1 Tax=Xanthomonas oryzae pv... 139 2e-31 UniRef50_Q1QFL8 Putative uncharacterized protein n=1 Tax=Nitroba... 110 1e-22 UniRef50_Q10V90 Transposase, IS4 family n=7 Tax=Trichodesmium er... 109 2e-22 UniRef50_Q7NHH4 Gll2563 protein n=2 Tax=Gloeobacter violaceus Re... 95 6e-18 UniRef50_Q6LGR5 Putative transposase similar to Tn10 n=1 Tax=Pho... 81 5e-14 UniRef50_A7N4N2 Putative uncharacterized protein n=1 Tax=Vibrio ... 65 4e-09 Sequences not found previously or not previously below threshold: UniRef50_A7NGF0 Transposase IS4 family protein n=1 Tax=Roseiflex... 154 4e-36 UniRef50_B0JUB6 Transposase n=18 Tax=Cyanobacteria RepID=B0JUB6_... 139 1e-31 UniRef50_B7KME5 Transposase IS4 family protein n=42 Tax=Cyanobac... 120 1e-25 UniRef50_B0BZT8 Transposase, IS4 family n=21 Tax=Cyanobacteria R... 118 4e-25 UniRef50_C7RIL9 Transposase IS4 family protein n=1 Tax=Candidatu... 116 1e-24 UniRef50_A8ZRP2 Transposase IS4 family protein n=1 Tax=Deinococc... 114 5e-24 UniRef50_A5UQG7 Transposase, IS4 family n=1 Tax=Roseiflexus sp. ... 112 3e-23 UniRef50_B5VWL5 Transposase IS4 family protein n=6 Tax=Arthrospi... 110 7e-23 UniRef50_UPI000038476B hypothetical protein Magn03010330 n=1 Tax... 108 3e-22 UniRef50_Q1VRR5 Putative uncharacterized protein n=8 Tax=Bactero... 106 2e-21 UniRef50_D2SUD5 Transposase n=1 Tax=uncultured bacterium psy1 Re... 103 2e-20 UniRef50_Q2S0J1 Putative transposase n=1 Tax=Salinibacter ruber ... 102 3e-20 UniRef50_Q5ZXB3 ORF2 transposase n=9 Tax=Legionella RepID=Q5ZXB3... 100 2e-19 UniRef50_A9AUQ0 Transposase IS4 family protein n=3 Tax=Herpetosi... 99 2e-19 UniRef50_A8ZMZ5 Putative uncharacterized protein n=1 Tax=Acaryoc... 99 3e-19 UniRef50_B4WNR8 Putative uncharacterized protein n=1 Tax=Synecho... 98 5e-19 UniRef50_B4W0I0 Transposase, IS4 family protein n=1 Tax=Microcol... 97 9e-19 UniRef50_A8ZQL6 Putative uncharacterized protein n=1 Tax=Acaryoc... 97 1e-18 UniRef50_Q6ZER7 Putative uncharacterized protein sll5063 n=1 Tax... 95 4e-18 UniRef50_D1C6P8 Putative uncharacterized protein n=2 Tax=Sphaero... 89 3e-16 UniRef50_B5VUF1 Transposase IS4 family protein n=17 Tax=Arthrosp... 87 1e-15 UniRef50_C4YZ17 Transposase, IS4 family protein n=4 Tax=Ricketts... 84 8e-15 UniRef50_Q3M186 Putative uncharacterized protein n=2 Tax=Anabaen... 84 9e-15 UniRef50_C4ILZ9 Putative iso-IS10R ORF n=1 Tax=Clostridium butyr... 82 4e-14 UniRef50_Q1AUS1 Putative uncharacterized protein n=4 Tax=Rubroba... 82 4e-14 UniRef50_C1XLC5 Transposase family protein n=1 Tax=Meiothermus r... 81 6e-14 UniRef50_B4VLK2 Transposase, IS4 family protein n=1 Tax=Microcol... 81 6e-14 UniRef50_B1XQT5 Tn10-like transposase (IS4 family) n=14 Tax=Cyan... 81 1e-13 UniRef50_A5UPF7 Transposase, IS4 family n=1 Tax=Roseiflexus sp. ... 80 1e-13 UniRef50_B7AA71 Transposase IS4 family protein n=2 Tax=Thermus a... 79 3e-13 UniRef50_Q1IXF5 Transposase, IS4 n=6 Tax=Bacteria RepID=Q1IXF5_D... 77 8e-13 UniRef50_C5UVK8 Putative transposase n=1 Tax=Clostridium botulin... 76 4e-12 UniRef50_A5UY16 Transposase, IS4 family n=9 Tax=Roseiflexus sp. ... 74 9e-12 UniRef50_B5K928 Transposase, IS4 n=23 Tax=Alphaproteobacteria Re... 73 2e-11 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 72 4e-11 UniRef50_Q1QJQ9 Putative uncharacterized protein n=1 Tax=Nitroba... 71 6e-11 UniRef50_C8Q1E5 Transposase, IS4 family n=1 Tax=Enhydrobacter ae... 71 6e-11 UniRef50_Q1QGK1 Putative uncharacterized protein n=1 Tax=Nitroba... 69 3e-10 UniRef50_UPI000197B669 hypothetical protein BACCOPRO_01365 n=1 T... 68 7e-10 UniRef50_Q9RZJ3 Transposase, putative n=9 Tax=Deinococcus radiod... 67 1e-09 UniRef50_A6TN04 Transposase, IS4 family protein n=1 Tax=Alkaliph... 67 1e-09 UniRef50_B7I4U9 Transposase 1 n=31 Tax=Bacteria RepID=B7I4U9_ACIB5 66 3e-09 UniRef50_Q7NIQ3 Glr2130 protein n=2 Tax=Gloeobacter violaceus Re... 65 4e-09 UniRef50_B1QZ52 Putative transposase n=2 Tax=Clostridium butyric... 65 4e-09 UniRef50_Q10VE7 Putative uncharacterized protein n=1 Tax=Trichod... 64 7e-09 UniRef50_Q73IB8 Transposase, IS4 family n=9 Tax=Wolbachia RepID=... 64 1e-08 UniRef50_B7GET6 Transposase n=2 Tax=Bacillaceae RepID=B7GET6_ANOFW 63 2e-08 UniRef50_Q1ARL9 Transposase, IS4 family n=1 Tax=Rubrobacter xyla... 59 3e-07 UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepI... 54 9e-06 UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=... 54 9e-06 UniRef50_A2RJ55 Putative transposase n=7 Tax=Lactobacillales Rep... 54 1e-05 UniRef50_B0JGV7 Putative uncharacterized protein n=1 Tax=Microcy... 54 1e-05 UniRef50_D0SG98 Transposase n=3 Tax=Gammaproteobacteria RepID=D0... 54 1e-05 UniRef50_Q6ZER8 Putative uncharacterized protein sll5062 n=1 Tax... 53 2e-05 UniRef50_A7HFH6 Putative uncharacterized protein n=1 Tax=Anaerom... 53 2e-05 UniRef50_UPI0001C16BE8 Transposase, IS4 protein n=1 Tax=Cylindro... 52 3e-05 UniRef50_A5WBL3 Transposase, IS4 family n=2 Tax=Bacteria RepID=A... 52 4e-05 UniRef50_Q1PXV1 Putative uncharacterized protein n=3 Tax=Candida... 52 5e-05 UniRef50_Q5L3A2 Transposase of IS231E-like element n=1 Tax=Geoba... 51 6e-05 UniRef50_Q6MB98 Putative uncharacterized protein n=1 Tax=Candida... 51 8e-05 UniRef50_A5FWE3 Transposase, IS4 family protein n=2 Tax=Acidiphi... 51 8e-05 UniRef50_B0NXD2 Putative uncharacterized protein n=5 Tax=Clostri... 51 8e-05 UniRef50_Q46GC6 Transposase n=7 Tax=Methanosarcina RepID=Q46GC6_... 50 1e-04 UniRef50_A3IP38 Putative uncharacterized protein n=1 Tax=Cyanoth... 50 1e-04 UniRef50_Q8VV93 Transposase n=1 Tax=marine psychrotrophic bacter... 50 1e-04 UniRef50_Q7UPU9 Probable transposase n=2 Tax=Rhodopirellula balt... 50 2e-04 UniRef50_C4YUK3 Transcription-repair-coupling factor n=29 Tax=Ri... 49 2e-04 UniRef50_C1D0Y0 Putative transposase n=1 Tax=Deinococcus deserti... 49 3e-04 UniRef50_C3EBZ9 IS231-related transposase n=1 Tax=Bacillus thuri... 49 3e-04 UniRef50_C6AUF2 Transposase IS4 family protein n=7 Tax=Rhizobium... 49 4e-04 UniRef50_Q8A4P1 Transposase n=5 Tax=Bacteroides RepID=Q8A4P1_BACTN 49 4e-04 UniRef50_B2AKB8 Transposase, IS4 family n=40 Tax=cellular organi... 49 4e-04 UniRef50_UPI00016C424B Transposase n=1 Tax=Gemmata obscuriglobus... 49 5e-04 UniRef50_A9AZS8 Transposase IS4 family protein n=3 Tax=Herpetosi... 49 5e-04 UniRef50_B4ABV5 Transposase n=1 Tax=Salmonella enterica subsp. e... 48 5e-04 UniRef50_P11901 Transposase for insertion sequence element IS421... 48 6e-04 UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostri... 48 7e-04 UniRef50_B8F976 Transposase IS4 family protein n=2 Tax=Desulfati... 48 8e-04 UniRef50_A6M1E5 Transposase, IS4 family protein n=1 Tax=Clostrid... 48 8e-04 UniRef50_A4SUB1 IS element transposase n=8 Tax=Bacteria RepID=A4... 47 9e-04 UniRef50_A5UVL8 Putative uncharacterized protein n=1 Tax=Roseifl... 47 0.001 UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coproco... 47 0.001 UniRef50_C3M9W9 Modified transposase for insertion sequence NGRI... 47 0.001 UniRef50_A8YLR7 Genome sequencing data, contig C326 n=5 Tax=Micr... 47 0.002 UniRef50_A8YMK7 Genome sequencing data, contig C327 n=1 Tax=Micr... 47 0.002 UniRef50_C6LGD4 Transposase, IS4 family protein n=3 Tax=Lachnosp... 46 0.002 UniRef50_UPI0001BC4BB6 transposase n=2 Tax=Neisseria mucosa ATCC... 46 0.002 UniRef50_B0R9A9 Transposase (ISH8) n=22 Tax=Halobacteriaceae Rep... 46 0.002 UniRef50_C1P7N3 Transposase IS4 family protein n=5 Tax=Bacillus ... 46 0.002 UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea f... 46 0.002 UniRef50_UPI0001C171A4 Putative transposase n=1 Tax=Raphidiopsis... 46 0.002 UniRef50_P12249 Transposase for insertion sequence element IS231... 46 0.002 UniRef50_D1RLN9 Putative uncharacterized protein n=1 Tax=Legione... 46 0.002 UniRef50_UPI00017465B5 InsL n=2 Tax=Verrucomicrobium spinosum DS... 46 0.003 UniRef50_C4YUW4 Transposase subunit n=4 Tax=Rickettsia endosymbi... 45 0.005 UniRef50_B0JP83 Transposase n=112 Tax=Cyanobacteria RepID=B0JP83... 45 0.005 UniRef50_B7I4G0 Transposase subunit n=16 Tax=Bacteria RepID=B7I4... 45 0.005 UniRef50_Q74P20 IS231-related transposase n=15 Tax=Bacillus RepI... 45 0.005 UniRef50_D2KXE5 Putative transposase n=1 Tax=Lactobacillus ferme... 45 0.006 UniRef50_Q1J2M1 Transposase IS4 family protein n=4 Tax=Deinococc... 45 0.006 UniRef50_Q64B41 Transposase n=11 Tax=environmental samples RepID... 45 0.006 UniRef50_C8W6S4 Transposase IS4 family protein n=1 Tax=Desulfoto... 44 0.007 UniRef50_C9LIM0 Putative transposase n=1 Tax=Prevotella tannerae... 44 0.008 UniRef50_Q1J3A6 IS1 related protein n=4 Tax=Deinococcus geotherm... 44 0.008 UniRef50_C6PFH6 Transposase IS4 family protein n=2 Tax=Thermoana... 44 0.009 UniRef50_D1XVT7 Putative uncharacterized protein n=2 Tax=Bactero... 44 0.010 UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipe... 44 0.010 UniRef50_C0VKT5 IS4 family transposase ORF 2 n=5 Tax=Acinetobact... 44 0.012 UniRef50_A7B2R8 Putative uncharacterized protein n=2 Tax=Clostri... 44 0.014 UniRef50_A5EC94 Putative transposase n=1 Tax=Bradyrhizobium sp. ... 44 0.015 UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001... 43 0.016 UniRef50_B6FTH4 Putative uncharacterized protein n=3 Tax=Clostri... 43 0.017 UniRef50_Q2FU81 Transposase, IS4 n=4 Tax=Methanospirillum hungat... 43 0.017 UniRef50_Q3M187 Putative transposase n=10 Tax=Nostocaceae RepID=... 43 0.017 UniRef50_Q7UY96 Similar to transposase n=1 Tax=Rhodopirellula ba... 43 0.019 UniRef50_B6FLV1 Putative uncharacterized protein (Fragment) n=1 ... 43 0.020 UniRef50_B6FVK0 Putative uncharacterized protein (Fragment) n=2 ... 43 0.020 UniRef50_A7IQF9 Putative uncharacterized protein n=1 Tax=Xanthob... 43 0.021 UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicute... 43 0.023 UniRef50_C6N6I3 Transposase n=2 Tax=Gammaproteobacteria RepID=C6... 43 0.024 UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostrid... 43 0.024 UniRef50_Q0H069 ISEc13 transposase n=23 Tax=Bacteria RepID=Q0H06... 43 0.024 UniRef50_UPI000190437B putative insertion sequence transposase p... 42 0.028 UniRef50_C0AF19 InsL n=9 Tax=Opitutaceae bacterium TAV2 RepID=C0... 42 0.029 UniRef50_A6DTQ2 Putative transposase insL for insertion sequence... 42 0.031 UniRef50_Q4BVH8 Putative uncharacterized protein n=1 Tax=Crocosp... 42 0.034 UniRef50_Q4V248 Transposase, n=5 Tax=Bacillus cereus group RepID... 42 0.037 UniRef50_C4RAB7 Putative uncharacterized protein n=1 Tax=magneti... 42 0.037 UniRef50_A9DPK2 Transposase n=8 Tax=Shewanella benthica KT99 Rep... 42 0.039 UniRef50_Q05309 Transposase for insertion sequence element IS115... 42 0.040 UniRef50_C0BDH6 Putative uncharacterized protein n=2 Tax=Coproco... 42 0.042 UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3... 42 0.043 UniRef50_B0TD95 Transposase, is4 family n=3 Tax=Heliobacterium m... 42 0.046 UniRef50_A7C1C1 IS231-related transposase n=6 Tax=Beggiatoa sp. ... 42 0.047 UniRef50_A6FXH7 Transposase, IS4 n=1 Tax=Plesiocystis pacifica S... 42 0.052 UniRef50_Q04QP0 Transposase, ISLbp11 n=2 Tax=Leptospira borgpete... 42 0.053 UniRef50_B8FI31 Transposase IS4 family protein n=1 Tax=Desulfati... 42 0.054 UniRef50_C1DPR7 Transposase n=7 Tax=Proteobacteria RepID=C1DPR7_... 42 0.055 UniRef50_UPI000196B70E hypothetical protein CATMIT_00144 n=1 Tax... 42 0.058 UniRef50_C3BTW8 Transposase for insertion sequence element IS231... 41 0.059 UniRef50_C8VXW5 Transposase IS4 family protein n=3 Tax=Desulfoto... 41 0.065 UniRef50_C6J5I2 Transposase n=1 Tax=Paenibacillus sp. oral taxon... 41 0.066 UniRef50_D1RLH5 Putative uncharacterized protein n=3 Tax=Legione... 41 0.071 UniRef50_Q8PF48 Transposase n=1 Tax=Xanthomonas axonopodis pv. c... 41 0.080 UniRef50_A3ZNH0 Probable transposase n=1 Tax=Blastopirellula mar... 41 0.091 >UniRef50_B9EHT2 Olfr780 protein n=158 Tax=root RepID=B9EHT2_MOUSE Length = 402 Score = 549 bits (1415), Expect = e-155, Method: Composition-based stats. Identities = 398/402 (99%), Positives = 399/402 (99%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR Sbjct: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS Sbjct: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL Sbjct: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK Sbjct: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 NQRSTRTHCHHPSPKIYSASAKEPW+LATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS Sbjct: 241 NQRSTRTHCHHPSPKIYSASAKEPWVLATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS Sbjct: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 Query: 361 TVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLGKL 402 TVRLGMEVLRHSGYTITRED LVAATLL QNLFTHGY LGKL Sbjct: 361 TVRLGMEVLRHSGYTITREDLLVAATLLAQNLFTHGYALGKL 402 >UniRef50_A3D336 Transposase, IS4 family n=6 Tax=Shewanella RepID=A3D336_SHEB5 Length = 460 Score = 451 bits (1160), Expect = e-125, Method: Composition-based stats. Identities = 214/398 (53%), Positives = 283/398 (71%), Gaps = 2/398 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLP-TKARTKHNIK 59 M L ILH SLYQ CPE+H KRLN+L + C AL++ LTLT LGR++ T TKH+IK Sbjct: 1 MQVLTILHQSLYQHCPEIHQKRLNTLMVTCRALINADCLTLTHLGRHIDGTSTHTKHSIK 60 Query: 60 RIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGR 119 R+DRLLGN HLH ER+AVY+WHA ++ + +TMP +LVDWSD+RE + L+ LRAS+A+ GR Sbjct: 61 RMDRLLGNPHLHHERMAVYQWHAKWLLTAHTMPTILVDWSDMREGRELIALRASIAIKGR 120 Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYW 179 S+TLYE+ FPL Q ++ AH+QFL +L +LP N TPLIV+DAGF+ PW++ VE+LGWYW Sbjct: 121 SITLYERTFPLVLQGTQTAHNQFLNELRKVLPDNITPLIVTDAGFRNPWFRKVEQLGWYW 180 Query: 180 LSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 L RVRG Y + L+ + +K +G L+ P+ C+++L+++ SKGR Sbjct: 181 LGRVRGLSVYRPHPFGRQFSLKALYPQARRRAKHVGRVALSVKKPLLCEMVLFRAPSKGR 240 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 K QRST T CHH + Y +AKEPW L TNL ++ +P++LVNIY KRMQ+EETFRDLK Sbjct: 241 KGQRSTTTDCHHTAQWTYELTAKEPWALVTNLTMKAMSPQKLVNIYQKRMQMEETFRDLK 300 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 SPAYG GLRHSRT + R DI+LLIAL++QL W G++ + Q +HFQANTV+ RNVL Sbjct: 301 SPAYGFGLRHSRTRYAARMDILLLIALLVQLAFWWIGLYGETQQLQRHFQANTVKKRNVL 360 Query: 360 STVRLGMEVLRHS-GYTITREDSLVAATLLTQNLFTHG 396 ST+R+G E+LR Y I+ +D L AA L + THG Sbjct: 361 STIRMGKELLRRRHDYPISADDLLCAAKKLAELSLTHG 398 >UniRef50_Q15UH5 Transposase, IS4 family n=36 Tax=Gammaproteobacteria RepID=Q15UH5_PSEA6 Length = 420 Score = 451 bits (1159), Expect = e-125, Method: Composition-based stats. Identities = 195/389 (50%), Positives = 270/389 (69%), Gaps = 2/389 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M ++ ILHD L + CP LH KRL++L +A +LLD + L+LTELGRN+ KHNIKR Sbjct: 19 MRDIHILHDLLKKQCPNLHAKRLSALMVATQSLLDGQQLSLTELGRNISGSVAPKHNIKR 78 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRLLGN +LH ERL +YRWHA +C N MP+VLVDWSD+REQ R + LRASV++ GRS Sbjct: 79 IDRLLGNNNLHNERLDIYRWHARLLCGANPMPVVLVDWSDVREQLRHLTLRASVSVQGRS 138 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 VTLYE+ F E S +H+ FL +LASILP PLIV+DAG++ PW++ VEK GW+WL Sbjct: 139 VTLYERVFSFGEYNSPVSHNPFLRELASILPLGCCPLIVTDAGYRNPWFREVEKHGWFWL 198 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 RVRG V + G +W+ + + ++S +K LG +L + +P+ + LYK+++K RK Sbjct: 199 GRVRGDVGFKRDGQASWQSNKSFYPSANSRAKYLGCGQLGRKSPLHAHLHLYKAKAKHRK 258 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIR-TPKQLVNIYSKRMQIEETFRDLK 299 + RS++ +H + + Y A +KEPW+LATNLP + KQLV++Y++RMQIEETFRD+K Sbjct: 259 DNRSSKAGRNHTAQQSYRAGSKEPWLLATNLPENDKLNSKQLVSLYARRMQIEETFRDIK 318 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 SP YG+GLRHS + ++RFDI+LLIA++ + L G+ A KQ W++ FQANT+R+R VL Sbjct: 319 SPQYGMGLRHSNSRCTKRFDILLLIAMLAEWLLRLLGIIAVKQNWERAFQANTIRHRRVL 378 Query: 360 STVRLGMEVLRHSG-YTITREDSLVAATL 387 S +RLG EV + + Y + A Sbjct: 379 SIIRLGREVRKRAKDYRMNSAQMTWAIAQ 407 >UniRef50_Q6LPG7 Hypothetical transposase n=7 Tax=Photobacterium profundum RepID=Q6LPG7_PHOPR Length = 402 Score = 426 bits (1094), Expect = e-117, Method: Composition-based stats. Identities = 136/397 (34%), Positives = 213/397 (53%), Gaps = 3/397 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKART--KHNI 58 M +IL+ L + P++H RL +L + + + +++T LGR L + + T KH+I Sbjct: 1 MKATEILYQDLRSYYPQIHSSRLKTLCTFIESGIKDQRVSVTYLGRGLESGSVTTKKHDI 60 Query: 59 KRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHG 118 KR DRL+GN HLH ER Y + + PI+L+DWS I Q+ +LRAS+ + G Sbjct: 61 KRADRLIGNAHLHCERHDYYEYMTEQLIGREKHPIILIDWSPINGQEIYQLLRASIPMQG 120 Query: 119 RSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWY 178 R + LYEK F SE ++KAH FL +L +LP P+I +DA ++ PW+K+VE GWY Sbjct: 121 RGLVLYEKTFHESELNTEKAHQSFLDELEQVLPEGCQPVITTDAIYRSPWFKAVELKGWY 180 Query: 179 WLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKG 238 W+ RVRG+V + + + ++ LG K C+ +L+K KG Sbjct: 181 WIGRVRGQVSLSQDKETWYTSYQWFKAAKVNKAEHLGVLYYGKVAKFKCEGVLFKRNKKG 240 Query: 239 RKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQL-VNIYSKRMQIEETFRD 297 R ++ + K + A E W+L LP + + V++Y +RMQIEE FRD Sbjct: 241 RSAKKKRGGVSQRTTDKTHEKDANEAWLLVFKLPPRYKNNANIAVSLYRQRMQIEENFRD 300 Query: 298 LKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRN 357 K+ G+ L ++ + S ERFD +LLIA ++ W G A + QAN+++ R Sbjct: 301 TKNGKLGISLEYANSKSVERFDNLLLIAGLILFIIWCVGRAAVMKKIHYSLQANSLKFRA 360 Query: 358 VLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFT 394 VLST+ +G EV++ YTIT ++ + L++ + Sbjct: 361 VLSTIYIGREVVKDGRYTITIDEYVYVLAHLSELAVS 397 >UniRef50_A7N7H3 Putative uncharacterized protein n=31 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N7H3_VIBHB Length = 397 Score = 423 bits (1088), Expect = e-117, Method: Composition-based stats. Identities = 139/370 (37%), Positives = 215/370 (58%), Gaps = 4/370 (1%) Query: 20 LKRLNSLTLACHALLDCKTLTLTELGRNLP-TKARTKHNIKRIDRLLGNRHLHKERLAVY 78 +R+ ++ +AL + TLTLT LGR + TK + KH IKR+ RLLGN HLH+ER VY Sbjct: 22 KRRITAVLDCINALNEKDTLTLTGLGRGMKNTKTKVKHCIKRVYRLLGNPHLHRERTGVY 81 Query: 79 RWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKA 138 + F+ PI++VDWS + + +LRA++ + GR+ TLYE+ P + S Sbjct: 82 AYITDFLLKNVKHPIIIVDWSPVNHVDK-QILRATIPIGGRAFTLYEEVHPECKLGSLAV 140 Query: 139 HDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWK 198 H F+ LA+++P P++ +DAGFKVPW+K +E+ GWYWL RVRG + + W Sbjct: 141 HKAFIRRLATMVPKGVIPIVTTDAGFKVPWFKPIEQQGWYWLGRVRGNSKLR--VNDRWC 198 Query: 199 PISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYS 258 + + + LG LTK + CQ+ LY+ +SKGRK + + + + ++ Sbjct: 199 SADEVFVQAQYKPQHLGTAELTKQHQYPCQVCLYRKKSKGRKAKNWSGSLQRNTVSLSHA 258 Query: 259 ASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERF 318 +EPW+L +NLP E +++V +Y++RM IEE FRD K+ YGL L S ++S +R Sbjct: 259 KGEREPWLLVSNLPGETWFAERVVALYTQRMSIEEGFRDTKNERYGLALNFSGSASPKRI 318 Query: 319 DIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITR 378 +I+L+I ++ Q + G A +G+ K FQANT+R R VLS LG E++ Y+ + Sbjct: 319 EILLMIGMLTQFALLVVGKVAYLKGYYKDFQANTIRTRRVLSYFFLGKELIGREAYSFSV 378 Query: 379 EDSLVAATLL 388 +D +A L Sbjct: 379 KDLALAVGGL 388 >UniRef50_Q6LJK0 Hypothetical transposase n=2 Tax=Vibrionaceae RepID=Q6LJK0_PHOPR Length = 394 Score = 410 bits (1053), Expect = e-113, Method: Composition-based stats. Identities = 175/350 (50%), Positives = 231/350 (66%), Gaps = 5/350 (1%) Query: 29 ACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSG 88 C L L L GR+LP+KA+TKH IKR+DRLLGN HLH +RL +YRWH CS Sbjct: 28 LCKHCLAMMHLRLLYFGRSLPSKAKTKHCIKRVDRLLGNNHLHHDRLDIYRWHCHQFCSV 87 Query: 89 NTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLAS 148 N PIVLVDW+DIRE +RLMVLRAS+A+ GRSVTL+E+ F S ++H QFL D + Sbjct: 88 NPQPIVLVDWADIREYERLMVLRASIAVEGRSVTLFEQTFTFKNYNSPRSHQQFLDDFKA 147 Query: 149 ILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSS 208 +LPS+ P+IV+DAGF+ W++ V+ + W +L RVRG V L W+ I L ++ Sbjct: 148 VLPSHVIPIIVTDAGFRNTWFRQVDDMDWCYLGRVRGDVNV--LIKNQWQHIKQLFIKAN 205 Query: 209 SHSKTLGYKRLTKSNPISCQILLYKSR-SKGRKNQRSTRTHCHHPSPKIYSASAKEPWIL 267 S K +G+ +L K P+ C + LYK + K RK++ R H + ++ SA EPW+L Sbjct: 206 SKPKYVGFTQLAKRKPLQCHLHLYKKQTPKKRKDRPKGRE--HFSAQAVHKKSALEPWVL 263 Query: 268 ATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALM 327 ATNLP +I + + +V +Y+KRMQIEETFRDLKSP YG GLR SRT +RFDI+LLI L+ Sbjct: 264 ATNLPTDIFSSRCIVRLYTKRMQIEETFRDLKSPQYGFGLRQSRTHDPKRFDILLLIGLL 323 Query: 328 LQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTIT 377 + W G+ A+ GW +HFQAN+V++R VLS VRLG EV R Y I Sbjct: 324 AFMVYWWFGIIAEHNGWHRHFQANSVKDRRVLSFVRLGKEVFRRLEYHIN 373 >UniRef50_C6MY57 Putative transposase, IS4 family protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MY57_9GAMM Length = 397 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 144/391 (36%), Positives = 224/391 (57%), Gaps = 1/391 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M +LH+ L Q +H KRL+SL A A + + +T+T LGR L + K+ IK+ Sbjct: 1 MRVEQLLHNHL-QKSVVMHSKRLDSLMCAVTAGMKDRCVTVTGLGRRLRMSIKVKNKIKK 59 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRL+GN HLH+E ++Y+ I PI++VDWS + + +LRA++ GR+ Sbjct: 60 IDRLVGNSHLHQEIPSIYQCMTGLILGNIRRPIIIVDWSPLGQGTEHQLLRATLPSGGRA 119 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 +TLYE A+P S S+K H +FLA L ILP+ TP+IV+DAGF+ W++ V LGW W+ Sbjct: 120 LTLYESAYPESLLTSRKVHQEFLAKLCQILPAGCTPIIVTDAGFRNTWFEDVSSLGWDWV 179 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 RVR + Y AE W PI +L+ ++S + +G+ L++ +SC + LYK + KGR Sbjct: 180 GRVRNRTHYLAANAEQWVPIKSLYHHATSRPQYIGHGNLSRRTSVSCGLYLYKKQPKGRV 239 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 + C + + +EPW++AT+L ++++ IY+KR QIE FRD K+ Sbjct: 240 LKTLKGAKCRQATSLKIAQREREPWLIATSLHHNTTLSRKIIKIYAKRAQIENGFRDTKN 299 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 G L S+TS + R +++L+I + WL G +++ FQANT++NRNVLS Sbjct: 300 QRLGFSLNDSKTSHTARLNVLLIIIAIATFGLWLLGGLLKQKQLHFQFQANTIKNRNVLS 359 Query: 361 TVRLGMEVLRHSGYTITREDSLVAATLLTQN 391 V LG +++ +S R D L ++ + Sbjct: 360 NVFLGWQIINNSSPRFKRADWLSVIDSISID 390 >UniRef50_Q07YD1 Transposase, IS4 family n=6 Tax=Shewanella RepID=Q07YD1_SHEFN Length = 397 Score = 406 bits (1043), Expect = e-112, Method: Composition-based stats. Identities = 128/393 (32%), Positives = 218/393 (55%), Gaps = 4/393 (1%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M +L L P +H R SL A + ++ L++T LGR++ +KA+ KH IKR Sbjct: 1 MNAKQVLSKCLSLVTPLMHKTRRQSLFSAIESSMNGGALSITGLGRDIESKAKEKHKIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 +DRL N +LH++ +Y + PI+ +DWSD+ ++K+ ++RAS+A GRS Sbjct: 61 VDRLCSNPYLHRDIEFIYTRMTCLLVGKMKQPIIHIDWSDLDDRKQHFLIRASLAAQGRS 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 +TLYE+ PL+++ K H FL L ++LP++ P+IV+DAGF++PW+K + L W ++ Sbjct: 121 LTLYEEIHPLNKKEKPKTHLSFLTKLKAMLPNDCKPIIVTDAGFRIPWFKQILSLDWDYV 180 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 R R + +W P+ L+ +S+ +K LG L + +++++K KGRK Sbjct: 181 GRFRNRTHCRKTIVHHWYPVKRLYIQASARAKNLGVYFLGEQASFCSRLVIFKRTDKGRK 240 Query: 241 NQRSTRTHCHHPSP-KIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 ++ +T + + KEPW+LAT+L T K++V IY+ RMQIEE+FRD+K Sbjct: 241 DRTATGDRTRRSKQSRSSAEREKEPWLLATSLCHSSATAKRVVKIYATRMQIEESFRDVK 300 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 + GL + S + ++ ++LLIA + Q L G+ + + +QAN++++RNVL Sbjct: 301 T---GLKMNDSGSRIKDKLSVLLLIACLSQFMLNLLGLAVKAADKHRQYQANSIKHRNVL 357 Query: 360 STVRLGMEVLRHSGYTITREDSLVAATLLTQNL 392 S+ +G+ R + + L L + Sbjct: 358 SSQFIGLRAYRDKYLRLLKSHWLAGIKTLQSLI 390 >UniRef50_A4C5E2 Hypothetical transposase n=2 Tax=Pseudoalteromonas tunicata D2 RepID=A4C5E2_9GAMM Length = 397 Score = 406 bits (1042), Expect = e-111, Method: Composition-based stats. Identities = 140/392 (35%), Positives = 211/392 (53%), Gaps = 1/392 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M +LH + + + +L A L +++ LGR L + A+ KHNIKR Sbjct: 1 MHLNKLLHKTFSNTVGVIDKRNHCTLMKAAATLCQHTFISIAALGRKLKSNAKVKHNIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRL GN + R Y+ + P V +DWS + +LRA+V + GR+ Sbjct: 61 IDRLFGNPRVQFARYHYYQEITHRVIGQIRRPCVTIDWSGLTPCGEFHLLRAAVPVKGRA 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 +T+YE++F E + H F+ L SILPS+ P+IV+DAGF+ PW+K V K GW ++ Sbjct: 121 MTIYEQSFRECEYMKQSVHKDFIKTLKSILPSDCKPIIVTDAGFRNPWFKLVLKFGWDFV 180 Query: 181 SRVRGKVQYADLGAE-NWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 RVR + QY + +W P+ L+ +++ L +L K+N +S L+KS+ K R Sbjct: 181 GRVRHQTQYQKPEDDTSWLPVKTLYSKATAKPVYLFETQLAKANSLSGHFYLFKSKPKQR 240 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 K + ++ A EPW+L T+L + + +V IYS+RMQIEE+FRDLK Sbjct: 241 KKKNLRGKTIRCSVSLKHAKGATEPWLLFTSLCNINYSAQDMVKIYSQRMQIEESFRDLK 300 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 + + GL LRH R+ R ++ LLIAL+ WLAG+ A+ + FQANT++NRNVL Sbjct: 301 NTSNGLNLRHCRSYEKGRLNVALLIALIANFILWLAGLTAKILNVHRSFQANTIKNRNVL 360 Query: 360 STVRLGMEVLRHSGYTITREDSLVAATLLTQN 391 S+ LG + GY I + L A L ++ Sbjct: 361 SSFSLGTQYFEKFGYKIKLKTFLEALKQLNKD 392 >UniRef50_C1DIQ1 Transposase, IS4 n=2 Tax=Azotobacter vinelandii DJ RepID=C1DIQ1_AZOVD Length = 400 Score = 374 bits (959), Expect = e-102, Method: Composition-based stats. Identities = 146/384 (38%), Positives = 220/384 (57%), Gaps = 6/384 (1%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M + LH + + P +H +RL +L A ALL + LTLT LGR+LP A +H IKR Sbjct: 1 MQTVQFLHAAFAKALPTIHARRLEALMAAVAALLQGRCLTLTALGRSLPGSAWPRHAIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRLLGNR L ER Y + P++LVDWS I +L +LRA++ L GRS Sbjct: 61 IDRLLGNRQLQAERGLFYWVMLRALLGSFRHPLILVDWSPIDAAGKLFLLRAALPLAGRS 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 + + E P + + L LA++LP++ P++V+DAGF+ PW+++VE GW+++ Sbjct: 121 LPVCEVVHPRE--GCPRCQKRLLEALAAMLPADCRPVLVTDAGFQRPWFQAVEIRGWHYV 178 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 RVR + LG + W P+ +L+ ++S+ K LG +T+S P S Q+ + K +GR+ Sbjct: 179 GRVRNR-DLCRLGEQPWGPVKSLYALASASPKRLGCVEMTRSAPWSTQLCVVKHAPRGRQ 237 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 ++R T T + + EPW+LA+NLP Q+V IY +R QIEE FRDLKS Sbjct: 238 HRRITGTLARDKRSRQSAQRESEPWLLASNLPEAQWNAAQVVAIYRRRTQIEEGFRDLKS 297 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLT---CWLAGVHAQKQGWDKHFQANTVRNRN 357 G+GL R+ R +I+LLIA++ L G+ A++ G ++ FQ+N+++ + Sbjct: 298 HRLGIGLGLHRSRCPRRIEILLLIAVLANYALCLLGLLGLQAREAGHERRFQSNSLKCKR 357 Query: 358 VLSTVRLGMEVLRHSGYTITREDS 381 VLS RLG+E R I+RE Sbjct: 358 VLSLWRLGLEYARTGVGAISRETL 381 >UniRef50_D0I6N0 Transposase IS4 n=1 Tax=Grimontia hollisae CIP 101886 RepID=D0I6N0_VIBHO Length = 345 Score = 374 bits (959), Expect = e-102, Method: Composition-based stats. Identities = 145/318 (45%), Positives = 204/318 (64%), Gaps = 5/318 (1%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M ++ IL ++L P +H KRL SL LA + L LTLT+LGR+L T KH IKR Sbjct: 1 MRDIQILQETLTNHYPTIHKKRLQSLLLATESALGGADLTLTKLGRSLNTFTAAKHAIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 +DRLLGN LH+E+ +Y+W+A I N P++L+DWSD+REQ R M LRAS+AL GR+ Sbjct: 61 VDRLLGNTRLHREKEDIYKWNARLIAGANPCPVILLDWSDVREQLRFMTLRASIALDGRA 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 VTLYE+AF ++ S K H FL L ILP + TP+I+SDAGF+ W++ V+ GW+WL Sbjct: 121 VTLYEQAFEYAQYNSPKTHQYFLGKLQEILPPSATPIIISDAGFRNTWFRQVQSKGWFWL 180 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 RVRG V + +W+ L+ ++S +LG +L + +P++C + K +K Sbjct: 181 GRVRGDVSI-KMTQSDWQSNKTLYPDATSKPHSLGQCQLARRSPLTCNGYVVKQ----QK 235 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 QR +RT H + ++++ +A EPW+L TN+P E Q+ +Y+KRMQIEE FRDLKS Sbjct: 236 AQRHSRTGQKHTASRLFAKNANEPWLLVTNIPTETLNAVQICRLYAKRMQIEEAFRDLKS 295 Query: 301 PAYGLGLRHSRTSSSERF 318 AYGL LRH+RT + R Sbjct: 296 TAYGLALRHNRTHHNRRL 313 >UniRef50_Q17U39 Transposase n=11 Tax=Gammaproteobacteria RepID=Q17U39_ECOLX Length = 394 Score = 353 bits (906), Expect = 6e-96, Method: Composition-based stats. Identities = 133/391 (34%), Positives = 200/391 (51%), Gaps = 16/391 (4%) Query: 1 MCELDILHDSLYQFCPE-LHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIK 59 M +L D L P+ +H R + L A AL T+T +GR +P + K +IK Sbjct: 4 MNVKAMLADFLTFVTPKSMHKARFSVLLDAVTALAKDACCTVTAIGRAMPGSS-DKVSIK 62 Query: 60 RIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGR 119 R DRLL N +L +E +Y + I T P++LVDWS+ KR +LRAS+A GR Sbjct: 63 RADRLLNNPNLQRELPLIYAALTASIVGHKTKPMILVDWSNADTAKRHFILRASIAADGR 122 Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYW 179 ++TL +K + H FL L ++LP + P+IV+DAGFKVPW K V KLGW++ Sbjct: 123 ALTLLQKIAAAEDYTCPHLHGAFLKQLKAMLPKDCKPVIVTDAGFKVPWLKQVRKLGWHY 182 Query: 180 LSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 ++RVRG V+ + + ++ L+ + K++G L ++ Q +L KG Sbjct: 183 VARVRGNVKLKLAEQDKFISVNQLYRQAKKDPKSVGKIMLAQTQHYETQAVLV---GKGY 239 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 K + + + KEPW+L ++L ++ YS RMQIEE+FRD K Sbjct: 240 KLLKRDKN-----------KTYKEPWLLVSSLADCHGYADKIAKCYSSRMQIEESFRDQK 288 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 S YGLG T R +I+LL+A ++ +L G A+K G +QANTV+NR VL Sbjct: 289 SHRYGLGSDLHGTKKKSRLEILLLLAALVNWFHYLLGSAAEKAGLHLRYQANTVKNRRVL 348 Query: 360 STVRLGMEVLRHSGYTITREDSLVAATLLTQ 390 + LG+ + + I R+ + Q Sbjct: 349 ALNFLGILLCKEPKQRIRRQYYQQGLKQILQ 379 >UniRef50_Q2NZH2 ISXoo8 transposase n=73 Tax=Xanthomonas RepID=Q2NZH2_XANOM Length = 407 Score = 348 bits (892), Expect = 2e-94, Method: Composition-based stats. Identities = 125/394 (31%), Positives = 205/394 (52%), Gaps = 8/394 (2%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M ++L L +H R +L A AL+ LTL +L R P R + +K Sbjct: 4 MRASEVLQKCLSNSLSGMHALRQRTLLRAVEALVHGGRLTLIDLARAWPGATRVRAPLKA 63 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 DRLL NR L ER A+ + A ++ G+ P++++DWSD++ K +LRA+V + GR+ Sbjct: 64 CDRLLCNRTLQVERSAIEQDMAHWLLRGD-QPVIVIDWSDLKPDKSWCLLRAAVPVGGRT 122 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 +TL + +Q S A +FL L +++P + P++V+DAGF+ PW+++V +GW W+ Sbjct: 123 LTLLDMVVSRKQQGSPGAEKRFLQQLRALIPDDVRPILVTDAGFRTPWFRAVSAMGWDWV 182 Query: 181 SRVRGKVQYA----DLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSR- 235 R+RG+ Q A W LH ++S+ ++ L + +S+P+ C+++LY Sbjct: 183 GRLRGRTQVKPQDVPDDAVQWIDSRRLHALASNRARALPPMQANRSDPLDCRLVLYAKTR 242 Query: 236 -SKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEET 294 + ++N+RS+ S +A +EPW++ + + + KQLVN+Y++RMQIE Sbjct: 243 QGRQQRNRRSSAKVSRASSSLKAAAREREPWLIVASPQLHAPSAKQLVNLYARRMQIELA 302 Query: 295 FRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVR 354 FRDLKS YG + S T ER I+LL+ + WLAG+ + G + + Sbjct: 303 FRDLKSHRYGQAMEDSLTRRGERLQILLLLNTLATFASWLAGLGCEATGIAQWLSPRSS- 361 Query: 355 NRNVLSTVRLGMEVLRHSGYTITREDSLVAATLL 388 R + ST+R+G E L L L Sbjct: 362 TRKLYSTLRVGREALVRCWPMEPVSRWLERLRAL 395 >UniRef50_Q5X8W8 Putative uncharacterized protein n=1 Tax=Legionella pneumophila str. Paris RepID=Q5X8W8_LEGPA Length = 398 Score = 347 bits (889), Expect = 5e-94, Method: Composition-based stats. Identities = 107/375 (28%), Positives = 185/375 (49%), Gaps = 9/375 (2%) Query: 18 LHLKRLNSLTLACHALLDCKT-LTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLA 76 +H KR L L+D T L++TE+G+ L +K K I + N L ++ + Sbjct: 19 IHAKRKQCLVRFLSDLMDYDTTLSVTEIGKKLTSKTTVKSKIYAAQTFVNNFKLERDIVC 78 Query: 77 VYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSK 136 +Y+ F S +VL+DW+ + VL AS+A HGRS+ +Y + SEQ + Sbjct: 79 IYKSLTHFFWSHAKEIVVLIDWTGGCSEG-YHVLEASIAAHGRSIPIYHEVHSESEQENA 137 Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAEN 196 + H QFL L ++PS+ + I++DAGF W++ V +LGW + R+ Y G N Sbjct: 138 EIHRQFLLRLKEVIPSSLSVTIITDAGFHREWFQQVLELGWDVIGRIYSLYCYQIEGETN 197 Query: 197 WKPISNLHDMSSSHSKTLGYKRLTKSNP-ISCQILLYKSRSKGRKNQRSTRTHCHHPSPK 255 W + ++ + LG +L K+ + + YK + G+ ++ + H K Sbjct: 198 WHKVKDILFEGIGKASALGKVKLGKTKKAVEGYLYTYKEKLSGKVRKKKNKYPSH---DK 254 Query: 256 IYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSS 315 +S K W+L ++L R LV+ Y KRMQIE+ F+D+K+ G+G R +++S Sbjct: 255 AHSNYYKNGWVLFSSLNKHARF---LVSYYKKRMQIEQNFKDIKNEQLGMGFRRNQSSGK 311 Query: 316 ERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYT 375 R +++ +A++L + W G+ + + +QANT++N+ V S + L RH Sbjct: 312 TRVNMLFFLAVLLIMIAWWFGLMIESLNKHRSYQANTIKNKRVRSFIHLARMAYRHEPEL 371 Query: 376 ITREDSLVAATLLTQ 390 + + + L Q Sbjct: 372 LNWDLFQYIMSDLKQ 386 >UniRef50_B4UH67 Transposase IS4 family protein n=3 Tax=Proteobacteria RepID=B4UH67_ANASK Length = 384 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 93/365 (25%), Positives = 163/365 (44%), Gaps = 43/365 (11%) Query: 14 FCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKA--RTKHNIKRIDRLLGNRHLH 71 F +LH KR+ SL A +L+ L + +G +L ++KH +K++DR+L N Sbjct: 19 FAEDLHAKRVASLAGAAVGVLEGAALGIHAIGNSLAVAEGLKSKHAVKQVDRMLSN---- 74 Query: 72 KERLAVYRWHASFI---CSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAF 128 E + V+R S++ +V +DW+D E + + + + HGR+ L K Sbjct: 75 -EGIPVWRLFGSWVPCVVGDRLEIVVALDWTDFDEDDQSTIALSMITSHGRATPLLWKTV 133 Query: 129 PLSEQ-CSKKAHDQ-FLADLASILPSNTTPLIVSDAGF--KVPWYKSVEKLGWYWLSRVR 184 SE + H+ L +LP +++D GF + + ++LG+ ++ R R Sbjct: 134 MKSELKGWRNEHEDVLLERFREVLPEGVKVTVLADRGFGDQALYELLKDQLGFGFIVRFR 193 Query: 185 GKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRS 244 G V+ E +P + S+ + L R+TKS ++ K Sbjct: 194 GVVKVTSAEGET-RPAKDWVP-SNGRTLRLRSARVTKSRREIGAVVCVK----------- 240 Query: 245 TRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYG 304 + KE W LAT+ + ++V +Y++R IEE+FRD K+ +G Sbjct: 241 -------------AKGMKEAWHLATS--HGDKPGSEIVALYARRFTIEESFRDQKNLRFG 285 Query: 305 LGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRL 364 +GL +R + R D +LL++ + + G + G DK + NTV+ R + S +R Sbjct: 286 MGLSETRIADPARRDRLLLVSAVAIALLTILGAAGEALGLDKWLKTNTVKRRTI-SLLRQ 344 Query: 365 GMEVL 369 GM Sbjct: 345 GMMHY 349 >UniRef50_A7MW84 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7MW84_VIBHB Length = 235 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 82/226 (36%), Positives = 128/226 (56%), Gaps = 2/226 (0%) Query: 163 GFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKS 222 GFKVPW+K +E+ GWYWL RVRG + + W + + + LG LTK Sbjct: 3 GFKVPWFKPIEQQGWYWLGRVRGNSKLR--VNDRWCSADEVFVQAQYKPQHLGTAELTKQ 60 Query: 223 NPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLV 282 + CQ+ LY+ +SKGRK + + + + ++ +EPW+L +NLP E +++V Sbjct: 61 HQYPCQVCLYRKKSKGRKAKNWSGSLQRNTVSLSHAKGEREPWLLVSNLPGETWFAERVV 120 Query: 283 NIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQ 342 +Y++RM IEE FRD K+ YGL L S ++ +R +I+L+I ++ Q + G A + Sbjct: 121 ALYTQRMSIEEGFRDTKNERYGLALNFSGSACPKRIEILLMIGMLTQFALLVVGKVAYLK 180 Query: 343 GWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLL 388 G+ K FQANT+R R VLS LG E++ Y+ + +D +A L Sbjct: 181 GYYKDFQANTIRTRRVLSYFFLGKELIGREAYSFSVKDLALAVGGL 226 >UniRef50_D0LPB8 Transposase IS4 family protein n=4 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPB8_HALO1 Length = 418 Score = 256 bits (654), Expect = 1e-66, Method: Composition-based stats. Identities = 93/365 (25%), Positives = 154/365 (42%), Gaps = 37/365 (10%) Query: 14 FCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT--KARTKHNIKRIDRLLGNRHLH 71 F +H KR+ SL+ A + L++ +G L +KH IK++DRLL N L Sbjct: 55 FEGNMHSKRVESLSNAVVGVTHASALSVQAIGHGLAVALDKNSKHAIKQVDRLLSNARL- 113 Query: 72 KERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLS 131 + V+ ++ S T I+ +DW++ + + + +HGRS L K S Sbjct: 114 -DPWQVFSVWVPYVLSERTEAIIALDWTEFAKDGQSTCAAHLMTMHGRSTALAWKTVEKS 172 Query: 132 EQCSKK--AHDQFLADLASILPSNTTPLIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQ 188 + ++ D+ + L I+P + +++D GF + + LGW ++ R R + Sbjct: 173 QLRGQQTAVEDEVIDHLHRIIPPDIEVTLLADRGFAAAERFIHLTTLGWNYVIRFRENIH 232 Query: 189 YADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTH 248 + G P + S K L ++ P+ + ++K Sbjct: 233 ISHQGQTQ--PARDWVPKSGRAKKLLDVGITCRAEPLEAVVCVHK--------------- 275 Query: 249 CHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLR 308 A K+ W LATNL + +V +Y++R IEETFRD K +GLGL Sbjct: 276 ----------AQMKQAWCLATNLVDA--SASHVVKLYARRFTIEETFRDQKDLRFGLGLS 323 Query: 309 HSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEV 368 + R D +LL++ + L G A+ G+D+ +ANTVR R S R G Sbjct: 324 ATHIRDCGRRDRLLLLSAIAHALLTLLGAAAESIGFDRMMKANTVR-RRTHSLFRQGCYW 382 Query: 369 LRHSG 373 Sbjct: 383 FWRMP 387 >UniRef50_A9AVJ1 Transposase IS4 family protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVJ1_HERA2 Length = 378 Score = 251 bits (642), Expect = 2e-65, Method: Composition-based stats. Identities = 86/368 (23%), Positives = 149/368 (40%), Gaps = 41/368 (11%) Query: 10 SLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRH 69 +L+QF P LH +RL + LL +++ L+ + +L + A I RI R L N Sbjct: 22 TLHQFHPTLHARRLATWAWVIVGLLHARSVHLSAVALHLASDAEAAGRIARIRRWLANPW 81 Query: 70 LHKERLAVYRWHASFICSG--NTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKA 127 L + L YR + + + N +++D + K L ++R S++ R++ L + Sbjct: 82 LDTQFL--YRPLITHVLTAWRNRDITIMIDGCYVNHDK-LQMVRLSLSHCYRAIPLAWQV 138 Query: 128 FPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFK-VPWYKSVEKLGWYWLSRVRGK 186 S ++ + L + +L ++D GF+ W S ++ GW ++ R+ Sbjct: 139 MSHHGNVSVESCQRMLNRVQQLLIGTRRVTFLADRGFRDWAWAASCQRRGWDYIIRIANT 198 Query: 187 VQYADLGAENWKPISNLHDMSSSHSKTLGYKR--LTKSNPISCQILLYKSRSKGRKNQRS 244 P ++ M+ K++ + LT+ C I + +R+ K Sbjct: 199 TTIRWDDG----PWMAINTMAVKPGKSVYLRNVLLTQDGEWRCTIAITWTRATKTK---- 250 Query: 245 TRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYG 304 P+ + + +EP K ++N Y +RM IEE+FRD KS G Sbjct: 251 -------PAERCAVITNREP-------------SKWILNHYLRRMHIEESFRDDKSG--G 288 Query: 305 LGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRL 364 L SR +R D +LL + L + G K H R LS +L Sbjct: 289 FDLDASRLRDPQRLDRLLLAIAVATLWMYELGERVLKDEQRAHVDPGYQRQ---LSVFQL 345 Query: 365 GMEVLRHS 372 G LR + Sbjct: 346 GWRWLRRA 353 >UniRef50_B4WTK1 Putative uncharacterized protein n=6 Tax=Synechococcus sp. PCC 7335 RepID=B4WTK1_9SYNE Length = 411 Score = 237 bits (604), Expect = 7e-61, Method: Composition-based stats. Identities = 85/378 (22%), Positives = 146/378 (38%), Gaps = 51/378 (13%) Query: 2 CELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKA-RTKHNIKR 60 D L L Q CP HL L + AL+ +++LT+ LP + + +R Sbjct: 25 RLYDALKAWLGQDCPWAHLSHLTTCCWMVFALIQTGSVSLTKWTTYLPCRGLYAQSKQRR 84 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICS--GNTMPIVLVDWSDIREQKRLMVLRASVALHG 118 + R LGN ++ RL Y+ + + +D S EQ ++R +V G Sbjct: 85 VRRWLGNSRINIHRL--YKPLIQAALATWEAECLYLCLDTSLFWEQ--YCLIRLAVVYRG 140 Query: 119 RSVTLYEKAFPL-SEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLG 176 RS+ L + S + +A+++ L LPSN ++++D GF +++LG Sbjct: 141 RSIPLAWRVLEHNSASVAFEAYEELLRQSTQYLPSNANMILLADRGFVHTRAMTLIKQLG 200 Query: 177 WYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRS 236 W++ R++ W+P S S H L + R+ Sbjct: 201 WHYRIRIKSDTWI-------WRPGSGWCQPKSFH--------------------LERGRA 233 Query: 237 KGRKNQRSTRTHCHHPSPKIYSAS--AKEPWILATNLPVEIRTPKQLVNIYSKRMQIEET 294 + R R + P I + E W + ++ P T Q Y+ R IEE Sbjct: 234 LCFHHIRLHRHEQYGPVHVIIGRNNINGELWAVVSDQP----TSPQTFMEYALRFDIEEG 289 Query: 295 FRDLKSPAYGLGLRHSRT-SSSERFDIMLLIALMLQLTCWLAGVHAQKQGW-DKHFQANT 352 F D +S + L R + R +L +A + +A V + ++ W D H+ Sbjct: 290 FLDDQSAGWNLQRSEIRGLTDLSRLWFILAVATLYVTAQGVAVVQSGRRRWIDTHWDRGN 349 Query: 353 VRNRNVLSTVRLGMEVLR 370 S R+G+E + Sbjct: 350 -------SYFRIGLEWTK 360 >UniRef50_B2JAE4 Transposase, IS4 family protein n=8 Tax=Cyanobacteria RepID=B2JAE4_NOSP7 Length = 448 Score = 210 bits (535), Expect = 7e-53, Method: Composition-based stats. Identities = 89/369 (24%), Positives = 152/369 (41%), Gaps = 51/369 (13%) Query: 15 CPELHLKRLNSLT---------LACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLL 65 PEL+ K L SL L + + + K + L ++ +LP + + K++ R L Sbjct: 2 LPELYQKHLQSLLSQSELIFLTLVINVVQNIKDVKLEKISESLPLFIQCQSRRKKLQRFL 61 Query: 66 GNRHLHKERL---AVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVT 122 L+ E L + RW A GN + +D ++ + + LM+ SV R++ Sbjct: 62 LLPILNIEELWFPIIERWLAQIFL-GNHRIYLAIDRTNWKRKNLLMI---SVIFQKRAIP 117 Query: 123 LYEKAFPLSEQCSKKAHDQFLADLASILP--SNTTPLIVSDAGF-KVPWYKSVEKLGWYW 179 +Y K L++ S +Q L I+P N +++ D F V K +++ G+ + Sbjct: 118 IYFKL--LAKLGSSNLSEQT-KALTKIIPLFKNYKTVVLGDREFCSVSLAKWLDEQGFEF 174 Query: 180 LSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 R++ K + +L A W I +L + S + +TK+ Q+ + K + Sbjct: 175 CLRLK-KNENIELKAHLWCEIKDL-GLKPGTSFFVSDATVTKTK----QVKGFNVACKWK 228 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 KN R + AKE W + TN+ +I + Y KR IEE FRD K Sbjct: 229 KNYRQNK--------------AKEGWFILTNMNSKIT----AIQAYQKRFDIEEMFRDFK 270 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR--N 357 S Y L +RF ++LI + L G + + +G K+ R Sbjct: 271 SGGYNL---EKTNVEGKRFIALVLIISLADTIATLQGQNIKSKGIAKYLARPKEYGRSHR 327 Query: 358 VLSTVRLGM 366 S +G+ Sbjct: 328 RHSNFYIGL 336 >UniRef50_C7QY62 Transposase IS4 family protein n=9 Tax=Cyanothece RepID=C7QY62_CYAP0 Length = 365 Score = 206 bits (523), Expect = 2e-51, Method: Composition-based stats. Identities = 80/366 (21%), Positives = 140/366 (38%), Gaps = 44/366 (12%) Query: 15 CPELHLKRLNS---------LTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLL 65 PEL+ L L++ + L + L EL P + + IK++ R L Sbjct: 4 LPELYSNHLKKHLDNHQYLMLSILVNLLQSLHLVRLEELANRFPHPIQLRSRIKKLQRFL 63 Query: 66 GNRHLHKERL---AVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVT 122 + E L + W G + +V +D S RE + V S+ + R++ Sbjct: 64 SLPQFNLETLWIPIIESWIKQEWKRGEIIYLV-IDRSQWREINLIFV---SLIYNHRAIP 119 Query: 123 LYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSR 182 L P + + L + S L L+ V K + + + S Sbjct: 120 LCVDWLPKKGNSNLEQQKAILEVILSRLKDYKIVLLGDREFCGVDLAKWLSEAKEVYFSL 179 Query: 183 VRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQ 242 K +YA+L + W L D+ + ++ Y+ + + K++ G N Sbjct: 180 RLKKNEYAELAPQIWF---QLKDLGLNPGMSVYYRG----------VKITKTKGFGEVNL 226 Query: 243 RSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPA 302 + + SAKEPW++ TNL + Q ++ YSKRM IEE FRD K Sbjct: 227 AAKWKRNYQ------GKSAKEPWLIMTNL----ESLSQAMSAYSKRMGIEEMFRDFKRGG 276 Query: 303 YGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR--NVLS 360 Y L ++ + ++LLI + +G +++G K+ T +R S Sbjct: 277 Y--QLEGTQVTKERLISLVLLIC-LAYCWSTFSGQSLKRKGVAKYVSRPTSGHRSHRQHS 333 Query: 361 TVRLGM 366 + +G+ Sbjct: 334 SFYIGL 339 >UniRef50_Q72IB6 Transposase n=3 Tax=Thermus thermophilus HB27 RepID=Q72IB6_THET2 Length = 365 Score = 203 bits (517), Expect = 8e-51, Method: Composition-based stats. Identities = 93/394 (23%), Positives = 150/394 (38%), Gaps = 48/394 (12%) Query: 5 DILHDSLYQFCPELHLKRLNSLTLACHALLDCK-TLTLTELGRNLPTKARTKHNIKRIDR 63 ++ +++ L ++L L LL TL++L R P + + R+ R Sbjct: 9 QVITLWVHKAFASLRKTIRSNLALFLSTLLTAPLDPTLSDLARRTPLPTLAQSRLNRLWR 68 Query: 64 LLGNRHLHKERL---AVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 L + L A+ A +P++ VDW+ E R L A++ L GR+ Sbjct: 69 FLHHPTLQNPWALTEALLPLLARRFPKDRPLPLI-VDWT-FAEDGRHQALVAALPLKGRA 126 Query: 121 VTLYEKAFPLSEQCSKK-AHDQFLADL-ASILPSNTTPLIVSDAGF-KVPWYKSVEKLGW 177 + + PLS S+ ++FL L ++ TPL + D GF +V + ++ G Sbjct: 127 LVVAFALHPLSPFPSQNRVEEEFLHRLGRAVQDLGYTPLFLLDRGFDRVSLMRKLQGWGM 186 Query: 178 YWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSK 237 +L R+R + G + P+ + +P+ ++ L+ Sbjct: 187 GFLIRLRQNREVEPRGGKR-LPLKEGYRRVV--------------HPLREEVRLF----- 226 Query: 238 GRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRD 297 G + T +P +EPW LA + P + P Y RM IEE FRD Sbjct: 227 GHGGEEVEVTLLVYP-------GGREPWYLAYSGPFGGKPP------YGWRMWIEEGFRD 273 Query: 298 LKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRN 357 LK +GL RT +S R + L+AL + L L G Q + W A+ R Sbjct: 274 LKGQGFGLDRHRLRTGASLR-GWLWLLALGMALLI-LLGARLQGREWLPRLLAHPERQ-- 329 Query: 358 VLSTVRLGMEVLRHSGYTITREDSLVAATLLTQN 391 S RLG L LL + Sbjct: 330 --SLFRLGRIALAQGPPPWREAVVEELIRLLQEL 361 >UniRef50_Q9UH48 Gastric cancer-related protein GCYS-20 n=1 Tax=Homo sapiens RepID=Q9UH48_HUMAN Length = 332 Score = 167 bits (422), Expect = 8e-40, Method: Composition-based stats. Identities = 110/111 (99%), Positives = 110/111 (99%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR Sbjct: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLR 111 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMV R Sbjct: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVFR 111 >UniRef50_A7NGF0 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NGF0_ROSCS Length = 383 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 76/361 (21%), Positives = 133/361 (36%), Gaps = 44/361 (12%) Query: 19 HLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVY 78 H L++ +L ++ L+++ LP A+ + I RI R L N ++ + Y Sbjct: 23 HAHHLSNWLWIVCGILLSGSVALSKIALYLPLTAQAEGRIARIRRWLKN--VYVDVWQFY 80 Query: 79 RWHASFICSG--NTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSK 136 R + G V++D + RL + R S+ R++ L P S Sbjct: 81 RPLLEKVLQGWQAAEAAVILDGVMVFGD-RLQIFRLSLRHGSRAIPLSWVVVPGKGLTSV 139 Query: 137 KAHDQFLADLASIL-PSNTTPLIVSDAGFK-VPWYKSVEKLGWYWLSRVRGKVQYADLGA 194 + + A L P + +D GF+ V W ++GW+++ R+ Sbjct: 140 ERLRPLIQRAAEFLAPRVGAVVFPADRGFRDVEWAALCLEVGWHYVIRLANNTLITLEDG 199 Query: 195 ENWKPISNLHDMSSSHSKTLGYKR--LTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHP 252 ++ + ++ +T+S + + ++ Sbjct: 200 RR----LSIAAPGVPPGEACYWRNAAITQSQDWPANLSVTWTKG---------------- 239 Query: 253 SPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 A + P +LA + + R Q + Y RM IEE+FRD KS G L H+R Sbjct: 240 ------ARGQAPELLA--VMSDRRACNQRLREYGWRMSIEESFRDDKSG--GFDLEHTRL 289 Query: 313 SSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA--NTVRNRNVLSTVRLGMEVLR 370 +R + +LL + L G A D QA + R LS +LG+ LR Sbjct: 290 QDPQRLERLLLAVAIATLWRHELGEQALH---DHSVQAELDPGGKRRELSIFQLGLRFLR 346 Query: 371 H 371 Sbjct: 347 R 347 >UniRef50_B8B8E6 Putative uncharacterized protein n=5 Tax=cellular organisms RepID=B8B8E6_ORYSI Length = 753 Score = 144 bits (364), Expect = 5e-33, Method: Composition-based stats. Identities = 88/89 (98%), Positives = 89/89 (100%) Query: 287 KRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDK 346 +RMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDK Sbjct: 358 ERMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDK 417 Query: 347 HFQANTVRNRNVLSTVRLGMEVLRHSGYT 375 HFQANTVRNRNVLSTVRLGMEVLRHSGYT Sbjct: 418 HFQANTVRNRNVLSTVRLGMEVLRHSGYT 446 >UniRef50_A7MYH1 Putative uncharacterized protein n=4 Tax=Vibrio RepID=A7MYH1_VIBHB Length = 145 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 51/148 (34%), Positives = 80/148 (54%), Gaps = 6/148 (4%) Query: 119 RSVTLYEKAFPLSEQCSKKAHDQFLADL--ASILPSNTTPLIVSDAGFKVPWYKSVEKLG 176 R + + ++ E + H + L+ L A+ + TP+IVSDAGF+ W++ V G Sbjct: 2 RDIQILQQTI---ENQCPEIHKKRLSSLILATKTVNGCTPIIVSDAGFRNTWFRQVANKG 58 Query: 177 WYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRS 236 W+WL RVRG+V G ++W+ + ++ + LG +L K +P+ C LYKS Sbjct: 59 WFWLGRVRGEVSI-KCGEDSWQWNKTFYPQATDKPQFLGESQLAKRSPLECFAYLYKSHP 117 Query: 237 KGRKNQRSTRTHCHHPSPKIYSASAKEP 264 KGRK R +RT H + K++ AKEP Sbjct: 118 KGRKAHRHSRTCQKHSAGKVFHKGAKEP 145 Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 21/95 (22%), Positives = 38/95 (40%), Gaps = 6/95 (6%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M ++ IL ++ CPE+H KRL+SL LA + C + +++ G + + Sbjct: 1 MRDIQILQQTIENQCPEIHKKRLSSLILATKTVNGCTPIIVSDAGFR---NTWFRQVANK 57 Query: 61 IDRLLGNRHLHKER---LAVYRWHASFICSGNTMP 92 LG ++W+ +F P Sbjct: 58 GWFWLGRVRGEVSIKCGEDSWQWNKTFYPQATDKP 92 >UniRef50_Q47076 BfpT, bfpV, bfpW and transposase genes, complete cds n=53 Tax=Enterobacteriaceae RepID=Q47076_ECOLX Length = 186 Score = 141 bits (354), Expect = 6e-32, Method: Composition-based stats. Identities = 55/170 (32%), Positives = 92/170 (54%), Gaps = 2/170 (1%) Query: 221 KSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQ 280 + I YK KGRK +RS + + K S SAKE W++ + ++ Sbjct: 10 RKKSIRGHFYTYKKSVKGRKKKRSKGQRGLNKTDKEQSKSAKEAWLIFSRTND--FRARE 67 Query: 281 LVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQ 340 ++ +YS+RMQIE+ FRD K+ +G GLR S++ S+ R ++ L+A + + WL G HA+ Sbjct: 68 IIKLYSRRMQIEQNFRDEKNGRFGFGLRASKSRSTGRILVLSLLATLSTIVMWLLGYHAE 127 Query: 341 KQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQ 390 +G + +Q N++++R V+S + L VLRHS + + R L + Sbjct: 128 NKGLHQKYQVNSIKSRRVISYLTLAKNVLRHSPFILRRTVLSTVLNHLAR 177 >UniRef50_B0JUB6 Transposase n=18 Tax=Cyanobacteria RepID=B0JUB6_MICAN Length = 382 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 63/359 (17%), Positives = 114/359 (31%), Gaps = 36/359 (10%) Query: 17 ELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKE--R 74 EL R L + L K L L LP + K++ R L L+ E Sbjct: 13 ELGRARYLLLLMIVGTLQILKQAKLEILAEALPIPILFESRRKKLKRFLKLEILNIEKIW 72 Query: 75 LAVYRWHA--SFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSE 132 V + + + + +D + LMV S+ R++ +Y + Sbjct: 73 FPVLKEMLKQQQRFTTKGLAYIAIDRTSWGAINILMV---SLIYDKRAMPIYWEILDKKG 129 Query: 133 QCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADL 192 + + + L ++L + ++ V K ++K Y+ R + Sbjct: 130 SSNLEEQQRVLEKTLTVLSGHKIVVLGDREFCSVSLGKWLQKQSLYFCLRQKKSTNVKTK 189 Query: 193 GAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHP 252 + + S L + + K + G N + Sbjct: 190 EGI----YQEMRALGLSPGTKLFLNDVN----------ITKEKGFGEFNLAGKWKKTYRG 235 Query: 253 SPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 P KEPW + TN + Y KR IEE FRD KS Y L S+ Sbjct: 236 FPT------KEPWYILTNFGDLET----AIMAYQKRFDIEEMFRDFKSGGY--SLEGSQL 283 Query: 313 SSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL--STVRLGMEVL 369 + + ++++ + + L G + G K+ R + S+ +G + Sbjct: 284 A-PKYLSKLIIVIAIAYTSATLQGKKIKDMGIQKYVTRPEKRYKRQRRHSSFYVGQHLY 341 >UniRef50_Q5GUK2 ISxac1 transposase n=1 Tax=Xanthomonas oryzae pv. oryzae RepID=Q5GUK2_XANOR Length = 361 Score = 139 bits (350), Expect = 2e-31, Method: Composition-based stats. Identities = 58/198 (29%), Positives = 97/198 (48%), Gaps = 4/198 (2%) Query: 177 WYWLSRVRGKVQYADLGAENWKPI-SNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSR 235 W + ++ +++ L A NW + L + S + +S+P C+++LY Sbjct: 135 WSIVRQLLPRLRTGLLAAGNWVSVGPPLAAGAPSSGLVARTMQANRSDPRDCRLVLYAKT 194 Query: 236 SKGRK--NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEE 293 +GR+ N+RS S +A +EPW++ + + + KQLVN+Y++RMQIE Sbjct: 195 PQGRQQRNRRSPAKVSRASSSLKAAAREREPWLIVASPQLHAPSAKQLVNLYARRMQIEL 254 Query: 294 TFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTV 353 FR+LKS YG + S T ER I+LL+ + WLAG+ + G + + Sbjct: 255 AFRNLKSHRYGQAMEDSLTRRGERLQILLLLTTLASFASWLAGLGCEATGIARWLSPRSS 314 Query: 354 RNRNVLSTVRLGMEVLRH 371 R + T+R+G E L Sbjct: 315 -TRKLYLTLRVGREALVR 331 >UniRef50_B7KME5 Transposase IS4 family protein n=42 Tax=Cyanobacteria RepID=B7KME5_CYAP7 Length = 387 Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 62/330 (18%), Positives = 120/330 (36%), Gaps = 40/330 (12%) Query: 37 KTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKE--RLAVYRWHASFICSGNTMPIV 94 K + + LG LP + ++I R L ++ L + + G ++ Sbjct: 34 KQVKIERLGACLPIPILYESRRRKIQRFLKSKKLSLSLFWFPLIKLIIEQEFKGQERLVL 93 Query: 95 LVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNT 154 ++D + + +M+ SV R++ +Y + S + + +L S+ Sbjct: 94 VLDRTQWKSNNIIMI---SVIWRKRALPIYWLILNKKGRSSLSEQQAIIRPILKLL-SDW 149 Query: 155 TPLIVSDAGFK----VPWYKSVEKLG---WYWLSRVRGKVQYADLGAENWKPISNLHDMS 207 +I+ D F W K +K Y+ R +G V + G + ++ + L Sbjct: 150 EIVILGDREFHGIELAYWLKQQDKKRKNPIYFAFREKGDVNFKK-GKKGYQTMKELCKDP 208 Query: 208 SSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWIL 267 + + + K + G+ N + + K+PW + Sbjct: 209 GFKAFY-------------SDVEVTKKKGFGKFNLGFYWKRNY------KNYKEKQPWFI 249 Query: 268 ATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALM 327 TNLP + + + Y KR IE F+D K+ Y L S+ + +++LLIA+ Sbjct: 250 LTNLP----SLNETIKYYRKRSGIEAMFKDCKTGGYN--LEGSQANQVRLTNLILLIAIA 303 Query: 328 LQLTCWLAGVHAQKQGWDKHFQANTVRNRN 357 + L G + QG K+ R Sbjct: 304 YTNSA-LKGKSIKNQGHQKYITRLREARRK 332 >UniRef50_B0BZT8 Transposase, IS4 family n=21 Tax=Cyanobacteria RepID=B0BZT8_ACAM1 Length = 397 Score = 118 bits (296), Expect = 4e-25, Method: Composition-based stats. Identities = 60/355 (16%), Positives = 113/355 (31%), Gaps = 51/355 (14%) Query: 37 KTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTM----- 91 + + L L P I+ + R L L RL + ++ Sbjct: 33 RQIQLARLASMFPQPIHYSSRIRNLQRFLVLPQLSV-RLLWFPILKHWLSEEFKTGHGNR 91 Query: 92 --------------PIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKK 137 ++ VD + + + +MV +V ++ +Y P S S K Sbjct: 92 AYRRARLKRTIDGYVVMAVDRTQWKGRNLMMV---TVVWGKHALPVYWAPLPKSGSSSLK 148 Query: 138 AHDQFLADLASILPSNTTPLIVSDAGFKVP-WYKSVEKLGWYWLSRVRGKVQYADLGAEN 196 + L I ++++D F P + + ++ R + G + Sbjct: 149 QQLRLLKTALKIFKP-YPVVVLADRDFHSPKLALWLSQRQVEFVLRQKKSAYVQLQGEVD 207 Query: 197 WKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKI 256 ++P+ C + K G N + Sbjct: 208 YQPLKE-------------RGFAPGQKGFLCDVYWGKRDQLGPFNLAFYWKRQYR----- 249 Query: 257 YSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSE 316 K+PW + T+LP T +Q +++Y+ R IE F+D KS Y L ++ + Sbjct: 250 -GKGGKDPWFIMTSLP----TLEQALSLYACRWGIEMMFKDCKSGGYN--LERTKVND-A 301 Query: 317 RFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRH 371 RF ++L+ M LAG +K + + +R G + H Sbjct: 302 RFLALVLVMAMAYCLATLAGYGLKKLKVNHYVARLNEHSRRRPRHSDFGTALYGH 356 >UniRef50_C7RIL9 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RIL9_9PROT Length = 352 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 65/331 (19%), Positives = 117/331 (35%), Gaps = 40/331 (12%) Query: 11 LYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR-IDRLLGNRH 69 L + P + L L LA A++ +T EL L ++ + RLL NR Sbjct: 15 LEEALPGMRKTILKKLPLAVAAMIQARTPNTMELSTLLALNTERADMREQWLRRLLTNRL 74 Query: 70 LHKERL--AVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKA 127 + + R +G ++ +D +D+ R VL SV RS+ L + Sbjct: 75 IRSAGVLEPFARRALEQAAAGGQTILLSMDQTDL--GDRFAVLMISVRRGDRSLPLVWRI 132 Query: 128 FPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGK 186 L ++ + LP ++++D + V ++ + GW + R++G Sbjct: 133 EEGEANIGFAGQQVLLEEVRAWLPEGAAVMLLADRFYPSVALFEWLLATGWQYRLRLKGN 192 Query: 187 VQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTR 246 + +G + + ++ + R + N R Sbjct: 193 LLVD-----------------------VGCAGIGTTGELAAGV-----RERYEANARLFE 224 Query: 247 THCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLG 306 ++ EPWI+A + P + V Y R IE F D KS G Sbjct: 225 AGIPMAIGVLHEPGHPEPWIIAMDCP----PNRAAVRDYGARWAIEPMFSDFKSR--GFR 278 Query: 307 LRHSRTSSSERFDIMLLIALMLQLTCWLAGV 337 L ++ + +R D ++LI + C AG Sbjct: 279 LEDTKLEAPKRLDCLILIMALAMYWCVQAGQ 309 >UniRef50_A8ZRP2 Transposase IS4 family protein n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=A8ZRP2_DEIGD Length = 398 Score = 114 bits (286), Expect = 5e-24, Method: Composition-based stats. Identities = 71/379 (18%), Positives = 137/379 (36%), Gaps = 53/379 (13%) Query: 7 LHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKAR-TKHNIKRIDRLL 65 L L+ H +L LL + L ++ ++A + +R+ R L Sbjct: 21 LQTGLWNDVRNAH-----TLAWMVTGLLLSQCSFLPAWLPHIHSRATFAQSTERRLRRWL 75 Query: 66 GNRHLHKERLAVYRWHASFICS--GNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTL 123 N + + A+Y + G I+ +D S + E + ++R SV GR+V L Sbjct: 76 ENPAI--DPTAIYGPLVTRALRDWGGHTLILALDTSRLFE--KFCLIRVSVLFRGRAVPL 131 Query: 124 YEKAFPL-SEQCSKKAHDQFLADLASILP--SNTTPLIVSDAGF-KVPWYKSVEKLGWYW 179 + S Q S LA++ +L +++D GF + GW++ Sbjct: 132 VSRVLEHPSAQVSTAQLLPVLAEVKGLLDFLGQPEVRLLADRGFCDTQLMAWLRVCGWHF 191 Query: 180 LSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 R++ + A + + + ++ ++ LT + + L + Sbjct: 192 RIRIKSSLILAAPDGQRLCKVGEV-RLAPRETRYFHNVTLTGQHFGPVHVALGRP----- 245 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 E W + ++ P T + Y +R QIEE F D K Sbjct: 246 -------------------MDGPELWQVVSDEP----TSIETFAEYGERFQIEEGFLDEK 282 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 S +G L SR + + ++L+ + L GV G + + R L Sbjct: 283 SGLFG--LEDSRLRDAASLERLILVLTVATLLLVSEGVQIVHCGDRRVVDPHWQR---AL 337 Query: 360 STVRLGMEVLRHSGYTITR 378 S +++G+ ++ Y ++R Sbjct: 338 SYLKIGLRAVQ---YALSR 353 >UniRef50_A5UQG7 Transposase, IS4 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UQG7_ROSS1 Length = 372 Score = 112 bits (279), Expect = 3e-23, Method: Composition-based stats. Identities = 68/379 (17%), Positives = 132/379 (34%), Gaps = 48/379 (12%) Query: 1 MCEL----DILHDSLYQFCPEL--HLKR-LNSLTLACHALLDCKTLTLTELGRNLPT-KA 52 M + + L Q P++ H +R L +L L ++ + L ++ P +A Sbjct: 1 MRDTYRRYRAIAQCLLQLYPQVGGHQRRHLATLALLICGIVGSQHTQLPKVVERTPGGRA 60 Query: 53 RTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRA 112 + + R R L + ++ +R + A G + ++D S + M L Sbjct: 61 ADESVVMRFRRWLKHDNVTYKRWMLPVAQALIAMLGRRPLVFVIDGSTVGRG--CMCLMI 118 Query: 113 SVALHGRSVTLYEKAFPLSEQCSKKA-HDQFLADLASILPSNTTPLIVSDAGF-KVPWYK 170 SV R++ + + +A H L LA ++P+ + I+ D + W Sbjct: 119 SVLYQRRALPITWLVVKARKGHLPEALHCALLEQLAQLVPAEASVTILGDGEYDGADWQA 178 Query: 171 SVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQIL 230 ++ GW ++ R + A L D++ + + +++ + + Sbjct: 179 AITARGWKYVCRTASNILLTLAEATI-----ALGDLAPKRGEVIAVEQVCITAAQYGPV- 232 Query: 231 LYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQ 290 ++ A+ + P L T + +Y +R Q Sbjct: 233 ---------------------NVLAVWEAAYEHPIHLVTTHADVAY----ALALYRRRAQ 267 Query: 291 IEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA 350 IE F D KS G + S S R +L+ + L GV A++ Sbjct: 268 IETYFSDQKSR--GFRINRSHISDPTRLARLLIATALAYLWVVYLGVVARRDALRGRIHR 325 Query: 351 NTVRNRNVLSTVRLGMEVL 369 +R LS LG+ +L Sbjct: 326 ---PDRCDLSLFSLGLRLL 341 >UniRef50_B5VWL5 Transposase IS4 family protein n=6 Tax=Arthrospira maxima CS-328 RepID=B5VWL5_SPIMA Length = 398 Score = 110 bits (276), Expect = 7e-23, Method: Composition-based stats. Identities = 70/399 (17%), Positives = 125/399 (31%), Gaps = 62/399 (15%) Query: 17 ELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKE--- 73 L + +L L L + + L+ L P + ++ I R L L + Sbjct: 13 NLSESQAQTLELLVLMLQSYRQVRLSTLANVFPQPIQYSSRLRNIQRFLKLPQLSAKLLW 72 Query: 74 RLAVYRWHASFI-----------------CSGNTMPIVLVDWSDIREQKRLMVLRASVAL 116 + S V +D + R++ LMV ++ Sbjct: 73 FPIIKAALKSEFREKHLNREQRRKRSKFRLKTKNYVAVALDRTQWRDRNLLMV---TIIW 129 Query: 117 HGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLG 176 ++ +Y + P S + + L + ++L +++ D F + Sbjct: 130 GHHALPIYWELLPKLGSSSFREQKRVLGPVLALLKP-YPVVVIGDREFHSA-----QLAD 183 Query: 177 WYWLSRVRGKVQYADLG-----AENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILL 231 W RVRG A + +P +L ++ +K +T I Sbjct: 184 W---LRVRGVNVVFRQKKSAFVATSCQPGKSLKTQGFKSGESHFFKNVTLQK--FAPIHG 238 Query: 232 YKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQI 291 + +K R K+PW L T L PK + +Y R I Sbjct: 239 FNLGVYWQKIHR--------------GKKVKKPWYLLTTL----DNPKLVKQLYQARWGI 280 Query: 292 EETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQAN 351 E FRD +S Y + S S RF ++L+ L G + + + Sbjct: 281 EMMFRDCQSGGYNM---ESTRVDSTRFLALVLLITFAYWLATLGGHEWEANHLVAYLGRS 337 Query: 352 --TVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLL 388 T N S LG+ S + ++ ++A L Sbjct: 338 EKTPNNFPHHSIFGLGLSGYAWSQSLVFWQEEMLALMAL 376 >UniRef50_Q1QFL8 Putative uncharacterized protein n=1 Tax=Nitrobacter hamburgensis X14 RepID=Q1QFL8_NITHX Length = 191 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 32/115 (27%), Positives = 53/115 (46%), Gaps = 3/115 (2%) Query: 256 IYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSS 315 +++ K W LA + T +++ N Y++R IE FRD K +G+GL R + Sbjct: 46 VHARDMKAAWCLAASNAEA--TAREITNHYARRWTIEPGFRDTKDLRFGMGLGVLRIADP 103 Query: 316 ERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLR 370 +R D +LL+ + L G + G D+H + T + R S R G + Sbjct: 104 QRRDRLLLLNAFAIVLLTLLGAAGESLGMDRHLKVATAK-RRTHSLFRQGCMLYE 157 >UniRef50_Q10V90 Transposase, IS4 family n=7 Tax=Trichodesmium erythraeum IMS101 RepID=Q10V90_TRIEI Length = 275 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 48/261 (18%), Positives = 101/261 (38%), Gaps = 37/261 (14%) Query: 113 SVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVS-DAGFKVP---- 167 S+A R++ ++ K + + + +L +I++ D F Sbjct: 3 SLAWKKRALPIHWKILTHKGASNLAEQKAVIRPVIRLLK--CQKIILTADREFHSIFLCY 60 Query: 168 WYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISC 227 W K +K Y++ R + + + +S L +G + Sbjct: 61 WLKKYQKQDVYFVLRTKKSTMIKR--GKKYCKLSEL-------PANIGECK--------- 102 Query: 228 QILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSK 287 L+ ++ + + T + PK S E W + TNL + P ++ IYS+ Sbjct: 103 ---LFLNQKITKILRVGTYNLLIYKKPKYRDKSVSEKWYILTNLSL----PGKIKKIYSQ 155 Query: 288 RMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKH 347 RM IE F+D K+ AY L ++ + + +++LLIA+ ++ + +G ++ Sbjct: 156 RMGIEAMFKDYKTGAYN--LESAKANETRLNNLILLIAISYAISS-FQVQKIKNKGVQEY 212 Query: 348 FQANTVRNRNVL--STVRLGM 366 ++R S+ +G+ Sbjct: 213 ISRTNEKSRKERRHSSFFVGL 233 >UniRef50_UPI000038476B hypothetical protein Magn03010330 n=1 Tax=Magnetospirillum magnetotacticum MS-1 RepID=UPI000038476B Length = 333 Score = 108 bits (270), Expect = 3e-22, Method: Composition-based stats. Identities = 58/329 (17%), Positives = 115/329 (34%), Gaps = 42/329 (12%) Query: 15 CPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR-IDRLLGNRHLHKE 73 P + K+ L L +LD ++ L ++ +LP +A + I R+LGN + + Sbjct: 20 LPRQNKKQREGLALLAATMLDVRSANLMDVAASLPRQAERLDMRYQWISRVLGNALIDVD 79 Query: 74 RL--AVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLS 131 + R + ++++D + E ++ +V+ + RS+ L + Sbjct: 80 EVMAPYVRDILGRLVGDGRRLVLIIDQTQANEVQQAVVVAVR--VGERSLPLAWRVKKTQ 137 Query: 132 EQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVP-WYKSVEKLGWYWLSRVRGKVQYA 190 + L +A +LP P+++ D + P W W R++ + Sbjct: 138 GAIGFAEQREALEVVAGLLPEGVRPVLMGDRFYGSPDLIAWCRTQSWDWRLRLKQDLLVF 197 Query: 191 DLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCH 250 + G E+ + + G + LT T Sbjct: 198 EDGGES----------TLAECFARGERMLT--------------------GVELTGKRVP 227 Query: 251 HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHS 310 ++ A EPWI+A + + Y R IE F D K+ +G L S Sbjct: 228 TNVAMVHEAGHPEPWIIALSEAPTVHRAFD----YGLRWGIEAMFSDFKTRGFG--LEDS 281 Query: 311 RTSSSERFDIMLLIALMLQLTCWLAGVHA 339 ++R D ++++ + G+ A Sbjct: 282 HIQRADRMDRLIMVMALALFWAVSTGMWA 310 >UniRef50_Q1VRR5 Putative uncharacterized protein n=8 Tax=Bacteroidetes RepID=Q1VRR5_9FLAO Length = 372 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 57/359 (15%), Positives = 130/359 (36%), Gaps = 46/359 (12%) Query: 17 ELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLA 76 +++L RL ++ AL +T+T +L +++ + +++RI R + + L + +A Sbjct: 32 KINLARLKLISHFVIALCKVQTVTFEKLANAFNSQSDSGSSLRRIQRFIASYSLDSDLIA 91 Query: 77 VYRWHASFICSGNTMPIVLVDWSDIR-EQKRLMVLRASVALHGRSVTLYEKAFPLSEQCS 135 + + I+ +D ++ + Q + + V G + L + Sbjct: 92 L---LVFNLLPSRDKLILSIDRTNWKFGQTNINIFMLGVVYKGVAFPLLFTMLDKRGNSN 148 Query: 136 KKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGA 194 + L + + +V+D F W + + + R+R + Sbjct: 149 SQERIDLLNRFIRLFGKHVIESVVADREFVGKDWLAFLNRNEIRYYIRIRNNFKV----- 203 Query: 195 ENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSP 254 + P N + L + T + + Y K R C+ Sbjct: 204 --FLPHKN----KEIKASHLFNRFKTN------EFVYY------HKIVRVNGELCYLSGC 245 Query: 255 KIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSS 314 K+ + K+ +++ + P+ Y KR QIE F+ +KS G + + Sbjct: 246 KLNPKNLKQEFLIIVSFNK----PENAQQDYQKRWQIEMCFKAMKSS--GFDIEKTHLQD 299 Query: 315 SERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVR----NRNVLSTVRLGMEVL 369 +R + ++L+ ++ + C+ G++ Q N ++ R S + G+ L Sbjct: 300 IQRIEKLILLVMIAFVWCYKIGIYLH--------QINPIKIKKHGRKAKSIFKYGLTFL 350 >UniRef50_D2SUD5 Transposase n=1 Tax=uncultured bacterium psy1 RepID=D2SUD5_9BACT Length = 367 Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats. Identities = 71/407 (17%), Positives = 128/407 (31%), Gaps = 54/407 (13%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M +L +L HL R+ L+ AL KT+ EL A+ + +R Sbjct: 1 MDHTTMLTHTLKLHFGW-HLARIKCLSCLIIALFKVKTVNFAELATAFSGSAKVDSHYRR 59 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIR-EQKRLMVLRASVALHGR 119 I R L ++ LA R S + ++ +D ++ + L S+ G Sbjct: 60 IQRFFKEVELKQDTLA--RLVTSLLPYD--QFVLSIDRTNWMLGCFAINFLVLSIVHQGT 115 Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWY 178 + ++ P + K + + + S+ + D F W+ + K Sbjct: 116 AFPVFWLLLPKKGNSNTKERIELINQFLDVFGSHKIQYLTGDREFIGQQWFAYLIKHQIE 175 Query: 179 WLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKG 238 + R++ + + + P N +S P+S L R Sbjct: 176 FRLRIKKNMMISRSNG-QFSPAENFF----------------RSLPLSTACQLIDRRWVC 218 Query: 239 RKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDL 298 T A +++ ++ Y+KR +IE F L Sbjct: 219 GHLLWVTGMRL-----------ASGDYLIVVTHDD----SAHTMSDYAKRWKIEVLFESL 263 Query: 299 KSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVR--NR 356 KS + E +L + + + G W + V+ R Sbjct: 264 KSRGFNF--EDVNLKDQESLKRLLAVITIAFCWAYHVGA------WLNEVKPIRVKKHQR 315 Query: 357 NVLSTVRLGMEVLRH--SGYTITREDSLVAATLLTQNLFTH--GYVL 399 S R G + +RH R++ TLL +N T GY+ Sbjct: 316 PAKSVFRYGFDWIRHVLFNPEDKRDELKQVLTLL-KNTITRPKGYIF 361 >UniRef50_Q2S0J1 Putative transposase n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S0J1_SALRD Length = 248 Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 83/235 (35%), Gaps = 19/235 (8%) Query: 7 LHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLG 66 + +L Q PE R +L L + + L+++ + A+ + + + R LG Sbjct: 1 MESTLSQLLPEALATRRRALAQMITGLHLAEHVHLSKVAGRIAGTAQLESKTRHLRRFLG 60 Query: 67 NRHLHKERLA------VYRWHASFICSGNTMPI-VLVDWSDIREQKRLMVLRASVALHGR 119 N ++ ER + W A + + PI +LVD ++ VL A +A R Sbjct: 61 NENVDPERFYSPVRDRLIEWAAQGAETQGSGPIRLLVDTVELS--GERQVLMAGIAYRRR 118 Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFK-VPWYKSVEKLGWY 178 ++ + + + + + L L P ++V D F +E GW+ Sbjct: 119 ALPICWETYRREGVTNAEQQISLLKALVGRFPDEAEVVVVGDGAFHSTDLMDFIEDQGWH 178 Query: 179 WLSRVRGKVQYAD--------LGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPI 225 + R+ WK + +L + L +TK N Sbjct: 179 FCLRLHADTYIRSFKDSSKGFPKEGTWKQLRDLVPEEGER-RYLQDVIVTKDNEY 232 >UniRef50_Q5ZXB3 ORF2 transposase n=9 Tax=Legionella RepID=Q5ZXB3_LEGPH Length = 361 Score = 99.8 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 62/377 (16%), Positives = 131/377 (34%), Gaps = 51/377 (13%) Query: 3 ELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRID 62 + L D L + + R+ +L+ +T+ LTE+ + A+ RI Sbjct: 2 SITELSDILNGYFSW-NKSRIECFATMLISLIKVRTVNLTEIACGFSSPAKQDSRYTRIK 60 Query: 63 RLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDI-REQKRLMVLRASVALHGRSV 121 R R + +V W + +D ++ +K + +L SV G ++ Sbjct: 61 RFF--REFKIDFSSVSVWVIHCFGLSGQALYLSMDRTNWRWGKKDINILMLSVVYKGIAI 118 Query: 122 TLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWL 180 L+ + + + + + +++D F W+ + + Sbjct: 119 PLFWTLLAKGGNSDTRERIEIVQRFITKFGKSMIAGLLADREFVGDNWFAWLLTEKIPFC 178 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTL-GYKRLTKSNPISCQILLYKSRSKGR 239 R++ V + + +D+ S + L G ++L + + L Sbjct: 179 IRIKNNVITTNSRGLEVSIDALFYDLKSGEQRILQGLRKLWRQKIYLSALRL-------- 230 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 + E I+AT+ ++ + Y+ R +IE F LK Sbjct: 231 --------------------ADGELLIVATDHLMDEP-----IEHYALRWEIETLFSCLK 265 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQ-ANTVRN--R 356 G + + +R + +L++ + CW A K G +H Q A ++ R Sbjct: 266 --GRGFNFEDTHMTQPDRIEKLLVLLTIA--FCW-----AHKTGEWRHVQKAIKIKKHGR 316 Query: 357 NVLSTVRLGMEVLRHSG 373 +S R G+++LR + Sbjct: 317 KGVSFFRYGLDLLRDAA 333 >UniRef50_A9AUQ0 Transposase IS4 family protein n=3 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AUQ0_HERA2 Length = 414 Score = 99.4 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 68/370 (18%), Positives = 128/370 (34%), Gaps = 46/370 (12%) Query: 5 DILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELG--RNLPTKARTKHNIKRID 62 IL ++ P +RL ++ L+ T + G L T A + +R+ Sbjct: 38 TILRTAVPTLSPWT-ARRLTDWLVSI-LLMPSITTRVVAWGCALGLSTAAHAASHERRLR 95 Query: 63 RLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVT 122 R + L +++R + V V + R +L A++ HGR++ Sbjct: 96 RTYRDSQLS---WSLHRAILATTLHIAPTESVTVIIDETTHTDRWTLLTAALWYHGRAIP 152 Query: 123 LYEKAFPLSEQCSKKAHDQ---FLADLASILPSNTTPLIVSDAGFKVPWYK-SVEKLGWY 178 L P + + L + +LP+ + ++V+D F P + V GW Sbjct: 153 LAWVLHPGYTRRATAFWTDVATLLERVQQVLPNAMSVVVVADRAFGCPAFTDQVAAYGWG 212 Query: 179 WLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKG 238 W+ RV+G + G H++T+ +T+ + + + +K Sbjct: 213 WVVRVQGHTRIQLRG----------------HTETMIRTLVTRGHRVVRRGHAFKKAGWR 256 Query: 239 RKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDL 298 + H EP +L +NL + Y +R IE FRD Sbjct: 257 TVTVVAAWEATCH-----------EPLLLVSNLEGIGA----IRQAYGRRSAIEALFRDW 301 Query: 299 KSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRN--R 356 K+ + S++ S + ++L + + L G + + + R Sbjct: 302 KTAGW--QWEASQSRSQTTQEALVLGMAIATVLVLLVGTAEAQAVLAERGDRPSPRRPWA 359 Query: 357 NVLSTVRLGM 366 S RLG Sbjct: 360 ARESLFRLGR 369 >UniRef50_A8ZMZ5 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMZ5_ACAM1 Length = 258 Score = 99.0 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 47/264 (17%), Positives = 95/264 (35%), Gaps = 37/264 (14%) Query: 109 VLRASVALHGRSVTLYEKAFPL-SEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKV- 166 ++ SV GR+V S + + L L + ++++D GF Sbjct: 1 MIHLSVVCCGRAVPFLWLVLAHKSAAVGFEEYQPLLRRARWFLRKHPDVMLLADRGFANH 60 Query: 167 PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPIS 226 +++ W++ R+ V H + + + P Sbjct: 61 QLMSWLQQSRWHYCLRIPCDV--------------------ILHGPRRCPREVRRLWPSK 100 Query: 227 CQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYS 286 + +LY++ C KEPW + T+ ++T Q Y+ Sbjct: 101 GEAILYRNVGLWEDG------VCRCNLVLANIRGVKEPWAVITDESPTLQTLWQ----YA 150 Query: 287 KRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDK 346 R ++EE F D KS A+ L S+ ++ + + L+A + L G+ Q +G + Sbjct: 151 LRFRVEELFLDSKSGAF--ELEDSKIRCADALERLYLVAAVALLYSTTHGMAVQIEGLRE 208 Query: 347 HFQANTVRNRNVLSTVRLGMEVLR 370 + R +S +++G+ L+ Sbjct: 209 QVDPH---WRRGISYLKIGLRWLK 229 >UniRef50_B4WNR8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WNR8_9SYNE Length = 271 Score = 98.2 bits (243), Expect = 5e-19, Method: Composition-based stats. Identities = 49/279 (17%), Positives = 89/279 (31%), Gaps = 31/279 (11%) Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYW 179 ++ L + Q L +L +N ++ V + + + G Y+ Sbjct: 6 AIPLSWRLMENLGNSDYVEQTQLLTKALPMLSANKIVVLGDREFCSVDLARWLGEKGHYF 65 Query: 180 LSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 R + G +W+ ++ L T G+ + L KS+ G Sbjct: 66 CLRQKQSTWM-KAGETDWQKLTTL----GLRPGTQGFYN---------ALTLTKSKGFGA 111 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 + + EPW + T+L T + Y KR IEE FRD K Sbjct: 112 AHLVGKWKRRYQSFAPA------EPWFILTSL----DTLDVAIWAYQKRFDIEEMFRDFK 161 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANT---VRNR 356 Y L R I+LL+A+ G +++ K+ N+ Sbjct: 162 LGGY--SLERCRAQDKRFLSIVLLVAIAYTCA-TSQGQTLKQKALQKYIARPERYDQPNK 218 Query: 357 NVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTH 395 S +G+ R + + + +L +N + Sbjct: 219 R-HSAFYIGLAAHRWVPFWPRCQQQVFELLVLDRNKLPY 256 >UniRef50_B4W0I0 Transposase, IS4 family protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4W0I0_9CYAN Length = 390 Score = 97.1 bits (240), Expect = 9e-19, Method: Composition-based stats. Identities = 50/284 (17%), Positives = 102/284 (35%), Gaps = 39/284 (13%) Query: 74 RLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQ 133 + +W S S + + +D ++I + VL V G + + K +E+ Sbjct: 90 FAPLLKWILSLWKSEDKCLPLAIDATNIGQN--FTVLSLHVLYQGCGIPVAWKIVKGTEK 147 Query: 134 CSKKAH-DQFLADLASILPSNTTPLIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQYAD 191 S K H + ++P ++++D G W ++ + L W+ R+ + Y Sbjct: 148 GSWKPHWLHIFHYVKDVVPDYWQVIVLADRGLYADWLFEVICSLNWHPFLRINKQGYYQL 207 Query: 192 LGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHH 251 + W+ + + + + GY K + + C +L Sbjct: 208 RQEQEWRCLDTVAPK--TRTDWSGYVTCFKDHSLECTLL--------------------- 244 Query: 252 PSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSR 311 + KEPW++ T+L + Y R IE ++RD+KS + + +R Sbjct: 245 ---ARWDEGYKEPWLIVTDLELTQAQSFW----YGLRAWIESSYRDIKSDGW--QWQKTR 295 Query: 312 TSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRN 355 +R + + L + L G + + Q N +N Sbjct: 296 LREPDRAERIWLAMAIATLWTVTVGSEEKS---HQSQQFNEQQN 336 >UniRef50_A8ZQL6 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZQL6_ACAM1 Length = 383 Score = 97.1 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 61/375 (16%), Positives = 118/375 (31%), Gaps = 55/375 (14%) Query: 7 LHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKA-RTKHNIKRIDRLL 65 L L Q+ + L + A+L ++ ++ L ++ + + ++R+ L Sbjct: 10 LQTQLSQWISPKDHRHLTVFSENIAAILQAQSGCMSHWLSYLSHRSCQARSQMERLSYFL 69 Query: 66 GNRHLHKERLAVYRWHASFICSG--NTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTL 123 N + E Y + +D S + ++ +A GRS L Sbjct: 70 HNPRILSET--FYAPLLKQFLHAWEGMSMTLTLDTSMLW--DTYCLIEVCLAWGGRSFPL 125 Query: 124 YEKAFPL-SEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLS 181 +K S + + L+ +LP +++D GF + + W W Sbjct: 126 AQKVMEHGSATVAFVDYCSVLSMTQGVLPPRCHITLLADRGFEHGELIRWLRSSEWSWAI 185 Query: 182 RVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKN 241 R + +Q + KP+S L S L I C + Sbjct: 186 RAKSDLQITLANGRS-KPVSKLLPEVEQASLFRDVMIL---EDIHCHLAT---------- 231 Query: 242 QRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQ-IEETFRDLKS 300 ++ +E W + T+ P Q +Y +R IE F+D KS Sbjct: 232 --------------ASVSTTQEAWAVITDTP----PSLQTFAVYGQRFGGIEPHFKDYKS 273 Query: 301 PAYGLGLRHSRTS----SSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR 356 A+ + H R + + + +A + W H Sbjct: 274 AAFEVPRSHIRDTAALERLLMLLAAATLIAISVAFQVIAQDALKTIDWHTH--------- 324 Query: 357 NVLSTVRLGMEVLRH 371 LS +++G+ + Sbjct: 325 RGLSFLQIGLRQINQ 339 >UniRef50_Q6ZER7 Putative uncharacterized protein sll5063 n=1 Tax=Synechocystis sp. PCC 6803 RepID=Q6ZER7_SYNY3 Length = 217 Score = 95.2 bits (235), Expect = 4e-18, Method: Composition-based stats. Identities = 36/181 (19%), Positives = 72/181 (39%), Gaps = 11/181 (6%) Query: 32 ALLDCKTLTLTELGRNLPTKA-RTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNT 90 ALL ++LT +P + + + +R+ R L N L+ RL Y+ + Sbjct: 3 ALLQRGEVSLTLWLPYIPCRGVQAQSKQRRLSRWLHNSRLNVHRL--YKSLIQAALADWQ 60 Query: 91 MPIV--LVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPL-SEQCSKKAHDQFLADLA 147 I+ +D S ++R +V GR++ + + S + + + + L A Sbjct: 61 EEILYLSLDTSLFW--DEYCLVRLAVVYRGRALPVVWRVLKHRSASIAFRQYWEMLYQAA 118 Query: 148 SILPSNTTPLIVSDAGF-KVPWYKSVEK-LGWYWLSRVRGKVQYADLGAENWKPISNLHD 205 + L ++++D GF +V LGW++ R++ G W ++H Sbjct: 119 NRLSQGVKVVLLADRGFIHTDAMTAVTTHLGWHYRIRLKRNTWIWRAG-HGWCQFKDIHL 177 Query: 206 M 206 Sbjct: 178 Q 178 >UniRef50_Q7NHH4 Gll2563 protein n=2 Tax=Gloeobacter violaceus RepID=Q7NHH4_GLOVI Length = 212 Score = 94.8 bits (234), Expect = 6e-18, Method: Composition-based stats. Identities = 40/139 (28%), Positives = 54/139 (38%), Gaps = 14/139 (10%) Query: 231 LYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQ 290 L K + G N EPW L TNL TP++ + Y R Sbjct: 15 LVKRKPFGPVNIAGKLGFQPGKKEAYC-----EPWWLLTNLS----TPQEAITWYRCRWG 65 Query: 291 IEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA 350 IEE FRD KS Y L +RF ML++ M + G ++QG K+ Sbjct: 66 IEEMFRDCKSGGYNL---EKLRVQPKRFKRMLMVLAMAMSLSVMHGKQLKRQGLQKYVSR 122 Query: 351 NTVRNRNV--LSTVRLGME 367 R V ST R+G++ Sbjct: 123 VAEPGRVVKRRSTFRVGLQ 141 >UniRef50_D1C6P8 Putative uncharacterized protein n=2 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1C6P8_SPHTD Length = 371 Score = 89.0 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 72/376 (19%), Positives = 131/376 (34%), Gaps = 46/376 (12%) Query: 4 LDILHDSLYQF---CPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 L +L D + F P L + R+ L L L+ T L + LP + ++R Sbjct: 5 LHLLQDWTHHFQALLPGLRVTRVRGLALLSLGLIWAGTPQLGHIAATLPLPVQQLSTVRR 64 Query: 61 IDRLLGNRHLHKERLAVYRWHASFIC--SGNTMPIVLVDWSDIREQKRLMVLRASVALHG 118 + R L R + +A ++ A G +++VD + E+ +L + +H Sbjct: 65 LRRWLATRAV--PVVATWQPLARAFLAHRGQRELLLVVDPTP--ERDDATLLVLGLVVHR 120 Query: 119 RSVTLYEKAFPLSEQCSKKAHDQFLADL----ASILPSNTTPLIVSDAGFKVPWYK-SVE 173 R + L P + +LA L A++LP T V D G + Sbjct: 121 RVLPLAWHIVP-GQTAWAHPTTAYLARLGQRVAAVLPPGVTVTRVVDRGLASAAVIDWCQ 179 Query: 174 KLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYK 233 +LGW+WL R + A G P + + + + ++ Sbjct: 180 RLGWHWLMR---RNVDARQGVHVRLPDGTVCPAWACVP--------GPGRRWAGPVAAFQ 228 Query: 234 SRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEE 293 ++ S ++EPW+L ++ P V Y +R Q+E Sbjct: 229 TQGWYAAELTSIWPV-----------RSREPWVLLSDRP----AGPARVREYRRRQQVEA 273 Query: 294 TFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTV 353 +D + + L S ++ R + + L + L G ++G + Sbjct: 274 VSQDGTTRGWN--LEASTRTARNRLNRLPLALFLALWWSHLRGQQVVRRGERR---RFDR 328 Query: 354 RNRNVLSTVRLGMEVL 369 +R VRLG + Sbjct: 329 TDRRDGRLVRLGRRWM 344 >UniRef50_B5VUF1 Transposase IS4 family protein n=17 Tax=Arthrospira maxima CS-328 RepID=B5VUF1_SPIMA Length = 382 Score = 87.1 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 72/375 (19%), Positives = 119/375 (31%), Gaps = 45/375 (12%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M EL+ L D+L P H RLN + L AL KT+ L E+ + N +R Sbjct: 1 MNELNRLRDTLRPHLPW-HGARLNFVCLFLMALFQTKTVNLMEIATVFANPVQISSNYQR 59 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMP-IVLVDWSDIREQKRLM-VLRASVALHG 118 + R R +R + R+ S I P + +D + + +L +V G Sbjct: 60 LQRFF--REFKFDRAEIARFVVSLI--DIPQPWTLSLDRTCWSFGQTHFNILMLAVVHEG 115 Query: 119 RSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGW 177 + L + ++ P + +D F W + Sbjct: 116 IAFPLLWTMLDKKGNSNSGERMDLFDRFEALFPDVEVACLTADREFVGRDWLSYLLIDPE 175 Query: 178 Y-WLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRS 236 + R+R + + + D S + ++L+ Sbjct: 176 VPFRLRIRHSELISPKLGGTRRSGERMFD-SLRPGEF---RQLS---------------- 215 Query: 237 KGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFR 296 +R + A + E IL TN E P Y++R IE F Sbjct: 216 ----GRRWVWGRQVYVIGS-RLADSGELLILITNACPETALPD-----YARRWGIENLFG 265 Query: 297 DLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR 356 LK+ G L + ER +L + + G+ QG +A+ R Sbjct: 266 ALKTR--GFCLESTHFKDPERLSRLLALLSLAFTWAMKVGLWI-HQGSPIPLKAH---GR 319 Query: 357 NVLSTVRLGMEVLRH 371 S R G + LR Sbjct: 320 RSQSLFRTGFDFLRR 334 >UniRef50_C4YZ17 Transposase, IS4 family protein n=4 Tax=Rickettsia endosymbiont of Ixodes scapularis RepID=C4YZ17_9RICK Length = 372 Score = 84.0 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 61/355 (17%), Positives = 124/355 (34%), Gaps = 45/355 (12%) Query: 21 KRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRW 80 RL L +L++ +++ + + KA+ +RI R ++++ Y Sbjct: 23 SRLECLAGMIMSLIENCSVSGKNMALGILGKAKHSSRTQRIYRFF------RDQIFNYDQ 76 Query: 81 HASFICS--GNTMPIVLVDWSDIREQKR-LMVLRASVALHGRSVTLYEKAFPLSEQCSKK 137 A FI + N I+++D + + K + +L ++ SV +Y S CS Sbjct: 77 VAKFILNIFANDKYIIVLDRTCWKFGKSDINILFLAIVFGKISVPIYWYPLEHSGACSSW 136 Query: 138 AHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAEN 196 + L + + +++D F W + + VR + A Sbjct: 137 LMEAMLERFINNFGVHKIKYLLADREFMGKEWLNFLTTKQIKFAIPVRKDMLIRITNALQ 196 Query: 197 WKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKI 256 KP+ K+ Y + + + + H + Sbjct: 197 TKPV----------GKSFDYVKALEYIEVKGMLW------------------DHAVTLSA 228 Query: 257 YSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSE 316 Y E ++A + +++ + +Y R IE F+ LKS G + S ++ + Sbjct: 229 YRNDKNELMVVAASGDIDVS----IFALYKFRWSIERLFKHLKSG--GFDIEKSHITNPD 282 Query: 317 RFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRH 371 RF ++ + + G+ + K A T R V S G++ LR+ Sbjct: 283 RFVKLVTVCAIASALIIKNGLIQHEIQPIKIRTAKTNPKRLV-SFFTYGLDHLRN 336 >UniRef50_Q3M186 Putative uncharacterized protein n=2 Tax=Anabaena RepID=Q3M186_ANAVT Length = 257 Score = 84.0 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 43/284 (15%), Positives = 85/284 (29%), Gaps = 30/284 (10%) Query: 6 ILHDSLYQFCPE-LHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRL 64 +L F + L+ +L +L + L + K + + L LP + + + R Sbjct: 1 MLASFYQNFLEKYLNKAQLITLKMLVWLLQNQKQVRIERLAATLPLPIQQNSRRRHLQRF 60 Query: 65 LGNRHLHKE--RLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVT 122 L L + + + + +D + +E LMV SV R+ Sbjct: 61 LTLNALSVVLLWFPIIEAIINQHFKVGSQLTIAMDRTQWKENNVLMV---SVIYQKRAWP 117 Query: 123 LYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSR 182 +Y + + L + +L +I + + K ++ R Sbjct: 118 IYWCLLEKDGCSNLTEQQKVLRPVIRLLKKYKLVIIGDREFHSIELGSWLHKQNIGFVLR 177 Query: 183 VRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQ 242 + + + W+ L ++ + + + R GR N Sbjct: 178 QKKDTTFC----QKWQKFQPLSNIEIYPG----------VRQFYTNVKVTQKRGFGRFNL 223 Query: 243 RSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYS 286 K K+ W L TNLP + IY+ Sbjct: 224 GVYWKR------KYRGKQEKDAWHLLTNLPDLNT----ALKIYA 257 >UniRef50_C4ILZ9 Putative iso-IS10R ORF n=1 Tax=Clostridium butyricum E4 str. BoNT E BL5262 RepID=C4ILZ9_CLOBU Length = 174 Score = 81.7 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 43/174 (24%), Positives = 88/174 (50%), Gaps = 14/174 (8%) Query: 18 LHLKRLNSLTLACHALLDCKTLTLTELGRNLP---TKARTKHNIKRIDRLLGNRHLHKE- 73 L RLN+L ++ +++ L+ + + L ++ + IKRI L N+ + +E Sbjct: 4 LKSTRLNNLVAVIIGIIVSRSVILSNISQGLKDCYSRGNEESKIKRIQSFLNNKDIDQES 63 Query: 74 --RLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLS 131 VY+ S+ N + ++ D + I + R ++L+ S+ + R+V L+ K F Sbjct: 64 TYEFFVYKLLKSYKSKSNRINVIF-DHTTI--EDRFVILQFSLKIGKRAVPLWYKVFKYK 120 Query: 132 EQCS--KKAHDQFLADLASILPS-NTTPLIVSDAGFK-VPWYKSVEK-LGWYWL 180 EQ + K ++ L L +L + N ++++D GFK + +K +++ LGW ++ Sbjct: 121 EQGNKDFKHVNEGLIFLHKVLKNYNYNVVLLADRGFKSIDLFKFIDETLGWNYV 174 >UniRef50_Q1AUS1 Putative uncharacterized protein n=4 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AUS1_RUBXD Length = 248 Score = 81.7 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 37/179 (20%), Positives = 72/179 (40%), Gaps = 21/179 (11%) Query: 40 TLTELGRNLPTKARTK---------HNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNT 90 L+EL R PT + H +KR+ R N + + + + C G Sbjct: 38 RLSELARAYPTPKERRVASPKHDLLHRLKRLWRFTDNERVDPLAVQLALVPHTVACLGFP 97 Query: 91 MPI-VLVDWS------DIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQF- 142 + + VDW+ E+ R +LR SV GR++ L + A+ K+ ++ Sbjct: 98 RLLGLAVDWTFSDTTLPSGERMRYQILRISVPRKGRALPLLQLAYNRDNLSPNKSQNRIE 157 Query: 143 ---LADLASILPSNTTPLIVSDAGFKVPWY-KSVEKLGWYWLSRVRGKVQYADLGAENW 197 L + LP+ P++++D GF+ + + + +++ R+R + W Sbjct: 158 QDALLAVVGALPTGVRPVVLADRGFRRASFIAWLARHHLHYVVRIRKGTCVPEASGHRW 216 >UniRef50_Q6LGR5 Putative transposase similar to Tn10 n=1 Tax=Photobacterium profundum RepID=Q6LGR5_PHOPR Length = 105 Score = 81.3 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 36/60 (60%), Positives = 43/60 (71%) Query: 27 TLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFIC 86 + ALL LTLT LGR+LP+KA+TKH IKR+DRLLGN HLH +RL +YRWH C Sbjct: 1 MDSVQALLSNDALTLTLLGRSLPSKAKTKHCIKRVDRLLGNNHLHHDRLDIYRWHCHQFC 60 >UniRef50_C1XLC5 Transposase family protein n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XLC5_MEIRU Length = 354 Score = 81.3 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 65/378 (17%), Positives = 126/378 (33%), Gaps = 50/378 (13%) Query: 19 HLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVY 78 H RL+ L+ AL+ +++ L ++ L N +R R L L +E + Sbjct: 18 HRARLDFLSAFVLALIRVRSVNLAQIALALNPWVHIASNYRRCQRFLAEFRLQQE--VIG 75 Query: 79 RWHASFICSGNTMPIVL-VDWSDIR-EQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSK 136 R + +VL +D ++ Q + +L VA G + L + + Sbjct: 76 RLILKLLPQDPAHKLVLSLDRTEWTLGQASINLLFIGVAHQGVAYPLVWCFLGKAGSSNL 135 Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGFKVPWY-KSVEKLGWYWLSRVRGKVQYADLGAE 195 + L L + LP + +D F + + + + R++ + G Sbjct: 136 QERLGLLRRLLTFLPKERIQSLCADREFACTGFLRYLRWQQLPYTLRIKAGNRVTYKGRS 195 Query: 196 NWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPK 255 P+ L + L Y + P+ Sbjct: 196 R--PVQQLF-------RHLDYGAW--------------------EALPKPVKLWGQPTYL 226 Query: 256 IYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSS 315 + S K ++L R P + Y++R +IE F+ KS G + + + Sbjct: 227 MGSRLRKGEYLLLITEAEPERAPAR----YARRWEIETLFKACKSQ--GFDFESTHLTRA 280 Query: 316 ERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRN--RNVLSTVRLGMEVLRH-- 371 ER + ++ + + + G W Q ++N R + ST R G++ LR Sbjct: 281 ERIESLVALMSIALVWAHRVG------EWRLQTQPIPIKNHARKLYSTFRYGLDYLRQLL 334 Query: 372 SGYTITREDSLVAATLLT 389 + + LL+ Sbjct: 335 FAPEARKAELYACVRLLS 352 >UniRef50_B4VLK2 Transposase, IS4 family protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VLK2_9CYAN Length = 199 Score = 81.3 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 68/194 (35%), Gaps = 26/194 (13%) Query: 113 SVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKS 171 +V RS +Y + + + + + +L S +++ D F +V Sbjct: 3 AVIWKKRSFPVYWQFLDKAGSSNISEQIAVIRPVLKLL-SRYQVVLIGDREFRRVELAYW 61 Query: 172 VEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILL 231 ++K ++ R++ L ++ + L + + + Sbjct: 62 LKKKKVFFALRIKQDTYIRQSEGN----YQQLSELGLTPGMKLFHSGVNYT--------- 108 Query: 232 YKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQI 291 K + GR N + K + K+ W + TNL + +++ +Y R I Sbjct: 109 -KKKGFGRFNLAAYWKR------KYRGSYEKQGWFILTNLS----SIDEVIQVYQSRSGI 157 Query: 292 EETFRDLKSPAYGL 305 E F+D K+ Y L Sbjct: 158 ESLFKDCKTGGYNL 171 >UniRef50_B1XQT5 Tn10-like transposase (IS4 family) n=14 Tax=Cyanobacteria RepID=B1XQT5_SYNP2 Length = 366 Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 65/379 (17%), Positives = 118/379 (31%), Gaps = 42/379 (11%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M ++ + L H RL+ + L AL KT+ L +L A + N KR Sbjct: 1 MNQISEIRRQLRPHLGW-HGARLSFIALFLVALFRAKTVNLAKLATVWGGNAAEESNYKR 59 Query: 61 IDRLLGNRHLHKERLAVYRWHASFI--CSGNTMPIVL-VDWSDIR-EQKRLMVLRASVAL 116 + R + ++ ++ A + + P VL +D ++ +L V Sbjct: 60 MQRFFQSFDVNMDK------IARMVMNIAAIPQPWVLSIDRTNWSLGTTDFNILMLCVVH 113 Query: 117 HGRSVTLYEKAFPLSEQCSKKAHD-QFLADLASILPSNTTPLIVSDAGF-KVPWYKSV-E 173 G L S L ++ P+ + D F PW + Sbjct: 114 EGIGYPLMWTMLKKKRGNSNSTERMDLLERFETLFPNIEIAYLTGDREFIGKPWLSYLML 173 Query: 174 KLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYK 233 + R+R + + + S+L S +G R+ +Y Sbjct: 174 DKPIPFRLRLRQTDKISKGKGQPAIAGSHLF-----RSLAIGETRILSGKRWVWGRQVY- 227 Query: 234 SRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEE 293 + + E I+ T P+ + Y +R IE Sbjct: 228 ------------VMGTRLDPKRRAHKNEDEFLIIIT-----THDPQNALADYRRRWGIET 270 Query: 294 TFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTV 353 F LK+ G L + + R +L + + + AG+ Q + +A+ Sbjct: 271 LFGALKTR--GFCLESTHFTDKVRLSKLLALLAIGFVWAMQAGLWRHTQKPIRIIKAH-- 326 Query: 354 RNRNVLSTVRLGMEVLRHS 372 R S R ++LR Sbjct: 327 -GRRARSLFRYDFDLLRRF 344 >UniRef50_A5UPF7 Transposase, IS4 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UPF7_ROSS1 Length = 397 Score = 80.1 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 53/291 (18%), Positives = 97/291 (33%), Gaps = 42/291 (14%) Query: 23 LNSLTLACHALLDCKTLTLTELGRNLPT----KARTKHNIKRIDRLLGNRHLHKERLAVY 78 L L +L T+ L + A+ + +R+ R++ + L Sbjct: 27 RQRLALFVVGVLLAGTVVLRRVATTQTHIALGAAQAASHERRLRRIVNDPQLGAAAPMDG 86 Query: 79 RWHASFI--CSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPL---SEQ 133 R + + ++VD S + L A++ GR++ L +P Q Sbjct: 87 RVVRRVLQRLRPDQRVWLIVDESG--HSDVVRTLVAALWYRGRALPLAWVRWPAQQPHPQ 144 Query: 134 CSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYK-SVEKLGWYWLSRVRGKVQYADL 192 S LA +A+ILP+ + +++D F P + V GW L R + + Sbjct: 145 ASWTDCQTLLAQVAAILPAGPSVTVLADRAFGCPAFTDLVAAHGWQDLVRAQRQTCLRHD 204 Query: 193 GAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHP 252 + S L + + + +K + + ++ Sbjct: 205 DG-RMQAFSTLIPQAGTR--------------WCGRGQAFKKQGWRPVSVVASWRV---- 245 Query: 253 SPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAY 303 EP +L +NLP LV Y +R I FRD K+ + Sbjct: 246 -------GCPEPLLLVSNLP----PAWDLVRPYRRRAAIAALFRDWKTSGW 285 >UniRef50_B7AA71 Transposase IS4 family protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7AA71_THEAQ Length = 393 Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 65/331 (19%), Positives = 119/331 (35%), Gaps = 43/331 (12%) Query: 29 ACHALLDCKTLTLTELGRNLPTKARTK-HNIKRIDRLLGNRHLHKERLA--VYRWHASFI 85 L + ++E+ +LP+ + + H K + R L N + E L VY+ A+ + Sbjct: 19 VVRGALAAGSARVSEMVASLPSPLQNRFHQAKALYRFLSNPRVEAEALLDRVYQESATAL 78 Query: 86 CSGNTMPIVLVDWSDI-----------------REQKRLMVLRASVALHGRSVTLYEKAF 128 +VL+D S + R ++ + GR Y Sbjct: 79 EGE--EVLVLLDLSPVAKPYARALEGIARVGKDRRPGYELLTALGLDPAGRLALGYAHLV 136 Query: 129 PLSEQCSKKAHDQF---LADLASILPS-NTTPLIVSDAGFK-VPWYKSVEKLGWYWLSRV 183 E+ + + L + V+D GF + V LG ++ RV Sbjct: 137 AYGERGFASLPKEVEGAIEAARERLGGVGRRLVYVADRGFDDRKVFGQVLALGEEFVVRV 196 Query: 184 RGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQR 243 + + G +L ++SS + G + + ++ L+ G + Sbjct: 197 YRDRKLGEGG--------SLAKVASSLALPCGEEVELRVGGRYQRVRLH----FGWREVE 244 Query: 244 STRTHCHHPSPKIYSASAKEPWILATNLPVEIRT-PKQLVNIYSKRMQIEETFRDLKSPA 302 H ++ + + W L T+LPV R Q+V Y +R ++E FR LK+ Sbjct: 245 VEGRRLHLVVCRVPALGRRGEWWLLTSLPVRGREEAAQVVEAYRRRWEVERFFRLLKT-- 302 Query: 303 YGLGLRHSRTSSSERFDIMLLIALMLQLTCW 333 GLGL + R ++ + L L + W Sbjct: 303 -GLGLETFQVRGLARIRKVVAVLLGLAVFLW 332 >UniRef50_Q1IXF5 Transposase, IS4 n=6 Tax=Bacteria RepID=Q1IXF5_DEIGD Length = 352 Score = 77.4 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 58/366 (15%), Positives = 120/366 (32%), Gaps = 51/366 (13%) Query: 7 LHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLG 66 L L + P L +RL L+ A++ +++ L +L + + +R+ R + Sbjct: 13 LTALLAEHFP-LDPRRLTVLSALILAVIQARSVVLYQLVQIVDLPGSNDTVYQRLKRFV- 70 Query: 67 NRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIR-EQKRLMVLRASVALHGRSVTLYE 125 L V R+ + + ++++D ++ + Q+ + +L SV S L Sbjct: 71 --QFALPDLLVARFVLAHL-RDEQHLLLVLDRTNWKLGQQDINILLLSVRWQTFSFPLVW 127 Query: 126 KAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVR 184 P S + + L +L T + +D F W+ ++ ++ + R+R Sbjct: 128 TLLPHSGNSNMATRIALVERLLPLL-QGKTLFLAADREFVGGEWFVALRRMSLSPVIRLR 186 Query: 185 GKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRS 244 G+ W L G Sbjct: 187 ADSMVE--GSPVWVRFKKLKP--------------------------------GEVRVWY 212 Query: 245 TRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYG 304 TH + + ++ + L ++ + Y+ R E + LKS G Sbjct: 213 KPTHVYGVTLRVLACQNVHGQTLFLAYQGH---AEKALKRYALRWTAENMHQALKSR--G 267 Query: 305 LGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRL 364 L + + R +L + + + C L G Q++ + + S R Sbjct: 268 FFLESTHLTDPSRVSTLLAVVALAFVWCCLVGEFEQQRDPSRCLRHGYPPK----SLFRR 323 Query: 365 GMEVLR 370 G++ LR Sbjct: 324 GLDALR 329 >UniRef50_C5UVK8 Putative transposase n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UVK8_CLOBO Length = 250 Score = 75.5 bits (184), Expect = 4e-12, Method: Composition-based stats. Identities = 50/274 (18%), Positives = 87/274 (31%), Gaps = 65/274 (23%) Query: 119 RSVTLYEKAFPLSEQCSKKAHDQFLADLASILP-SNTTPLIVSDAGFKVPWYKSVEKLGW 177 ++V L E A L + S + + L L + P EKL W Sbjct: 23 KTVVLSEIAQELKDSYSSGTEESKIKRLQRFLSNKSINP----------------EKLKW 66 Query: 178 YWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSK 237 + R + G K + ++ + S+ K +LT N +C + Sbjct: 67 KYCIRCTKDLCVTIKGKLKIKKLEDIKAL-SNKGKNFYNIKLTAQN-YNCNL-------- 116 Query: 238 GRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRD 297 + A+E W + NL + Y KR QIEE F+D Sbjct: 117 ----------------SVCKAKDAEETWFIVHNLEKSF-----AIREYKKRFQIEEMFKD 155 Query: 298 LKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTV---- 353 KS G L + + + + ++ + G+ K + NT+ Sbjct: 156 FKSG--GFNLESTWSMNIQYIKMLYFCISIAYCFIITLGISCGKD------KNNTIIGVI 207 Query: 354 -----RNRNVLSTVRLGMEVLRHSGYTITREDSL 382 + + S R G++ + Y+ E L Sbjct: 208 KDLNGKKVRIYSLFRAGLKWFKRCYYSKRNEYYL 241 Score = 43.9 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 3/61 (4%) Query: 18 LHLKRLNSLTLACHALLDCKTLTLTELGRNLP---TKARTKHNIKRIDRLLGNRHLHKER 74 L KRLN+L ++ KT+ L+E+ + L + + IKR+ R L N+ ++ E+ Sbjct: 4 LSSKRLNNLVAMIIGIIISKTVVLSEIAQELKDSYSSGTEESKIKRLQRFLSNKSINPEK 63 Query: 75 L 75 L Sbjct: 64 L 64 >UniRef50_A5UY16 Transposase, IS4 family n=9 Tax=Roseiflexus sp. RS-1 RepID=A5UY16_ROSS1 Length = 416 Score = 74.0 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 56/315 (17%), Positives = 109/315 (34%), Gaps = 57/315 (18%) Query: 76 AVYRWHASFICSGNTMPIVL-VDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQC 134 + RW + G P+VL +D S R+ +++LR SV G ++ + P ++ Sbjct: 99 DLLRWIRAHWTGG---PLVLGLDASHRRDD--VVLLRMSVLYRGTALPVAWVIVPANQPG 153 Query: 135 SKKAH-DQFLADLASILPSNTTPLIVSDAGFKVP-WYKSVEKLGWYWLSRVRGKVQYADL 192 + + H ++ L S LP + L+++D G P + ++ ++ + RVR +A Sbjct: 154 AWEPHWERMLRWARSALPLDQEVLVLADQGLWSPRLWHAIRSQQFHPIMRVRTTSTFAPT 213 Query: 193 GAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHP 252 G ++ ++ + + +K K Sbjct: 214 GQAR----QSVLRLAPGPG-----------HGWVGVGVAFKHAPK----------RIAGT 248 Query: 253 SPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 + A EPW+L T+LP Y+ R E FR KS + + Sbjct: 249 LAVAWGADHAEPWVLLTDLPPAQVDAAW----YALRSWDEAGFRQSKSMGWDWQRG--QV 302 Query: 313 SSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRN----------------- 355 + + L+ + L G + + ++ Sbjct: 303 TDPDAVAWQYLVVATVTLWTVAVGTRIEDAE-QQGVPPGRLKRAPPTTGAPPRRRWSGTA 361 Query: 356 RNVLSTVRLGMEVLR 370 + V+S +R GM+ LR Sbjct: 362 QRVISLLRRGMQHLR 376 >UniRef50_B5K928 Transposase, IS4 n=23 Tax=Alphaproteobacteria RepID=B5K928_9RHOB Length = 364 Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 53/362 (14%), Positives = 119/362 (32%), Gaps = 53/362 (14%) Query: 18 LHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAV 77 ++L ++ L +++ +T+ L+ L PT ++ + +R+ R + L + A Sbjct: 24 IYLNLFKTMCLLIMGMVNARTVNLSHLACEFPTDSKVESTYRRLQRFFQHVDLGSDWAA- 82 Query: 78 YRWHASFICSGNTMPIVLVDWSDIREQKRL--MVLRASVALHGRSVTLYEKAFPLSEQCS 135 + + +D ++ + +R ++ A V R + L + Sbjct: 83 --PLLVEMIGSGPTWHLCLDRTNWKIGQRHVNFLVLALVTRRHR-IPLMWSVLGRAGNSD 139 Query: 136 KKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGA 194 + S+ +T +++D F W + K ++ R+ Sbjct: 140 TAQRIALMKRYLSVFEVSTIKFLLADREFIGAQWLDFLHKNNVPFVIRI----------- 188 Query: 195 ENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSP 254 + L + K+ +S + + + + + Sbjct: 189 ---------------KANQLVTTQDGKTQNLSTLLRTCRGK-RNFDARFGGNNLGEATWF 232 Query: 255 KIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSS 314 + K +L + V R + + Y KR IE F D K+ G L +R + Sbjct: 233 SFAAKRIKGGELL---IVVSNRPAHRALATYKKRWAIESLFGDTKTR--GFNLEDTRLTI 287 Query: 315 SERFDIMLLIALMLQLTCW-----LAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVL 369 S++ +++L + + L G K+ +F S R+G + L Sbjct: 288 SKKLELLLGLVALAVAWASKTATKLIGGGKMKRKKHGYF---------AKSFFRIGFDQL 338 Query: 370 RH 371 R Sbjct: 339 RK 340 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 72.1 bits (175), Expect = 4e-11, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 79/215 (36%), Gaps = 8/215 (3%) Query: 152 SNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSH 210 ++T+ + ++D G++ + VE G Y+L RV+ P S D + Sbjct: 181 NDTSAIFIADRGYENYNIFAHVEHKGMYYLIRVKDITSNGITSKLTMLPESGEFDEWVNV 240 Query: 211 SKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATN 270 + T K+NP + ++ K + + + + TN Sbjct: 241 TLTKKQTNEVKANPKKYR-VIDKKTPFDYLDLHFNNFYEMKMRVIRFPIPQGSYECIITN 299 Query: 271 LPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERF-DIMLLIALMLQ 329 LP + ++ +Y+KR IE +FR+LK Y LGL + E + + Sbjct: 300 LPQDKFNSDEIKRLYAKRWGIETSFRELK---YALGLTRFHSKKPEYIMQEIWSRMTLYN 356 Query: 330 LTCWLA--GVHAQKQGWDKHFQANTVRNRNVLSTV 362 +A V +K+G +Q N R + Sbjct: 357 FCEIIATNVVINEKKGCKHTYQLNYTRAIRICCYF 391 >UniRef50_Q1QJQ9 Putative uncharacterized protein n=1 Tax=Nitrobacter hamburgensis X14 RepID=Q1QJQ9_NITHX Length = 152 Score = 71.3 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 28/113 (24%), Positives = 44/113 (38%), Gaps = 5/113 (4%) Query: 45 GRNLPTKART-KHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIRE 103 PT+ KH IK++DRLL N+ + ++ + I +V +DW+D Sbjct: 10 ATRWPTRGLLGKHAIKQVDRLLSNQGIVV--WDMFAAWVTQIVGQRKAIVVAMDWTDFDA 67 Query: 104 QKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKA--HDQFLADLASILPSNT 154 + + + HGR+ L E + D LA LA LP Sbjct: 68 DDQTTLALNLASNHGRATPLLWLTVLKDELKDSRNDFEDLCLARLAESLPDGI 120 >UniRef50_C8Q1E5 Transposase, IS4 family n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8Q1E5_9GAMM Length = 288 Score = 71.3 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 43/280 (15%), Positives = 92/280 (32%), Gaps = 39/280 (13%) Query: 93 IVLVDWSDI-REQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILP 151 + +D ++ + L + V G ++ LY + + + + Sbjct: 12 TLTIDRTNWKWGKSNLNIFMLGVVYKGIAIPLYWQMLDKRGNTNHLERCELIDRFIKQFG 71 Query: 152 SNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSH 210 + +IV+D F W+ + + R++ + + + I L S Sbjct: 72 KDNLEMIVADREFVGEKWFNWLTNNHIPFAIRIKKNSKVKNHHGKL-VQIKELLRHVSHQ 130 Query: 211 SKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATN 270 + LT C + ++ R K I+ATN Sbjct: 131 ETYRHGRILTVDG---CLVRVFAKRDKDYGLV-----------------------IVATN 164 Query: 271 LPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQL 330 + + Y+KR +IE F LK G L + + +R ++ + + Sbjct: 165 QLETV----DAMTSYAKRWEIETLFACLK--GRGFNLEDTHLTHLDRVSKLVAVNALAFC 218 Query: 331 TCWLAGVHA-QKQGWDKHFQANTVRNRNVLSTVRLGMEVL 369 + G++ + + + ++N R S LG++VL Sbjct: 219 WAYHVGIYKDKDKPLKRKLKSNA---RPQASLFALGLDVL 255 >UniRef50_Q1QGK1 Putative uncharacterized protein n=1 Tax=Nitrobacter hamburgensis X14 RepID=Q1QGK1_NITHX Length = 155 Score = 69.0 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 1/69 (1%) Query: 295 FRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVR 354 FRD K +G+GL R + +R D +LL+ + L G + G D+H + NT + Sbjct: 88 FRDTKDLRFGMGLGVLRIADPQRRDRLLLLNAFAIVLLTLLGPAGESLGMDRHLKVNTAK 147 Query: 355 NRNVLSTVR 363 R S R Sbjct: 148 -RRTHSLFR 155 >UniRef50_UPI000197B669 hypothetical protein BACCOPRO_01365 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197B669 Length = 270 Score = 67.8 bits (164), Expect = 7e-10, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 63/162 (38%), Gaps = 7/162 (4%) Query: 2 CELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRI 61 L I+ + + ++L R+ + HAL +T++L +L +PT N++RI Sbjct: 32 QILPIMQEYFGKS---MNLARIKLMAYMLHALCVVQTVSLHKLASAMPTSVERDSNLRRI 88 Query: 62 DRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIR-EQKRLMVLRASVALHGRS 120 R + N L+ + +A+ + ++ +D ++ + + + +L + G + Sbjct: 89 QRFIANYALNLDLVAM---MIFSLLPVKNGLVLSMDRTNWKFGEFNINILTLGITYKGVA 145 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDA 162 L + + + + + +V+D Sbjct: 146 FPLLFSLLNKRGNSNWEERKDIMERFIRLFGHDCIDCLVADR 187 >UniRef50_Q9RZJ3 Transposase, putative n=9 Tax=Deinococcus radiodurans RepID=Q9RZJ3_DEIRA Length = 327 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 51/340 (15%), Positives = 115/340 (33%), Gaps = 51/340 (15%) Query: 33 LLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMP 92 ++D +++ +L ++P + + +R DR + L +R + G Sbjct: 1 MIDARSVNHHDLSAHMPGMSTPQGKKRRADRTFRDEQL--DRGFFIALLVVHLPPG--KV 56 Query: 93 IVLVDWSDIREQKR-LMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILP 151 ++ +D ++ + + L +HG ++ L S A + L P Sbjct: 57 LLSLDRTNWEHGETPINFLVLGAVVHGFTLPLIWVPLDQSGNSHTYARMWLVLKLLRAWP 116 Query: 152 SNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSH 210 + +V+D F W++ + + G R+R D+ + W Sbjct: 117 AKRWLGLVADREFIGAEWFRFLRRQGIKRAIRIRQTDMLDDMNGKEWFE----------- 165 Query: 211 SKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATN 270 + + + I ++ ++ + P + I+AT+ Sbjct: 166 -----HVQHGHFHEIGEKVFVF--------GELMRVVATRSPVGDLV--------IIATD 204 Query: 271 LPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQL 330 + ++ +Y +R IE TF K G L + + R + + + + Sbjct: 205 -----FSARKTWRLYKQRWSIECTFSSFK--KRGFDLERTGMTERSRLQRLFGLVTLAWM 257 Query: 331 TCWLAGVHAQKQGWDKHFQANTVRN-RNVLSTVRLGMEVL 369 C GV + + +++ R +S VR G + L Sbjct: 258 FCLRLGVWLSQT-----WPIPVLKHGRRAVSLVRHGAQHL 292 >UniRef50_A6TN04 Transposase, IS4 family protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TN04_ALKMQ Length = 454 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 78/197 (39%), Gaps = 12/197 (6%) Query: 144 ADLASILPSNTTPLIVSDAG-FKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISN 202 L +++ L+++D G F +++ + G Y+L+R++ + + Sbjct: 179 DKLLAMVNPGE--LLITDLGYFSKAFFEKLSTKGSYYLTRIKKNSIVYVEKSGQLTKVDL 236 Query: 203 LHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAK 262 + + T + + + C+ + + K +R K SA Sbjct: 237 TDLLKGTVVDTEVFLGIAHKKQLKCRFVAIRLPEKVVNQRRRKANQQAKAQGKQLSAKET 296 Query: 263 E--PW-ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFD 319 E W I+ TN+ + +P+ ++Y R QIE F+ LKS L + + + + Sbjct: 297 ELLAWNIIVTNVTKDKLSPEAACDLYRARWQIELVFKSLKSY---LNIDKIGSCGKYQLE 353 Query: 320 IML---LIALMLQLTCW 333 ++ LIA++ + + Sbjct: 354 CLIYGRLIAVVAMFSLY 370 >UniRef50_B7I4U9 Transposase 1 n=31 Tax=Bacteria RepID=B7I4U9_ACIB5 Length = 189 Score = 65.5 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 66/189 (34%), Gaps = 11/189 (5%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M L+ L+ L + + L L ++ +T L+ + LP K + +R Sbjct: 1 MTHLNELYLILNKSLKW-NKSHLKCFALIMLVIILKQTCNLSSASKALPIKCLPQSFYRR 59 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICS--GNTMPIVLVDWSDIREQKR-LMVLRASVALH 117 + R ++ YR + I + + +D ++ + KR + +L ++ Sbjct: 60 MQRFFAGQYFD------YRQISQLIFNMFSFDQVQLTLDRTNWKWGKRNINILMLAIVYR 113 Query: 118 GRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLG 176 G ++ + K + +I + + +D F W+ + + Sbjct: 114 GIAIPILWTLLNKRGNSDTKERIALIQRFIAIFGKDRIVNVFADREFIGEQWFTWLIEQD 173 Query: 177 WYWLSRVRG 185 + RV+ Sbjct: 174 INFCIRVKK 182 >UniRef50_A7N4N2 Putative uncharacterized protein n=1 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N4N2_VIBHB Length = 54 Score = 65.1 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 24/50 (48%), Positives = 33/50 (66%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT 50 M ++ IL ++ CP++H KRL SL LA A+LD LTLT++GR L T Sbjct: 1 MRDIQILQQTIENQCPDIHKKRLRSLMLATKAVLDGSNLTLTKIGRALST 50 >UniRef50_Q7NIQ3 Glr2130 protein n=2 Tax=Gloeobacter violaceus RepID=Q7NIQ3_GLOVI Length = 190 Score = 65.1 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 49/158 (31%), Gaps = 7/158 (4%) Query: 26 LTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRH--LHKERLAVYRWHAS 83 L + H L + L L L LP K + R L L L + Sbjct: 30 LHILLHTLQTQQNLCLERLANALPLPITVDSRRKAVQRFLLLPSLCLWHLWLPLLAQIIE 89 Query: 84 FICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFL 143 ++ +D ++ + LMV S+ R++ ++ + + S L Sbjct: 90 HFAVQPQRLVLAIDRTNWWKYNLLMV---SLVWDRRALPVFWRLLNHAGNSSLPERRSVL 146 Query: 144 ADLASILPSNTTPLIVSDAGFKVPWYK-SVEKLGWYWL 180 + + +++ D F + ++ + Sbjct: 147 LPVLKYF-HHKQIIVLGDREFGSVGFANWLQSQKVSYC 183 >UniRef50_B1QZ52 Putative transposase n=2 Tax=Clostridium butyricum RepID=B1QZ52_CLOBU Length = 190 Score = 65.1 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 43/132 (32%), Gaps = 10/132 (7%) Query: 254 PKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTS 313 +A A E W +A N I + Y K IEE F+D K G L + + Sbjct: 57 SVCKAADADEVWYIANNFDEAID-----IREYKKIFDIEEMFKDFKGG--GFNLEDTWSQ 109 Query: 314 SSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTV---RNRNVLSTVRLGMEVLR 370 +M L + G K +K A + + S R G + + Sbjct: 110 DIHYIKMMYLCISIAYCWIITLGTSCTKDKKNKLIGAVKFLKGKKVRIYSLFRAGYKWFK 169 Query: 371 HSGYTITREDSL 382 Y+ E L Sbjct: 170 RGYYSNRSEYYL 181 >UniRef50_Q10VE7 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10VE7_TRIEI Length = 159 Score = 64.4 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 67/177 (37%), Gaps = 31/177 (17%) Query: 160 SDAGFKV----PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 +D F W K +K Y++ R + + + +S L G Sbjct: 4 ADREFHSIFLSHWLKKYQKQDLYFVFRQKKTTIIKR--GKKYCKVSELKV-------NFG 54 Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 +L ++ + + T ++ K S + W + +NL Sbjct: 55 ETKLLLNHIFP------------KILKVGTYNLLNYKKQKYRQKSVVDKWYILSNLS--- 99 Query: 276 RTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTC 332 +PK++ IY +RM IE F+ K+ +Y L ++ + +++LLIA+ ++ Sbjct: 100 -SPKKIKKIYIQRMGIEAMFKYYKTGSYN--LESAKANKMRLSNLILLIAISYTISS 153 >UniRef50_Q73IB8 Transposase, IS4 family n=9 Tax=Wolbachia RepID=Q73IB8_WOLPM Length = 442 Score = 64.0 bits (154), Expect = 1e-08, Method: Composition-based stats. Identities = 43/234 (18%), Positives = 85/234 (36%), Gaps = 14/234 (5%) Query: 134 CSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQYADL 192 ++ + L++IL ++ L++SD G+ VP +K + ++G Y++SR + D+ Sbjct: 171 EGVRSDQGYRKHLSNILSND---LLISDLGYFVPSSFKQINEIGAYFISRYKSDTNIYDV 227 Query: 193 GAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHP 252 L + L K I +I+ K + +R Sbjct: 228 ETNQ---KMELLECLEDKLFLENEVLLGKEAKIRVRIICQKLTEEQSMARRRKANRLARS 284 Query: 253 SPKIYSASAKEP--WILA-TNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRH 309 S ++ W + TN+P + +Q++ IY R QIE F+ KS + L Sbjct: 285 QGYTSSKRNQKLLNWSIFITNVPENKISAEQVLTIYRVRWQIELLFKLYKSH---IRLDK 341 Query: 310 SRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHF-QANTVRNRNVLSTV 362 + + + + + G K+ + +A R V+ Sbjct: 342 LKGKPCRVLCELYAKLCAILIFHGIVGCTEVKKNTELSLTKAFIELKRRVIELF 395 >UniRef50_B7GET6 Transposase n=2 Tax=Bacillaceae RepID=B7GET6_ANOFW Length = 417 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 41/180 (22%), Positives = 67/180 (37%), Gaps = 13/180 (7%) Query: 147 ASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISN--- 202 +ILP++ + D GF V ++ G Y+++R+R ++ WK Sbjct: 179 HTILPNDLC---IRDLGFFSVAALTEIDARGAYYITRLRSDMKVYIKENSQWKEWDWESL 235 Query: 203 ---LHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSA 259 L + S + + P L + + R R + + Sbjct: 236 GNQLKEGESVEMEHVYIGHERLYIPRLIFRRLTEEEWQKRMAYVRKREKRKGKALTRQTL 295 Query: 260 SAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFD 319 K+ IL TNLP E +Q+ +YS R QIE F+ KS L + ERF+ Sbjct: 296 EQKKYHILLTNLPQESFDGQQVYELYSLRWQIELLFKAWKSV---FDLEKVKKMKKERFE 352 >UniRef50_Q1ARL9 Transposase, IS4 family n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1ARL9_RUBXD Length = 335 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 47/250 (18%), Positives = 84/250 (33%), Gaps = 50/250 (20%) Query: 99 SDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHD-----QFLADLASILPSN 153 SD + MV+ A GR++ + + + + + L ++ ++ Sbjct: 110 SDGKSLGFWMVVFAQ-PYRGRAIPFHFGIYSEATLKEQVTSRNLRWRELLWEIEELV--G 166 Query: 154 TTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKT 213 TPLI W K++E+ W+ R+ Sbjct: 167 DTPLIFDREFSAQAWLKALEEAQCKWVVRLNKGSGVK----------------------- 203 Query: 214 LGYKRLTKSNPISCQILLYKSRSKGRKNQRST---RTHCHHPSPKIYSASAKEPWILATN 270 + L + P+ + KG K R ++ KEP + N Sbjct: 204 -FFDELGEEIPLLIE--------KGEKRNIEGCYYRGETKANVAGVWRKGCKEPLWVMGN 254 Query: 271 LPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQL 330 P +LV +Y +RM+IE+TFRD KS LG+ + +I L + L+ Sbjct: 255 ----FLPPDELVEVYEERMKIEQTFRDAKSL---LGMEKVMNKKRVQLEITLALMLLAYG 307 Query: 331 TCWLAGVHAQ 340 + G + Sbjct: 308 LGLMVGEAVR 317 >UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepID=B0R8M6_HALS3 Length = 449 Score = 54.0 bits (128), Expect = 9e-06, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 71/192 (36%), Gaps = 11/192 (5%) Query: 157 LIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 LI+ D GF W + +++ G +++SRV+ + + +++ S L Sbjct: 190 LILLDLGFYDFWLFDRIDQNGGWFVSRVKDNANFEIVEELRTWRGNSIPLEGESLQAVLD 249 Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 + I +I L R +G + + + TNL + Sbjct: 250 DLQ---RQEIDVRITLSFERKRGSGASATRTFRLVGLRNEETEEYH----LYLTNLGNDD 302 Query: 276 RTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 + + +Y R ++E F++LKS GL T+ + + ++++A + + + Sbjct: 303 YSAPDIAQLYRARWEVELLFKELKSR---FGLDEINTTDAYIIEALIIMAAISLMMSRVI 359 Query: 336 GVHAQKQGWDKH 347 + + Sbjct: 360 VDELRSLEARQR 371 >UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=Streptococcus RepID=A4W4J4_STRS2 Length = 440 Score = 54.0 bits (128), Expect = 9e-06, Method: Composition-based stats. Identities = 60/322 (18%), Positives = 114/322 (35%), Gaps = 30/322 (9%) Query: 15 CPELHLKRLNSLT--LACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHK 72 PE R + LT A+L TL + +L ++R +L H+ Sbjct: 37 HPEKDFSRKSQLTMETMIQAILTMGGNTLAKELLDLDLPVSQSAFVQRRYQL-----KHQ 91 Query: 73 ERLAVYRWHASFICSGNTMPIVLVDWSDI-------REQKRLMVLRASVAL---HGRSVT 122 A++ S I + +PI+ VD SD+ + H ++ Sbjct: 92 AFKALFANITSKIPTFKDLPILAVDGSDVVLPRNRSDKTTTFQTGPHHTPYTLIHINALY 151 Query: 123 LYEK--AFPLSEQCSKKAHDQ--FLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGW 177 E+ L Q +++ ++ F+ + S L++ D G++ ++ W Sbjct: 152 NLEQEIYHDLRIQNNREVDERAAFIDMMESC--PFEQALVIMDRGYESYNVMAHCQERNW 209 Query: 178 YWLSRVRGKVQYADLGAE--NWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSR 235 ++ R+R G + D++ +T K L + P L + + Sbjct: 210 SYIIRIRDGNHSMKSGFNLPDTPCFDEEFDLNICRKQTNVMKELYRDFPNQYHFLPHNAS 269 Query: 236 -SKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEET 294 + R + + +P T + +P++L ++Y+ R IE + Sbjct: 270 FDLLPNSSRKSDPISFYDLHFRMVRLEIKPGFFETLVTNTDYSPEKLKDLYAYRWGIETS 329 Query: 295 FRDLKSPAYGLGLRHSRTSSSE 316 FRDLK Y +GL H E Sbjct: 330 FRDLK---YSIGLTHFHAKKKE 348 >UniRef50_A2RJ55 Putative transposase n=7 Tax=Lactobacillales RepID=A2RJ55_LACLM Length = 439 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 73/210 (34%), Gaps = 8/210 (3%) Query: 157 LIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 +I++D G++ Y+ ++K G +L R + L + P D + L Sbjct: 196 IIIADRGYESFNVYEHIKKSGQKFLIRAKDTKSNGLLNGLD-LPSDGTFDKKIT--LQLT 252 Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 ++ K L+K + RS T+ + L TNL + Sbjct: 253 RRQTNKVKKDKHYHFLHKRANFDYLPIRSKETYPISLRVVRIKLNEDTYESLVTNLDPFL 312 Query: 276 RTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSE-RFDIMLLIALMLQLTCWL 334 T + L +Y R IE +FR+LK Y LGL H + + + +M + + Sbjct: 313 FTSEDLKVLYHLRWGIETSFRELK---YALGLSHFHSKKLDFIIQEIFARLIMYNFSMTI 369 Query: 335 AGVHAQKQGWDKHFQANTVRNRNVLSTVRL 364 +Q N + + L Sbjct: 370 TLAVVLSNRLKHSYQINFTQAFGICRRFFL 399 >UniRef50_B0JGV7 Putative uncharacterized protein n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JGV7_MICAN Length = 141 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 25/125 (20%), Positives = 48/125 (38%), Gaps = 15/125 (12%) Query: 18 LHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAV 77 +H ++++L LA L+ KT+ L+EL KA + N KR+ R N L Sbjct: 12 IHQNKIDALLLA---LIKVKTVNLSELAVGFGGKALKESNYKRLQRFFRNFELD------ 62 Query: 78 YRWHASFICSGNTMP---IVLVDWSDIREQKRLM--VLRASVALHGRSVTLYEKAFPLSE 132 Y A + +P ++ +D + E +L + G ++ + Sbjct: 63 YSEIAKIVVGWLKLPQPWVLSLDRTTW-ELGEHCYNILTVGIVHEGVAIPILWWLLKKKG 121 Query: 133 QCSKK 137 + + Sbjct: 122 NSNSE 126 >UniRef50_D0SG98 Transposase n=3 Tax=Gammaproteobacteria RepID=D0SG98_ACIJO Length = 426 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 51/318 (16%), Positives = 97/318 (30%), Gaps = 48/318 (15%) Query: 60 RIDRLLGNRHLHKERL--AVYRWHASFICSGNTMPIVLV-DWSDI--------------R 102 + R L N + +L + I + ++V DWS + Sbjct: 27 AVWRFLNNNKISFSQLNQPIKLLACEQIKTSPHQYALIVHDWSQLQYVKHSHKVQRLQRT 86 Query: 103 EQKRLMVLRASVALHGRS-VTL------------YEKAFPLSEQCSKKAHDQFLAD---- 145 E L++S+ S + + F + +K+H LA+ Sbjct: 87 EANSGYELQSSLLFDASSGLPIAPLAQTLTDASGCYSTFSE-QYSERKSHLDSLAEQIKT 145 Query: 146 LASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHD 205 + T I+ G + + + G+ WL R + + G Sbjct: 146 IEQYPIEKTKVHIIDREGDSIAHLREISSHGFKWLIRAKESHRIEHQGETYKVAEVAEKV 205 Query: 206 MSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKN---QRSTRTHCHHPSPKIYSASAK 262 ++ + I + ++ RK+ QR + ++ A K Sbjct: 206 VTQQVKPIAYKGNRHMLHVGETDIRITRAAKPKRKDDLGQRVAPQPGKAVTARLIVAVVK 265 Query: 263 EP-------WILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSS 315 + W L +N+ EI +L Y R IE F+ LK + + ++ Sbjct: 266 DAQGKTVARWSLISNVSSEID-AVELTTWYYWRWTIECYFKLLKQAGHNV--ESWLQTTP 322 Query: 316 ERFDIMLLIALMLQLTCW 333 LLI+ M + W Sbjct: 323 AAILRRLLISSMACVLTW 340 >UniRef50_Q6ZER8 Putative uncharacterized protein sll5062 n=1 Tax=Synechocystis sp. PCC 6803 RepID=Q6ZER8_SYNY3 Length = 151 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 23/107 (21%), Positives = 39/107 (36%), Gaps = 9/107 (8%) Query: 266 ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIA 325 +L ++ P + T + Y R IEE F D +S + L R + + + Sbjct: 6 LLFSDEPTCLHT----IQEYGLRFDIEEAFLDDQSNGWNLQKSEIRFVCA--LSRLFFLL 59 Query: 326 LMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHS 372 + L GV G + + R S R+G + L+ S Sbjct: 60 ALATLYATAQGVEVFATGKHRWVAPHWFRGN---SYFRIGWDWLKTS 103 >UniRef50_A7HFH6 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7HFH6_ANADF Length = 131 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 18/59 (30%), Positives = 29/59 (49%) Query: 306 GLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRL 364 GL +R R D LLI+ + + G + G D++ +ANTV+ R +L R+ Sbjct: 64 GLSATRIGDPGRRDRELLISAIAIALHTILGASGEAIGIDRYLKANTVKPRIILLLNRV 122 >UniRef50_UPI0001C16BE8 Transposase, IS4 protein n=1 Tax=Cylindrospermopsis raciborskii CS-505 RepID=UPI0001C16BE8 Length = 231 Score = 52.4 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 49/154 (31%), Gaps = 28/154 (18%) Query: 34 LDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMP- 92 + + L++L P + + + + R LG L + L + + + P Sbjct: 30 QAYRQVKLSKLASLFPQPIKYESRKRNLQRFLGINKLCVKLL--WFPLIKYWIRQSLTPQ 87 Query: 93 ---------------------IVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLS 131 +V +D + + + MV ++ ++ LY + Sbjct: 88 QLNREQRRYFHKKQYQKYGYWMVALDRTQWKGRNIFMV---TLVWGTHALPLYWETLNHV 144 Query: 132 EQCSKKAHDQFLADLASILPSNTTPLIVSDAGFK 165 + + + +L ++++D F+ Sbjct: 145 GNSNLSTQKRLIKTAIKLLKK-CRIVVLADREFR 177 >UniRef50_A5WBL3 Transposase, IS4 family n=2 Tax=Bacteria RepID=A5WBL3_PSYWF Length = 427 Score = 52.0 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 58/176 (32%), Gaps = 13/176 (7%) Query: 169 YKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQ 228 + ++ + +++RV+ + G S + + K + + Sbjct: 185 MRQWDEHDYKFITRVKAGSYLSYEGKSQRCSQIAGQLNFSYQRQVNYKGKAAKQYIATAK 244 Query: 229 ILLYKS-------RSKGRKNQRSTRTHCHH--PSPKIYSASAK--EPWILATNLPVEIRT 277 ++L +S + G++ +IY K W L +NL Sbjct: 245 VVLTRSAKPQAIDPATGKRIAPIKGKPLSLLLTVSRIYDDQDKRLATWYLLSNLQEPSVN 304 Query: 278 PKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCW 333 + Y R QIE F+ LKS GL L S + + LLIA W Sbjct: 305 GADISQWYYWRWQIESYFKLLKSA--GLQLESWLQQSGDAYFKRLLIASQACTLVW 358 >UniRef50_Q1PXV1 Putative uncharacterized protein n=3 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PXV1_9BACT Length = 449 Score = 51.6 bits (122), Expect = 5e-05, Method: Composition-based stats. Identities = 42/177 (23%), Positives = 66/177 (37%), Gaps = 19/177 (10%) Query: 157 LIVSDAG-FKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 L++ D G F + +E G Y+LSR + P + D+ S K +G Sbjct: 187 LLLRDLGYFDLSVLGDIEGKGAYYLSRFFKSTKVYLSAD----PGAEAIDLVSYVKKHIG 242 Query: 216 YKRLTK------SNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWI--- 266 K L I +++ Y++ +R S K S E W+ Sbjct: 243 NKGLADMEVYLGEERICSRLIAYRAPGHVINERRRKAKRAVQKSGKTLSREYLE-WLDYS 301 Query: 267 -LATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIML 322 TN+ EI +P+ + IY R QIE F+ K + R + ER +L Sbjct: 302 FYITNVGAEIWSPEVVGTIYRIRWQIELVFKQWKQL---FRMDVMRGTREERIRCLL 355 >UniRef50_Q5L3A2 Transposase of IS231E-like element n=1 Tax=Geobacillus kaustophilus RepID=Q5L3A2_GEOKA Length = 453 Score = 51.3 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 77/204 (37%), Gaps = 9/204 (4%) Query: 164 FKVPWYKSVEKLGWYWLSRVRGKVQYADLGAEN---WKPISNLHDMSSSHSKTLGYKRLT 220 F + +++ G +++SR++ V + W+P L + + L + ++ Sbjct: 199 FSLEGLQAIHDAGAFYISRLKHNVGIYQKEGDRFRKWEPEDFLAVLQPGETMELEHAYVS 258 Query: 221 KSNPISCQILLYK-SRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPK 279 ++++Y+ + + R+ + + + ++ TN+P + Sbjct: 259 GKKVHQPRLIVYRLTEEQERQKEGQWKQKAKQKGAAYVTRRPHPIYVYITNIPAIYTSLH 318 Query: 280 QLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHA 339 ++ +YS R QIE F+ KS + + RF L L+ L + V Sbjct: 319 EIHTLYSLRWQIEVVFKTWKSL---FHIHRFKPMKGARFQCHLYGTLIALLIS--STVMF 373 Query: 340 QKQGWDKHFQANTVRNRNVLSTVR 363 + + W Q + +S ++ Sbjct: 374 KMREWLYRKQKKELSEYKAMSMIK 397 >UniRef50_Q6MB98 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MB98_PARUW Length = 146 Score = 50.9 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 34/147 (23%), Positives = 55/147 (37%), Gaps = 6/147 (4%) Query: 26 LTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFI 85 +T L KT+ L+EL L +KA+ N KRI R + V I Sbjct: 1 MTNLLLGLFIVKTVNLSELATVLYSKAKIDSNFKRIQRFFNWLTFLNDYQEVITDLVIII 60 Query: 86 C-SGNTMPIVLVDWSDIREQKRLM-VLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFL 143 N + +D +D + K+ + +L V G S+ L L + K D+ L Sbjct: 61 LDLKNKKNDLALDRTDWKFGKKHINILTLGVNFKGISIPLAW--ISLGRAGNSKTLDR-L 117 Query: 144 ADLASILPSNTTPLIVSDAGF-KVPWY 169 + L ++ +D F W+ Sbjct: 118 SVLKRVMDKIHINSFTADREFIGSEWF 144 >UniRef50_A5FWE3 Transposase, IS4 family protein n=2 Tax=Acidiphilium cryptum JF-5 RepID=A5FWE3_ACICJ Length = 453 Score = 50.9 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 50/126 (39%), Gaps = 5/126 (3%) Query: 208 SSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP--W 265 + S+T R+ + + + + R G + T + ++ P W Sbjct: 232 VAPSRTGIAARVATVALRAGTVTICRPRHGGDVGGPAHLTLTMVEAREVDWNGEGTPLLW 291 Query: 266 ILATNL-PVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLI 324 L T + ++ ++V +Y R +IEE FR LKS GL L ++ + R + LI Sbjct: 292 RLLTTIETIDADGAAEIVRLYRLRWRIEEVFRSLKSD--GLRLEETQMQDAGRLFKLALI 349 Query: 325 ALMLQL 330 L Sbjct: 350 GLAAAT 355 >UniRef50_B0NXD2 Putative uncharacterized protein n=5 Tax=Clostridium sp. SS2/1 RepID=B0NXD2_9CLOT Length = 439 Score = 50.9 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 58/162 (35%), Gaps = 9/162 (5%) Query: 155 TPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKT 213 L ++D G++ ++ G Y+L R R + P D + ++ T Sbjct: 190 KALFIADRGYESYLLMAQIQHDGNYFLIRAREDFGQGSMIKGYPFPRDGTFDKTVTYIYT 249 Query: 214 LGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYS---ASAKEPWILATN 270 + TK+NP LYK + + H + + + L TN Sbjct: 250 KTQNKRTKANPE-----LYKRVATRNSPYFINKEHPYVKMTLRFVMIVLPNGQKECLITN 304 Query: 271 LPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 LP + L +Y R +IE +FR +K A L + Sbjct: 305 LPANKFPSETLKKLYCIRWKIETSFRLIKYSANLLEFHSKKI 346 >UniRef50_Q46GC6 Transposase n=7 Tax=Methanosarcina RepID=Q46GC6_METBF Length = 435 Score = 50.5 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 69/181 (38%), Gaps = 16/181 (8%) Query: 157 LIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQ---YADLGAENWKPISNLHDMSSSHSK 212 +++ D GF K + VE+ G Y++SR+R + + + + Sbjct: 189 ILLVDLGFYKTQMFARVEENGGYFVSRIRKNMDPILVSIEEELSKTKSKEF----AGKPV 244 Query: 213 TLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWIL-ATNL 271 + K+L+ + + K K R+ + + E + + TN+ Sbjct: 245 SECIKQLSGKD----IDAVVKIEFKRREYKGKQKQDEMIVRLVAVYNDEDEKYHIYITNI 300 Query: 272 PVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLT 331 +I K + N+Y R IE F++LKS L T + + + ++ A++ + Sbjct: 301 QKDILNAKDIANLYGARWDIELLFKELKSK---YSLDVLETKNVQVIEALIWTAILTLIV 357 Query: 332 C 332 Sbjct: 358 S 358 >UniRef50_A3IP38 Putative uncharacterized protein n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IP38_9CHRO Length = 101 Score = 50.5 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 18/110 (16%), Positives = 37/110 (33%), Gaps = 17/110 (15%) Query: 74 RLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQ 133 + +W GN + IV +D + + +L S+ + R + LY + + Sbjct: 3 IPIIKQWLNQSFDPGNVLHIV-IDRTQW---GLINILMVSLIIDNRGIPLYFELLDHTGN 58 Query: 134 CSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRV 183 + L+ + +L +++ D G G W R Sbjct: 59 SNFDTQKSILSRVLPLLKE-YKTVVLGDRG------------GHGWRIRF 95 >UniRef50_Q8VV93 Transposase n=1 Tax=marine psychrotrophic bacterium Mst37 RepID=Q8VV93_9GAMM Length = 423 Score = 50.1 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 55/145 (37%), Gaps = 19/145 (13%) Query: 157 LIVSDAG-FKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 L+++D G FK+ + +++ G ++ VR K + + + + K Sbjct: 187 LLLADRGYFKLSYLDEIDQAGGAYV--VRAKTTVNPM-------VVAGFNKAGKPLKRFQ 237 Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 + + + +G+ N R + W ATNL E Sbjct: 238 KIKQKAVKKHIRRSGIVDMDVEGKTNYRL-------IASWPEGKDEPTYW--ATNLDREQ 288 Query: 276 RTPKQLVNIYSKRMQIEETFRDLKS 300 + ++++ +Y R QIE F++ KS Sbjct: 289 FSAEKVMKLYQLRWQIELLFKEWKS 313 >UniRef50_Q7UPU9 Probable transposase n=2 Tax=Rhodopirellula baltica RepID=Q7UPU9_RHOBA Length = 656 Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 54/253 (21%), Positives = 84/253 (33%), Gaps = 50/253 (19%) Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTP---LIVSDAGFKVPWY-KSVEKLG 176 +T K P S++AH + +L + P L DAGF + KS+ G Sbjct: 274 LTWCWKLGP--SNASERAH------VQEMLENGEFPEKTLFTGDAGFVGYEFWKSIIDGG 325 Query: 177 WYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRS 236 ++L RV V +LGY P ++ + Sbjct: 326 HHFLVRVGANVNLLH---------------------SLGY----DVEPDEDNLVYCWPKD 360 Query: 237 KGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFR 296 K R+ R + K+ +L + L + T KQ + IY R IE FR Sbjct: 361 KRREGMRPLKLRM-----IQIQLGRKKAVLLTSVLDEKKLTDKQALVIYKSRWGIELEFR 415 Query: 297 DLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR 356 +LK G R R S R + L +++ L L + + + R Sbjct: 416 NLKQT---YGRRQLRCRQSVRALVELHWSILSILIVKLYALKVHLAKKRRRCDPVAMPGR 472 Query: 357 -----NVLSTVRL 364 V S R+ Sbjct: 473 ISFAGVVRSFQRI 485 >UniRef50_C4YUK3 Transcription-repair-coupling factor n=29 Tax=Rickettsia endosymbiont of Ixodes scapularis RepID=C4YUK3_9RICK Length = 287 Score = 49.3 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 57/178 (32%), Gaps = 35/178 (19%) Query: 146 LASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLH 204 + N + +D F W + ++ RVR QY I L Sbjct: 4 FLEVFDKNRIEALTADREFIGKEWLSWLRTNQIRYVFRVRENRQYISNARGKMVKIQELF 63 Query: 205 -DMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKE 263 ++ +L +R+ I + G +N++S H Sbjct: 64 RPLAIGSHVSLSQRRIGTKGEIFNVV--------GIRNKKSELAVLIHS----------- 104 Query: 264 PWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIM 321 EI+ P ++ Y++R QIE F+ KS G + ++ R D + Sbjct: 105 ---------DEIKNPAEI---YAQRWQIETMFKAFKSA--GFNCEATHITNDLRLDTL 148 >UniRef50_C1D0Y0 Putative transposase n=1 Tax=Deinococcus deserti VCD115 RepID=C1D0Y0_DEIDV Length = 333 Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats. Identities = 50/341 (14%), Positives = 105/341 (30%), Gaps = 65/341 (19%) Query: 5 DILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRL 64 D L +L P L +RL LT A++ +++ L L ++ +R+ R Sbjct: 10 DSLQTALRCAFP-LDGRRLEVLTALILAMVQARSVVLYTLKTHVHLPGSFDTRYQRLRRF 68 Query: 65 LGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKR-LMVLRASVALHGRSVTL 123 + E + + + +++D ++ + K+ + +L S S+ L Sbjct: 69 V-----RFEFPDHFFVRFALFSLPDGELNLILDRTNWKLGKQDVNILLLSAVWDSFSLPL 123 Query: 124 YEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSR 182 P S + L +++D F W+ +++ G R Sbjct: 124 VWALLPHGGSSSHQERFAHLLRFVRCCSERHIGSLLADREFIGKSWFTFLDQHGIAPCIR 183 Query: 183 VRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQ 242 + +++ T+G +L S +S ++ K Sbjct: 184 L----------------------PATA---TIGTGKLPVSYGVSSRLCATK--------- 209 Query: 243 RSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPA 302 +A E LA + Y++R Q E LK+ Sbjct: 210 ----------------NTADEVLYLAYR-----GYASVNLRRYAQRWQAENLHSALKTR- 247 Query: 303 YGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQG 343 G L + + +ER +L + ++ + Sbjct: 248 -GFNLEDTGLTQAERVSTVLTCVSAAFIWAYVTCQVLAAKQ 287 >UniRef50_C3EBZ9 IS231-related transposase n=1 Tax=Bacillus thuringiensis serovar pakistani str. T13001 RepID=C3EBZ9_BACTU Length = 221 Score = 48.9 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 37/175 (21%), Positives = 67/175 (38%), Gaps = 15/175 (8%) Query: 164 FKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENW-KPISNLHDMSSSHSKTL--GYKRLT 220 F +P + + + G Y+LSR+ Q + + S KT+ + Sbjct: 46 FYLPDFHEINQKGAYYLSRLPINTQVYRKKGILYERLYLEDFIKKVSEGKTIEWFDVYIR 105 Query: 221 KSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQ 280 K + + ++++YK G + + T + IL TN+P +I ++ Sbjct: 106 KQHKVPTRLIIYKLTGAGYDGKNNVSTATKYKRQVS---------ILMTNIPSDILQKEE 156 Query: 281 LVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 + +Y+ R QIE F+ KS G+ + ERF L L L + Sbjct: 157 IYPLYTVRGQIEILFKTWKSL---CGIHLCKHVKLERFQCHLYGQLTAILLHSML 208 >UniRef50_C6AUF2 Transposase IS4 family protein n=7 Tax=Rhizobium RepID=C6AUF2_RHILS Length = 372 Score = 48.6 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 70/189 (37%), Gaps = 13/189 (6%) Query: 157 LIVSDAGFKVPW-YKSVEKLGWYWLSRV-RGKVQYADLGAENWKPISNLHDMSSSHSK-- 212 ++++D + P + V G ++ R ++ E + + L + Sbjct: 170 IVLADRYYARPRDLRPVIDAGADFIVRTGWNSLRLLQTNGEPFDLFAALAAQQEQEGEVQ 229 Query: 213 ---TLGYKRLTKSNPISCQILLYKS---RSKGRKNQRSTRTHCHHPSPKIYSASAKEPWI 266 G P+ ++++ + +++ + + P S A + + Sbjct: 230 VRVHEGMTGTPPPPPLVLRLIVRRKDPQQAQAEQERLLKDARKRGKKPDPRSLEAAKYIL 289 Query: 267 LATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIM--LLI 324 L T+LP P ++ +Y R QIE F+ KS A GL ++ R + L++ Sbjct: 290 LLTSLPTATFPPADILTLYRFRWQIELAFKRFKSLA-GLDSLPAKKPELARAWLYARLIV 348 Query: 325 ALMLQLTCW 333 A++ + Sbjct: 349 AIIAEQIAG 357 >UniRef50_Q8A4P1 Transposase n=5 Tax=Bacteroides RepID=Q8A4P1_BACTN Length = 440 Score = 48.6 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 42/263 (15%), Positives = 92/263 (34%), Gaps = 25/263 (9%) Query: 86 CSGNTMPIVLVDWS-----DIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHD 140 C P+++VD S K+ A+++ R+ + P+ E+ S + + Sbjct: 87 CGTFLHPVLVVDASSHIPIGFSSVKQWNRSPAALSREERN----YRYQPIEEKESYRWIE 142 Query: 141 QFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKP 199 +A + P + I+ D + + + + L R + + Sbjct: 143 SGMAASEQM-PRDAVKTIIGDREADIFELFSRIPTDNVHLLIRSVHERNCRLDDPDCSVH 201 Query: 200 ISNLHDMSSSHSKTLGYK--RLTKSNPISC-QILLYKSRSKGRKNQRSTRTH-----CHH 251 ++ L + + ++ + ++C ++ + N + + C H Sbjct: 202 LNTLMEQAVLRAEYSFEVLPGSGRKKRVACMELRFERVTLCAPVNGPAKGSPPVSLYCIH 261 Query: 252 PSPKIYSASAKEP---W-ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGL 307 K S E W +L T++ + + + Y R IEE FR LK G + Sbjct: 262 VKEKSSSTPVNESPIEWRLLTTHVVETVEQAIECIGWYRCRWLIEELFRVLK--RKGFMI 319 Query: 308 RHSRTSSSERFDIMLLIALMLQL 330 ++ + ++LI+L L Sbjct: 320 EDAQLETVSALQKLILISLQAAL 342 >UniRef50_B2AKB8 Transposase, IS4 family n=40 Tax=cellular organisms RepID=B2AKB8_CUPTR Length = 442 Score = 48.6 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 54/286 (18%), Positives = 92/286 (32%), Gaps = 26/286 (9%) Query: 80 WHASFICSGNTMPIVLVD-WSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKA 138 H ++ + + P+ ++D W RE K R + R + YE+ Sbjct: 119 LHPTYAVTPDREPLGVIDAWMWAREPKDADGNRGGIKESVRWIEGYERVAEQ-------- 170 Query: 139 HDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGW--YWLSRVRGKVQYADLGAEN 196 A++LP + G ++LG WL R + A+ G + Sbjct: 171 --------AALLPQTRLVYMTDREGDIAELMARAQELGQPADWLIRSQHNRNLAE-GGKL 221 Query: 197 WKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKI 256 W + + + L + K+ + ++ + + G T C Sbjct: 222 WDSVDA-SPVLGEITFILPGRAGQKAREVKQELRAQRMKLPGLVGAEFT---CVAAREIE 277 Query: 257 YSASAKEP-WILATNLPVEIRTP-KQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSS 314 A K W L TN + +LV Y R +IE F LK+ L+ S Sbjct: 278 APAGVKPVVWRLVTNREAQDADAVNKLVEWYRARWEIEMFFHVLKTGCKVEALQLSHMDR 337 Query: 315 SERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 ER + ++ G F A+ +R VLS Sbjct: 338 VERALALYMVVAWRIARLMRLGRTCPDLDASLFFDADEIRGAYVLS 383 >UniRef50_UPI00016C424B Transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C424B Length = 472 Score = 48.6 bits (114), Expect = 5e-04, Method: Composition-based stats. Identities = 55/311 (17%), Positives = 101/311 (32%), Gaps = 45/311 (14%) Query: 112 ASVALHGRSVTLYEKAFPLSEQCSKKA---------HDQFL---ADLASI--LPSNTTPL 157 +V GR + L + S+KA H + L +I P+ + Sbjct: 133 LAVTASGRILGLLHQILFTPRNASRKAPKSERRHDPHKASVLWRDALEAIGPAPAGKRWV 192 Query: 158 IVSDAGFKVPWYKSVE-KLGWYWLSRVRGKVQYADL-GAENWKPIS-----NLHDMSSSH 210 V+D G V ++ + ++ RV L A W Sbjct: 193 HVADRGADVTEFRDYAHENRMEYVVRVNHNRNVTVLDEAGEWTVAKLHDTLQCQPALGRR 252 Query: 211 SKTLG--YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP---- 264 ++ +G R + ++ + L R+ + +++ Sbjct: 253 TQEVGTQKGRTGGTATVAVRALTLSLIPPRPPRGRARGVPLLVTAIRVWEVDPPAGEKPL 312 Query: 265 -WILATNLPVEIRTPKQL-VNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIML 322 W+L TN+P + Y+KR ++EE + LK+ GLGL + ++ L Sbjct: 313 EWLLVTNVPGADVASAWARADWYAKRWRVEEYHKSLKT---GLGLEELQLTTKVGLQNAL 369 Query: 323 LIALMLQLTCWLAGVHAQ-----KQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTIT 377 + ++ + + A+ Q D Q + V VLS R H T Sbjct: 370 SLLSVVAVGLVMLRELARDPVTAAQPIDGWVQRSWVE---VLSQWR-----HDHGEQLAT 421 Query: 378 REDSLVAATLL 388 +D + A L Sbjct: 422 VKDWVWALARL 432 >UniRef50_A9AZS8 Transposase IS4 family protein n=3 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AZS8_HERA2 Length = 442 Score = 48.6 bits (114), Expect = 5e-04, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 71/186 (38%), Gaps = 10/186 (5%) Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAE 195 +A DQ L+ + LP+ + L ++D GF ++ + YWLSRV+ + G + Sbjct: 171 RASDQVLSVQRAPLPAGS--LRLADLGFYNIRIFRELAAAEVYWLSRVQSHSRIRLPGQK 228 Query: 196 NWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSK--GRKNQRSTRTHCHHPS 253 + I + G + ++ ++L+ + ++ QR Sbjct: 229 E-QSILEVVTGLGDADHWEGTVLVGSKERLAARLLVQRVPDAVAAQRRQRVQDEAHDKCR 287 Query: 254 PKIYSASAKEPWILA-TNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 P +A W + TN P + + + + R QIE F+ KS + + RT Sbjct: 288 PVSNAAMDLAAWTVVITNAPEDKLGLTEAMVLLKMRWQIELLFKLWKSHGH---VDEWRT 344 Query: 313 SSSERF 318 R Sbjct: 345 KKPARI 350 >UniRef50_B4ABV5 Transposase n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL317 RepID=B4ABV5_SALNE Length = 63 Score = 48.2 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 17/36 (47%), Positives = 27/36 (75%), Gaps = 2/36 (5%) Query: 260 SAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETF 295 SAKEPW++ +N+ TP+ ++ +YS+RMQIE+ F Sbjct: 8 SAKEPWLIFSNINDI--TPRSIMKLYSRRMQIEQNF 41 >UniRef50_P11901 Transposase for insertion sequence element IS421 n=41 Tax=cellular organisms RepID=T421_ECOLX Length = 371 Score = 48.2 bits (113), Expect = 6e-04, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 29/67 (43%), Gaps = 3/67 (4%) Query: 266 ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIA 325 +L T+LP + + +Q+ + Y R QIE F+ LKS L L R E + Sbjct: 287 LLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL---LHLDALRAKEPELAKAWIFAN 343 Query: 326 LMLQLTC 332 L+ Sbjct: 344 LLAAFLI 350 >UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostridium spiroforme DSM 1552 RepID=B1C560_9FIRM Length = 399 Score = 47.8 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 70/193 (36%), Gaps = 22/193 (11%) Query: 135 SKKAHDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLG 193 + H F + + ++D G++ ++ V G +L RVR ++ Sbjct: 130 KRDEHGAFCQLVDRY--DGQKAIFIADRGYESYNGFEHVVHSGHKYLIRVR-DIESQSSI 186 Query: 194 AENWKPISNL-HDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKG------RKNQRSTR 246 ++ P + D+ S TL ++ K+ P +YK K K Sbjct: 187 TKSLGPFPDGEFDVDVSRMLTLKQTKMIKACPD-----VYKFVPKNMRFDFMNKQNPWYE 241 Query: 247 THCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLG 306 +C KI + + + TNL + + + IY+ R E +FR+LK Y +G Sbjct: 242 FNCRVVRLKITENTYET---VITNLSRNEFSMEDICEIYNMRWGEETSFRELK---YAIG 295 Query: 307 LRHSRTSSSERFD 319 L E Sbjct: 296 LNALHAKKRELIQ 308 >UniRef50_B8F976 Transposase IS4 family protein n=2 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F976_DESAA Length = 371 Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats. Identities = 35/162 (21%), Positives = 64/162 (39%), Gaps = 8/162 (4%) Query: 157 LIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 + ++D G+ V K G + R+ + + D+ + + I NL ++ + Sbjct: 172 VFLADRGYYHRTGMLHVVKGGGDLIVRMIHQYKLYDINGQEFGLIKNLRSLTVNQIGDWD 231 Query: 216 YKRLTKSNPISCQIL-LYKSRSKGRKNQRSTRTHCHHPSPKIYSAS--AKEPWILATNLP 272 K IS ++ + KS+ K +R+ K + A E + T L Sbjct: 232 AFIHHKKEVISGRVCAIKKSKEAAEKAKRAILRENSKKGHKTKPETLVAAEYVFVFTTLS 291 Query: 273 VEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSS 314 E Q++ Y R Q+E F+ LKS +GL H + + Sbjct: 292 RE-WKASQVLEAYRGRWQVELAFKRLKSL---IGLGHLKKTD 329 >UniRef50_A6M1E5 Transposase, IS4 family protein n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M1E5_CLOB8 Length = 460 Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats. Identities = 42/188 (22%), Positives = 77/188 (40%), Gaps = 14/188 (7%) Query: 153 NTTPLIVSDAG-FKVPWYKSVEKLGWYWLSRVRGKVQYADL--GAENWKPISNLHDMSSS 209 NT +++ D G F +K +EK ++LS+++ N++ + + + S Sbjct: 192 NTNEILLVDLGYFDKKCFKMLEKKSAFFLSKIKYNTALYKENYKKGNFEKVEMIDFLKKS 251 Query: 210 HSKTLGYKRLT-KSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSP-KIYSASAKE--PW 265 Y + K N ++ K + N R R + + KE W Sbjct: 252 SGVIDTYLYVGMKQNNREEFRVIGKRLPEEIVNLRIRRAREKAKAQGRAPKKIDKELMSW 311 Query: 266 ILA-TNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIML-- 322 ++ TN+ E L++IY R QIE F+ KS YG + H +++ + + +L Sbjct: 312 VIMITNIEKEQADVDMLLDIYRLRWQIELLFKCWKS--YG-KIDHVKSAGIDYLNCLLYG 368 Query: 323 -LIALMLQ 329 LI +L Sbjct: 369 RLIITLLI 376 >UniRef50_A4SUB1 IS element transposase n=8 Tax=Bacteria RepID=A4SUB1_AERS4 Length = 420 Score = 47.4 bits (111), Expect = 9e-04, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 64/201 (31%), Gaps = 28/201 (13%) Query: 111 RASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADL-------ASILPSNTTPL---IVS 160 R + GR T+ A S L LP +++ Sbjct: 132 RLATVFPGRFKTISPAAIECHMTMSLLEQKPLCMQLSADTASERQFLPDAKKLTGSLLLA 191 Query: 161 DAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRL 219 DAG+ ++ V K G ++L VRG+ W+ + L L Sbjct: 192 DAGYIDRAYFAEVNKAGCFYL--VRGRKGLNPKILRAWRDD-------GRAVEKLTGMSL 242 Query: 220 TKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPK 279 + C+ + + +S + + W TNL E + Sbjct: 243 KEEGRRHCRAEVLDM------DVKSGKYEYRLIRRWFAEETRFCVW--MTNLARETWPAE 294 Query: 280 QLVNIYSKRMQIEETFRDLKS 300 +++ +Y R Q+E F++ KS Sbjct: 295 RVMRLYRCRWQVELLFKERKS 315 >UniRef50_A5UVL8 Putative uncharacterized protein n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UVL8_ROSS1 Length = 185 Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 49/130 (37%), Gaps = 2/130 (1%) Query: 109 VLRASVALHGRSVTLYEKAFPLSEQC-SKKAHDQFLADLASILPSNTTPLIVSDAGFKVP 167 +L V G ++ + P ++ + A + L L +P++ T L+++D G Sbjct: 1 MLALCVVDRGCAIPVAWTILPAGQKRAWRCAWFRMLRLLRPAVPASWTVLVLADCGVDAR 60 Query: 168 W-YKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPIS 226 W ++ + +LGW+ R+ + G S L + + G + + Sbjct: 61 WRFRRMARLGWHPFLRINQGGTFRLAGQARCVWWSTLVGAAGRRWRGRGTAFASSDCRLD 120 Query: 227 CQILLYKSRS 236 + + S Sbjct: 121 GTLAAWWSDG 130 >UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCH0_9FIRM Length = 435 Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 67/178 (37%), Gaps = 22/178 (12%) Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAE 195 + H + + + ++++D G++ + GW +L R++ V + + Sbjct: 170 QEHRACIQMIERVTLD--KVILIADRGYENYNIMSHAIEKGWKFLIRIK-DVHSNGIASG 226 Query: 196 NWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK------NQRSTRTHC 249 P + + DM + LT++ S + YK + + Sbjct: 227 LELPQTAVFDMDIN-------LILTRNQTKSKKQAGYKFMPTVQTFDYLPIGSKEDYPIS 279 Query: 250 HHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGL 307 + + + E + TNL + ++L +Y R IE +FR+LK Y +GL Sbjct: 280 FRIARFKIADDSYET--VITNLDRFCFSAEKLKELYHLRWGIETSFRELK---YAIGL 332 >UniRef50_C3M9W9 Modified transposase for insertion sequence NGRIS-18c n=7 Tax=Rhizobiales RepID=C3M9W9_RHISN Length = 445 Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 64/334 (19%), Positives = 108/334 (32%), Gaps = 53/334 (15%) Query: 32 ALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTM 91 ALL L T R L + ++ R R LGN + E + + Sbjct: 11 ALLQGMVLRETVCLRRLAAGSHSQEI--RFGRFLGNDAVTVEWIIAGWGEPTGAAVAGRH 68 Query: 92 PIVLVDWSDIR----EQKRL------------MVLRASVAL-----------HGRSVTLY 124 + L D S+IR R ++L +AL GR T Sbjct: 69 VLALQDTSEIRFQTTPDNRRDLGKIKKGNCWGLLLHPMLALDAETGSCLGLVGGRVWTRG 128 Query: 125 EKAFPLSEQCSKKAHD-----QFLADLASILPSNTTPLIVSDAG--FKVPWYKSVEKLGW 177 +A P A + + ++L S T ++D F V W + E+ + Sbjct: 129 TEALPPHASRPLSAKESRRWVETAEAAKAVLASATRVTAITDREGDFFVMWARLPEEC-F 187 Query: 178 YWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSK 237 + LSRV + G +++ S+T+ + + L + Sbjct: 188 HLLSRVMHDHALSGGGTLR----RAAAEVAFCDSRTVELRERADRPARQADLSLRFGEAT 243 Query: 238 GRKNQRSTRTHCHHPS---------PKIYSASAKEPWILATNLPVEIRT-PKQLVNIYSK 287 R+ Q P + W++ T V Q+V Y + Sbjct: 244 IRRPQNLEAAALPDGVTLRWVEVVEPSPPAGVEPLSWLILTTHAVATFADAWQIVAWYKQ 303 Query: 288 RMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIM 321 R IE+ FR +K GL + S+ S+ R + + Sbjct: 304 RWVIEQFFRVMKQQ--GLKVEDSQLQSAARLEKL 335 >UniRef50_A8YLR7 Genome sequencing data, contig C326 n=5 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YLR7_MICAE Length = 132 Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 47/124 (37%), Gaps = 7/124 (5%) Query: 7 LHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKA-RTKHNIKRIDRLL 65 ++ L Q + + L L+ ALL ++L + ++A + +R +R Sbjct: 11 VYSYLEQGSRFVDKRHLTVLSWMVTALLSSQSLNQARWEPFVQSRAEQANSYQRRWNRFC 70 Query: 66 GNRHLHKE---RLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVT 122 N + E + + ++ G + + +D + + + + +V GR+V Sbjct: 71 QNGRVAVEKIYIPLILKAIETWKEKGERLYL-AIDTTLLW--NQYCFVYLAVVCGGRAVP 127 Query: 123 LYEK 126 L Sbjct: 128 LMWM 131 >UniRef50_A8YMK7 Genome sequencing data, contig C327 n=1 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YMK7_MICAE Length = 438 Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 61/204 (29%), Gaps = 29/204 (14%) Query: 179 WLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKG 238 W +R+ K + + + W H + ++L K K + Sbjct: 239 WDNRIENKSKKSFILDGKW---KLAHLLKEFKPQSLSVKIYGKFTQVEA----------- 284 Query: 239 RKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDL 298 + + AKEP IL + T Q++ IY R IE RDL Sbjct: 285 -VEREVYTRGFQPKVKVVVMKGAKEPIILMS--TDITLTAIQIIEIYGSRFSIELAIRDL 341 Query: 299 KSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNV 358 K GL + D + +A + L Q + ++ + + ++ Sbjct: 342 KQH---FGLGDYQCYLGIAIDRFVQLACVAYCLFRLF----QIKEIEQSWMPKVSPSCSL 394 Query: 359 LSTVRLGMEVLRHSGYTITREDSL 382 S RL R L Sbjct: 395 FSFSRL-----RRGLQHFAITQVL 413 >UniRef50_C6LGD4 Transposase, IS4 family protein n=3 Tax=Lachnospiraceae RepID=C6LGD4_9FIRM Length = 422 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 42/342 (12%), Positives = 103/342 (30%), Gaps = 37/342 (10%) Query: 40 TLTELGRNLPTKA---RTKHNIKRIDRLLGN-RHLHKERLAV-----YRWHASFICSGNT 90 T+ L R+L K +K++ + + +H + V Y++ A + Sbjct: 70 TVDRLSRHLAKGTPKDALKAYLKQVKKWCPDQPVIHIDDSDVVKPDGYQFEAPGWVRDGS 129 Query: 91 MPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASIL 150 ++ ++ + + V+++ + E+ +D + + Sbjct: 130 EST---KTKNVYKKGYHVTEATVLTTSNHPVSIFSEIHSSVEKDFTSINDVTFSAMERAK 186 Query: 151 PSNTTPLIVSDAGFK-VPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSS 209 V D G+ + ++ + ++ R+ K + L W + L + Sbjct: 187 ALFGKATFVMDRGYDDNKMFLKLDSMKQDYVIRLTAKRRL--LYHNKWTLATELRNRRKG 244 Query: 210 HSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILAT 269 K + + K + + + S+ HP + + K Sbjct: 245 KVKLPLFYKGKKHEAYLSHVKVQITASRKDMYPVLVYGITEHPMMLAANKAIKSK----- 299 Query: 270 NLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIM-----LLI 324 ++ +Y R +IEE FR K + R + + + L + Sbjct: 300 ------EDVIKVAKLYFSRWKIEEYFRCKKQM---FQFENFRVRRLKAINALNFYTTLCM 350 Query: 325 ALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGM 366 A + ++ + AN ++ + RL Sbjct: 351 AFLAHISMKAETNALKTAIIHT---ANPIKEKTAFCYYRLAK 389 >UniRef50_UPI0001BC4BB6 transposase n=2 Tax=Neisseria mucosa ATCC 25996 RepID=UPI0001BC4BB6 Length = 403 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 79/229 (34%), Gaps = 26/229 (11%) Query: 114 VALHGRSVTLYEKAFPLSEQCSKKAH---DQFLADLAS--ILPSNTTPLIVSDAGF-KVP 167 V+ +GR L + + D+ I+ V D G+ Sbjct: 140 VSSNGRISGLKVHVLMNHANGCPTVQSITEASVNDIDQRHIVQPEKGATYVFDKGYCDYN 199 Query: 168 WYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISC 227 W+ +++ G Y+++R++ + S + + + K+ PI Sbjct: 200 WWAELDRAGAYFVTRLKANAAVEVIEQ---------FSPSETQNAHENSRNDNKNTPILT 250 Query: 228 QILL-YKSRS-KGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIY 285 + +K +S R N +T E +L +N + +++ Y Sbjct: 251 DEYIRFKHKSNSTRPNHYHNKTLRRITVE----REGTEALVLVSN--NLTASAQEIAENY 304 Query: 286 SKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWL 334 +R QIE F+ LK L L+ S+ + LL A+M L L Sbjct: 305 KRRWQIELLFKWLKQH---LKLKRFLGRSANAVKLQLLCAMMAYLLLKL 350 >UniRef50_B0R9A9 Transposase (ISH8) n=22 Tax=Halobacteriaceae RepID=B0R9A9_HALS3 Length = 424 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 10/196 (5%) Query: 125 EKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVR 184 ++ + +K HD L S L ++ A FK + +++ Y++SR++ Sbjct: 147 DETIERIDVTDEKTHDSTLFKTGSWLQE--RLVLFDRAYFKYRRFALIDENDGYFVSRLK 204 Query: 185 GKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRS 244 L E + +++ + + +G+ Sbjct: 205 ENA--NPLITEELREWRGRAIPLEGKQIHDVVDDISRKY---IDVEVEAEFKRGQYEGTR 259 Query: 245 TRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYG 304 + + + A + + TNLP + P+ L +Y R ++E FR+LK+ Sbjct: 260 SLDTKRFRVVGVRDSDADDYHLYITNLPRDEFFPEDLATLYRCRWEVETLFRELKTQ--- 316 Query: 305 LGLRHSRTSSSERFDI 320 L TS + I Sbjct: 317 YELDEFNTSDPDVVKI 332 >UniRef50_C1P7N3 Transposase IS4 family protein n=5 Tax=Bacillus coagulans 36D1 RepID=C1P7N3_BACCO Length = 437 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 79/218 (36%), Gaps = 29/218 (13%) Query: 140 DQFLADLASILPSNTTPLIVSDAGFKVP-WYKSVEKLGWYWLSRVRGKVQYADLGAENWK 198 + + P T + D+GF VP Y E+ ++ R++ Q L A+ + Sbjct: 206 RPLIKHYNEMFPET-TLFLRGDSGFAVPGLYDLCEEESVLYIIRLKSNSQLQSL-AKEYH 263 Query: 199 PISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYS 258 P S + S ++T + + ++ S K +R ++ Sbjct: 264 PSSA--PLDVSKTETYYEETIYQAKSWS-------------KPRRVIIQSVRPAGELFFT 308 Query: 259 ASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERF 318 S TN E+ P+ +V Y KR +E ++ K+ Y H + + Sbjct: 309 HS-----FFVTN--FELAFPQDIVRAYQKRGTMENYIKEAKNGFY---FDHMNSHAFLVN 358 Query: 319 DIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR 356 ++ +++ L+ +G K Q +T+R R Sbjct: 359 EVKMMLTLLAYNLTNWLRTLCFPEG-QKTMQIDTIRTR 395 >UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G346_9FIRM Length = 443 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 48/250 (19%), Positives = 78/250 (31%), Gaps = 47/250 (18%) Query: 123 LYEKAF--PLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYW 179 ++E P+ E KA Q + +S P+ + ++D G++ + +E+ G + Sbjct: 164 IFEDVVFQPICECNEHKALAQMVDRRSSAFPA----IFMADRGYESYNTFAHIEQKGDKY 219 Query: 180 LSRVRGK-----VQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKS 234 + R R E + L+ K K+ Sbjct: 220 VVRGRESGTGICSGLNLPDTEEYDIEKELYICKKHSKK-------------------VKT 260 Query: 235 RSKGRKNQRSTRTHCHHPSP----------KIYSASAKEPWILATNLPVEIRTPKQLVNI 284 + K RS T S +L TNL E + L + Sbjct: 261 NPRKYKRIRSDATFDFFTDDCEEYRLNLRIVKIKLSETTTEVLFTNLSKEKFSADDLKRL 320 Query: 285 YSKRMQIEETFRDLKSPAYGLGLRHSRTSSSER-FDIMLLIALMLQLTCWLAGVHAQKQG 343 Y R IE F LK Y LG + +SE + +M + G A KQ Sbjct: 321 YHMRWGIETAFDQLK---YALGAASVHSKNSELIIQELYGKLIMFNFCKTIVGGIAVKQQ 377 Query: 344 --WDKHFQAN 351 W ++ N Sbjct: 378 EYWKYEYKLN 387 >UniRef50_UPI0001C171A4 Putative transposase n=1 Tax=Raphidiopsis brookii D9 RepID=UPI0001C171A4 Length = 90 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 30/84 (35%), Gaps = 11/84 (13%) Query: 13 QFCPELHLKRLNS---------LTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDR 63 + P + K L S L + + L K + L L +P + + KRI R Sbjct: 2 KMLPLFYQKHLKSQLSLAEYLFLKILVNILQSIKNVNLERLANGVPLPIKFESRRKRIQR 61 Query: 64 LLGNRHLHKERLAVYRWHASFICS 87 L +L E+ ++ S Sbjct: 62 FLSLPNLTIEK--IWFPIIQEWLS 83 >UniRef50_P12249 Transposase for insertion sequence element IS231A n=411 Tax=Bacillus RepID=T231A_BACTB Length = 478 Score = 45.9 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 66/181 (36%), Gaps = 22/181 (12%) Query: 171 SVEKLGWYWLSRVR-GKVQYADLGAENWKPISNLHDM---------------SSSHSKTL 214 +++ G Y++SR++ Y + + + + + Sbjct: 209 QMDQRGAYYISRLKLNHTVYIKNPSPEYFRNGTVKKQSQYIQVDLEHIMNHLKPGQTYEI 268 Query: 215 GYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWIL---ATNL 271 + K+ + ++++Y+ K + +R + + +S +K + +N Sbjct: 269 KEAYIGKNQKLFTRVIIYRLTEKQIQERRKKQAYTESKKGITFSEKSKRLTGINIYVSNT 328 Query: 272 PVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLT 331 P I +Q+ + YS R QIE F+ KS + H + ER + + L+ Sbjct: 329 PEGIVPMEQIHDFYSLRWQIEIIFKTWKSL---FQIHHWQNIKQERLECHVYGRLIAIFI 385 Query: 332 C 332 C Sbjct: 386 C 386 >UniRef50_D1RLN9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RLN9_LEGLO Length = 58 Score = 45.9 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 14/40 (35%), Positives = 21/40 (52%) Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVS 160 +T+YE+ + H FL L SILP N P++V+ Sbjct: 1 MTIYEEVHLQKHYANNTFHTHFLRKLRSILPENCRPIVVT 40 >UniRef50_UPI00017465B5 InsL n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017465B5 Length = 382 Score = 45.9 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 75/218 (34%), Gaps = 22/218 (10%) Query: 124 YEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPW-YKSVEKLGWYWLSR 182 + + P+ + ++ F + P + ++D GF + V + G + + R Sbjct: 144 FFRLNPVRGSGNGESLKHF-----EVAPGDC---FLADRGFSHLLGIEHVYRGGAHVIMR 195 Query: 183 VRGKVQYADLGAEN----WKPISNLHDMSSSHSKTLGYK-----RLTKSNPISCQILLYK 233 + + + + L ++ L + L K P+ + Sbjct: 196 LNEQNTPLEDEQGRPVVLLPWLRKLKQPGAAAGLDLWVRPRKEDSLEKRVPVRLCAVRKS 255 Query: 234 SRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILA-TNLPVEIRTPKQLVNIYSKRMQIE 292 + ++ R + + WI+ T +P + + +++ Y R QIE Sbjct: 256 VEAAALAQRKVQRRAQQDQTKLRAATLEHTAWIVVLTTVPRDTLSDVEVLQWYRVRWQIE 315 Query: 293 ETFRDLKSPAYGLGLRHSRTSSSERFDIM--LLIALML 328 F+ LKS L S S R + LLIAL+ Sbjct: 316 LAFKRLKSLGDVGHLPKSDERS-SRAWVYAKLLIALLS 352 >UniRef50_C4YUW4 Transposase subunit n=4 Tax=Rickettsia endosymbiont of Ixodes scapularis RepID=C4YUW4_9RICK Length = 96 Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 38/94 (40%), Gaps = 10/94 (10%) Query: 278 PKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGV 337 K L+ +Y++R QIE F+ KS G + ++ R D ++ I + + G Sbjct: 3 SKILLRLYAQRWQIETMFKAFKSA--GFNCEATHITNDLRLDTLMQILSIAFCLAYQTGE 60 Query: 338 HAQKQGWDKHFQANTVRNRNVL--STVRLGMEVL 369 + ++ S R+G++++ Sbjct: 61 IIVLD------KPIVIKKHGYRQNSIFRVGLDII 88 >UniRef50_B0JP83 Transposase n=112 Tax=Cyanobacteria RepID=B0JP83_MICAN Length = 422 Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 46/270 (17%), Positives = 92/270 (34%), Gaps = 31/270 (11%) Query: 117 HGRSVTLYEKAF-PLSEQCSKKAHDQFLAD-------LASILPSNTTP-LIVSDAGFKVP 167 +SV L + + P S K +F + L P +++ DAG+ Sbjct: 142 GKKSVPLDREIYQPASSLAEGKEDKEFKKKPEIAIDLIDRSLTRGYRPKIVLIDAGYGNN 201 Query: 168 --WYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSS-HSKTLGYKRLTKSNP 224 + K++E+ +L + + + L ++ S K L Sbjct: 202 TNFLKALEERKLKYLGGLAKNRKVIIEKEGGVEETIQLEQLAKSLSEKDWEKITLNLDKE 261 Query: 225 ISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIR-TPKQLVN 283 + + +++++ + +R+ + S + A E TN+ T +V Sbjct: 262 KTVWVAVFRAKISQLEGERNLAIVMNASSMEK----ATEVDYFITNVVEADTVTASWIVR 317 Query: 284 IYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDI-MLLIALMLQLTCWLAGVHAQKQ 342 Y++R +E +R+ K LGLR + +L+ W H Sbjct: 318 TYTERNWVEVFYREAKGW---LGLREYQVRDKRSLLRHFILVFCAYTFILW----HKLTG 370 Query: 343 GWDKHFQANTVRNRNVLSTVRLGMEVLRHS 372 G + + AN L+T +E R + Sbjct: 371 GLQRQW-AN-----RPLNTFVEALEAFRTA 394 >UniRef50_B7I4G0 Transposase subunit n=16 Tax=Bacteria RepID=B7I4G0_ACIB5 Length = 148 Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 26/122 (21%), Positives = 47/122 (38%), Gaps = 9/122 (7%) Query: 271 LPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQL 330 L V + + Y+ R +IE F LK G L ++R + R LIA++ Sbjct: 21 LVVSPQFNANAIQDYALRWEIETLFSCLK--GRGFNLENTRLTDPRRVKK--LIAVLAIS 76 Query: 331 TCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTI----TREDSLVAAT 386 CW + + Q K R +S R G++ ++ + + +E+ Sbjct: 77 FCWCY-LTGEWQHDQKKVIKIKKHGRLSMSLFRYGLDYVQMAIQRLIGFGKKEEFKEILA 135 Query: 387 LL 388 +L Sbjct: 136 IL 137 >UniRef50_Q74P20 IS231-related transposase n=15 Tax=Bacillus RepID=Q74P20_BACC1 Length = 460 Score = 44.7 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 55/155 (35%), Gaps = 13/155 (8%) Query: 170 KSVEKLGWYWLSRVRGKVQ-YADLGAENWKPI---SNLHDMSSSHSKTLGYKRLTKSNPI 225 + + +++SR+R Q Y W + D+S L + Sbjct: 211 EKIADRKAFYVSRIRWNTQVYQKEKGGKWTLLDLEKLTKDLSEGQILELPEIYIGLHQKH 270 Query: 226 SCQILLYK-SRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNI 284 ++++Y+ ++++ K + + +L TN+ + ++ + Sbjct: 271 KTRLVIYRLTQTEWTKRLEHHKKAKKKMPKYASRIN-----LLITNVSSKHLPHNEVYEL 325 Query: 285 YSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFD 319 YS R QIE F+ KS + + ERF Sbjct: 326 YSLRWQIEIIFKTWKSI---FKIHEVKPVKLERFQ 357 >UniRef50_D2KXE5 Putative transposase n=1 Tax=Lactobacillus fermentum RepID=D2KXE5_LACFE Length = 373 Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 44/252 (17%), Positives = 85/252 (33%), Gaps = 45/252 (17%) Query: 148 SILPSNTTPLIVSDAGFKVPWY-KSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDM 206 S LP +T+ +I D+GF P + + + G +L R++ + L + Sbjct: 151 SALPQSTSVIIRGDSGFAAPKFYEMCDTHGVDFLVRLKANSKLGKLAET---------AL 201 Query: 207 SSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPW- 265 K C +K ++ R H + ++ PW Sbjct: 202 VDCPPKY---------EESKCVYHEFKYQAASWGKARRVIICSTHTADELV------PWN 246 Query: 266 --ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLL 323 + T+L E P+ L Y +R E ++LK G G + +S+ R L Sbjct: 247 HAFVVTSLASEA--PETLFKTYRQRGNAENQIKELKC---GFGFDKTDSSTFARNTARAL 301 Query: 324 IALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGM-EVLRHSGYTITREDSL 382 I + L + R+ +S++R + + + R Sbjct: 302 ITGIAYNLVQLFKQLFV-----------SEDRRSTISSLRFSLFHIAGRITWHARRVVIH 350 Query: 383 VAATLLTQNLFT 394 +A+ + +N F+ Sbjct: 351 LASNYVNKNWFS 362 >UniRef50_Q1J2M1 Transposase IS4 family protein n=4 Tax=Deinococcus RepID=Q1J2M1_DEIGD Length = 331 Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 37/244 (15%), Positives = 73/244 (29%), Gaps = 41/244 (16%) Query: 95 LVDWSDIR-EQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSN 153 ++D ++ + K L +L VAL + L + P + + L LP+ Sbjct: 63 VLDRTNWKLGAKDLNLLVLGVALGDVVLPLTWQVLPHGGNSDMRGRMLLVGLLLKRLPAR 122 Query: 154 TTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSK 212 ++++D F W+ + R+R D A N + + Sbjct: 123 RWAVLIADREFIGQEWFNFLRDRKIKRCIRIRESTLLDDEPARN-------AFQNLKPGE 175 Query: 213 TLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLP 272 G Q++ + R ++A++L Sbjct: 176 VRGVFERVWVYGSWMQVVATLAPQGERV-------------------------LVASDLS 210 Query: 273 VEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTC 332 + + Y R IE TF +K+ GL L + + R + + + Sbjct: 211 LWDT-----LTTYRLRWAIECTFSAMKTR--GLNLEQTHMTQPNRLSRLFGLLSLALAWM 263 Query: 333 WLAG 336 G Sbjct: 264 VRIG 267 >UniRef50_Q64B41 Transposase n=11 Tax=environmental samples RepID=Q64B41_9ARCH Length = 439 Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 75/175 (42%), Gaps = 9/175 (5%) Query: 157 LIVSDAG-FKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 +++ D G FK ++ ++ G Y++SR++G + +++ + L Sbjct: 196 ILLIDLGYFKYLFFDRIDGYGGYFVSRLKGNANPLIVRVNRKCRGNSVDVVGKKLRDVLP 255 Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 K + ++ + R + K ++ST S S K L TN+ V+I Sbjct: 256 RL---KREILDVEVEVEFKR-RKYKGKQSTVKRRFRMVCAFNSDSGKYHSYL-TNIRVDI 310 Query: 276 RTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQL 330 + +++ +Y R +IE F++LKS + +++ ++ IA++ + Sbjct: 311 LSAEEIALLYGARWEIELIFKELKSH---YRMDQIPSANPNIVKCLIWIAILTLM 362 >UniRef50_C8W6S4 Transposase IS4 family protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W6S4_DESAS Length = 465 Score = 44.3 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 32/78 (41%), Gaps = 3/78 (3%) Query: 266 ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIA 325 L TN + + +L+ Y +R QIE F+DLK L L ER + + Sbjct: 306 ALLTNYDADRVSANKLIKKYRERNQIEVNFKDLKGL---LDLERIFLQLPERIEAYVFPK 362 Query: 326 LMLQLTCWLAGVHAQKQG 343 + +A+++G Sbjct: 363 TLAYFVLAFLRWYAEEKG 380 >UniRef50_C9LIM0 Putative transposase n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LIM0_9BACT Length = 418 Score = 44.3 bits (103), Expect = 0.008, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 57/184 (30%), Gaps = 7/184 (3%) Query: 139 HDQFLADLASILPSNTTPLIVSDAGFKVPWYKS-VEKLGWYWLSRVRGKVQYADLGAENW 197 + + L L S LIV+DA F + S V+ LG+ +SR R V L + Sbjct: 170 YLRMLNRNCKQLLSICK-LIVADAYFSKESFVSGVKSLGFNLISRFRDDVNLKYLYSGP- 227 Query: 198 KPISNLHDMSSSHSKTLGYKRLTK-SNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKI 256 K + + + S + + L S Sbjct: 228 KTGKRGRPQKFAGKVDVNNLDMNVFSEDYTAEGKLVYKMYTAVVWAVSLGCEVRVVLVDY 287 Query: 257 YSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSE 316 A K + + + +IY R Q+E FRD K GL H + + E Sbjct: 288 EDAEKKRQTRKVFFSTDTAMSARDIFDIYRTRFQLEFVFRDAKQFT---GLTHCQARNKE 344 Query: 317 RFDI 320 Sbjct: 345 ALSF 348 >UniRef50_Q1J3A6 IS1 related protein n=4 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J3A6_DEIGD Length = 219 Score = 44.3 bits (103), Expect = 0.008, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 36/93 (38%), Gaps = 6/93 (6%) Query: 278 PKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGV 337 ++ + Y+ R E + LKS G L + + R +L + + + C L G Sbjct: 110 AEKALKRYALRWTAENMHQALKSR--GFFLESTHLTDPSRVSTLLAVVALAFVWCCLVGE 167 Query: 338 HAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLR 370 Q++ + + S R G++ LR Sbjct: 168 FEQQRDPSRCLRHGYPPK----SLFRRGLDALR 196 >UniRef50_C6PFH6 Transposase IS4 family protein n=2 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PFH6_CLOTS Length = 398 Score = 44.3 bits (103), Expect = 0.009, Method: Composition-based stats. Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 6/68 (8%) Query: 266 ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIA 325 + TN ++ T ++N YS+R IE FR K+ LGL + S++ D +L + Sbjct: 289 FICTNTELDTET---ILNYYSQRWPIEIFFRQTKN---NLGLNTYQVRSTKSIDRLLWLI 342 Query: 326 LMLQLTCW 333 + + C Sbjct: 343 SLTYMYCT 350 >UniRef50_D1XVT7 Putative uncharacterized protein n=2 Tax=Bacteroidales RepID=D1XVT7_9BACT Length = 235 Score = 43.9 bits (102), Expect = 0.010, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 54/169 (31%), Gaps = 13/169 (7%) Query: 153 NTTPLIVSDAGFKVPWY-KSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHS 211 + T ++V+DA F + K + +LG+Y +SR R + Sbjct: 10 SITDIVVADAFFSTSTFEKGMSELGFYLVSRFRDNACLHYISKRE---------KRRGRP 60 Query: 212 KTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNL 271 + G K + +SC L+ G+ ++ Sbjct: 61 RVKGDKIDLANLNLSCMEELHIDGLDGKAYTLEVYAKALKKRVRLVIWRMPGGKCKLFFS 120 Query: 272 PVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDI 320 T ++++ Y R QIE +RD K GL + + D Sbjct: 121 TKLSMTGEEVLKTYRTRFQIEFCYRDSKQFT---GLMDCQARHKRQLDF 166 >UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipelotrichaceae RepID=B7C7E2_9FIRM Length = 446 Score = 43.9 bits (102), Expect = 0.010, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 80/225 (35%), Gaps = 16/225 (7%) Query: 152 SNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSH 210 + ++D G+ + ++ + G +L RV+ + + + + D+ Sbjct: 196 KGDKAIFIADRGYESINSFEKIHLSGNKYLVRVK-DIHSTGMLRSFGPFLDDEFDLIVKR 254 Query: 211 SKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATN 270 + T K++P + + R ++ C KI + + + TN Sbjct: 255 TLTTKQTNEIKAHPEIYKFVPQNQRFDYFEDAPFYDFECRVVRFKITEDTYE---CIVTN 311 Query: 271 LPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDI-MLLIALMLQ 329 L + + + +Y R +IE ++R+LK Y L L + + + ++ Sbjct: 312 LDKNEFSMQDIKELYHLRWEIETSYRELK---YDLDLNTLHSKKRNLIEQEIYAKMILYN 368 Query: 330 LTCWLA-GVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSG 373 + G+ K+ +Q N V+ + E L+ + Sbjct: 369 FCSRITNGIDIAKRKRKYEYQLNFVQG------FHIIREHLKKAK 407 >UniRef50_C0VKT5 IS4 family transposase ORF 2 n=5 Tax=Acinetobacter RepID=C0VKT5_9GAMM Length = 178 Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 53/140 (37%), Gaps = 8/140 (5%) Query: 231 LYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQ 290 L++ G+ R R I + ++ +L P+ + + Y+ R + Sbjct: 14 LFRHLQVGQTECRKRRIWVGRVKLYISALRLEDGELLLVVSPMFNASA---IRDYALRWE 70 Query: 291 IEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA 350 IE F LK G L ++R + R ++ + + C+L G + + Sbjct: 71 IETLFSCLK--GRGFNLENTRLTDPGRVKKLIAVLAIGFCWCYLTGEWQHDRKKAIKIKK 128 Query: 351 NTVRNRNVLSTVRLGMEVLR 370 + R +S R G++ ++ Sbjct: 129 H---GRLSVSLFRYGLDYVQ 145 >UniRef50_A7B2R8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A7B2R8_RUMGN Length = 441 Score = 43.6 bits (101), Expect = 0.014, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 81/228 (35%), Gaps = 38/228 (16%) Query: 152 SNTTPLIVSDAGFKVP-WYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSH 210 L+ D+GF P YK E+ G ++ R++ + + + + + Sbjct: 217 PTIQILLRGDSGFATPDLYKQCEENGTSYVIRLKENGILREKASHLVDELDEITRNNKVD 276 Query: 211 SKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATN 270 + + + K+ P Y+ R + + + + + TN Sbjct: 277 YAVVYGEFMYKAGPWP-----YERRVVCKVEKPENQMVYMYT-------------FVVTN 318 Query: 271 LPVEIRTPKQLVNIYSKRMQIEETFRDLKSP-AYGLGLRHSRTSSSERFDIMLLIALMLQ 329 + +P+ L+ Y KR ++E ++ KS + H+R ++ R + AL Sbjct: 319 M---DSSPEYLIKFYCKRGRMENFIKESKSGFDFASVSSHARIVNANRLQVH---ALAYN 372 Query: 330 LTCWLAGVHAQKQGWDKHFQANTVRNR---NVLSTVRLGMEVLRHSGY 374 + W + AN + R L +++ +++R + Y Sbjct: 373 IFNWFRRLA---------LSANMRKQRIDTVRLKLLKIAAKIIRSARY 411 >UniRef50_A5EC94 Putative transposase n=1 Tax=Bradyrhizobium sp. BTAi1 RepID=A5EC94_BRASB Length = 395 Score = 43.6 bits (101), Expect = 0.015, Method: Composition-based stats. Identities = 17/53 (32%), Positives = 28/53 (52%), Gaps = 3/53 (5%) Query: 265 W-ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSE 316 W +L T++ + +++V++Y KR IEE FR LKS G + + E Sbjct: 233 WRLLTTHVVRSSKQARRIVDLYRKRWTIEEFFRTLKSA--GFDIEEADIGEPE 283 >UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001905F7C Length = 367 Score = 43.2 bits (100), Expect = 0.016, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 1/55 (1%) Query: 266 ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDI 320 +L T+L + P++L + Y R QIE F+ +KS GL ++ + R I Sbjct: 285 VLLTSLNADDWPPERLASTYRLRWQIELAFKRMKSL-IGLEGLRAKDADLARLWI 338 >UniRef50_B6FTH4 Putative uncharacterized protein n=3 Tax=Clostridium nexile DSM 1787 RepID=B6FTH4_9CLOT Length = 224 Score = 43.2 bits (100), Expect = 0.017, Method: Composition-based stats. Identities = 16/41 (39%), Positives = 25/41 (60%), Gaps = 3/41 (7%) Query: 267 LATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGL 307 + TNLP E +++ +Y+ R IE +FR+LK Y +GL Sbjct: 89 ILTNLPKEDFPVEEIKKVYAMRWGIETSFRELK---YAIGL 126 >UniRef50_Q2FU81 Transposase, IS4 n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FU81_METHJ Length = 452 Score = 43.2 bits (100), Expect = 0.017, Method: Composition-based stats. Identities = 44/226 (19%), Positives = 81/226 (35%), Gaps = 20/226 (8%) Query: 134 CSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADL 192 +++ HD + I P L+++D G+ + + +++ G ++ SRV+ + + Sbjct: 177 TTERVHDY---KMLKIGPDVENILLINDLGYYSLKTFSKIQEYGGFFASRVKSNAVFKVV 233 Query: 193 GAENWKP-----ISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRT 247 + P + + S + L + C + G K+ +T Sbjct: 234 SINSGPPEITSIVDHNCFKSINGDDFLDRMPKKGVYDLICSFHI------GDKHINKIKT 287 Query: 248 HCHHPSPKIYS-ASAKEPWIL-ATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGL 305 I S E W L TNL E+ + + +Y R IE F++LK Y L Sbjct: 288 PIFQEFRVICSWNPLTEKWHLYITNLGKEVFSADDIYELYRFRWVIELIFKELK-GDYDL 346 Query: 306 GLRHSRTSSSERFDI--MLLIALMLQLTCWLAGVHAQKQGWDKHFQ 349 G I MLL ++ + +K K+ Q Sbjct: 347 GKMLLNNEPMAFIHIYSMLLRFIISRDLFTWIVSTTRKNDKGKYTQ 392 >UniRef50_Q3M187 Putative transposase n=10 Tax=Nostocaceae RepID=Q3M187_ANAVT Length = 116 Score = 43.2 bits (100), Expect = 0.017, Method: Composition-based stats. Identities = 14/75 (18%), Positives = 29/75 (38%), Gaps = 5/75 (6%) Query: 294 TFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTV 353 F+D K+ Y L S +S +R ++ + + + WL G + Q + + Sbjct: 1 MFKDCKTGGYNL---ESSQASPDRLVRIIFLIALAMTSAWLQGQKIKLQRQQSYVCRSQE 57 Query: 354 --RNRNVLSTVRLGM 366 + S +G+ Sbjct: 58 QGKTEKRHSNFWIGL 72 >UniRef50_Q7UY96 Similar to transposase n=1 Tax=Rhodopirellula baltica RepID=Q7UY96_RHOBA Length = 403 Score = 43.2 bits (100), Expect = 0.019, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 57/168 (33%), Gaps = 19/168 (11%) Query: 211 SKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRT------HCHHPSPKIYSASAKEP 264 K L + + + + LY+ S+ ++ T Sbjct: 203 GKALYHGKKVQRQVAEKTVTLYRPHSEVIDGEKKAVTGEPIEVRTVFVRLVDADGWILAE 262 Query: 265 WILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLI 324 W L TN+P + + Y R +IE F+ LKS G L + + S E LL+ Sbjct: 263 WTLLTNVPADQANASDVGRWYYFRWRIESFFKLLKSH--GQELEYWQQESGEAITKRLLM 320 Query: 325 ALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHS 372 A M + + + +T++ R L +RL ++ Sbjct: 321 ASMACVLV---------KQLEASESESTIKFRRHL--IRLSGRRMKRG 357 >UniRef50_B6FLV1 Putative uncharacterized protein (Fragment) n=1 Tax=Clostridium nexile DSM 1787 RepID=B6FLV1_9CLOT Length = 135 Score = 43.2 bits (100), Expect = 0.020, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 19/34 (55%) Query: 266 ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 + TNL E K++ +Y R IE +FR+LK Sbjct: 29 CIITNLEEEDFPMKEIKKLYEWRWGIERSFRELK 62 >UniRef50_B6FVK0 Putative uncharacterized protein (Fragment) n=2 Tax=Clostridium nexile DSM 1787 RepID=B6FVK0_9CLOT Length = 383 Score = 43.2 bits (100), Expect = 0.020, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 87/243 (35%), Gaps = 37/243 (15%) Query: 78 YRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKK 137 +R GNT ++ V+ + K ++ A R TL K L++ + + Sbjct: 91 FRMLTVSWSDGNT--LIPVNSCLLASSKESNIIGPVKAFDKR--TLAGKRRKLAQTKAPE 146 Query: 138 AHDQFLADLASILPSNTTPLIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQYADLGAEN 196 A L S S L D+ F P +++ G ++ ++ + E+ Sbjct: 147 AMLTLLDTAVSAGLSAEYMLF--DSWFSNPAQITALKSRGMDVIAMIKKSSRIKYTYGED 204 Query: 197 WKPISNLHD---MSSSHSKTL--GYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHH 251 I ++ S+ L + K NPI +I+ ++++ Sbjct: 205 QLNIKEIYSRNKKRRGRSRYLLSVDVMVGKENPIPAKIVCVRNKA--------------- 249 Query: 252 PSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSR 311 ++ W LA + ++++ IY KR +IE F+ KS +G HS Sbjct: 250 ---------NRKDW-LAFICTDTSLSEEEIIRIYGKRWKIEVFFKTCKSMLNLIGECHSL 299 Query: 312 TSS 314 + Sbjct: 300 SYD 302 >UniRef50_A7IQF9 Putative uncharacterized protein n=1 Tax=Xanthobacter autotrophicus Py2 RepID=A7IQF9_XANP2 Length = 183 Score = 42.8 bits (99), Expect = 0.021, Method: Composition-based stats. Identities = 26/111 (23%), Positives = 48/111 (43%), Gaps = 11/111 (9%) Query: 261 AKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDI 320 +KE ++ATN P+ + Y +R +IE F K+ GL L + +S ER Sbjct: 50 SKELLVVATN-----TDPRIALTNYRRRWEIETLFAASKTR--GLNLEDTHITSPERIAK 102 Query: 321 MLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRH 371 ++ + + + G + + + T + R S R+G ++LR Sbjct: 103 LIAVLAVAFIFAHATGEWSAR---HRPIIIKTHK-RKAKSIFRIGFDLLRK 149 >UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicutes RepID=B8FXU5_DESHD Length = 381 Score = 42.8 bits (99), Expect = 0.023, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 72/184 (39%), Gaps = 11/184 (5%) Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSV---EKLGWYWLSRVRGKVQY-ADL 192 H ++ + + + + D G++ + ++ ++ WY++ R + + L Sbjct: 118 NEHKALVSMVDQSEING-NVIAIMDRGYES--FNNIAHFQEKSWYYIIRAKESYGIISRL 174 Query: 193 GAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHP 252 ++ ++ + +T L K+ P + + + K + S H Sbjct: 175 SLPDYPEYDEEIMLTLTRRQTKETLPLLKAYPHRYRWIQPHTTFDFIKPKDSKFYDLHFR 234 Query: 253 SPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 + + A + TNL E P++L +Y+ R IE +F++LK Y +GL + Sbjct: 235 AVRFAIADGVYE-TVYTNLNAEDFPPEKLKQLYNLRWGIETSFKELK---YAVGLASLHS 290 Query: 313 SSSE 316 + Sbjct: 291 KKKD 294 >UniRef50_C6N6I3 Transposase n=2 Tax=Gammaproteobacteria RepID=C6N6I3_9GAMM Length = 488 Score = 42.8 bits (99), Expect = 0.024, Method: Composition-based stats. Identities = 26/109 (23%), Positives = 47/109 (43%), Gaps = 4/109 (3%) Query: 228 QILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLP-VEIRTPKQLVNIYS 286 QI ++ K +K + T H + + K W LATNLP + + ++ Y+ Sbjct: 292 QINVFPPIGKQKKYPSLSLTIIHAEEEQDPTNRDKIVWKLATNLPITSLEQAIEKLDWYA 351 Query: 287 KRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 R +IE + LKS G S+ +++R ++ I +L + Sbjct: 352 LRWRIETFHKILKS---GCKAEASKLRTAQRISNLIAIFCILSWRIFWM 397 >UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostridiales RepID=C7GFW6_9FIRM Length = 436 Score = 42.8 bits (99), Expect = 0.024, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 70/202 (34%), Gaps = 6/202 (2%) Query: 153 NTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHS 211 +P+ ++D GF + + +L R + + P + + Sbjct: 184 GASPIFIADRGFSSYNVFAHAIENNVDFLIRAK-DLNVQRFLGGGTLPDKLDTTIELILT 242 Query: 212 KTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWI--LAT 269 +T K+ S + K+ + N + + + +I + + T Sbjct: 243 RTQSKKKHKHPEKESQYRYIGKNIAFDYLN-PADISDEYLLKLRIVRVEVSDGVFENIIT 301 Query: 270 NLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQ 329 L E TP + Y+ R IE +FRDLK G HS+ + F++ + L Sbjct: 302 TLSEEDFTPDDIKYCYNLRWGIETSFRDLK-HTIGATNLHSKKTEYVAFELWSKLILYNF 360 Query: 330 LTCWLAGVHAQKQGWDKHFQAN 351 + + V + + +Q N Sbjct: 361 CSIIILHVPVKSRNRKYEYQVN 382 >UniRef50_Q0H069 ISEc13 transposase n=23 Tax=Bacteria RepID=Q0H069_ECOLX Length = 457 Score = 42.8 bits (99), Expect = 0.024, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 69/218 (31%), Gaps = 9/218 (4%) Query: 129 PLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFK-VPWYKSVEKLGWYWLSRVRGKV 187 ++A ++ L I + V D + G ++ R Sbjct: 164 EKESYRWQQASERMAERLGEIQK---RVITVCDREADIWHYLHYKVSHGQRFVVRAAQNR 220 Query: 188 QYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRT 247 + + + ++ L +++ S TL + ++ + S + S + Sbjct: 221 RLEEAPGKLFELPEVL---ATAGSHTLNVMQKGGRAARQARMFISYSEVSIKNPDNSGQA 277 Query: 248 HCHHPSPKIYSASAKEPWILATNLPVEIRT-PKQLVNIYSKRMQIEETFRDLKSPAYGLG 306 A W L T+ V +++V+ Y +R IEE + KS + Sbjct: 278 LPLTYVCCREQAEDGACWHLLTSEKVASAADARRIVSHYERRWLIEEYHKAWKSGGTCVE 337 Query: 307 LRHSRTSSS-ERFDIMLLIALMLQLTCWLAGVHAQKQG 343 +T + ER ++ + L G+ + Q Sbjct: 338 SLRMQTRDNLERMVVIKAFIAVRVLGLRQGGISEETQN 375 >UniRef50_UPI000190437B putative insertion sequence transposase protein n=1 Tax=Rhizobium etli Brasil 5 RepID=UPI000190437B Length = 330 Score = 42.4 bits (98), Expect = 0.028, Method: Composition-based stats. Identities = 23/95 (24%), Positives = 36/95 (37%), Gaps = 10/95 (10%) Query: 233 KSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP-------WILATNLPVEIRT-PKQLVNI 284 + RS GRK R + P W+L T + T Q+V Sbjct: 7 RRRSGGRKTLREEGLPDGVRLSWVEVVEPDAPDGVEPLHWLLLTTHALSSATDAWQIVAW 66 Query: 285 YSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFD 319 Y +R IE+ FR +K+ G + S+ + R + Sbjct: 67 YKQRWMIEQFFRVMKTQ--GFKIEDSQLQLAARLE 99 >UniRef50_C0AF19 InsL n=9 Tax=Opitutaceae bacterium TAV2 RepID=C0AF19_9BACT Length = 362 Score = 42.4 bits (98), Expect = 0.029, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 46/146 (31%), Gaps = 2/146 (1%) Query: 157 LIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 L + D F + P + V L R + + S + Sbjct: 169 LFLGDRNFCRAPQIRHVMDHQGAVLLRWHSTSLPLFDQQGHALDVPAWLAQLRSRQCSEL 228 Query: 216 YKRLTKSNPIS-CQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVE 274 L + C + + ++ + + + P + ++ T+LP Sbjct: 229 PVFLKDGTALRLCALRVSPQAAQRERAKIRLSAKKNGRKPSCQCLCMADYIVVVTSLPSS 288 Query: 275 IRTPKQLVNIYSKRMQIEETFRDLKS 300 + ++ +Y R QIE F+ LKS Sbjct: 289 CLDSRGILQLYRLRWQIELAFKRLKS 314 >UniRef50_A6DTQ2 Putative transposase insL for insertion sequence IS186 n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTQ2_9BACT Length = 375 Score = 42.4 bits (98), Expect = 0.031, Method: Composition-based stats. Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 5/70 (7%) Query: 231 LYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQ 290 L ++ RK +++R P Y+ ++ T L + +P++++NIY R Q Sbjct: 259 LKAQKAIHRKASKNSRKGSTRPETLEYAGYI----LILTTLAESV-SPEKILNIYRSRWQ 313 Query: 291 IEETFRDLKS 300 IE F+ LKS Sbjct: 314 IELLFKRLKS 323 >UniRef50_Q4BVH8 Putative uncharacterized protein n=1 Tax=Crocosphaera watsonii WH 8501 RepID=Q4BVH8_CROWT Length = 168 Score = 42.4 bits (98), Expect = 0.034, Method: Composition-based stats. Identities = 12/93 (12%), Positives = 31/93 (33%), Gaps = 5/93 (5%) Query: 74 RLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQ 133 + +G+ + I+ +D + +++ MV +V ++ +Y Sbjct: 49 FPVIKAIICKEFKTGSRL-IITIDRTQWKDKNVFMV---AVIWKKLALPIYWTLLGKRGA 104 Query: 134 CSKKAHDQFLADLASILPSNTTPLIVSDAGFKV 166 + + +L N +I+ D F Sbjct: 105 SRLSEQQALIQPVLCLL-KNYELVILGDREFHS 136 >UniRef50_Q4V248 Transposase, n=5 Tax=Bacillus cereus group RepID=Q4V248_BACCZ Length = 140 Score = 42.0 bits (97), Expect = 0.037, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 40/88 (45%), Gaps = 3/88 (3%) Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWIL---ATNLP 272 + K + ++++Y+ K + +R + + YS +K + TN P Sbjct: 42 EAYIGKDQKLFTRVIIYRLTEKQIQERRKKQNYTESKKGITYSEKSKRLTGINIYVTNTP 101 Query: 273 VEIRTPKQLVNIYSKRMQIEETFRDLKS 300 EI +Q+ + YS R QIE TF+ KS Sbjct: 102 WEIVPMEQIHDFYSLRWQIEITFKTWKS 129 >UniRef50_C4RAB7 Putative uncharacterized protein n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAB7_9PROT Length = 192 Score = 42.0 bits (97), Expect = 0.037, Method: Composition-based stats. Identities = 10/98 (10%), Positives = 32/98 (32%), Gaps = 2/98 (2%) Query: 94 VLVDWSDIREQKRLM-VLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPS 152 + +D ++ + + + +L + + L+ + + L + P Sbjct: 10 LALDRTNWKFGRCHINILMLGIVHEKVCIPLFWSLRDKAGNSNAPERTALLERMIKTFPD 69 Query: 153 NTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQY 189 + D F W + + G ++ R++ + Sbjct: 70 QPISSLSGDREFIGEKWMGWLHERGIPFVLRLKENMHV 107 >UniRef50_A9DPK2 Transposase n=8 Tax=Shewanella benthica KT99 RepID=A9DPK2_9GAMM Length = 269 Score = 42.0 bits (97), Expect = 0.039, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 65/176 (36%), Gaps = 21/176 (11%) Query: 157 LIVSDAG-FKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 L++ DAG F + + +K G + + R GK+ I D + L Sbjct: 34 LLLMDAGYFNIDYCYQADKHGGHVIMRTNGKIN---------PDIKAAFDSQGLAIEGLI 84 Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 K+L + QI+ + + H + + L TNL Sbjct: 85 GKKLKQLKWHREQII--------DLDVQWKSKPGTHRLIAFWDRNKSAIGYLITNLKRAQ 136 Query: 276 RTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLT 331 + ++ +Y R QIE F++LKS + GL+ T + ++ +++ L Sbjct: 137 FSADKVSKLYGLRWQIELFFKELKSYS---GLKTFNTRDKSIAESLVWASMLTLLL 189 >UniRef50_Q05309 Transposase for insertion sequence element IS1151 n=16 Tax=Clostridium perfringens RepID=T1151_CLOPE Length = 473 Score = 42.0 bits (97), Expect = 0.040, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 67/198 (33%), Gaps = 26/198 (13%) Query: 160 SDAG-FKVPWYKSVEKLGWYWLSRVRGKVQY-------------ADLGAENWKPISNLH- 204 +D G FK+ + K ++K G ++S+V+ + + I + Sbjct: 193 ADLGYFKIDYLKRLDKSGTAFISKVKSNTSLYIKNPSPEKYKVGTIKKSSEYIKIDIIKL 252 Query: 205 --DMSSSHSKTLGYKRLTKSNPISCQILLYK---SRSKGRKNQRSTRTHCHHPSPKIYSA 259 +++ + L + + ++++ K R + Sbjct: 253 AEPLAAGETIELTDIYIGSKKELKSRLIITKLTEENKSKRIFNHIEGIKKKRLTLNQRRL 312 Query: 260 SAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFD 319 TN+ I T Q+ +YS R QIE F+ KS + + ERF Sbjct: 313 DFNSINAYITNVSSNIITMNQVHELYSLRWQIEIIFKVWKSI---FKINQVKKVKLERFM 369 Query: 320 IML---LIALMLQLTCWL 334 L LIAL+L T Sbjct: 370 CFLYGRLIALLLSSTIVF 387 >UniRef50_C0BDH6 Putative uncharacterized protein n=2 Tax=Coprococcus comes ATCC 27758 RepID=C0BDH6_9FIRM Length = 204 Score = 42.0 bits (97), Expect = 0.042, Method: Composition-based stats. Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 3/51 (5%) Query: 266 ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSE 316 + TNLP + +++ +Y+ R IE +FR LK Y +GL + E Sbjct: 143 CIVTNLPRDEFPVERIKTLYNARWSIESSFRKLK---YTIGLSNFHAYKPE 190 >UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3VMZ1_KLEPN Length = 421 Score = 42.0 bits (97), Expect = 0.043, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 44/147 (29%), Gaps = 15/147 (10%) Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 + P+ + + + + + + + +P L TNL Sbjct: 227 CNQTGLCQPLRSWLAVLPKHGELDLDVQWPDGPVYRCVLFASTDHKDKPVCLCTNLDRHT 286 Query: 276 RTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 + Y R QIE F++ KS L T S + ++ +L+ Sbjct: 287 FPAATVGEWYRLRWQIELLFKEWKSLN---SLNKFNTEYSTIAETLIWGSLLAATLKRWL 343 Query: 336 GVHAQKQGWDKHFQANTVRNRNVLSTV 362 AQ+ + R VLS Sbjct: 344 INGAQQ------------KYRRVLSMF 358 >UniRef50_B0TD95 Transposase, is4 family n=3 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TD95_HELMI Length = 441 Score = 41.6 bits (96), Expect = 0.046, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 50/152 (32%), Gaps = 20/152 (13%) Query: 157 LIVSDAG-FKVPWYKSVEKLGWYWLSRVRGKVQ-------YADLGAENWKPISNLHDMSS 208 +++ D G F + + Y++SR++ G L D+ Sbjct: 191 ILLFDLGYFSFKHFGKIMNEKGYFVSRLKSNSNPLILRSLIQHRGRTIAVEGKRLLDIKG 250 Query: 209 SHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILA 268 S + + + SN S + L K + R + Sbjct: 251 SLRREIIDFEVLVSNSQSSNMDLVKRTAL---QLRVVGILNEETKDYHFY---------I 298 Query: 269 TNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 TNLP E + + +Y R IE F++LKS Sbjct: 299 TNLPAERFPAEDIATLYRARWTIELLFKELKS 330 >UniRef50_A7C1C1 IS231-related transposase n=6 Tax=Beggiatoa sp. PS RepID=A7C1C1_9GAMM Length = 445 Score = 41.6 bits (96), Expect = 0.047, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 60/166 (36%), Gaps = 12/166 (7%) Query: 180 LSRVRGKVQYADLGAENW-KPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYK----- 233 LSR+R D E + + L ++ + L + + ++ + + Sbjct: 212 LSRLRHDAVLFDEQEEEFDLSLYTLFMKKNNRLRAELNVLLVRYEKLPVRLFIERVPEMI 271 Query: 234 SRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEE 293 S + R+ + + S S + +L T P + + + +Y R QIE Sbjct: 272 SSKRRRQANKGASKKKKGKTASKKSLSLCDFTLLVTTAPSVQLSFDEALVLYGARWQIEL 331 Query: 294 TFRDLKSPAYGLGLRHSRTSSSERFDI---MLLIALMLQLTCWLAG 336 F+ KS A L S + R + L+A ++Q L G Sbjct: 332 LFKLWKSHA---KLDTSIRPNPWRICRYIYIKLLACLVQHWIILMG 374 >UniRef50_A6FXH7 Transposase, IS4 n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6FXH7_9DELT Length = 377 Score = 41.6 bits (96), Expect = 0.052, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 35/119 (29%), Gaps = 11/119 (9%) Query: 276 RTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIML-LIALMLQLTCWL 334 + +++ YS+R IE FRDLK G + + + L Sbjct: 249 MSTSRILEAYSRRWAIEVCFRDLKQE---FGFEDCQARKQNAVERTCPFLGYCYSLLVLW 305 Query: 335 AGVHAQKQGWDKHFQANTVRNRNVLST---VRLGMEVLRHSGYTITREDSLVAATLLTQ 390 + + + R + S +R VL G D L L + Sbjct: 306 FASLTESERRAAEVERPWYRTKQNYSFADVLRAARLVLSQPG----VSDLLCDLADLGE 360 >UniRef50_Q04QP0 Transposase, ISLbp11 n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QP0_LEPBJ Length = 243 Score = 41.6 bits (96), Expect = 0.053, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 31/78 (39%), Gaps = 4/78 (5%) Query: 259 ASAKEPWILATNLP-VEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSER 317 W T +P K++++ Y R IE F+ LKS G + ++ +R Sbjct: 81 KEESIDWKFLTTIPIHNSENAKRVISYYKSRWGIEVFFKVLKS---GCNIESTQFKFGDR 137 Query: 318 FDIMLLIALMLQLTCWLA 335 F + ++ ++ + Sbjct: 138 FKACIAVSAIVAWRVTML 155 >UniRef50_B8FI31 Transposase IS4 family protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FI31_DESAA Length = 386 Score = 41.6 bits (96), Expect = 0.054, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 77/215 (35%), Gaps = 35/215 (16%) Query: 150 LPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSS 208 LP + ++V D + W+K+ + ++++R++ ++Y L + + Sbjct: 183 LPKGS--ILVEDRAYVDFTWFKNWHENKQFFVTRLKKNIKYKVLERRDVPQNKGVTSD-- 238 Query: 209 SHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILA 268 +LT C N R + + K+ ++ Sbjct: 239 ------QIIKLTGKKAADCP------------NLRRVG---------YWDKTTKKHYVYL 271 Query: 269 TNLPVEIRTPKQLVNIYSKRMQIEETFRDLK-SPAYGLGLRHSRTSSSERFDIMLLIALM 327 TNL + + + +IY R QIE F+ +K + L +SR + + ++ L+ Sbjct: 272 TNLTK--LSARTIADIYKDRWQIELFFKWIKQNLRIKSFLGNSRNAVLTQIWTAMISMLI 329 Query: 328 LQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTV 362 L W A + A K Q + RN+ Sbjct: 330 LAYYKWRAKIGATLTEMLKLLQLTLMERRNLYELF 364 >UniRef50_C1DPR7 Transposase n=7 Tax=Proteobacteria RepID=C1DPR7_AZOVD Length = 460 Score = 41.6 bits (96), Expect = 0.055, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 57/184 (30%), Gaps = 16/184 (8%) Query: 179 WLSRVRGKVQYADLGAEN-WKPIS------NLHDMSSSHSKTLGYKRLTKSNPISCQILL 231 W+ R + A+ W ++ L + K R + S ++L Sbjct: 209 WIVRAAQDRRVLTGDADKLWASLAMAPGLGQLAVEVRARPKR--PARQARVTLRSATVVL 266 Query: 232 YKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI-RTPKQLVNIYSKRMQ 290 GR + W+L T+LPV +V Y+ R Sbjct: 267 RPPARIGRHLPEVSVNAVLAREENPPEGVEPLEWLLLTSLPVGSLEQASTIVAWYAVRWY 326 Query: 291 IEETFRDLKSPAYGLGLRHSRTSSSERF---DIMLLIALMLQLTCWLAGVHAQKQGWDKH 347 IE F LK+ G + + + ER + L+ L + G + + Sbjct: 327 IEIYFHVLKN---GCQINCLQLETEERLLPCIGLYLVVAWRVLYSLMLGRACPELNCELI 383 Query: 348 FQAN 351 F+A Sbjct: 384 FEAR 387 >UniRef50_UPI000196B70E hypothetical protein CATMIT_00144 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196B70E Length = 479 Score = 41.6 bits (96), Expect = 0.058, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 63/197 (31%), Gaps = 26/197 (13%) Query: 129 PLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYK-SVEKLGWYWLSRVRGKV 187 P SE AH L +L N + ++D + +E L + ++ R + Sbjct: 183 PYSETPLAFAH---LYRTREML-ENQKVIYLADRYYGSAEIISHLEDLRYSYVIRGKSNF 238 Query: 188 QYADLGA----ENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQR 243 + + W + + K L + L K + Sbjct: 239 YKKQVAGMESDDEWIEVE------------VDEKWLKRFRFSPEAKKLRKENPTLKIRVI 286 Query: 244 STRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAY 303 K + + I TNL E T +++ IYS+R IE +++ +K+ Sbjct: 287 KREYRYTDNKNKEHCENL----IYFTNLSSESFTTDEIMEIYSRRWDIEVSYKTMKTTQ- 341 Query: 304 GLGLRHSRTSSSERFDI 320 + S R DI Sbjct: 342 EVERHISSDGDVARNDI 358 >UniRef50_C3BTW8 Transposase for insertion sequence element IS231B n=13 Tax=Bacillus RepID=C3BTW8_9BACI Length = 387 Score = 41.2 bits (95), Expect = 0.059, Method: Composition-based stats. Identities = 28/102 (27%), Positives = 40/102 (39%), Gaps = 3/102 (2%) Query: 231 LYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQ 290 L K + + R ++ R S + TN P +I QL + YS R Q Sbjct: 199 LTKEQQQKRLQDQTVREKKKGMKYSARSKRLSGINVYMTNTPTDIVPMGQLHDWYSLRWQ 258 Query: 291 IEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTC 332 IE F+ KS Y + H + ER + L L+ L C Sbjct: 259 IEILFKTWKSFFY---IHHCKKIKRERLECHLYGQLIAILLC 297 >UniRef50_C8VXW5 Transposase IS4 family protein n=3 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VXW5_DESAS Length = 587 Score = 41.2 bits (95), Expect = 0.065, Method: Composition-based stats. Identities = 36/268 (13%), Positives = 84/268 (31%), Gaps = 12/268 (4%) Query: 124 YEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSD-AGFKVPWYKSVEKLGWYWLSR 182 YE+ + E + FL + + ++ D A F + + + ++ Sbjct: 306 YEQIMVMQEFLPFD-ENAFL-YYREFIIDDRRYILTFDVARFFDEHHAQLNNVAYFVQWL 363 Query: 183 VRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQ 242 + + + + + K L P +++ R R Q Sbjct: 364 TVKNQSLREAKKKRCQSLLEREVAAMLKRKHLKKWVSVNIEPYDFEVI--NKRGNSRTIQ 421 Query: 243 RSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPA 302 ++ + + TNL V T ++ Y ++ ++EE F ++K Sbjct: 422 SFQLSYTINTVAQKNEQRIHGITCFITNLDVTSHTAIDIIQWYRRKNKVEEAFHEIKDH- 480 Query: 303 YGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQ--ANTVRNRNV-- 358 L LR + +R ++I ++ ++ + T+R V Sbjct: 481 --LDLRPIYLTREQRVMAHVIICVLAYFIFNDIEYRLKQNDLAYSTEEVIGTLRECLVNR 538 Query: 359 LSTVRLGMEVLRHSGYTITREDSLVAAT 386 L+ + L + + ++ L A Sbjct: 539 LAIQQTNRSWLSITQPSSQLKEILHALK 566 >UniRef50_C6J5I2 Transposase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J5I2_9BACL Length = 190 Score = 41.2 bits (95), Expect = 0.066, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 50/149 (33%), Gaps = 26/149 (17%) Query: 154 TTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKT 213 P+++ P V K G + + V+ + L + L+ + Sbjct: 48 VKPILMDSWFTHAPLIGEVVKRGLHVIGMVKNDNK-RFLVQGRKLSLKELYAAAPRVDSK 106 Query: 214 LGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWI--LATNL 271 + + ++C I ++ + + K W+ L T+L Sbjct: 107 KRHILRSIRTELACGIPIH--------------------VVFVRHRTNKNEWLAILMTDL 146 Query: 272 PVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 T ++++ IY+ R IE F+ KS Sbjct: 147 ---TLTVEEVIQIYAMRWDIEVFFKCTKS 172 >UniRef50_D1RLH5 Putative uncharacterized protein n=3 Tax=Legionella longbeachae D-4968 RepID=D1RLH5_LEGLO Length = 96 Score = 41.2 bits (95), Expect = 0.071, Method: Composition-based stats. Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 2/54 (3%) Query: 318 FDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRH 371 I+LL+A + WLA + G FQA++ + + LS V LG VL+ Sbjct: 1 MKILLLVAAIATFAAWLA--DIKSIGKTSDFQAHSAKYTSALSIVFLGKHVLKK 52 >UniRef50_Q8PF48 Transposase n=1 Tax=Xanthomonas axonopodis pv. citri RepID=Q8PF48_XANAC Length = 136 Score = 40.9 bits (94), Expect = 0.080, Method: Composition-based stats. Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 6/94 (6%) Query: 306 GLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLG 365 S T ER + +LL + WLAG+ A+ +H +R + +RLG Sbjct: 17 AFEGSLTRKRERIETLLLPHALAMFASWLAGMAAEAID-AQHNLNPYRTSRRLYCLMRLG 75 Query: 366 ME-----VLRHSGYTITREDSLVAATLLTQNLFT 394 E L H R L+A + +F+ Sbjct: 76 QEASCRGWLEHLSTHAGRTTPLLAGSQSEPAIFS 109 >UniRef50_A3ZNH0 Probable transposase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZNH0_9PLAN Length = 451 Score = 40.9 bits (94), Expect = 0.091, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 60/185 (32%), Gaps = 13/185 (7%) Query: 157 LIVSDAGFKVP-WYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 L + D G++ + + G ++ R+ + L + + ++ + Sbjct: 237 LFLMDRGYRSAELFNKIHTAGHDYICRL-NRTDGKLLKPPKKGEVREPIQLPPLSAEAIA 295 Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRST------RTHCHHPSPKIYSASAKEPWILAT 269 + +R R + ++ +LAT Sbjct: 296 MGIVADELITMGGNCGASKIGSDHPMRRIKLIPPADRPSSARQGRVRTDQTGRDELVLAT 355 Query: 270 NLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQ 329 L T +++V +Y R ++E FR LK LG + ++ + I L A++ Sbjct: 356 TL--MDLTAEEIVRLYEHRWEVELFFRFLKQV---LGCKKLLSAKTAGVQIQLYCAIIAS 410 Query: 330 LTCWL 334 L L Sbjct: 411 LLLAL 415 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B9EHT2 Olfr780 protein n=158 Tax=root RepID=B9EHT2_MOUSE 387 e-106 UniRef50_Q15UH5 Transposase, IS4 family n=36 Tax=Gammaproteobact... 326 1e-87 UniRef50_A3D336 Transposase, IS4 family n=6 Tax=Shewanella RepID... 306 1e-81 UniRef50_A4C5E2 Hypothetical transposase n=2 Tax=Pseudoalteromon... 304 4e-81 UniRef50_A7N7H3 Putative uncharacterized protein n=31 Tax=Vibrio... 304 5e-81 UniRef50_Q6LPG7 Hypothetical transposase n=7 Tax=Photobacterium ... 297 5e-79 UniRef50_Q07YD1 Transposase, IS4 family n=6 Tax=Shewanella RepID... 292 2e-77 UniRef50_C6MY57 Putative transposase, IS4 family protein n=1 Tax... 285 2e-75 UniRef50_Q6LJK0 Hypothetical transposase n=2 Tax=Vibrionaceae Re... 285 2e-75 UniRef50_C1DIQ1 Transposase, IS4 n=2 Tax=Azotobacter vinelandii ... 263 1e-68 UniRef50_D0I6N0 Transposase IS4 n=1 Tax=Grimontia hollisae CIP 1... 262 2e-68 UniRef50_Q17U39 Transposase n=11 Tax=Gammaproteobacteria RepID=Q... 259 2e-67 UniRef50_Q5X8W8 Putative uncharacterized protein n=1 Tax=Legione... 246 1e-63 UniRef50_Q2NZH2 ISXoo8 transposase n=73 Tax=Xanthomonas RepID=Q2... 245 2e-63 UniRef50_B2JAE4 Transposase, IS4 family protein n=8 Tax=Cyanobac... 199 2e-49 UniRef50_C7QY62 Transposase IS4 family protein n=9 Tax=Cyanothec... 197 5e-49 UniRef50_Q5ZXB3 ORF2 transposase n=9 Tax=Legionella RepID=Q5ZXB3... 195 3e-48 UniRef50_D2SUD5 Transposase n=1 Tax=uncultured bacterium psy1 Re... 195 3e-48 UniRef50_A9AVJ1 Transposase IS4 family protein n=1 Tax=Herpetosi... 186 8e-46 UniRef50_B4UH67 Transposase IS4 family protein n=3 Tax=Proteobac... 186 1e-45 UniRef50_B4WTK1 Putative uncharacterized protein n=6 Tax=Synecho... 186 1e-45 UniRef50_Q1VRR5 Putative uncharacterized protein n=8 Tax=Bactero... 186 1e-45 UniRef50_A7MW84 Putative uncharacterized protein n=2 Tax=Vibrio ... 185 2e-45 UniRef50_D0LPB8 Transposase IS4 family protein n=4 Tax=Haliangiu... 183 7e-45 UniRef50_B1XQT5 Tn10-like transposase (IS4 family) n=14 Tax=Cyan... 181 3e-44 UniRef50_B0JUB6 Transposase n=18 Tax=Cyanobacteria RepID=B0JUB6_... 181 3e-44 UniRef50_B5VUF1 Transposase IS4 family protein n=17 Tax=Arthrosp... 172 2e-41 UniRef50_B5VWL5 Transposase IS4 family protein n=6 Tax=Arthrospi... 168 3e-40 UniRef50_A7NGF0 Transposase IS4 family protein n=1 Tax=Roseiflex... 167 8e-40 UniRef50_C4YZ17 Transposase, IS4 family protein n=4 Tax=Ricketts... 166 1e-39 UniRef50_B7KME5 Transposase IS4 family protein n=42 Tax=Cyanobac... 165 2e-39 UniRef50_B0BZT8 Transposase, IS4 family n=21 Tax=Cyanobacteria R... 165 3e-39 UniRef50_C7RIL9 Transposase IS4 family protein n=1 Tax=Candidatu... 160 1e-37 UniRef50_Q1IXF5 Transposase, IS4 n=6 Tax=Bacteria RepID=Q1IXF5_D... 157 6e-37 UniRef50_Q72IB6 Transposase n=3 Tax=Thermus thermophilus HB27 Re... 157 6e-37 UniRef50_A5UQG7 Transposase, IS4 family n=1 Tax=Roseiflexus sp. ... 156 2e-36 UniRef50_C1XLC5 Transposase family protein n=1 Tax=Meiothermus r... 155 3e-36 UniRef50_UPI000038476B hypothetical protein Magn03010330 n=1 Tax... 149 2e-34 UniRef50_A8ZRP2 Transposase IS4 family protein n=1 Tax=Deinococc... 149 2e-34 UniRef50_B5K928 Transposase, IS4 n=23 Tax=Alphaproteobacteria Re... 141 4e-32 UniRef50_A9AUQ0 Transposase IS4 family protein n=3 Tax=Herpetosi... 139 1e-31 UniRef50_Q3M186 Putative uncharacterized protein n=2 Tax=Anabaen... 138 3e-31 UniRef50_C1D0Y0 Putative transposase n=1 Tax=Deinococcus deserti... 138 5e-31 UniRef50_B4WNR8 Putative uncharacterized protein n=1 Tax=Synecho... 137 6e-31 UniRef50_B4W0I0 Transposase, IS4 family protein n=1 Tax=Microcol... 133 2e-29 UniRef50_A8ZQL6 Putative uncharacterized protein n=1 Tax=Acaryoc... 132 2e-29 UniRef50_C8Q1E5 Transposase, IS4 family n=1 Tax=Enhydrobacter ae... 132 2e-29 UniRef50_Q9RZJ3 Transposase, putative n=9 Tax=Deinococcus radiod... 130 9e-29 UniRef50_Q9UH48 Gastric cancer-related protein GCYS-20 n=1 Tax=H... 129 2e-28 UniRef50_Q2S0J1 Putative transposase n=1 Tax=Salinibacter ruber ... 126 1e-27 UniRef50_D1C6P8 Putative uncharacterized protein n=2 Tax=Sphaero... 123 1e-26 UniRef50_Q10V90 Transposase, IS4 family n=7 Tax=Trichodesmium er... 121 4e-26 UniRef50_A8ZMZ5 Putative uncharacterized protein n=1 Tax=Acaryoc... 121 4e-26 UniRef50_B7I4U9 Transposase 1 n=31 Tax=Bacteria RepID=B7I4U9_ACIB5 119 1e-25 UniRef50_A5UPF7 Transposase, IS4 family n=1 Tax=Roseiflexus sp. ... 118 4e-25 UniRef50_UPI000197B669 hypothetical protein BACCOPRO_01365 n=1 T... 117 1e-24 UniRef50_Q5GUK2 ISxac1 transposase n=1 Tax=Xanthomonas oryzae pv... 116 2e-24 UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=... 115 3e-24 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 114 8e-24 UniRef50_B8B8E6 Putative uncharacterized protein n=5 Tax=cellula... 110 7e-23 UniRef50_Q47076 BfpT, bfpV, bfpW and transposase genes, complete... 109 2e-22 UniRef50_B7AA71 Transposase IS4 family protein n=2 Tax=Thermus a... 109 2e-22 UniRef50_C5UVK8 Putative transposase n=1 Tax=Clostridium botulin... 109 3e-22 UniRef50_A5UY16 Transposase, IS4 family n=9 Tax=Roseiflexus sp. ... 106 2e-21 UniRef50_B2AKB8 Transposase, IS4 family n=40 Tax=cellular organi... 101 4e-20 UniRef50_B1QZ52 Putative transposase n=2 Tax=Clostridium butyric... 99 2e-19 UniRef50_B4VLK2 Transposase, IS4 family protein n=1 Tax=Microcol... 99 3e-19 UniRef50_Q5L3A2 Transposase of IS231E-like element n=1 Tax=Geoba... 98 6e-19 UniRef50_Q7NHH4 Gll2563 protein n=2 Tax=Gloeobacter violaceus Re... 98 8e-19 UniRef50_Q7NIQ3 Glr2130 protein n=2 Tax=Gloeobacter violaceus Re... 96 2e-18 UniRef50_D0SG98 Transposase n=3 Tax=Gammaproteobacteria RepID=D0... 96 2e-18 UniRef50_Q8A4P1 Transposase n=5 Tax=Bacteroides RepID=Q8A4P1_BACTN 96 3e-18 UniRef50_C6LGD4 Transposase, IS4 family protein n=3 Tax=Lachnosp... 95 5e-18 UniRef50_Q73IB8 Transposase, IS4 family n=9 Tax=Wolbachia RepID=... 94 1e-17 UniRef50_B7GET6 Transposase n=2 Tax=Bacillaceae RepID=B7GET6_ANOFW 93 3e-17 UniRef50_C3M9W9 Modified transposase for insertion sequence NGRI... 91 7e-17 UniRef50_A6TN04 Transposase, IS4 family protein n=1 Tax=Alkaliph... 90 1e-16 UniRef50_Q1QFL8 Putative uncharacterized protein n=1 Tax=Nitroba... 90 2e-16 UniRef50_Q6ZER7 Putative uncharacterized protein sll5063 n=1 Tax... 89 4e-16 UniRef50_A2RJ55 Putative transposase n=7 Tax=Lactobacillales Rep... 89 4e-16 UniRef50_A7MYH1 Putative uncharacterized protein n=4 Tax=Vibrio ... 86 2e-15 UniRef50_UPI00016C424B Transposase n=1 Tax=Gemmata obscuriglobus... 86 2e-15 UniRef50_C4ILZ9 Putative iso-IS10R ORF n=1 Tax=Clostridium butyr... 85 4e-15 UniRef50_Q1AUS1 Putative uncharacterized protein n=4 Tax=Rubroba... 85 5e-15 UniRef50_Q6MB98 Putative uncharacterized protein n=1 Tax=Candida... 85 7e-15 UniRef50_Q1ARL9 Transposase, IS4 family n=1 Tax=Rubrobacter xyla... 82 4e-14 UniRef50_A6M1E5 Transposase, IS4 family protein n=1 Tax=Clostrid... 81 5e-14 UniRef50_B0NXD2 Putative uncharacterized protein n=5 Tax=Clostri... 80 1e-13 UniRef50_A5WBL3 Transposase, IS4 family n=2 Tax=Bacteria RepID=A... 80 1e-13 UniRef50_A8YLR7 Genome sequencing data, contig C326 n=5 Tax=Micr... 80 2e-13 UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepI... 80 2e-13 UniRef50_B0JGV7 Putative uncharacterized protein n=1 Tax=Microcy... 79 2e-13 UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coproco... 78 5e-13 UniRef50_UPI0001BC4BB6 transposase n=2 Tax=Neisseria mucosa ATCC... 77 9e-13 UniRef50_A9AZS8 Transposase IS4 family protein n=3 Tax=Herpetosi... 76 2e-12 UniRef50_Q1PXV1 Putative uncharacterized protein n=3 Tax=Candida... 76 2e-12 UniRef50_C3EBZ9 IS231-related transposase n=1 Tax=Bacillus thuri... 75 4e-12 UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostri... 75 4e-12 UniRef50_Q46GC6 Transposase n=7 Tax=Methanosarcina RepID=Q46GC6_... 72 4e-11 UniRef50_A5FWE3 Transposase, IS4 family protein n=2 Tax=Acidiphi... 70 1e-10 UniRef50_A8YMK7 Genome sequencing data, contig C327 n=1 Tax=Micr... 70 1e-10 UniRef50_UPI0001C16BE8 Transposase, IS4 protein n=1 Tax=Cylindro... 70 2e-10 UniRef50_B8F976 Transposase IS4 family protein n=2 Tax=Desulfati... 69 3e-10 UniRef50_C4YUK3 Transcription-repair-coupling factor n=29 Tax=Ri... 68 8e-10 UniRef50_A4SUB1 IS element transposase n=8 Tax=Bacteria RepID=A4... 67 1e-09 UniRef50_C6AUF2 Transposase IS4 family protein n=7 Tax=Rhizobium... 67 1e-09 UniRef50_Q7UPU9 Probable transposase n=2 Tax=Rhodopirellula balt... 67 1e-09 UniRef50_A5UVL8 Putative uncharacterized protein n=1 Tax=Roseifl... 67 1e-09 UniRef50_Q8VV93 Transposase n=1 Tax=marine psychrotrophic bacter... 67 1e-09 UniRef50_Q1QJQ9 Putative uncharacterized protein n=1 Tax=Nitroba... 65 4e-09 UniRef50_Q10VE7 Putative uncharacterized protein n=1 Tax=Trichod... 64 9e-09 UniRef50_Q6LGR5 Putative transposase similar to Tn10 n=1 Tax=Pho... 61 8e-08 UniRef50_Q6ZER8 Putative uncharacterized protein sll5062 n=1 Tax... 61 1e-07 UniRef50_A3IP38 Putative uncharacterized protein n=1 Tax=Cyanoth... 59 2e-07 UniRef50_Q1QGK1 Putative uncharacterized protein n=1 Tax=Nitroba... 57 1e-06 UniRef50_P11901 Transposase for insertion sequence element IS421... 57 1e-06 UniRef50_A7N4N2 Putative uncharacterized protein n=1 Tax=Vibrio ... 55 4e-06 UniRef50_A7HFH6 Putative uncharacterized protein n=1 Tax=Anaerom... 47 0.001 Sequences not found previously or not previously below threshold: UniRef50_Q1J2M1 Transposase IS4 family protein n=4 Tax=Deinococc... 95 7e-18 UniRef50_C6YRC6 Transposase ISFtu5 n=6 Tax=Francisella tularensi... 87 9e-16 UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipe... 79 2e-13 UniRef50_Q1J3A6 IS1 related protein n=4 Tax=Deinococcus geotherm... 76 2e-12 UniRef50_C4YZA5 Transposase n=28 Tax=Rickettsia endosymbiont of ... 73 2e-11 UniRef50_B7I4G0 Transposase subunit n=16 Tax=Bacteria RepID=B7I4... 71 9e-11 UniRef50_P12249 Transposase for insertion sequence element IS231... 69 2e-10 UniRef50_B7CEB8 Putative uncharacterized protein n=2 Tax=Erysipe... 69 3e-10 UniRef50_C4RAB7 Putative uncharacterized protein n=1 Tax=magneti... 68 8e-10 UniRef50_B6FTH4 Putative uncharacterized protein n=3 Tax=Clostri... 68 9e-10 UniRef50_Q7UY96 Similar to transposase n=1 Tax=Rhodopirellula ba... 67 1e-09 UniRef50_A7IQF9 Putative uncharacterized protein n=1 Tax=Xanthob... 67 1e-09 UniRef50_Q74P20 IS231-related transposase n=15 Tax=Bacillus RepI... 66 2e-09 UniRef50_C0VKT5 IS4 family transposase ORF 2 n=5 Tax=Acinetobact... 66 2e-09 UniRef50_UPI00016AD9A8 transposase Tn5 n=1 Tax=Burkholderia thai... 66 2e-09 UniRef50_C0BDH6 Putative uncharacterized protein n=2 Tax=Coproco... 66 2e-09 UniRef50_A1RNX9 Transposase, Tn5 family n=93 Tax=Gammaproteobact... 66 3e-09 UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicute... 64 7e-09 UniRef50_Q5ZTU2 Transposase Tn5 n=9 Tax=root RepID=Q5ZTU2_LEGPH 64 1e-08 UniRef50_B0R9A9 Transposase (ISH8) n=22 Tax=Halobacteriaceae Rep... 63 2e-08 UniRef50_Q05309 Transposase for insertion sequence element IS115... 63 3e-08 UniRef50_C4YUW4 Transposase subunit n=4 Tax=Rickettsia endosymbi... 62 4e-08 UniRef50_Q5ZXP7 Putative uncharacterized protein n=5 Tax=Gammapr... 61 5e-08 UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostrid... 61 6e-08 UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostri... 61 8e-08 UniRef50_Q2FU81 Transposase, IS4 n=4 Tax=Methanospirillum hungat... 61 1e-07 UniRef50_A9F243 Transposase, IS4 family n=4 Tax=Sorangium cellul... 61 1e-07 UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea f... 60 1e-07 UniRef50_C6N6I3 Transposase n=2 Tax=Gammaproteobacteria RepID=C6... 60 2e-07 UniRef50_Q64B41 Transposase n=11 Tax=environmental samples RepID... 60 2e-07 UniRef50_Q4BVH8 Putative uncharacterized protein n=1 Tax=Crocosp... 59 2e-07 UniRef50_Q7M7G3 Gll0371 protein n=1 Tax=Gloeobacter violaceus Re... 59 3e-07 UniRef50_Q6MB99 Putative uncharacterized protein n=2 Tax=Candida... 59 3e-07 UniRef50_Q6LRT4 Similar to transposase n=38 Tax=Photobacterium p... 59 3e-07 UniRef50_B0NZ84 Putative uncharacterized protein n=1 Tax=Clostri... 59 3e-07 UniRef50_A8F1V7 Transposase and inactivated derivative n=1 Tax=R... 58 5e-07 UniRef50_UPI000196B70E hypothetical protein CATMIT_00144 n=1 Tax... 58 7e-07 UniRef50_B6BYZ1 Putative uncharacterized protein n=1 Tax=Nitroso... 58 7e-07 UniRef50_B9K450 Transposase n=6 Tax=cellular organisms RepID=B9K... 58 8e-07 UniRef50_Q0H069 ISEc13 transposase n=23 Tax=Bacteria RepID=Q0H06... 58 9e-07 UniRef50_C1DPR7 Transposase n=7 Tax=Proteobacteria RepID=C1DPR7_... 58 9e-07 UniRef50_Q1Q2K2 Putative uncharacterized protein n=5 Tax=Candida... 57 1e-06 UniRef50_A7C1C1 IS231-related transposase n=6 Tax=Beggiatoa sp. ... 57 1e-06 UniRef50_C3BTW8 Transposase for insertion sequence element IS231... 57 1e-06 UniRef50_B4WVP9 Putative uncharacterized protein n=1 Tax=Synecho... 57 2e-06 UniRef50_C8VXW5 Transposase IS4 family protein n=3 Tax=Desulfoto... 56 2e-06 UniRef50_A8YDI5 Similarity. Hypothetical start n=1 Tax=Microcyst... 56 2e-06 UniRef50_C0INS6 Transposase n=1 Tax=uncultured bacterium BLR10 R... 56 2e-06 UniRef50_Q04QP0 Transposase, ISLbp11 n=2 Tax=Leptospira borgpete... 56 2e-06 UniRef50_A5II18 Transposase, IS4 n=1 Tax=Legionella pneumophila ... 56 3e-06 UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3... 56 3e-06 UniRef50_A6L0R8 Transposase n=13 Tax=Bacteroidales RepID=A6L0R8_... 55 4e-06 UniRef50_A4J2U7 Transposase, IS4 family protein n=3 Tax=Desulfot... 55 4e-06 UniRef50_C8W0R4 Transposase IS4 family protein n=4 Tax=Desulfoto... 55 5e-06 UniRef50_A1WHR7 Transposase, IS4 family n=11 Tax=Proteobacteria ... 55 6e-06 UniRef50_Q6MCG2 Putative uncharacterized protein n=1 Tax=Candida... 54 7e-06 UniRef50_Q04V25 Transposase, ISLbp1 n=29 Tax=Leptospira RepID=Q0... 54 7e-06 UniRef50_Q18HG4 Tn5-like transposase n=1 Tax=Haloquadratum walsb... 54 8e-06 UniRef50_C7RPQ7 Putative uncharacterized protein n=1 Tax=Candida... 54 8e-06 UniRef50_B0JGI1 Transposase n=37 Tax=Bacteria RepID=B0JGI1_MICAN 54 1e-05 UniRef50_UPI00017465B5 InsL n=2 Tax=Verrucomicrobium spinosum DS... 54 1e-05 UniRef50_B0TD95 Transposase, is4 family n=3 Tax=Heliobacterium m... 53 2e-05 UniRef50_A6DTQ2 Putative transposase insL for insertion sequence... 53 2e-05 UniRef50_B6FLV1 Putative uncharacterized protein (Fragment) n=1 ... 53 2e-05 UniRef50_UPI0001C15C40 hypothetical protein CRC_03218 n=1 Tax=Cy... 53 2e-05 UniRef50_C6JEA3 Putative uncharacterized protein n=1 Tax=Ruminoc... 52 3e-05 UniRef50_A9DPK2 Transposase n=8 Tax=Shewanella benthica KT99 Rep... 52 4e-05 UniRef50_B8FXQ3 Transposase IS4 family protein n=8 Tax=Desulfito... 51 5e-05 UniRef50_Q1Q5J6 Putative uncharacterized protein n=6 Tax=Candida... 51 6e-05 UniRef50_B7KKS2 Putative uncharacterized protein n=3 Tax=Cyanoth... 51 6e-05 UniRef50_Q4V248 Transposase, n=5 Tax=Bacillus cereus group RepID... 51 6e-05 UniRef50_C1P7N3 Transposase IS4 family protein n=5 Tax=Bacillus ... 51 7e-05 UniRef50_A8RFU1 Putative uncharacterized protein n=1 Tax=Eubacte... 51 9e-05 UniRef50_B5ZZ25 Transposase IS4 family protein n=11 Tax=Rhizobiu... 51 1e-04 UniRef50_C6PFH6 Transposase IS4 family protein n=2 Tax=Thermoana... 50 1e-04 UniRef50_A3ZNH0 Probable transposase n=1 Tax=Blastopirellula mar... 50 1e-04 UniRef50_Q647P2 Transposase n=1 Tax=uncultured archaeon GZfos9E5... 50 1e-04 UniRef50_UPI0001C171A4 Putative transposase n=1 Tax=Raphidiopsis... 50 1e-04 UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001... 50 1e-04 UniRef50_P55729 Putative transposase y4zB n=4 Tax=Rhizobiaceae R... 50 2e-04 UniRef50_B7KMB2 Transposase IS4 family protein n=2 Tax=Cyanothec... 50 2e-04 UniRef50_C6JHT2 Transposase ISLbp1 n=1 Tax=Ruminococcus sp. 5_1_... 49 2e-04 UniRef50_A3IS08 Putative uncharacterized protein n=1 Tax=Cyanoth... 49 2e-04 UniRef50_Q8KKS9 Putative insertion sequence transposase protein ... 49 2e-04 UniRef50_C4XGQ6 Putative transposase for insertion sequence elem... 49 2e-04 UniRef50_C3KKH4 Putative transposase Y4ZB n=2 Tax=Rhizobium sp. ... 49 2e-04 UniRef50_A5N5R2 Transposase n=6 Tax=Clostridium RepID=A5N5R2_CLOK5 49 3e-04 UniRef50_C5EN31 Putative uncharacterized protein n=1 Tax=Clostri... 49 3e-04 UniRef50_Q0F098 ISGsu1, transposase n=6 Tax=Mariprofundus ferroo... 49 3e-04 UniRef50_B0C3Q4 Putative uncharacterized protein n=5 Tax=Cyanoba... 49 3e-04 UniRef50_Q64E61 Transposase n=1 Tax=uncultured archaeon GZfos14B... 49 3e-04 UniRef50_A6DSH7 Probable transposase n=3 Tax=Lentisphaera araneo... 49 3e-04 UniRef50_Q1PWW4 Putative uncharacterized protein n=2 Tax=Candida... 49 4e-04 UniRef50_C5VJA1 Transposase domain protein n=15 Tax=Prevotella R... 49 4e-04 UniRef50_C6JFT0 Transposase family protein n=3 Tax=Clostridiales... 49 4e-04 UniRef50_Q3A1U3 Transposase n=1 Tax=Pelobacter carbinolicus DSM ... 48 4e-04 UniRef50_Q115Q8 Putative uncharacterized protein n=5 Tax=Trichod... 48 4e-04 UniRef50_C3FBK7 Transposase for insertion sequence element IS231... 48 4e-04 UniRef50_C7PAE4 Transposase IS4 family protein n=4 Tax=Chitinoph... 48 4e-04 UniRef50_Q3M187 Putative transposase n=10 Tax=Nostocaceae RepID=... 48 5e-04 UniRef50_Q2NQE6 Putative uncharacterized protein n=1 Tax=Sodalis... 48 5e-04 UniRef50_C9C7H0 Transposase n=5 Tax=Enterococcus faecium RepID=C... 48 5e-04 UniRef50_C8W6S4 Transposase IS4 family protein n=1 Tax=Desulfoto... 48 6e-04 UniRef50_Q7ULM3 Probable transposase n=5 Tax=Planctomycetaceae R... 48 6e-04 UniRef50_A5EC94 Putative transposase n=1 Tax=Bradyrhizobium sp. ... 48 7e-04 UniRef50_Q10XV7 Putative uncharacterized protein n=2 Tax=Trichod... 48 7e-04 UniRef50_D0DW10 Transposase IS4 family protein n=5 Tax=Lactobaci... 48 8e-04 UniRef50_UPI00016C4547 putative transposase n=1 Tax=Gemmata obsc... 48 8e-04 UniRef50_A7B2R8 Putative uncharacterized protein n=2 Tax=Clostri... 47 0.001 UniRef50_C0AF19 InsL n=9 Tax=Opitutaceae bacterium TAV2 RepID=C0... 47 0.001 UniRef50_B1XIB9 Transposase n=2 Tax=Cyanobacteria RepID=B1XIB9_S... 47 0.001 UniRef50_C9KS84 Transposase domain protein n=5 Tax=Bacteroidales... 47 0.001 UniRef50_Q6MCG1 Putative uncharacterized protein n=1 Tax=Candida... 47 0.001 UniRef50_Q04TU6 Transposase, ISLbp10 n=3 Tax=cellular organisms ... 47 0.001 UniRef50_Q9X6I5 Putative uncharacterized protein n=2 Tax=Bacillu... 47 0.002 UniRef50_C9LFX6 Transposase domain protein n=14 Tax=Bacteroidale... 47 0.002 UniRef50_A3H523 Transposase (IS4 family) protein (Fragment) n=1 ... 47 0.002 UniRef50_Q18EK5 Probable transposase (ISH8/ISH26) n=5 Tax=Haloqu... 46 0.002 UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium... 46 0.002 UniRef50_C8NDH5 Transposase n=1 Tax=Cardiobacterium hominis ATCC... 46 0.002 UniRef50_A6DKD2 ISPg4, transposase n=7 Tax=Chlamydiae/Verrucomic... 46 0.002 UniRef50_UPI00016C58D0 Transposase n=1 Tax=Gemmata obscuriglobus... 46 0.002 UniRef50_Q6MS13 Transposase IS1634BQ n=39 Tax=Mycoplasma RepID=Q... 46 0.002 UniRef50_A5GAF0 Putative uncharacterized protein n=6 Tax=Deltapr... 46 0.002 UniRef50_Q46731 Transposase for transposon Tn5 n=15 Tax=root Rep... 46 0.002 UniRef50_A3DKE5 Transposase, IS4 n=14 Tax=Clostridium RepID=A3DK... 46 0.002 UniRef50_C0VKK7 ISCja2 transposase n=8 Tax=Acinetobacter RepID=C... 46 0.003 UniRef50_A4BSI6 Putative transposase n=3 Tax=Nitrococcus mobilis... 46 0.003 UniRef50_C5V7Z6 Transposase IS4 family protein n=3 Tax=root RepI... 46 0.003 UniRef50_D1N6R0 Transposase IS4 family protein n=1 Tax=Victivall... 46 0.003 UniRef50_Q2S608 Putative uncharacterized protein n=1 Tax=Salinib... 46 0.003 UniRef50_Q4C3L4 Putative uncharacterized protein n=1 Tax=Crocosp... 46 0.003 >UniRef50_B9EHT2 Olfr780 protein n=158 Tax=root RepID=B9EHT2_MOUSE Length = 402 Score = 387 bits (993), Expect = e-106, Method: Composition-based stats. Identities = 398/402 (99%), Positives = 399/402 (99%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR Sbjct: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS Sbjct: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL Sbjct: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK Sbjct: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 NQRSTRTHCHHPSPKIYSASAKEPW+LATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS Sbjct: 241 NQRSTRTHCHHPSPKIYSASAKEPWVLATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS Sbjct: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 Query: 361 TVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLGKL 402 TVRLGMEVLRHSGYTITRED LVAATLL QNLFTHGY LGKL Sbjct: 361 TVRLGMEVLRHSGYTITREDLLVAATLLAQNLFTHGYALGKL 402 >UniRef50_Q15UH5 Transposase, IS4 family n=36 Tax=Gammaproteobacteria RepID=Q15UH5_PSEA6 Length = 420 Score = 326 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 195/389 (50%), Positives = 270/389 (69%), Gaps = 2/389 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M ++ ILHD L + CP LH KRL++L +A +LLD + L+LTELGRN+ KHNIKR Sbjct: 19 MRDIHILHDLLKKQCPNLHAKRLSALMVATQSLLDGQQLSLTELGRNISGSVAPKHNIKR 78 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRLLGN +LH ERL +YRWHA +C N MP+VLVDWSD+REQ R + LRASV++ GRS Sbjct: 79 IDRLLGNNNLHNERLDIYRWHARLLCGANPMPVVLVDWSDVREQLRHLTLRASVSVQGRS 138 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 VTLYE+ F E S +H+ FL +LASILP PLIV+DAG++ PW++ VEK GW+WL Sbjct: 139 VTLYERVFSFGEYNSPVSHNPFLRELASILPLGCCPLIVTDAGYRNPWFREVEKHGWFWL 198 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 RVRG V + G +W+ + + ++S +K LG +L + +P+ + LYK+++K RK Sbjct: 199 GRVRGDVGFKRDGQASWQSNKSFYPSANSRAKYLGCGQLGRKSPLHAHLHLYKAKAKHRK 258 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPV-EIRTPKQLVNIYSKRMQIEETFRDLK 299 + RS++ +H + + Y A +KEPW+LATNLP + KQLV++Y++RMQIEETFRD+K Sbjct: 259 DNRSSKAGRNHTAQQSYRAGSKEPWLLATNLPENDKLNSKQLVSLYARRMQIEETFRDIK 318 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 SP YG+GLRHS + ++RFDI+LLIA++ + L G+ A KQ W++ FQANT+R+R VL Sbjct: 319 SPQYGMGLRHSNSRCTKRFDILLLIAMLAEWLLRLLGIIAVKQNWERAFQANTIRHRRVL 378 Query: 360 STVRLGMEVLRHSG-YTITREDSLVAATL 387 S +RLG EV + + Y + A Sbjct: 379 SIIRLGREVRKRAKDYRMNSAQMTWAIAQ 407 >UniRef50_A3D336 Transposase, IS4 family n=6 Tax=Shewanella RepID=A3D336_SHEB5 Length = 460 Score = 306 bits (782), Expect = 1e-81, Method: Composition-based stats. Identities = 213/398 (53%), Positives = 282/398 (70%), Gaps = 2/398 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT-KARTKHNIK 59 M L ILH SLYQ CPE+H KRLN+L + C AL++ LTLT LGR++ TKH+IK Sbjct: 1 MQVLTILHQSLYQHCPEIHQKRLNTLMVTCRALINADCLTLTHLGRHIDGTSTHTKHSIK 60 Query: 60 RIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGR 119 R+DRLLGN HLH ER+AVY+WHA ++ + +TMP +LVDWSD+RE + L+ LRAS+A+ GR Sbjct: 61 RMDRLLGNPHLHHERMAVYQWHAKWLLTAHTMPTILVDWSDMREGRELIALRASIAIKGR 120 Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYW 179 S+TLYE+ FPL Q ++ AH+QFL +L +LP N TPLIV+DAGF+ PW++ VE+LGWYW Sbjct: 121 SITLYERTFPLVLQGTQTAHNQFLNELRKVLPDNITPLIVTDAGFRNPWFRKVEQLGWYW 180 Query: 180 LSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 L RVRG Y + L+ + +K +G L+ P+ C+++L+++ SKGR Sbjct: 181 LGRVRGLSVYRPHPFGRQFSLKALYPQARRRAKHVGRVALSVKKPLLCEMVLFRAPSKGR 240 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 K QRST T CHH + Y +AKEPW L TNL ++ +P++LVNIY KRMQ+EETFRDLK Sbjct: 241 KGQRSTTTDCHHTAQWTYELTAKEPWALVTNLTMKAMSPQKLVNIYQKRMQMEETFRDLK 300 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 SPAYG GLRHSRT + R DI+LLIAL++QL W G++ + Q +HFQANTV+ RNVL Sbjct: 301 SPAYGFGLRHSRTRYAARMDILLLIALLVQLAFWWIGLYGETQQLQRHFQANTVKKRNVL 360 Query: 360 STVRLGMEVLRHS-GYTITREDSLVAATLLTQNLFTHG 396 ST+R+G E+LR Y I+ +D L AA L + THG Sbjct: 361 STIRMGKELLRRRHDYPISADDLLCAAKKLAELSLTHG 398 >UniRef50_A4C5E2 Hypothetical transposase n=2 Tax=Pseudoalteromonas tunicata D2 RepID=A4C5E2_9GAMM Length = 397 Score = 304 bits (778), Expect = 4e-81, Method: Composition-based stats. Identities = 140/392 (35%), Positives = 210/392 (53%), Gaps = 1/392 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M +LH + + + +L A L +++ LGR L + A+ KHNIKR Sbjct: 1 MHLNKLLHKTFSNTVGVIDKRNHCTLMKAAATLCQHTFISIAALGRKLKSNAKVKHNIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRL GN + R Y+ + P V +DWS + +LRA+V + GR+ Sbjct: 61 IDRLFGNPRVQFARYHYYQEITHRVIGQIRRPCVTIDWSGLTPCGEFHLLRAAVPVKGRA 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 +T+YE++F E + H F+ L SILPS+ P+IV+DAGF+ PW+K V K GW ++ Sbjct: 121 MTIYEQSFRECEYMKQSVHKDFIKTLKSILPSDCKPIIVTDAGFRNPWFKLVLKFGWDFV 180 Query: 181 SRVRGKVQYA-DLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 RVR + QY +W P+ L+ +++ L +L K+N +S L+KS+ K R Sbjct: 181 GRVRHQTQYQKPEDDTSWLPVKTLYSKATAKPVYLFETQLAKANSLSGHFYLFKSKPKQR 240 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 K + ++ A EPW+L T+L + + +V IYS+RMQIEE+FRDLK Sbjct: 241 KKKNLRGKTIRCSVSLKHAKGATEPWLLFTSLCNINYSAQDMVKIYSQRMQIEESFRDLK 300 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 + + GL LRH R+ R ++ LLIAL+ WLAG+ A+ + FQANT++NRNVL Sbjct: 301 NTSNGLNLRHCRSYEKGRLNVALLIALIANFILWLAGLTAKILNVHRSFQANTIKNRNVL 360 Query: 360 STVRLGMEVLRHSGYTITREDSLVAATLLTQN 391 S+ LG + GY I + L A L ++ Sbjct: 361 SSFSLGTQYFEKFGYKIKLKTFLEALKQLNKD 392 >UniRef50_A7N7H3 Putative uncharacterized protein n=31 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N7H3_VIBHB Length = 397 Score = 304 bits (777), Expect = 5e-81, Method: Composition-based stats. Identities = 143/392 (36%), Positives = 219/392 (55%), Gaps = 6/392 (1%) Query: 1 MCELDILHDSL--YQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNL-PTKARTKHN 57 M IL+ F +R+ ++ +AL + TLTLT LGR + TK + KH Sbjct: 1 MNLSTILNTFFLMSSFSAIKDKRRITAVLDCINALNEKDTLTLTGLGRGMKNTKTKVKHC 60 Query: 58 IKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALH 117 IKR+ RLLGN HLH+ER VY + F+ PI++VDWS + + +LRA++ + Sbjct: 61 IKRVYRLLGNPHLHRERTGVYAYITDFLLKNVKHPIIIVDWSPVNHVDK-QILRATIPIG 119 Query: 118 GRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGW 177 GR+ TLYE+ P + S H F+ LA+++P P++ +DAGFKVPW+K +E+ GW Sbjct: 120 GRAFTLYEEVHPECKLGSLAVHKAFIRRLATMVPKGVIPIVTTDAGFKVPWFKPIEQQGW 179 Query: 178 YWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSK 237 YWL RVRG + W + + + LG LTK + CQ+ LY+ +SK Sbjct: 180 YWLGRVRGNSKLRVND--RWCSADEVFVQAQYKPQHLGTAELTKQHQYPCQVCLYRKKSK 237 Query: 238 GRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRD 297 GRK + + + + ++ +EPW+L +NLP E +++V +Y++RM IEE FRD Sbjct: 238 GRKAKNWSGSLQRNTVSLSHAKGEREPWLLVSNLPGETWFAERVVALYTQRMSIEEGFRD 297 Query: 298 LKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRN 357 K+ YGL L S ++S +R +I+L+I ++ Q + G A +G+ K FQANT+R R Sbjct: 298 TKNERYGLALNFSGSASPKRIEILLMIGMLTQFALLVVGKVAYLKGYYKDFQANTIRTRR 357 Query: 358 VLSTVRLGMEVLRHSGYTITREDSLVAATLLT 389 VLS LG E++ Y+ + +D +A L Sbjct: 358 VLSYFFLGKELIGREAYSFSVKDLALAVGGLK 389 >UniRef50_Q6LPG7 Hypothetical transposase n=7 Tax=Photobacterium profundum RepID=Q6LPG7_PHOPR Length = 402 Score = 297 bits (760), Expect = 5e-79, Method: Composition-based stats. Identities = 135/397 (34%), Positives = 211/397 (53%), Gaps = 3/397 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTK--ARTKHNI 58 M +IL+ L + P++H RL +L + + + +++T LGR L + KH+I Sbjct: 1 MKATEILYQDLRSYYPQIHSSRLKTLCTFIESGIKDQRVSVTYLGRGLESGSVTTKKHDI 60 Query: 59 KRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHG 118 KR DRL+GN HLH ER Y + + PI+L+DWS I Q+ +LRAS+ + G Sbjct: 61 KRADRLIGNAHLHCERHDYYEYMTEQLIGREKHPIILIDWSPINGQEIYQLLRASIPMQG 120 Query: 119 RSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWY 178 R + LYEK F SE ++KAH FL +L +LP P+I +DA ++ PW+K+VE GWY Sbjct: 121 RGLVLYEKTFHESELNTEKAHQSFLDELEQVLPEGCQPVITTDAIYRSPWFKAVELKGWY 180 Query: 179 WLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKG 238 W+ RVRG+V + + + ++ LG K C+ +L+K KG Sbjct: 181 WIGRVRGQVSLSQDKETWYTSYQWFKAAKVNKAEHLGVLYYGKVAKFKCEGVLFKRNKKG 240 Query: 239 RKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQL-VNIYSKRMQIEETFRD 297 R ++ + K + A E W+L LP + + V++Y +RMQIEE FRD Sbjct: 241 RSAKKKRGGVSQRTTDKTHEKDANEAWLLVFKLPPRYKNNANIAVSLYRQRMQIEENFRD 300 Query: 298 LKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRN 357 K+ G+ L ++ + S ERFD +LLIA ++ W G A + QAN+++ R Sbjct: 301 TKNGKLGISLEYANSKSVERFDNLLLIAGLILFIIWCVGRAAVMKKIHYSLQANSLKFRA 360 Query: 358 VLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFT 394 VLST+ +G EV++ YTIT ++ + L++ + Sbjct: 361 VLSTIYIGREVVKDGRYTITIDEYVYVLAHLSELAVS 397 >UniRef50_Q07YD1 Transposase, IS4 family n=6 Tax=Shewanella RepID=Q07YD1_SHEFN Length = 397 Score = 292 bits (747), Expect = 2e-77, Method: Composition-based stats. Identities = 128/393 (32%), Positives = 218/393 (55%), Gaps = 4/393 (1%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M +L L P +H R SL A + ++ L++T LGR++ +KA+ KH IKR Sbjct: 1 MNAKQVLSKCLSLVTPLMHKTRRQSLFSAIESSMNGGALSITGLGRDIESKAKEKHKIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 +DRL N +LH++ +Y + PI+ +DWSD+ ++K+ ++RAS+A GRS Sbjct: 61 VDRLCSNPYLHRDIEFIYTRMTCLLVGKMKQPIIHIDWSDLDDRKQHFLIRASLAAQGRS 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 +TLYE+ PL+++ K H FL L ++LP++ P+IV+DAGF++PW+K + L W ++ Sbjct: 121 LTLYEEIHPLNKKEKPKTHLSFLTKLKAMLPNDCKPIIVTDAGFRIPWFKQILSLDWDYV 180 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 R R + +W P+ L+ +S+ +K LG L + +++++K KGRK Sbjct: 181 GRFRNRTHCRKTIVHHWYPVKRLYIQASARAKNLGVYFLGEQASFCSRLVIFKRTDKGRK 240 Query: 241 NQRSTRTHCHH-PSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 ++ +T + + KEPW+LAT+L T K++V IY+ RMQIEE+FRD+K Sbjct: 241 DRTATGDRTRRSKQSRSSAEREKEPWLLATSLCHSSATAKRVVKIYATRMQIEESFRDVK 300 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 + GL + S + ++ ++LLIA + Q L G+ + + +QAN++++RNVL Sbjct: 301 T---GLKMNDSGSRIKDKLSVLLLIACLSQFMLNLLGLAVKAADKHRQYQANSIKHRNVL 357 Query: 360 STVRLGMEVLRHSGYTITREDSLVAATLLTQNL 392 S+ +G+ R + + L L + Sbjct: 358 SSQFIGLRAYRDKYLRLLKSHWLAGIKTLQSLI 390 >UniRef50_C6MY57 Putative transposase, IS4 family protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MY57_9GAMM Length = 397 Score = 285 bits (729), Expect = 2e-75, Method: Composition-based stats. Identities = 143/391 (36%), Positives = 224/391 (57%), Gaps = 1/391 (0%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M +LH+ L + +H KRL+SL A A + + +T+T LGR L + K+ IK+ Sbjct: 1 MRVEQLLHNHLQKSV-VMHSKRLDSLMCAVTAGMKDRCVTVTGLGRRLRMSIKVKNKIKK 59 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRL+GN HLH+E ++Y+ I PI++VDWS + + +LRA++ GR+ Sbjct: 60 IDRLVGNSHLHQEIPSIYQCMTGLILGNIRRPIIIVDWSPLGQGTEHQLLRATLPSGGRA 119 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 +TLYE A+P S S+K H +FLA L ILP+ TP+IV+DAGF+ W++ V LGW W+ Sbjct: 120 LTLYESAYPESLLTSRKVHQEFLAKLCQILPAGCTPIIVTDAGFRNTWFEDVSSLGWDWV 179 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 RVR + Y AE W PI +L+ ++S + +G+ L++ +SC + LYK + KGR Sbjct: 180 GRVRNRTHYLAANAEQWVPIKSLYHHATSRPQYIGHGNLSRRTSVSCGLYLYKKQPKGRV 239 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 + C + + +EPW++AT+L ++++ IY+KR QIE FRD K+ Sbjct: 240 LKTLKGAKCRQATSLKIAQREREPWLIATSLHHNTTLSRKIIKIYAKRAQIENGFRDTKN 299 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 G L S+TS + R +++L+I + WL G +++ FQANT++NRNVLS Sbjct: 300 QRLGFSLNDSKTSHTARLNVLLIIIAIATFGLWLLGGLLKQKQLHFQFQANTIKNRNVLS 359 Query: 361 TVRLGMEVLRHSGYTITREDSLVAATLLTQN 391 V LG +++ +S R D L ++ + Sbjct: 360 NVFLGWQIINNSSPRFKRADWLSVIDSISID 390 >UniRef50_Q6LJK0 Hypothetical transposase n=2 Tax=Vibrionaceae RepID=Q6LJK0_PHOPR Length = 394 Score = 285 bits (728), Expect = 2e-75, Method: Composition-based stats. Identities = 173/356 (48%), Positives = 230/356 (64%), Gaps = 3/356 (0%) Query: 29 ACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSG 88 C L L L GR+LP+KA+TKH IKR+DRLLGN HLH +RL +YRWH CS Sbjct: 28 LCKHCLAMMHLRLLYFGRSLPSKAKTKHCIKRVDRLLGNNHLHHDRLDIYRWHCHQFCSV 87 Query: 89 NTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLAS 148 N PIVLVDW+DIRE +RLMVLRAS+A+ GRSVTL+E+ F S ++H QFL D + Sbjct: 88 NPQPIVLVDWADIREYERLMVLRASIAVEGRSVTLFEQTFTFKNYNSPRSHQQFLDDFKA 147 Query: 149 ILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSS 208 +LPS+ P+IV+DAGF+ W++ V+ + W +L RVRG V L W+ I L ++ Sbjct: 148 VLPSHVIPIIVTDAGFRNTWFRQVDDMDWCYLGRVRGDVNV--LIKNQWQHIKQLFIKAN 205 Query: 209 SHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILA 268 S K +G+ +L K P+ C + LYK ++ ++ R H + ++ SA EPW+LA Sbjct: 206 SKPKYVGFTQLAKRKPLQCHLHLYKKQTPKKRKDRPKG-REHFSAQAVHKKSALEPWVLA 264 Query: 269 TNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALML 328 TNLP +I + + +V +Y+KRMQIEETFRDLKSP YG GLR SRT +RFDI+LLI L+ Sbjct: 265 TNLPTDIFSSRCIVRLYTKRMQIEETFRDLKSPQYGFGLRQSRTHDPKRFDILLLIGLLA 324 Query: 329 QLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVA 384 + W G+ A+ GW +HFQAN+V++R VLS VRLG EV R Y I A Sbjct: 325 FMVYWWFGIIAEHNGWHRHFQANSVKDRRVLSFVRLGKEVFRRLEYHINEPAIRWA 380 >UniRef50_C1DIQ1 Transposase, IS4 n=2 Tax=Azotobacter vinelandii DJ RepID=C1DIQ1_AZOVD Length = 400 Score = 263 bits (671), Expect = 1e-68, Method: Composition-based stats. Identities = 149/403 (36%), Positives = 224/403 (55%), Gaps = 6/403 (1%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M + LH + + P +H +RL +L A ALL + LTLT LGR+LP A +H IKR Sbjct: 1 MQTVQFLHAAFAKALPTIHARRLEALMAAVAALLQGRCLTLTALGRSLPGSAWPRHAIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 IDRLLGNR L ER Y + P++LVDWS I +L +LRA++ L GRS Sbjct: 61 IDRLLGNRQLQAERGLFYWVMLRALLGSFRHPLILVDWSPIDAAGKLFLLRAALPLAGRS 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 + + E P + + L LA++LP++ P++V+DAGF+ PW+++VE GW+++ Sbjct: 121 LPVCEVVHPREG--CPRCQKRLLEALAAMLPADCRPVLVTDAGFQRPWFQAVEIRGWHYV 178 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 RVR LG + W P+ +L+ ++S+ K LG +T+S P S Q+ + K +GR+ Sbjct: 179 GRVR-NRDLCRLGEQPWGPVKSLYALASASPKRLGCVEMTRSAPWSTQLCVVKHAPRGRQ 237 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 ++R T T + + EPW+LA+NLP Q+V IY +R QIEE FRDLKS Sbjct: 238 HRRITGTLARDKRSRQSAQRESEPWLLASNLPEAQWNAAQVVAIYRRRTQIEEGFRDLKS 297 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLT---CWLAGVHAQKQGWDKHFQANTVRNRN 357 G+GL R+ R +I+LLIA++ L G+ A++ G ++ FQ+N+++ + Sbjct: 298 HRLGIGLGLHRSRCPRRIEILLLIAVLANYALCLLGLLGLQAREAGHERRFQSNSLKCKR 357 Query: 358 VLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLG 400 VLS RLG+E R I+RE L + + LG Sbjct: 358 VLSLWRLGLEYARTGVGAISRETLRNLELALRREVHRQAQELG 400 >UniRef50_D0I6N0 Transposase IS4 n=1 Tax=Grimontia hollisae CIP 101886 RepID=D0I6N0_VIBHO Length = 345 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 145/318 (45%), Positives = 204/318 (64%), Gaps = 5/318 (1%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M ++ IL ++L P +H KRL SL LA + L LTLT+LGR+L T KH IKR Sbjct: 1 MRDIQILQETLTNHYPTIHKKRLQSLLLATESALGGADLTLTKLGRSLNTFTAAKHAIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 +DRLLGN LH+E+ +Y+W+A I N P++L+DWSD+REQ R M LRAS+AL GR+ Sbjct: 61 VDRLLGNTRLHREKEDIYKWNARLIAGANPCPVILLDWSDVREQLRFMTLRASIALDGRA 120 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 VTLYE+AF ++ S K H FL L ILP + TP+I+SDAGF+ W++ V+ GW+WL Sbjct: 121 VTLYEQAFEYAQYNSPKTHQYFLGKLQEILPPSATPIIISDAGFRNTWFRQVQSKGWFWL 180 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 RVRG V + +W+ L+ ++S +LG +L + +P++C + K +K Sbjct: 181 GRVRGDVSI-KMTQSDWQSNKTLYPDATSKPHSLGQCQLARRSPLTCNGYVVK----QQK 235 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 QR +RT H + ++++ +A EPW+L TN+P E Q+ +Y+KRMQIEE FRDLKS Sbjct: 236 AQRHSRTGQKHTASRLFAKNANEPWLLVTNIPTETLNAVQICRLYAKRMQIEEAFRDLKS 295 Query: 301 PAYGLGLRHSRTSSSERF 318 AYGL LRH+RT + R Sbjct: 296 TAYGLALRHNRTHHNRRL 313 >UniRef50_Q17U39 Transposase n=11 Tax=Gammaproteobacteria RepID=Q17U39_ECOLX Length = 394 Score = 259 bits (661), Expect = 2e-67, Method: Composition-based stats. Identities = 133/402 (33%), Positives = 202/402 (50%), Gaps = 16/402 (3%) Query: 1 MCELDILHDSLYQFCPE-LHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIK 59 M +L D L P+ +H R + L A AL T+T +GR +P + IK Sbjct: 4 MNVKAMLADFLTFVTPKSMHKARFSVLLDAVTALAKDACCTVTAIGRAMPGSSDKVS-IK 62 Query: 60 RIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGR 119 R DRLL N +L +E +Y + I T P++LVDWS+ KR +LRAS+A GR Sbjct: 63 RADRLLNNPNLQRELPLIYAALTASIVGHKTKPMILVDWSNADTAKRHFILRASIAADGR 122 Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYW 179 ++TL +K + H FL L ++LP + P+IV+DAGFKVPW K V KLGW++ Sbjct: 123 ALTLLQKIAAAEDYTCPHLHGAFLKQLKAMLPKDCKPVIVTDAGFKVPWLKQVRKLGWHY 182 Query: 180 LSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 ++RVRG V+ + + ++ L+ + K++G L ++ Q +L KG Sbjct: 183 VARVRGNVKLKLAEQDKFISVNQLYRQAKKDPKSVGKIMLAQTQHYETQAVLV---GKGY 239 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 K + + + KEPW+L ++L ++ YS RMQIEE+FRD K Sbjct: 240 KLLKRDKN-----------KTYKEPWLLVSSLADCHGYADKIAKCYSSRMQIEESFRDQK 288 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 S YGLG T R +I+LL+A ++ +L G A+K G +QANTV+NR VL Sbjct: 289 SHRYGLGSDLHGTKKKSRLEILLLLAALVNWFHYLLGSAAEKAGLHLRYQANTVKNRRVL 348 Query: 360 STVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLGK 401 + LG+ + + I R+ + Q + + + K Sbjct: 349 ALNFLGILLCKEPKQRIRRQYYQQGLKQILQWVVQWDWAVIK 390 >UniRef50_Q5X8W8 Putative uncharacterized protein n=1 Tax=Legionella pneumophila str. Paris RepID=Q5X8W8_LEGPA Length = 398 Score = 246 bits (627), Expect = 1e-63, Method: Composition-based stats. Identities = 107/393 (27%), Positives = 189/393 (48%), Gaps = 10/393 (2%) Query: 1 MCELDILHDSLYQFCPE-LHLKRLNSLTLACHALLDC-KTLTLTELGRNLPTKARTKHNI 58 M + + L +H KR L L+D TL++TE+G+ L +K K I Sbjct: 1 MHKKIFCQNLLDSALKNSIHAKRKQCLVRFLSDLMDYDTTLSVTEIGKKLTSKTTVKSKI 60 Query: 59 KRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHG 118 + N L ++ + +Y+ F S +VL+DW+ + VL AS+A HG Sbjct: 61 YAAQTFVNNFKLERDIVCIYKSLTHFFWSHAKEIVVLIDWTGGCSEGYH-VLEASIAAHG 119 Query: 119 RSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWY 178 RS+ +Y + SEQ + + H QFL L ++PS+ + I++DAGF W++ V +LGW Sbjct: 120 RSIPIYHEVHSESEQENAEIHRQFLLRLKEVIPSSLSVTIITDAGFHREWFQQVLELGWD 179 Query: 179 WLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNP-ISCQILLYKSRSK 237 + R+ Y G NW + ++ + LG +L K+ + + YK + Sbjct: 180 VIGRIYSLYCYQIEGETNWHKVKDILFEGIGKASALGKVKLGKTKKAVEGYLYTYKEKLS 239 Query: 238 GRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRD 297 G+ ++ + H + +S K W+L ++L + LV+ Y KRMQIE+ F+D Sbjct: 240 GKVRKKKNKYPSHDKA---HSNYYKNGWVLFSSLNKH---ARFLVSYYKKRMQIEQNFKD 293 Query: 298 LKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRN 357 +K+ G+G R +++S R +++ +A++L + W G+ + + +QANT++N+ Sbjct: 294 IKNEQLGMGFRRNQSSGKTRVNMLFFLAVLLIMIAWWFGLMIESLNKHRSYQANTIKNKR 353 Query: 358 VLSTVRLGMEVLRHSGYTITREDSLVAATLLTQ 390 V S + L RH + + + L Q Sbjct: 354 VRSFIHLARMAYRHEPELLNWDLFQYIMSDLKQ 386 >UniRef50_Q2NZH2 ISXoo8 transposase n=73 Tax=Xanthomonas RepID=Q2NZH2_XANOM Length = 407 Score = 245 bits (626), Expect = 2e-63, Method: Composition-based stats. Identities = 125/394 (31%), Positives = 203/394 (51%), Gaps = 8/394 (2%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M ++L L +H R +L A AL+ LTL +L R P R + +K Sbjct: 4 MRASEVLQKCLSNSLSGMHALRQRTLLRAVEALVHGGRLTLIDLARAWPGATRVRAPLKA 63 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 DRLL NR L ER A+ + A ++ G P++++DWSD++ K +LRA+V + GR+ Sbjct: 64 CDRLLCNRTLQVERSAIEQDMAHWLLRG-DQPVIVIDWSDLKPDKSWCLLRAAVPVGGRT 122 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWL 180 +TL + +Q S A +FL L +++P + P++V+DAGF+ PW+++V +GW W+ Sbjct: 123 LTLLDMVVSRKQQGSPGAEKRFLQQLRALIPDDVRPILVTDAGFRTPWFRAVSAMGWDWV 182 Query: 181 SRVRGKVQYADLG----AENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRS 236 R+RG+ Q A W LH ++S+ ++ L + +S+P+ C+++LY Sbjct: 183 GRLRGRTQVKPQDVPDDAVQWIDSRRLHALASNRARALPPMQANRSDPLDCRLVLYAKTR 242 Query: 237 KGRKNQR--STRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEET 294 +GR+ + S+ S +A +EPW++ + + + KQLVN+Y++RMQIE Sbjct: 243 QGRQQRNRRSSAKVSRASSSLKAAAREREPWLIVASPQLHAPSAKQLVNLYARRMQIELA 302 Query: 295 FRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVR 354 FRDLKS YG + S T ER I+LL+ + WLAG+ + G + + Sbjct: 303 FRDLKSHRYGQAMEDSLTRRGERLQILLLLNTLATFASWLAGLGCEATGIAQWLSPRSS- 361 Query: 355 NRNVLSTVRLGMEVLRHSGYTITREDSLVAATLL 388 R + ST+R+G E L L L Sbjct: 362 TRKLYSTLRVGREALVRCWPMEPVSRWLERLRAL 395 >UniRef50_B2JAE4 Transposase, IS4 family protein n=8 Tax=Cyanobacteria RepID=B2JAE4_NOSP7 Length = 448 Score = 199 bits (505), Expect = 2e-49, Method: Composition-based stats. Identities = 77/392 (19%), Positives = 145/392 (36%), Gaps = 43/392 (10%) Query: 15 CPELHLKRLNSLT---------LACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLL 65 PEL+ K L SL L + + + K + L ++ +LP + + K++ R L Sbjct: 2 LPELYQKHLQSLLSQSELIFLTLVINVVQNIKDVKLEKISESLPLFIQCQSRRKKLQRFL 61 Query: 66 GNRHLHKE--RLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTL 123 L+ E + + I GN + +D ++ + + +L SV R++ + Sbjct: 62 LLPILNIEELWFPIIERWLAQIFLGNHRIYLAIDRTNWKRK---NLLMISVIFQKRAIPI 118 Query: 124 YEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRV 183 Y K + + L + + + T ++ V K +++ G+ + R+ Sbjct: 119 YFKLLAKLGSSNLSEQTKALTKIIPLFKNYKTVVLGDREFCSVSLAKWLDEQGFEFCLRL 178 Query: 184 RGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQR 243 + +L A W I +L + S + +TK+ + + K +KN R Sbjct: 179 KKNENI-ELKAHLWCEIKDL-GLKPGTSFFVSDATVTKTKQVKG----FNVACKWKKNYR 232 Query: 244 STRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAY 303 + AKE W + TN+ +I + Y KR IEE FRD KS Y Sbjct: 233 QNK--------------AKEGWFILTNMNSKIT----AIQAYQKRFDIEEMFRDFKSGGY 274 Query: 304 GLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR--NVLST 361 L +RF ++LI + L G + + +G K+ R S Sbjct: 275 NL---EKTNVEGKRFIALVLIISLADTIATLQGQNIKSKGIAKYLARPKEYGRSHRRHSN 331 Query: 362 VRLGMEVLRHSGYTITREDSLVAATLLTQNLF 393 +G+ + + L+++ Sbjct: 332 FYIGLYAQNWVNFIGDCWSLVQDLMRLSRHKL 363 >UniRef50_C7QY62 Transposase IS4 family protein n=9 Tax=Cyanothece RepID=C7QY62_CYAP0 Length = 365 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 76/388 (19%), Positives = 145/388 (37%), Gaps = 42/388 (10%) Query: 15 CPELHLKRLNS---------LTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLL 65 PEL+ L L++ + L + L EL P + + IK++ R L Sbjct: 4 LPELYSNHLKKHLDNHQYLMLSILVNLLQSLHLVRLEELANRFPHPIQLRSRIKKLQRFL 63 Query: 66 GNRHLHKE--RLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTL 123 + E + + + +++D S RE + ++ S+ + R++ L Sbjct: 64 SLPQFNLETLWIPIIESWIKQEWKRGEIIYLVIDRSQWRE---INLIFVSLIYNHRAIPL 120 Query: 124 YEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRV 183 P + + L + S L L+ V K + + + S Sbjct: 121 CVDWLPKKGNSNLEQQKAILEVILSRLKDYKIVLLGDREFCGVDLAKWLSEAKEVYFSLR 180 Query: 184 RGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQR 243 K +YA+L + W L D+ + ++ Y+ + + K++ G N Sbjct: 181 LKKNEYAELAPQIWF---QLKDLGLNPGMSVYYRG----------VKITKTKGFGEVNLA 227 Query: 244 STRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAY 303 + + SAKEPW++ TNL + Q ++ YSKRM IEE FRD K Y Sbjct: 228 AKWKRNYQ------GKSAKEPWLIMTNLE----SLSQAMSAYSKRMGIEEMFRDFKRGGY 277 Query: 304 GLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR--NVLST 361 L ++ + ER ++L+ + +G +++G K+ T +R S+ Sbjct: 278 --QLEGTQVTK-ERLISLVLLICLAYCWSTFSGQSLKRKGVAKYVSRPTSGHRSHRQHSS 334 Query: 362 VRLGMEVLRHSGYTITREDSLVAATLLT 389 +G+ +D++ + Sbjct: 335 FYIGLHGQNWLDSLTFFQDAMQQLMSFS 362 >UniRef50_Q5ZXB3 ORF2 transposase n=9 Tax=Legionella RepID=Q5ZXB3_LEGPH Length = 361 Score = 195 bits (494), Expect = 3e-48, Method: Composition-based stats. Identities = 57/373 (15%), Positives = 124/373 (33%), Gaps = 45/373 (12%) Query: 4 LDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDR 63 + L D L + + R+ +L+ +T+ LTE+ + A+ RI R Sbjct: 3 ITELSDILNGYFSW-NKSRIECFATMLISLIKVRTVNLTEIACGFSSPAKQDSRYTRIKR 61 Query: 64 LLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDI-REQKRLMVLRASVALHGRSVT 122 R + +V W + +D ++ +K + +L SV G ++ Sbjct: 62 FF--REFKIDFSSVSVWVIHCFGLSGQALYLSMDRTNWRWGKKDINILMLSVVYKGIAIP 119 Query: 123 LYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLS 181 L+ + + + + + +++D F W+ + + Sbjct: 120 LFWTLLAKGGNSDTRERIEIVQRFITKFGKSMIAGLLADREFVGDNWFAWLLTEKIPFCI 179 Query: 182 RVRGKVQYADLGAENWKPISNLHDMSSSHSKTL-GYKRLTKSNPISCQILLYKSRSKGRK 240 R++ V + + +D+ S + L G ++L + + L Sbjct: 180 RIKNNVITTNSRGLEVSIDALFYDLKSGEQRILQGLRKLWRQKIYLSALRL--------- 230 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 + E I+AT+ ++ + Y+ R +IE F LK Sbjct: 231 -------------------ADGELLIVATDHLMDEP-----IEHYALRWEIETLFSCLK- 265 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 G + + +R + +L++ + G Q K + R +S Sbjct: 266 -GRGFNFEDTHMTQPDRIEKLLVLLTIAFCWAHKTGEWRHVQKAIKIKK----HGRKGVS 320 Query: 361 TVRLGMEVLRHSG 373 R G+++LR + Sbjct: 321 FFRYGLDLLRDAA 333 >UniRef50_D2SUD5 Transposase n=1 Tax=uncultured bacterium psy1 RepID=D2SUD5_9BACT Length = 367 Score = 195 bits (494), Expect = 3e-48, Method: Composition-based stats. Identities = 70/405 (17%), Positives = 130/405 (32%), Gaps = 50/405 (12%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M +L +L HL R+ L+ AL KT+ EL A+ + +R Sbjct: 1 MDHTTMLTHTLKLHFGW-HLARIKCLSCLIIALFKVKTVNFAELATAFSGSAKVDSHYRR 59 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIR-EQKRLMVLRASVALHGR 119 I R L ++ LA R S + ++ +D ++ + L S+ G Sbjct: 60 IQRFFKEVELKQDTLA--RLVTSLLPYD--QFVLSIDRTNWMLGCFAINFLVLSIVHQGT 115 Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWY 178 + ++ P + K + + + S+ + D F W+ + K Sbjct: 116 AFPVFWLLLPKKGNSNTKERIELINQFLDVFGSHKIQYLTGDREFIGQQWFAYLIKHQIE 175 Query: 179 WLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKG 238 + R++ + + + P N +S P+S L R Sbjct: 176 FRLRIKKNMMISRSNG-QFSPAENFF----------------RSLPLSTACQLIDRRWVC 218 Query: 239 RKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDL 298 T ++ + I+ T+ ++ Y+KR +IE F L Sbjct: 219 GHLLWVTGMRL----------ASGDYLIVVTH-----DDSAHTMSDYAKRWKIEVLFESL 263 Query: 299 KSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNV 358 KS + E +L + + + G + + + R Sbjct: 264 KSRGFNF--EDVNLKDQESLKRLLAVITIAFCWAYHVGAWLNEVKPIRVKK----HQRPA 317 Query: 359 LSTVRLGMEVLRH--SGYTITREDSLVAATLLTQNLFTH--GYVL 399 S R G + +RH R++ TLL +N T GY+ Sbjct: 318 KSVFRYGFDWIRHVLFNPEDKRDELKQVLTLL-KNTITRPKGYIF 361 >UniRef50_A9AVJ1 Transposase IS4 family protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVJ1_HERA2 Length = 378 Score = 186 bits (473), Expect = 8e-46, Method: Composition-based stats. Identities = 83/382 (21%), Positives = 143/382 (37%), Gaps = 41/382 (10%) Query: 9 DSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNR 68 +L+QF P LH +RL + LL +++ L+ + +L + A I RI R L N Sbjct: 21 TTLHQFHPTLHARRLATWAWVIVGLLHARSVHLSAVALHLASDAEAAGRIARIRRWLANP 80 Query: 69 HLHKERLAVYRWHASFICSG--NTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEK 126 L + +YR + + + N +++D + K L ++R S++ R++ L + Sbjct: 81 WL--DTQFLYRPLITHVLTAWRNRDITIMIDGCYVNHDK-LQMVRLSLSHCYRAIPLAWQ 137 Query: 127 AFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRG 185 S ++ + L + +L ++D GF W S ++ GW ++ R+ Sbjct: 138 VMSHHGNVSVESCQRMLNRVQQLLIGTRRVTFLADRGFRDWAWAASCQRRGWDYIIRIAN 197 Query: 186 KVQYADLGAENWKPISNLHDMSSSHSKTLGYKR--LTKSNPISCQILLYKSRSKGRKNQR 243 P ++ M+ K++ + LT+ C I + +R+ K Sbjct: 198 TTTIRWDDG----PWMAINTMAVKPGKSVYLRNVLLTQDGEWRCTIAITWTRATKTK--- 250 Query: 244 STRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAY 303 + + WIL N Y +RM IEE+FRD KS Sbjct: 251 ------PAERCAVITNREPSKWIL---------------NHYLRRMHIEESFRDDKSG-- 287 Query: 304 GLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVR 363 G L SR +R D +LL + L + G K H LS + Sbjct: 288 GFDLDASRLRDPQRLDRLLLAIAVATLWMYELGERVLKDEQRAHVDPGYQ---RQLSVFQ 344 Query: 364 LGMEVLRHSGYTITREDSLVAA 385 LG LR + + Sbjct: 345 LGWRWLRRALSLADIPKWNLTL 366 >UniRef50_B4UH67 Transposase IS4 family protein n=3 Tax=Proteobacteria RepID=B4UH67_ANASK Length = 384 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 88/362 (24%), Positives = 158/362 (43%), Gaps = 37/362 (10%) Query: 14 FCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKA--RTKHNIKRIDRLLGNRHLH 71 F +LH KR+ SL A +L+ L + +G +L ++KH +K++DR+L N + Sbjct: 19 FAEDLHAKRVASLAGAAVGVLEGAALGIHAIGNSLAVAEGLKSKHAVKQVDRMLSNEGIP 78 Query: 72 KERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLS 131 ++ + +V +DW+D E + + + + HGR+ L K S Sbjct: 79 V--WRLFGSWVPCVVGDRLEIVVALDWTDFDEDDQSTIALSMITSHGRATPLLWKTVMKS 136 Query: 132 E-QCSKKAHDQ-FLADLASILPSNTTPLIVSDAGFKVPWYKSV--EKLGWYWLSRVRGKV 187 E + + H+ L +LP +++D GF + ++LG+ ++ R RG V Sbjct: 137 ELKGWRNEHEDVLLERFREVLPEGVKVTVLADRGFGDQALYELLKDQLGFGFIVRFRGVV 196 Query: 188 QYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRT 247 + +P + S+ + L R+TKS ++ K Sbjct: 197 KVTSAEG-ETRPAKDWVP-SNGRTLRLRSARVTKSRREIGAVVCVK-------------- 240 Query: 248 HCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGL 307 + KE W LAT+ + ++V +Y++R IEE+FRD K+ +G+GL Sbjct: 241 ----------AKGMKEAWHLATSHGD--KPGSEIVALYARRFTIEESFRDQKNLRFGMGL 288 Query: 308 RHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGME 367 +R + R D +LL++ + + G + G DK + NTV+ R +S +R GM Sbjct: 289 SETRIADPARRDRLLLVSAVAIALLTILGAAGEALGLDKWLKTNTVK-RRTISLLRQGMM 347 Query: 368 VL 369 Sbjct: 348 HY 349 >UniRef50_B4WTK1 Putative uncharacterized protein n=6 Tax=Synechococcus sp. PCC 7335 RepID=B4WTK1_9SYNE Length = 411 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 80/374 (21%), Positives = 138/374 (36%), Gaps = 43/374 (11%) Query: 2 CELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKAR-TKHNIKR 60 D L L Q CP HL L + AL+ +++LT+ LP + + +R Sbjct: 25 RLYDALKAWLGQDCPWAHLSHLTTCCWMVFALIQTGSVSLTKWTTYLPCRGLYAQSKQRR 84 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 + R LGN ++ RL A+ + +D S E + ++R +V GRS Sbjct: 85 VRRWLGNSRINIHRLYKPLIQAALATWEAECLYLCLDTSLFWE--QYCLIRLAVVYRGRS 142 Query: 121 VTLYEKAFPL-SEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWY 178 + L + S + +A+++ L LPSN ++++D GF +++LGW+ Sbjct: 143 IPLAWRVLEHNSASVAFEAYEELLRQSTQYLPSNANMILLADRGFVHTRAMTLIKQLGWH 202 Query: 179 WLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKG 238 + R++ G+ +P S L + R+ Sbjct: 203 YRIRIKSDTWIWRPGSGWCQPKS---------------------------FHLERGRALC 235 Query: 239 RKNQRSTRTHCHHPSPKIYSAS--AKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFR 296 + R R + P I + E W + ++ P T Q Y+ R IEE F Sbjct: 236 FHHIRLHRHEQYGPVHVIIGRNNINGELWAVVSDQP----TSPQTFMEYALRFDIEEGFL 291 Query: 297 DLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR 356 D +S + L R + I + L GV + G + + R Sbjct: 292 DDQSAGWNLQRSEIRG--LTDLSRLWFILAVATLYVTAQGVAVVQSGRRRWIDTHWDRGN 349 Query: 357 NVLSTVRLGMEVLR 370 S R+G+E + Sbjct: 350 ---SYFRIGLEWTK 360 >UniRef50_Q1VRR5 Putative uncharacterized protein n=8 Tax=Bacteroidetes RepID=Q1VRR5_9FLAO Length = 372 Score = 186 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 54/371 (14%), Positives = 128/371 (34%), Gaps = 39/371 (10%) Query: 4 LDILHDSLYQFC-PELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRID 62 L L +++L RL ++ AL +T+T +L +++ + +++RI Sbjct: 18 NSELTSVLNTHLQGKINLARLKLISHFVIALCKVQTVTFEKLANAFNSQSDSGSSLRRIQ 77 Query: 63 RLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIR-EQKRLMVLRASVALHGRSV 121 R + + L + + + I+ +D ++ + Q + + V G + Sbjct: 78 RFIASYSLDSD---LIALLVFNLLPSRDKLILSIDRTNWKFGQTNINIFMLGVVYKGVAF 134 Query: 122 TLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWL 180 L + + L + + +V+D F W + + + Sbjct: 135 PLLFTMLDKRGNSNSQERIDLLNRFIRLFGKHVIESVVADREFVGKDWLAFLNRNEIRYY 194 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 R+R + S+L + ++ +K + Sbjct: 195 IRIRNNFKVFLPHKNKEIKASHLFNRFKTNEFVYYHKIV--------------------- 233 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 R C+ K+ + K+ +++ + P+ Y KR QIE F+ +KS Sbjct: 234 --RVNGELCYLSGCKLNPKNLKQEFLIIVSFNK----PENAQQDYQKRWQIEMCFKAMKS 287 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 G + + +R + ++L+ ++ + C+ G++ + K + R S Sbjct: 288 S--GFDIEKTHLQDIQRIEKLILLVMIAFVWCYKIGIYLHQINPIKIKK----HGRKAKS 341 Query: 361 TVRLGMEVLRH 371 + G+ L + Sbjct: 342 IFKYGLTFLAN 352 >UniRef50_A7MW84 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7MW84_VIBHB Length = 235 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 82/228 (35%), Positives = 127/228 (55%), Gaps = 2/228 (0%) Query: 162 AGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTK 221 GFKVPW+K +E+ GWYWL RVRG + W + + + LG LTK Sbjct: 2 LGFKVPWFKPIEQQGWYWLGRVRGNSKLRVND--RWCSADEVFVQAQYKPQHLGTAELTK 59 Query: 222 SNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQL 281 + CQ+ LY+ +SKGRK + + + + ++ +EPW+L +NLP E +++ Sbjct: 60 QHQYPCQVCLYRKKSKGRKAKNWSGSLQRNTVSLSHAKGEREPWLLVSNLPGETWFAERV 119 Query: 282 VNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQK 341 V +Y++RM IEE FRD K+ YGL L S ++ +R +I+L+I ++ Q + G A Sbjct: 120 VALYTQRMSIEEGFRDTKNERYGLALNFSGSACPKRIEILLMIGMLTQFALLVVGKVAYL 179 Query: 342 QGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLT 389 +G+ K FQANT+R R VLS LG E++ Y+ + +D +A L Sbjct: 180 KGYYKDFQANTIRTRRVLSYFFLGKELIGREAYSFSVKDLALAVGGLK 227 >UniRef50_D0LPB8 Transposase IS4 family protein n=4 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPB8_HALO1 Length = 418 Score = 183 bits (465), Expect = 7e-45, Method: Composition-based stats. Identities = 93/365 (25%), Positives = 155/365 (42%), Gaps = 37/365 (10%) Query: 14 FCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLP--TKARTKHNIKRIDRLLGNRHLH 71 F +H KR+ SL+ A + L++ +G L +KH IK++DRLL N L Sbjct: 55 FEGNMHSKRVESLSNAVVGVTHASALSVQAIGHGLAVALDKNSKHAIKQVDRLLSNARL- 113 Query: 72 KERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLS 131 + V+ ++ S T I+ +DW++ + + + +HGRS L K S Sbjct: 114 -DPWQVFSVWVPYVLSERTEAIIALDWTEFAKDGQSTCAAHLMTMHGRSTALAWKTVEKS 172 Query: 132 EQCSKKA--HDQFLADLASILPSNTTPLIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQ 188 + ++ D+ + L I+P + +++D GF + + LGW ++ R R + Sbjct: 173 QLRGQQTAVEDEVIDHLHRIIPPDIEVTLLADRGFAAAERFIHLTTLGWNYVIRFRENIH 232 Query: 189 YADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTH 248 + G +P + S K L ++ P+ + ++K Sbjct: 233 ISHQGQT--QPARDWVPKSGRAKKLLDVGITCRAEPLEAVVCVHK--------------- 275 Query: 249 CHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLR 308 A K+ W LATNL + +V +Y++R IEETFRD K +GLGL Sbjct: 276 ----------AQMKQAWCLATNLVDA--SASHVVKLYARRFTIEETFRDQKDLRFGLGLS 323 Query: 309 HSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEV 368 + R D +LL++ + L G A+ G+D+ +ANTVR R S R G Sbjct: 324 ATHIRDCGRRDRLLLLSAIAHALLTLLGAAAESIGFDRMMKANTVR-RRTHSLFRQGCYW 382 Query: 369 LRHSG 373 Sbjct: 383 FWRMP 387 >UniRef50_B1XQT5 Tn10-like transposase (IS4 family) n=14 Tax=Cyanobacteria RepID=B1XQT5_SYNP2 Length = 366 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 62/376 (16%), Positives = 118/376 (31%), Gaps = 36/376 (9%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M ++ + L H RL+ + L AL KT+ L +L A + N KR Sbjct: 1 MNQISEIRRQLRPHLGW-HGARLSFIALFLVALFRAKTVNLAKLATVWGGNAAEESNYKR 59 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIR-EQKRLMVLRASVALHGR 119 + R + ++ +++A I + ++ +D ++ +L V G Sbjct: 60 MQRFFQSFDVNMDKIA---RMVMNIAAIPQPWVLSIDRTNWSLGTTDFNILMLCVVHEGI 116 Query: 120 SVTLYEKAFPLS-EQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSV-EKLG 176 L + L ++ P+ + D F PW + Sbjct: 117 GYPLMWTMLKKKRGNSNSTERMDLLERFETLFPNIEIAYLTGDREFIGKPWLSYLMLDKP 176 Query: 177 WYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRS 236 + R+R + + + S+L S +G R+ +Y Sbjct: 177 IPFRLRLRQTDKISKGKGQPAIAGSHLF-----RSLAIGETRILSGKRWVWGRQVY---- 227 Query: 237 KGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFR 296 + + E I+ T P+ + Y +R IE F Sbjct: 228 ---------VMGTRLDPKRRAHKNEDEFLIIITTHD-----PQNALADYRRRWGIETLFG 273 Query: 297 DLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR 356 LK+ G L + + R +L + + + AG+ Q + +A+ R Sbjct: 274 ALKTR--GFCLESTHFTDKVRLSKLLALLAIGFVWAMQAGLWRHTQKPIRIIKAH---GR 328 Query: 357 NVLSTVRLGMEVLRHS 372 S R ++LR Sbjct: 329 RARSLFRYDFDLLRRF 344 >UniRef50_B0JUB6 Transposase n=18 Tax=Cyanobacteria RepID=B0JUB6_MICAN Length = 382 Score = 181 bits (459), Expect = 3e-44, Method: Composition-based stats. Identities = 60/380 (15%), Positives = 120/380 (31%), Gaps = 36/380 (9%) Query: 17 ELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKE--R 74 EL R L + L K L L LP + K++ R L L+ E Sbjct: 13 ELGRARYLLLLMIVGTLQILKQAKLEILAEALPIPILFESRRKKLKRFLKLEILNIEKIW 72 Query: 75 LAVYRWHA--SFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSE 132 V + + + + +D + + +L S+ R++ +Y + Sbjct: 73 FPVLKEMLKQQQRFTTKGLAYIAIDRTSW---GAINILMVSLIYDKRAMPIYWEILDKKG 129 Query: 133 QCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADL 192 + + + L ++L + ++ V K ++K Y+ R + Sbjct: 130 SSNLEEQQRVLEKTLTVLSGHKIVVLGDREFCSVSLGKWLQKQSLYFCLRQKKSTNVKTK 189 Query: 193 GAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHP 252 + + S L + + K + G N + Sbjct: 190 EGI----YQEMRALGLSPGTKLFLNDVN----------ITKEKGFGEFNLAGKWKKTYRG 235 Query: 253 SPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 P KEPW + TN + + Y KR IEE FRD KS Y L S+ Sbjct: 236 FPT------KEPWYILTNFGD----LETAIMAYQKRFDIEEMFRDFKSGGY--SLEGSQL 283 Query: 313 SSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTV--RNRNVLSTVRLGMEVLR 370 + + ++++ + + L G + G K+ + + S+ +G + Sbjct: 284 A-PKYLSKLIIVIAIAYTSATLQGKKIKDMGIQKYVTRPEKRYKRQRRHSSFYVGQHLYH 342 Query: 371 HSGYTITREDSLVAATLLTQ 390 ++ +++ Sbjct: 343 WLQLHQMFPKNIEELMQISR 362 >UniRef50_B5VUF1 Transposase IS4 family protein n=17 Tax=Arthrospira maxima CS-328 RepID=B5VUF1_SPIMA Length = 382 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 72/394 (18%), Positives = 121/394 (30%), Gaps = 45/394 (11%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M EL+ L D+L P H RLN + L AL KT+ L E+ + N +R Sbjct: 1 MNELNRLRDTLRPHLPW-HGARLNFVCLFLMALFQTKTVNLMEIATVFANPVQISSNYQR 59 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRL-MVLRASVALHGR 119 + R R +R + R+ S I + +D + + +L +V G Sbjct: 60 LQRFF--REFKFDRAEIARFVVSLID-IPQPWTLSLDRTCWSFGQTHFNILMLAVVHEGI 116 Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKL-GW 177 + L + ++ P + +D F W + Sbjct: 117 AFPLLWTMLDKKGNSNSGERMDLFDRFEALFPDVEVACLTADREFVGRDWLSYLLIDPEV 176 Query: 178 YWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSK 237 + R+R + + + D L + Sbjct: 177 PFRLRIRHSELISPKLGGTRRSGERMFD------------------------SLRPGEFR 212 Query: 238 GRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRD 297 +R + + + E IL TN P+ + Y++R IE F Sbjct: 213 QLSGRRWVWGRQVYVIGSRLA-DSGELLILITNAC-----PETALPDYARRWGIENLFGA 266 Query: 298 LKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRN 357 LK+ G L + ER +L + + G+ QG +A+ R Sbjct: 267 LKTR--GFCLESTHFKDPERLSRLLALLSLAFTWAMKVGLWIH-QGSPIPLKAH---GRR 320 Query: 358 VLSTVRLGMEVLRH--SGYTITREDSLVAATLLT 389 S R G + LR S + A LL+ Sbjct: 321 SQSLFRTGFDFLRRTFSNLPLFSGRFHQALQLLS 354 >UniRef50_B5VWL5 Transposase IS4 family protein n=6 Tax=Arthrospira maxima CS-328 RepID=B5VWL5_SPIMA Length = 398 Score = 168 bits (425), Expect = 3e-40, Method: Composition-based stats. Identities = 65/407 (15%), Positives = 122/407 (29%), Gaps = 53/407 (13%) Query: 6 ILHDSLYQFCP-ELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRL 64 +L + L + +L L L + + L+ L P + ++ I R Sbjct: 1 MLRTLYQKLLRINLSESQAQTLELLVLMLQSYRQVRLSTLANVFPQPIQYSSRLRNIQRF 60 Query: 65 LGNRHLHKE--RLAVYRWHASFICSGNTM------------------PIVLVDWSDIREQ 104 L L + + + + V +D + R++ Sbjct: 61 LKLPQLSAKLLWFPIIKAALKSEFREKHLNREQRRKRSKFRLKTKNYVAVALDRTQWRDR 120 Query: 105 KRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF 164 +L ++ ++ +Y + P S + + L + ++L +I Sbjct: 121 ---NLLMVTIIWGHHALPIYWELLPKLGSSSFREQKRVLGPVLALLKPYPVVVIGDREFH 177 Query: 165 KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNP 224 + G + R + A + +P +L ++ +K +T Sbjct: 178 SAQLADWLRVRGVNVVFRQKKSAFV----ATSCQPGKSLKTQGFKSGESHFFKNVT---- 229 Query: 225 ISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNI 284 L K N H K+PW L T L PK + + Sbjct: 230 ------LQKFAPIHGFNLGVYWQKIHR------GKKVKKPWYLLTTLD----NPKLVKQL 273 Query: 285 YSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGW 344 Y R IE FRD +S Y + S S RF ++L+ L G + Sbjct: 274 YQARWGIEMMFRDCQSGGYNM---ESTRVDSTRFLALVLLITFAYWLATLGGHEWEANHL 330 Query: 345 DKHFQA--NTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLT 389 + T N S LG+ S + ++ ++A L Sbjct: 331 VAYLGRSEKTPNNFPHHSIFGLGLSGYAWSQSLVFWQEEMLALMALK 377 >UniRef50_A7NGF0 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NGF0_ROSCS Length = 383 Score = 167 bits (422), Expect = 8e-40, Method: Composition-based stats. Identities = 83/403 (20%), Positives = 141/403 (34%), Gaps = 44/403 (10%) Query: 1 MCELDILHDSLYQFCPEL----HLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKH 56 M D LH + + L H L++ +L ++ L+++ LP A+ + Sbjct: 1 MSACDELHSKMEEPIRPLVDVKHAHHLSNWLWIVCGILLSGSVALSKIALYLPLTAQAEG 60 Query: 57 NIKRIDRLLGNRHLHKERLAVYRWHASFICSGNT--MPIVLVDWSDIREQKRLMVLRASV 114 I RI R L N ++ + YR + G V++D + RL + R S+ Sbjct: 61 RIARIRRWLKN--VYVDVWQFYRPLLEKVLQGWQAAEAAVILDGVMVFG-DRLQIFRLSL 117 Query: 115 ALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASIL-PSNTTPLIVSDAGF-KVPWYKSV 172 R++ L P S + + A L P + +D GF V W Sbjct: 118 RHGSRAIPLSWVVVPGKGLTSVERLRPLIQRAAEFLAPRVGAVVFPADRGFRDVEWAALC 177 Query: 173 EKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLY 232 ++GW+++ R+ + + +T+S + + Sbjct: 178 LEVGWHYVIRLANNTLITLEDGRRLSIAAPGVP--PGEACYWRNAAITQSQDWPANLSVT 235 Query: 233 KSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIE 292 ++ A + P +LA + + R Q + Y RM IE Sbjct: 236 WTKG----------------------ARGQAPELLA--VMSDRRACNQRLREYGWRMSIE 271 Query: 293 ETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA-- 350 E+FRD KS G L H+R +R + +LL + L G A D QA Sbjct: 272 ESFRDDKSG--GFDLEHTRLQDPQRLERLLLAVAIATLWRHELGEQALH---DHSVQAEL 326 Query: 351 NTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLF 393 + R LS +LG+ LR +T +L+ Sbjct: 327 DPGGKRRELSIFQLGLRFLRRCLLALTTARLPKLRLVLSNLAL 369 >UniRef50_C4YZ17 Transposase, IS4 family protein n=4 Tax=Rickettsia endosymbiont of Ixodes scapularis RepID=C4YZ17_9RICK Length = 372 Score = 166 bits (420), Expect = 1e-39, Method: Composition-based stats. Identities = 64/396 (16%), Positives = 135/396 (34%), Gaps = 44/396 (11%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M + +L + F RL L +L++ +++ + + KA+ +R Sbjct: 6 MHNI-LLQSLILNFF--WKRSRLECLAGMIMSLIENCSVSGKNMALGILGKAKHSSRTQR 62 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQK-RLMVLRASVALHGR 119 I R ++ + + V ++ + N I+++D + + K + +L ++ Sbjct: 63 IYRFFRDQIFNYD--QVAKFILNIF--ANDKYIIVLDRTCWKFGKSDINILFLAIVFGKI 118 Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWY 178 SV +Y S CS + L + + +++D F W + Sbjct: 119 SVPIYWYPLEHSGACSSWLMEAMLERFINNFGVHKIKYLLADREFMGKEWLNFLTTKQIK 178 Query: 179 WLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKG 238 + VR + A KP+ K+ Y + + + + Sbjct: 179 FAIPVRKDMLIRITNALQTKPV----------GKSFDYVKALEYIEVKGMLW-------- 220 Query: 239 RKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDL 298 H + Y E ++A + +++ + +Y R IE F+ L Sbjct: 221 ----------DHAVTLSAYRNDKNELMVVAASGDIDVS----IFALYKFRWSIERLFKHL 266 Query: 299 KSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNV 358 KS G + S ++ +RF ++ + + G+ + K A T R V Sbjct: 267 KSG--GFDIEKSHITNPDRFVKLVTVCAIASALIIKNGLIQHEIQPIKIRTAKTNPKRLV 324 Query: 359 LSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFT 394 S G++ LR+ + V +L + T Sbjct: 325 -SFFTYGLDHLRNCLKQASSIAKSVLKRILEYDSIT 359 >UniRef50_B7KME5 Transposase IS4 family protein n=42 Tax=Cyanobacteria RepID=B7KME5_CYAP7 Length = 387 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 57/369 (15%), Positives = 118/369 (31%), Gaps = 40/369 (10%) Query: 37 KTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKE--RLAVYRWHASFICSGNTMPIV 94 K + + LG LP + ++I R L ++ L + + G ++ Sbjct: 34 KQVKIERLGACLPIPILYESRRRKIQRFLKSKKLSLSLFWFPLIKLIIEQEFKGQERLVL 93 Query: 95 LVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNT 154 ++D + + +M+ SV R++ +Y + S + + +L Sbjct: 94 VLDRTQWKSNNIIMI---SVIWRKRALPIYWLILNKKGRSSLSEQQAIIRPILKLLSDWE 150 Query: 155 TPLIVSDAGFKVPWYKSVEKLG------WYWLSRVRGKVQYADLGAENWKPISNLHDMSS 208 ++ + +++ Y+ R +G V + G + ++ + L Sbjct: 151 IVILGDREFHGIELAYWLKQQDKKRKNPIYFAFREKGDVNF-KKGKKGYQTMKELCKDPG 209 Query: 209 SHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILA 268 + + + K + G+ N + K+PW + Sbjct: 210 FKAFY-------------SDVEVTKKKGFGKFNLGFYWKRNYKN------YKEKQPWFIL 250 Query: 269 TNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALML 328 TNLP + + + Y KR IE F+D K+ Y L + R ++L+ + Sbjct: 251 TNLP----SLNETIKYYRKRSGIEAMFKDCKTGGYNLEGSQANQV---RLTNLILLIAIA 303 Query: 329 QLTCWLAGVHAQKQGWDKHFQANTVRNR--NVLSTVRLGMEVLRHSGYTITREDSLVAAT 386 L G + QG K+ R S +G+ + Sbjct: 304 YTNSALKGKSIKNQGHQKYITRLREARRKNKRHSDFWVGLYGDSWIFAVEFCFHFIQELM 363 Query: 387 LLTQNLFTH 395 L +N T Sbjct: 364 ALNKNKLTF 372 >UniRef50_B0BZT8 Transposase, IS4 family n=21 Tax=Cyanobacteria RepID=B0BZT8_ACAM1 Length = 397 Score = 165 bits (416), Expect = 3e-39, Method: Composition-based stats. Identities = 57/358 (15%), Positives = 108/358 (30%), Gaps = 49/358 (13%) Query: 35 DCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTM--- 91 + + L L P I+ + R L L RL + ++ Sbjct: 31 SHRQIQLARLASMFPQPIHYSSRIRNLQRFLVLPQLSV-RLLWFPILKHWLSEEFKTGHG 89 Query: 92 ----------------PIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCS 135 ++ VD + + + +MV +V ++ +Y P S S Sbjct: 90 NRAYRRARLKRTIDGYVVMAVDRTQWKGRNLMMV---TVVWGKHALPVYWAPLPKSGSSS 146 Query: 136 KKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAE 195 K + L I ++ + + ++ R + G Sbjct: 147 LKQQLRLLKTALKIFKPYPVVVLADRDFHSPKLALWLSQRQVEFVLRQKKSAYVQLQGEV 206 Query: 196 NWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPK 255 +++P+ C + K G N + Sbjct: 207 DYQPLKE-------------RGFAPGQKGFLCDVYWGKRDQLGPFNLAFYWKRQYR---- 249 Query: 256 IYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSS 315 K+PW + T+LP T +Q +++Y+ R IE F+D KS Y L ++ + Sbjct: 250 --GKGGKDPWFIMTSLP----TLEQALSLYACRWGIEMMFKDCKSGGYN--LERTKVND- 300 Query: 316 ERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSG 373 RF ++L+ M LAG +K + + +R G + H Sbjct: 301 ARFLALVLVMAMAYCLATLAGYGLKKLKVNHYVARLNEHSRRRPRHSDFGTALYGHLW 358 >UniRef50_C7RIL9 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RIL9_9PROT Length = 352 Score = 160 bits (404), Expect = 1e-37, Method: Composition-based stats. Identities = 66/338 (19%), Positives = 119/338 (35%), Gaps = 40/338 (11%) Query: 5 DILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR-IDR 63 D + L + P + L L LA A++ +T EL L ++ + R Sbjct: 9 DEVRFGLEEALPGMRKTILKKLPLAVAAMIQARTPNTMELSTLLALNTERADMREQWLRR 68 Query: 64 LLGNRHLHKERL--AVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSV 121 LL NR + + R +G ++ +D +D+ R VL SV RS+ Sbjct: 69 LLTNRLIRSAGVLEPFARRALEQAAAGGQTILLSMDQTDL--GDRFAVLMISVRRGDRSL 126 Query: 122 TLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWL 180 L + L ++ + LP ++++D + V ++ + GW + Sbjct: 127 PLVWRIEEGEANIGFAGQQVLLEEVRAWLPEGAAVMLLADRFYPSVALFEWLLATGWQYR 186 Query: 181 SRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK 240 R++G + +G + + ++ + R + Sbjct: 187 LRLKGNLLVD-----------------------VGCAGIGTTGELAAGV-----RERYEA 218 Query: 241 NQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 N R ++ EPWI+A + P + V Y R IE F D KS Sbjct: 219 NARLFEAGIPMAIGVLHEPGHPEPWIIAMDCPPN----RAAVRDYGARWAIEPMFSDFKS 274 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVH 338 G L ++ + +R D ++LI + C AG Sbjct: 275 R--GFRLEDTKLEAPKRLDCLILIMALAMYWCVQAGQE 310 >UniRef50_Q1IXF5 Transposase, IS4 n=6 Tax=Bacteria RepID=Q1IXF5_DEIGD Length = 352 Score = 157 bits (397), Expect = 6e-37, Method: Composition-based stats. Identities = 57/365 (15%), Positives = 117/365 (32%), Gaps = 49/365 (13%) Query: 7 LHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLG 66 L L + P L +RL L+ A++ +++ L +L + + +R+ R + Sbjct: 13 LTALLAEHFP-LDPRRLTVLSALILAVIQARSVVLYQLVQIVDLPGSNDTVYQRLKRFV- 70 Query: 67 NRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIR-EQKRLMVLRASVALHGRSVTLYE 125 L V R+ + + ++++D ++ + Q+ + +L SV S L Sbjct: 71 --QFALPDLLVARFVLAHL-RDEQHLLLVLDRTNWKLGQQDINILLLSVRWQTFSFPLVW 127 Query: 126 KAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRG 185 P S + + L +L T L W+ ++ ++ + R+R Sbjct: 128 TLLPHSGNSNMATRIALVERLLPLLQGKTLFLAADREFVGGEWFVALRRMSLSPVIRLRA 187 Query: 186 KVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRST 245 G+ W L G Sbjct: 188 DSMV--EGSPVWVRFKKLKP--------------------------------GEVRVWYK 213 Query: 246 RTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGL 305 TH + + ++ + L ++ + Y+ R E + LKS G Sbjct: 214 PTHVYGVTLRVLACQNVHGQTLFLAYQGH---AEKALKRYALRWTAENMHQALKSR--GF 268 Query: 306 GLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLG 365 L + + R +L + + + C L G Q++ + + S R G Sbjct: 269 FLESTHLTDPSRVSTLLAVVALAFVWCCLVGEFEQQRDPSRCLRHGYPPK----SLFRRG 324 Query: 366 MEVLR 370 ++ LR Sbjct: 325 LDALR 329 >UniRef50_Q72IB6 Transposase n=3 Tax=Thermus thermophilus HB27 RepID=Q72IB6_THET2 Length = 365 Score = 157 bits (397), Expect = 6e-37, Method: Composition-based stats. Identities = 90/394 (22%), Positives = 141/394 (35%), Gaps = 48/394 (12%) Query: 5 DILHDSLYQFCPELHLKRLNSLTLACHALLDCK-TLTLTELGRNLPTKARTKHNIKRIDR 63 ++ +++ L ++L L LL TL++L R P + + R+ R Sbjct: 9 QVITLWVHKAFASLRKTIRSNLALFLSTLLTAPLDPTLSDLARRTPLPTLAQSRLNRLWR 68 Query: 64 LLGNRHLHKERL---AVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 L + L A+ A +P++ VDW+ E R L A++ L GR+ Sbjct: 69 FLHHPTLQNPWALTEALLPLLARRFPKDRPLPLI-VDWT-FAEDGRHQALVAALPLKGRA 126 Query: 121 VTLYEKAFPLSEQCSKK-AHDQFLADL-ASILPSNTTPLIVSDAGFK-VPWYKSVEKLGW 177 + + PLS S+ ++FL L ++ TPL + D GF V + ++ G Sbjct: 127 LVVAFALHPLSPFPSQNRVEEEFLHRLGRAVQDLGYTPLFLLDRGFDRVSLMRKLQGWGM 186 Query: 178 YWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSK 237 +L R+R + G + P+ + + + +L+Y Sbjct: 187 GFLIRLRQNREVEPRGGKR-LPLKEGYRRVVHPLREEVRLFGHGGEEVEVTLLVY----- 240 Query: 238 GRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRD 297 +EPW LA + P + P Y RM IEE FRD Sbjct: 241 ---------------------PGGREPWYLAYSGPFGGKPP------YGWRMWIEEGFRD 273 Query: 298 LKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRN 357 LK +GL RT +S R + LL M L L G Q + W A+ R Sbjct: 274 LKGQGFGLDRHRLRTGASLRGWLWLLALGMALLI--LLGARLQGREWLPRLLAHPERQ-- 329 Query: 358 VLSTVRLGMEVLRHSGYTITREDSLVAATLLTQN 391 S RLG L LL + Sbjct: 330 --SLFRLGRIALAQGPPPWREAVVEELIRLLQEL 361 >UniRef50_A5UQG7 Transposase, IS4 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UQG7_ROSS1 Length = 372 Score = 156 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 65/382 (17%), Positives = 129/382 (33%), Gaps = 44/382 (11%) Query: 2 CELDILHDSLYQFCPELH---LKRLNSLTLACHALLDCKTLTLTELGRNLPT-KARTKHN 57 + L Q P++ + L +L L ++ + L ++ P +A + Sbjct: 6 RRYRAIAQCLLQLYPQVGGHQRRHLATLALLICGIVGSQHTQLPKVVERTPGGRAADESV 65 Query: 58 IKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALH 117 + R R L + ++ +R + A G + ++D S + M L SV Sbjct: 66 VMRFRRWLKHDNVTYKRWMLPVAQALIAMLGRRPLVFVIDGSTVGRG--CMCLMISVLYQ 123 Query: 118 GRSVTLYEKAFPLSEQCSKKA-HDQFLADLASILPSNTTPLIVSDAGFK-VPWYKSVEKL 175 R++ + + +A H L LA ++P+ + I+ D + W ++ Sbjct: 124 RRALPITWLVVKARKGHLPEALHCALLEQLAQLVPAEASVTILGDGEYDGADWQAAITAR 183 Query: 176 GWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSR 235 GW ++ R + A L D++ + + +++ + + Sbjct: 184 GWKYVCRTASNILLTLAEATI-----ALGDLAPKRGEVIAVEQVCITAAQYGPV------ 232 Query: 236 SKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETF 295 ++ A+ + P L T + +Y +R QIE F Sbjct: 233 ----------------NVLAVWEAAYEHPIHLVTTHADVAY----ALALYRRRAQIETYF 272 Query: 296 RDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRN 355 D KS G + S S R +L+ + L GV A++ + Sbjct: 273 SDQKSR--GFRINRSHISDPTRLARLLIATALAYLWVVYLGVVARRDALRGRIHR---PD 327 Query: 356 RNVLSTVRLGMEVLRHSGYTIT 377 R LS LG+ +L + Sbjct: 328 RCDLSLFSLGLRLLAYCLRHRR 349 >UniRef50_C1XLC5 Transposase family protein n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XLC5_MEIRU Length = 354 Score = 155 bits (391), Expect = 3e-36, Method: Composition-based stats. Identities = 65/396 (16%), Positives = 127/396 (32%), Gaps = 51/396 (12%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M L + L H RL+ L+ AL+ +++ L ++ L N +R Sbjct: 1 MHHHSELIEVLRPHLKW-HRARLDFLSAFVLALIRVRSVNLAQIALALNPWVHIASNYRR 59 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTM-PIVLVDWSDIR-EQKRLMVLRASVALHG 118 R L L +E + R + ++ +D ++ Q + +L VA G Sbjct: 60 CQRFLAEFRLQQEV--IGRLILKLLPQDPAHKLVLSLDRTEWTLGQASINLLFIGVAHQG 117 Query: 119 RSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWY-KSVEKLGW 177 + L + + + L L + LP + +D F + + + Sbjct: 118 VAYPLVWCFLGKAGSSNLQERLGLLRRLLTFLPKERIQSLCADREFACTGFLRYLRWQQL 177 Query: 178 YWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSK 237 + R++ + G +P+ L + L Y Sbjct: 178 PYTLRIKAGNRVTYKG--RSRPVQQLF-------RHLDYGA--------------WEALP 214 Query: 238 GRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRD 297 T+ + E +L T E + Y++R +IE F+ Sbjct: 215 KPVKLWGQPTYLMGSRLRK-----GEYLLLITEAEPERAPAR-----YARRWEIETLFKA 264 Query: 298 LKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRN-- 355 KS G + + +ER + ++ + + + G W Q ++N Sbjct: 265 CKSQ--GFDFESTHLTRAERIESLVALMSIALVWAHRVG------EWRLQTQPIPIKNHA 316 Query: 356 RNVLSTVRLGMEVLRH--SGYTITREDSLVAATLLT 389 R + ST R G++ LR + + LL+ Sbjct: 317 RKLYSTFRYGLDYLRQLLFAPEARKAELYACVRLLS 352 >UniRef50_UPI000038476B hypothetical protein Magn03010330 n=1 Tax=Magnetospirillum magnetotacticum MS-1 RepID=UPI000038476B Length = 333 Score = 149 bits (376), Expect = 2e-34, Method: Composition-based stats. Identities = 58/329 (17%), Positives = 115/329 (34%), Gaps = 42/329 (12%) Query: 15 CPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR-IDRLLGNRHLHKE 73 P + K+ L L +LD ++ L ++ +LP +A + I R+LGN + + Sbjct: 20 LPRQNKKQREGLALLAATMLDVRSANLMDVAASLPRQAERLDMRYQWISRVLGNALIDVD 79 Query: 74 RL--AVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLS 131 + R + ++++D + E ++ +V+ + RS+ L + Sbjct: 80 EVMAPYVRDILGRLVGDGRRLVLIIDQTQANEVQQAVVVAV--RVGERSLPLAWRVKKTQ 137 Query: 132 EQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVP-WYKSVEKLGWYWLSRVRGKVQYA 190 + L +A +LP P+++ D + P W W R++ + Sbjct: 138 GAIGFAEQREALEVVAGLLPEGVRPVLMGDRFYGSPDLIAWCRTQSWDWRLRLKQDLLVF 197 Query: 191 DLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCH 250 + G E+ + + G + LT T Sbjct: 198 EDGGES----------TLAECFARGERMLT--------------------GVELTGKRVP 227 Query: 251 HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHS 310 ++ A EPWI+A + + Y R IE F D K+ +G L S Sbjct: 228 TNVAMVHEAGHPEPWIIALSEAPTVHRAFD----YGLRWGIEAMFSDFKTRGFG--LEDS 281 Query: 311 RTSSSERFDIMLLIALMLQLTCWLAGVHA 339 ++R D ++++ + G+ A Sbjct: 282 HIQRADRMDRLIMVMALALFWAVSTGMWA 310 >UniRef50_A8ZRP2 Transposase IS4 family protein n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=A8ZRP2_DEIGD Length = 398 Score = 149 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 68/371 (18%), Positives = 132/371 (35%), Gaps = 50/371 (13%) Query: 7 LHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKAR-TKHNIKRIDRLL 65 L L+ H +L LL + L ++ ++A + +R+ R L Sbjct: 21 LQTGLWNDVRNAH-----TLAWMVTGLLLSQCSFLPAWLPHIHSRATFAQSTERRLRRWL 75 Query: 66 GNRHLHKERLAVYRWHASFICS--GNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTL 123 N + + A+Y + G I+ +D S + E + ++R SV GR+V L Sbjct: 76 ENPAI--DPTAIYGPLVTRALRDWGGHTLILALDTSRLFE--KFCLIRVSVLFRGRAVPL 131 Query: 124 YEKAFPL-SEQCSKKAHDQFLADLASILP--SNTTPLIVSDAGF-KVPWYKSVEKLGWYW 179 + S Q S LA++ +L +++D GF + GW++ Sbjct: 132 VSRVLEHPSAQVSTAQLLPVLAEVKGLLDFLGQPEVRLLADRGFCDTQLMAWLRVCGWHF 191 Query: 180 LSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 R++ + A + + + ++ ++ LT + + L + Sbjct: 192 RIRIKSSLILAAPDGQRLCKVGEV-RLAPRETRYFHNVTLTGQHFGPVHVALGRPM---- 246 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 E W + ++ P T + Y +R QIEE F D K Sbjct: 247 --------------------DGPELWQVVSDEP----TSIETFAEYGERFQIEEGFLDEK 282 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 S +G L SR + + ++L+ + L GV G + + L Sbjct: 283 SGLFG--LEDSRLRDAASLERLILVLTVATLLLVSEGVQIVHCGDRRVVDPHWQ---RAL 337 Query: 360 STVRLGMEVLR 370 S +++G+ ++ Sbjct: 338 SYLKIGLRAVQ 348 >UniRef50_B5K928 Transposase, IS4 n=23 Tax=Alphaproteobacteria RepID=B5K928_9RHOB Length = 364 Score = 141 bits (355), Expect = 4e-32, Method: Composition-based stats. Identities = 49/356 (13%), Positives = 113/356 (31%), Gaps = 41/356 (11%) Query: 18 LHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAV 77 ++L ++ L +++ +T+ L+ L PT ++ + +R+ R + L + A Sbjct: 24 IYLNLFKTMCLLIMGMVNARTVNLSHLACEFPTDSKVESTYRRLQRFFQHVDLGSDWAA- 82 Query: 78 YRWHASFICSGNTMPIVLVDWSDIREQKRL-MVLRASVALHGRSVTLYEKAFPLSEQCSK 136 + + +D ++ + +R L ++ + L + Sbjct: 83 --PLLVEMIGSGPTWHLCLDRTNWKIGQRHVNFLVLALVTRRHRIPLMWSVLGRAGNSDT 140 Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAE 195 + S+ +T +++D F W + K ++ R++ Sbjct: 141 AQRIALMKRYLSVFEVSTIKFLLADREFIGAQWLDFLHKNNVPFVIRIKAN--------- 191 Query: 196 NWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPK 255 L + K+ +S + + + + + Sbjct: 192 -----------------QLVTTQDGKTQNLSTLLRTCRGK-RNFDARFGGNNLGEATWFS 233 Query: 256 IYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSS 315 + K +L V R + + Y KR IE F D K+ G L +R + S Sbjct: 234 FAAKRIKGGELLI---VVSNRPAHRALATYKKRWAIESLFGDTKTR--GFNLEDTRLTIS 288 Query: 316 ERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRH 371 ++ +++L + + G K + S R+G + LR Sbjct: 289 KKLELLLGLVALAVAWASKTATKLIGGGKMKRKKHGYF----AKSFFRIGFDQLRK 340 >UniRef50_A9AUQ0 Transposase IS4 family protein n=3 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AUQ0_HERA2 Length = 414 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 76/413 (18%), Positives = 140/413 (33%), Gaps = 53/413 (12%) Query: 3 ELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGR--NLPTKARTKHNIKR 60 IL ++ P +RL ++ L+ T + G L T A + +R Sbjct: 36 INTILRTAVPTLSPWT-ARRLTDWLVSIL-LMPSITTRVVAWGCALGLSTAAHAASHERR 93 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 + R + L +++R + V V + R +L A++ HGR+ Sbjct: 94 LRRTYRDSQLS---WSLHRAILATTLHIAPTESVTVIIDETTHTDRWTLLTAALWYHGRA 150 Query: 121 VTLYEKAFPLSEQCSKKAHDQ---FLADLASILPSNTTPLIVSDAGFKVPWYK-SVEKLG 176 + L P + + L + +LP+ + ++V+D F P + V G Sbjct: 151 IPLAWVLHPGYTRRATAFWTDVATLLERVQQVLPNAMSVVVVADRAFGCPAFTDQVAAYG 210 Query: 177 WYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRS 236 W W+ RV+G + G H++T+ +T+ + + + +K Sbjct: 211 WGWVVRVQGHTRIQLRG----------------HTETMIRTLVTRGHRVVRRGHAFKKAG 254 Query: 237 KGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFR 296 + A+ EP +L +NL + Y +R IE FR Sbjct: 255 WRTV-----------TVVAAWEATCHEPLLLVSNLEGIGA----IRQAYGRRSAIEALFR 299 Query: 297 DLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRN- 355 D K+ + S++ S + ++L + + L G + + + R Sbjct: 300 DWKTAGW--QWEASQSRSQTTQEALVLGMAIATVLVLLVGTAEAQAVLAERGDRPSPRRP 357 Query: 356 -RNVLSTVRLG----MEVLRHSGYTITREDSLVAATLLTQ---NLFTHGYVLG 400 S RLG + L +A T L + T G LG Sbjct: 358 WAARESLFRLGRYGVLRWLWTGTQPALGARLSLAGTALHERWATTVTRGGRLG 410 >UniRef50_Q3M186 Putative uncharacterized protein n=2 Tax=Anabaena RepID=Q3M186_ANAVT Length = 257 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 43/284 (15%), Positives = 85/284 (29%), Gaps = 30/284 (10%) Query: 6 ILHDSLYQFCPE-LHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRL 64 +L F + L+ +L +L + L + K + + L LP + + + R Sbjct: 1 MLASFYQNFLEKYLNKAQLITLKMLVWLLQNQKQVRIERLAATLPLPIQQNSRRRHLQRF 60 Query: 65 LGNRHLHK--ERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVT 122 L L + + + + +D + +E VL SV R+ Sbjct: 61 LTLNALSVVLLWFPIIEAIINQHFKVGSQLTIAMDRTQWKEN---NVLMVSVIYQKRAWP 117 Query: 123 LYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSR 182 +Y + + L + +L +I + + K ++ R Sbjct: 118 IYWCLLEKDGCSNLTEQQKVLRPVIRLLKKYKLVIIGDREFHSIELGSWLHKQNIGFVLR 177 Query: 183 VRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQ 242 + + + W+ L ++ Y + + + R GR N Sbjct: 178 QKKDTTFC----QKWQKFQPLSNIEIYPGVRQFYTN----------VKVTQKRGFGRFNL 223 Query: 243 RSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYS 286 K K+ W L TNLP + IY+ Sbjct: 224 GVYWKR------KYRGKQEKDAWHLLTNLPDLNT----ALKIYA 257 >UniRef50_C1D0Y0 Putative transposase n=1 Tax=Deinococcus deserti VCD115 RepID=C1D0Y0_DEIDV Length = 333 Score = 138 bits (346), Expect = 5e-31, Method: Composition-based stats. Identities = 52/361 (14%), Positives = 107/361 (29%), Gaps = 69/361 (19%) Query: 5 DILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRL 64 D L +L P L +RL LT A++ +++ L L ++ +R+ R Sbjct: 10 DSLQTALRCAFP-LDGRRLEVLTALILAMVQARSVVLYTLKTHVHLPGSFDTRYQRLRRF 68 Query: 65 LGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIR-EQKRLMVLRASVALHGRSVTL 123 + E + + + +++D ++ + ++ + +L S S+ L Sbjct: 69 -----VRFEFPDHFFVRFALFSLPDGELNLILDRTNWKLGKQDVNILLLSAVWDSFSLPL 123 Query: 124 YEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSR 182 P S + L +++D F W+ +++ G R Sbjct: 124 VWALLPHGGSSSHQERFAHLLRFVRCCSERHIGSLLADREFIGKSWFTFLDQHGIAPCIR 183 Query: 183 VRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQ 242 + + T+G +L S +S ++ K Sbjct: 184 LPA-------------------------TATIGTGKLPVSYGVSSRLCATK--------- 209 Query: 243 RSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPA 302 +A E LA + Y++R Q E LK+ Sbjct: 210 ----------------NTADEVLYLAY-----RGYASVNLRRYAQRWQAENLHSALKTR- 247 Query: 303 YGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTV 362 G L + + +ER +L + ++ + K +S Sbjct: 248 -GFNLEDTGLTQAERVSTVLTCVSAAFIWAYVTCQVLAAKQPVKR----KEDGYRAVSVF 302 Query: 363 R 363 R Sbjct: 303 R 303 >UniRef50_B4WNR8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WNR8_9SYNE Length = 271 Score = 137 bits (345), Expect = 6e-31, Method: Composition-based stats. Identities = 45/278 (16%), Positives = 88/278 (31%), Gaps = 29/278 (10%) Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYW 179 ++ L + Q L +L +N ++ V + + + G Y+ Sbjct: 6 AIPLSWRLMENLGNSDYVEQTQLLTKALPMLSANKIVVLGDREFCSVDLARWLGEKGHYF 65 Query: 180 LSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 R + G +W+ ++ L + + L KS+ G Sbjct: 66 CLRQKQSTWM-KAGETDWQKLTTLGLRPGTQGFYN-------------ALTLTKSKGFGA 111 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 + + EPW + T+L T + Y KR IEE FRD K Sbjct: 112 AHLVGKWKRRYQSFA------PAEPWFILTSLD----TLDVAIWAYQKRFDIEEMFRDFK 161 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR--N 357 Y L R +RF ++L+ + G +++ K+ ++ Sbjct: 162 LGGY--SLERCRAQD-KRFLSIVLLVAIAYTCATSQGQTLKQKALQKYIARPERYDQPNK 218 Query: 358 VLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTH 395 S +G+ R + + + +L +N + Sbjct: 219 RHSAFYIGLAAHRWVPFWPRCQQQVFELLVLDRNKLPY 256 >UniRef50_B4W0I0 Transposase, IS4 family protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4W0I0_9CYAN Length = 390 Score = 133 bits (333), Expect = 2e-29, Method: Composition-based stats. Identities = 61/373 (16%), Positives = 126/373 (33%), Gaps = 55/373 (14%) Query: 1 MCELDILHDS---LYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLP--TKARTK 55 M L D + P L L + L ++ + +LT L + + Sbjct: 1 MLRNKYLKDWSKLVSYHFPHLSLPEVVGLATWSFGIVMTGSSSLTRLSEFIAKINGEKRN 60 Query: 56 HNIKRIDRLLGNRH-----------LHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQ 104 +R+ + + + + +W S S + + +D ++I Sbjct: 61 TVRQRLKEWYQDSKDKKGAKRRELDVSQCFAPLLKWILSLWKSEDKCLPLAIDATNI--G 118 Query: 105 KRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHD-QFLADLASILPSNTTPLIVSDAG 163 + VL V G + + K +E+ S K H + ++P ++++D G Sbjct: 119 QNFTVLSLHVLYQGCGIPVAWKIVKGTEKGSWKPHWLHIFHYVKDVVPDYWQVIVLADRG 178 Query: 164 FKVPW-YKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKS 222 W ++ + L W+ R+ + Y + W+ + + + + GY K Sbjct: 179 LYADWLFEVICSLNWHPFLRINKQGYYQLRQEQEWRCLDTVAPK--TRTDWSGYVTCFKD 236 Query: 223 NPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLV 282 + + C +L + KEPW++ T+L + Sbjct: 237 HSLECTLLA------------------------RWDEGYKEPWLIVTDLELTQAQSF--- 269 Query: 283 NIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQ 342 Y R IE ++RD+KS + + +R +R + + L + L G + Sbjct: 270 -WYGLRAWIESSYRDIKSDGW--QWQKTRLREPDRAERIWLAMAIATLWTVTVGSEEKS- 325 Query: 343 GWDKHFQANTVRN 355 + Q N +N Sbjct: 326 --HQSQQFNEQQN 336 >UniRef50_A8ZQL6 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZQL6_ACAM1 Length = 383 Score = 132 bits (332), Expect = 2e-29, Method: Composition-based stats. Identities = 61/375 (16%), Positives = 117/375 (31%), Gaps = 55/375 (14%) Query: 7 LHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKA-RTKHNIKRIDRLL 65 L L Q+ + L + A+L ++ ++ L ++ + + ++R+ L Sbjct: 10 LQTQLSQWISPKDHRHLTVFSENIAAILQAQSGCMSHWLSYLSHRSCQARSQMERLSYFL 69 Query: 66 GNRHLHKERLAVYRWHASFICSG--NTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTL 123 N + E Y + +D S + ++ +A GRS L Sbjct: 70 HNPRILSETF--YAPLLKQFLHAWEGMSMTLTLDTSMLW--DTYCLIEVCLAWGGRSFPL 125 Query: 124 YEKAFPL-SEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLS 181 +K S + + L+ +LP +++D GF + + W W Sbjct: 126 AQKVMEHGSATVAFVDYCSVLSMTQGVLPPRCHITLLADRGFEHGELIRWLRSSEWSWAI 185 Query: 182 RVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKN 241 R + +Q KP+S L S L I C + Sbjct: 186 RAKSDLQITLANG-RSKPVSKLLPEVEQASLFRDVMIL---EDIHCHLAT---------- 231 Query: 242 QRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRM-QIEETFRDLKS 300 ++ +E W + T+ P Q +Y +R IE F+D KS Sbjct: 232 --------------ASVSTTQEAWAVITDTP----PSLQTFAVYGQRFGGIEPHFKDYKS 273 Query: 301 PAYGLGLRHSRTS----SSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR 356 A+ + H R + + + +A + W H Sbjct: 274 AAFEVPRSHIRDTAALERLLMLLAAATLIAISVAFQVIAQDALKTIDWHTH--------- 324 Query: 357 NVLSTVRLGMEVLRH 371 LS +++G+ + Sbjct: 325 RGLSFLQIGLRQINQ 339 >UniRef50_C8Q1E5 Transposase, IS4 family n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8Q1E5_9GAMM Length = 288 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 46/313 (14%), Positives = 98/313 (31%), Gaps = 39/313 (12%) Query: 91 MPIVLVDWSDI-REQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASI 149 + +D ++ + L + V G ++ LY + + + + Sbjct: 10 KVTLTIDRTNWKWGKSNLNIFMLGVVYKGIAIPLYWQMLDKRGNTNHLERCELIDRFIKQ 69 Query: 150 LPSNTTPLIVSDAGFKVP-WYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSS 208 + +IV+D F W+ + + R++ + + I L S Sbjct: 70 FGKDNLEMIVADREFVGEKWFNWLTNNHIPFAIRIKKNSKVKNHHG-KLVQIKELLRHVS 128 Query: 209 SHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILA 268 + LT C + ++ R K I+A Sbjct: 129 HQETYRHGRILTVDG---CLVRVFAKRDKDYGLV-----------------------IVA 162 Query: 269 TNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALML 328 TN + + Y+KR +IE F LK G L + + +R ++ + + Sbjct: 163 TNQLETV----DAMTSYAKRWEIETLFACLK--GRGFNLEDTHLTHLDRVSKLVAVNALA 216 Query: 329 QLTCWLAGVHA-QKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATL 387 + G++ + + + ++N R S LG++VL + + + Sbjct: 217 FCWAYHVGIYKDKDKPLKRKLKSNA---RPQASLFALGLDVLIEGLHLVFFNNDKTVFRQ 273 Query: 388 LTQNLFTHGYVLG 400 L L +G Sbjct: 274 LVSFLTPKPMKIG 286 >UniRef50_Q9RZJ3 Transposase, putative n=9 Tax=Deinococcus radiodurans RepID=Q9RZJ3_DEIRA Length = 327 Score = 130 bits (326), Expect = 9e-29, Method: Composition-based stats. Identities = 51/360 (14%), Positives = 114/360 (31%), Gaps = 50/360 (13%) Query: 33 LLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMP 92 ++D +++ +L ++P + + +R DR + L +R + G Sbjct: 1 MIDARSVNHHDLSAHMPGMSTPQGKKRRADRTFRDEQL--DRGFFIALLVVHLPPG--KV 56 Query: 93 IVLVDWSDIREQKR-LMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILP 151 ++ +D ++ + + L +HG ++ L S A + L P Sbjct: 57 LLSLDRTNWEHGETPINFLVLGAVVHGFTLPLIWVPLDQSGNSHTYARMWLVLKLLRAWP 116 Query: 152 SNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSH 210 + +V+D F W++ + + G R+R D+ + W Sbjct: 117 AKRWLGLVADREFIGAEWFRFLRRQGIKRAIRIRQTDMLDDMNGKEWFE----------- 165 Query: 211 SKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATN 270 + + + I ++ ++ + + I+AT+ Sbjct: 166 -----HVQHGHFHEIGEKVFVF----------------GELMRVVATRSPVGDLVIIATD 204 Query: 271 LPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQL 330 + ++ +Y +R IE TF K G L + + R + + + + Sbjct: 205 -----FSARKTWRLYKQRWSIECTFSSFKKR--GFDLERTGMTERSRLQRLFGLVTLAWM 257 Query: 331 TCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQ 390 C GV R +S VR G + L + + + + Q Sbjct: 258 FCLRLGVWLS----QTWPIPVLKHGRRAVSLVRHGAQHLVDAL-RWKPQQFMAILEVFIQ 312 >UniRef50_Q9UH48 Gastric cancer-related protein GCYS-20 n=1 Tax=Homo sapiens RepID=Q9UH48_HUMAN Length = 332 Score = 129 bits (323), Expect = 2e-28, Method: Composition-based stats. Identities = 110/116 (94%), Positives = 110/116 (94%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR Sbjct: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVAL 116 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMV R Sbjct: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVFRVCFEY 116 >UniRef50_Q2S0J1 Putative transposase n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S0J1_SALRD Length = 248 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 83/235 (35%), Gaps = 19/235 (8%) Query: 7 LHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLG 66 + +L Q PE R +L L + + L+++ + A+ + + + R LG Sbjct: 1 MESTLSQLLPEALATRRRALAQMITGLHLAEHVHLSKVAGRIAGTAQLESKTRHLRRFLG 60 Query: 67 NRHLHKERLA------VYRWHASFICSGNTMPI-VLVDWSDIREQKRLMVLRASVALHGR 119 N ++ ER + W A + + PI +LVD ++ VL A +A R Sbjct: 61 NENVDPERFYSPVRDRLIEWAAQGAETQGSGPIRLLVDTVELS--GERQVLMAGIAYRRR 118 Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWY 178 ++ + + + + + L L P ++V D F +E GW+ Sbjct: 119 ALPICWETYRREGVTNAEQQISLLKALVGRFPDEAEVVVVGDGAFHSTDLMDFIEDQGWH 178 Query: 179 WLSRVRGKVQYAD--------LGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPI 225 + R+ WK + +L + L +TK N Sbjct: 179 FCLRLHADTYIRSFKDSSKGFPKEGTWKQLRDLVPEEGER-RYLQDVIVTKDNEY 232 >UniRef50_D1C6P8 Putative uncharacterized protein n=2 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1C6P8_SPHTD Length = 371 Score = 123 bits (309), Expect = 1e-26, Method: Composition-based stats. Identities = 70/376 (18%), Positives = 127/376 (33%), Gaps = 42/376 (11%) Query: 4 LDILHDSLYQF---CPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 L +L D + F P L + R+ L L L+ T L + LP + ++R Sbjct: 5 LHLLQDWTHHFQALLPGLRVTRVRGLALLSLGLIWAGTPQLGHIAATLPLPVQQLSTVRR 64 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRS 120 + R L R + A G +++VD + E+ +L + +H R Sbjct: 65 LRRWLATRAVPVVATWQPLARAFLAHRGQRELLLVVDPTP--ERDDATLLVLGLVVHRRV 122 Query: 121 VTLYEKAFPLSEQCSKKAHDQFL----ADLASILPSNTTPLIVSDAGFKVPW-YKSVEKL 175 + L P + +L +A++LP T V D G ++L Sbjct: 123 LPLAWHIVP-GQTAWAHPTTAYLARLGQRVAAVLPPGVTVTRVVDRGLASAAVIDWCQRL 181 Query: 176 GWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSR 235 GW+WL R + A G P + + + + ++++ Sbjct: 182 GWHWLMR---RNVDARQGVHVRLPDGTVCPAWACVP--------GPGRRWAGPVAAFQTQ 230 Query: 236 SKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETF 295 S ++EPW+L ++ P V Y +R Q+E Sbjct: 231 GWYAAELTSIWPV-----------RSREPWVLLSDRP----AGPARVREYRRRQQVEAVS 275 Query: 296 RDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRN 355 +D + + L S ++ R + + L + L G ++G + F R+ Sbjct: 276 QDGTTRGWN--LEASTRTARNRLNRLPLALFLALWWSHLRGQQVVRRGERRRFDRTDRRD 333 Query: 356 RNVLSTVRLGMEVLRH 371 VRLG + Sbjct: 334 GR---LVRLGRRWMTW 346 >UniRef50_Q10V90 Transposase, IS4 family n=7 Tax=Trichodesmium erythraeum IMS101 RepID=Q10V90_TRIEI Length = 275 Score = 121 bits (303), Expect = 4e-26, Method: Composition-based stats. Identities = 48/294 (16%), Positives = 100/294 (34%), Gaps = 37/294 (12%) Query: 111 RASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKV---P 167 S+A R++ ++ K + + + +L L + Sbjct: 1 MISLAWKKRALPIHWKILTHKGASNLAEQKAVIRPVIRLLKCQKIILTADREFHSIFLCY 60 Query: 168 WYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISC 227 W K +K Y++ R + Y +L++ Sbjct: 61 WLKKYQKQDVYFVLRTKKSTMIKRGKK---------------------YCKLSELPANIG 99 Query: 228 QILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSK 287 + L+ ++ + + T + PK S E W + TNL + P ++ IYS+ Sbjct: 100 ECKLFLNQKITKILRVGTYNLLIYKKPKYRDKSVSEKWYILTNLSL----PGKIKKIYSQ 155 Query: 288 RMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKH 347 RM IE F+D K+ AY L S ++ R + ++L+ + + +G ++ Sbjct: 156 RMGIEAMFKDYKTGAYNL---ESAKANETRLNNLILLIAISYAISSFQVQKIKNKGVQEY 212 Query: 348 FQANTVRNR--NVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVL 399 ++R S+ +G+ + Y +D + L H ++ Sbjct: 213 ISRTNEKSRKERRHSSFFVGLSRM----YWAINDDFIWGLVENLMRLNPHKFLY 262 >UniRef50_A8ZMZ5 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMZ5_ACAM1 Length = 258 Score = 121 bits (303), Expect = 4e-26, Method: Composition-based stats. Identities = 47/264 (17%), Positives = 94/264 (35%), Gaps = 37/264 (14%) Query: 109 VLRASVALHGRSVTLYEKAFPL-SEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKV- 166 ++ SV GR+V S + + L L + ++++D GF Sbjct: 1 MIHLSVVCCGRAVPFLWLVLAHKSAAVGFEEYQPLLRRARWFLRKHPDVMLLADRGFANH 60 Query: 167 PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPIS 226 +++ W++ R+ V H + + + P Sbjct: 61 QLMSWLQQSRWHYCLRIPCDVIL--------------------HGPRRCPREVRRLWPSK 100 Query: 227 CQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYS 286 + +LY++ C KEPW + T+ ++T Q Y+ Sbjct: 101 GEAILYRNVGLWEDGV------CRCNLVLANIRGVKEPWAVITDESPTLQTLWQ----YA 150 Query: 287 KRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDK 346 R ++EE F D KS A L S+ ++ + + L+A + L G+ Q +G + Sbjct: 151 LRFRVEELFLDSKSGA--FELEDSKIRCADALERLYLVAAVALLYSTTHGMAVQIEGLRE 208 Query: 347 HFQANTVRNRNVLSTVRLGMEVLR 370 + R +S +++G+ L+ Sbjct: 209 QVDPHW---RRGISYLKIGLRWLK 229 >UniRef50_B7I4U9 Transposase 1 n=31 Tax=Bacteria RepID=B7I4U9_ACIB5 Length = 189 Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 66/191 (34%), Gaps = 11/191 (5%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M L+ L+ L + + L L ++ +T L+ + LP K + +R Sbjct: 1 MTHLNELYLILNKSLKW-NKSHLKCFALIMLVIILKQTCNLSSASKALPIKCLPQSFYRR 59 Query: 61 IDRLLGNRHLHKERLAVYRWHASFICS--GNTMPIVLVDWSDIREQKR-LMVLRASVALH 117 + R ++ YR + I + + +D ++ + KR + +L ++ Sbjct: 60 MQRFFAGQYFD------YRQISQLIFNMFSFDQVQLTLDRTNWKWGKRNINILMLAIVYR 113 Query: 118 GRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVP-WYKSVEKLG 176 G ++ + K + +I + + +D F W+ + + Sbjct: 114 GIAIPILWTLLNKRGNSDTKERIALIQRFIAIFGKDRIVNVFADREFIGEQWFTWLIEQD 173 Query: 177 WYWLSRVRGKV 187 + RV+ Sbjct: 174 INFCIRVKKTS 184 >UniRef50_A5UPF7 Transposase, IS4 family n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UPF7_ROSS1 Length = 397 Score = 118 bits (295), Expect = 4e-25, Method: Composition-based stats. Identities = 64/371 (17%), Positives = 119/371 (32%), Gaps = 51/371 (13%) Query: 1 MCELDILHDSLYQFCP----ELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT----KA 52 M + L D++ L L L +L T+ L + A Sbjct: 1 MSSIPHLTDAVETLLRAGDDPLPRATRQRLALFVVGVLLAGTVVLRRVATTQTHIALGAA 60 Query: 53 RTKHNIKRIDRLLGNRHLHKERLAVYRWHASFI--CSGNTMPIVLVDWSDIREQKRLMVL 110 + + +R+ R++ + L R + + ++VD S + L Sbjct: 61 QAASHERRLRRIVNDPQLGAAAPMDGRVVRRVLQRLRPDQRVWLIVDESG--HSDVVRTL 118 Query: 111 RASVALHGRSVTLYEKAFPL---SEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVP 167 A++ GR++ L +P Q S LA +A+ILP+ + +++D F P Sbjct: 119 VAALWYRGRALPLAWVRWPAQQPHPQASWTDCQTLLAQVAAILPAGPSVTVLADRAFGCP 178 Query: 168 WYK-SVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPIS 226 + V GW L R + + + S L + + Sbjct: 179 AFTDLVAAHGWQDLVRAQRQTCLRHDDG-RMQAFSTLIPQAGTR--------------WC 223 Query: 227 CQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYS 286 + +K + + ++ EP +L +NLP LV Y Sbjct: 224 GRGQAFKKQGWRPVSVVASWRV-----------GCPEPLLLVSNLP----PAWDLVRPYR 268 Query: 287 KRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDK 346 +R I FRD K+ + S+ +++L+ + L G A + + Sbjct: 269 RRAAIAALFRDWKTSGW--QWEASQVRDVAHQSVLVLVLALATLITRCLGEEAAQTILN- 325 Query: 347 HFQANTVRNRN 357 Q R Sbjct: 326 --QPPRAGRRR 334 >UniRef50_UPI000197B669 hypothetical protein BACCOPRO_01365 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197B669 Length = 270 Score = 117 bits (292), Expect = 1e-24, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 63/166 (37%), Gaps = 7/166 (4%) Query: 2 CELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRI 61 L I+ + + ++L R+ + HAL +T++L +L +PT N++RI Sbjct: 32 QILPIMQEYFGKS---MNLARIKLMAYMLHALCVVQTVSLHKLASAMPTSVERDSNLRRI 88 Query: 62 DRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQK-RLMVLRASVALHGRS 120 R + N L+ + +A+ + ++ +D ++ + + + +L + G + Sbjct: 89 QRFIANYALNLDLVAM---MIFSLLPVKNGLVLSMDRTNWKFGEFNINILTLGITYKGVA 145 Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKV 166 L + + + + + +V+D Sbjct: 146 FPLLFSLLNKRGNSNWEERKDIMERFIRLFGHDCIDCLVADRESSA 191 >UniRef50_Q5GUK2 ISxac1 transposase n=1 Tax=Xanthomonas oryzae pv. oryzae RepID=Q5GUK2_XANOR Length = 361 Score = 116 bits (290), Expect = 2e-24, Method: Composition-based stats. Identities = 77/330 (23%), Positives = 134/330 (40%), Gaps = 34/330 (10%) Query: 63 RLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVT 122 R + +LA+ R S++ + P + D +++R + R + R + Sbjct: 50 RRTKDAQFGGVQLAILRPRDSWLSGVISTPALHRDATNLRSRDRANIARPAT-------- 101 Query: 123 LYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSR 182 S A + A L S P++ T + + W + + Sbjct: 102 --------QSASSLAARTRRWAYLISTAPASHTAVSRA-------------TPRWSIVRQ 140 Query: 183 VRGKVQYADLGAENWKPI-SNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR-- 239 + +++ L A NW + L + S + +S+P C+++LY +GR Sbjct: 141 LLPRLRTGLLAAGNWVSVGPPLAAGAPSSGLVARTMQANRSDPRDCRLVLYAKTPQGRQQ 200 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 +N+RS S +A +EPW++ + + + KQLVN+Y++RMQIE FR+LK Sbjct: 201 RNRRSPAKVSRASSSLKAAAREREPWLIVASPQLHAPSAKQLVNLYARRMQIELAFRNLK 260 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 S YG + S T ER I+LL+ + WLAG+ + G + + R + Sbjct: 261 SHRYGQAMEDSLTRRGERLQILLLLTTLASFASWLAGLGCEATGIARWLSPRSS-TRKLY 319 Query: 360 STVRLGMEVLRHSGYTITREDS-LVAATLL 388 T+R+G E L L L Sbjct: 320 LTLRVGREALVRCWPMEPVSRWTLERLRTL 349 >UniRef50_A4W4J4 Transposase and inactivated derivative n=29 Tax=Streptococcus RepID=A4W4J4_STRS2 Length = 440 Score = 115 bits (288), Expect = 3e-24, Method: Composition-based stats. Identities = 62/348 (17%), Positives = 117/348 (33%), Gaps = 30/348 (8%) Query: 15 CPELHLKRLNSLTL--ACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHK 72 PE R + LT+ A+L TL + +L ++R +L H+ Sbjct: 37 HPEKDFSRKSQLTMETMIQAILTMGGNTLAKELLDLDLPVSQSAFVQRRYQL-----KHQ 91 Query: 73 ERLAVYRWHASFICSGNTMPIVLVDWSDI-------REQKRLMVLRASVAL---HGRSVT 122 A++ S I + +PI+ VD SD+ + H ++ Sbjct: 92 AFKALFANITSKIPTFKDLPILAVDGSDVVLPRNRSDKTTTFQTGPHHTPYTLIHINALY 151 Query: 123 LYEK--AFPLSEQCSK--KAHDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGW 177 E+ L Q ++ F+ + S L++ D G++ ++ W Sbjct: 152 NLEQEIYHDLRIQNNREVDERAAFIDMMESC--PFEQALVIMDRGYESYNVMAHCQERNW 209 Query: 178 YWLSRVRGKVQYADLGAE--NWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSR 235 ++ R+R G + D++ +T K L + P L + + Sbjct: 210 SYIIRIRDGNHSMKSGFNLPDTPCFDEEFDLNICRKQTNVMKELYRDFPNQYHFLPHNAS 269 Query: 236 -SKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEET 294 + R + + +P T + +P++L ++Y+ R IE + Sbjct: 270 FDLLPNSSRKSDPISFYDLHFRMVRLEIKPGFFETLVTNTDYSPEKLKDLYAYRWGIETS 329 Query: 295 FRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQ 342 FRDLK Y +GL H E + + C H + Sbjct: 330 FRDLK---YSIGLTHFHAKKKEGILQEIYAHFINFNVCKWLTSHVAIK 374 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 114 bits (284), Expect = 8e-24, Method: Composition-based stats. Identities = 42/230 (18%), Positives = 79/230 (34%), Gaps = 10/230 (4%) Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAE 195 + ++T+ + ++D G++ + VE G Y+L RV+ Sbjct: 168 NERRAMCEMIDRY--NDTSAIFIADRGYENYNIFAHVEHKGMYYLIRVKDITSNGITSKL 225 Query: 196 NWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPK 255 P S D + + T K+NP ++ + K + + Sbjct: 226 TMLPESGEFDEWVNVTLTKKQTNEVKANPKKYRV-IDKKTPFDYLDLHFNNFYEMKMRVI 284 Query: 256 IYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSS 315 + + TNLP + ++ +Y+KR IE +FR+LK Y LGL + Sbjct: 285 RFPIPQGSYECIITNLPQDKFNSDEIKRLYAKRWGIETSFRELK---YALGLTRFHSKKP 341 Query: 316 ERF-DIMLLIALMLQLTCWLAGVHA--QKQGWDKHFQANTVRNRNVLSTV 362 E + + +A +K+G +Q N R + Sbjct: 342 EYIMQEIWSRMTLYNFCEIIATNVVINEKKGCKHTYQLNYTRAIRICCYF 391 >UniRef50_B8B8E6 Putative uncharacterized protein n=5 Tax=cellular organisms RepID=B8B8E6_ORYSI Length = 753 Score = 110 bits (275), Expect = 7e-23, Method: Composition-based stats. Identities = 88/94 (93%), Positives = 89/94 (94%) Query: 288 RMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKH 347 RMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKH Sbjct: 359 RMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKH 418 Query: 348 FQANTVRNRNVLSTVRLGMEVLRHSGYTITREDS 381 FQANTVRNRNVLSTVRLGMEVLRHSGYT + Sbjct: 419 FQANTVRNRNVLSTVRLGMEVLRHSGYTNNKGRL 452 >UniRef50_Q47076 BfpT, bfpV, bfpW and transposase genes, complete cds n=53 Tax=Enterobacteriaceae RepID=Q47076_ECOLX Length = 186 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 55/170 (32%), Positives = 92/170 (54%), Gaps = 2/170 (1%) Query: 221 KSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQ 280 + I YK KGRK +RS + + K S SAKE W++ + ++ Sbjct: 10 RKKSIRGHFYTYKKSVKGRKKKRSKGQRGLNKTDKEQSKSAKEAWLIFSR--TNDFRARE 67 Query: 281 LVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQ 340 ++ +YS+RMQIE+ FRD K+ +G GLR S++ S+ R ++ L+A + + WL G HA+ Sbjct: 68 IIKLYSRRMQIEQNFRDEKNGRFGFGLRASKSRSTGRILVLSLLATLSTIVMWLLGYHAE 127 Query: 341 KQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQ 390 +G + +Q N++++R V+S + L VLRHS + + R L + Sbjct: 128 NKGLHQKYQVNSIKSRRVISYLTLAKNVLRHSPFILRRTVLSTVLNHLAR 177 >UniRef50_B7AA71 Transposase IS4 family protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7AA71_THEAQ Length = 393 Score = 109 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 64/341 (18%), Positives = 117/341 (34%), Gaps = 39/341 (11%) Query: 19 HLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTK-HNIKRIDRLLGNRHLHKERLAV 77 + L + ++E+ +LP+ + + H K + R L N + E L Sbjct: 9 DKRLHRRFEEVVRGALAAGSARVSEMVASLPSPLQNRFHQAKALYRFLSNPRVEAEALLD 68 Query: 78 YRWHASFICSGNTMPIVLVDWSDI-----------------REQKRLMVLRASVALHGRS 120 + S +VL+D S + R ++ + GR Sbjct: 69 RVYQESATALEGEEVLVLLDLSPVAKPYARALEGIARVGKDRRPGYELLTALGLDPAGRL 128 Query: 121 VTLYEKAFPLSEQCS---KKAHDQFLADLASILPS-NTTPLIVSDAGFK-VPWYKSVEKL 175 Y E+ K + + L + V+D GF + V L Sbjct: 129 ALGYAHLVAYGERGFASLPKEVEGAIEAARERLGGVGRRLVYVADRGFDDRKVFGQVLAL 188 Query: 176 GWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSR 235 G ++ RV + + G +L ++SS + G + + ++ L+ Sbjct: 189 GEEFVVRVYRDRKLGEGG--------SLAKVASSLALPCGEEVELRVGGRYQRVRLH--- 237 Query: 236 SKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRT-PKQLVNIYSKRMQIEET 294 G + H ++ + + W L T+LPV R Q+V Y +R ++E Sbjct: 238 -FGWREVEVEGRRLHLVVCRVPALGRRGEWWLLTSLPVRGREEAAQVVEAYRRRWEVERF 296 Query: 295 FRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 FR LK+ GLGL + R ++ + L L + W Sbjct: 297 FRLLKT---GLGLETFQVRGLARIRKVVAVLLGLAVFLWEV 334 >UniRef50_C5UVK8 Putative transposase n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UVK8_CLOBO Length = 250 Score = 109 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 48/267 (17%), Positives = 86/267 (32%), Gaps = 51/267 (19%) Query: 119 RSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWY 178 ++V L E A L + S + + L L + + EKL W Sbjct: 23 KTVVLSEIAQELKDSYSSGTEESKIKRLQRFLSNKSIN---------------PEKLKWK 67 Query: 179 WLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKG 238 + R + G K + ++ + S+ K +LT N +C + + K Sbjct: 68 YCIRCTKDLCVTIKGKLKIKKLEDIKAL-SNKGKNFYNIKLTAQN-YNCNLSVCK----- 120 Query: 239 RKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDL 298 + A+E W + NL + Y KR QIEE F+D Sbjct: 121 -------------------AKDAEETWFIVHNLEKSF-----AIREYKKRFQIEEMFKDF 156 Query: 299 KSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQG---WDKHFQANTVRN 355 KS G L + + + + ++ + G+ K + + Sbjct: 157 KSG--GFNLESTWSMNIQYIKMLYFCISIAYCFIITLGISCGKDKNNTIIGVIKDLNGKK 214 Query: 356 RNVLSTVRLGMEVLRHSGYTITREDSL 382 + S R G++ + Y+ E L Sbjct: 215 VRIYSLFRAGLKWFKRCYYSKRNEYYL 241 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 3/61 (4%) Query: 18 LHLKRLNSLTLACHALLDCKTLTLTELGRNLP---TKARTKHNIKRIDRLLGNRHLHKER 74 L KRLN+L ++ KT+ L+E+ + L + + IKR+ R L N+ ++ E+ Sbjct: 4 LSSKRLNNLVAMIIGIIISKTVVLSEIAQELKDSYSSGTEESKIKRLQRFLSNKSINPEK 63 Query: 75 L 75 L Sbjct: 64 L 64 >UniRef50_A5UY16 Transposase, IS4 family n=9 Tax=Roseiflexus sp. RS-1 RepID=A5UY16_ROSS1 Length = 416 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 64/406 (15%), Positives = 124/406 (30%), Gaps = 75/406 (18%) Query: 5 DILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHN------- 57 +L + P L R +L L + L L T + Sbjct: 8 RLLQQQIQTLFPRLSRHRRRALARWVLGALLAGSANRPALVHALATAGIARAATLADAWD 67 Query: 58 ------IKRIDRLLGNRHLHKERL-------AVYRWHASFICSGNTMPIVLVDWSDIREQ 104 RID + + RW + G ++ +D S + Sbjct: 68 AWIAAPAHRIDTADPPAAGAPPVVSPLACGADLLRWIRAHWTGGP--LVLGLDASH--RR 123 Query: 105 KRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAH-DQFLADLASILPSNTTPLIVSDAG 163 +++LR SV G ++ + P ++ + + H ++ L S LP + L+++D G Sbjct: 124 DDVVLLRMSVLYRGTALPVAWVIVPANQPGAWEPHWERMLRWARSALPLDQEVLVLADQG 183 Query: 164 FKVP-WYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKS 222 P + ++ ++ + RVR +A G ++ ++ Sbjct: 184 LWSPRLWHAIRSQQFHPIMRVRTTSTFAPTGQAR----QSVLRLAPGPG----------- 228 Query: 223 NPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLV 282 + + +K K + A EPW+L T+LP Sbjct: 229 HGWVGVGVAFKHAPK----------RIAGTLAVAWGADHAEPWVLLTDLPPAQVDAA--- 275 Query: 283 NIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQ 342 Y+ R E FR KS + + + + L+ + L G + Sbjct: 276 -WYALRSWDEAGFRQSKSMGWDWQRG--QVTDPDAVAWQYLVVATVTLWTVAVGTRIEDA 332 Query: 343 GWDKHFQANTVRN-----------------RNVLSTVRLGMEVLRH 371 + ++ + V+S +R GM+ LR Sbjct: 333 E-QQGVPPGRLKRAPPTTGAPPRRRWSGTAQRVISLLRRGMQHLRW 377 >UniRef50_B2AKB8 Transposase, IS4 family n=40 Tax=cellular organisms RepID=B2AKB8_CUPTR Length = 442 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 54/286 (18%), Positives = 91/286 (31%), Gaps = 26/286 (9%) Query: 80 WHASFICSGNTMPIVLVD-WSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKA 138 H ++ + + P+ ++D W RE K R + R + YE+ Sbjct: 119 LHPTYAVTPDREPLGVIDAWMWAREPKDADGNRGGIKESVRWIEGYERVAEQ-------- 170 Query: 139 HDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGW--YWLSRVRGKVQYADLGAEN 196 A++LP + G ++LG WL R + A G + Sbjct: 171 --------AALLPQTRLVYMTDREGDIAELMARAQELGQPADWLIRSQHNRNLA-EGGKL 221 Query: 197 WKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKI 256 W + + + L + K+ + ++ + + G T C Sbjct: 222 WDSVDA-SPVLGEITFILPGRAGQKAREVKQELRAQRMKLPGLVGAEFT---CVAAREIE 277 Query: 257 YSASAKEP-WILATNLPVEIRTP-KQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSS 314 A K W L TN + +LV Y R +IE F LK+ L+ S Sbjct: 278 APAGVKPVVWRLVTNREAQDADAVNKLVEWYRARWEIEMFFHVLKTGCKVEALQLSHMDR 337 Query: 315 SERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 ER + ++ G F A+ +R VLS Sbjct: 338 VERALALYMVVAWRIARLMRLGRTCPDLDASLFFDADEIRGAYVLS 383 >UniRef50_B1QZ52 Putative transposase n=2 Tax=Clostridium butyricum RepID=B1QZ52_CLOBU Length = 190 Score = 99 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 33/159 (20%), Positives = 48/159 (30%), Gaps = 10/159 (6%) Query: 235 RSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEET 294 K N T +A A E W +A N I + Y K IEE Sbjct: 38 SGKYFYNIELTAQKYICNMSVCKAADADEVWYIANNFDEAID-----IREYKKIFDIEEM 92 Query: 295 FRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTV- 353 F+D K G L + + +M L + G K +K A Sbjct: 93 FKDFK--GGGFNLEDTWSQDIHYIKMMYLCISIAYCWIITLGTSCTKDKKNKLIGAVKFL 150 Query: 354 --RNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQ 390 + + S R G + + Y+ E L L + Sbjct: 151 KGKKVRIYSLFRAGYKWFKRGYYSNRSEYYLKITFTLYE 189 >UniRef50_B4VLK2 Transposase, IS4 family protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VLK2_9CYAN Length = 199 Score = 98.8 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 63/196 (32%), Gaps = 24/196 (12%) Query: 111 RASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYK 170 +V RS +Y + + + + + +L LI +V Sbjct: 1 MVAVIWKKRSFPVYWQFLDKAGSSNISEQIAVIRPVLKLLSRYQVVLIGDREFRRVELAY 60 Query: 171 SVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQIL 230 ++K ++ R++ L ++ + L + + Sbjct: 61 WLKKKKVFFALRIKQDTYIRQSEGN----YQQLSELGLTPGMKLFHSGVN---------- 106 Query: 231 LYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQ 290 K + GR N + K + K+ W + TNL + +++ +Y R Sbjct: 107 YTKKKGFGRFNLAAYWKR------KYRGSYEKQGWFILTNLS----SIDEVIQVYQSRSG 156 Query: 291 IEETFRDLKSPAYGLG 306 IE F+D K+ Y L Sbjct: 157 IESLFKDCKTGGYNLE 172 >UniRef50_Q5L3A2 Transposase of IS231E-like element n=1 Tax=Geobacillus kaustophilus RepID=Q5L3A2_GEOKA Length = 453 Score = 97.6 bits (241), Expect = 6e-19, Method: Composition-based stats. Identities = 51/385 (13%), Positives = 123/385 (31%), Gaps = 29/385 (7%) Query: 24 NSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHAS 83 SL C AL + +L+ G N + +K + E+L +++ + Sbjct: 62 KSLVQLCSALALKQNTSLSAEGLNQRFHEKAVSFLKAV----------FEKLLIHQTQEA 111 Query: 84 FICSGNTMPIV---LVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFP----LSEQCSK 136 + ++D + + + + G + L + + Sbjct: 112 RRLCPRHSLFLRIRILDSTSFQLPPEIQGIYEGCTGPGVKIQLEYEWLEGKVLHVDVEDA 171 Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAEN 196 + HD + F + +++ G +++SR++ V + Sbjct: 172 RHHDAAYGASLLSTIQEGDLCLKDLGYFSLEGLQAIHDAGAFYISRLKHNVGIYQKEGDR 231 Query: 197 ---WKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRT-HCHHP 252 W+P L + + L + ++ ++++Y+ + + + Sbjct: 232 FRKWEPEDFLAVLQPGETMELEHAYVSGKKVHQPRLIVYRLTEEQERQKEGQWKQKAKQK 291 Query: 253 SPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 + ++ TN+P + ++ +YS R QIE F+ KS + + Sbjct: 292 GAAYVTRRPHPIYVYITNIPAIYTSLHEIHTLYSLRWQIEVVFKTWKSL---FHIHRFKP 348 Query: 313 SSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVR-LGMEVLRH 371 RF L L+ L + + W Q + +S ++ GM++ + Sbjct: 349 MKGARFQCHLYGTLIALLISSTV--MFKMREWLYRKQKKELSEYKAMSMIKEFGMDLFQ- 405 Query: 372 SGYTITREDSLVAATLLTQNLFTHG 396 + ++ L + HG Sbjct: 406 -ALWCSEALAVQLLLKLCDIIAQHG 429 >UniRef50_Q7NHH4 Gll2563 protein n=2 Tax=Gloeobacter violaceus RepID=Q7NHH4_GLOVI Length = 212 Score = 97.6 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 42/168 (25%), Positives = 59/168 (35%), Gaps = 14/168 (8%) Query: 225 ISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNI 284 L K + G N EPW L TNL TP++ + Sbjct: 9 WFKGAKLVKRKPFGPVNIAGKLGFQPGKKEAYC-----EPWWLLTNLS----TPQEAITW 59 Query: 285 YSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGW 344 Y R IEE FRD KS Y L + +RF ML++ M + G ++QG Sbjct: 60 YRCRWGIEEMFRDCKSGGYNLEKLRVQ---PKRFKRMLMVLAMAMSLSVMHGKQLKRQGL 116 Query: 345 DKHFQANTVRNRNV--LSTVRLGMEVLRHSGYTITREDSLVAATLLTQ 390 K+ R V ST R+G++ S + L + Sbjct: 117 QKYVSRVAEPGRVVKRRSTFRVGLQSEVWSQGMERCRQLVEKLMSLRR 164 >UniRef50_Q7NIQ3 Glr2130 protein n=2 Tax=Gloeobacter violaceus RepID=Q7NIQ3_GLOVI Length = 190 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 45/158 (28%), Gaps = 5/158 (3%) Query: 26 LTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHL--HKERLAVYRWHAS 83 L + H L + L L L LP K + R L L L + Sbjct: 30 LHILLHTLQTQQNLCLERLANALPLPITVDSRRKAVQRFLLLPSLCLWHLWLPLLAQIIE 89 Query: 84 FICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFL 143 ++ +D ++ + +L S+ R++ ++ + + S L Sbjct: 90 HFAVQPQRLVLAIDRTNWWK---YNLLMVSLVWDRRALPVFWRLLNHAGNSSLPERRSVL 146 Query: 144 ADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLS 181 + ++ V + ++ + Sbjct: 147 LPVLKYFHHKQIIVLGDREFGSVGFANWLQSQKVSYCL 184 >UniRef50_D0SG98 Transposase n=3 Tax=Gammaproteobacteria RepID=D0SG98_ACIJO Length = 426 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 50/336 (14%), Positives = 95/336 (28%), Gaps = 46/336 (13%) Query: 56 HNIKRIDRLLGNRHLHKERL--AVYRWHASFICSGNTMPIVLV-DWSDI----------- 101 + + R L N + +L + I + ++V DWS + Sbjct: 23 STTQAVWRFLNNNKISFSQLNQPIKLLACEQIKTSPHQYALIVHDWSQLQYVKHSHKVQR 82 Query: 102 ---REQKRLMVLRASVALHGRS----VTLYEKAFPLSEQCSKKAHD------------QF 142 E L++S+ S L + S S + + Sbjct: 83 LQRTEANSGYELQSSLLFDASSGLPIAPLAQTLTDASGCYSTFSEQYSERKSHLDSLAEQ 142 Query: 143 LADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISN 202 + + T I+ G + + + G+ WL R + + G Sbjct: 143 IKTIEQYPIEKTKVHIIDREGDSIAHLREISSHGFKWLIRAKESHRIEHQGETYKVAEVA 202 Query: 203 LHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK---NQRSTRTHCHHPSPKIYSA 259 ++ + I + ++ RK QR + ++ A Sbjct: 203 EKVVTQQVKPIAYKGNRHMLHVGETDIRITRAAKPKRKDDLGQRVAPQPGKAVTARLIVA 262 Query: 260 SAKEP-------WILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 K+ W L +N+ EI +L Y R IE F+ LK + + Sbjct: 263 VVKDAQGKTVARWSLISNVSSEI-DAVELTTWYYWRWTIECYFKLLKQAGHN--VESWLQ 319 Query: 313 SSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHF 348 ++ LLI+ M + W +Q Sbjct: 320 TTPAAILRRLLISSMACVLTWRIQRSEDEQNQKIRI 355 >UniRef50_Q8A4P1 Transposase n=5 Tax=Bacteroides RepID=Q8A4P1_BACTN Length = 440 Score = 95.7 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 44/327 (13%), Positives = 105/327 (32%), Gaps = 33/327 (10%) Query: 26 LTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFI 85 L+ H + L ++ + +++R+ + + + Sbjct: 37 LSGLIHTCCKNASGRQHLLCIQDTSEINYEAHVERMKKKTASPGI----------VGQKQ 86 Query: 86 CSGNTMPIVLVDWS-----DIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHD 140 C P+++VD S K+ A+++ R+ + P+ E+ S + + Sbjct: 87 CGTFLHPVLVVDASSHIPIGFSSVKQWNRSPAALSREERN----YRYQPIEEKESYRWIE 142 Query: 141 QFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPI 200 +A + +I + + + L R + + + Sbjct: 143 SGMAASEQMPRDAVKTIIGDREADIFELFSRIPTDNVHLLIRSVHERNCRLDDPDCSVHL 202 Query: 201 SNLHDMSSSHSKTLGYK--RLTKSNPISC-QILLYKSRSKGRKNQRSTRTH-----CHHP 252 + L + + ++ + ++C ++ + N + + C H Sbjct: 203 NTLMEQAVLRAEYSFEVLPGSGRKKRVACMELRFERVTLCAPVNGPAKGSPPVSLYCIHV 262 Query: 253 SPKIYSASAKEP---W-ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLR 308 K S E W +L T++ + + + Y R IEE FR LK G + Sbjct: 263 KEKSSSTPVNESPIEWRLLTTHVVETVEQAIECIGWYRCRWLIEELFRVLK--RKGFMIE 320 Query: 309 HSRTSSSERFDIMLLIALMLQLTCWLA 335 ++ + ++LI+L L + Sbjct: 321 DAQLETVSALQKLILISLQAALQVMVL 347 >UniRef50_C6LGD4 Transposase, IS4 family protein n=3 Tax=Lachnospiraceae RepID=C6LGD4_9FIRM Length = 422 Score = 94.9 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 41/342 (11%), Positives = 103/342 (30%), Gaps = 37/342 (10%) Query: 40 TLTELGRNLPTKA---RTKHNIKRIDRLLGN-RHLHKERLAV-----YRWHASFICSGNT 90 T+ L R+L K +K++ + + +H + V Y++ A + Sbjct: 70 TVDRLSRHLAKGTPKDALKAYLKQVKKWCPDQPVIHIDDSDVVKPDGYQFEAPGWVRDGS 129 Query: 91 MPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASIL 150 ++ ++ + + V+++ + E+ +D + + Sbjct: 130 EST---KTKNVYKKGYHVTEATVLTTSNHPVSIFSEIHSSVEKDFTSINDVTFSAMERAK 186 Query: 151 PSNTTPLIVSDAGFK-VPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSS 209 V D G+ + ++ + ++ R+ K + W + L + Sbjct: 187 ALFGKATFVMDRGYDDNKMFLKLDSMKQDYVIRLTAKRRLLYHN--KWTLATELRNRRKG 244 Query: 210 HSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILAT 269 K + + K + + + S+ HP + + K + Sbjct: 245 KVKLPLFYKGKKHEAYLSHVKVQITASRKDMYPVLVYGITEHPMMLAANKAIKSKEDVI- 303 Query: 270 NLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIM-----LLI 324 ++ +Y R +IEE FR K + R + + + L + Sbjct: 304 ----------KVAKLYFSRWKIEEYFRCKKQM---FQFENFRVRRLKAINALNFYTTLCM 350 Query: 325 ALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGM 366 A + ++ + AN ++ + RL Sbjct: 351 AFLAHISMKAETNALKTAIIHT---ANPIKEKTAFCYYRLAK 389 >UniRef50_Q1J2M1 Transposase IS4 family protein n=4 Tax=Deinococcus RepID=Q1J2M1_DEIGD Length = 331 Score = 94.5 bits (233), Expect = 7e-18, Method: Composition-based stats. Identities = 45/345 (13%), Positives = 98/345 (28%), Gaps = 48/345 (13%) Query: 29 ACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSG 88 +L ++ ++ ++ A + + R +L + + Sbjct: 1 MLFGILQAESTLHRKIAAHINRTASPASITRMVARTFHETNLTPQDVQD----VLLPLLP 56 Query: 89 NTMPIVLVDWSDIR-EQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLA 147 +++D ++ + K L +L VAL + L + P + + L Sbjct: 57 PGKLTLVLDRTNWKLGAKDLNLLVLGVALGDVVLPLTWQVLPHGGNSDMRGRMLLVGLLL 116 Query: 148 SILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDM 206 LP+ ++++D F W+ + R+R D A N Sbjct: 117 KRLPARRWAVLIADREFIGQEWFNFLRDRKIKRCIRIRESTLLDDEPARN---------- 166 Query: 207 SSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWI 266 +++ G R + ++ + A + Sbjct: 167 ------------------------AFQNLKPGEVRGVFERVWVYGSWMQVVATLAPQGER 202 Query: 267 LATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIAL 326 + L + + Y R IE TF +K+ GL L + + R + + Sbjct: 203 V---LVASDLSLWDTLTTYRLRWAIECTFSAMKTR--GLNLEQTHMTQPNRLSRLFGLLS 257 Query: 327 MLQLTCWLAGVHAQKQGW---DKHFQANTVRNRNVLSTVRLGMEV 368 + G +Q KH + R R + + Sbjct: 258 LALAWMVRIGEWRAEQQPIPRKKHGRPAWGRARYGHELLSAALRW 302 >UniRef50_Q73IB8 Transposase, IS4 family n=9 Tax=Wolbachia RepID=Q73IB8_WOLPM Length = 442 Score = 93.8 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 43/231 (18%), Positives = 85/231 (36%), Gaps = 14/231 (6%) Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQYADLGAE 195 ++ + L++IL ++ L++SD G+ VP +K + ++G Y++SR + D+ Sbjct: 174 RSDQGYRKHLSNILSND---LLISDLGYFVPSSFKQINEIGAYFISRYKSDTNIYDVETN 230 Query: 196 NWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPK 255 L + L K I +I+ K + +R Sbjct: 231 Q---KMELLECLEDKLFLENEVLLGKEAKIRVRIICQKLTEEQSMARRRKANRLARSQGY 287 Query: 256 IYSASAKEP--WIL-ATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 S ++ W + TN+P + +Q++ IY R QIE F+ KS + L + Sbjct: 288 TSSKRNQKLLNWSIFITNVPENKISAEQVLTIYRVRWQIELLFKLYKSH---IRLDKLKG 344 Query: 313 SSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHF-QANTVRNRNVLSTV 362 + + + + G K+ + +A R V+ Sbjct: 345 KPCRVLCELYAKLCAILIFHGIVGCTEVKKNTELSLTKAFIELKRRVIELF 395 >UniRef50_B7GET6 Transposase n=2 Tax=Bacillaceae RepID=B7GET6_ANOFW Length = 417 Score = 92.6 bits (228), Expect = 3e-17, Method: Composition-based stats. Identities = 47/223 (21%), Positives = 76/223 (34%), Gaps = 11/223 (4%) Query: 144 ADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPIS-- 201 +ILP++ I F V ++ G Y+++R+R ++ WK Sbjct: 176 HAQHTILPNDLC--IRDLGFFSVAALTEIDARGAYYITRLRSDMKVYIKENSQWKEWDWE 233 Query: 202 ----NLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIY 257 L + S + + P L + + R R + Sbjct: 234 SLGNQLKEGESVEMEHVYIGHERLYIPRLIFRRLTEEEWQKRMAYVRKREKRKGKALTRQ 293 Query: 258 SASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSER 317 + K+ IL TNLP E +Q+ +YS R QIE F+ KS L + ER Sbjct: 294 TLEQKKYHILLTNLPQESFDGQQVYELYSLRWQIELLFKAWKS---VFDLEKVKKMKKER 350 Query: 318 FDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLS 360 F+ + L+ L A+ F RN+ S Sbjct: 351 FECHVYGTLIAILVTQTFLFQARTYWQQTAFAPIQGRNQPFFS 393 >UniRef50_C3M9W9 Modified transposase for insertion sequence NGRIS-18c n=7 Tax=Rhizobiales RepID=C3M9W9_RHISN Length = 445 Score = 91.1 bits (224), Expect = 7e-17, Method: Composition-based stats. Identities = 62/335 (18%), Positives = 108/335 (32%), Gaps = 53/335 (15%) Query: 31 HALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNT 90 ALL L T R L + ++ R R LGN + E + + Sbjct: 10 AALLQGMVLRETVCLRRLAAGSHSQEI--RFGRFLGNDAVTVEWIIAGWGEPTGAAVAGR 67 Query: 91 MPIVLVDWSDIREQKR----------------LMVLRASVA-----------LHGRSVTL 123 + L D S+IR Q ++L +A + GR T Sbjct: 68 HVLALQDTSEIRFQTTPDNRRDLGKIKKGNCWGLLLHPMLALDAETGSCLGLVGGRVWTR 127 Query: 124 YEKAFPLSEQCSKKAHD-----QFLADLASILPSNTTPLIVSDAG--FKVPWYKSVEKLG 176 +A P A + + ++L S T ++D F V W + E+ Sbjct: 128 GTEALPPHASRPLSAKESRRWVETAEAAKAVLASATRVTAITDREGDFFVMWARLPEEC- 186 Query: 177 WYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRS 236 ++ LSRV + G +++ S+T+ + + L + Sbjct: 187 FHLLSRVMHDHALSGGGTLR----RAAAEVAFCDSRTVELRERADRPARQADLSLRFGEA 242 Query: 237 KGRKNQRSTRTHCHHPSPKIY---------SASAKEPWILATNLPVEIR-TPKQLVNIYS 286 R+ Q + + W++ T V Q+V Y Sbjct: 243 TIRRPQNLEAAALPDGVTLRWVEVVEPSPPAGVEPLSWLILTTHAVATFADAWQIVAWYK 302 Query: 287 KRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIM 321 +R IE+ FR +K GL + S+ S+ R + + Sbjct: 303 QRWVIEQFFRVMKQQ--GLKVEDSQLQSAARLEKL 335 >UniRef50_A6TN04 Transposase, IS4 family protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TN04_ALKMQ Length = 454 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 39/228 (17%), Positives = 88/228 (38%), Gaps = 12/228 (5%) Query: 113 SVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAG-FKVPWYKS 171 S+ + G + + L + A + L +++ L+++D G F +++ Sbjct: 148 SLKVQGIYSLIPARFSSLEITKAPGADTTYNDKLLAMVNPGE--LLITDLGYFSKAFFEK 205 Query: 172 VEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILL 231 + G Y+L+R++ + + + + T + + + C+ + Sbjct: 206 LSTKGSYYLTRIKKNSIVYVEKSGQLTKVDLTDLLKGTVVDTEVFLGIAHKKQLKCRFVA 265 Query: 232 YKSRSKGRKNQRSTRTHCHHPSPKIYSASAKE--PW-ILATNLPVEIRTPKQLVNIYSKR 288 + K +R K SA E W I+ TN+ + +P+ ++Y R Sbjct: 266 IRLPEKVVNQRRRKANQQAKAQGKQLSAKETELLAWNIIVTNVTKDKLSPEAACDLYRAR 325 Query: 289 MQIEETFRDLKSPAYGLGLRHSRTSSSERFDIML---LIALMLQLTCW 333 QIE F+ LKS L + + + + ++ LIA++ + + Sbjct: 326 WQIELVFKSLKSY---LNIDKIGSCGKYQLECLIYGRLIAVVAMFSLY 370 >UniRef50_Q1QFL8 Putative uncharacterized protein n=1 Tax=Nitrobacter hamburgensis X14 RepID=Q1QFL8_NITHX Length = 191 Score = 89.5 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 32/114 (28%), Positives = 53/114 (46%), Gaps = 3/114 (2%) Query: 256 IYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSS 315 +++ K W LA + T +++ N Y++R IE FRD K +G+GL R + Sbjct: 46 VHARDMKAAWCLAAS--NAEATAREITNHYARRWTIEPGFRDTKDLRFGMGLGVLRIADP 103 Query: 316 ERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVL 369 +R D +LL+ + L G + G D+H + T + R S R G + Sbjct: 104 QRRDRLLLLNAFAIVLLTLLGAAGESLGMDRHLKVATAK-RRTHSLFRQGCMLY 156 >UniRef50_Q6ZER7 Putative uncharacterized protein sll5063 n=1 Tax=Synechocystis sp. PCC 6803 RepID=Q6ZER7_SYNY3 Length = 217 Score = 88.8 bits (218), Expect = 4e-16, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 68/183 (37%), Gaps = 7/183 (3%) Query: 30 CHALLDCKTLTLTELGRNLPTKA-RTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSG 88 ALL ++LT +P + + + +R+ R L N L+ RL A+ Sbjct: 1 MIALLQRGEVSLTLWLPYIPCRGVQAQSKQRRLSRWLHNSRLNVHRLYKSLIQAALADWQ 60 Query: 89 NTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQC-SKKAHDQFLADLA 147 + + +D S ++R +V GR++ + + + + + + L A Sbjct: 61 EEILYLSLDTSLFW--DEYCLVRLAVVYRGRALPVVWRVLKHRSASIAFRQYWEMLYQAA 118 Query: 148 SILPSNTTPLIVSDAGFK--VPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHD 205 + L ++++D GF LGW++ R++ G W ++H Sbjct: 119 NRLSQGVKVVLLADRGFIHTDAMTAVTTHLGWHYRIRLKRNTWIWRAGHG-WCQFKDIHL 177 Query: 206 MSS 208 Sbjct: 178 QRG 180 >UniRef50_A2RJ55 Putative transposase n=7 Tax=Lactobacillales RepID=A2RJ55_LACLM Length = 439 Score = 88.8 bits (218), Expect = 4e-16, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 73/210 (34%), Gaps = 8/210 (3%) Query: 157 LIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 +I++D G++ Y+ ++K G +L R + L + P D + L Sbjct: 196 IIIADRGYESFNVYEHIKKSGQKFLIRAKDTKSNGLLNGLD-LPSDGTFDKKI--TLQLT 252 Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 ++ K L+K + RS T+ + L TNL + Sbjct: 253 RRQTNKVKKDKHYHFLHKRANFDYLPIRSKETYPISLRVVRIKLNEDTYESLVTNLDPFL 312 Query: 276 RTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSE-RFDIMLLIALMLQLTCWL 334 T + L +Y R IE +FR+LK Y LGL H + + + +M + + Sbjct: 313 FTSEDLKVLYHLRWGIETSFRELK---YALGLSHFHSKKLDFIIQEIFARLIMYNFSMTI 369 Query: 335 AGVHAQKQGWDKHFQANTVRNRNVLSTVRL 364 +Q N + + L Sbjct: 370 TLAVVLSNRLKHSYQINFTQAFGICRRFFL 399 >UniRef50_C6YRC6 Transposase ISFtu5 n=6 Tax=Francisella tularensis RepID=C6YRC6_FRATT Length = 169 Score = 87.2 bits (214), Expect = 9e-16, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 57/163 (34%), Gaps = 5/163 (3%) Query: 4 LDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDR 63 ++ L SL + H ++ + LL +T+ L+++ + + N +RI Sbjct: 8 VNQLKRSLSKIFNW-HKSSIDCFSNILLGLLSAQTINLSKIAIHFKNNNKIASNYRRIQS 66 Query: 64 LLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIRE-QKRLMVLRASVALHGRSVT 122 L + + + A+ ++ S + + D ++ + + + +L S ++ Sbjct: 67 FLKDMKIDFD--AITEFNVRQPLSQKSKFNFIPDRTNWQFVKNNINILVFSAVYEVIAIP 124 Query: 123 LYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFK 165 LY K + + + I+ D F Sbjct: 125 LYYK-LDHRGNSDSQTRIDLVKKFVNKFGKECIGSILGDREFG 166 >UniRef50_A7MYH1 Putative uncharacterized protein n=4 Tax=Vibrio RepID=A7MYH1_VIBHB Length = 145 Score = 86.1 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 50/148 (33%), Positives = 79/148 (53%), Gaps = 6/148 (4%) Query: 119 RSVTLYEKAFPLSEQCSKKAHDQFLADLA--SILPSNTTPLIVSDAGFKVPWYKSVEKLG 176 R + + ++ E + H + L+ L + + TP+IVSDAGF+ W++ V G Sbjct: 2 RDIQILQQTI---ENQCPEIHKKRLSSLILATKTVNGCTPIIVSDAGFRNTWFRQVANKG 58 Query: 177 WYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRS 236 W+WL RVRG+V G ++W+ + ++ + LG +L K +P+ C LYKS Sbjct: 59 WFWLGRVRGEVSI-KCGEDSWQWNKTFYPQATDKPQFLGESQLAKRSPLECFAYLYKSHP 117 Query: 237 KGRKNQRSTRTHCHHPSPKIYSASAKEP 264 KGRK R +RT H + K++ AKEP Sbjct: 118 KGRKAHRHSRTCQKHSAGKVFHKGAKEP 145 Score = 58.7 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 21/95 (22%), Positives = 38/95 (40%), Gaps = 6/95 (6%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M ++ IL ++ CPE+H KRL+SL LA + C + +++ G + + Sbjct: 1 MRDIQILQQTIENQCPEIHKKRLSSLILATKTVNGCTPIIVSDAGFR---NTWFRQVANK 57 Query: 61 IDRLLGNRHLHKER---LAVYRWHASFICSGNTMP 92 LG ++W+ +F P Sbjct: 58 GWFWLGRVRGEVSIKCGEDSWQWNKTFYPQATDKP 92 >UniRef50_UPI00016C424B Transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C424B Length = 472 Score = 86.1 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 65/410 (15%), Positives = 130/410 (31%), Gaps = 67/410 (16%) Query: 35 DCKTLTLTELGRN----LPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNT 90 + + L ELG LP K + + R + + + H + + T Sbjct: 34 SARLVALAELGDRPQGTLPNKIPDPYQLDAAYRFFRTEQVTPDAIQQPHRHHTRVELDRT 93 Query: 91 MPIVLV--DWSDI----------------REQKRLMVLRASVALHGRSVTLYEKAF---- 128 +VL+ D +++ +++ + +V GR + L + Sbjct: 94 EDVVLIAHDGTELNFTGLGVPELGVLAGPKQRGFVAHNSLAVTASGRILGLLHQILFTPR 153 Query: 129 ------PLSEQCSKKAHDQFLAD----LASILPSNTTPLIVSDAGFKVPWYK-SVEKLGW 177 P SE+ L P+ + V+D G V ++ + Sbjct: 154 NASRKAPKSERRHDPHKASVLWRDALEAIGPAPAGKRWVHVADRGADVTEFRDYAHENRM 213 Query: 178 YWLSRVRGKVQYA-DLGAENWKPIS-----NLHDMSSSHSKTLG--YKRLTKSNPISCQI 229 ++ RV A W ++ +G R + ++ + Sbjct: 214 EYVVRVNHNRNVTVLDEAGEWTVAKLHDTLQCQPALGRRTQEVGTQKGRTGGTATVAVRA 273 Query: 230 LLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP-----WILATNLP-VEIRTPKQLVN 283 L R+ + +++ W+L TN+P ++ + + Sbjct: 274 LTLSLIPPRPPRGRARGVPLLVTAIRVWEVDPPAGEKPLEWLLVTNVPGADVASAWARAD 333 Query: 284 IYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQ--- 340 Y+KR ++EE + LK+ GLGL + ++ L + ++ + + A+ Sbjct: 334 WYAKRWRVEEYHKSLKT---GLGLEELQLTTKVGLQNALSLLSVVAVGLVMLRELARDPV 390 Query: 341 --KQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLL 388 Q D Q + V VLS R H T +D + A L Sbjct: 391 TAAQPIDGWVQRSWVE---VLSQWR-----HDHGEQLATVKDWVWALARL 432 >UniRef50_C4ILZ9 Putative iso-IS10R ORF n=1 Tax=Clostridium butyricum E4 str. BoNT E BL5262 RepID=C4ILZ9_CLOBU Length = 174 Score = 84.9 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 42/174 (24%), Positives = 87/174 (50%), Gaps = 14/174 (8%) Query: 18 LHLKRLNSLTLACHALLDCKTLTLTELGRNLP---TKARTKHNIKRIDRLLGNRHLHKE- 73 L RLN+L ++ +++ L+ + + L ++ + IKRI L N+ + +E Sbjct: 4 LKSTRLNNLVAVIIGIIVSRSVILSNISQGLKDCYSRGNEESKIKRIQSFLNNKDIDQES 63 Query: 74 --RLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLS 131 VY+ S+ N + ++ D + I + R ++L+ S+ + R+V L+ K F Sbjct: 64 TYEFFVYKLLKSYKSKSNRINVIF-DHTTI--EDRFVILQFSLKIGKRAVPLWYKVFKYK 120 Query: 132 EQCS--KKAHDQFLADLASILPS-NTTPLIVSDAGF-KVPWYKSVEK-LGWYWL 180 EQ + K ++ L L +L + N ++++D GF + +K +++ LGW ++ Sbjct: 121 EQGNKDFKHVNEGLIFLHKVLKNYNYNVVLLADRGFKSIDLFKFIDETLGWNYV 174 >UniRef50_Q1AUS1 Putative uncharacterized protein n=4 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AUS1_RUBXD Length = 248 Score = 84.9 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 77/213 (36%), Gaps = 21/213 (9%) Query: 35 DCKTLTLTELGRNLPTKARTK---------HNIKRIDRLLGNRHLHKERLAVYRWHASFI 85 L+EL R PT + H +KR+ R N + + + + Sbjct: 33 QKADPRLSELARAYPTPKERRVASPKHDLLHRLKRLWRFTDNERVDPLAVQLALVPHTVA 92 Query: 86 CSGNTMPI-VLVDWS------DIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKA 138 C G + + VDW+ E+ R +LR SV GR++ L + A+ K+ Sbjct: 93 CLGFPRLLGLAVDWTFSDTTLPSGERMRYQILRISVPRKGRALPLLQLAYNRDNLSPNKS 152 Query: 139 HDQF----LADLASILPSNTTPLIVSDAGFKVP-WYKSVEKLGWYWLSRVRGKVQYADLG 193 ++ L + LP+ P++++D GF+ + + + +++ R+R + Sbjct: 153 QNRIEQDALLAVVGALPTGVRPVVLADRGFRRASFIAWLARHHLHYVVRIRKGTCVPEAS 212 Query: 194 AENWKPISNLHDMSSSHSKTLGYKRLTKSNPIS 226 WK + L + P Sbjct: 213 GHRWKLGDEELGLGELRLVAGVRHGLYHNRPRE 245 >UniRef50_Q6MB98 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MB98_PARUW Length = 146 Score = 84.5 bits (207), Expect = 7e-15, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 50/146 (34%), Gaps = 5/146 (3%) Query: 26 LTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWH-ASF 84 +T L KT+ L+EL L +KA+ N KRI R + V Sbjct: 1 MTNLLLGLFIVKTVNLSELATVLYSKAKIDSNFKRIQRFFNWLTFLNDYQEVITDLVIII 60 Query: 85 ICSGNTMPIVLVDWSDIREQKRL-MVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFL 143 + N + +D +D + K+ +L V G S+ L + + L Sbjct: 61 LDLKNKKNDLALDRTDWKFGKKHINILTLGVNFKGISIPLAWISLGRAGNSKTLDR---L 117 Query: 144 ADLASILPSNTTPLIVSDAGFKVPWY 169 + L ++ +D F + Sbjct: 118 SVLKRVMDKIHINSFTADREFIGSEW 143 >UniRef50_Q1ARL9 Transposase, IS4 family n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1ARL9_RUBXD Length = 335 Score = 81.8 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 56/328 (17%), Positives = 108/328 (32%), Gaps = 53/328 (16%) Query: 27 TLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFIC 86 ++L+ ++ ++ + +++ N + IDR+L K + +Y + F+ Sbjct: 31 ARILWSILESRSPRKSDWSQVF-SESSEGANYRTIDRVLPKLDAKKALMRLYDPESPFVL 89 Query: 87 SGNTMP-------IVLVDW-SDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKA 138 T V SD + MV+ A GR++ + + + + Sbjct: 90 VDPTEMERPQAKKTDYVGRLSDGKSLGFWMVVFAQ-PYRGRAIPFHFGIYSEATLKEQVT 148 Query: 139 -----HDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLG 193 + L ++ ++ TPLI W K++E+ W+ R+ Sbjct: 149 SRNLRWRELLWEIEELVGD--TPLIFDREFSAQAWLKALEEAQCKWVVRLNKGSGVK--- 203 Query: 194 AENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPS 253 D L K ++ R Sbjct: 204 ---------FFDELGEEIPLLIEKGEKRNIEGCY-----------------YRGETKANV 237 Query: 254 PKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTS 313 ++ KEP + N P +LV +Y +RM+IE+TFRD KS LG+ Sbjct: 238 AGVWRKGCKEPLWVMGNF----LPPDELVEVYEERMKIEQTFRDAKSL---LGMEKVMNK 290 Query: 314 SSERFDIMLLIALMLQLTCWLAGVHAQK 341 + +I L + L+ + G + Sbjct: 291 KRVQLEITLALMLLAYGLGLMVGEAVRD 318 >UniRef50_A6M1E5 Transposase, IS4 family protein n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M1E5_CLOB8 Length = 460 Score = 81.4 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 73/191 (38%), Gaps = 11/191 (5%) Query: 153 NTTPLIVSDAG-FKVPWYKSVEKLGWYWLSRVRGKVQYADLG--AENWKPISNLHDMSSS 209 NT +++ D G F +K +EK ++LS+++ N++ + + + S Sbjct: 192 NTNEILLVDLGYFDKKCFKMLEKKSAFFLSKIKYNTALYKENYKKGNFEKVEMIDFLKKS 251 Query: 210 HSKTLGYKRLT-KSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKE---PW 265 Y + K N ++ K + N R R + + W Sbjct: 252 SGVIDTYLYVGMKQNNREEFRVIGKRLPEEIVNLRIRRAREKAKAQGRAPKKIDKELMSW 311 Query: 266 ILA-TNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLI 324 ++ TN+ E L++IY R QIE F+ KS + H +++ + + +L Sbjct: 312 VIMITNIEKEQADVDMLLDIYRLRWQIELLFKCWKSYG---KIDHVKSAGIDYLNCLLYG 368 Query: 325 ALMLQLTCWLA 335 L++ L Sbjct: 369 RLIITLLINTV 379 >UniRef50_B0NXD2 Putative uncharacterized protein n=5 Tax=Clostridium sp. SS2/1 RepID=B0NXD2_9CLOT Length = 439 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 66/196 (33%), Gaps = 6/196 (3%) Query: 155 TPLIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKT 213 L ++D G++ ++ G Y+L R R + P D + ++ T Sbjct: 190 KALFIADRGYESYLLMAQIQHDGNYFLIRAREDFGQGSMIKGYPFPRDGTFDKTVTYIYT 249 Query: 214 LGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPV 273 + TK+NP + + ++ + + KE L TNLP Sbjct: 250 KTQNKRTKANPELYKRVATRNSPYFINKEHPYVKMTLRFVMIVLPNGQKE--CLITNLPA 307 Query: 274 EIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCW 333 + L +Y R +IE +FR +K Y L + E + ++ Sbjct: 308 NKFPSETLKKLYCIRWKIETSFRLIK---YSANLLEFHSKKIEFLQQEIWAKMIFYNFTT 364 Query: 334 LAGVHAQKQGWDKHFQ 349 H + + +Q Sbjct: 365 TITQHLRYKRDRGKYQ 380 >UniRef50_A5WBL3 Transposase, IS4 family n=2 Tax=Bacteria RepID=A5WBL3_PSYWF Length = 427 Score = 79.9 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 45/285 (15%), Positives = 77/285 (27%), Gaps = 28/285 (9%) Query: 131 SEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYA 190 ++Q + L I+ + ++ + +++RV+ + Sbjct: 147 AKQTHLNELTTRIEYLEQQGFDKPLIHIIDREADSAYQMRQWDEHDYKFITRVKAGSYLS 206 Query: 191 DLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKS---------RSKGRKN 241 G S + + K + +++L +S K Sbjct: 207 YEGKSQRCSQIAGQLNFSYQRQVNYKGKAAKQYIATAKVVLTRSAKPQAIDPATGKRIAP 266 Query: 242 QRSTRTHCHHPSPKIYSASAK--EPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 + +IY K W L +NL + Y R QIE F+ LK Sbjct: 267 IKGKPLSLLLTVSRIYDDQDKRLATWYLLSNLQEPSVNGADISQWYYWRWQIESYFKLLK 326 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 S GL L S + + LLIA W T + Sbjct: 327 SA--GLQLESWLQQSGDAYFKRLLIASQACTLVW-------------RIMQKTDKQSKEF 371 Query: 360 STV--RLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLGKL 402 + RL ++ S L N + Y KL Sbjct: 372 ALFLVRLSGRQMKRSKPITAPAVMAGLFQWLNFNELVNHYSPDKL 416 >UniRef50_A8YLR7 Genome sequencing data, contig C326 n=5 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YLR7_MICAE Length = 132 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 19/128 (14%), Positives = 44/128 (34%), Gaps = 5/128 (3%) Query: 2 CELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKA-RTKHNIKR 60 ++ L Q + + L L+ ALL ++L + ++A + +R Sbjct: 6 RIFSQVYSYLEQGSRFVDKRHLTVLSWMVTALLSSQSLNQARWEPFVQSRAEQANSYQRR 65 Query: 61 IDRLLGNRHLHKERLAVYRWH--ASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHG 118 +R N + E++ + + +D + + + + +V G Sbjct: 66 WNRFCQNGRVAVEKIYIPLILKAIETWKEKGERLYLAIDTTLLW--NQYCFVYLAVVCGG 123 Query: 119 RSVTLYEK 126 R+V L Sbjct: 124 RAVPLMWM 131 >UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepID=B0R8M6_HALS3 Length = 449 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 76/205 (37%), Gaps = 14/205 (6%) Query: 147 ASILPSNTTP---LIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQYADLGAENWKPISN 202 S LP+ LI+ D GF W + +++ G +++SRV+ + + ++ Sbjct: 177 RSQLPTGEWVADALILLDLGFYDFWLFDRIDQNGGWFVSRVKDNANFEIVEELRTWRGNS 236 Query: 203 LHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAK 262 + S L + I +I L R +G + + Sbjct: 237 IPLEGESLQAVLDDLQ---RQEIDVRITLSFERKRGSGASATRTFRLVGLRNEETE---- 289 Query: 263 EPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIML 322 E + TNL + + + +Y R ++E F++LKS GL T+ + + ++ Sbjct: 290 EYHLYLTNLGNDDYSAPDIAQLYRARWEVELLFKELKSR---FGLDEINTTDAYIIEALI 346 Query: 323 LIALMLQLTCWLAGVHAQKQGWDKH 347 ++A + + + + + Sbjct: 347 IMAAISLMMSRVIVDELRSLEARQR 371 >UniRef50_B0JGV7 Putative uncharacterized protein n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JGV7_MICAN Length = 141 Score = 79.1 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 45/121 (37%), Gaps = 7/121 (5%) Query: 18 LHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAV 77 +H ++++L L AL+ KT+ L+EL KA + N KR+ R N L +A Sbjct: 12 IHQNKIDALLL---ALIKVKTVNLSELAVGFGGKALKESNYKRLQRFFRNFELDYSEIA- 67 Query: 78 YRWHASFICSGNTMPIVLVDWSDIREQKR-LMVLRASVALHGRSVTLYEKAFPLSEQCSK 136 ++ +D + + +L + G ++ + + Sbjct: 68 --KIVVGWLKLPQPWVLSLDRTTWELGEHCYNILTVGIVHEGVAIPILWWLLKKKGNSNS 125 Query: 137 K 137 + Sbjct: 126 E 126 >UniRef50_B7C7E2 Putative uncharacterized protein n=3 Tax=Erysipelotrichaceae RepID=B7C7E2_9FIRM Length = 446 Score = 79.1 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 36/245 (14%), Positives = 83/245 (33%), Gaps = 18/245 (7%) Query: 132 EQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYA 190 Q + D + + ++D G+ + ++ + G +L RV+ + Sbjct: 178 GQAVQDERDALNKMVERY--KGDKAIFIADRGYESINSFEKIHLSGNKYLVRVK-DIHST 234 Query: 191 DLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCH 250 + + + D+ + T K++P + + R ++ C Sbjct: 235 GMLRSFGPFLDDEFDLIVKRTLTTKQTNEIKAHPEIYKFVPQNQRFDYFEDAPFYDFECR 294 Query: 251 HPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHS 310 KI + + TNL + + + +Y R +IE ++R+LK Y L L Sbjct: 295 VVRFKI---TEDTYECIVTNLDKNEFSMQDIKELYHLRWEIETSYRELK---YDLDLNTL 348 Query: 311 RTSSSERFD-IMLLIALMLQLTCWLA-GVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEV 368 + + + ++ + G+ K+ +Q N V+ + E Sbjct: 349 HSKKRNLIEQEIYAKMILYNFCSRITNGIDIAKRKRKYEYQLNFVQG------FHIIREH 402 Query: 369 LRHSG 373 L+ + Sbjct: 403 LKKAK 407 >UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCH0_9FIRM Length = 435 Score = 78.0 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 29/212 (13%), Positives = 70/212 (33%), Gaps = 11/212 (5%) Query: 134 CSKKAHDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADL 192 + + H + + + ++++D G++ + GW +L R++ V + Sbjct: 167 STYQEHRACIQMIERVTLD--KVILIADRGYENYNIMSHAIEKGWKFLIRIK-DVHSNGI 223 Query: 193 GAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHP 252 + P + + DM + T + K + ++ + Sbjct: 224 ASGLELPQTAVFDMDINLILTRNQTKSKKQAGYK-FMPTVQTFDYLPIGSKEDYPISFRI 282 Query: 253 SPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 + + + E + TNL + ++L +Y R IE +FR+LK Y +GL Sbjct: 283 ARFKIADDSYE--TVITNLDRFCFSAEKLKELYHLRWGIETSFRELK---YAIGLTSFHA 337 Query: 313 SSSERF-DIMLLIALMLQLTCWLAGVHAQKQG 343 + + + + + Sbjct: 338 KKVDYIKQEIFARLALYNYCELITTYVVEHTE 369 >UniRef50_UPI0001BC4BB6 transposase n=2 Tax=Neisseria mucosa ATCC 25996 RepID=UPI0001BC4BB6 Length = 403 Score = 77.2 bits (188), Expect = 9e-13, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 73/227 (32%), Gaps = 22/227 (9%) Query: 114 VALHGRSVTLYEKAFPLSEQCSKKAH---DQFLADLAS--ILPSNTTPLIVSDAGF-KVP 167 V+ +GR L + + D+ I+ V D G+ Sbjct: 140 VSSNGRISGLKVHVLMNHANGCPTVQSITEASVNDIDQRHIVQPEKGATYVFDKGYCDYN 199 Query: 168 WYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISC 227 W+ +++ G Y+++R++ + S + + + K+ PI Sbjct: 200 WWAELDRAGAYFVTRLKANAAVEVIEQ---------FSPSETQNAHENSRNDNKNTPILT 250 Query: 228 QILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSK 287 + R H E +L +N + +++ Y + Sbjct: 251 DEYIRFKHKSNST--RPNHYHNKTLRRITVEREGTEALVLVSN--NLTASAQEIAENYKR 306 Query: 288 RMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWL 334 R QIE F+ LK L L+ S+ + LL A+M L L Sbjct: 307 RWQIELLFKWLKQH---LKLKRFLGRSANAVKLQLLCAMMAYLLLKL 350 >UniRef50_Q1J3A6 IS1 related protein n=4 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J3A6_DEIGD Length = 219 Score = 76.4 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 47/139 (33%), Gaps = 9/139 (6%) Query: 232 YKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQI 291 Y+ G TH + + ++ + L ++ + Y+ R Sbjct: 67 YQKLKPGEVRVWYKPTHVYGVTLRVLACQNVHGQTLFLAYQGH---AEKALKRYALRWTA 123 Query: 292 EETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQAN 351 E + LKS G L + + R +L + + + C L G Q++ + + Sbjct: 124 ENMHQALKSR--GFFLESTHLTDPSRVSTLLAVVALAFVWCCLVGEFEQQRDPSRCLRHG 181 Query: 352 TVRNRNVLSTVRLGMEVLR 370 S R G++ LR Sbjct: 182 YPPK----SLFRRGLDALR 196 >UniRef50_A9AZS8 Transposase IS4 family protein n=3 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AZS8_HERA2 Length = 442 Score = 76.4 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 43/219 (19%), Positives = 81/219 (36%), Gaps = 11/219 (5%) Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAE 195 +A DQ L+ + LP+ + L ++D GF ++ + YWLSRV+ + G + Sbjct: 171 RASDQVLSVQRAPLPAGS--LRLADLGFYNIRIFRELAAAEVYWLSRVQSHSRIRLPGQK 228 Query: 196 NWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSK--GRKNQRSTRTHCHHPS 253 + I + G + ++ ++L+ + ++ QR Sbjct: 229 E-QSILEVVTGLGDADHWEGTVLVGSKERLAARLLVQRVPDAVAAQRRQRVQDEAHDKCR 287 Query: 254 PKIYSASAKEPW-ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 P +A W ++ TN P + + + + R QIE F+ KS + RT Sbjct: 288 PVSNAAMDLAAWTVVITNAPEDKLGLTEAMVLLKMRWQIELLFKLWKSHG---HVDEWRT 344 Query: 313 SSSERFDIMLLIALMLQLT-CWLAGVHAQKQGWDKHFQA 350 R + L+ + W+ A F+A Sbjct: 345 KKPARILCEIYAKLLGLVFQQWILVASAWDTAERSLFKA 383 >UniRef50_Q1PXV1 Putative uncharacterized protein n=3 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PXV1_9BACT Length = 449 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 43/194 (22%), Positives = 69/194 (35%), Gaps = 19/194 (9%) Query: 153 NTTPLIVSDAG-FKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHS 211 L++ D G F + +E G Y+LSR + P + D+ S Sbjct: 183 GERDLLLRDLGYFDLSVLGDIEGKGAYYLSRFFKSTKVYLSAD----PGAEAIDLVSYVK 238 Query: 212 KTLGYKRLT------KSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPW 265 K +G K L I +++ Y++ +R S K S E W Sbjct: 239 KHIGNKGLADMEVYLGEERICSRLIAYRAPGHVINERRRKAKRAVQKSGKTLSREYLE-W 297 Query: 266 I----LATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIM 321 + TN+ EI +P+ + IY R QIE F+ K + R + ER + Sbjct: 298 LDYSFYITNVGAEIWSPEVVGTIYRIRWQIELVFKQWKQL---FRMDVMRGTREERIRCL 354 Query: 322 LLIALMLQLTCWLA 335 L L++ Sbjct: 355 LYGRLIMICIVTRI 368 >UniRef50_C3EBZ9 IS231-related transposase n=1 Tax=Bacillus thuringiensis serovar pakistani str. T13001 RepID=C3EBZ9_BACTU Length = 221 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 68/176 (38%), Gaps = 15/176 (8%) Query: 163 GFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPI---SNLHDMSSSHSKTLGYKRL 219 F +P + + + G Y+LSR+ Q ++ + + +S + + Sbjct: 45 YFYLPDFHEINQKGAYYLSRLPINTQVYRKKGILYERLYLEDFIKKVSEGKTIEWFDVYI 104 Query: 220 TKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPK 279 K + + ++++YK G + + T + IL TN+P +I + Sbjct: 105 RKQHKVPTRLIIYKLTGAGYDGKNNVSTATKYKRQVS---------ILMTNIPSDILQKE 155 Query: 280 QLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 ++ +Y+ R QIE F+ KS G+ + ERF L L L + Sbjct: 156 EIYPLYTVRGQIEILFKTWKSLC---GIHLCKHVKLERFQCHLYGQLTAILLHSML 208 >UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostridium spiroforme DSM 1552 RepID=B1C560_9FIRM Length = 399 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 46/258 (17%), Positives = 92/258 (35%), Gaps = 27/258 (10%) Query: 93 IVLVDWSDIREQKRLMVLRASVALHGR------------SVTLYEKAFP---LSEQCSKK 137 ++ +D S++ + +V HG S L E+ + + + + Sbjct: 73 LLAIDGSELPIDNTIFDDETTVLRHGTLAKTFSAYHLNASYDLMERTYDDIIIQGEAKRD 132 Query: 138 AHDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAEN 196 H F + + ++D G++ ++ V G +L RVR ++ ++ Sbjct: 133 EHGAFCQLVDRY--DGQKAIFIADRGYESYNGFEHVVHSGHKYLIRVR-DIESQSSITKS 189 Query: 197 WKPISN-LHDMSSSHSKTLGYKRLTKSNPISCQILLYKSR-SKGRKNQRSTRTHCHHPSP 254 P + D+ S TL ++ K+ P + + R K +C Sbjct: 190 LGPFPDGEFDVDVSRMLTLKQTKMIKACPDVYKFVPKNMRFDFMNKQNPWYEFNCRVVRL 249 Query: 255 KIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSS 314 KI + + TNL + + + IY+ R E +FR+LK Y +GL Sbjct: 250 KITENT---YETVITNLSRNEFSMEDICEIYNMRWGEETSFRELK---YAIGLNALHAKK 303 Query: 315 SERFDIMLLIALMLQLTC 332 E + +++ C Sbjct: 304 RELIQQEIYARMLMYNFC 321 >UniRef50_C4YZA5 Transposase n=28 Tax=Rickettsia endosymbiont of Ixodes scapularis RepID=C4YZA5_9RICK Length = 151 Score = 72.6 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 51/115 (44%), Gaps = 2/115 (1%) Query: 11 LYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHL 70 L Q L+L R L L ++L+ +T+ ++ L +P+ + +R+ R + + Sbjct: 22 LLQKHISLNLSRAKCLGLFIISMLNSRTVNMSLLCNRMPSGIKAASWYRRMQRFISEISI 81 Query: 71 HKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRL-MVLRASVALHGRSVTLY 124 L V + ++ +D ++ + KR +L +V+ HG ++ L+ Sbjct: 82 SWRVLPVMLVMMTGFE-QEQKWVLCLDRTNWKFGKRHINILYLAVSFHGIAIPLF 135 >UniRef50_Q46GC6 Transposase n=7 Tax=Methanosarcina RepID=Q46GC6_METBF Length = 435 Score = 71.8 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 71/205 (34%), Gaps = 15/205 (7%) Query: 156 PLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKV---QYADLGAENWKPISNLHDMSSSHSK 212 L+V +K + VE+ G Y++SR+R + + + Sbjct: 189 ILLVDLGFYKTQMFARVEENGGYFVSRIRKNMDPILVSIEEELSKTKSKEFA----GKPV 244 Query: 213 TLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWIL-ATNL 271 + K+L+ + + K K R+ + + E + + TN+ Sbjct: 245 SECIKQLSGKDID----AVVKIEFKRREYKGKQKQDEMIVRLVAVYNDEDEKYHIYITNI 300 Query: 272 PVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLT 331 +I K + N+Y R IE F++LKS L T + + + ++ A++ + Sbjct: 301 QKDILNAKDIANLYGARWDIELLFKELKSK---YSLDVLETKNVQVIEALIWTAILTLIV 357 Query: 332 CWLAGVHAQKQGWDKHFQANTVRNR 356 +K A + R Sbjct: 358 SRRIYSLVRKSTTHPEKMARYTQLR 382 >UniRef50_B7I4G0 Transposase subunit n=16 Tax=Bacteria RepID=B7I4G0_ACIB5 Length = 148 Score = 70.6 bits (171), Expect = 9e-11, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 45/116 (38%), Gaps = 9/116 (7%) Query: 279 KQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVH 338 + Y+ R +IE F LK G L ++R + R ++ + + C+L G Sbjct: 29 ANAIQDYALRWEIETLFSCLK--GRGFNLENTRLTDPRRVKKLIAVLAISFCWCYLTGEW 86 Query: 339 AQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTI----TREDSLVAATLLTQ 390 Q + + R +S R G++ ++ + + +E+ +L + Sbjct: 87 QHDQKKVIKIKKH---GRLSMSLFRYGLDYVQMAIQRLIGFGKKEEFKEILAILRR 139 >UniRef50_A5FWE3 Transposase, IS4 family protein n=2 Tax=Acidiphilium cryptum JF-5 RepID=A5FWE3_ACICJ Length = 453 Score = 70.3 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 67/207 (32%), Gaps = 9/207 (4%) Query: 139 HDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQY----ADLGA 194 + + A L + ++V+D + + + R A Sbjct: 159 WLEGASHAADRLTDAASVVVVADREGDIYAGFARRPASIEMIVRAAQDRVLDDGRRLFAA 218 Query: 195 ENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSP 254 P ++ + S+T R+ + + + + R G + T + Sbjct: 219 PEAWPELVRSEVRVAPSRTGIAARVATVALRAGTVTICRPRHGGDVGGPAHLTLTMVEAR 278 Query: 255 KIYSASAKEP--WILATNLPV-EIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSR 311 ++ P W L T + + ++V +Y R +IEE FR LKS GL L ++ Sbjct: 279 EVDWNGEGTPLLWRLLTTIETIDADGAAEIVRLYRLRWRIEEVFRSLKSD--GLRLEETQ 336 Query: 312 TSSSERFDIMLLIALMLQLTCWLAGVH 338 + R + LI L Sbjct: 337 MQDAGRLFKLALIGLAAATRTVQLVDA 363 >UniRef50_A8YMK7 Genome sequencing data, contig C327 n=1 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YMK7_MICAE Length = 438 Score = 70.3 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 44/244 (18%), Positives = 74/244 (30%), Gaps = 20/244 (8%) Query: 140 DQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKP 199 + +L L N+ ++V K P+ + ++R+R A K Sbjct: 189 IPLILELKKTLGLNSLRVVVDAYFSKAPFLSPLVDKLINVITRMRKD-AVAWDNRIENKS 247 Query: 200 ISNLHDMSSSHSKTLGYKRLTKSNPISCQILLY-KSRSKGRKNQRSTRTHCHHPSPKIYS 258 + L L + P S + +Y K + + Sbjct: 248 KKSFILDGKWKLAHL----LKEFKPQSLSVKIYGKFTQVEAVEREVYTRGFQPKVKVVVM 303 Query: 259 ASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERF 318 AKEP IL + T Q++ IY R IE RDLK GL + Sbjct: 304 KGAKEPIILMST--DITLTAIQIIEIYGSRFSIELAIRDLKQH---FGLGDYQCYLGIAI 358 Query: 319 DIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITR 378 D + +A + L Q + ++ + + ++ S R LR Sbjct: 359 DRFVQLACVAYCLFRLF----QIKEIEQSWMPKVSPSCSLFSFSR-----LRRGLQHFAI 409 Query: 379 EDSL 382 L Sbjct: 410 TQVL 413 >UniRef50_UPI0001C16BE8 Transposase, IS4 protein n=1 Tax=Cylindrospermopsis raciborskii CS-505 RepID=UPI0001C16BE8 Length = 231 Score = 69.9 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 14/150 (9%), Positives = 44/150 (29%), Gaps = 23/150 (15%) Query: 34 LDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKE--RLAVYRWHASFICSGNTM 91 + + L++L P + + + + R LG L + + ++ + + Sbjct: 30 QAYRQVKLSKLASLFPQPIKYESRKRNLQRFLGINKLCVKLLWFPLIKYWIRQSLTPQQL 89 Query: 92 ------------------PIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQ 133 +V +D + + + MV ++ ++ LY + Sbjct: 90 NREQRRYFHKKQYQKYGYWMVALDRTQWKGRNIFMV---TLVWGTHALPLYWETLNHVGN 146 Query: 134 CSKKAHDQFLADLASILPSNTTPLIVSDAG 163 + + + +L ++ Sbjct: 147 SNLSTQKRLIKTAIKLLKKCRIVVLADREF 176 >UniRef50_P12249 Transposase for insertion sequence element IS231A n=411 Tax=Bacillus RepID=T231A_BACTB Length = 478 Score = 69.5 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 68/189 (35%), Gaps = 22/189 (11%) Query: 163 GFKVPWYKSVEKLGWYWLSRVRGKVQYADLG-------------AENWKPISNLHD---M 206 F + +++ G Y++SR++ + + H + Sbjct: 201 YFSLEDLDQMDQRGAYYISRLKLNHTVYIKNPSPEYFRNGTVKKQSQYIQVDLEHIMNHL 260 Query: 207 SSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAK---E 263 + + + K+ + ++++Y+ K + +R + + +S +K Sbjct: 261 KPGQTYEIKEAYIGKNQKLFTRVIIYRLTEKQIQERRKKQAYTESKKGITFSEKSKRLTG 320 Query: 264 PWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLL 323 I +N P I +Q+ + YS R QIE F+ KS + H + ER + + Sbjct: 321 INIYVSNTPEGIVPMEQIHDFYSLRWQIEIIFKTWKSL---FQIHHWQNIKQERLECHVY 377 Query: 324 IALMLQLTC 332 L+ C Sbjct: 378 GRLIAIFIC 386 >UniRef50_B8F976 Transposase IS4 family protein n=2 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F976_DESAA Length = 371 Score = 69.1 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 51/318 (16%), Positives = 100/318 (31%), Gaps = 28/318 (8%) Query: 26 LTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFI 85 L + L+ +L +T + + ++ + RL R + Sbjct: 47 LRVLLIHLVQGCSLRVTS-ALSKAGGLASASDVALLKRL--KASGEWMRWMAVELMKQWF 103 Query: 86 CSGNTMPIV------LVDWSDIREQKRLMVLRASVALHGRSVTLYEKA-FPLSEQCSKKA 138 + +VD S + E G + ++ P + Sbjct: 104 GKQPEKILGMGRTVRVVDGSTVSEPGST----------GTTWKIHYSIQLPSLQCDEVYV 153 Query: 139 HDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENW 197 D + + + ++D G+ V K G + R+ + + D+ + + Sbjct: 154 TDPKTGEDLKNFNVHPGDVFLADRGYYHRTGMLHVVKGGGDLIVRMIHQYKLYDINGQEF 213 Query: 198 KPISNLHDMSSSHSKTLGYKRLTKSNPISCQIL-LYKSRSKGRKNQRSTRTHCHHPSPKI 256 I NL ++ + K IS ++ + KS+ K +R+ K Sbjct: 214 GLIKNLRSLTVNQIGDWDAFIHHKKEVISGRVCAIKKSKEAAEKAKRAILRENSKKGHKT 273 Query: 257 YSAS--AKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSS 314 + A E + T L E Q++ Y R Q+E F+ LKS +GL H + + Sbjct: 274 KPETLVAAEYVFVFTTLSRE-WKASQVLEAYRGRWQVELAFKRLKSL---IGLGHLKKTD 329 Query: 315 SERFDIMLLIALMLQLTC 332 E L + Sbjct: 330 FEGAKAWLHGKIFAAFLV 347 >UniRef50_B7CEB8 Putative uncharacterized protein n=2 Tax=Erysipelotrichaceae RepID=B7CEB8_9FIRM Length = 431 Score = 68.7 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 65/195 (33%), Gaps = 8/195 (4%) Query: 154 TTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSK 212 +I +D G++ + ++ R++ + + + P D+ + Sbjct: 182 ENSIITADRGYEKYNLIACCIENNQKFVFRIKDIDVFGSILSNLNLP-DEEFDLDVTKIL 240 Query: 213 TLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLP 272 T TK+N + KS + + + KI + L TNL Sbjct: 241 TRKQTNETKANKHKYTFISNKSEFNYFGTKEFYKMNLRVVRFKI---TDDTYECLVTNLT 297 Query: 273 VEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTC 332 + +L +Y R IE F+ LK Y +G+ + + A++L Sbjct: 298 RDEFDLNELKKMYHMRWDIETAFKVLK---YIIGMMAFHSKKRNFIQQEIYAAILLHCLT 354 Query: 333 WLAGVHAQKQGWDKH 347 + + + DK Sbjct: 355 NIITERIEVEQSDKR 369 >UniRef50_C4YUK3 Transcription-repair-coupling factor n=29 Tax=Rickettsia endosymbiont of Ixodes scapularis RepID=C4YUK3_9RICK Length = 287 Score = 67.6 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 55/181 (30%), Gaps = 35/181 (19%) Query: 143 LADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPIS 201 + + N + +D F W + ++ RVR QY I Sbjct: 1 MECFLEVFDKNRIEALTADREFIGKEWLSWLRTNQIRYVFRVRENRQYISNARGKMVKIQ 60 Query: 202 NLH-DMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSAS 260 L ++ +L +R+ I + + Sbjct: 61 ELFRPLAIGSHVSLSQRRIGTKGEIFNVVGI----------------------------R 92 Query: 261 AKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDI 320 K+ + EI+ P ++ Y++R QIE F+ KS G + ++ R D Sbjct: 93 NKKSELAVLIHSDEIKNPAEI---YAQRWQIETMFKAFKSA--GFNCEATHITNDLRLDT 147 Query: 321 M 321 + Sbjct: 148 L 148 >UniRef50_C4RAB7 Putative uncharacterized protein n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAB7_9PROT Length = 192 Score = 67.6 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 10/104 (9%), Positives = 31/104 (29%), Gaps = 2/104 (1%) Query: 88 GNTMPIVLVDWSDIREQKRL-MVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADL 146 + +D ++ + + +L + + L+ + + L + Sbjct: 4 SGKPWHLALDRTNWKFGRCHINILMLGIVHEKVCIPLFWSLRDKAGNSNAPERTALLERM 63 Query: 147 ASILPSNTTPLIVSDAGFKVP-WYKSVEKLGWYWLSRVRGKVQY 189 P + D F W + + G ++ R++ + Sbjct: 64 IKTFPDQPISSLSGDREFIGEKWMGWLHERGIPFVLRLKENMHV 107 >UniRef50_B6FTH4 Putative uncharacterized protein n=3 Tax=Clostridium nexile DSM 1787 RepID=B6FTH4_9CLOT Length = 224 Score = 67.6 bits (163), Expect = 9e-10, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 66/188 (35%), Gaps = 7/188 (3%) Query: 177 WYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRS 236 Y+L RV+ + P N D T + K+NP + + KS Sbjct: 1 MYYLIRVKDGGG-GSMTGSFDLPDDNEFDHDMQLILTRKQTKDVKANPQKFKFIA-KSSP 58 Query: 237 KGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFR 296 + + + + ++ S + TNLP E +++ +Y+ R IE +FR Sbjct: 59 FDYLDLYDKKFYTLNFRVVRFAISEDSYESILTNLPKEDFPVEEIKKVYAMRWGIETSFR 118 Query: 297 DLKSPAYGLGLRHSRTSSSERF-DIMLLIALML-QLTCWLAGVHAQKQGWDKHFQANTVR 354 +LK Y +GL + E + ++ V Q++G Q N Sbjct: 119 ELK---YAIGLCCFHSKKVEYIMQEIYARLILYNYCELITMHVIIQQKGTKHVCQMNYTI 175 Query: 355 NRNVLSTV 362 ++ Sbjct: 176 AIHICRYF 183 >UniRef50_A4SUB1 IS element transposase n=8 Tax=Bacteria RepID=A4SUB1_AERS4 Length = 420 Score = 67.2 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 52/327 (15%), Positives = 98/327 (29%), Gaps = 42/327 (12%) Query: 77 VYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSK 136 + + ++L D + KR + GR T+ A S Sbjct: 104 IGQQVTDVAQGAFKQ-VLLQDGTSFAVHKR-----LATVFPGRFKTISPAAIECHMTMSL 157 Query: 137 KAHDQFLADL-------ASILPSNTTP---LIVSDAGF-KVPWYKSVEKLGWYWLSRVRG 185 L LP L+++DAG+ ++ V K G ++L R R Sbjct: 158 LEQKPLCMQLSADTASERQFLPDAKKLTGSLLLADAGYIDRAYFAEVNKAGCFYLVRGRK 217 Query: 186 KVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRST 245 W+ + L L + C+ + K S Sbjct: 218 G--LNPKILRAWRDD-------GRAVEKLTGMSLKEEGRRHCRAEVLDMDVK------SG 262 Query: 246 RTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGL 305 + + W TNL E ++++ +Y R Q+E F++ KS Sbjct: 263 KYEYRLIRRWFAEETRFCVW--MTNLARETWPAERVMRLYRCRWQVELLFKERKSYN--- 317 Query: 306 GLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLG 365 L+ T + ++ +L+ + K+G A + ++ Sbjct: 318 NLKGFVTGQKAITEGLVWDSLLSLVLKRRVAQTLVKEGLSTLKAAKS-----GMTWWLPI 372 Query: 366 MEVLRHSGYTITREDSLVAATLLTQNL 392 +E + H + RE L++N Sbjct: 373 LEAVAHRALSEIREKLEWVVDFLSKNA 399 >UniRef50_C6AUF2 Transposase IS4 family protein n=7 Tax=Rhizobium RepID=C6AUF2_RHILS Length = 372 Score = 67.2 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 67/192 (34%), Gaps = 12/192 (6%) Query: 157 LIVSDAGFKVPW-YKSVEKLGWYWLSRV-RGKVQYADLGAENWKPISNLHDMSSSHSK-- 212 ++++D + P + V G ++ R ++ E + + L + Sbjct: 170 IVLADRYYARPRDLRPVIDAGADFIVRTGWNSLRLLQTNGEPFDLFAALAAQQEQEGEVQ 229 Query: 213 ---TLGYKRLTKSNPISCQILLYKSRSK---GRKNQRSTRTHCHHPSPKIYSASAKEPWI 266 G P+ ++++ + + + + P S A + + Sbjct: 230 VRVHEGMTGTPPPPPLVLRLIVRRKDPQQAQAEQERLLKDARKRGKKPDPRSLEAAKYIL 289 Query: 267 LATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIAL 326 L T+LP P ++ +Y R QIE F+ KS A GL ++ R + + Sbjct: 290 LLTSLPTATFPPADILTLYRFRWQIELAFKRFKSLA-GLDSLPAKKPELARA-WLYARLI 347 Query: 327 MLQLTCWLAGVH 338 + + +AG Sbjct: 348 VAIIAEQIAGQV 359 >UniRef50_Q7UY96 Similar to transposase n=1 Tax=Rhodopirellula baltica RepID=Q7UY96_RHOBA Length = 403 Score = 67.2 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 69/239 (28%), Gaps = 13/239 (5%) Query: 129 PLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQ 188 P + + + ++A ++ + +S G +L R + Sbjct: 118 PAHDDHHLDQLEPTMDEVAQWELPRRVVHVIDREADSLGRLRSWHAKGHLFLVRC-DDRR 176 Query: 189 YADLGAENWKPISNLHDMSSSH----SKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRS 244 G N S K L + + + + LY+ S+ ++ Sbjct: 177 VRCEGRSVLLSELNDELDSQCEYADAGKALYHGKKVQRQVAEKTVTLYRPHSEVIDGEKK 236 Query: 245 TRTHCHHPSPKIYSASAKE------PWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDL 298 T ++ W L TN+P + + Y R +IE F+ L Sbjct: 237 AVTGEPIEVRTVFVRLVDADGWILAEWTLLTNVPADQANASDVGRWYYFRWRIESFFKLL 296 Query: 299 KSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRN 357 KS G L + + S E LL+A M + + + R Sbjct: 297 KSH--GQELEYWQQESGEAITKRLLMASMACVLVKQLEASESESTIKFRRHLIRLSGRR 353 >UniRef50_A7IQF9 Putative uncharacterized protein n=1 Tax=Xanthobacter autotrophicus Py2 RepID=A7IQF9_XANP2 Length = 183 Score = 67.2 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 29/154 (18%), Positives = 54/154 (35%), Gaps = 12/154 (7%) Query: 218 RLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRT 277 + ++ + + R + R + AS KE ++ATN Sbjct: 8 ERPRLPASQRRLFGFLAGWACRLDSRGSVLGPPVKLAATRLAS-KELLVVATN-----TD 61 Query: 278 PKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGV 337 P+ + Y +R +IE F K+ GL L + +S ER ++ + + + G Sbjct: 62 PRIALTNYRRRWEIETLFAASKTR--GLNLEDTHITSPERIAKLIAVLAVAFIFAHATGE 119 Query: 338 HAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRH 371 + R S R+G ++LR Sbjct: 120 WS----ARHRPIIIKTHKRKAKSIFRIGFDLLRK 149 >UniRef50_Q7UPU9 Probable transposase n=2 Tax=Rhodopirellula baltica RepID=Q7UPU9_RHOBA Length = 656 Score = 66.8 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 50/253 (19%), Positives = 80/253 (31%), Gaps = 50/253 (19%) Query: 121 VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTP---LIVSDAGF-KVPWYKSVEKLG 176 +T K P S++AH + +L + P L DAGF ++KS+ G Sbjct: 274 LTWCWKLGPS--NASERAH------VQEMLENGEFPEKTLFTGDAGFVGYEFWKSIIDGG 325 Query: 177 WYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRS 236 ++L RV V + +P + Sbjct: 326 HHFLVRVGANVNLLHSLGYDVEPDED------------------------------NLVY 355 Query: 237 KGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFR 296 K++R K+ +L + L + T KQ + IY R IE FR Sbjct: 356 CWPKDKRREGMRPLKLRMIQIQLGRKKAVLLTSVLDEKKLTDKQALVIYKSRWGIELEFR 415 Query: 297 DLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR 356 +LK G R R S R + L +++ L L + + + R Sbjct: 416 NLKQT---YGRRQLRCRQSVRALVELHWSILSILIVKLYALKVHLAKKRRRCDPVAMPGR 472 Query: 357 -----NVLSTVRL 364 V S R+ Sbjct: 473 ISFAGVVRSFQRI 485 >UniRef50_A5UVL8 Putative uncharacterized protein n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UVL8_ROSS1 Length = 185 Score = 66.8 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 29/214 (13%), Positives = 70/214 (32%), Gaps = 32/214 (14%) Query: 109 VLRASVALHGRSVTLYEKAFPLSEQC-SKKAHDQFLADLASILPSNTTPLIVSDAGFKVP 167 +L V G ++ + P ++ + A + L L +P++ T L+++D G Sbjct: 1 MLALCVVDRGCAIPVAWTILPAGQKRAWRCAWFRMLRLLRPAVPASWTVLVLADCGVDAR 60 Query: 168 W-YKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPIS 226 W ++ + +LGW+ R+ + G S L + + G + + Sbjct: 61 WRFRRMARLGWHPFLRINQGGTFRLAGQARCVWWSTLVGAAGRRWRGRGTAFASSDCRLD 120 Query: 227 CQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYS 286 + + S EPW++ T+ + + Sbjct: 121 GTLAAWWSDG------------------------HAEPWLVLTDRDPDGCDAPWD----A 152 Query: 287 KRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDI 320 R ++ + K + + ++ ++ R Sbjct: 153 LRSWCDQRGKGAKRGGW--QWQQTQMTNPARTSR 184 >UniRef50_Q8VV93 Transposase n=1 Tax=marine psychrotrophic bacterium Mst37 RepID=Q8VV93_9GAMM Length = 423 Score = 66.8 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 65/188 (34%), Gaps = 21/188 (11%) Query: 155 TPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTL 214 L+ FK+ + +++ G ++ R + V + + + K Sbjct: 186 RLLLADRGYFKLSYLDEIDQAGGAYVVRAKTTVN---------PMVVAGFNKAGKPLKRF 236 Query: 215 GYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVE 274 + + + +G+ N R + W ATNL E Sbjct: 237 QKIKQKAVKKHIRRSGIVDMDVEGKTNYRL-------IASWPEGKDEPTYW--ATNLDRE 287 Query: 275 IRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWL 334 + ++++ +Y R QIE F++ KS L+ T + + ++ +L+ L Sbjct: 288 QFSAEKVMKLYQLRWQIELLFKEWKSYC---NLQKFNTRKATMMEGLVWSSLLSLLVKRR 344 Query: 335 AGVHAQKQ 342 G+ Q+ Sbjct: 345 IGLSVQQL 352 >UniRef50_Q74P20 IS231-related transposase n=15 Tax=Bacillus RepID=Q74P20_BACC1 Length = 460 Score = 66.4 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 63/198 (31%), Gaps = 11/198 (5%) Query: 163 GFKVPWYKSVEKLGWYWLSRVRGKVQYA-DLGAENWKPI---SNLHDMSSSHSKTLGYKR 218 F + + + +++SR+R Q W + D+S L Sbjct: 204 YFSIYDLEKIADRKAFYVSRIRWNTQVYQKEKGGKWTLLDLEKLTKDLSEGQILELPEIY 263 Query: 219 LTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTP 278 + ++++Y+ + PK S +L TN+ + Sbjct: 264 IGLHQKHKTRLVIYRLTQTEWTKRLEHHKKAKKKMPKYASRIN----LLITNVSSKHLPH 319 Query: 279 KQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVH 338 ++ +YS R QIE F+ KS + + ERF L L+ Sbjct: 320 NEVYELYSLRWQIEIIFKTWKSI---FKIHEVKPVKLERFQCHLYGQLIGLCLVASITYR 376 Query: 339 AQKQGWDKHFQANTVRNR 356 ++ W+K + + Sbjct: 377 MRRLIWEKKQKEVSEYKC 394 >UniRef50_C0VKT5 IS4 family transposase ORF 2 n=5 Tax=Acinetobacter RepID=C0VKT5_9GAMM Length = 178 Score = 66.4 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 61/164 (37%), Gaps = 12/164 (7%) Query: 231 LYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQ 290 L++ G+ R R I + ++ +L P+ + + Y+ R + Sbjct: 14 LFRHLQVGQTECRKRRIWVGRVKLYISALRLEDGELLLVVSPMFNASA---IRDYALRWE 70 Query: 291 IEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA 350 IE F LK G L ++R + R ++ + + C+L G + + Sbjct: 71 IETLFSCLK--GRGFNLENTRLTDPGRVKKLIAVLAIGFCWCYLTGEWQHDRKKAIKIKK 128 Query: 351 NTVRNRNVLSTVRLGMEVLRHSGYTI----TREDSLVAATLLTQ 390 + R +S R G++ ++ + + +E+ +L + Sbjct: 129 H---GRLSVSLFRYGLDYVQMAILRLIGFGKKEEFKKVLAILRK 169 >UniRef50_UPI00016AD9A8 transposase Tn5 n=1 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016AD9A8 Length = 460 Score = 66.4 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 40/349 (11%), Positives = 80/349 (22%), Gaps = 55/349 (15%) Query: 40 TLTELGRNLPTK-------ARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMP 92 L L R L + +K R N + + + H + + Sbjct: 28 RLVALARRLACSPQCSFPQSLKAAELKAAYRFFDNAQVDTDG--ILAPHITQTLNRMAEV 85 Query: 93 IVLV---DWSDIR-------------------EQKRLMVLRASVALHGRSVTLY------ 124 V++ D ++ ++ +M +V G + L Sbjct: 86 PVVLAVQDTTEFNLSHLPATEGLGYGSRNSIHQRGFMMHSLLAVTPEGLPLGLLGMKTWV 145 Query: 125 -------------EKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKS 171 + E LA L + + Sbjct: 146 RPDEGFGKKHQRKTRPIHEKESAKWIEGIAHLAALKKRCAEPRFVCVGDRESDLYELFTI 205 Query: 172 VEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSS----HSKTLGYKRLTKSNPISC 227 G WL R + W+ + + + +R Sbjct: 206 ERPAGVDWLIRAAVNRRACHPEGYLWEAVQATVPLGRTELLVPGGHNFPQRTAGLTLRCA 265 Query: 228 QILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRT-PKQLVNIYS 286 + L R + R + H W+L +++ + + Y+ Sbjct: 266 TVRLQPPRGRARGLPKVDVFAIHAIEDAPPDGVEPIEWMLLSSVETTTFDDALERLAWYA 325 Query: 287 KRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 +R IE R LKS + R + + L + Sbjct: 326 RRWTIESWHRVLKSGCQVEARQFGSLDRFVRATALFAVISWRILYATML 374 >UniRef50_C0BDH6 Putative uncharacterized protein n=2 Tax=Coprococcus comes ATCC 27758 RepID=C0BDH6_9FIRM Length = 204 Score = 66.0 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 35/184 (19%), Positives = 64/184 (34%), Gaps = 6/184 (3%) Query: 149 ILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSS 208 +L + I V + G Y+L R + + L P + D++ Sbjct: 25 VLDGIKSVYIGDRGYCSYNNMAHVVEQGQYFLFRTK-DIHSKGLVGNFNFPDAESFDINV 83 Query: 209 SH--SKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWI 266 S ++ K L + + +S + S T+ + S Sbjct: 84 SVILVRSHSKKILADIHTEGYIRFVDQSAAFDYIEYGSYDTYELSFRILRFPISTSTYEC 143 Query: 267 LATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIAL 326 + TNLP + +++ +Y+ R IE +FR LK Y +GL + E + L Sbjct: 144 IVTNLPRDEFPVERIKTLYNARWSIESSFRKLK---YTIGLSNFHAYKPEYVKQEIWARL 200 Query: 327 MLQL 330 + L Sbjct: 201 LASL 204 >UniRef50_A1RNX9 Transposase, Tn5 family n=93 Tax=Gammaproteobacteria RepID=A1RNX9_SHESW Length = 461 Score = 65.6 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 56/422 (13%), Positives = 123/422 (29%), Gaps = 51/422 (12%) Query: 8 HDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGN 67 L+Q +R+ L L +G+++P ++T +I+ R + N Sbjct: 11 AQHLFQHASLGDARRVKRLIALSATLA-------AHMGKSVPQASQTTADIEAAYRFIRN 63 Query: 68 RHLHKERLAVYRWHAS-FICSGNTMPIVLVDWSDIR-------------------EQKRL 107 + + +A + A+ + L D S + + Sbjct: 64 EAISADAIAEAGFMATKQEALAFDTLLALEDTSSLNFSHKGVRDELGHITSHLSSRGFQA 123 Query: 108 MVLRASVALHGRSVTLYEKAF-----------------PLSEQCSKKAHDQFLADLASIL 150 + H + + L E+ P E+ S K ++A+ L Sbjct: 124 HSVLLYAPSHNQVIGLIEQHLWTRDIATMGKHKNATKRPYWEKESFKWQQAS-QNVANRL 182 Query: 151 PSN--TTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSS 208 + + + + + + ++ R + + +L + Sbjct: 183 GQHMAHAVSVCDREADLIEYLQYKLEHQQRFVVRSMISRHIEEAEGKLHLYGHSLQS-AG 241 Query: 209 SHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILA 268 + K K+ +C++ K N+ T + S W L Sbjct: 242 ERMVEVVQKGGRKARVATCEVRFAPVTLKMPSNKVGESTPLFYVSCIEKGNDDGLCWHLL 301 Query: 269 TNLPVE-IRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALM 327 T+ PV ++ Y KR IE+ + KS G + R S + M+ I Sbjct: 302 TSEPVTRTEQALTILCWYEKRWLIEDFHKSWKSG--GTQVEDLRLQSKGNLERMITILAF 359 Query: 328 LQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATL 387 + + ++ K T+ + L +E + + + ++ Sbjct: 360 IAVRVQQLRHLGLQEEQAKQQSCETLLGCKAWKLLWLKVEQCKPPKQAPSVHWAYLSLGK 419 Query: 388 LT 389 L Sbjct: 420 LA 421 >UniRef50_Q1QJQ9 Putative uncharacterized protein n=1 Tax=Nitrobacter hamburgensis X14 RepID=Q1QJQ9_NITHX Length = 152 Score = 65.3 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 27/112 (24%), Positives = 44/112 (39%), Gaps = 5/112 (4%) Query: 45 GRNLPTKART-KHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIRE 103 PT+ KH IK++DRLL N+ + ++ + I +V +DW+D Sbjct: 10 ATRWPTRGLLGKHAIKQVDRLLSNQGIVV--WDMFAAWVTQIVGQRKAIVVAMDWTDFDA 67 Query: 104 QKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQ--FLADLASILPSN 153 + + + HGR+ L E + + LA LA LP Sbjct: 68 DDQTTLALNLASNHGRATPLLWLTVLKDELKDSRNDFEDLCLARLAESLPDG 119 >UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicutes RepID=B8FXU5_DESHD Length = 381 Score = 64.5 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 72/204 (35%), Gaps = 6/204 (2%) Query: 133 QCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQY-AD 191 + H ++ + + I+ ++ WY++ R + + Sbjct: 114 KKGMNEHKALVSMVDQSEINGNVIAIMDRGYESFNNIAHFQEKSWYYIIRAKESYGIISR 173 Query: 192 LGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHH 251 L ++ ++ + +T L K+ P + + + + ++ + H Sbjct: 174 LSLPDYPEYDEEIMLTLTRRQTKETLPLLKAYPHRYR-WIQPHTTFDFIKPKDSKFYDLH 232 Query: 252 PSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSR 311 ++ + + TNL E P++L +Y+ R IE +F++LK Y +GL Sbjct: 233 FRAVRFAIADGVYETVYTNLNAEDFPPEKLKQLYNLRWGIETSFKELK---YAVGLASLH 289 Query: 312 TSSSE-RFDIMLLIALMLQLTCWL 334 + + + ++ + + Sbjct: 290 SKKKDFILQEIFARLILYNYSSII 313 >UniRef50_Q10VE7 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10VE7_TRIEI Length = 159 Score = 64.1 bits (154), Expect = 9e-09, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 61/175 (34%), Gaps = 28/175 (16%) Query: 164 FKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSN 223 F W K +K Y++ R + + + +S L G +L ++ Sbjct: 12 FLSHWLKKYQKQDLYFVFRQKKTTIIKR--GKKYCKVSELKV-------NFGETKLLLNH 62 Query: 224 PISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVN 283 + + T ++ K S + W + +NL +PK++ Sbjct: 63 IFPKILKV------------GTYNLLNYKKQKYRQKSVVDKWYILSNLS----SPKKIKK 106 Query: 284 IYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVH 338 IY +RM IE F+ K+ +Y L S ++ R ++L+ + Sbjct: 107 IYIQRMGIEAMFKYYKTGSYNL---ESAKANKMRLSNLILLIAISYTISSFQVQK 158 >UniRef50_Q5ZTU2 Transposase Tn5 n=9 Tax=root RepID=Q5ZTU2_LEGPH Length = 483 Score = 64.1 bits (154), Expect = 1e-08, Method: Composition-based stats. Identities = 47/352 (13%), Positives = 97/352 (27%), Gaps = 72/352 (20%) Query: 56 HNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLV------------------- 96 H K R N + ER + I P +L Sbjct: 55 HQSKAAYRFFQNDAVS-ERKILDSHITKTIERAKNYPTILAIQDTSYISYKNHKKTEGLG 113 Query: 97 --------DWSDIREQKRLMVLRASVALHGRSV----------TLYEKAFPLSEQCSKK- 137 ++ + +M +V G + L ++ ++ S Sbjct: 114 IIAARVRSKTTNFQTHGLVMHTTFAVTTEGLPIGLLDQKISSRPLLDETVKELKKRSHNI 173 Query: 138 ----AHDQFLADLASI--------LPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRG 185 + + + S+ L + + ++ ++ R R Sbjct: 174 ALPIEEKESMRWIESLEHSNNYPDLKNAKVVTVCDREADIYDLFEVASTNQCPFVVRARQ 233 Query: 186 KVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLT---------KSNPISCQILLYKSRS 236 + K L D+ S G ++T ++ + + + Sbjct: 234 DRTVNKTSIYSKKSGEKLWDLVSGSP-CRGEIQVTIPARDNKPKRTATLEVRFDHFVMNP 292 Query: 237 KGRKNQRSTRTHCHHPSPKIYSASAKEP-------WILATNLPVEIRT-PKQLVNIYSKR 288 +R TRT +Y P W+L TN+ + + + Y R Sbjct: 293 PKNNVKRKTRTLPDLKLNAVYVIEQSPPLGEEPMNWMLLTNIDINNFEEAVEKIQWYCLR 352 Query: 289 MQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQ 340 +IE + LKS G + R +++R L I ++ + + A+ Sbjct: 353 WRIEIFHKILKS---GFKVEECRLGAADRLVRFLTIMSVIAWRIFFITLVAR 401 >UniRef50_B0R9A9 Transposase (ISH8) n=22 Tax=Halobacteriaceae RepID=B0R9A9_HALS3 Length = 424 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 67/195 (34%), Gaps = 10/195 (5%) Query: 126 KAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRG 185 + + +K HD L S L ++ A FK + +++ Y++SR++ Sbjct: 148 ETIERIDVTDEKTHDSTLFKTGSWL--QERLVLFDRAYFKYRRFALIDENDGYFVSRLKE 205 Query: 186 KVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRST 245 + + + K + + + + +G ++ + Sbjct: 206 NANPLITEELREWRGRAI-PLEGKQIHDVVDDISRKYIDVEVEAEFKRGQYEGTRSLDTK 264 Query: 246 RTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGL 305 R A + + TNLP + P+ L +Y R ++E FR+LK+ Sbjct: 265 RFRVVGVRDS----DADDYHLYITNLPRDEFFPEDLATLYRCRWEVETLFRELKTQ---Y 317 Query: 306 GLRHSRTSSSERFDI 320 L TS + I Sbjct: 318 ELDEFNTSDPDVVKI 332 >UniRef50_Q05309 Transposase for insertion sequence element IS1151 n=16 Tax=Clostridium perfringens RepID=T1151_CLOPE Length = 473 Score = 62.6 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 63/198 (31%), Gaps = 22/198 (11%) Query: 163 GFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAE----------------NWKPISNLHDM 206 FK+ + K ++K G ++S+V+ I + Sbjct: 197 YFKIDYLKRLDKSGTAFISKVKSNTSLYIKNPSPEKYKVGTIKKSSEYIKIDIIKLAEPL 256 Query: 207 SSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWI 266 ++ + L + + ++++ K + + + + + Sbjct: 257 AAGETIELTDIYIGSKKELKSRLIITKLTEENKSKRIFNHIEGIKKKRLTLNQRRLDFNS 316 Query: 267 L---ATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLL 323 + TN+ I T Q+ +YS R QIE F+ KS + + ERF L Sbjct: 317 INAYITNVSSNIITMNQVHELYSLRWQIEIIFKVWKSI---FKINQVKKVKLERFMCFLY 373 Query: 324 IALMLQLTCWLAGVHAQK 341 L+ L ++ Sbjct: 374 GRLIALLLSSTIVFTSKS 391 >UniRef50_C4YUW4 Transposase subunit n=4 Tax=Rickettsia endosymbiont of Ixodes scapularis RepID=C4YUW4_9RICK Length = 96 Score = 62.2 bits (149), Expect = 4e-08, Method: Composition-based stats. Identities = 17/89 (19%), Positives = 34/89 (38%), Gaps = 6/89 (6%) Query: 281 LVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQ 340 L+ +Y++R QIE F+ KS G + ++ R D ++ I + + G Sbjct: 6 LLRLYAQRWQIETMFKAFKSA--GFNCEATHITNDLRLDTLMQILSIAFCLAYQTGEIIV 63 Query: 341 KQGWDKHFQANTVRNRNVLSTVRLGMEVL 369 S R+G++++ Sbjct: 64 LDKPI----VIKKHGYRQNSIFRVGLDII 88 >UniRef50_Q5ZXP7 Putative uncharacterized protein n=5 Tax=Gammaproteobacteria RepID=Q5ZXP7_LEGPH Length = 468 Score = 61.4 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 49/349 (14%), Positives = 95/349 (27%), Gaps = 60/349 (17%) Query: 62 DRLLGNRHLHKE--RLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGR 119 R + +++ E L I + ++ D +++ R + + R Sbjct: 54 YRFFNHVNVNPESILLPHSEAILERI-KAEKIVLIPQDTTEVDFTGRKSLSGMGYLSNER 112 Query: 120 SVTLYEK-----------------------------------AFPLSEQCSKKAHDQFLA 144 S LY C K ++ Sbjct: 113 SRGLYLHPSIAFTPERVCLGVVEMQHWIRKEIGTRNSRKGKSIEEKETYCWLKGYNAA-N 171 Query: 145 DLASILPSNTTPLIVSDAGFKVPWYKSV--EKLGWYWLSRVRGKVQYADLGAENWKPIS- 201 +A +P I G + + E+ YWL R + + ++ + Sbjct: 172 KIALAVPDTMVVSISDREGDIYEVLEKLPSEENKAYWLIRCQHDRAVLNEETNQFELLLK 231 Query: 202 --------------NLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRT 247 + +S + + R +S +I R RK+++ Sbjct: 232 KEVSKACVLGTIEFEIPAGTSYRNCKKRHTRKARSVRQEIRICSVSLRPPRRKSKKLNVI 291 Query: 248 HCHHPSPKIYSASAKE---PWILATNLPVEIRT-PKQLVNIYSKRMQIEETFRDLKSPAY 303 K + E W L T++P++ ++VN Y R IE + LKS Sbjct: 292 EIQVVHCKEINTPEGEQPVEWFLITSVPIKTLDRAVEIVNWYLCRWLIEMYIKILKSGCK 351 Query: 304 GLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANT 352 LR ++ +I + G F+ N Sbjct: 352 IEELRFETYEATLNCIAFYMIVAWRVFYLTMLGRTCPDIDCTTVFEDNE 400 >UniRef50_C7GFW6 Transposase, IS4 family protein n=4 Tax=Clostridiales RepID=C7GFW6_9FIRM Length = 436 Score = 61.4 bits (147), Expect = 6e-08, Method: Composition-based stats. Identities = 38/228 (16%), Positives = 74/228 (32%), Gaps = 8/228 (3%) Query: 126 KAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVR 184 + P + +A + A +P+ ++D GF + + +L R + Sbjct: 161 EVQPGRLKNEFQAICNLMDRYA----YGASPIFIADRGFSSYNVFAHAIENNVDFLIRAK 216 Query: 185 GKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQR- 243 + P + ++T K+ S + K+ + N Sbjct: 217 -DLNVQRFLGGGTLPDKLDTTIELILTRTQSKKKHKHPEKESQYRYIGKNIAFDYLNPAD 275 Query: 244 STRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAY 303 + + S + T L E TP + Y+ R IE +FRDLK Sbjct: 276 ISDEYLLKLRIVRVEVSDGVFENIITTLSEEDFTPDDIKYCYNLRWGIETSFRDLKHTIG 335 Query: 304 GLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQAN 351 L HS+ + F++ + L + + V + + +Q N Sbjct: 336 ATNL-HSKKTEYVAFELWSKLILYNFCSIIILHVPVKSRNRKYEYQVN 382 >UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=A7B831_RUMGN Length = 366 Score = 61.0 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 51/166 (30%), Gaps = 8/166 (4%) Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAEN 196 + T I + V++ G Y+L RV+ + Sbjct: 177 NESLAMTQMIDRYKGEKKTIFIADRGYETYNIFAHVQEKGMYYLIRVKDGGG-GSMTGSF 235 Query: 197 WKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKI 256 P N D T + K+ P + + KS + + + + Sbjct: 236 DLPDENEFDHDMQLILTRKQTKDVKAKPKKFKFIA-KSSPFDYLDLYDKKFYTLNFRVVR 294 Query: 257 YSASAKEPWILATNLPVEIRTPKQLVNIYSKRM------QIEETFR 296 ++ S + TNLP E +++ +Y+ R IE +R Sbjct: 295 FAISEDSYESIITNLPKEDFPVEEIKKVYAMRWHRNIVQGIEICYR 340 >UniRef50_Q6LGR5 Putative transposase similar to Tn10 n=1 Tax=Photobacterium profundum RepID=Q6LGR5_PHOPR Length = 105 Score = 61.0 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 36/60 (60%), Positives = 43/60 (71%) Query: 27 TLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFIC 86 + ALL LTLT LGR+LP+KA+TKH IKR+DRLLGN HLH +RL +YRWH C Sbjct: 1 MDSVQALLSNDALTLTLLGRSLPSKAKTKHCIKRVDRLLGNNHLHHDRLDIYRWHCHQFC 60 >UniRef50_Q6ZER8 Putative uncharacterized protein sll5062 n=1 Tax=Synechocystis sp. PCC 6803 RepID=Q6ZER8_SYNY3 Length = 151 Score = 60.6 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 23/107 (21%), Positives = 38/107 (35%), Gaps = 9/107 (8%) Query: 266 ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIA 325 +L ++ P + T + Y R IEE F D +S + L R + + Sbjct: 6 LLFSDEPTCLHT----IQEYGLRFDIEEAFLDDQSNGWNLQKSEIRFVC--ALSRLFFLL 59 Query: 326 LMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHS 372 + L GV G + + R S R+G + L+ S Sbjct: 60 ALATLYATAQGVEVFATGKHRWVAPHWFRGN---SYFRIGWDWLKTS 103 >UniRef50_Q2FU81 Transposase, IS4 n=4 Tax=Methanospirillum hungatei JF-1 RepID=Q2FU81_METHJ Length = 452 Score = 60.6 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 57/362 (15%), Positives = 107/362 (29%), Gaps = 18/362 (4%) Query: 46 RNLPTKARTKHNIKRIDR----LLGNRHLHKERLAVYRWHASFICSGNT-MPIVLVDWSD 100 + T + KR + L + H V++ A I++ D S Sbjct: 76 PKIETSILNQSFRKRFNYKLVDFLKSLMDHYIDQIVHQSPAHLKGIVEDFKDILVQDSSI 135 Query: 101 IREQKRLMVLRASVALHGRSVTL-----YEKAFPLSEQCSKKAHDQFLADLASILPSNTT 155 IR K+L L + S L Y + + + I P Sbjct: 136 IRISKKLYDLHPAARSRDDSAGLKIHAVYSVVYHSVKNAIITTERVHDYKMLKIGPDVEN 195 Query: 156 PLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTL 214 L+++D G+ + +++ G ++ SRV+ + + + P + Sbjct: 196 ILLINDLGYYSLKTFSKIQEYGGFFASRVKSNAVFKVVSINSGPPEITSIVDHNCFKSIN 255 Query: 215 GYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWIL-ATNLPV 273 G L + L+ + + E W L TNL Sbjct: 256 GDDFLDRMPKKGVYDLICSFHIGDKHINKIKTPIFQEFRVICSWNPLTEKWHLYITNLGK 315 Query: 274 EIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDI--MLLIALMLQLT 331 E+ + + +Y R IE F++LK Y LG I MLL ++ + Sbjct: 316 EVFSADDIYELYRFRWVIELIFKELK-GDYDLGKMLLNNEPMAFIHIYSMLLRFIISRDL 374 Query: 332 CWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGM---EVLRHSGYTITREDSLVAATLL 388 +K K+ Q + + L + + + + L Sbjct: 375 FTWIVSTTRKNDKGKYTQMLWSKVFSEKGLEFLSILNQNLFGTGNVKKRWDKLERSLRHL 434 Query: 389 TQ 390 + Sbjct: 435 GK 436 >UniRef50_A9F243 Transposase, IS4 family n=4 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F243_SORC5 Length = 461 Score = 60.6 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 40/356 (11%), Positives = 93/356 (26%), Gaps = 50/356 (14%) Query: 54 TKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLV-DWSD--IREQKRLMVL 110 + ++ R GN + + A+ ++ + D + R + L Sbjct: 50 SDSELEAAYRFFGNDAVTPAAILAPHVRATLARMEAEPVVLAIHDTTTLSFRSDGQRQGL 109 Query: 111 --------------RASVALHGRSVTLYE---KAFPLSEQCSKKAHDQFLADLASIL--- 150 +V+ G L + + HD++ + + Sbjct: 110 GRLRSSGQTFFAHFTLAVSGDGTRRPLGVLDLSTHVRDDGTTDNEHDRWGEQVERVAVLG 169 Query: 151 -PSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLH----- 204 + ++ G + + G ++ R+ + Sbjct: 170 AAPHDVVHVMDREGDDYGLFAQLLSAGHRFVIRLAHNRLVEADALGAEAKLEQALAHVQA 229 Query: 205 ------DMSSSHSKTLGYKRLTKSNPISCQIL-----LYKSRSKGRKNQRSTRTHCHHPS 253 ++S + ++ P + ++ + + ++Q Sbjct: 230 VAVREVELSPRPAGNRSPQQKRLHPPRAGRLAKLALGSTRVTLRRPRSQPRELPATLSLR 289 Query: 254 PKIYSASAKEP------WILATNLP-VEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLG 306 P W+L T+ P + QLV+ Y R +EE F+ LK+ G Sbjct: 290 VVRVWEIEPPPGEAPVEWVLLTSEPVESVEQLTQLVDWYRARWMVEELFKALKT---GCA 346 Query: 307 LRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTV 362 + +L + + L A++ T VL Sbjct: 347 YEKRQIEDLHGLRNVLALFAPIAWQLLLLRSEARRAPEQPATAVLTPTQLEVLRVF 402 >UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G346_9FIRM Length = 443 Score = 60.2 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 60/218 (27%), Gaps = 19/218 (8%) Query: 137 KAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGK-----VQYAD 191 H + + + + +E+ G ++ R R Sbjct: 177 NEHKALAQMVDRRSSAFPAIFMADRGYESYNTFAHIEQKGDKYVVRGRESGTGICSGLNL 236 Query: 192 LGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHH 251 E + L+ K R K + + + Sbjct: 237 PDTEEYDIEKELYICKKHSKKVKTNPRKYKRIRSDATFDFFTDDCEEYRLN--------- 287 Query: 252 PSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSR 311 S +L TNL E + L +Y R IE F LK Y LG Sbjct: 288 LRIVKIKLSETTTEVLFTNLSKEKFSADDLKRLYHMRWGIETAFDQLK---YALGAASVH 344 Query: 312 TSSSERFDIMLLIALMLQLTCWLA--GVHAQKQGWDKH 347 + +SE L L++ C G+ ++Q + K+ Sbjct: 345 SKNSELIIQELYGKLIMFNFCKTIVGGIAVKQQEYWKY 382 >UniRef50_C6N6I3 Transposase n=2 Tax=Gammaproteobacteria RepID=C6N6I3_9GAMM Length = 488 Score = 59.9 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 44/272 (16%), Positives = 89/272 (32%), Gaps = 22/272 (8%) Query: 128 FPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKV 187 P+ ++ S + + A + I + ++LG ++L R Sbjct: 189 IPIEQKESYRWLENLKQSTALLQEPKRCVHIGDRESDIYELFCIAQELGTHFLVRTCVNR 248 Query: 188 QYADLGAENWKPISNLHDMSSSHSK---TLGYKRLTKSNPISCQILLYKSRSKGRKNQRS 244 + + +S++ + G K QI ++ K +K Sbjct: 249 LIENGSSTIADKMSSVEVKGLHRIELQDADGNKTEIPLAIKYQQINVFPPIGKQKKYPSL 308 Query: 245 TRTHCHHPSPKIYSASAKEPWILATNLPVEIR-TPKQLVNIYSKRMQIEETFRDLKSPAY 303 + T H + + K W LATNLP+ + ++ Y+ R +IE + LKS Sbjct: 309 SLTIIHAEEEQDPTNRDKIVWKLATNLPITSLEQAIEKLDWYALRWRIETFHKILKSGCK 368 Query: 304 GLGLRHSRTSSSERFDIMLLIALML---QLTCWLAG------------VHAQKQGWDKHF 348 S+ +++R ++ I +L + + DK Sbjct: 369 A---EASKLRTAQRISNLIAIFCILSWRIFWMTMINRCSPNASPQLVFTSEEIHLLDKLV 425 Query: 349 QANTVRNRNVLSTVRLGMEVLRHSGYTITRED 380 + N R + + +V + GY + D Sbjct: 426 KGNIQNERRIKNLSYYLTKVAQLGGYLARKSD 457 >UniRef50_Q64B41 Transposase n=11 Tax=environmental samples RepID=Q64B41_9ARCH Length = 439 Score = 59.9 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 84/206 (40%), Gaps = 8/206 (3%) Query: 155 TPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTL 214 L++ FK ++ ++ G Y++SR++G + +++ D+ + + Sbjct: 195 RILLIDLGYFKYLFFDRIDGYGGYFVSRLKGNANPLIVRVNRKCRGNSV-DVVGKKLRDV 253 Query: 215 GYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVE 274 + + + ++ + + KG+++ R +++ + + TN+ V+ Sbjct: 254 LPRLKREILDVEVEVEFKRRKYKGKQSTVKRRFRMVC----AFNSDSGKYHSYLTNIRVD 309 Query: 275 IRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWL 334 I + +++ +Y R +IE F++LKS + +++ ++ IA++ + Sbjct: 310 ILSAEEIALLYGARWEIELIFKELKSH---YRMDQIPSANPNIVKCLIWIAILTLMCSRR 366 Query: 335 AGVHAQKQGWDKHFQANTVRNRNVLS 360 + + + +R V S Sbjct: 367 ILRLIRNANPENANRYTPLRWAKVFS 392 >UniRef50_A3IP38 Putative uncharacterized protein n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IP38_9CHRO Length = 101 Score = 59.5 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 33/93 (35%), Gaps = 4/93 (4%) Query: 74 RLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQ 133 + +W GN + IV +D + + +L S+ + R + LY + + Sbjct: 3 IPIIKQWLNQSFDPGNVLHIV-IDRTQW---GLINILMVSLIIDNRGIPLYFELLDHTGN 58 Query: 134 CSKKAHDQFLADLASILPSNTTPLIVSDAGFKV 166 + L+ + +L T ++ G Sbjct: 59 SNFDTQKSILSRVLPLLKEYKTVVLGDRGGHGW 91 >UniRef50_Q4BVH8 Putative uncharacterized protein n=1 Tax=Crocosphaera watsonii WH 8501 RepID=Q4BVH8_CROWT Length = 168 Score = 59.5 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 9/98 (9%), Positives = 27/98 (27%), Gaps = 3/98 (3%) Query: 74 RLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQ 133 V + + I+ +D + +++ MV +V ++ +Y Sbjct: 48 WFPVIKAIICKEFKTGSRLIITIDRTQWKDKNVFMV---AVIWKKLALPIYWTLLGKRGA 104 Query: 134 CSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKS 171 + + +L + ++ V Sbjct: 105 SRLSEQQALIQPVLCLLKNYELVILGDREFHSVKLAYW 142 >UniRef50_Q7M7G3 Gll0371 protein n=1 Tax=Gloeobacter violaceus RepID=Q7M7G3_GLOVI Length = 467 Score = 59.1 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 48/387 (12%), Positives = 106/387 (27%), Gaps = 60/387 (15%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKR 60 M +D + Q +R L +L +L P +K Sbjct: 1 MPTIDPWTEHELQHLDLGDARRHTRLKQLLSSLARQPHASL-------PQACEDAAALKA 53 Query: 61 IDRLLGNRHLHK-ERLAVYRWHASFICSGNTMPIVLVDWSDIR----------------- 102 R L N H + + + + + +++ D +++ Sbjct: 54 AYRFLDNPQCHPADIRQAHASATAGRLAALPLVLLIQDTTELNFTAHPHTTGLGHLSKPE 113 Query: 103 EQKRLMVLRASVALHGRSVTLYEKA----------FPLSEQCSKKAHDQF------LADL 146 Q L+ +V ++ L + Q + H + LA Sbjct: 114 SQGLLVHSLLAVRPDAVALGLCWQKVWTRAAQVEHLAARRQKRPQQHKESQRWLDGLAAA 173 Query: 147 ASILPSNTTPLIVSDAGFKVP-WYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHD 205 +P +++ D + + + + L R + + A + + Sbjct: 174 KGYVPPGGEAVLIGDRESDMFGLFAAPRPPQLHLLVRAARQRRLATPKTLLF----EVFA 229 Query: 206 MSSSHSKTLGY-KRLTKSNPISCQILLYKSRSK---GRKNQRSTRTHCHHPSPKIYSASA 261 +S+ R ++ +Y + GR +Q Sbjct: 230 GASAQGHYRCDLARRPGRAARQARLQVYWQAVQLQPGRNDQTHKGQPPVSVWVVRAWEVD 289 Query: 262 KEP------WILATNLPVEIR-TPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSS 314 W L ++ PV V+ Y+ R IE LKS G + + + Sbjct: 290 PPAGEEGIDWWLLSSQPVTTLEQALACVHRYTLRWLIERYHYILKS---GCAVEQLQLET 346 Query: 315 SERFDIMLLIALMLQLTCWLAGVHAQK 341 ++R + L + ++ ++ Sbjct: 347 AQRLERALAVYCIVAWRLLQLTYQSRS 373 >UniRef50_Q6MB99 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MB99_PARUW Length = 164 Score = 59.1 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 50/167 (29%), Gaps = 15/167 (8%) Query: 236 SKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETF 295 K + + E I+ TN+ P + + +Y KR IE F Sbjct: 13 GKKKVFKGHHTLLGVSVQIAASRNYQGELLIVLTNVC-----PYKALKMYKKRWAIETLF 67 Query: 296 RDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRN 355 LK+ G + + ++ D +L+ + + + + Sbjct: 68 GYLKT--KGFCFEDTHMTDLKKIDAWMLVLTLAVVW------TIKTNEIIQSKTNQASHG 119 Query: 356 RNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLGKL 402 R S R E +R + E + + L +L +L Sbjct: 120 RKRKSIFRTCFEGIRKC--LLCLELYMNEILHYIRLLRKKNSILNRL 164 >UniRef50_Q6LRT4 Similar to transposase n=38 Tax=Photobacterium profundum RepID=Q6LRT4_PHOPR Length = 426 Score = 58.7 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 44/361 (12%), Positives = 91/361 (25%), Gaps = 50/361 (13%) Query: 8 HDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGN 67 L KR +L + + + L + + A T+ R N Sbjct: 6 QHQLPCILESRLSKRYQTLIMEHMTVNSSNAPGVKSLRHHTQSWASTQAT----WRFYHN 61 Query: 68 RHLHK---ERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRL---------------MV 109 + + + S + ++ DW I K Sbjct: 62 EDVTFPMLSGPMLGLARSGVKESQSRYVLMAHDWCHINFAKHHSKLDKTKMSHALDVGYE 121 Query: 110 LRASVALHGRS-VTLYEKAFP---------------LSEQCSKKAHDQFLADLASILPSN 153 L+AS+ + + + +Q + + + Sbjct: 122 LQASLLVDANTGAPIAPAGLNLLTSNGIYQCRSQELQPKQSHLDSLFDSIHWQEQLHLDK 181 Query: 154 TTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKT 213 +V + + +WL+R + + I + Sbjct: 182 PLVHVVDREADSAKDLRRLGS--VHWLTRTKKGSTFRHEDQFKTAEIISRTISPDLKGVI 239 Query: 214 LGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP--WILATNL 271 + + L++ K S C + KE W L +N+ Sbjct: 240 SLRGKEGYLFVGETTVELHRKSEK----LASAAPTCRFVMSLVTDDEGKELARWYLLSNV 295 Query: 272 PVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLT 331 ++ Y R IE F+ LKS + L + +++E L+ A + Sbjct: 296 --LDVDATEIATWYCHRWNIESWFKLLKSDGH--QLEKWQQTTAESILKRLITASVATTL 351 Query: 332 C 332 Sbjct: 352 I 352 >UniRef50_B0NZ84 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0NZ84_9CLOT Length = 244 Score = 58.7 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 64/202 (31%), Gaps = 8/202 (3%) Query: 169 YKSVEKLGWYWLSRVRGKVQYADLGAE--NWKPISNLHDMSSSHSKTLGYKRLTKSNPIS 226 + K ++L R++ +G E + + +T K+L K Sbjct: 3 WLIYRKKDGFFLIRIKDGRNGIKMGLELPRRNEFDLDVSLKLTRKQTNDVKKLLKDKNHY 62 Query: 227 CQIL---LYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVN 283 I + + TR + + + + + T+L V PK+L Sbjct: 63 RYIASSATFDFLPSHSRKSEQTRFYEINFRIVRFEITPGNYETVLTSLDVNKYPPKELKR 122 Query: 284 IYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQG 343 +Y+ R E +FRDLK L + + ++ + + A + Sbjct: 123 LYALRWGTETSFRDLKYTVGMLNFHSKKVMCIH--QEIYAHLIIYNFSEMITSHVAISKK 180 Query: 344 WDKH-FQANTVRNRNVLSTVRL 364 + ++AN +V Sbjct: 181 KRLYTYKANFSMAVHVCRLFFY 202 >UniRef50_A8F1V7 Transposase and inactivated derivative n=1 Tax=Rickettsia massiliae MTU5 RepID=A8F1V7_RICM5 Length = 478 Score = 58.3 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 28/228 (12%), Positives = 61/228 (26%), Gaps = 19/228 (8%) Query: 124 YEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRV 183 Y P+ E+ S + + + I YK ++ +L R Sbjct: 165 YVHTKPIQEKESYRWIEAIRDINNLNIVDKEIVHIADREADIYEMYKYCDEKNIKFLIRA 224 Query: 184 RGKVQYADLGAENWKPISNL--------HDMSSSHSKTLGYKRLTKSNPISCQILLYKSR 235 + +S + + ++ Sbjct: 225 KENRAINKQKRREKPKYKLFDYFHSLPEMLKTSIKFQINKDVKYREATLSISFAEFTLPP 284 Query: 236 SKGRKNQRSTRTHCHHPSPKIYSASAKEP-------WILATNL-PVEIRTPKQLVNIYSK 287 R + + + I + P W+L +N+ + VN Y++ Sbjct: 285 PPSRTCNKDGKELANLKLWGIIAKEDNPPEGAEAINWLLISNIKVNTTDEAIEKVNWYTR 344 Query: 288 RMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 R IE + LKS G + ++ ER + + ++ + Sbjct: 345 RWSIEIFHKILKS---GCSVEAAQLRERERLIKYITMKSIVAWRIFWL 389 >UniRef50_UPI000196B70E hypothetical protein CATMIT_00144 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196B70E Length = 479 Score = 57.9 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 53/173 (30%), Gaps = 12/173 (6%) Query: 128 FPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKV 187 + L +L + + +E L + ++ R + Sbjct: 179 LRQAPYSETPLAFAHLYRTREMLENQKVIYLADRYYGSAEIISHLEDLRYSYVIRGKSNF 238 Query: 188 QYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRT 247 + D + + + +P + ++ K R +R R Sbjct: 239 YKKQVAG------MESDDEWIEVEVDEKWLKRFRFSPEAKKLRKENPTLKIRVIKREYRY 292 Query: 248 HCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 + E I TNL E T +++ IYS+R IE +++ +K+ Sbjct: 293 TDNKN------KEHCENLIYFTNLSSESFTTDEIMEIYSRRWDIEVSYKTMKT 339 >UniRef50_B6BYZ1 Putative uncharacterized protein n=1 Tax=Nitrosococcus oceani AFC27 RepID=B6BYZ1_9GAMM Length = 191 Score = 57.6 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 21/102 (20%), Positives = 40/102 (39%), Gaps = 10/102 (9%) Query: 273 VEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTC 332 P+Q + Y + Q E +K G RH+ + R +L + + C Sbjct: 49 HPTHAPEQALEDYRQCWQTECLLAVMKLR--GFPRRHANITDPNRVARLLAVMTLALCWC 106 Query: 333 WLAGVHAQKQGWDKHFQANTVRN--RNVLSTVRLGMEVLRHS 372 + G+ ++Q Q ++ R S RLG++ +R + Sbjct: 107 YKVGLWLERQ------QPIEIKKHQRRACSGARLGLDTVRQA 142 >UniRef50_B9K450 Transposase n=6 Tax=cellular organisms RepID=B9K450_AGRVS Length = 473 Score = 57.6 bits (137), Expect = 8e-07, Method: Composition-based stats. Identities = 53/378 (14%), Positives = 108/378 (28%), Gaps = 67/378 (17%) Query: 42 TELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLA--VYRWHASFICSGNTMPIVLVDWS 99 +G ++P + N K R N ++ + + + A + +VL D + Sbjct: 36 EHMGGSIPFACQDWANTKAAYRFFSNPNVEEGDILNGHFAATAQRYDASQGPILVLRDTT 95 Query: 100 DIREQKR-----------------------------LMVLRASVALHGRSVTLYE----- 125 + Q+R LM +V L G + L Sbjct: 96 EFTYQRRNPHAVGFTKSVNSGRDKQERLRHHTVCGILMHSSLAVTLDGLPLGLAAVKFWS 155 Query: 126 ------------KAFPLSEQCSKKAHDQFLADLASILP----SNTTPLIVSDAGFKVPWY 169 K P K ++L +L + + + Y Sbjct: 156 RDKFKGTAQLKRKINPTRVPIETKESIRWLENLRQSVGLLGQPDRCIHVGDRESDIYELY 215 Query: 170 KSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQI 229 ++LG +++ R L IS + + L R + Sbjct: 216 CLTKELGTHFVVR----TVVDRLAGNGDHTISAEMRDVETAGRHLIEVRADTDEVTKVHL 271 Query: 230 LLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP-------WILATNLPVEI-RTPKQL 281 + R + + + I++ P W L T+L V + Sbjct: 272 DIRFKRIRVLPPIGKMKRYPALDLTVIHAVEPNPPPGSKRIEWKLLTDLEVHSCEDAVEK 331 Query: 282 VNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQK 341 + Y+ R +IE + LKS +R +++R ++ + ++ + A+ Sbjct: 332 IKWYAMRWKIEVFHKILKSGCRA---EDARLRTADRLANLVAMFCIMSWRVLWLTMLARS 388 Query: 342 QGWDKHFQANTVRNRNVL 359 A T + +L Sbjct: 389 APEIPPTAALTEQEIEIL 406 >UniRef50_Q0H069 ISEc13 transposase n=23 Tax=Bacteria RepID=Q0H069_ECOLX Length = 457 Score = 57.6 bits (137), Expect = 9e-07, Method: Composition-based stats. Identities = 29/212 (13%), Positives = 65/212 (30%), Gaps = 9/212 (4%) Query: 129 PLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVE-KLGWYWLSRVRGKV 187 P E+ S + + + V D + Y + G ++ R Sbjct: 161 PYEEKESYRWQQASERMAERLGEIQKRVITVCDREADIWHYLHYKVSHGQRFVVRAAQNR 220 Query: 188 QYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRT 247 + + + ++ L S TL + ++ + S + S + Sbjct: 221 RLEEAPGKLFELPEVLATAGS---HTLNVMQKGGRAARQARMFISYSEVSIKNPDNSGQA 277 Query: 248 HCHHPSPKIYSASAKEPWILATNL-PVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLG 306 A W L T+ +++V+ Y +R IEE + KS G Sbjct: 278 LPLTYVCCREQAEDGACWHLLTSEKVASAADARRIVSHYERRWLIEEYHKAWKSG--GTC 335 Query: 307 LRHSRTSSSERFDIMLLIALMLQLTCWLAGVH 338 + R + + + ++ + + + G+ Sbjct: 336 VESLRMQTRDNLER--MVVIKAFIAVRVLGLR 365 >UniRef50_C1DPR7 Transposase n=7 Tax=Proteobacteria RepID=C1DPR7_AZOVD Length = 460 Score = 57.6 bits (137), Expect = 9e-07, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 68/241 (28%), Gaps = 21/241 (8%) Query: 129 PLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEK----LGWYWLSRVR 184 + ++ L +P + G W+ + W+ R Sbjct: 156 EKESRRWIDSYQASC-ALQGQIPKTQLVNLADAEGDLYEWFTEYAEVAPSTRAQWIVRAA 214 Query: 185 GKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRS 244 + A+ + + + K ++ L + R R Sbjct: 215 QDRRVLTGDADKLWASLAMAPGLGQ--LAVEVRARPKRPARQARVTLRSATVVLRPPARI 272 Query: 245 TRTHCHHPSPKIYSASAKEP-------WILATNLP-VEIRTPKQLVNIYSKRMQIEETFR 296 R + + P W+L T+LP + +V Y+ R IE F Sbjct: 273 GRHLPEVSVNAVLAREENPPEGVEPLEWLLLTSLPVGSLEQASTIVAWYAVRWYIEIYFH 332 Query: 297 DLKSPAYGLGLRHSRTSSSERFD---IMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTV 353 LK+ G + + + ER + L+ L + G + + F+A Sbjct: 333 VLKN---GCQINCLQLETEERLLPCIGLYLVVAWRVLYSLMLGRACPELNCELIFEAREW 389 Query: 354 R 354 R Sbjct: 390 R 390 >UniRef50_Q1Q2K2 Putative uncharacterized protein n=5 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q2K2_9BACT Length = 457 Score = 57.2 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 41/379 (10%), Positives = 108/379 (28%), Gaps = 51/379 (13%) Query: 4 LDILHDSLYQFCPELHLKRLNSLTLACHALLD-CKTLTLTELGRNLPTKARTKHNIKRID 62 + +L+ L F K+ L +A+ K +L + + + T ++ Sbjct: 41 IPLLNKFLEHFASCFTKKQFAMFLLVVYAMFKDYKRNSLEAMAQAVHTD------YQKFQ 94 Query: 63 RLLG-----NRHLHKERLAVYRWHASFICSGNTMPIVLVDWS------------------ 99 L ++R+ + + + + ++ I+ +D + Sbjct: 95 YFFSESKWDLPALKQKRMDIIQKQRTTALTKDS--ILTIDDTGCPKPYAKNTEAAKWQYC 152 Query: 100 -DIREQKRLMVLR-ASVALHGRSVTLYEKAF----PLSEQCSKKAHDQFLADLASILPSN 153 ++ + V+ A+ + L + E + + + + Sbjct: 153 GPLKRPETCNVVVGAAFVSKTKHFPLDVIPYLPADEFGEGKNDPKFKDKIQIAMDMFDAA 212 Query: 154 TTPLIVSDAGFKV-----PWYKSVEKLGWYWLSRVRGKVQY--ADLGAENWKPISNLHDM 206 + S F + + + ++ S ++ + + I + Sbjct: 213 SNVFDFSAIAFDTWYASQRFLEHIHAKKKHFFSEIKSNRNISMYHPEKQKYCIIKPDELV 272 Query: 207 SSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSA-SAKEPW 265 + G + + YK+ + K ++ K+ Sbjct: 273 TLIKKHYAGKIKYVTLKSADGSEVSYKTYTFDAKLNGCNVPLKFVVILGKWNKEDDKKYH 332 Query: 266 ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIA 325 +L TN + K ++ Y R IE F++LK H + ++ + I Sbjct: 333 VLITN--QLDASVKTVITNYLLRWGIEHCFKELKDT---FYFDHYQVRHIDKIERYWNIC 387 Query: 326 LMLQLTCWLAGVHAQKQGW 344 L+ + +A Sbjct: 388 LISWTFVYWIKQNAYLDKI 406 >UniRef50_A7C1C1 IS231-related transposase n=6 Tax=Beggiatoa sp. PS RepID=A7C1C1_9GAMM Length = 445 Score = 57.2 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 62/198 (31%), Gaps = 12/198 (6%) Query: 152 SNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENW-KPISNLHDMSSSH 210 + I F + + LSR+R D E + + L ++ Sbjct: 184 EECSLQIADLGYFSIAKMAENFDANVFCLSRLRHDAVLFDEQEEEFDLSLYTLFMKKNNR 243 Query: 211 SKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAK-----EPW 265 + L + + ++ + + +R K +AS K + Sbjct: 244 LRAELNVLLVRYEKLPVRLFIERVPEMISSKRRRQANKGASKKKKGKTASKKSLSLCDFT 303 Query: 266 ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIA 325 +L T P + + + +Y R QIE F+ KS A L S + R + I Sbjct: 304 LLVTTAPSVQLSFDEALVLYGARWQIELLFKLWKSHA---KLDTSIRPNPWRICRYIYIK 360 Query: 326 LMLQL---TCWLAGVHAQ 340 L+ L L G Sbjct: 361 LLACLVQHWIILMGCWNH 378 >UniRef50_C3BTW8 Transposase for insertion sequence element IS231B n=13 Tax=Bacillus RepID=C3BTW8_9BACI Length = 387 Score = 57.2 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 27/103 (26%), Positives = 39/103 (37%), Gaps = 3/103 (2%) Query: 230 LLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRM 289 L K + + R ++ R S + TN P +I QL + YS R Sbjct: 198 RLTKEQQQKRLQDQTVREKKKGMKYSARSKRLSGINVYMTNTPTDIVPMGQLHDWYSLRW 257 Query: 290 QIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTC 332 QIE F+ KS + H + ER + L L+ L C Sbjct: 258 QIEILFKTWKSF---FYIHHCKKIKRERLECHLYGQLIAILLC 297 >UniRef50_Q1QGK1 Putative uncharacterized protein n=1 Tax=Nitrobacter hamburgensis X14 RepID=Q1QGK1_NITHX Length = 155 Score = 57.2 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 1/69 (1%) Query: 295 FRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVR 354 FRD K +G+GL R + +R D +LL+ + L G + G D+H + NT + Sbjct: 88 FRDTKDLRFGMGLGVLRIADPQRRDRLLLLNAFAIVLLTLLGPAGESLGMDRHLKVNTAK 147 Query: 355 NRNVLSTVR 363 R S R Sbjct: 148 -RRTHSLFR 155 >UniRef50_P11901 Transposase for insertion sequence element IS421 n=41 Tax=cellular organisms RepID=T421_ECOLX Length = 371 Score = 56.8 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 56/194 (28%), Gaps = 14/194 (7%) Query: 150 LPSNTTPLIVSDAGFKV--PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDM- 206 + ++D GF +S+ ++ RV + + Sbjct: 160 FAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGL 219 Query: 207 -----SSSHSKTLGYKRLTKSNPISCQILLYKSRSKG---RKNQRSTRTHCHHPSPKIYS 258 + P +++ + K + + + + Sbjct: 220 DCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAET 279 Query: 259 ASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERF 318 A +L T+LP + + +Q+ + Y R QIE F+ LKS L L R E Sbjct: 280 LEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSL---LHLDALRAKEPELA 336 Query: 319 DIMLLIALMLQLTC 332 + L+ Sbjct: 337 KAWIFANLLAAFLI 350 >UniRef50_B4WVP9 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WVP9_9SYNE Length = 112 Score = 56.8 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 15/89 (16%), Positives = 29/89 (32%), Gaps = 4/89 (4%) Query: 38 TLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVD 97 L+EL P +A+ + + K+ R L AV + + ++ D Sbjct: 12 QPHLSELAVAFPGRAKPESHYKQQQRFFRAFELDY---AVIAQLVADWMAIPEPWVLAAD 68 Query: 98 WSDIR-EQKRLMVLRASVALHGRSVTLYE 125 + + +L V G + L Sbjct: 69 RTQWEVGTTTVNILTLGVVHKGIAFPLVW 97 >UniRef50_C8VXW5 Transposase IS4 family protein n=3 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VXW5_DESAS Length = 587 Score = 56.4 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 50/353 (14%), Positives = 103/353 (29%), Gaps = 21/353 (5%) Query: 46 RNLPTKARTKHNIK------RIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWS 99 R +P + IK + L N + +R V + + + +D Sbjct: 226 RVMPGNTQDITTIKPLIEDVKARFALKNCTMVFDRGMVSTDNIVALECEKWTYVSAMDRD 285 Query: 100 DIREQKRLMV-LRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLI 158 +I++ L SV YE+ + E + FL I+ L Sbjct: 286 EIKKADFFNTALPESVTPDN-----YEQIMVMQEFLPFDE-NAFLYYREFIIDDRRYILT 339 Query: 159 VSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKR 218 A F + + + ++ + + + + + K L K+ Sbjct: 340 FDVARFFDEHHAQLNNVAYFVQWLTVKNQSLREAKKKRCQSLLEREVAAMLKRKHL--KK 397 Query: 219 LTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTP 278 N + R R Q ++ + + TNL V T Sbjct: 398 WVSVNIEPYDFEVINKRGNSRTIQSFQLSYTINTVAQKNEQRIHGITCFITNLDVTSHTA 457 Query: 279 KQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVH 338 ++ Y ++ ++EE F ++K L LR + +R ++I ++ Sbjct: 458 IDIIQWYRRKNKVEEAFHEIKDH---LDLRPIYLTREQRVMAHVIICVLAYFIFNDIEYR 514 Query: 339 AQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTIT--REDSLVAATLLT 389 ++ + + RL ++ S +IT L Sbjct: 515 LKQNDLAYSTE-EVIGTLRECLVNRLAIQQTNRSWLSITQPSSQLKEILHALK 566 >UniRef50_A8YDI5 Similarity. Hypothetical start n=1 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YDI5_MICAE Length = 91 Score = 56.4 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 14/82 (17%), Positives = 31/82 (37%), Gaps = 1/82 (1%) Query: 2 CELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKA-RTKHNIKR 60 ++ L Q + + L L+ ALL ++L + ++A + +R Sbjct: 6 RIFSQVYSYLEQGSRFVDKRHLTVLSWMVTALLSSQSLNQARWEPFVQSRAEQANSYQRR 65 Query: 61 IDRLLGNRHLHKERLAVYRWHA 82 +R N + E++ + H Sbjct: 66 WNRFCQNGRVAVEKIYLNFPHT 87 >UniRef50_C0INS6 Transposase n=1 Tax=uncultured bacterium BLR10 RepID=C0INS6_9BACT Length = 267 Score = 56.4 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 51/184 (27%), Gaps = 3/184 (1%) Query: 178 YWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSK 237 WL R + + + TL + K+ P+ Q+ + Sbjct: 27 DWLIRAAHNRCL--PDGGKLWDQTMVGEPVGQIEFTLASRHGIKARPVRQQLWAQRVELC 84 Query: 238 GRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI-RTPKQLVNIYSKRMQIEETFR 296 T + + W L TN + +L++ Y R +IE F Sbjct: 85 NGTGSALHVTSIVAREIDAPAGAKPVEWRLLTNRCADTCAEVVELIDWYRARWEIEMLFD 144 Query: 297 DLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNR 356 LK+ L+ ER + L+ G + F +R Sbjct: 145 ILKNACRIESLQLEHIGKLERAIAVYLVVAWRIAHLMRLGRTCPDLDATRFFDWAEIRAA 204 Query: 357 NVLS 360 + S Sbjct: 205 YMRS 208 >UniRef50_Q04QP0 Transposase, ISLbp11 n=2 Tax=Leptospira borgpetersenii serovar Hardjo-bovis RepID=Q04QP0_LEPBJ Length = 243 Score = 56.0 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 57/156 (36%), Gaps = 6/156 (3%) Query: 183 VRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQ 242 +R + G + + +++ T+ K+ ++ Q+ K K + + Sbjct: 3 IRANHERKIEGGGCSWSYLETLEPAHTYTITVPRKKGKEAREAIIQLRFEKLTIKSPQYK 62 Query: 243 RSTRTHCHHPSPKIYSASAKEP--WILATNLP-VEIRTPKQLVNIYSKRMQIEETFRDLK 299 + + + +E W T +P K++++ Y R IE F+ LK Sbjct: 63 KLENIDMYALTATEVDGPKEESIDWKFLTTIPIHNSENAKRVISYYKSRWGIEVFFKVLK 122 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 S G + ++ +RF + ++ ++ + Sbjct: 123 S---GCNIESTQFKFGDRFKACIAVSAIVAWRVTML 155 >UniRef50_A5II18 Transposase, IS4 n=1 Tax=Legionella pneumophila str. Corby RepID=A5II18_LEGPC Length = 379 Score = 56.0 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 59/187 (31%), Gaps = 30/187 (16%) Query: 150 LPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSS 208 P T + V D G+ W+ S+ + +++SR++ + Sbjct: 173 WPIETDIIYVFDKGYCDYDWWWSIHQKKAFFVSRLKVNAAISIEQK-------------- 218 Query: 209 SHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILA 268 + +G K + K+P IL Sbjct: 219 ------FETNENSPILEDGLFRFSNPKPRGGK----KNLYTSLARRISVQREDKDPLILV 268 Query: 269 TNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALML 328 TNL E + + +Y R +IE F+ +K L ++ S I L+ A++ Sbjct: 269 TNLLDE--PAEMIAQLYKSRWEIELFFKWIKQR---LKIKKILGKSENAVKIQLITAIIA 323 Query: 329 QLTCWLA 335 L +L Sbjct: 324 YLLVFLF 330 >UniRef50_B3VMZ1 Transposase n=6 Tax=Gammaproteobacteria RepID=B3VMZ1_KLEPN Length = 421 Score = 56.0 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 47/158 (29%), Gaps = 17/158 (10%) Query: 205 DMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP 264 S + T L P+ + + + + + + + +P Sbjct: 218 PASVNPMATCNQTGLC--QPLRSWLAVLPKHGELDLDVQWPDGPVYRCVLFASTDHKDKP 275 Query: 265 WILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLI 324 L TNL + Y R QIE F++ KS L T S + ++ Sbjct: 276 VCLCTNLDRHTFPAATVGEWYRLRWQIELLFKEWKSLN---SLNKFNTEYSTIAETLIWG 332 Query: 325 ALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTV 362 +L+ AQ+ + R VLS Sbjct: 333 SLLAATLKRWLINGAQQ------------KYRRVLSMF 358 >UniRef50_A7N4N2 Putative uncharacterized protein n=1 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N4N2_VIBHB Length = 54 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 24/50 (48%), Positives = 33/50 (66%) Query: 1 MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPT 50 M ++ IL ++ CP++H KRL SL LA A+LD LTLT++GR L T Sbjct: 1 MRDIQILQQTIENQCPDIHKKRLRSLMLATKAVLDGSNLTLTKIGRALST 50 >UniRef50_A6L0R8 Transposase n=13 Tax=Bacteroidales RepID=A6L0R8_BACV8 Length = 411 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 59/186 (31%), Gaps = 27/186 (14%) Query: 156 PLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 + + A ++ + + G ++++++ + Y + S L + H Sbjct: 203 IVALDRAYIDYAKFEELSRAGVIYVTKMKKNLVYEVSADTIYMTESGLMALRERHVTFTK 262 Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 + +I+ Y + L TN Sbjct: 263 KVKDGDDIKHHARIVTY----------------------VDQKKRGAKLISLLTN--DME 298 Query: 276 RTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 + + +V IY KR +IE F+ +K LR+ S+ I + I L+ L + Sbjct: 299 MSAEDIVAIYRKRWEIELLFKQIKQ---NFPLRYFYGESANAIKIQIWITLIANLLLMVL 355 Query: 336 GVHAQK 341 ++ Sbjct: 356 KKRIKR 361 >UniRef50_A4J2U7 Transposase, IS4 family protein n=3 Tax=Desulfotomaculum reducens MI-1 RepID=A4J2U7_DESRM Length = 413 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 23/105 (21%), Positives = 41/105 (39%), Gaps = 5/105 (4%) Query: 231 LYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQ 290 L K K + T + + +P I+ TN + +++ +IY R Q Sbjct: 237 LIKKDHKVILGKDGTTKMQNPLRLIETEDTEGKPVIIITN--DFELSAEEISDIYRYRWQ 294 Query: 291 IEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 IE F+ +K ++H S + + L+IAL+ L Sbjct: 295 IELFFKWIKQH---FCVKHFYGLSQQAVENQLMIALITYCLMMLL 336 >UniRef50_C8W0R4 Transposase IS4 family protein n=4 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W0R4_DESAS Length = 467 Score = 54.9 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 40/320 (12%), Positives = 90/320 (28%), Gaps = 44/320 (13%) Query: 56 HNIKRIDRLLGNRHLHKE-RLAVYRWHASFICSGNTMPIVLVDWSDIR------------ 102 ++K R N + E V+R + + + D + Sbjct: 59 SDVKAAYRFFDNEKVTVEAIYDVHRKATIEKIKNQPVVLAIQDTTIFNYTLHRETKGLGP 118 Query: 103 ---EQKRLMVLRASVALHGRSVTL-------YEKAFPLSEQCS---------KKAHDQFL 143 L + +A V L + ++ E+ Sbjct: 119 IGQAGLSGFFLHSCLAASAEGVPLGILAHRLWVRSLEPKEKTHKKRPIEDKESVRWIDVT 178 Query: 144 ADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISN 202 ++A + T ++V D + + + L R + W + + Sbjct: 179 REVAETVSPFTKVVMVGDRESDIFDLFLLASANQYDILVRAAWNRRIDQSHDYLWPVVES 238 Query: 203 LHDMS-----SSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIY 257 + + + + + L + +G++ + + + Sbjct: 239 APVLGRTVINIPRADKRPEREAVVLTLQAATVTLKPPKHRGKEKLAAPTLNALLVQEQSP 298 Query: 258 SASAKE-PWILATNLP-VEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSS 315 K W+L T LP I Q + Y+ R +IE LKS G + + + Sbjct: 299 PEGEKPIEWMLLTTLPVTTIDDALQCLTWYTYRWRIERYHYILKS---GCQVEKLQLETK 355 Query: 316 ERFDIMLLI-ALMLQLTCWL 334 +R + + +++ WL Sbjct: 356 DRLMRAIAVYSMVASQLLWL 375 >UniRef50_A1WHR7 Transposase, IS4 family n=11 Tax=Proteobacteria RepID=A1WHR7_VEREI Length = 462 Score = 54.9 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 40/270 (14%), Positives = 71/270 (26%), Gaps = 36/270 (13%) Query: 80 WHASFICSGNTMPIVLVD-WSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKA 138 H ++ + P+ + D R K RA V R YE+ Sbjct: 102 LHPTYAVTPEREPLGITDARMWARTPKADDGTRAGVRESLRWSEAYERIGE--------- 152 Query: 139 HDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGW--YWLSRVRGKVQYADLGAEN 196 +A L + V + LG WL R + G++ Sbjct: 153 -------MAQTLTDTRLVCVGDREADIVEMMRRARDLGHPADWLIRSKHNRTL-PDGSKL 204 Query: 197 WKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKI 256 W LG T + + + + + R + Sbjct: 205 WAQTMEDAP--------LGEIEFTLAARTAEAARVVRQQIWARALNIPDGAGGQLTVTCV 256 Query: 257 YSASAKEP-------WILATN-LPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLR 308 + P W L TN + + + ++ Y R +IE F LK+ L+ Sbjct: 257 VAKETNPPAGCKAVQWHLLTNRMASDFAEVVEWIDWYRCRWEIETFFNVLKNGCRVEALQ 316 Query: 309 HSRTSSSERFDIMLLIALMLQLTCWLAGVH 338 + E + ++ G Sbjct: 317 LGSVAKLELALAVYMLVAWRLARLVRLGRT 346 >UniRef50_Q6MCG2 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCG2_PARUW Length = 91 Score = 54.5 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 32/82 (39%), Gaps = 1/82 (1%) Query: 26 LTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWH-ASF 84 +T L KT+ L+EL L +KA+ N KRI R + V Sbjct: 1 MTNLLLGLFIVKTVNLSELATVLYSKAKIDSNFKRIQRFFNWLTSLNDYQEVITDLVIII 60 Query: 85 ICSGNTMPIVLVDWSDIREQKR 106 + N + +D +D + K+ Sbjct: 61 LDLKNKKNDLALDRTDWKFGKK 82 >UniRef50_Q04V25 Transposase, ISLbp1 n=29 Tax=Leptospira RepID=Q04V25_LEPBJ Length = 423 Score = 54.5 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 41/237 (17%), Positives = 79/237 (33%), Gaps = 27/237 (11%) Query: 128 FPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGK 186 SE+ H + L ++++ N +++ D G+ + + G +++ R + Sbjct: 149 HRTSERSMALHHIEKLRSISAL--QNKKLILLFDKGYPSMELIGKLMANGIHFIIRSNTR 206 Query: 187 VQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTR 246 A +K + + L + K K +T+ Sbjct: 207 WLKEAKIAGEYKEYDKVKN-------ILITNNMLKKKEWL-------------KEYANTK 246 Query: 247 THCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLG 306 + + I T+LP + + +V +Y KR IE FR K Y L Sbjct: 247 GNLFSLRFVGSRYKDGQVGIFVTDLPDSEFSREDIVFLYGKRWNIETHFRFEK---YSLE 303 Query: 307 LRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVR 363 L + +S RF ++ L AQ+ +D+ Q V+ + R Sbjct: 304 LENVAPKTSIRFLQEYYAKILTFNLASLLIQEAQE-EYDQSIQNKKVKTKYDYKINR 359 >UniRef50_Q18HG4 Tn5-like transposase n=1 Tax=Haloquadratum walsbyi DSM 16790 RepID=Q18HG4_HALWD Length = 476 Score = 54.5 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 40/272 (14%), Positives = 87/272 (31%), Gaps = 29/272 (10%) Query: 90 TMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASI 149 ++D + E ++ + +G++ + + E+ S+ Sbjct: 153 HRMTGVIDQQPLIEDQQADEKYDA---NGKAEPI--QLDSEHEKWSRGDRQA-----RDW 202 Query: 150 LPSNTTPLIVSDAGFKVPWY-----KSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLH 204 L + PL + D G + + +E G ++ R + E K Sbjct: 203 LADDIRPLFIHDRGADSFAFYEGVTREMENAG--FIIRASQNRRIWTDDGEPGKLFDWSS 260 Query: 205 DMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP 264 D++ KT+ ++ + ++ + + R + + + E Sbjct: 261 DLAEQGRKTIEIQQGGGREARTAELSIATGTCELRAPKNNPEQEGSIEVNVVRVDEVGED 320 Query: 265 -----WILATNLPVEIRTPK-QLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERF 318 W+L T VE L++ Y R +IE+ + LKS G + + ER Sbjct: 321 DDPIQWVLLTTESVEEFEETLTLIDYYGLRWRIEDWHKVLKS---GCNIEERQLQIWERM 377 Query: 319 DIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA 350 +++L M + W + D Sbjct: 378 EVLL---SMYSVIAWKVLELRELARDDSSVSP 406 >UniRef50_C7RPQ7 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RPQ7_9PROT Length = 761 Score = 54.1 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 29/261 (11%), Positives = 68/261 (26%), Gaps = 13/261 (4%) Query: 83 SFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQF 142 + S P+ ++D HG+S E Sbjct: 409 TVAFSVEGTPLGVLDAQCWARDPD---------EHGKSEERKHLPIEEKESMKWLNSFAR 459 Query: 143 LADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRG-KVQYADLGAENWKPIS 201 +A++ ++ P + + + VR + + + E Sbjct: 460 VAEVQALCPETLLVSMGDRESDIHDLFALAARDPAGPKLLVRAERTRQRRVENEALWDFI 519 Query: 202 NLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASA 261 + + + + + + + + + ++ R + Sbjct: 520 SRQSPAGEITLHIPRRGNRPKRTVVLSVRFAEVTLQPPRDSRLPAVELWAVHLYEENTDD 579 Query: 262 KEP--WILATNLPVEIRT-PKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERF 318 EP W+L T +PV + Y+ R IE R LKS + + + Sbjct: 580 PEPIEWMLLTTVPVNTFDDAVERAEWYAARWGIEVFHRTLKSGCRIKDRQLGTATRLQAC 639 Query: 319 DIMLLIALMLQLTCWLAGVHA 339 + ++ + G Sbjct: 640 LGIDMVVAWRIYHLTMLGREV 660 >UniRef50_B0JGI1 Transposase n=37 Tax=Bacteria RepID=B0JGI1_MICAN Length = 475 Score = 54.1 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 30/223 (13%), Positives = 69/223 (30%), Gaps = 21/223 (9%) Query: 139 HDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENW 197 L++ +P + + + D + + + L R + L + Sbjct: 169 WLDSLSETQQQIPEDIQVVTIGDCEADIFDLFAQSRSPNSHLLIRGTHNRKVNYLEDKQR 228 Query: 198 -------KPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCH 250 ++ ++ + + + KR ++ + + + + ++ Sbjct: 229 SGHSEPKYLHQSIREIKACGTLDVQVKRNPNHEARLAKLTVRFASFEIQVPSHHSKATPR 288 Query: 251 HPSPKIYSASAKE---------PWILATNLPVEIR-TPKQLVNIYSKRMQIEETFRDLKS 300 P + +E W+L T+L + + V YS R IE LKS Sbjct: 289 QPVKLQVILAEEENPHSGVNPISWLLLTSLDISSFESAITCVRWYSYRWLIERYHFVLKS 348 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQG 343 G GL + + R ++ L ++ A+ G Sbjct: 349 ---GCGLEKLQLETGRRIEMALATYSIVAWRLLWLTYQARLHG 388 >UniRef50_UPI00017465B5 InsL n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017465B5 Length = 382 Score = 53.7 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 64/209 (30%), Gaps = 13/209 (6%) Query: 157 LIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 ++D GF + V + G + + R+ + + + G Sbjct: 169 CFLADRGFSHLLGIEHVYRGGAHVIMRLNEQNTPLEDEQGRPVVLLPWLRKLKQPGAAAG 228 Query: 216 -------YKRLTKSNPISCQILLYKS--RSKGRKNQRSTRTHCHHPSPKIYSASAKEPWI 266 K + + ++ + + ++ R + + WI Sbjct: 229 LDLWVRPRKEDSLEKRVPVRLCAVRKSVEAAALAQRKVQRRAQQDQTKLRAATLEHTAWI 288 Query: 267 LA-TNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIA 325 + T +P + + +++ Y R QIE F+ LKS L S SS + Sbjct: 289 VVLTTVPRDTLSDVEVLQWYRVRWQIELAFKRLKSLGDVGHLPKSDERSSRA--WVYAKL 346 Query: 326 LMLQLTCWLAGVHAQKQGWDKHFQANTVR 354 L+ L+ + A W + N Sbjct: 347 LIALLSEKMQRHAAALSPWGGRWLENETP 375 >UniRef50_B0TD95 Transposase, is4 family n=3 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TD95_HELMI Length = 441 Score = 53.3 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 49/145 (33%), Gaps = 6/145 (4%) Query: 157 LIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 +++ D G+ + + Y++SR++ L + + K L Sbjct: 191 ILLFDLGYFSFKHFGKIMNEKGYFVSRLKSNSNPLILRSLIQHRGRTIAV----EGKRLL 246 Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 + + I +L + + + + K+ TNLP E Sbjct: 247 DIKGSLRREIIDFEVLVSNSQSSNMDLVKRTALQLRVVGILNEET-KDYHFYITNLPAER 305 Query: 276 RTPKQLVNIYSKRMQIEETFRDLKS 300 + + +Y R IE F++LKS Sbjct: 306 FPAEDIATLYRARWTIELLFKELKS 330 >UniRef50_A6DTQ2 Putative transposase insL for insertion sequence IS186 n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTQ2_9BACT Length = 375 Score = 53.3 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 54/185 (29%), Gaps = 7/185 (3%) Query: 156 PLIVSDAGFKVPWYKSVEKLGWYWLSRVRGK-VQYADLGAENWKPISNLHDMSSSHSKTL 214 I + V G Y L R + +K +S L + Sbjct: 174 VFIGDRVYPRRNGIIHVHSNGGYILCRFPPSLTPLHNDNGTPFKLLSKLRKLKLGDIGEY 233 Query: 215 GYKRLTKSNPISCQILLYKSRS----KGRKNQRSTRTHCHHPSPKIYSASAKEPWILATN 270 I+ ++ K K +K + +IL Sbjct: 234 NVVIKHNEGQINARVCAMKKDHESTLKAQKAIHRKASKNSRKGSTRPETLEYAGYILILT 293 Query: 271 LPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQL 330 E +P++++NIY R QIE F+ LKS L + + + L+ L Sbjct: 294 TLAESVSPEKILNIYRSRWQIELLFKRLKSIIGAAPL--YKKNDIGMRSWLAGKILVATL 351 Query: 331 TCWLA 335 ++ Sbjct: 352 IEYII 356 >UniRef50_B6FLV1 Putative uncharacterized protein (Fragment) n=1 Tax=Clostridium nexile DSM 1787 RepID=B6FLV1_9CLOT Length = 135 Score = 52.9 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 22/106 (20%), Positives = 38/106 (35%), Gaps = 3/106 (2%) Query: 245 TRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYG 304 + H + + + TNL E K++ +Y R IE +FR+LK Y Sbjct: 8 NPFYTLHFRVLRFPITESTMECIITNLEEEDFPMKEIKKLYEWRWGIERSFRELK---YT 64 Query: 305 LGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA 350 +GL + E + L++ C Q + +A Sbjct: 65 IGLTNFHAKKVEYILQEIFARLIIYNFCERIITKIVIQQKKSYAKA 110 >UniRef50_UPI0001C15C40 hypothetical protein CRC_03218 n=1 Tax=Cylindrospermopsis raciborskii CS-505 RepID=UPI0001C15C40 Length = 233 Score = 52.9 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 36/120 (30%), Gaps = 27/120 (22%) Query: 33 LLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMP 92 + + + L++L P + + + + R LG L + ++ + + P Sbjct: 3 IQAHRQVKLSKLASLFPQPIKYESRKRNLQRFLGINKLCIK--LLWFLLIKYWIRQSLTP 60 Query: 93 ----------------------IVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPL 130 +V +D + + + MV ++ LY + Sbjct: 61 QQLNREQRRFFHKKQYQKYGYWMVALDRTQWKGRNIFMVTF---VWGTHALPLYWETLNH 117 >UniRef50_C6JEA3 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JEA3_9FIRM Length = 329 Score = 52.2 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 59/196 (30%), Gaps = 16/196 (8%) Query: 102 REQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASI-LPSNTTPLIVS 160 R ++ + R + + + + F + + S +P + + Sbjct: 139 RGFNQIHINAMFSLFDKRFTDILVQPARK-----RNEYSAFCSMVDSADIPEHYKVIFFG 193 Query: 161 DAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRL 219 D G+ + V + G Y+L R K +G P+ L S L + Sbjct: 194 DRGYTSYNNFAHVIEKGQYFLIRCNDKRASGMMG----YPVDTLPAFDEDISLILTRSKA 249 Query: 220 ----TKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 ++ S +Y++ N + T +ATNLP E Sbjct: 250 VSKYSRPELFSSYRYIYQNAPMDYLNDQRTEYDL-ALRLLRIQLDDGSYENIATNLPEEE 308 Query: 276 RTPKQLVNIYSKRMQI 291 + +Y R I Sbjct: 309 FKAEDFKALYHLRWGI 324 >UniRef50_A9DPK2 Transposase n=8 Tax=Shewanella benthica KT99 RepID=A9DPK2_9GAMM Length = 269 Score = 52.2 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 69/203 (33%), Gaps = 23/203 (11%) Query: 155 TPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTL 214 T L++ F + + +K G + + R GK+ I D + L Sbjct: 33 TLLLMDAGYFNIDYCYQADKHGGHVIMRTNGKIN---------PDIKAAFDSQGLAIEGL 83 Query: 215 GYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVE 274 K+L + QI+ + + H + + L TNL Sbjct: 84 IGKKLKQLKWHREQII--------DLDVQWKSKPGTHRLIAFWDRNKSAIGYLITNLKRA 135 Query: 275 IRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWL 334 + ++ +Y R QIE F++LKS + GL+ T + ++ +++ L Sbjct: 136 QFSADKVSKLYGLRWQIELFFKELKSYS---GLKTFNTRDKSIAESLVWASMLTLLLKRF 192 Query: 335 AGVHAQKQGWDKHFQANTVRNRN 357 A+ G +T + Sbjct: 193 I---ARASGLIHQVTISTQKVAR 212 >UniRef50_B8FXQ3 Transposase IS4 family protein n=8 Tax=Desulfitobacterium hafniense RepID=B8FXQ3_DESHD Length = 414 Score = 51.4 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 36/254 (14%), Positives = 75/254 (29%), Gaps = 43/254 (16%) Query: 98 WSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPL 157 W+ R+ K + + ++ ++ P L ++ + L Sbjct: 143 WAVFRKIKAGVKMHLRLSFDEMAIPDEVIITPAKTAD--------RKKLDELIVVDKDAL 194 Query: 158 IVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGY 216 + D G+ + + +++R++ G E P+ + LG Sbjct: 195 TIFDRGYIDYLLFDEYCEKEIRFVTRLKNNAVIEFTGVER--PVEEEGSIEEDVDIILGT 252 Query: 217 KRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIR 276 + + + EP+ + TN Sbjct: 253 GTRKMKHTL---------------------------REVTIDDNVNEPFTILTN--DFDL 283 Query: 277 TPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAG 336 + ++L +Y R QIE F+ LK A ++H +S + + LM T L Sbjct: 284 SAEELGEVYRYRWQIELFFKWLKQHA---QIKHFYGTSEAAVINQIRLDLMTYCTLILLK 340 Query: 337 VHAQKQGWDKHFQA 350 + + Q Q Sbjct: 341 LEVEHQRDLLTLQR 354 >UniRef50_Q1Q5J6 Putative uncharacterized protein n=6 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q5J6_9BACT Length = 367 Score = 51.4 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 46/161 (28%), Gaps = 8/161 (4%) Query: 148 SILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKP----ISN 202 P I++D G+ + G Y RV + + P I Sbjct: 156 RQFPMKKDDYIIADRGYCTGQGIHHATRKGAYLSVRVNSQSLRIFGEEKKPFPLLKEIQY 215 Query: 203 LHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSP---KIYSA 259 L + S + + + + + + + + K K + Sbjct: 216 LKRPLAIKSWNVFIPNVDNTEYVKGSLCIIRKTEEAIKIAHKKLKRHASKKGIELKPETL 275 Query: 260 SAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 + I+ T P T ++ Y R QIE F+ K Sbjct: 276 IYAKYVIVFTTFPENQFTAFDILEWYRVRWQIELVFKRFKQ 316 >UniRef50_B7KKS2 Putative uncharacterized protein n=3 Tax=Cyanothece sp. PCC 7424 RepID=B7KKS2_CYAP7 Length = 457 Score = 51.4 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 56/317 (17%), Positives = 99/317 (31%), Gaps = 47/317 (14%) Query: 32 ALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTM 91 AL L L + G+ L T + ++KR L N E++ + + + N Sbjct: 25 ALYIGDCLRL-KYGQALSTVFKNAGDLKRTYEFLANPKTSFEKVVEPSHYQTAKETKNLP 83 Query: 92 PIVLV-DWS---------------DIREQKRLMVLRASVAL---HGRSVTLYE------- 125 I+ + D + I ++L S+A+ G+ + L Sbjct: 84 LILSIGDTTFLDYKNIKLKREDYGPIGNGGNGLILHTSLAVAPDSGQPLGLLWEKVWKRT 143 Query: 126 ------------KAFPLSEQCSKKAHDQFLADLASILPSNTTP---LIVSDAGFKVPWYK 170 K F E Q ++ + + P I G + Sbjct: 144 QKIKSGKKVNRAKVFEKKESYKWVEAIQKVSSIFQEVSEVEQPKVIHIFDREGDIAEVFA 203 Query: 171 SVEKL-GWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSH-SKTLGYKRLTKSNPISCQ 228 V++ +L R + W + N S T +KR + + + Sbjct: 204 EVQRAEKCSFLVRAAHNRSLNEEENYLWNYVQNQPVSFEREISLTNNHKRKKRIAHLEVR 263 Query: 229 ILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP--WILATNLP-VEIRTPKQLVNIY 285 K RS R + + + +I +EP W+L T + L+ Y Sbjct: 264 FCQVKLRSPQRLKETNGFKIYAVYAKEINPLDGEEPINWMLLTTEVIESPESANTLLRWY 323 Query: 286 SKRMQIEETFRDLKSPA 302 + R IEE + LKS Sbjct: 324 TYRWLIEEYHKILKSGC 340 >UniRef50_Q4V248 Transposase, n=5 Tax=Bacillus cereus group RepID=Q4V248_BACCZ Length = 140 Score = 51.4 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 25/118 (21%), Positives = 45/118 (38%), Gaps = 6/118 (5%) Query: 198 KPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIY 257 + + + + K + ++++Y+ K + +R + + Y Sbjct: 24 VSFMAMRQDPIRQTYEIKEAYIGKDQKLFTRVIIYRLTEKQIQERRKKQNYTESKKGITY 83 Query: 258 SASAK---EPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 S +K I TN P EI +Q+ + YS R QIE TF+ KS + H Sbjct: 84 SEKSKRLTGINIYVTNTPWEIVPMEQIHDFYSLRWQIEITFKTWKSL---FQIHHWHN 138 >UniRef50_C1P7N3 Transposase IS4 family protein n=5 Tax=Bacillus coagulans 36D1 RepID=C1P7N3_BACCO Length = 437 Score = 51.4 bits (121), Expect = 7e-05, Method: Composition-based stats. Identities = 42/259 (16%), Positives = 89/259 (34%), Gaps = 33/259 (12%) Query: 140 DQFLADLASILPSNTTPLIVSDAGFKVP-WYKSVEKLGWYWLSRVRGKVQYADLGAENWK 198 + + P TT + D+GF VP Y E+ ++ R++ Q L A+ + Sbjct: 206 RPLIKHYNEMFPE-TTLFLRGDSGFAVPGLYDLCEEESVLYIIRLKSNSQLQSL-AKEYH 263 Query: 199 PISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYS 258 P S D+S + + KS ++++ R G + Sbjct: 264 PSSAPLDVSKTETYYEETIYQAKSWSKPRRVIIQSVRPAGELFFTHS------------- 310 Query: 259 ASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERF 318 TN E+ P+ +V Y KR +E ++ K+ G H + + Sbjct: 311 -------FFVTNF--ELAFPQDIVRAYQKRGTMENYIKEAKN---GFYFDHMNSHAFLVN 358 Query: 319 DIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRL---GMEVLRHSGYT 375 ++ +++ L+ +G K Q +T+R R + + ++ G + + Sbjct: 359 EVKMMLTLLAYNLTNWLRTLCFPEG-QKTMQIDTIRTRLIKAASKVVKSGRSLYFKLSSS 417 Query: 376 ITREDSLV-AATLLTQNLF 393 ++ + + Sbjct: 418 FVYQNFFWDVLNRIQKLQL 436 >UniRef50_A8RFU1 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RFU1_9FIRM Length = 443 Score = 50.6 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 34/257 (13%), Positives = 79/257 (30%), Gaps = 16/257 (6%) Query: 89 NTMPIVLVDWSD-----IREQKRLMVLRASVALHGRSVTLYEKA-----FPLSEQCSKKA 138 N I+ D SD + ++ + S+ LY+ + +KK+ Sbjct: 116 NGYYILAQDGSDINLPFWHDDTQISYGQDSIVCQYHLNALYDCINHVFWESRIDLPTKKS 175 Query: 139 HDQFLADLASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENW 197 L D + +I +D G++ + ++ RV+ + + Sbjct: 176 EKSALIDFINHRNYPENSIITADRGYESYNLIAHCIENNQKFVFRVK-DIDTRSGIMTSI 234 Query: 198 KPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIY 257 D++ + + T K N + + + + + R + + Sbjct: 235 SLPDETFDITVTRTLTNLQTNEVKKNENNQFVFVPSTSVFDYLD-ACNRFYNLSFRIVRF 293 Query: 258 SASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSER 317 + + L TNL ++Y R E F LK + +G+ + + Sbjct: 294 KIADDKYETLVTNLDENEFGLSDFKDLYHLRWNEETAFYYLK---HAVGMLYFHCKKRQH 350 Query: 318 FDIMLLIALMLQLTCWL 334 + +++ L Sbjct: 351 IQQEIYASILFYNYANL 367 >UniRef50_B5ZZ25 Transposase IS4 family protein n=11 Tax=Rhizobium RepID=B5ZZ25_RHILW Length = 381 Score = 50.6 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 39/346 (11%), Positives = 101/346 (29%), Gaps = 61/346 (17%) Query: 13 QFCPELHLKRLNSLTLACHAL--LDCKTLTLTELGRNLPTKARTKHNI--KRIDR-LLGN 67 + + H++RL++ + L ++L E+ +L + + +++ + + R + Sbjct: 26 EHQADKHVRRLSTKSQLIALLYGQLAGAVSLREIVGSLESHSARLYHLGARPVSRSTFAD 85 Query: 68 RHLHKERLAVYRWHASFICSGNT-------MPIVLVDWS----------DIREQKRLMVL 110 + + A + + L+D S R + Sbjct: 86 ANGLRPSTVFAELFAQMVARAGRGLKRAIGEAVYLIDGSSLSLAGAGSQWARFSDQACGA 145 Query: 111 RASVALHGRS-VTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWY 169 + V + +Y P + + + + + W+ Sbjct: 146 KMHVVYDANAERPIYAAVTPANVND--------ITAAKEMPIEAGATYVFDLGYYDFGWW 197 Query: 170 KSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQI 229 + G +SR++ + + +++ + L + Sbjct: 198 AKLNAAGCRIVSRLKSHTKLTVSAEQ----------AANADAGILFDRI----------- 236 Query: 230 LLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRM 289 R+ + + + +N +++ +Y +R Sbjct: 237 ----GLLPQRQAKSRRNPMNRPVREIGVRIETGKVLRIFSN--DLTAPAEEIAALYKRRW 290 Query: 290 QIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 IE FR +K L +RH +S I + +AL+ L +A Sbjct: 291 AIELFFRWVKQT---LKIRHFLGNSENAVRIQVAVALIAYLLLQMA 333 >UniRef50_C6PFH6 Transposase IS4 family protein n=2 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PFH6_CLOTS Length = 398 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 29/251 (11%), Positives = 75/251 (29%), Gaps = 26/251 (10%) Query: 107 LMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKV 166 +L ++ + + + + + + ++ L +P + Sbjct: 143 HQLLTMMLSCGKVVLPCCIERYEKGGKSKIERICEMVSML--PIPKGPAYGLCDSWYINK 200 Query: 167 PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPIS 226 ++ + G++ + ++ G + ++ + + Sbjct: 201 KVIEAHFERGYHLIGALKTNRIIYPQG---------IRIQIKDFAQYIEKNEVHLVTVNG 251 Query: 227 CQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYS 286 +Y+ + C + + TN ++ T ++N YS Sbjct: 252 SNYWVYRYEGALNGIDNAVVVLCWPEKAFKNENALHA--FICTNTELDTET---ILNYYS 306 Query: 287 KRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWL-------AGVHA 339 +R IE FR K+ LGL + S++ D +L + + + C G Sbjct: 307 QRWPIEIFFRQTKN---NLGLNTYQVRSTKSIDRLLWLISLTYMYCTTSDDKYCKFGQGI 363 Query: 340 QKQGWDKHFQA 350 +K + Q Sbjct: 364 KKVRKEVQKQR 374 >UniRef50_A3ZNH0 Probable transposase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZNH0_9PLAN Length = 451 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 41/243 (16%), Positives = 81/243 (33%), Gaps = 17/243 (6%) Query: 102 REQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLAS----ILPSNTTPL 157 +E+ + L G+ + K + D +A+ I S+ L Sbjct: 180 KEKGQWRFHALVHVLDGQ--PVASKLTEEPSAKGRAERDVLAEMIAADQIDIPQSDEGHL 237 Query: 158 IVSDAGF-KVPWYKSVEKLGWYWLSRVR---GKVQYADLGAENWKPISNLHDMSSSHSKT 213 + D G+ + + G ++ R+ GK+ E +PI + + + Sbjct: 238 FLMDRGYRSAELFNKIHTAGHDYICRLNRTDGKLLKPPKKGEVREPIQLPPLSAEAIAMG 297 Query: 214 LGYKRLTKSNPISCQILLYKSRSKGRKNQ--RSTRTHCHHPSPKIYSASAKEPWILATNL 271 + L + R + R + ++ +LAT L Sbjct: 298 IVADELITMGGNCGASKIGSDHPMRRIKLIPPADRPSSARQGRVRTDQTGRDELVLATTL 357 Query: 272 PVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLT 331 T +++V +Y R ++E FR LK LG + ++ + I L A++ L Sbjct: 358 MD--LTAEEIVRLYEHRWEVELFFRFLKQ---VLGCKKLLSAKTAGVQIQLYCAIIASLL 412 Query: 332 CWL 334 L Sbjct: 413 LAL 415 >UniRef50_Q647P2 Transposase n=1 Tax=uncultured archaeon GZfos9E5 RepID=Q647P2_9ARCH Length = 398 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 37/303 (12%), Positives = 92/303 (30%), Gaps = 49/303 (16%) Query: 39 LTLTELGRNLPTKARTKHNIKRIDRLLGNR---HLHKERLAVYRWHASFICSGNTMPIVL 95 L + ++ + + I + R N+ + Y + ++ Sbjct: 108 LVIDDILSQFNGISEELYRIAKKQRWFVNKVDLSIDIHDWMYYGDI-------DDEMVL- 159 Query: 96 VDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTT 155 + + A++ + R + KA P+ + + L ++ Sbjct: 160 --GTQPKNGTSYAYKFATINVVERGIRFTLKALPIGDYSEICGVVEELLKY-AMKKVKIR 216 Query: 156 PLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLG 215 + + + VP + +++L +++ + + + + EN + D + Sbjct: 217 SVYLDRGFYAVPIVRMLKRLSVHFIIQAQKSIGIKKVIEENKDKEVIVVDYKMKRKR--- 273 Query: 216 YKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEI 275 K+ + L+ + +K++R TNL V Sbjct: 274 -----KAPSGKEDVRLFIVPHRLKKDKRV---------------------CFVTNLDVNE 307 Query: 276 RTPKQLVNIYSKRMQIEETFRDL------KSPAYGLGLRHSRTSSSERFDIMLLIALMLQ 329 K Y KR IE ++R K+ + +R S F + ++A ++ Sbjct: 308 ENAKDYAGNYRKRWGIETSYRVKKDAFRPKTTSKNYAIRLFFFLFSVSFYNLWVLASIVL 367 Query: 330 LTC 332 Sbjct: 368 GLV 370 >UniRef50_UPI0001C171A4 Putative transposase n=1 Tax=Raphidiopsis brookii D9 RepID=UPI0001C171A4 Length = 90 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 30/89 (33%), Gaps = 11/89 (12%) Query: 13 QFCPELHLKRLNS---------LTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDR 63 + P + K L S L + + L K + L L +P + + KRI R Sbjct: 2 KMLPLFYQKHLKSQLSLAEYLFLKILVNILQSIKNVNLERLANGVPLPIKFESRRKRIQR 61 Query: 64 LLGNRHLHKE--RLAVYRWHASFICSGNT 90 L +L E + + S + Sbjct: 62 FLSLPNLTIEKIWFPIIQEWLSIYFTNEK 90 >UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001905F7C Length = 367 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 7/169 (4%) Query: 158 IVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYK 217 I A K +V K G +L R G L + Sbjct: 171 IADRAHAKATDLAAVVKAGADFLVRAPSNYPRLLDGDGQLLERLALCREAGDKGVLDRSV 230 Query: 218 RLTKSN---PISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKE---PWILATNL 271 R+ ++ ++++ + R + S + E +L T+L Sbjct: 231 RIQDGKSKVEVAARVVILPLPPEAAAKARRAARRLAAKARYKPSEAGIEMAGYLVLLTSL 290 Query: 272 PVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDI 320 + P++L + Y R QIE F+ +KS GL ++ + R I Sbjct: 291 NADDWPPERLASTYRLRWQIELAFKRMKSL-IGLEGLRAKDADLARLWI 338 >UniRef50_P55729 Putative transposase y4zB n=4 Tax=Rhizobiaceae RepID=Y4ZB_RHISN Length = 356 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 42/144 (29%), Gaps = 8/144 (5%) Query: 195 ENWKPISNLHDMSSSHSKTLGYKRLTKSNPI---SCQILLYKSRSKGRKNQRSTRTHCHH 251 W I+ + K+ ++ + I + R + Sbjct: 163 GWWTAIAEAKAFFVTRPKSNMGLKVVRQRRIKVAEGDGFTVIDDATVRLASKGDSKLPIP 222 Query: 252 PSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSR 311 + + L TN R + +Y R QIE FR +K L +R Sbjct: 223 LRRLTVKRADGDTITLLTN--DRKRPAVAIAALYKGRWQIELLFRWIKQH---LKIRSFL 277 Query: 312 TSSSERFDIMLLIALMLQLTCWLA 335 ++ + L A++ +A Sbjct: 278 GNNDNAVRLQLFAAMIAYALLRIA 301 >UniRef50_B7KMB2 Transposase IS4 family protein n=2 Tax=Cyanothece RepID=B7KMB2_CYAP7 Length = 481 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 43/345 (12%), Positives = 89/345 (25%), Gaps = 50/345 (14%) Query: 48 LPTKARTKHNIKRIDRLLGNRHLHKE-RLAVYRWHASFICSGNTMPIVLVDWSDIREQKR 106 +P + + N H+ +A +R G+++ +V+ D +D+ Sbjct: 38 VPQTFEEASQAQAVYEFWSNPHVKPSQIIAAHRDATLLRIKGHSIILVIQDTTDLEFASL 97 Query: 107 LMV-----------------LRASVALHGRSVTLYEKAF-----------------PLSE 132 SV G + L ++ + E Sbjct: 98 ATRRGLGEISKQGVEGIKVHNVLSVTTDGVPLGLIKQIAWVRKKTRKGKGYEERKRKIEE 157 Query: 133 QCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYA-D 191 + S + + F + P + G S + G + L R Sbjct: 158 KESYRWLESFRETQELVPPEMEVVTVCDREGDIFELLASPRREGAHLLIRAAQNRNVKTS 217 Query: 192 LGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPIS----------CQILLYKSRSKGRKN 241 + + L + + + T + + ++ + N Sbjct: 218 TEQGEIQKLFTLLKSQEVVGEIVLDLQKTPRRKARKATIQVKYATVTLQVPSNKPPLKHN 277 Query: 242 QRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRT-PKQLVNIYSKRMQIEETFRDLKS 300 + K W+L T LP+ + + YS R IE LKS Sbjct: 278 EPVEVAAILAEEINPPPKEQKVSWLLLTTLPLNNVSDAFTYLKWYSLRWLIERYHYVLKS 337 Query: 301 PAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWD 345 G + + + ER L ++ ++ Sbjct: 338 ---GCKIEELQLETGERLLRALACYSIVAWRLLWLTYTSRLDPHQ 379 >UniRef50_C6JHT2 Transposase ISLbp1 n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JHT2_9FIRM Length = 424 Score = 49.5 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 20/176 (11%), Positives = 46/176 (26%), Gaps = 24/176 (13%) Query: 143 LADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISN 202 + + + + +I+ P + + ++ R++ + Sbjct: 167 MERIPETIGNIPYIIIMDRGYPSTPAFIHMMDKDLKFIVRLKSSD--YKKEQSSLTENDQ 224 Query: 203 LHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAK 262 L + S+ Y+ +R Sbjct: 225 LVKIKLDKSRIRHYEGT-------------------PDGERMKELGEISLRMVKILLENG 265 Query: 263 EPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERF 318 +LATNL +++ +Y R IE + LK+ L L + + Sbjct: 266 NLEVLATNLSQTEFHTEEIKELYHMRWGIETAYETLKNR---LQLENFTGTKPILL 318 >UniRef50_A3IS08 Putative uncharacterized protein n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IS08_9CHRO Length = 472 Score = 49.5 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 67/217 (30%), Gaps = 6/217 (2%) Query: 136 KKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAE 195 K HD LA + L+ A S +K G + R+ Sbjct: 188 FKTHDIKLARKLTDYLDAGDILLGDRAFCSYIDIYSWKKKGIDSVMRLHQGRLQKGKKRP 247 Query: 196 NWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPK 255 + + L + + + + K+ HC+ P Sbjct: 248 KYTVSPPFKKKKKTRKCPHDRLILWEKPKRKPKDISKEDFYSLPKDLVLREVHCYICIPG 307 Query: 256 IYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSS 315 + KE ++ T + ++++Y +R Q E R++K+ G+ + +T Sbjct: 308 FRT---KEIIVVTTLIDAIEYPSSDILDLYDQRWQAEVNLRNIKTT-LGMDILTCQTPEM 363 Query: 316 ERFDIMLLIALMLQLTCWL--AGVHAQKQGWDKHFQA 350 R +I + + L + AG + QA Sbjct: 364 VRKEIYVYLLAYNFLRSIMYDAGDIFNHKPIRLSLQA 400 >UniRef50_Q8KKS9 Putative insertion sequence transposase protein n=3 Tax=Rhizobium etli RepID=Q8KKS9_RHIEC Length = 319 Score = 49.5 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 48/162 (29%), Gaps = 16/162 (9%) Query: 168 WYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISC 227 + + + LSRV + + ++T+ + Sbjct: 2 MWARLPDQRFDLLSRVMHDHALIGGSKLR----DVVETVRFCDTQTIELRERADRPARQA 57 Query: 228 QILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP---------WILATNLP-VEIRT 277 + L ++ R+ Q + + W+L T Sbjct: 58 TLCLRFGQATIRRPQNLREEGLPDGVRLSWVEVVEPDAPDGVEPLHWLLLTTHALSSATD 117 Query: 278 PKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFD 319 Q+V Y +R IE+ FR +K+ G + S+ + R + Sbjct: 118 AWQIVAWYKQRWMIEQFFRVMKTQ--GFKIEDSQLQLAPRLE 157 >UniRef50_C4XGQ6 Putative transposase for insertion sequence element n=2 Tax=Desulfovibrio magneticus RS-1 RepID=C4XGQ6_DESMR Length = 376 Score = 49.5 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 73/215 (33%), Gaps = 37/215 (17%) Query: 146 LASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLH 204 +A +L ++V D G+ W++ + K G + ++R++ ++ + Sbjct: 176 IAKLLKLPKGSIVVFDRGYNDYTWFRHLCKSGVFLVTRLKSNARFRVIE----------- 224 Query: 205 DMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP 264 +T +T + I + K+ + R R T Sbjct: 225 -----RHRTDQATGVTSDHIIQVAV-GEKTMTLRRVGYRDQETGNRLD------------ 266 Query: 265 WILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLI 324 TN + + +IY +R Q+E FR +K L ++ +S + + Sbjct: 267 --FLTNHM--TLPARTIADIYKERWQVEIFFRFIKQ---NLKIKSFLGNSKNAVLSQVYV 319 Query: 325 ALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL 359 AL+ L ++ + RN N+L Sbjct: 320 ALIAYLLLAYQKFMSKIGLSLHYLARLVQRNCNIL 354 >UniRef50_C3KKH4 Putative transposase Y4ZB n=2 Tax=Rhizobium sp. NGR234 RepID=C3KKH4_RHISN Length = 493 Score = 49.5 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 22/149 (14%), Positives = 45/149 (30%), Gaps = 8/149 (5%) Query: 195 ENWKPISNLHDMSSSHSKTLGYKRLTKSNPI---SCQILLYKSRSKGRKNQRSTRTHCHH 251 W I+ + + K ++ + I ++ R + Sbjct: 300 GWWTAIAAAKAVFVTRPKVNMALKVVRKRRITAAEGDGFTVLEDARVRLASKGDSKLPIG 359 Query: 252 PSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSR 311 + + L TN R + +Y R QIE FR +K L +R Sbjct: 360 LRRITVKRADGDTITLLTN--DLKRPAVAIGQLYKGRWQIELLFRWIKQH---LKIRKFL 414 Query: 312 TSSSERFDIMLLIALMLQLTCWLAGVHAQ 340 ++ + +L A++ +A + Sbjct: 415 GNNDNAIRLQILAAMVAYALLRIATRLWR 443 >UniRef50_A5N5R2 Transposase n=6 Tax=Clostridium RepID=A5N5R2_CLOK5 Length = 205 Score = 49.1 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 52/150 (34%), Gaps = 20/150 (13%) Query: 143 LADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISN 202 + + ILP+ ++ + +E+ G +L R+ E Sbjct: 68 INAIRKILPNTNFIVVFDRGYLSIELIHFLEENGVQYLFRLSSND--YKKEREFMITEDE 125 Query: 203 LHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAK 262 + +L +NP +I K+ + + RS + + + Sbjct: 126 V-------------VKLMHTNPRLTKIK--KNHPEIVEELRSKKYTSSRIVLSKLPSGNE 170 Query: 263 EPWILATNLPVEIRTPKQLVNIYSKRMQIE 292 L TNLP + K++ N+Y KR +IE Sbjct: 171 --LALMTNLP-TEFSGKEIENLYFKRWEIE 197 >UniRef50_C5EN31 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EN31_9FIRM Length = 148 Score = 49.1 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 50/146 (34%), Gaps = 10/146 (6%) Query: 197 WKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKI 256 W L K L + C+ + + + R R Sbjct: 10 WTGGRTLSLPGQMPGKRLHPESE-PLYRYICKAVPFDLITDSRPEYR------MQLRVVR 62 Query: 257 YSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSE 316 + + + TNLP + + +Q+ +IY E +FRDLK + +G + + S + Sbjct: 63 FQIAEGGYENIITNLPADEFSLEQIKHIYHLLWGQETSFRDLK---HTIGTENFHSGSPK 119 Query: 317 RFDIMLLIALMLQLTCWLAGVHAQKQ 342 + +L + L C + + + Sbjct: 120 YIEFEILCRMTLYNFCTIITMEVPIK 145 >UniRef50_Q0F098 ISGsu1, transposase n=6 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F098_9PROT Length = 383 Score = 49.1 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 69/226 (30%), Gaps = 44/226 (19%) Query: 157 LIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGY 216 + A W+ + + ++SR++ ++ + L D S G Sbjct: 184 YVFDRAYNDYAWFHDLTQRDIRFVSRMKRNAEFEVVATLPVSDDGVLEDQHIRLSSAKGR 243 Query: 217 KRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIR 276 K C + + + TN R Sbjct: 244 KECPTILRRICFVH----------------------------EEDGKKLVFITN--DLKR 273 Query: 277 TPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAG 336 + + +Y +R QIE FR +K L ++ +S I ++IA++ L +A Sbjct: 274 SAGAIAALYKQRWQIELFFRWIKQ---NLKIKRFIGTSENAVKIQIIIAMIAYLLLHMAR 330 Query: 337 VHAQK----QGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITR 378 Q + N ++ RN+ +E+L S R Sbjct: 331 RILPASRSMQQLARLVSVNLMQRRNL-------LELLADSPPPPKR 369 >UniRef50_B0C3Q4 Putative uncharacterized protein n=5 Tax=Cyanobacteria RepID=B0C3Q4_ACAM1 Length = 447 Score = 49.1 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 55/178 (30%), Gaps = 5/178 (2%) Query: 129 PLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKL-GWYWLSRVRGKV 187 P ++ S + + A + PS T + G ++ + L + R Sbjct: 106 PFEQKESYRWVEAMQAVEKIVSPSTRTIHVFDREGDIAEVFEQLNHLRNTGVVVRASHNR 165 Query: 188 QYADLGAENWKPISNLHDMSSSH---SKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRS 244 + W+ + KT T + C ++ + ++ + Sbjct: 166 RLEQDPNRLWEKLEAQRVQLEYEIDLPKTKDRSAHTAKLVVRCCLVQLQRPTRLADSHPL 225 Query: 245 TRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQ-LVNIYSKRMQIEETFRDLKSP 301 + W+L T+ V Q ++ Y+ R ++EE + LKS Sbjct: 226 QVYAVYATEVDPPEDEDPVSWMLLTSEAVTTVEMAQTILRWYTYRWRVEEYHKILKSG 283 >UniRef50_Q64E61 Transposase n=1 Tax=uncultured archaeon GZfos14B8 RepID=Q64E61_9ARCH Length = 622 Score = 49.1 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 35/259 (13%), Positives = 82/259 (31%), Gaps = 18/259 (6%) Query: 98 WSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPL 157 W + + + + + K P + S + ++ + Sbjct: 324 WDHVNNKSVHCYKLYAAFELKSNYPVCFKIEPGNTSDSTMLV-EMCERAKKVVGKENIEI 382 Query: 158 IVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYK 217 ++ D GF ++++G + + + + + + K GY Sbjct: 383 VMFDKGFYNA----------KSFNKIKGDLTFNTPAKKYKTIMDAIAGIEPEKFKQTGYN 432 Query: 218 RLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRT 277 R ++ + K R K K ++ + TN + Sbjct: 433 RWISETRVALEGYDGKLRLIVVKKVEPRAKKDKETGEKSWTMEDV-YYSYLTN--NKTLG 489 Query: 278 PKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGV 337 +YSKR +IE F++L++ +R+ ++S + + + + + L Sbjct: 490 TIDAPKLYSKRWRIENFFKELRNH---WNIRNFPSTSLDAVRSHIALLFIQFMVLSLF-K 545 Query: 338 HAQKQGWDKHFQANTVRNR 356 H G ++ Q T+R R Sbjct: 546 HYVLGGEYRNAQLKTLRTR 564 >UniRef50_A6DSH7 Probable transposase n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH7_9BACT Length = 382 Score = 48.7 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 55/198 (27%), Gaps = 38/198 (19%) Query: 134 CSKKAHDQFLADLASILPSNTTPLIVSDA--GFKVPWYKSVEKLGWYWLSRVRGKVQYAD 191 + L + + V D G +++ +++ G + R+R K + Sbjct: 180 GNSCERKALLKMVQ------PGVMYVCDRYYGLDYSYFEELQQRGALFTIRIRNKPKLTV 233 Query: 192 LGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHH 251 + + S LG T Sbjct: 234 IKEYEITEKDRKEGVISDQLVYLGD----------------------------TDRELKP 265 Query: 252 PSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS--PAYGLGLRH 309 A + +L T+ E + IY +R QIE F+ LKS L Sbjct: 266 IRLVRTGAFNDKEILLVTSEAPEKLNAAIISTIYRQRWQIEVFFKWLKSILGCRKLLAES 325 Query: 310 SRTSSSERFDIMLLIALM 327 S + + + ++ ++ Sbjct: 326 SNGVAIQMYSALIAAIML 343 >UniRef50_Q1PWW4 Putative uncharacterized protein n=2 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PWW4_9BACT Length = 166 Score = 48.7 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 31/77 (40%), Gaps = 7/77 (9%) Query: 267 LATNLPVEIR-TPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFD---IML 322 L T+LP + +V Y R QIE F+ LKS G + + ++ER + Sbjct: 2 LLTSLPADTFRQACLVVECYLCRWQIEIYFKVLKS---GCKIEERQLETAERIKPCIALY 58 Query: 323 LIALMLQLTCWLAGVHA 339 +I L + G Sbjct: 59 MIVAWRVLFVTMFGREC 75 >UniRef50_C5VJA1 Transposase domain protein n=15 Tax=Prevotella RepID=C5VJA1_9BACT Length = 405 Score = 48.7 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 71/210 (33%), Gaps = 36/210 (17%) Query: 132 EQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYAD 191 + S HD +L LP + T + A ++ + + G ++++++ + Y + Sbjct: 181 QLTSAATHDHYLLK-EVHLPKDATLTM-DRAYVDYAQFQRLTEEGVCYVTKMKKNLTYTE 238 Query: 192 LGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHH 251 L + + L + ++++ + +R Sbjct: 239 LSSVTYVSPDGLVTHTDKK-------------------IVFEKGEIRHQARRVE------ 273 Query: 252 PSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSR 311 ++S ++ + +L TN K L IY +R IE ++ LK L Sbjct: 274 ----LWSDNSHKSVVLLTN--NLELDVKDLEEIYKRRWAIESLYKQLKQ---NFPLHFFY 324 Query: 312 TSSSERFDIMLLIALMLQLTCWLAGVHAQK 341 S I + L+ L C + ++ Sbjct: 325 GDSVNAIQIQTWVVLIANLLCTVISRMIKR 354 >UniRef50_C6JFT0 Transposase family protein n=3 Tax=Clostridiales RepID=C6JFT0_9FIRM Length = 366 Score = 48.7 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 63/179 (35%), Gaps = 17/179 (9%) Query: 174 KLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYK 233 + G++ + ++ G + L ++++ S T L + + Y+ Sbjct: 176 QKGFHTIGALKTNRLLYPSGMK-----KKLSELAAERSTTHKGFDLVTVKKRNYYVYRYE 230 Query: 234 SRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEE 293 G +N ++ + A ++TN+ + +++++ Y R IE Sbjct: 231 GNLSGIENAVVLLSYPEKAFGNPKALRA----FISTNVS---LSTQEILSCYVCRWPIEI 283 Query: 294 TFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALM--LQLTCWLAGVHAQKQGWDKHFQA 350 FR K+ L L + SS+ L+ + G + + G+ + A Sbjct: 284 FFRQCKNH---LALDTYQIRSSKGIQRYWLLMSLTHYICVTGTGGYRSFQDGYHRICSA 339 >UniRef50_Q3A1U3 Transposase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A1U3_PELCD Length = 489 Score = 48.3 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 21/124 (16%), Positives = 40/124 (32%), Gaps = 4/124 (3%) Query: 207 SSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWI 266 ++S K LG L + + N T + + Sbjct: 263 AASSLKKLGPDDLLITWERPKYAQILSYSKDAWANLPKKLTLRQIKVKVPHPGFRTRGFY 322 Query: 267 LATNL-PVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIA 325 + T L + L +Y KR +E FRD+K+ +G+ R + + +L+ Sbjct: 323 IVTTLIDAARYPAEDLAELYFKRWDVELFFRDIKTT---MGMDVLRCLTPDMIRKEILMH 379 Query: 326 LMLQ 329 + Sbjct: 380 FIAY 383 >UniRef50_Q115Q8 Putative uncharacterized protein n=5 Tax=Trichodesmium erythraeum IMS101 RepID=Q115Q8_TRIEI Length = 77 Score = 48.3 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 9/77 (11%), Positives = 29/77 (37%), Gaps = 3/77 (3%) Query: 81 HASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHD 140 + S + ++++D + + + +L +VA ++ +Y K + Sbjct: 1 MVKNLFSFTSELVLILDRTQW---QNINILMITVAWKKTALPIYWKILSHKGASNLTEQK 57 Query: 141 QFLADLASILPSNTTPL 157 + + +L ++ L Sbjct: 58 SVIRPVLKLLKAHKIIL 74 >UniRef50_C3FBK7 Transposase for insertion sequence element IS231B n=3 Tax=Bacillus thuringiensis RepID=C3FBK7_BACTU Length = 180 Score = 48.3 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 58/148 (39%), Gaps = 10/148 (6%) Query: 239 RKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDL 298 R+ ++S + S I TN P E+ +Q+ + YS R QIE F+ Sbjct: 3 RRKKQSYTESKKGITFSEKSKRLTGINIYVTNAPWEVVPMEQIHDFYSLRWQIEIIFKTW 62 Query: 299 KSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTC-------WLAGVHAQKQGWDKHFQAN 351 KS + H +T ER + + L+ L C W + +K+ ++ Sbjct: 63 KSL---FQMHHWQTIKQERLECHVYEKLIAILLCFSTMFQMWQLLLMKKKRELSEYKAIY 119 Query: 352 TVRNRNVLSTVRLGMEVLRHSGYTITRE 379 +++ + ++ + ++ ++ + Sbjct: 120 MIKDYFLFQVIQNILRLVVKRRSWVSWD 147 >UniRef50_C7PAE4 Transposase IS4 family protein n=4 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PAE4_CHIPD Length = 412 Score = 48.3 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 60/208 (28%), Gaps = 30/208 (14%) Query: 140 DQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKP 199 D+ + +LP + + A K + W++RV ++ L Sbjct: 186 DRIIMPQLELLPGSIIA--MDRAYVNYKLMKEWTEKEITWVTRVTKSMKIKLLTRNR--- 240 Query: 200 ISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSA 259 K L K I ++ + IY Sbjct: 241 ----------------LKILHKRKGILKDWVIQLGNPLTEEKSPVQTAR----VISIYDR 280 Query: 260 SAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFD 319 + K+ L TN TP + +Y KR IE F+ +K L + + Sbjct: 281 NTKKKIHLLTN--NFTYTPTTIRKLYQKRWAIEMLFKRIKQ---NSQLNNFLGENKNAIS 335 Query: 320 IMLLIALMLQLTCWLAGVHAQKQGWDKH 347 I L L+ L + ++G K Sbjct: 336 IQLWCTLIKDLLTKIVKDKLTEKGSKKW 363 >UniRef50_Q3M187 Putative transposase n=10 Tax=Nostocaceae RepID=Q3M187_ANAVT Length = 116 Score = 48.3 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 15/104 (14%), Positives = 35/104 (33%), Gaps = 5/104 (4%) Query: 294 TFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTV 353 F+D K+ Y L S +S +R ++ + + + WL G + Q + + Sbjct: 1 MFKDCKTGGYNL---ESSQASPDRLVRIIFLIALAMTSAWLQGQKIKLQRQQSYVCRSQE 57 Query: 354 RNR--NVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTH 395 + + S +G+ + + +N + Sbjct: 58 QGKTEKRHSNFWIGLYGFNWIVAWYGCQTWVEEMVSSIRNKQAY 101 >UniRef50_Q2NQE6 Putative uncharacterized protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NQE6_SODGM Length = 411 Score = 48.3 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 33/289 (11%), Positives = 78/289 (26%), Gaps = 6/289 (2%) Query: 62 DRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLV-DWSDIREQKRLMVLRASVALHG-R 119 R L N + R+ W AS G ++ + D +++ + + + R Sbjct: 51 YRFLRNDDVRWNRVMEPHWQASQARMGQHEVVLCLQDTTELNYNGQDIEGLGPLNYETQR 110 Query: 120 SVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYW 179 + L+ +++ + A + + W +S E++ Sbjct: 111 GLYLHPTYVVSTQREPLGVTNA--WSWARKFKDDEGVRGGITE--SIRWIESYERMAESA 166 Query: 180 LSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGR 239 + + + + L R + P S ++ + Sbjct: 167 AALPTTRHVCVGDRESDMIELMLCARNLGYPVDYLIRNRHNHALPGSGKLWDQVQAAPLL 226 Query: 240 KNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK 299 R + + ++ ++ L+ Y R +IE F LK Sbjct: 227 GRIRFELPRGRGRKTRQVEQEIRLQYLSISDGVGGKLEVSCLIARYRARWEIELFFLVLK 286 Query: 300 SPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHF 348 L+ + E + ++ G + + D F Sbjct: 287 EGCRVERLQLGNKNRLETALALYMVIAWRINRLMRLGRNLPELDADLLF 335 >UniRef50_C9C7H0 Transposase n=5 Tax=Enterococcus faecium RepID=C9C7H0_ENTFC Length = 373 Score = 48.3 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 72/217 (33%), Gaps = 33/217 (15%) Query: 134 CSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLG 193 + HD+ L ++ +V F + + G+++++R + + L Sbjct: 170 TNASEHDR--NHLEVLVDKTQATYVVDRGYFDYKLLDKLNRDGYFFVTRTKSNTKITILD 227 Query: 194 AENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPS 253 +++ + + N ++ + L +KG+K R Sbjct: 228 QIE---VADTTTRDGTIISDQQVILVGGVNHVTERFRLVTVLTKGQKILRM--------- 275 Query: 254 PKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTS 313 TNL +P ++ ++Y R QIE F+ LK L ++ + Sbjct: 276 --------------VTNL--FDVSPNEVADMYQARWQIELLFKHLKQ---NLTIKRLYSH 316 Query: 314 SSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQA 350 S + +++ L+ L ++ + + Sbjct: 317 SEQGAINQVILTLIATLLTYVIKIELNTTATLFQLKR 353 >UniRef50_C8W6S4 Transposase IS4 family protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W6S4_DESAS Length = 465 Score = 47.9 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 52/160 (32%), Gaps = 8/160 (5%) Query: 184 RGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQR 243 R K+ A G + + +L + + + K + ++ + Sbjct: 229 RKKLDMALQGFQKKINLRSLKTVEACENSLRTLLNGYKQIKSFVNV-VFSTNEHNSVIMN 287 Query: 244 STRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAY 303 T + + L TN + + +L+ Y +R QIE F+DLK Sbjct: 288 WTWDE----VALAHEEKLDGIFALLTNYDADRVSANKLIKKYRERNQIEVNFKDLKGL-- 341 Query: 304 GLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQG 343 L L ER + + + +A+++G Sbjct: 342 -LDLERIFLQLPERIEAYVFPKTLAYFVLAFLRWYAEEKG 380 >UniRef50_Q7ULM3 Probable transposase n=5 Tax=Planctomycetaceae RepID=Q7ULM3_RHOBA Length = 458 Score = 47.9 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 71/222 (31%), Gaps = 26/222 (11%) Query: 146 LASILPSNTTPLIVSDAGFKV-PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLH 204 L +L + L V D G+ + S+ ++ R+R Y Sbjct: 232 LKRVLEEDR--LYVMDRGYAKFSLFNSIVASSSSYVCRLRDNTVYETTQELELTEGDRAA 289 Query: 205 DMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP 264 + S LG + ++P I L + R +N + + ++ Sbjct: 290 GVLSDTIVKLGGSSSSSNSP-DHPIRLIQIRCTPHQN------RTGGKARGSKAPNSDGI 342 Query: 265 WILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLI 324 +ATNL + + IY+ R IE FR K G H + ++ I + Sbjct: 343 LRIATNL--LNVPAEIIALIYAYRWTIEIFFRFYKQLMGG---DHLISHNANGIQIQVYC 397 Query: 325 ALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGM 366 +++ L L T R ++S G+ Sbjct: 398 SVIACLLINLWTGS-----------RPTKRTFEMISFYFQGL 428 >UniRef50_A5EC94 Putative transposase n=1 Tax=Bradyrhizobium sp. BTAi1 RepID=A5EC94_BRASB Length = 395 Score = 47.9 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 59/168 (35%), Gaps = 12/168 (7%) Query: 150 LPSNTTPLIVSDAGFKVPW--------YKSVEKLGWYWLSRVRGKVQYADLGAENWKPIS 201 L + + ++SD + + + + W +++ + + P + Sbjct: 109 LSAAQSITVISDRESDIYEHFVRRPPNVELLVRANWNRKIKLKSGTFTSQFAFVDGLPEA 168 Query: 202 NLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASA 261 ++ + + + + L + + T ++ S Sbjct: 169 ARFSVTIPAAPGRKER-TAELALRFSPVTLCRPHPSPAPDLPDTVRLTIVDVREVSSTHD 227 Query: 262 KEP--W-ILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLG 306 E W +L T++ + +++V++Y KR IEE FR LKS + + Sbjct: 228 GESIHWRLLTTHVVRSSKQARRIVDLYRKRWTIEEFFRTLKSAGFDIE 275 >UniRef50_Q10XV7 Putative uncharacterized protein n=2 Tax=Trichodesmium erythraeum IMS101 RepID=Q10XV7_TRIEI Length = 144 Score = 47.9 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 12/120 (10%), Positives = 33/120 (27%), Gaps = 3/120 (2%) Query: 90 TMPIVLVDWSDIREQK-RLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLAS 148 ++ + + K +L +A +G ++ + + + A Sbjct: 5 QKLVLSNYRTQWQVGKHTYNILMLGIAEYGLAIPIVGQMLSKKGNSQTAEGLDVIEKFAP 64 Query: 149 ILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSS 208 + + +D F + W + R+ G + + S +S Sbjct: 65 LFSYQRVGYLTADREFVGKQWFKYLSANWGFSIRIFHTGLIG--GGDQYFAFSWFLAVSP 122 >UniRef50_D0DW10 Transposase IS4 family protein n=5 Tax=Lactobacillus RepID=D0DW10_LACFE Length = 452 Score = 47.5 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 35/293 (11%), Positives = 80/293 (27%), Gaps = 34/293 (11%) Query: 57 NIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPI-----VLVDWSDIREQKRLMVLR 111 +R+ ++ + +E A + I + + WSD + Sbjct: 106 RRRRLALIIDDTLFSREYATQTELLARVFDHDKQLYIKGYRALTLGWSDANTFLPINFAL 165 Query: 112 ASVALHGRSVTLYEKAFPLSEQCSKKAHDQ-------FLADLASILPSNTTP-LIVSDAG 163 S + K ++ L + L + ++ D+ Sbjct: 166 MSSKKPQNVLGKSAKTTDQRTIAGRRRRQAQQKMNLVSLQLVKQALINGVPADYVLFDSW 225 Query: 164 FKVP-WYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKS 222 + P + + KLG + ++ + + L+ + +K Sbjct: 226 YSSPKMFYELTKLGLNGVGMLKRSSKVYYQYRGRQYSVKALYK----------RLQASKY 275 Query: 223 NPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLV 282 P + + A++ L P++++ Sbjct: 276 QPKQAYQYSCFVEA-------HVGNQKFKLRLVFVANRARQDDYLVLATTQLSLQPQEII 328 Query: 283 NIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLA 335 +Y++R QIE F+ K L L S+ S + L I ++ Sbjct: 329 QLYARRWQIENYFKVAKQY---LRLDKSQVQSYDGLCGHLAIVMIAYNLLAWQ 378 >UniRef50_UPI00016C4547 putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4547 Length = 372 Score = 47.5 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 48/328 (14%), Positives = 97/328 (29%), Gaps = 50/328 (15%) Query: 21 KRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERL-AVYR 79 +RL +L A +LP K + + +RL+G E + A +R Sbjct: 27 RRLATLADLLVA----------SGAESLPDKFADPADYRAFNRLVGRPEATHEAVTAPHR 76 Query: 80 WHASFICSGNTMPIVLV-DWSDIREQKRL-----------------------------MV 109 H +T ++++ D +++ R ++ Sbjct: 77 AHTRNRMRAHTGAVLVLHDTTELDYSGRALPRMGPIGNGHGTGWECHNSLAVDAGSGAVL 136 Query: 110 LRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLA----DLASILPSNTTPLIVSDAGFK 165 A+ LH R T + +++ ++ + L + P + VSD G Sbjct: 137 GLAAQILHRRPPTRSNRGETKAQRRQRQDRESRLWVTGLEAVGPAPDGRHWVHVSDRGSD 196 Query: 166 V-PWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNP 224 + ++ G ++ R R + A + L +S+ + + Sbjct: 197 TFEYLSALVTGGHRFVVRSRHD-RVRSDEATLHAHLRALSAVSAWSGEVRCGPHGGSTRT 255 Query: 225 ISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP--WILATNLP-VEIRTPKQL 281 + + +P W L T+ +++ Sbjct: 256 ADLSAAGVRVELPDPSGSAPALGVWALRVWEPNPPDGVDPVEWFLLTDRALDTAAGLREV 315 Query: 282 VNIYSKRMQIEETFRDLKSPAYGLGLRH 309 Y +R IEE + LKS L H Sbjct: 316 AGWYCQRPIIEEYHKALKSGCGVEELGH 343 >UniRef50_A7B2R8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A7B2R8_RUMGN Length = 441 Score = 47.2 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 32/254 (12%), Positives = 80/254 (31%), Gaps = 30/254 (11%) Query: 142 FLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPIS 201 L + + P+ L YK E+ G ++ R++ + + + Sbjct: 208 ILDEYLNDYPTIQILLRGDSGFATPDLYKQCEENGTSYVIRLKENGILREKASHLVDELD 267 Query: 202 NLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASA 261 + + + + + K+ P Y+ R + + + + Sbjct: 268 EITRNNKVDYAVVYGEFMYKAGPWP-----YERRVVCKVEKPENQMVYMYT--------- 313 Query: 262 KEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSP-AYGLGLRHSRTSSSERFDI 320 + TN+ +P+ L+ Y KR ++E ++ KS + H+R ++ R + Sbjct: 314 ----FVVTNMDS---SPEYLIKFYCKRGRMENFIKESKSGFDFASVSSHARIVNANRLQV 366 Query: 321 MLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVR-----NRNVLSTVRLGMEVLRHSGYT 375 AL + W + + ++ + + S + ++ Y Sbjct: 367 H---ALAYNIFNWFRRLALSANMRKQRIDTVRLKLLKIAAKIIRSARYITFKLCSSCPYK 423 Query: 376 ITREDSLVAATLLT 389 ++L L Sbjct: 424 NEFYETLSNIGKLN 437 >UniRef50_C0AF19 InsL n=9 Tax=Opitutaceae bacterium TAV2 RepID=C0AF19_9BACT Length = 362 Score = 47.2 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 26/212 (12%), Positives = 58/212 (27%), Gaps = 11/212 (5%) Query: 98 WSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFL------ADLASILP 151 WS + E + + ++ + + S L L Sbjct: 105 WSRLPEGWTAVAVDSTTIEESGASGTDWRLHYAIGLPSLFCEQAELTDNKGGESLCRYKV 164 Query: 152 SNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHS 211 + + P + V L R + + S Sbjct: 165 RKGDLFLGDRNFCRAPQIRHVMDHQGAVLLRWHSTSLPLFDQQGHALDVPAWLAQLRSRQ 224 Query: 212 KTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSAS---AKEPWILA 268 + L + ++ + + + +R+ + + S + ++ Sbjct: 225 CSELPVFLKDGTAL--RLCALRVSPQAAQRERAKIRLSAKKNGRKPSCQCLCMADYIVVV 282 Query: 269 TNLPVEIRTPKQLVNIYSKRMQIEETFRDLKS 300 T+LP + ++ +Y R QIE F+ LKS Sbjct: 283 TSLPSSCLDSRGILQLYRLRWQIELAFKRLKS 314 >UniRef50_B1XIB9 Transposase n=2 Tax=Cyanobacteria RepID=B1XIB9_SYNP2 Length = 442 Score = 46.8 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 63/182 (34%), Gaps = 11/182 (6%) Query: 158 IVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYK 217 + + V + L R + D ++ +S H + + + Sbjct: 160 VGDRENDIYQEWVRVPNDQTHVLVRACRDRRLWDEQQSLYEYLSAQHCEGTYSVQVVADS 219 Query: 218 RLTKSNP------ISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP--WILAT 269 RL ++ + + + + ++ + ++ ++P W L T Sbjct: 220 RLGRTAREAWLAVRMTPVQIQRPDTVEAQDYPEKVQLYAVEAKEVNPPVGQDPIHWRLLT 279 Query: 270 NL-PVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALML 328 V + Q++ Y R +IE+ F LK GL L ++ S + + ++AL + Sbjct: 280 THRVVSLEQALQVIEWYRWRWRIEQLFGTLK--RSGLDLESTQLESVSAIERLTVLALSV 337 Query: 329 QL 330 L Sbjct: 338 AL 339 >UniRef50_C9KS84 Transposase domain protein n=5 Tax=Bacteroidales RepID=C9KS84_9BACE Length = 407 Score = 46.8 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 3/84 (3%) Query: 259 ASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERF 318 K+P +++ + +V IY +R QIE F+ +K LR+ S+ Sbjct: 278 KKGKQPKLISLLTNDFDMELETIVAIYRRRWQIESLFKQIKQ---NFPLRYFYGESANAI 334 Query: 319 DIMLLIALMLQLTCWLAGVHAQKQ 342 I + + L+ L + ++ Sbjct: 335 KIQIWVTLIANLLLSVLQSTLTRR 358 >UniRef50_Q6MCG1 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCG1_PARUW Length = 112 Score = 46.8 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 13/112 (11%), Positives = 30/112 (26%), Gaps = 4/112 (3%) Query: 110 LRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGF-KVPW 168 + V G S+ L + + L+ L ++ +D F W Sbjct: 1 MTLGVNFKGISIPLAWISLGRAGNSKTLDR---LSVLKRVMDKININSFTADREFIGSEW 57 Query: 169 YKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLT 220 ++ + RV+ Q + ++ K + + Sbjct: 58 FEFLINSKIPSYIRVKEDTQVLHTKGNYTVGLRDICKEIKCGKKKVFKATIH 109 >UniRef50_Q04TU6 Transposase, ISLbp10 n=3 Tax=cellular organisms RepID=Q04TU6_LEPBJ Length = 226 Score = 46.8 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 22/117 (18%), Positives = 39/117 (33%), Gaps = 4/117 (3%) Query: 228 QILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLP-VEIRTPKQLVNIYS 286 I + K +K T H + W L TNL Q + Y+ Sbjct: 31 YIRILPPIGKKKKYPELNLTVIHAQEKGTPKNRKRIDWKLTTNLSVKTNLDAIQKIQWYA 90 Query: 287 KRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQG 343 R +IE + LKS S+ +++R ++ I +L + + + Sbjct: 91 LRWKIEVFHKILKSGCKA---EESKLRTADRLVNLISIYCILSWRIFWMTMMNRSTN 144 >UniRef50_A7HFH6 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7HFH6_ANADF Length = 131 Score = 46.8 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 18/59 (30%), Positives = 29/59 (49%) Query: 306 GLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRL 364 GL +R R D LLI+ + + G + G D++ +ANTV+ R +L R+ Sbjct: 64 GLSATRIGDPGRRDRELLISAIAIALHTILGASGEAIGIDRYLKANTVKPRIILLLNRV 122 >UniRef50_Q9X6I5 Putative uncharacterized protein n=2 Tax=Bacillus thuringiensis RepID=Q9X6I5_BACTU Length = 118 Score = 46.8 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 37/88 (42%), Gaps = 9/88 (10%) Query: 279 KQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTC------ 332 KQ+ +YS R QIE F+ KS + H RT ER + L L+ C Sbjct: 2 KQVHELYSLRWQIEIVFKTWKSL---FDIDHCRTVKQERIECHLYGKLIAIFLCSSTMFK 58 Query: 333 WLAGVHAQKQGWDKHFQANTVRNRNVLS 360 + +KQ ++++ +R +S Sbjct: 59 MRQLLLQKKQKELSEYKSHWNDSRPPIS 86 >UniRef50_C9LFX6 Transposase domain protein n=14 Tax=Bacteroidales RepID=C9LFX6_9BACT Length = 424 Score = 46.8 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 29/248 (11%), Positives = 73/248 (29%), Gaps = 41/248 (16%) Query: 145 DLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLH 204 L + + + A ++ + + G ++++++ +QY + + Sbjct: 207 MLLPATLNRGDIIAMDRAYIDYAKFQQMTERGVVYVTKMKKNLQYTIEEDVMCQTPEGVM 266 Query: 205 DMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEP 264 + + S +I+ Y K + Sbjct: 267 QVRVQRVTFRKKLKGGSSIVHHARIVTYVDVQKRKLIS---------------------- 304 Query: 265 WILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLI 324 L TN P ++++IY KR IE F+ +K L++ S+ I + + Sbjct: 305 --LLTN--DMTSDPLEIMDIYHKRWAIELLFKQIKQ---NFPLKYFYGESANAIKIQIWV 357 Query: 325 ALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTIT--REDSL 382 L+ L + ++ + + + + VR+ + +D Sbjct: 358 TLIANLLLMIM-----RRRLIRSWSFSGLAT-----MVRITLMYYVDFYSLFNHPEKDWE 407 Query: 383 VAATLLTQ 390 + Sbjct: 408 ACLKEAAE 415 >UniRef50_A3H523 Transposase (IS4 family) protein (Fragment) n=1 Tax=Vibrio cholerae B33 RepID=A3H523_VIBCH Length = 371 Score = 46.8 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 67/211 (31%), Gaps = 8/211 (3%) Query: 141 QFLADLASILPSNTTPLIVSDAGFKVPWYKSVE-KLG--WYWLSRVRGKVQYADLGAENW 197 + + +A ++ +K + KLG + LSR+R + Sbjct: 115 EMVIQVAQHFAGVDIIIVCDSWFGNNGLFKPLRTKLGNFVHLLSRLRSNTVLYSIPKIGS 174 Query: 198 KPISNLHDMSSSHSKTLGYKRLT-KSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKI 256 S + + + + LY + + ++ Sbjct: 175 SKKPGRPKKYGSRLGSCAEMAAAFMAYASTYHVFLYGKYREVNAYSQIVMLKTLKCPVRV 234 Query: 257 YSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSE 316 K WI + + +Q++ Y R +IE F+++K +G S+T +++ Sbjct: 235 VWVFRKTQWIAIFS-TDLKLSVEQIIEYYGARWKIESGFKEIKQ---DIGSSKSQTRNAQ 290 Query: 317 RFDIMLLIALMLQLTCWLAGVHAQKQGWDKH 347 + ++M W+ G + +H Sbjct: 291 AVINHINFSIMAATIIWIYGSRLENIPERRH 321 >UniRef50_Q18EK5 Probable transposase (ISH8/ISH26) n=5 Tax=Haloquadratum walsbyi DSM 16790 RepID=Q18EK5_HALWD Length = 417 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 30/91 (32%), Gaps = 3/91 (3%) Query: 253 SPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRT 312 + E T L P ++NIY+ R IE FR+ K L + + + Sbjct: 309 RRIVLETPDGEEIEYLTTLASSEYDPIDVINIYTLRTVIEILFREWKQY---LNIENFHS 365 Query: 313 SSSERFDIMLLIALMLQLTCWLAGVHAQKQG 343 S L AL+ + +G Sbjct: 366 KSLNGVLFELFCALIGYMLVVWFRQRHPVKG 396 >UniRef50_B9XCY0 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XCY0_9BACT Length = 481 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 55/184 (29%), Gaps = 15/184 (8%) Query: 153 NTTPLIVSDAGFKVPW-YKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHS 211 LI+ D GF + G L R+ + + Sbjct: 207 RRRDLIIGDRGFSSYTNLALLLGRGVDCLFRLHQGKKVRHPRRSRLQ-----------RK 255 Query: 212 KTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNL 271 + LG ++ Q Y + + ++ ++ T L Sbjct: 256 QKLGPRQWLVQWKKPYQKPEYMRPKEWAAVPSEMQVRVFEVIVCTRGMRTRKLMLVTTLL 315 Query: 272 PVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLT 331 ++L +Y +R +IE +FRDLK+ LGL R S + + + L+ Sbjct: 316 DPVRYPVEELAELYLRRWEIELSFRDLKTT---LGLEVLRCQSPAMVEKEVWMHLIAFNL 372 Query: 332 CWLA 335 Sbjct: 373 LRRV 376 >UniRef50_C8NDH5 Transposase n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8NDH5_9GAMM Length = 159 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 17/118 (14%), Positives = 28/118 (23%), Gaps = 10/118 (8%) Query: 142 FLADLASILPSNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPI 200 L +D F W + + G + R + Q A G Sbjct: 1 MLELSRETFSEVHIETRYADREFIDNEWLAYLRQAGISYCIRCKENAQIAGKGGTRQALD 60 Query: 201 SNLHDMSSSHSKTLGYKRLTKSNPISCQ---------ILLYKSRSKGRKNQRSTRTHC 249 +L + + K LG L ++ KG + C Sbjct: 61 GHLQALKTGEGKRLGPVMLYGQQHYLEATRLSDGQLLVVCSDKEGKGIEAYGKRWQAC 118 >UniRef50_A6DKD2 ISPg4, transposase n=7 Tax=Chlamydiae/Verrucomicrobia group RepID=A6DKD2_9BACT Length = 412 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 69/241 (28%), Gaps = 41/241 (17%) Query: 125 EKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVR 184 + + A++ + ++ A ++ G W++R + Sbjct: 174 YAIVKEANTHDSTEAKEMCANI-----KDGEIVVFDKAYVDFRHLYHLDSRGVNWVTRSK 228 Query: 185 GKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRS 244 + Y + TK N IS QI+ + G ++ Sbjct: 229 DNMVYDIIEERP-----------------------TKGNIISDQII----KLNGINTEKH 261 Query: 245 TRTHCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYG 304 + + I + + TN P + +IY R IE F+ LK Sbjct: 262 YSQNLRLVTANIEVDGKMKVLMFLTN--NLQWAPSSIASIYQSRWGIEVFFKQLKQ---N 316 Query: 305 LGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVL-STVR 363 L L + + AL+ + A + W F T R VL S Sbjct: 317 LKLADFLGHNKNAIQWQVWTALLTYVLLRFL---AFRSQWPHSFSRITTLIRGVLWSYFD 373 Query: 364 L 364 L Sbjct: 374 L 374 >UniRef50_UPI00016C58D0 Transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C58D0 Length = 208 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 66/210 (31%), Gaps = 10/210 (4%) Query: 158 IVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYK 217 + + + + G ++ R + + G + + + + G Sbjct: 3 VCDRGADTFEFLEQLVGAGRSFVIRSKSNRR-RVGGEGAGAKLHDHLRTFPARAGWWGQA 61 Query: 218 RLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKE---PWILATNLP-V 273 R ++ + + A + W L T+ Sbjct: 62 REGAGRSRPVKLQGCWATGTVPAPRGRGTVTVPVVRVWEVEIPAGDTGVEWFLRTDRAVG 121 Query: 274 EIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCW 333 +I + +Q+V+IY +R IEE + LK+ G G+ + + + I +L + Sbjct: 122 DIASMRQVVSIYQRRPIIEEYHKALKT---GCGVENLPHRTRAVLATAVGITSVLAVALL 178 Query: 334 LAGVHAQKQGWDKHFQANTVRNR--NVLST 361 A+ G A V R VLST Sbjct: 179 ELRDLARDPGRQDDPAAGAVGGRAVRVLST 208 >UniRef50_Q6MS13 Transposase IS1634BQ n=39 Tax=Mycoplasma RepID=Q6MS13_MYCMS Length = 557 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 47/334 (14%), Positives = 102/334 (30%), Gaps = 54/334 (16%) Query: 110 LRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPW- 168 + +A + L+ K FP F+ ++A I + I++D G V Sbjct: 229 IVIGMATDENGIPLHYKIFP-GNVADPNTLIPFMLEIADIY-EVNSVTIIADKGMSVNRN 286 Query: 169 YKSVEKLGWYWLSRVR--------------------------------GKVQYADLGAEN 196 + +E W ++ R + Sbjct: 287 IRFLESKNWKYIISYRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHF 346 Query: 197 WKPISNLHDMSSSHSKTL-------GYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHC 249 + I + ++ K K++ K N +SC L + + K + Sbjct: 347 RRQIISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPINKGAFYE 406 Query: 250 HHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLK-----SPAYG 304 ++ TN + K+++N+YSK+ QIE F+ LK P Y Sbjct: 407 LDIEKIQEDQKYDGYYVYETN--RTDLSVKEVINLYSKQWQIESNFKTLKGKLSLRPMYL 464 Query: 305 LGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTV-- 362 H F ++ + ++ + G+ + + + N ++ + Sbjct: 465 STWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGKSKITEHKV-INVIKEVKEIEVFVN 523 Query: 363 --RLGMEVLRHSGYTITREDSLVAATLLTQNLFT 394 ++ + + + + + LLT+ T Sbjct: 524 KQKIETIQVYNDELQESWQTYQILLELLTKEKVT 557 >UniRef50_A5GAF0 Putative uncharacterized protein n=6 Tax=Deltaproteobacteria RepID=A5GAF0_GEOUR Length = 439 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 41/282 (14%), Positives = 87/282 (30%), Gaps = 30/282 (10%) Query: 99 SDIREQKRLMVLRASVAL--HGRSVTLY-EKAFPLSEQ----CSKKAHDQFLADLASILP 151 S+ ++ VL A++ + L E P C + A +FL + P Sbjct: 147 SNGKKLYYQQVLGAALVHPDSRVVIPLAPEMIIPQDGATKNDCERNASKRFLPNFREDFP 206 Query: 152 SNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHS 211 ++ P + +++ ++ + ++ NL D + Sbjct: 207 RLPVIVVEDGLSSNGPHIRDLQQHNMRFILGAKP--------GDHPLLFENLTDAIKKKT 258 Query: 212 KTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSP-KIYSASAKEPWILATN 270 T + K+ I + N + + + W T+ Sbjct: 259 ATTFAQIDPKNPQIMHSYCFLNDTPLNQANPDLKVNFLVYEEHNAKTGKTQRFSW--VTD 316 Query: 271 LPVEIRTPKQLVNIYSKRMQIE-ETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQ 329 LP+ L+ R +IE ETF LK+ Y L + + +++ ++ Sbjct: 317 LPITEENAYILMRGGRSRWKIENETFNTLKNQGYNLE-HNYGLGKEHLSENFVMLMMLAF 375 Query: 330 -------LTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRL 364 L L ++ G + + RN+ ++ L Sbjct: 376 LVDQAQQLCSPLFQAALERAGSRRSL---WEQQRNLFNSFEL 414 >UniRef50_Q46731 Transposase for transposon Tn5 n=15 Tax=root RepID=TN5P_ECOLX Length = 476 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 55/184 (29%), Gaps = 9/184 (4%) Query: 158 IVSDAGFKVPWYKSVEKLGWYWLSRVRG-----KVQYADLGAENWKPISNLHDMSSSHSK 212 + + + ++ R + + +P + +S Sbjct: 186 VCDREADIHAYLQDKLAHNERFVVRSKHPRKDVESGLYLYDHLKNQPELGGYQISIPQKG 245 Query: 213 TLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLP 272 + + K+ P L +S K T W+L T+ P Sbjct: 246 VVDKRGKRKNRPARKASLSLRSGRITLKQGNITLNAVLAEEINPPKGETPLKWLLLTSEP 305 Query: 273 -VEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLT 331 + ++++IY+ R +IEE + K+ A G R + + M+ I + + Sbjct: 306 VESLAQALRVIDIYTHRWRIEEFHKAWKTGA---GAERQRMEEPDNLERMVSILSFVAVR 362 Query: 332 CWLA 335 Sbjct: 363 LLQL 366 >UniRef50_A3DKE5 Transposase, IS4 n=14 Tax=Clostridium RepID=A3DKE5_CLOTH Length = 405 Score = 46.0 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 37/251 (14%), Positives = 76/251 (30%), Gaps = 27/251 (10%) Query: 99 SDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQF-------LADLASILP 151 SD + V+ + + S+ L K + + +KA F + + P Sbjct: 127 SDGKTMWSHCVVTSHYKISEYSLPLNFKLYLRKQFFGQKAKKLFKNKQELAMQLIDEFTP 186 Query: 152 SNTTPLIVSDAGF-KVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSH 210 T ++ DA + K G++ + R++ I Sbjct: 187 VTETTYLLVDAWYTSGKLMLHALKRGYHTIGRIKSNRVIYP--GGIKTNIKEF------- 237 Query: 211 SKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATN 270 + + +Y+ K + + C + S +I++T+ Sbjct: 238 ATHICSNETCIVTAGDDNYYVYRYEGKINDLENAVILICWSKK----ALSDTPAFIVSTD 293 Query: 271 LPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQL 330 + + T +V Y R IE ++R K+ LG + S + M Sbjct: 294 VSLTTST---IVGYYQNRWDIEVSYRYHKNS---LGFDEYQVESLTSIKRFWSMVFMTYT 347 Query: 331 TCWLAGVHAQK 341 L V ++ Sbjct: 348 FLELFRVSKKR 358 >UniRef50_C0VKK7 ISCja2 transposase n=8 Tax=Acinetobacter RepID=C0VKK7_9GAMM Length = 385 Score = 46.0 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 27/232 (11%), Positives = 60/232 (25%), Gaps = 36/232 (15%) Query: 150 LPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSS 209 P + + W+ + +++R+R ++++ S Sbjct: 182 FPPGSIVVF-DKGYVDYQWFAEMTDRKVSFVTRLRP---------------KTVYEVKSK 225 Query: 210 HSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILAT 269 L K + R + + + S + Sbjct: 226 REVYACKGILADEYIELSSDYAKKRGAPKRLRRIEFYDVEKKRTFEFLSNNFH------- 278 Query: 270 NLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQ 329 + IY R ++E F+ +K L L+ S + IAL+ Sbjct: 279 ------LAASTIAAIYKDRWKVELFFKAIKQ---NLKLKSFLGRSRNAIQTQIWIALIAY 329 Query: 330 LTCWLAGVHAQK----QGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTIT 377 L A A + Q + Q N + + + + + + Sbjct: 330 LLVSFAKHMAHEGWTVQRLLRIIQVNLFERKLLKALFLPDKKWRKQEEPQLR 381 >UniRef50_A4BSI6 Putative transposase n=3 Tax=Nitrococcus mobilis Nb-231 RepID=A4BSI6_9GAMM Length = 620 Score = 46.0 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 38/131 (29%), Gaps = 3/131 (2%) Query: 193 GAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHC--H 250 G W + + S + + + ++ + + K+ R Sbjct: 459 GGGWWPTSMPRAAPACASSPSRAVAKWPRVARLAVRYASVTLKPPREKHTRPDVALYAVR 518 Query: 251 HPSPKIYSASAKEPWILATNLPVEIR-TPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRH 309 S W L T + EI + ++ Y+ R QIE R LKS Sbjct: 519 ATEIDPPSGVKPLAWTLLTTVATEIFADACERLDWYATRWQIEVYHRTLKSGCRIEDTPA 578 Query: 310 SRTSSSERFDI 320 + ++ Sbjct: 579 WQCAAPGELSC 589 >UniRef50_C5V7Z6 Transposase IS4 family protein n=3 Tax=root RepID=C5V7Z6_9PROT Length = 389 Score = 46.0 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 57/328 (17%), Positives = 103/328 (31%), Gaps = 52/328 (15%) Query: 20 LKRLNSLTLACHALLDCKTLTLTELG--RNLPTKARTKHNIKRIDRL---LGNRHLHKER 74 L R + L A L+ + L +G + N +R RL LG+R + Sbjct: 50 LTRRDGLRDLV-ACLNSQKSKLYHIGIRSKVSRSTLADANERRDWRLFEALGHRLISIA- 107 Query: 75 LAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQC 134 L +YR I G P+ +D + I L + + ++ L Sbjct: 108 LELYRD--EDIGLGLKEPLYAMDSTTIDLC--LTLFPWAEFRSTKAAVKAHTIIDLRGSI 163 Query: 135 SK-------KAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKV 187 K HD L D+ P+ T +I ++ + ++ R + + Sbjct: 164 PVFLSITTGKVHDVNLLDVIP-FPAGTIVVI-DRGYLHFARLYALHQRQVTFVIRAKNNL 221 Query: 188 QYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRT 247 ++ + + + K+ + C + + K + Sbjct: 222 RFT----------------------WIASREVDKATGLRCDQTILLATPKSKTAYPERLR 259 Query: 248 HCHHPSPKIYSASAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGL 307 + K L TN + NIY R QIE F+ LK L + Sbjct: 260 R----VSFRDPETGKHLVFL-TN--RFDLPALTIANIYKNRWQIELFFKWLKQ---NLAI 309 Query: 308 RHSRTSSSERFDIMLLIALMLQLTCWLA 335 +H +S + IA+ + L +A Sbjct: 310 KHFYGNSLNAVKSQIWIAICVYLLVSIA 337 >UniRef50_D1N6R0 Transposase IS4 family protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N6R0_9BACT Length = 435 Score = 46.0 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 66/211 (31%), Gaps = 13/211 (6%) Query: 146 LASILPSNTTPLIVSDAGFKVPWYKSVEKL---GWYWLSRVRGKVQYADLGAENWKPISN 202 +A N+ ++ P K V + LSR+R DL K Sbjct: 185 IAGKFTQNSILIVSDSWFCSAPLIKEVRAHISGSVHILSRLRVSAAIFDLPGPRVKKRGR 244 Query: 203 LHDMSSSHSKTLGYKRLTKSNPISCQILLY---KSRSKGRKNQRSTRTHCHHPSPKIYSA 259 ++ + +I +Y + S S C +Y Sbjct: 245 APRYGKRLPNVRELSSQLRNQARTAEIHVYGKEREISFSEIICMSKALKCRVKVIFVYYR 304 Query: 260 SAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFD 319 + P ++ T+L ++++ YS R +IE F+++K + +G S+ + D Sbjct: 305 NFAFP-LVTTDLS---LPAERMIEYYSARWKIESGFKEIK---HEIGSLDSQCRNLSAVD 357 Query: 320 IMLLIALMLQLTCWLAGVHAQKQGWDKHFQA 350 + L W+ +H Sbjct: 358 NHFQLCLFATSIAWVYASKLPLAPPRRHPTR 388 >UniRef50_Q2S608 Putative uncharacterized protein n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S608_SALRD Length = 108 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 10/72 (13%), Positives = 22/72 (30%), Gaps = 2/72 (2%) Query: 29 ACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKE--RLAVYRWHASFIC 86 L + +++ A + +++ R N H+ + V Sbjct: 1 MITGLHQAGHVHFSKVASERFGSATLESKTRQVRRFFSNEHVDPQCCYKPVAELLLKQAI 60 Query: 87 SGNTMPIVLVDW 98 + + VLVD Sbjct: 61 TSGSPIRVLVDT 72 >UniRef50_Q4C3L4 Putative uncharacterized protein n=1 Tax=Crocosphaera watsonii WH 8501 RepID=Q4C3L4_CROWT Length = 133 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 14/92 (15%), Positives = 25/92 (27%), Gaps = 2/92 (2%) Query: 7 LHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLG 66 L F L + + + L K + + L P + K I R L Sbjct: 4 LAVYESHFTKHLTQTQFEAFRILLWLLTVHKQVRIERLAACFPLPILYQSRRKHIQRFLV 63 Query: 67 NRHLHKE--RLAVYRWHASFICSGNTMPIVLV 96 L V + ++ I+ + Sbjct: 64 LSALAIPRFWFPVIKAIICKEFKTGSLLIITI 95 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.316 0.119 0.285 Lambda K H 0.267 0.0364 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,836,734,844 Number of Sequences: 3077464 Number of extensions: 61580011 Number of successful extensions: 198972 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 274 Number of HSP's successfully gapped in prelim test: 405 Number of HSP's that attempted gapping in prelim test: 198055 Number of HSP's gapped (non-prelim): 736 length of query: 402 length of database: 1,040,396,356 effective HSP length: 131 effective length of query: 271 effective length of database: 637,248,572 effective search space: 172694363012 effective search space used: 172694363012 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 94 (41.0 bits)