BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (84 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 160 1e-38 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 100 2e-20 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 84 1e-15 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 83 2e-15 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 80 2e-14 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 76 4e-13 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 76 4e-13 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 75 6e-13 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 74 2e-12 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 73 2e-12 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 72 6e-12 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 71 9e-12 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 70 2e-11 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 70 2e-11 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 70 2e-11 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 69 6e-11 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 68 9e-11 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 68 1e-10 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 68 1e-10 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 67 2e-10 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 66 3e-10 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 65 5e-10 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 65 7e-10 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 65 8e-10 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 64 1e-09 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 64 2e-09 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 63 2e-09 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 63 4e-09 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 59 4e-08 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 59 5e-08 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 58 9e-08 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 58 1e-07 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 58 1e-07 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 56 3e-07 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 56 4e-07 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 56 4e-07 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 55 6e-07 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 55 8e-07 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 54 1e-06 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 54 2e-06 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 54 2e-06 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 53 4e-06 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 53 4e-06 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 52 4e-06 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 52 5e-06 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 52 7e-06 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 52 8e-06 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 50 2e-05 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 50 2e-05 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 50 3e-05 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 49 4e-05 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 47 2e-04 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 47 2e-04 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 46 4e-04 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 45 5e-04 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 45 6e-04 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 45 7e-04 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 44 0.002 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 44 0.002 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 43 0.004 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 43 0.004 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 41 0.010 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 41 0.013 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 160 bits (406), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 76/79 (96%), Positives = 76/79 (96%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 MELKKLMEHISI PDYRQ WKV HKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVS 79 YGDFENGIPVHDTIARVVS Sbjct: 61 YGDFENGIPVHDTIARVVS 79 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 100 bits (249), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 48/80 (60%), Positives = 59/80 (73%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L L +H + D RQA KV +KL D+L L + AVISGAEGWE+IEDFG L +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSQ 80 YGDF +GIPVHDTIAR+V + Sbjct: 61 YGDFSHGIPVHDTIARLVCR 80 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 84.3 bits (207), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 41/77 (53%), Positives = 53/77 (68%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++H +T D R H L DI+LL I AV+SG+EGWEDIE+FG LD+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVVSQGK 82 GIP HDTIARV+ + K Sbjct: 67 AGIPRHDTIARVICRLK 83 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 83.2 bits (204), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 39/72 (54%), Positives = 49/72 (68%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+ +SI D RQ KV H L D+L L I AVISG EGWE+I+DFG LD+L++Y F Sbjct: 6 LINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRKYLPFS 65 Query: 66 NGIPVHDTIARV 77 GIP DTI+R+ Sbjct: 66 GGIPTDDTISRI 77 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 80.5 bits (197), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 40/74 (54%), Positives = 53/74 (71%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+E SI D RQ K+ H+L DIL L + AVI GAEGW+DIE+ G L++L++ G F+ Sbjct: 7 LVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFFK 66 Query: 66 NGIPVHDTIARVVS 79 GIPV DTIAR++S Sbjct: 67 KGIPVDDTIARIIS 80 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 75.9 bits (185), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 31/73 (42%), Positives = 50/73 (68%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+EH+++ + R H L D++ L I A++SGAEGW DIE +G++ +D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVV 78 NGIP T+AR++ Sbjct: 69 NGIPRRHTVARIL 81 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 75.9 bits (185), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 35/75 (46%), Positives = 51/75 (68%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++++E + D R A + H L DIL+L +CAV+SGA+GW+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARV 77 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRV 81 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 75.1 bits (183), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 32/74 (43%), Positives = 50/74 (67%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 + H D R H L D++ LT+ A++SGAEGW+DI+ FG++ LD+L+++ F+ Sbjct: 3 FITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAFK 62 Query: 66 NGIPVHDTIARVVS 79 G+PV DTIAR++S Sbjct: 63 EGVPVDDTIARIIS 76 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 73.6 bits (179), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 37/80 (46%), Positives = 50/80 (62%), Gaps = 4/80 (5%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + + E++S Y Q H DI+ L + AVISGA W +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLSDPRAYNQK----HHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSQ 80 Y FE GIPV DTIARV+ + Sbjct: 57 YRPFECGIPVDDTIARVIKR 76 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 73.2 bits (178), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 37/80 (46%), Positives = 50/80 (62%), Gaps = 4/80 (5%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + + E++S Y Q H DI+ L + AVISGA W +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLSDPRAYNQK----HHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSQ 80 Y FE GIPV DTIARV+ + Sbjct: 57 YRPFECGIPVDDTIARVIKR 76 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 72.0 bits (175), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 37/77 (48%), Positives = 49/77 (63%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++H D R H L +I+LL I AV+SG+EGWE IE+FG LD+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVVSQGK 82 GIP HDTIARV+ + K Sbjct: 67 AGIPRHDTIARVICRLK 83 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 71.2 bits (173), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 29/74 (39%), Positives = 50/74 (67%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + +++H S D RQ+W+VV+ L +I LL +CA +SG E + +I +G+ L+FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTI 74 + +E G+P HDT+ Sbjct: 77 FLPYERGLPAHDTL 90 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 70.5 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 30/68 (44%), Positives = 46/68 (67%), Gaps = 1/68 (1%) Query: 14 PDYR-QAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 PD R + +H L+DIL + CAVI+GAEGWEDI ++G + F +++ + +NG+P HD Sbjct: 13 PDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLELKNGVPSHD 72 Query: 73 TIARVVSQ 80 T RV ++ Sbjct: 73 TFYRVFTK 80 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 70.1 bits (170), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 31/73 (42%), Positives = 46/73 (63%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L +H+S+ D R H L D+L L + AV SG +GW +I+ FGE L++L+++ F Sbjct: 3 LFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPFA 62 Query: 66 NGIPVHDTIARVV 78 NGIP TIAR++ Sbjct: 63 NGIPRRHTIARIL 75 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 70.1 bits (170), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 39/65 (60%), Positives = 42/65 (64%), Gaps = 12/65 (18%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTI------CAVISGAEGWEDIEDFGETH 54 MELKKLMEHISI PDYRQAWKV HKL DIL + C ++ G FGETH Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRG------FGETH 54 Query: 55 LDFLK 59 LDFLK Sbjct: 55 LDFLK 59 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 68.6 bits (166), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 33/65 (50%), Positives = 43/65 (66%) Query: 15 DYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D R+ WK+ H LSDI+LL A +SGAE W++IE FG+ + LK ENGIP HDT+ Sbjct: 18 DDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVLQLENGIPSHDTL 77 Query: 75 ARVVS 79 RV + Sbjct: 78 QRVFA 82 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 68.2 bits (165), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 28/67 (41%), Positives = 44/67 (65%) Query: 14 PDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDT 73 PD+R++ K ++ L ILL+ I +VI GA+ W ++E++ + +FL+ + D NGIP HDT Sbjct: 15 PDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLDLPNGIPSHDT 74 Query: 74 IARVVSQ 80 RV S Sbjct: 75 FNRVFSN 81 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 67.8 bits (164), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 31/73 (42%), Positives = 46/73 (63%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L++H+ I D R + H L D++ LT+ A++SGA GW+ IE FG LD+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVV 78 +GIP IA ++ Sbjct: 63 HGIPRRHCIANII 75 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 67.8 bits (164), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 31/56 (55%), Positives = 39/56 (69%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVS 79 HKL DI+ +TICAVI GA+ W DIE FG+ +LK++ + NGIP HDT RV S Sbjct: 27 HKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNGIPSHDTFGRVFS 82 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 67.0 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 32/69 (46%), Positives = 44/69 (63%), Gaps = 1/69 (1%) Query: 12 ITPDYRQAW-KVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPV 70 + PD R+A +H LSDIL + +CAV+SG + WE + +FG T +L+Q+ NGIP Sbjct: 20 LIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANGIPS 79 Query: 71 HDTIARVVS 79 HDT RV S Sbjct: 80 HDTFGRVFS 88 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 66.2 bits (160), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 31/75 (41%), Positives = 44/75 (58%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 + + H S D RQ KV + L +ILLLT+CAV+SGA W I +G L FLK++ F Sbjct: 24 EFLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPF 83 Query: 65 ENGIPVHDTIARVVS 79 +G P HD + + + Sbjct: 84 ADGTPSHDQLGNIFA 98 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 65.5 bits (158), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 31/79 (39%), Positives = 49/79 (62%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M ++ +H S D+RQ+ KV + L D+L ++CAVI+ + GW +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVS 79 F +GIP DTIAR+VS Sbjct: 61 QKMFIDGIPADDTIARIVS 79 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 65.1 bits (157), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 28/70 (40%), Positives = 42/70 (60%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L+ H + D RQ+ KV + L D+L +T+C VI+GAEGW +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIPV 70 G G+PV Sbjct: 72 KGILTEGVPV 81 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 64.7 bits (156), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 28/80 (35%), Positives = 49/80 (61%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++EH S D R A ++ + L DI+++T+CAV+ GA+ W ++ ++G + +LKQ+ Sbjct: 6 FASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQWI 65 Query: 63 DFENGIPVHDTIARVVSQGK 82 NG+P HDT V ++ K Sbjct: 66 ALPNGVPSHDTFEWVFARLK 85 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 64.3 bits (155), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 30/75 (40%), Positives = 48/75 (64%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L++H D R HKL DI+++ +CA+I GA+ + +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSQ 80 NGIP HDT ARV ++ Sbjct: 69 NGIPSHDTFARVFAR 83 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 63.5 bits (153), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 28/76 (36%), Positives = 49/76 (64%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++L+E + D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVV 78 + +NG P HDT+ RV+ Sbjct: 61 ELKNGPPSHDTLRRVM 76 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 63.2 bits (152), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 28/76 (36%), Positives = 49/76 (64%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++L+E + D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVV 78 + +NG P HDT+ RV+ Sbjct: 61 ELKNGPPSHDTLRRVM 76 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 62.8 bits (151), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 31/74 (41%), Positives = 45/74 (60%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +E S PD R H +I+ L + +V++GA+ + +IEDF E H+D+LK Y + Sbjct: 5 FVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTYFNLP 64 Query: 66 NGIPVHDTIARVVS 79 NGIP HDT +RV S Sbjct: 65 NGIPSHDTFSRVFS 78 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 59.3 bits (142), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 1/66 (1%) Query: 14 PDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDT 73 PD R+ + H+L ++LL IC VISGAE W + + + LD+L+ Y + +GI HDT Sbjct: 16 PDPRRR-ECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPYAHGIASHDT 74 Query: 74 IARVVS 79 RV S Sbjct: 75 FGRVFS 80 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 58.9 bits (141), Expect = 5e-08, Method: Composition-based stats. Identities = 29/75 (38%), Positives = 44/75 (58%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L+E + D R K+ H+L DIL++ +CAV++ AE +EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARV 77 D GIP HDT RV Sbjct: 62 DLPGGIPSHDTFRRV 76 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 58.2 bits (139), Expect = 9e-08, Method: Composition-based stats. Identities = 27/79 (34%), Positives = 44/79 (55%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L+EH D R + H+L D+L++ +C ++ G E + D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSQGK 82 +GIP HDT RV + K Sbjct: 67 LRHGIPKHDTFNRVFAALK 85 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 57.8 bits (138), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 26/72 (36%), Positives = 40/72 (55%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +++H+ D R H + DI L + AVISGA+ W +FG L++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARV 77 NGIP +I R+ Sbjct: 61 NGIPSQQSIGRI 72 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 57.8 bits (138), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 24/79 (30%), Positives = 43/79 (54%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M K L++++ PD R K H LS+++ + ICA++ G + W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVS 79 + GIP HDT R+ + Sbjct: 61 WLSLPGGIPSHDTFNRIFA 79 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 56.2 bits (134), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 28/79 (35%), Positives = 49/79 (62%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++E+ + D R+ H L D+L++ + AVI+GA+G I + E H+++LK + Sbjct: 13 ILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSRLELP 72 Query: 66 NGIPVHDTIARVVSQGKIT 84 +G+P HDTI R+++Q K T Sbjct: 73 SGVPSHDTIGRLLAQLKPT 91 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 56.2 bits (134), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 26/69 (37%), Positives = 39/69 (56%) Query: 11 SITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPV 70 ++ D RQ+WK+ + LS IL L ++G E +++EDF E + Y D G P Sbjct: 15 AVELDSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEPLFATYVDLSEGCPS 74 Query: 71 HDTIARVVS 79 HDT+ RV+S Sbjct: 75 HDTLERVIS 83 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 55.8 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 47/78 (60%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +L + E PD R + H LS++L + +CAV+ GA + D+ +G+++L +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVS 79 + G+P HDT RV++ Sbjct: 65 LKLKAGVPSHDTFCRVLA 82 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 55.5 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 23/67 (34%), Positives = 42/67 (62%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L+ L+EH S D R +++H L +ILLL +C ++ + +E+I +G HL FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIP 69 + +G+P Sbjct: 72 PYAHGVP 78 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 54.7 bits (130), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 23/54 (42%), Positives = 37/54 (68%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARV 77 H L D+L + I AVI+G++GWED+E++G ++L ++ + +GIP DT RV Sbjct: 49 HLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLELPHGIPSDDTFRRV 102 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 53.9 bits (128), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 27/65 (41%), Positives = 35/65 (53%) Query: 15 DYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D R K VHK+ I+ ++I AVI GA+ W +IE+FG + F K IP HDT Sbjct: 14 DPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPSLEFIPSHDTF 73 Query: 75 ARVVS 79 R S Sbjct: 74 NRFFS 78 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 29/66 (43%), Positives = 38/66 (57%), Gaps = 2/66 (3%) Query: 15 DYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLK-QYGDFENGIPVHDT 73 D R K VHK+ I+ ++I AVI GA+ W +IE+FG + F K + D E IP HDT Sbjct: 14 DPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPDLE-FIPSHDT 72 Query: 74 IARVVS 79 R S Sbjct: 73 FNRFFS 78 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 53.5 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 25/65 (38%), Positives = 39/65 (60%) Query: 15 DYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D R + ++ L +ILL+T+CA+I G + W+ I DFG+ +L Q+ + G+P T Sbjct: 25 DPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVNMRCGVPSTLTF 84 Query: 75 ARVVS 79 ARV S Sbjct: 85 ARVFS 89 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 52.8 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 24/70 (34%), Positives = 40/70 (57%) Query: 11 SITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPV 70 S+ R+ H DIL++ +CA+ISGA + +IE FG + ++ + + NGIP Sbjct: 15 SLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQTFLALPNGIPS 74 Query: 71 HDTIARVVSQ 80 HDT V+++ Sbjct: 75 HDTFNNVLAK 84 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 52.8 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 23/57 (40%), Positives = 37/57 (64%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQ 80 H+L DI+ + + AV++GA+ W IE +G+ +L+ + NGIP HDT ARV ++ Sbjct: 32 HQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALPNGIPSHDTFARVFAR 88 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 52.4 bits (124), Expect = 4e-06, Method: Composition-based stats. Identities = 25/82 (30%), Positives = 49/82 (59%), Gaps = 3/82 (3%) Query: 1 MELKKLMEHISI---TPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDF 57 M++ KL + + + D+R A + H+LS++L + +CAV+SGA+ +E+I +G + + Sbjct: 1 MDIGKLADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPW 60 Query: 58 LKQYGDFENGIPVHDTIARVVS 79 L+ + + G+ DT RV + Sbjct: 61 LRGFLRLDYGVASPDTFERVFA 82 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 52.0 bits (123), Expect = 5e-06, Method: Composition-based stats. Identities = 26/56 (46%), Positives = 32/56 (57%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVS 79 H+ S I+L+ I AVI GA+ W IEDFG++ F NGIP HDT R S Sbjct: 34 HQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKLSNFNGIPSHDTFNRFFS 89 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 52.0 bits (123), Expect = 7e-06, Method: Composition-based stats. Identities = 25/69 (36%), Positives = 40/69 (57%) Query: 11 SITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPV 70 +I PD R +++ ++I+ + + AVI GA+ W +IE FG+TH + K IP Sbjct: 8 AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKARLPGLVSIPS 67 Query: 71 HDTIARVVS 79 HDT++R S Sbjct: 68 HDTLSRFFS 76 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 29/77 (37%), Positives = 44/77 (57%), Gaps = 2/77 (2%) Query: 3 LKKLMEHISITPDYRQAWK--VVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 L+KL E S PD+R+A K + HKLSDI++L I +S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARV 77 NGIP T+ R+ Sbjct: 95 LDILVNGIPSEATLCRM 111 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 27/77 (35%), Positives = 43/77 (55%), Gaps = 2/77 (2%) Query: 3 LKKLMEHISITPDYRQAWK--VVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 L+KL E S PD+R+A K + HKL D+++L I +S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARV 77 NGIP T+ R+ Sbjct: 95 LDILANGIPSEATLCRM 111 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 50.1 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 22/56 (39%), Positives = 34/56 (60%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVS 79 H L +IL + + AV+ GA ++E F + LD L+Q+ E G P HDT +RV++ Sbjct: 20 HPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLERGAPSHDTFSRVLA 75 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 49.7 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 20/64 (31%), Positives = 38/64 (59%) Query: 15 DYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D RQA KV H++ ++L++ C+ + E + D+ DF ++ L +L+ + ++G P HD Sbjct: 13 DPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFLPLKHGAPSHDVF 72 Query: 75 ARVV 78 V+ Sbjct: 73 RNVL 76 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 49.3 bits (116), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 22/75 (29%), Positives = 43/75 (57%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +E ++ D+R + ++L DILL++ AVI + + ++ F + +L+ + DF Sbjct: 5 FLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCDFR 64 Query: 66 NGIPVHDTIARVVSQ 80 +G P HDT +V+S+ Sbjct: 65 HGPPSHDTFGKVLSR 79 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 47.0 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 23/65 (35%), Positives = 35/65 (53%) Query: 15 DYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D+R H L+DIL++ CA++ G + +E FG +L+ + NGIP HDT Sbjct: 25 DWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLALPNGIPSHDTF 84 Query: 75 ARVVS 79 +V S Sbjct: 85 RKVFS 89 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 24/73 (32%), Positives = 39/73 (53%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +++H D R L I+ + I AV++GA+G+ IE +G+ +L+ + D Sbjct: 27 VLKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLP 86 Query: 66 NGIPVHDTIARVV 78 GIP HDT RV+ Sbjct: 87 KGIPSHDTFGRVL 99 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 45.8 bits (107), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 19/62 (30%), Positives = 33/62 (53%) Query: 15 DYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D RQ KVVH+ I++ + V + + W ++ DF +DF++++ P HDT+ Sbjct: 29 DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFFPDIQKAPSHDTL 88 Query: 75 AR 76 R Sbjct: 89 RR 90 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 45.4 bits (106), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 2/78 (2%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +L ++S PD+R+A ++ L+ +LL +I A++SGA + I+ F +TH + L Sbjct: 2 QLKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQL 61 Query: 65 -ENGIPVHDTIARVVSQG 81 P H +I R QG Sbjct: 62 HRKRAPAHTSI-RYALQG 78 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 45.4 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 2/70 (2%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ--YGD 63 L+E S PD R+ + L+ IL++ +CA++ GA+ W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDT 73 E G P HDT Sbjct: 63 LEGGTPSHDT 72 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 45.1 bits (105), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 5/82 (6%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M+LK + I PD+R+A ++ L+ +LL +I AV+SGA + I+ F + H + L Sbjct: 1 MQLKAYLPAI---PDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNA 57 Query: 61 YGDFE-NGIPVHDTIARVVSQG 81 PVH +I R QG Sbjct: 58 LCQLHWKRAPVHTSI-RYALQG 78 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust. Identities = 26/77 (33%), Positives = 38/77 (49%), Gaps = 2/77 (2%) Query: 3 LKKLMEHISITPDYRQAWK--VVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 +K L E + PDYR+ K +KL DILLL I + DI FG+ +L + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARV 77 G +G+P T+ R+ Sbjct: 78 LGILLDGVPSEPTLCRI 94 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust. Identities = 22/64 (34%), Positives = 32/64 (50%) Query: 13 TPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 T D R+ KV + I+L+T+ V + W DI DF DFL+++ P HD Sbjct: 27 TIDPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHD 86 Query: 73 TIAR 76 T+ R Sbjct: 87 TLRR 90 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 25/40 (62%) Query: 38 ISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARV 77 ++ AE WEDIE +G + +L+ + NGIP HDT RV Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRV 43 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 21/64 (32%), Positives = 30/64 (46%) Query: 14 PDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDT 73 PD R H L D+L + + A I GAE D F +++ + G+P HDT Sbjct: 13 PDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLPSHDT 72 Query: 74 IARV 77 +RV Sbjct: 73 FSRV 76 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 41.2 bits (95), Expect = 0.010, Method: Compositional matrix adjust. Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 7/75 (9%) Query: 10 ISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG-- 67 I++ D R ++ + L ILL+++ A ISG + WE IED+ H + L+ +G Sbjct: 9 IAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKLSGKE 68 Query: 68 -----IPVHDTIARV 77 +P HDT V Sbjct: 69 LKVSRMPTHDTFNHV 83 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 40.8 bits (94), Expect = 0.013, Method: Compositional matrix adjust. Identities = 19/56 (33%), Positives = 35/56 (62%) Query: 22 VVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARV 77 V + L+++LL T+ +I A +++IE G LD+L+Q+ FE+G+P T ++ Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKI 57 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 128 4e-29 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 119 2e-26 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 119 3e-26 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 118 5e-26 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 118 6e-26 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 118 7e-26 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 117 1e-25 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 112 3e-24 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 111 5e-24 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 111 7e-24 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 111 9e-24 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 111 1e-23 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 109 4e-23 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 108 7e-23 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 107 8e-23 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 107 1e-22 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 106 2e-22 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 106 2e-22 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 106 2e-22 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 106 3e-22 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 105 5e-22 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 105 5e-22 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 105 5e-22 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 105 6e-22 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 104 1e-21 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 104 1e-21 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 102 3e-21 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 102 3e-21 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 101 5e-21 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 101 6e-21 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 101 9e-21 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 100 2e-20 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 98 6e-20 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 98 1e-19 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 98 1e-19 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 97 1e-19 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 96 2e-19 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 96 3e-19 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 96 4e-19 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 96 4e-19 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 96 5e-19 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 94 1e-18 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 93 2e-18 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 93 3e-18 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 93 4e-18 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 92 5e-18 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 91 1e-17 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 91 1e-17 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 88 9e-17 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 88 1e-16 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 86 2e-16 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 81 1e-14 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 81 1e-14 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 81 1e-14 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 81 1e-14 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 76 4e-13 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 66 4e-10 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 64 9e-10 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 58 7e-08 Sequences not found previously or not previously below threshold: UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 88 7e-17 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 86 4e-16 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 85 8e-16 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 84 1e-15 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 83 2e-15 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 81 9e-15 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 79 4e-14 UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium m... 72 7e-12 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 72 7e-12 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 67 2e-10 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 66 3e-10 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 66 5e-10 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 66 5e-10 UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylo... 62 5e-09 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 62 7e-09 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 62 8e-09 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 61 1e-08 UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastop... 59 5e-08 UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteoba... 59 5e-08 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 58 1e-07 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 57 2e-07 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 57 2e-07 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 56 5e-07 UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfo... 55 6e-07 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 55 6e-07 UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polarom... 55 6e-07 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 54 1e-06 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 54 2e-06 UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis... 53 3e-06 UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis... 52 6e-06 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 52 7e-06 UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 51 1e-05 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 50 2e-05 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 50 2e-05 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 50 2e-05 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 50 3e-05 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 49 3e-05 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 49 4e-05 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 49 5e-05 UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiat... 48 7e-05 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 48 1e-04 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 47 2e-04 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 46 3e-04 UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitroso... 46 4e-04 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 46 5e-04 UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodoco... 45 8e-04 UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewane... 45 8e-04 UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia... 45 9e-04 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 45 0.001 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 44 0.001 UniRef50_D1RJD3 Putative uncharacterized protein n=1 Tax=Legione... 44 0.001 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 44 0.002 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 43 0.003 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 41 0.009 UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromon... 41 0.011 UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Strepto... 39 0.034 UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms Rep... 39 0.054 UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia... 38 0.070 UniRef50_C7S7P7 Transposase n=4 Tax=root RepID=C7S7P7_METEA 38 0.072 UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 R... 38 0.092 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 128 bits (323), Expect = 4e-29, Method: Composition-based stats. Identities = 76/83 (91%), Positives = 76/83 (91%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 MELKKLMEHISI PDYRQ WKV HKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 YGDFENGIPVHDTIARVVS Sbjct: 61 YGDFENGIPVHDTIARVVSCISP 83 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 119 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 28/81 (34%), Positives = 49/81 (60%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++EH S D R A ++ + L DI+++T+CAV+ GA+ W ++ ++G + +LKQ+ Sbjct: 6 FASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQWI 65 Query: 63 DFENGIPVHDTIARVVSQGKI 83 NG+P HDT V ++ K Sbjct: 66 ALPNGVPSHDTFEWVFARLKP 86 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 119 bits (298), Expect = 3e-26, Method: Composition-based stats. Identities = 33/76 (43%), Positives = 43/76 (56%) Query: 8 EHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG 67 ++ D R HKL DI+ +TICAVI GA+ W DIE FG+ +LK++ + NG Sbjct: 11 DYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNG 70 Query: 68 IPVHDTIARVVSQGKI 83 IP HDT RV S Sbjct: 71 IPSHDTFGRVFSLLNP 86 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 118 bits (297), Expect = 5e-26, Method: Composition-based stats. Identities = 48/83 (57%), Positives = 59/83 (71%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L L +H + D RQA KV +KL D+L L + AVISGAEGWE+IEDFG L +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 YGDF +GIPVHDTIAR+V + Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDP 83 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 118 bits (296), Expect = 6e-26, Method: Composition-based stats. Identities = 24/83 (28%), Positives = 43/83 (51%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M K L++++ PD R K H LS+++ + ICA++ G + W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 + GIP HDT R+ + Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPP 83 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 118 bits (296), Expect = 7e-26, Method: Composition-based stats. Identities = 27/80 (33%), Positives = 44/80 (55%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L+EH D R + H+L D+L++ +C ++ G E + D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSQGKI 83 +GIP HDT RV + K Sbjct: 67 LRHGIPKHDTFNRVFAALKP 86 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 117 bits (293), Expect = 1e-25, Method: Composition-based stats. Identities = 30/78 (38%), Positives = 48/78 (61%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L++H D R HKL DI+++ +CA+I GA+ + +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSQGKI 83 NGIP HDT ARV ++ Sbjct: 69 NGIPSHDTFARVFARIDP 86 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 112 bits (281), Expect = 3e-24, Method: Composition-based stats. Identities = 40/83 (48%), Positives = 50/83 (60%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L+ +SI D RQ KV H L D+L L I AVISG EGWE+I+DFG LD+L++ Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 Y F GIP DTI+R+ Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDP 83 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 111 bits (279), Expect = 5e-24, Method: Composition-based stats. Identities = 26/83 (31%), Positives = 43/83 (51%), Gaps = 1/83 (1%) Query: 2 ELKKLMEHISITPDYRQAW-KVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 +K E+ D R+ H DIL++ +CA+ISGA + +IE FG + ++ + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 + NGIP HDT V+++ Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSP 87 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 111 bits (278), Expect = 7e-24, Method: Composition-based stats. Identities = 26/78 (33%), Positives = 42/78 (53%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L ++ D R H+L DI+ + + AV++GA+ W IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSQGKI 83 NGIP HDT ARV ++ Sbjct: 74 NGIPSHDTFARVFARLDP 91 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 111 bits (277), Expect = 9e-24, Method: Composition-based stats. Identities = 41/77 (53%), Positives = 53/77 (68%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++H +T D R H L DI+LL I AV+SG+EGWEDIE+FG LD+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVVSQGK 82 GIP HDTIARV+ + K Sbjct: 67 AGIPRHDTIARVICRLK 83 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 111 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 30/78 (38%), Positives = 47/78 (60%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 +KL PD+R++ K ++ L ILL+ I +VI GA+ W ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSQG 81 NGIP HDT RV S Sbjct: 65 LPNGIPSHDTFNRVFSNI 82 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 109 bits (272), Expect = 4e-23, Method: Composition-based stats. Identities = 25/82 (30%), Positives = 47/82 (57%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +L + E PD R + H LS++L + +CAV+ GA + D+ +G+++L +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVSQGKI 83 + G+P HDT RV++ Sbjct: 65 LKLKAGVPSHDTFCRVLAMIDP 86 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 108 bits (270), Expect = 7e-23, Method: Composition-based stats. Identities = 33/84 (39%), Positives = 50/84 (59%), Gaps = 2/84 (2%) Query: 1 MELKKLMEHISITPDYR-QAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLK 59 M L L + PD R + +H L+DIL + CAVI+GAEGWEDI ++G + F + Sbjct: 1 MAL-PLTSVFADLPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFR 59 Query: 60 QYGDFENGIPVHDTIARVVSQGKI 83 ++ + +NG+P HDT RV ++ Sbjct: 60 RFLELKNGVPSHDTFYRVFTKLDP 83 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 107 bits (269), Expect = 8e-23, Method: Composition-based stats. Identities = 32/78 (41%), Positives = 44/78 (56%), Gaps = 1/78 (1%) Query: 7 MEHISITPDYRQA-WKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 + PD R+A +H LSDIL + +CAV+SG + WE + +FG T +L+Q+ Sbjct: 15 KPFFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLA 74 Query: 66 NGIPVHDTIARVVSQGKI 83 NGIP HDT RV S Sbjct: 75 NGIPSHDTFGRVFSLIDP 92 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 32/80 (40%), Positives = 51/80 (63%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + H D R H L D++ LT+ A++SGAEGW+DI+ FG++ LD+L+++ Sbjct: 1 MSFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRA 60 Query: 64 FENGIPVHDTIARVVSQGKI 83 F+ G+PV DTIAR++S + Sbjct: 61 FKEGVPVDDTIARIISSLEP 80 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 106 bits (266), Expect = 2e-22, Method: Composition-based stats. Identities = 25/79 (31%), Positives = 37/79 (46%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L E D+R H L+DIL++ CA++ G + +E FG +L+ + Sbjct: 15 NLREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLAL 74 Query: 65 ENGIPVHDTIARVVSQGKI 83 NGIP HDT +V S Sbjct: 75 PNGIPSHDTFRKVFSLLDP 93 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 106 bits (266), Expect = 2e-22, Method: Composition-based stats. Identities = 37/77 (48%), Positives = 49/77 (63%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++H D R H L +I+LL I AV+SG+EGWE IE+FG LD+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVVSQGK 82 GIP HDTIARV+ + K Sbjct: 67 AGIPRHDTIARVICRLK 83 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 106 bits (265), Expect = 2e-22, Method: Composition-based stats. Identities = 28/83 (33%), Positives = 51/83 (61%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ ++E+ + D R+ H L D+L++ + AVI+GA+G I + E H+++LK Sbjct: 9 DVVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSR 68 Query: 62 GDFENGIPVHDTIARVVSQGKIT 84 + +G+P HDTI R+++Q K T Sbjct: 69 LELPSGVPSHDTIGRLLAQLKPT 91 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 106 bits (264), Expect = 3e-22, Method: Composition-based stats. Identities = 29/81 (35%), Positives = 44/81 (54%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L+E + D R K+ H+L DIL++ +CAV++ AE +EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSQGKI 83 D GIP HDT RV Sbjct: 62 DLPGGIPSHDTFRRVFMLIDP 82 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 105 bits (262), Expect = 5e-22, Method: Composition-based stats. Identities = 26/79 (32%), Positives = 44/79 (55%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+ ++ D R H L D+L + I AVI+G++GWED+E++G ++L ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 ENGIPVHDTIARVVSQGKI 83 +GIP DT RV + Sbjct: 90 PHGIPSDDTFRRVFERIDP 108 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 105 bits (262), Expect = 5e-22, Method: Composition-based stats. Identities = 35/78 (44%), Positives = 52/78 (66%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++++E + D R A + H L DIL+L +CAV+SGA+GW+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARVVSQ 80 NGIP HDTI RV + Sbjct: 67 KLRNGIPGHDTIRRVSLR 84 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 105 bits (262), Expect = 5e-22, Method: Composition-based stats. Identities = 40/79 (50%), Positives = 53/79 (67%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E SI D RQ K+ H+L DIL L + AVI GAEGW+DIE+ G L++L++ G F Sbjct: 6 SLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFF 65 Query: 65 ENGIPVHDTIARVVSQGKI 83 + GIPV DTIAR++S Sbjct: 66 KKGIPVDDTIARIISSLNP 84 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 105 bits (262), Expect = 6e-22, Method: Composition-based stats. Identities = 31/78 (39%), Positives = 45/78 (57%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +E S PD R H +I+ L + +V++GA+ + +IEDF E H+D+LK Y + Sbjct: 5 FVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTYFNLP 64 Query: 66 NGIPVHDTIARVVSQGKI 83 NGIP HDT +RV S Sbjct: 65 NGIPSHDTFSRVFSAINP 82 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 104 bits (259), Expect = 1e-21, Method: Composition-based stats. Identities = 28/81 (34%), Positives = 49/81 (60%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++L+E + D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSQGKI 83 + +NG P HDT+ RV+ Sbjct: 61 ELKNGPPSHDTLRRVMGMVSP 81 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 104 bits (259), Expect = 1e-21, Method: Composition-based stats. Identities = 28/78 (35%), Positives = 39/78 (50%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +++ D R K VHK+ I+ ++I AVI GA+ W +IE+FG + F K Sbjct: 5 IIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPDL 64 Query: 66 NGIPVHDTIARVVSQGKI 83 IP HDT R S K Sbjct: 65 EFIPSHDTFNRFFSIIKP 82 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 102 bits (255), Expect = 3e-21, Method: Composition-based stats. Identities = 31/77 (40%), Positives = 44/77 (57%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 + + H S D RQ KV + L +ILLLT+CAV+SGA W I +G L FLK++ F Sbjct: 24 EFLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPF 83 Query: 65 ENGIPVHDTIARVVSQG 81 +G P HD + + + Sbjct: 84 ADGTPSHDQLGNIFAAL 100 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 102 bits (255), Expect = 3e-21, Method: Composition-based stats. Identities = 31/83 (37%), Positives = 49/83 (59%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M ++ +H S D+RQ+ KV + L D+L ++CAVI+ + GW +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 F +GIP DTIAR+VS Sbjct: 61 QKMFIDGIPADDTIARIVSMIDP 83 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 101 bits (253), Expect = 5e-21, Method: Composition-based stats. Identities = 31/80 (38%), Positives = 46/80 (57%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L +H+S+ D R H L D+L L + AV SG +GW +I+ FGE L++L+++ Sbjct: 1 MSLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRP 60 Query: 64 FENGIPVHDTIARVVSQGKI 83 F NGIP TIAR++ Sbjct: 61 FANGIPRRHTIARILKAVGP 80 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 101 bits (253), Expect = 6e-21, Method: Composition-based stats. Identities = 28/81 (34%), Positives = 49/81 (60%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++L+E + D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSQGKI 83 + +NG P HDT+ RV+ Sbjct: 61 ELKNGPPSHDTLRRVMGMVSP 81 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 101 bits (251), Expect = 9e-21, Method: Composition-based stats. Identities = 28/78 (35%), Positives = 39/78 (50%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +++ D R K VHK+ I+ ++I AVI GA+ W +IE+FG + F K Sbjct: 5 IIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPSL 64 Query: 66 NGIPVHDTIARVVSQGKI 83 IP HDT R S K Sbjct: 65 EFIPSHDTFNRFFSMIKP 82 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats. Identities = 38/83 (45%), Positives = 53/83 (63%), Gaps = 4/83 (4%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + + E++S D R A+ H DI+ L + AVISGA W +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 Y FE GIPV DTIARV+ + + Sbjct: 57 YRPFECGIPVDDTIARVIKRIEP 79 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 98.4 bits (244), Expect = 6e-20, Method: Composition-based stats. Identities = 24/78 (30%), Positives = 40/78 (51%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +++H D R L I+ + I AV++GA+G+ IE +G+ +L+ + D Sbjct: 27 VLKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLP 86 Query: 66 NGIPVHDTIARVVSQGKI 83 GIP HDT RV+ + Sbjct: 87 KGIPSHDTFGRVLRILEP 104 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 97.6 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 29/77 (37%), Positives = 42/77 (54%), Gaps = 1/77 (1%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+ PD R+ + H+L ++LL IC VISGAE W + + + LD+L+ Y + Sbjct: 7 SLLTAFDDLPDPRR-RECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 ENGIPVHDTIARVVSQG 81 +GI HDT RV S Sbjct: 66 AHGIASHDTFGRVFSLL 82 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 97.6 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 31/73 (42%), Positives = 50/73 (68%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+EH+++ + R H L D++ L I A++SGAEGW DIE +G++ +D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVV 78 NGIP T+AR++ Sbjct: 69 NGIPRRHTVARIL 81 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 97.2 bits (241), Expect = 1e-19, Method: Composition-based stats. Identities = 22/80 (27%), Positives = 44/80 (55%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + +E ++ D+R + ++L DILL++ AVI + + ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSQGKI 83 F +G P HDT +V+S+ Sbjct: 63 FRHGPPSHDTFGKVLSRLDP 82 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 96.4 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 26/73 (35%), Positives = 40/73 (54%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +++H+ D R H + DI L + AVISGA+ W +FG L++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVV 78 NGIP +I R+ Sbjct: 61 NGIPSQQSIGRIF 73 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 96.0 bits (238), Expect = 3e-19, Method: Composition-based stats. Identities = 33/81 (40%), Positives = 47/81 (58%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + ++ + D R+ WK+ H LSDI+LL A +SGAE W++IE FG+ + LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFENGIPVHDTIARVVSQGKI 83 ENGIP HDT+ RV + Sbjct: 66 QLENGIPSHDTLQRVFATLDP 86 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 95.7 bits (237), Expect = 4e-19, Method: Composition-based stats. Identities = 24/81 (29%), Positives = 45/81 (55%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L ++E D+R A + H+LS++L + +CAV+SGA+ +E+I +G + +L+ + Sbjct: 6 LADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPWLRGFL 65 Query: 63 DFENGIPVHDTIARVVSQGKI 83 + G+ DT RV + Sbjct: 66 RLDYGVASPDTFERVFALLDP 86 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 95.7 bits (237), Expect = 4e-19, Method: Composition-based stats. Identities = 29/74 (39%), Positives = 50/74 (67%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + +++H S D RQ+W+VV+ L +I LL +CA +SG E + +I +G+ L+FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTI 74 + +E G+P HDT+ Sbjct: 77 FLPYERGLPAHDTL 90 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 95.7 bits (237), Expect = 5e-19, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 39/78 (50%), Gaps = 1/78 (1%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++ PD R H L +IL + + AV+ GA ++E F + LD L+Q+ E Sbjct: 3 FLDVFGEVPDPR-DLTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVHDTIARVVSQGKI 83 G P HDT +RV++ Sbjct: 62 RGAPSHDTFSRVLAALDP 79 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 94.1 bits (233), Expect = 1e-18, Method: Composition-based stats. Identities = 26/80 (32%), Positives = 42/80 (52%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L D R + ++ L +ILL+T+CA+I G + W+ I DFG+ +L Q+ + Sbjct: 14 QYLFHCFLSIKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVN 73 Query: 64 FENGIPVHDTIARVVSQGKI 83 G+P T ARV S + Sbjct: 74 MRCGVPSTLTFARVFSLIEP 93 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 93.4 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 31/76 (40%), Positives = 46/76 (60%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L++H+ I D R + H L D++ LT+ A++SGA GW+ IE FG LD+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSQG 81 +GIP IA ++ Sbjct: 63 HGIPRRHCIANIIKSL 78 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 93.0 bits (230), Expect = 3e-18, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 43/81 (53%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L + D RQA KV H++ ++L++ C+ + E + D+ DF ++ L +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSQGKI 83 ++G P HD V+ + Sbjct: 61 PLKHGAPSHDVFRNVLMAIQP 81 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 92.6 bits (229), Expect = 4e-18, Method: Composition-based stats. Identities = 29/72 (40%), Positives = 35/72 (48%) Query: 12 ITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVH 71 I D R H+ S I+L+ I AVI GA+ W IEDFG++ F NGIP H Sbjct: 22 ILIDNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKLSNFNGIPSH 81 Query: 72 DTIARVVSQGKI 83 DT R S Sbjct: 82 DTFNRFFSALDP 93 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 92.2 bits (228), Expect = 5e-18, Method: Composition-based stats. Identities = 38/81 (46%), Positives = 52/81 (64%), Gaps = 4/81 (4%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + + E++S D R A+ H DI+ L + AVISGA W +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSQG 81 Y FE GIPV DTIARV+ + Sbjct: 57 YRPFECGIPVDDTIARVIKRI 77 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 91.0 bits (225), Expect = 1e-17, Method: Composition-based stats. Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 3/83 (3%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M ++ +I PD R +++ ++I+ + + AVI GA+ W +IE FG+TH + K Sbjct: 1 MTIQAFS---AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKA 57 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 IP HDT++R S I Sbjct: 58 RLPGLVSIPSHDTLSRFFSILDI 80 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 91.0 bits (225), Expect = 1e-17, Method: Composition-based stats. Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 2/75 (2%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFL--KQYGD 63 L+E S PD R+ + L+ IL++ +CA++ GA+ W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDTIARVV 78 E G P HDT + Sbjct: 63 LEGGTPSHDTFGDLF 77 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 88.3 bits (218), Expect = 7e-17, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 36/78 (46%), Gaps = 1/78 (1%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 + + + PD R A V H L ++L++ +V+ G+ ++ FG F + Sbjct: 10 IAMHIFLSAFDEVPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFENGIPVHDTIARVV 78 + ++ IP HDT + V Sbjct: 69 FLKLKHAIPSHDTFSEVF 86 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 88.0 bits (217), Expect = 9e-17, Method: Composition-based stats. Identities = 28/71 (39%), Positives = 42/71 (59%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L+ H + D RQ+ KV + L D+L +T+C VI+GAEGW +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIPVH 71 G G+PV Sbjct: 72 KGILTEGVPVR 82 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 87.6 bits (216), Expect = 1e-16, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 36/80 (45%), Gaps = 1/80 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + E I D R+ KV + I+L+T+ V + W DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSQGK 82 P HDT+ R K Sbjct: 77 PDLETTPSHDTLRRFFCIIK 96 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 86.4 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 23/67 (34%), Positives = 42/67 (62%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L+ L+EH S D R +++H L +ILLL +C ++ + +E+I +G HL FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIP 69 + +G+P Sbjct: 72 PYAHGVP 78 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 86.0 bits (212), Expect = 4e-16, Method: Composition-based stats. Identities = 22/81 (27%), Positives = 35/81 (43%), Gaps = 1/81 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + E PD R A +H L++IL + + A + GA D+ F + Sbjct: 5 MDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDVL 63 Query: 63 DFENGIPVHDTIARVVSQGKI 83 +NG+P HDT +RV Sbjct: 64 VLKNGLPSHDTFSRVFRMLDP 84 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 84.9 bits (209), Expect = 8e-16, Method: Composition-based stats. Identities = 21/74 (28%), Positives = 30/74 (40%) Query: 10 ISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIP 69 PD R H L D+L + + A I GAE D F +++ + G+P Sbjct: 9 FQGLPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLP 68 Query: 70 VHDTIARVVSQGKI 83 HDT +RV Sbjct: 69 SHDTFSRVFRLLDP 82 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 84.1 bits (207), Expect = 1e-15, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 37/83 (44%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 + ++ + + D R H+ DI+++ +C V+ G +G I + ++L+ Sbjct: 6 VAVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQG 65 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 + + NG+P D I + + Sbjct: 66 FLELPNGLPSRDCIRNWLMALQP 88 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 83.3 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 32/79 (40%), Gaps = 1/79 (1%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 ++ PD R H L ++L++ +V+ GA ++ FG + + Sbjct: 36 PILSAFEDVPDPRAE-NTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKL 94 Query: 65 ENGIPVHDTIARVVSQGKI 83 ++ +P HDT + V Sbjct: 95 KHAVPSHDTFSAVFRMIDP 113 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 81.4 bits (200), Expect = 9e-15, Method: Composition-based stats. Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 1/80 (1%) Query: 5 KLMEHISITPDYR-QAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 LM D R ++ +H ++L++ I AV+S + EDI +G D+L+Q+ Sbjct: 8 SLMCCFREVDDPRKRSNGTLHDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLV 67 Query: 64 FENGIPVHDTIARVVSQGKI 83 NG+ +T R+ Sbjct: 68 LLNGVASEETFLRIFRALDP 87 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 81.0 bits (199), Expect = 1e-14, Method: Composition-based stats. Identities = 26/65 (40%), Positives = 37/65 (56%) Query: 15 DYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D RQ+WK+ + LS IL L ++G E +++EDF E + Y D G P HDT+ Sbjct: 19 DSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEPLFATYVDLSEGCPSHDTL 78 Query: 75 ARVVS 79 RV+S Sbjct: 79 ERVIS 83 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 80.6 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 37/77 (48%), Gaps = 1/77 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L+ + + D RQ KVVH+ I++ + V + + W ++ DF +DF++++ Sbjct: 18 LESMYNALGEI-DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFF 76 Query: 63 DFENGIPVHDTIARVVS 79 P HDT+ R Sbjct: 77 PDIQKAPSHDTLRRFFC 93 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 80.6 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 27/77 (35%), Positives = 43/77 (55%), Gaps = 2/77 (2%) Query: 3 LKKLMEHISITPDYRQAWK--VVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 L+KL E S PD+R+A K + HKL D+++L I +S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARV 77 NGIP T+ R+ Sbjct: 95 LDILANGIPSEATLCRM 111 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 80.6 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 29/77 (37%), Positives = 44/77 (57%), Gaps = 2/77 (2%) Query: 3 LKKLMEHISITPDYRQAWK--VVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 L+KL E S PD+R+A K + HKLSDI++L I +S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARV 77 NGIP T+ R+ Sbjct: 95 LDILVNGIPSEATLCRM 111 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 79.1 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 1/78 (1%) Query: 7 MEHISITPDYRQAWK-VVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 M + D R+ +H +IL++ I AV+S + EDI + T +L+++ + Sbjct: 1 MSCLDEIDDPRKPSNGTLHDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLK 60 Query: 66 NGIPVHDTIARVVSQGKI 83 NGIP +T R++ Sbjct: 61 NGIPSEETFLRILRALDP 78 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 75.6 bits (185), Expect = 4e-13, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 2/78 (2%) Query: 3 LKKLMEHISITPDYRQ--AWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 +K L E + PDYR+ +KL DILLL I + DI FG+ +L + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVV 78 G +G+P T+ R+ Sbjct: 78 LGILLDGVPSEPTLCRIF 95 >UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium meliloti RepID=Q1WLF6_RHIME Length = 105 Score = 71.8 bits (175), Expect = 7e-12, Method: Composition-based stats. Identities = 18/63 (28%), Positives = 26/63 (41%), Gaps = 1/63 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + PD R H L+ IL + I A++ GAE D+ DFG +LK Sbjct: 1 MSRFAACFEDLPDPR-GRNARHPLTSILFIAIAAIVCGAESCTDMADFGVAKKKWLKTIV 59 Query: 63 DFE 65 Sbjct: 60 PLP 62 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 71.8 bits (175), Expect = 7e-12, Method: Composition-based stats. Identities = 17/53 (32%), Positives = 25/53 (47%) Query: 30 LLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQGK 82 +L+T+ V + W DI DF DFL+++ P HDT+ R K Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIK 53 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 67.2 bits (163), Expect = 2e-10, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 33/83 (39%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +++ L+E + PD R+ V L +L L + AV GA G+ +I + L Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 62 GDFENGIPVHDTIARVVSQGKIT 84 P T RV+ T Sbjct: 91 FGLVRCAPSAATFRRVLCTTDPT 113 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 66.0 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 18/42 (42%), Positives = 25/42 (59%) Query: 37 VISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVV 78 ++ AE WEDIE +G + +L+ + NGIP HDT RV Sbjct: 3 RVACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVF 44 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 65.6 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 22/73 (30%), Positives = 39/73 (53%), Gaps = 1/73 (1%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 +L ++S PD+R+A ++ L+ +LL +I A++SGA + I+ F +TH + L Sbjct: 1 MQLKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQ 60 Query: 64 F-ENGIPVHDTIA 75 P H +I Sbjct: 61 LHRKRAPAHTSIR 73 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 65.6 bits (159), Expect = 5e-10, Method: Composition-based stats. Identities = 19/62 (30%), Positives = 35/62 (56%) Query: 22 VVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQG 81 V + L+++LL T+ +I A +++IE G LD+L+Q+ FE+G+P T ++ Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 KI 83 Sbjct: 62 DP 63 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 65.6 bits (159), Expect = 5e-10, Method: Composition-based stats. Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 7/81 (8%) Query: 10 ISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG-- 67 I++ D R ++ + L ILL+++ A ISG + WE IED+ H + L+ +G Sbjct: 9 IAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKLSGKE 68 Query: 68 -----IPVHDTIARVVSQGKI 83 +P HDT V Sbjct: 69 LKVSRMPTHDTFNHVFQVIDP 89 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 64.5 bits (156), Expect = 9e-10, Method: Composition-based stats. Identities = 22/73 (30%), Positives = 38/73 (52%), Gaps = 1/73 (1%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 +L ++ PD+R+A ++ L+ +LL +I AV+SGA + I+ F + H + L Sbjct: 1 MQLKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQ 60 Query: 64 FE-NGIPVHDTIA 75 PVH +I Sbjct: 61 LHWKRAPVHTSIR 73 >UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ITI9_METNO Length = 74 Score = 62.2 bits (150), Expect = 5e-09, Method: Composition-based stats. Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 1/64 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L D R+ H+L IL++ +CAVI+ AE +DI +G + +L+ + Sbjct: 2 IEGLAVCFPGLEDLRETSGCDHRLIAILVIAVCAVIACAES-KDIGLYGRSKQAWLQTFL 60 Query: 63 DFEN 66 Sbjct: 61 PLPC 64 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 61.8 bits (149), Expect = 7e-09, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 35/80 (43%), Gaps = 1/80 (1%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E ++ PD R + L +L L + AV+ G E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSQGKI 83 NG +P +TIA ++ + Sbjct: 64 RNGNMPCPNTIAGLLRRLDP 83 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 61.8 bits (149), Expect = 8e-09, Method: Composition-based stats. Identities = 26/87 (29%), Positives = 43/87 (49%), Gaps = 7/87 (8%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDF-GETHLDFLKQ 60 E+ L+E ++ PD R V H L+ +L LT CAV++GA + ++ E + L++ Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 61 YGD-----FENGI-PVHDTIARVVSQG 81 G F P TI RV+++ Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARI 124 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 61.0 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 1/82 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + L+ + D R H L ++L L + A + GA+ ++ +F E + L++ Sbjct: 1 MSVLISILREVRDPR-DVNARHDLGELLFLALLATLRGAKTCVEMAEFSEARQEELREIV 59 Query: 63 DFENGIPVHDTIARVVSQGKIT 84 +G P HDT +RV T Sbjct: 60 ALRHGAPSHDTFSRVFRLLDPT 81 >UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ13_9PLAN Length = 75 Score = 58.7 bits (141), Expect = 5e-08, Method: Composition-based stats. Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 8/64 (12%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 EL++L ++ D R HKL +++L+ +CAVI+GA+G IE +L Sbjct: 19 ELRELAKYFQSLDDSRSHVNQRHKLVNVVLIAMCAVIAGADGPTAIE--------WLAGR 70 Query: 62 GDFE 65 Sbjct: 71 LQLP 74 >UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3K1_9PROT Length = 134 Score = 58.7 bits (141), Expect = 5e-08, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 2/77 (2%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+ S D R+ ++ L+ +LL T+ A+++GA + ++ F THLD L D Sbjct: 5 LLSLFSQISDPRRDQGKIYPLAPVLLFTVLAMLAGARSYRQVQAFIRTHLDRLNDGFDLS 64 Query: 66 -NGIPVHDTIARVVSQG 81 P + T+ R + +G Sbjct: 65 LRRAPAYSTV-RFILRG 80 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 58.3 bits (140), Expect = 7e-08, Method: Composition-based stats. Identities = 39/65 (60%), Positives = 42/65 (64%), Gaps = 12/65 (18%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTI------CAVISGAEGWEDIEDFGETH 54 MELKKLMEHISI PDYRQAWKV HKL DIL + C ++ G FGETH Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRG------FGETH 54 Query: 55 LDFLK 59 LDFLK Sbjct: 55 LDFLK 59 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 57.5 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 17/77 (22%), Positives = 38/77 (49%), Gaps = 1/77 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFL-KQY 61 + L+E + D+R+ H L +L++ I + G G+ ++ +F + + L +++ Sbjct: 1 MLSLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEF 60 Query: 62 GDFENGIPVHDTIARVV 78 +P + TI RV+ Sbjct: 61 NIIPERVPSYSTIRRVM 77 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 57.1 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 35/78 (44%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L ++ PD R + L IL + +CAV++GA + I D+ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FENGIPVHDTIARVVSQG 81 F + +P T+ R++ + Sbjct: 89 FTDRVPAATTVWRLLIRI 106 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 56.8 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 21/80 (26%), Positives = 36/80 (45%), Gaps = 1/80 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFL-KQY 61 L+ + PD R+A + L +L+ T+ A++SGA + I F E + L + Sbjct: 13 FSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTHHF 72 Query: 62 GDFENGIPVHDTIARVVSQG 81 G PV +T+ V+ Sbjct: 73 GVDLKRAPVVNTLRTVLQSL 92 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 55.6 bits (133), Expect = 5e-07, Method: Composition-based stats. Identities = 22/75 (29%), Positives = 33/75 (44%), Gaps = 1/75 (1%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E ++ PD+R A + L +LLL I +S G+ +EDF H + L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 -ENGIPVHDTIARVV 78 P T RV+ Sbjct: 64 PPTRFPSDSTFRRVM 78 >UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV84_9DELT Length = 357 Score = 55.2 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 35/87 (40%), Gaps = 5/87 (5%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGE--THLDFLK 59 +++ L ++ D R+ H++S +L + A + G +G++ I + + Sbjct: 214 QMESLPDYFKTVTDPRRTHGRRHRISTVLSIAAAATLCGMKGYKAIYGWANKLGQKARQR 273 Query: 60 QYGDFENG---IPVHDTIARVVSQGKI 83 ENG IP I V+ + Sbjct: 274 FRCRKENGKYVIPSQFVIRDVLVRADP 300 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 55.2 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 1/77 (1%) Query: 7 MEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFEN 66 + + D R H L+++L L + A + GA+ +I +F E LK+ + Sbjct: 5 LSILREIHDPR-DINARHDLAELLFLALAATLCGAKNCVEIAEFVEGREAELKEIVTLRH 63 Query: 67 GIPVHDTIARVVSQGKI 83 G P HDT +R+ Sbjct: 64 GCPSHDTFSRIFRLIDP 80 >UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VX04_POLNA Length = 100 Score = 55.2 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 16/66 (24%), Positives = 37/66 (56%), Gaps = 4/66 (6%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDI----EDFGETHLDFLKQY 61 L++ SI PD R + L +++++T+ AV+ GA+ W D+ + +G++ + +++ Sbjct: 2 LIQAFSILPDPRTGPAQRYDLREMIVMTLSAVLCGADNWVDVPVGSKKYGDSCMQVVREK 61 Query: 62 GDFENG 67 +G Sbjct: 62 CCLTSG 67 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 54.4 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 38/90 (42%), Gaps = 7/90 (7%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFG----ETHLDF 57 +++ L + PD R H L IL + + AV++ A+ + + ++ + L Sbjct: 219 QMEGLRQIFFRLPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKR 278 Query: 58 LK-QYGDFENG--IPVHDTIARVVSQGKIT 84 ++ ++ P T+ RV+ +T Sbjct: 279 IRARFNPRTQRYVAPSEPTLRRVLQGANVT 308 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 54.1 bits (129), Expect = 2e-06, Method: Composition-based stats. Identities = 23/84 (27%), Positives = 35/84 (41%), Gaps = 1/84 (1%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + L E +S PD R ++H L +L L A++ G + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSQGKI 83 F G P T++R + + Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDP 84 >UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BLK3_9GAMM Length = 190 Score = 52.9 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 15/87 (17%), Positives = 36/87 (41%), Gaps = 5/87 (5%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHLD 56 +++ L ++ + PD R+A H+L + LT A + G +G++ + ++ Sbjct: 59 QMRALPQYFTDLPDPRRAQGRRHRLPVVPALTAGASLCGMQGYKAMAEWASSLGQAARQR 118 Query: 57 FLKQYGDFENGIPVHDTIARVVSQGKI 83 F + + +P I + + Sbjct: 119 FGCRRVNGHYLVPSLYVIRDCLVRLGP 145 >UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BR51_9GAMM Length = 153 Score = 52.1 bits (124), Expect = 6e-06, Method: Composition-based stats. Identities = 14/87 (16%), Positives = 35/87 (40%), Gaps = 5/87 (5%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHLD 56 +++ L ++ + PD +A H+L +L L A + G +G++ + ++ Sbjct: 7 QMRSLPQYFTDLPDPHRAEGRRHRLPVVLTLAAGASLCGMQGYKSMAEWASSLAQAARRR 66 Query: 57 FLKQYGDFENGIPVHDTIARVVSQGKI 83 F + + +P I + + Sbjct: 67 FGCRRVNGHYLVPSLYVIRDCLVRLGP 93 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 51.8 bits (123), Expect = 7e-06, Method: Composition-based stats. Identities = 17/85 (20%), Positives = 35/85 (41%), Gaps = 17/85 (20%) Query: 11 SITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFG----ETHLDFL-------- 58 + D R+A + H +LL+ + V++G +E I + ++ L L Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 59 KQYGDFENGIPVHDTIARVVSQGKI 83 +++ P TI R++S+ Sbjct: 289 ERFLP-----PSEPTIRRILSKADP 308 >UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E6_9ACTO Length = 128 Score = 50.6 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 34/83 (40%), Gaps = 7/83 (8%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFG----ETHLDFL 58 + L E ++ D R+ H +LL+ AV++GA + I ++ + L L Sbjct: 1 MGPLAERLATLADPRRRRGKRHPFVAVLLIACSAVVTGARSFVAIHEWATDSPQDVLARL 60 Query: 59 --KQYGDFENGI-PVHDTIARVV 78 + I P TI RV+ Sbjct: 61 GARTATALAVRIPPSGVTIRRVI 83 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 50.2 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 26/86 (30%), Positives = 42/86 (48%), Gaps = 6/86 (6%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICA----VISGAEGWEDIEDFGETHLDF 57 +LKKL+E S PD R+A V H+L+ +LL + + + S E D+ L Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSR--PAFLQA 136 Query: 58 LKQYGDFENGIPVHDTIARVVSQGKI 83 L+ +P DT+ARV+ + + Sbjct: 137 LQGLFPELETLPHGDTLARVLERIEP 162 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 50.2 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 22/80 (27%), Positives = 35/80 (43%), Gaps = 7/80 (8%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L EH++ PD R + H L IL + + A+ SGAE + + ++ T L Q + Sbjct: 16 LWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGCQ 75 Query: 66 NG-------IPVHDTIARVV 78 P T+ RV+ Sbjct: 76 ESPSRQCFVPPSWTTLHRVI 95 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 49.8 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 41/85 (48%), Gaps = 6/85 (7%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L E + + P R K ++ L +LL+ + +SG W +IED+ E + + LK + Sbjct: 4 SLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEM 63 Query: 65 ENG------IPVHDTIARVVSQGKI 83 G +P HDT+ R +S + Sbjct: 64 LTGHQLMHTMPSHDTLNRSISLLDV 88 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 49.8 bits (118), Expect = 3e-05, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 33/84 (39%), Gaps = 8/84 (9%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVIS-GAEGWEDIEDFGE-------THLDF 57 L++ ++ D R H L+ IL + CA ++ G + IE + + L Sbjct: 30 LIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLHI 89 Query: 58 LKQYGDFENGIPVHDTIARVVSQG 81 + + P TI RV++ Sbjct: 90 WRDPFTGLHRPPSERTIRRVLAAL 113 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 49.4 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 34/79 (43%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ L+ + PD R ++L ++ + +CAV +GA + I D+ + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSQ 80 +P TI +V + Sbjct: 102 CGIRFRVPSEATIRQVFGR 120 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 49.4 bits (117), Expect = 4e-05, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 38/85 (44%), Gaps = 7/85 (8%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETH----LDF 57 E++ L + ++ PD R + H+L IL L+ AV +G + E+I + L Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 58 LKQYGDFENG---IPVHDTIARVVS 79 L G P DT+ RV+S Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLS 122 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 48.7 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 27/76 (35%), Gaps = 1/76 (1%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFL-KQYGDF 64 L+ H+ PD R V +LL+ + ++S E D+E F H L + G Sbjct: 13 LISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGIE 72 Query: 65 ENGIPVHDTIARVVSQ 80 P Q Sbjct: 73 LKRPPSDSAFRYFFLQ 88 >UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BWL6_9GAMM Length = 94 Score = 48.3 bits (114), Expect = 7e-05, Method: Composition-based stats. Identities = 20/55 (36%), Positives = 32/55 (58%), Gaps = 1/55 (1%) Query: 8 EHISITPDYRQ-AWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 EH PD R+ + HK DIL++ ICA+I GA+ W + +FG+ D+ + + Sbjct: 40 EHFKSLPDPRRRTMNLRHKFIDILIIAICAIICGADSWVAVAEFGKAKEDWFRVF 94 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 47.5 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 38/79 (48%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 + L+E ++ PD R+ V ++ + +L + +CA++SGA + I ++ + Sbjct: 47 DQTALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAG 106 Query: 62 GDFENGIPVHDTIARVVSQ 80 +P TI RV+ + Sbjct: 107 LGLTGRVPGPVTIWRVLVR 125 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 47.1 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 18/58 (31%), Positives = 29/58 (50%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 + L + + PD+R+A + L +LL +I A++SGA + I F TH L Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNA 58 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 46.4 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 37/84 (44%), Gaps = 7/84 (8%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +++ LM+ +S T D R+ + H ++ +CA++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDFENG-------IPVHDTIARVV 78 F P T+ R + Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTL 304 >UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AI67_NITEC Length = 94 Score = 46.0 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 11/60 (18%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +L + D RQ K H L +L++TI +I + LD+L+QY Sbjct: 34 RLADVFVSITDPRQ-RKSRHDLVKVLVITI----------NEILAWANEKLDWLRQYLKL 82 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 45.6 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 17/82 (20%), Positives = 36/82 (43%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ L+ + PD+R V ++L+ +L L + I+G + + ++ + Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 62 GDFENGIPVHDTIARVVSQGKI 83 F +P TI R+V +G Sbjct: 84 LGFPRRVPSERTIRRIVEEGPP 105 >UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW01_RHOSR Length = 130 Score = 44.8 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 16/74 (21%), Positives = 25/74 (33%) Query: 11 SITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPV 70 PD R V H+ S IL + A +GA + I ++ +P Sbjct: 49 DEVPDPRHRRGVRHRFSVILAPALSATCAGARSFIAIAEWAHDAPAEALGGLGVSAVVPS 108 Query: 71 HDTIARVVSQGKIT 84 T R ++ T Sbjct: 109 ESTSRRFLAGVDAT 122 >UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TLQ7_SHEHH Length = 74 Score = 44.8 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 17/44 (38%), Positives = 23/44 (52%) Query: 7 MEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDF 50 EH+SI R + H DI+ L A+ S EGW DI++F Sbjct: 4 FEHLSIIKAPRSSINHEHDPVDIMFLVNSAIASDCEGWLDIDEF 47 >UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J572_FRASC Length = 148 Score = 44.8 bits (105), Expect = 9e-04, Method: Composition-based stats. Identities = 13/46 (28%), Positives = 19/46 (41%), Gaps = 1/46 (2%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDF 50 + E PD R V H+L +L L AV+ G G + + Sbjct: 67 PVAECFEQIPDPRDPRGVRHRLPVVLSLC-AAVLCGESGLAGVAAW 111 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 44.8 bits (105), Expect = 0.001, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 34/77 (44%), Gaps = 5/77 (6%) Query: 7 MEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLK-----QY 61 + +++ PD R+ K H+ D+LL+ + AV SG + + + FL + Sbjct: 10 LPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDEVHIRT 69 Query: 62 GDFENGIPVHDTIARVV 78 E +P T+ R+ Sbjct: 70 RRGERKLPGQATLYRLF 86 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 44.4 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 35/90 (38%), Gaps = 8/90 (8%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVI-SGAEGWEDIEDFG-ETHLDFLK 59 +++ L+ D R A V +++S +L L +CA+ +G + ++ + L Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 60 QY------GDFENGIPVHDTIARVVSQGKI 83 + IP T+ V+ + Sbjct: 90 AFGLPYHPLRGRYRIPSEKTLRTVLGRLDP 119 >UniRef50_D1RJD3 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RJD3_LEGLO Length = 61 Score = 44.4 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 1/55 (1%) Query: 22 VVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIAR 76 + L I+ L + + I DFG +++LKQ+ ++NG+PV DT+ R Sbjct: 2 KRYLLIKIMFLLLVLQFMDVKAGT-IRDFGLLKIEWLKQFLTYKNGMPVDDTMTR 55 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 44.0 bits (103), Expect = 0.002, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 37/89 (41%), Gaps = 7/89 (7%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDI----EDFGETHLDF 57 ++ L+ + D R+A ++ LS +L + A ++GA G +I DFG+ L Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKQYGDFENG---IPVHDTIARVVSQGKI 83 L D G P I + + + Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDV 109 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 42.9 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 30/78 (38%), Gaps = 3/78 (3%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L E +S PD R A + L +L L + A +S + +E F + L + Sbjct: 3 LREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARAN-PHLLPHLGLR 60 Query: 66 NGIPVHDTIARVVSQGKI 83 P H + ++ + Sbjct: 61 K-PPGHTILTLLLHRLDP 77 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 41.4 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 30/78 (38%), Gaps = 3/78 (3%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L + +S PD R A + L +L L + A +S + +E F + L G Sbjct: 3 LRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSQGKI 83 P H I ++ + Sbjct: 60 RKAPGHTAITLLLHRLDP 77 >UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromonas gingivalis ATCC 33277 RepID=B2RI66_PORG3 Length = 87 Score = 41.0 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 15/61 (24%), Positives = 27/61 (44%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + + + R K V+ L + L+ + +SG W +IED+ E + + LK Sbjct: 7 MYSVTIGKSLYGLGPTRIESKEVYPLDFLFLIVFLSTLSGDTSWYEIEDYAEEYEEVLKS 66 Query: 61 Y 61 Sbjct: 67 R 67 >UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HUT6_9ACTO Length = 130 Score = 39.4 bits (91), Expect = 0.034, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 29/73 (39%) Query: 10 ISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIP 69 +S PD R+ + L +L L + AV+ GA I F L++ + P Sbjct: 50 LSRIPDPRRVRGRRYHLGSLLALCLVAVLGGARSLATIARFAADTNSSLREQLGLASSTP 109 Query: 70 VHDTIARVVSQGK 82 T+ + + K Sbjct: 110 NASTLGGLRANLK 122 >UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms RepID=Q11MU1_MESSB Length = 447 Score = 38.7 bits (89), Expect = 0.054, Method: Composition-based stats. Identities = 13/37 (35%), Positives = 18/37 (48%) Query: 11 SITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDI 47 + D R +V H L+DIL I A+ G E D+ Sbjct: 52 AAIRDPRDPARVRHSLTDILRARIFAIACGYEDANDL 88 >UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LD44_FRASN Length = 261 Score = 38.3 bits (88), Expect = 0.070, Method: Composition-based stats. Identities = 10/37 (27%), Positives = 21/37 (56%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVI 38 + L+E ++ PD R+ V H + +L + +CA++ Sbjct: 57 DQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAML 93 >UniRef50_C7S7P7 Transposase n=4 Tax=root RepID=C7S7P7_METEA Length = 404 Score = 38.3 bits (88), Expect = 0.072, Method: Composition-based stats. Identities = 14/37 (37%), Positives = 22/37 (59%) Query: 11 SITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDI 47 ++ PD R ++VH L +ILL I A+ G E +D+ Sbjct: 9 AVIPDRRDPSRIVHPLPEILLARILAIACGYEDADDL 45 >UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L6S8_RUMHA Length = 107 Score = 37.9 bits (87), Expect = 0.092, Method: Composition-based stats. Identities = 8/37 (21%), Positives = 17/37 (45%) Query: 47 IEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQGKI 83 + F + + ++ D + G P DT+ RV + + Sbjct: 1 MAFFMKLQEPYFEKILDLKYGTPSADTLLRVFAIIEP 37 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 124 6e-28 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 123 2e-27 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 121 1e-26 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 119 4e-26 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 118 6e-26 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 118 7e-26 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 117 9e-26 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 117 1e-25 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 115 4e-25 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 114 9e-25 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 113 1e-24 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 113 1e-24 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 112 2e-24 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 112 3e-24 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 111 1e-23 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 109 2e-23 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 109 2e-23 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 109 3e-23 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 109 3e-23 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 108 4e-23 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 108 7e-23 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 104 7e-22 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 104 7e-22 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 104 1e-21 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 103 1e-21 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 103 1e-21 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 102 3e-21 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 102 3e-21 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 102 3e-21 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 102 4e-21 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 101 6e-21 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 101 7e-21 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 101 7e-21 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 101 7e-21 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 100 1e-20 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 100 1e-20 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 100 2e-20 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 100 2e-20 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 100 2e-20 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 100 2e-20 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 99 3e-20 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 99 4e-20 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 98 1e-19 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 98 1e-19 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 97 2e-19 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 96 2e-19 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 96 3e-19 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 96 4e-19 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 94 9e-19 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 94 1e-18 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 94 2e-18 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 93 2e-18 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 93 3e-18 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 93 3e-18 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 92 5e-18 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 92 5e-18 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 90 2e-17 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 90 2e-17 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 86 3e-16 UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfo... 86 4e-16 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 86 4e-16 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 85 6e-16 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 84 1e-15 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 83 3e-15 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 81 1e-14 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 81 1e-14 UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis... 81 1e-14 UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis... 81 1e-14 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 80 2e-14 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 80 2e-14 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 79 3e-14 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 79 4e-14 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 79 4e-14 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 79 5e-14 UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium m... 79 6e-14 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 78 7e-14 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 78 8e-14 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 77 2e-13 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 76 3e-13 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 76 4e-13 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 76 4e-13 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 76 5e-13 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 75 8e-13 UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteoba... 75 9e-13 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 74 2e-12 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 73 2e-12 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 73 2e-12 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 73 2e-12 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 73 3e-12 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 73 4e-12 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 72 6e-12 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 71 8e-12 UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodoco... 71 8e-12 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 71 1e-11 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 70 2e-11 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 69 4e-11 UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 69 6e-11 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 68 7e-11 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 68 1e-10 UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylo... 67 2e-10 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 66 5e-10 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 63 3e-09 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 62 5e-09 UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastop... 61 1e-08 UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polarom... 60 2e-08 UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitroso... 58 8e-08 UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia... 57 2e-07 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 56 5e-07 UniRef50_D1RJD3 Putative uncharacterized protein n=1 Tax=Legione... 51 8e-06 UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiat... 51 1e-05 UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewane... 48 8e-05 Sequences not found previously or not previously below threshold: UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 62 5e-09 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 61 1e-08 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 56 5e-07 UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 56 5e-07 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 52 5e-06 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 51 9e-06 UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Strepto... 49 4e-05 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 49 4e-05 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 49 6e-05 UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinis... 48 1e-04 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 47 1e-04 UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia... 46 3e-04 UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus... 46 3e-04 UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candida... 44 0.001 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 43 0.003 UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 R... 43 0.003 UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinis... 42 0.004 UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromon... 42 0.007 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 41 0.010 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 41 0.011 UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoa... 41 0.016 UniRef50_C6VL62 Transposase n=25 Tax=Bacilli RepID=C6VL62_LACPJ 40 0.020 UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms Rep... 38 0.087 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 124 bits (313), Expect = 6e-28, Method: Composition-based stats. Identities = 76/83 (91%), Positives = 76/83 (91%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 MELKKLMEHISI PDYRQ WKV HKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 YGDFENGIPVHDTIARVVS Sbjct: 61 YGDFENGIPVHDTIARVVSCISP 83 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 30/78 (38%), Positives = 48/78 (61%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L++H D R HKL DI+++ +CA+I GA+ + +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSQGKI 83 NGIP HDT ARV ++ Sbjct: 69 NGIPSHDTFARVFARIDP 86 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 121 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 33/76 (43%), Positives = 43/76 (56%) Query: 8 EHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG 67 ++ D R HKL DI+ +TICAVI GA+ W DIE FG+ +LK++ + NG Sbjct: 11 DYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNG 70 Query: 68 IPVHDTIARVVSQGKI 83 IP HDT RV S Sbjct: 71 IPSHDTFGRVFSLLNP 86 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 119 bits (298), Expect = 4e-26, Method: Composition-based stats. Identities = 25/82 (30%), Positives = 47/82 (57%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +L + E PD R + H LS++L + +CAV+ GA + D+ +G+++L +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVSQGKI 83 + G+P HDT RV++ Sbjct: 65 LKLKAGVPSHDTFCRVLAMIDP 86 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 118 bits (296), Expect = 6e-26, Method: Composition-based stats. Identities = 24/83 (28%), Positives = 43/83 (51%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M K L++++ PD R K H LS+++ + ICA++ G + W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 + GIP HDT R+ + Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPP 83 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 118 bits (296), Expect = 7e-26, Method: Composition-based stats. Identities = 48/83 (57%), Positives = 59/83 (71%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L L +H + D RQA KV +KL D+L L + AVISGAEGWE+IEDFG L +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 YGDF +GIPVHDTIAR+V + Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDP 83 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 117 bits (295), Expect = 9e-26, Method: Composition-based stats. Identities = 40/83 (48%), Positives = 50/83 (60%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L+ +SI D RQ KV H L D+L L I AVISG EGWE+I+DFG LD+L++ Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 Y F GIP DTI+R+ Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDP 83 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 117 bits (293), Expect = 1e-25, Method: Composition-based stats. Identities = 26/78 (33%), Positives = 42/78 (53%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L ++ D R H+L DI+ + + AV++GA+ W IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSQGKI 83 NGIP HDT ARV ++ Sbjct: 74 NGIPSHDTFARVFARLDP 91 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 115 bits (289), Expect = 4e-25, Method: Composition-based stats. Identities = 25/80 (31%), Positives = 37/80 (46%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L E D+R H L+DIL++ CA++ G + +E FG +L+ + Sbjct: 14 SNLREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLA 73 Query: 64 FENGIPVHDTIARVVSQGKI 83 NGIP HDT +V S Sbjct: 74 LPNGIPSHDTFRKVFSLLDP 93 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 114 bits (286), Expect = 9e-25, Method: Composition-based stats. Identities = 26/79 (32%), Positives = 44/79 (55%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+ ++ D R H L D+L + I AVI+G++GWED+E++G ++L ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 ENGIPVHDTIARVVSQGKI 83 +GIP DT RV + Sbjct: 90 PHGIPSDDTFRRVFERIDP 108 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 113 bits (284), Expect = 1e-24, Method: Composition-based stats. Identities = 26/83 (31%), Positives = 43/83 (51%), Gaps = 1/83 (1%) Query: 2 ELKKLMEHISITPDYRQA-WKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 +K E+ D R+ H DIL++ +CA+ISGA + +IE FG + ++ + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 + NGIP HDT V+++ Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSP 87 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 113 bits (284), Expect = 1e-24, Method: Composition-based stats. Identities = 28/82 (34%), Positives = 49/82 (59%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++EH S D R A ++ + L DI+++T+CAV+ GA+ W ++ ++G + +LKQ+ Sbjct: 5 PFASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQW 64 Query: 62 GDFENGIPVHDTIARVVSQGKI 83 NG+P HDT V ++ K Sbjct: 65 IALPNGVPSHDTFEWVFARLKP 86 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 112 bits (282), Expect = 2e-24, Method: Composition-based stats. Identities = 27/80 (33%), Positives = 44/80 (55%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L+EH D R + H+L D+L++ +C ++ G E + D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSQGKI 83 +GIP HDT RV + K Sbjct: 67 LRHGIPKHDTFNRVFAALKP 86 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 112 bits (282), Expect = 3e-24, Method: Composition-based stats. Identities = 30/79 (37%), Positives = 47/79 (59%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 +KL PD+R++ K ++ L ILL+ I +VI GA+ W ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSQGK 82 NGIP HDT RV S Sbjct: 65 LPNGIPSHDTFNRVFSNID 83 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 111 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 32/76 (42%), Positives = 44/76 (57%), Gaps = 1/76 (1%) Query: 9 HISITPDYRQA-WKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG 67 + PD R+A +H LSDIL + +CAV+SG + WE + +FG T +L+Q+ NG Sbjct: 17 FFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANG 76 Query: 68 IPVHDTIARVVSQGKI 83 IP HDT RV S Sbjct: 77 IPSHDTFGRVFSLIDP 92 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 109 bits (274), Expect = 2e-23, Method: Composition-based stats. Identities = 29/81 (35%), Positives = 44/81 (54%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L+E + D R K+ H+L DIL++ +CAV++ AE +EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSQGKI 83 D GIP HDT RV Sbjct: 62 DLPGGIPSHDTFRRVFMLIDP 82 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 109 bits (274), Expect = 2e-23, Method: Composition-based stats. Identities = 33/84 (39%), Positives = 49/84 (58%), Gaps = 2/84 (2%) Query: 1 MELKKLMEHISITPDYRQA-WKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLK 59 M L L + PD R +H L+DIL + CAVI+GAEGWEDI ++G + F + Sbjct: 1 MAL-PLTSVFADLPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFR 59 Query: 60 QYGDFENGIPVHDTIARVVSQGKI 83 ++ + +NG+P HDT RV ++ Sbjct: 60 RFLELKNGVPSHDTFYRVFTKLDP 83 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 109 bits (273), Expect = 3e-23, Method: Composition-based stats. Identities = 28/83 (33%), Positives = 51/83 (61%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ ++E+ + D R+ H L D+L++ + AVI+GA+G I + E H+++LK Sbjct: 9 DVVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSR 68 Query: 62 GDFENGIPVHDTIARVVSQGKIT 84 + +G+P HDTI R+++Q K T Sbjct: 69 LELPSGVPSHDTIGRLLAQLKPT 91 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 109 bits (273), Expect = 3e-23, Method: Composition-based stats. Identities = 32/82 (39%), Positives = 47/82 (57%), Gaps = 1/82 (1%) Query: 3 LKKLM-EHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ L E S PD R H +I+ L + +V++GA+ + +IEDF E H+D+LK Y Sbjct: 1 MEGLFVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTY 60 Query: 62 GDFENGIPVHDTIARVVSQGKI 83 + NGIP HDT +RV S Sbjct: 61 FNLPNGIPSHDTFSRVFSAINP 82 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 108 bits (271), Expect = 4e-23, Method: Composition-based stats. Identities = 32/80 (40%), Positives = 51/80 (63%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + H D R H L D++ LT+ A++SGAEGW+DI+ FG++ LD+L+++ Sbjct: 1 MSFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRA 60 Query: 64 FENGIPVHDTIARVVSQGKI 83 F+ G+PV DTIAR++S + Sbjct: 61 FKEGVPVDDTIARIISSLEP 80 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 108 bits (270), Expect = 7e-23, Method: Composition-based stats. Identities = 40/79 (50%), Positives = 53/79 (67%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E SI D RQ K+ H+L DIL L + AVI GAEGW+DIE+ G L++L++ G F Sbjct: 6 SLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFF 65 Query: 65 ENGIPVHDTIARVVSQGKI 83 + GIPV DTIAR++S Sbjct: 66 KKGIPVDDTIARIISSLNP 84 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 104 bits (261), Expect = 7e-22, Method: Composition-based stats. Identities = 28/81 (34%), Positives = 49/81 (60%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++L+E + D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSQGKI 83 + +NG P HDT+ RV+ Sbjct: 61 ELKNGPPSHDTLRRVMGMVSP 81 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 104 bits (261), Expect = 7e-22, Method: Composition-based stats. Identities = 28/81 (34%), Positives = 49/81 (60%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++L+E + D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSQGKI 83 + +NG P HDT+ RV+ Sbjct: 61 ELKNGPPSHDTLRRVMGMVSP 81 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 104 bits (260), Expect = 1e-21, Method: Composition-based stats. Identities = 41/77 (53%), Positives = 53/77 (68%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++H +T D R H L DI+LL I AV+SG+EGWEDIE+FG LD+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVVSQGK 82 GIP HDTIARV+ + K Sbjct: 67 AGIPRHDTIARVICRLK 83 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 103 bits (258), Expect = 1e-21, Method: Composition-based stats. Identities = 37/77 (48%), Positives = 49/77 (63%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++H D R H L +I+LL I AV+SG+EGWE IE+FG LD+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVVSQGK 82 GIP HDTIARV+ + K Sbjct: 67 AGIPRHDTIARVICRLK 83 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 103 bits (258), Expect = 1e-21, Method: Composition-based stats. Identities = 31/80 (38%), Positives = 46/80 (57%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L +H+S+ D R H L D+L L + AV SG +GW +I+ FGE L++L+++ Sbjct: 1 MSLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRP 60 Query: 64 FENGIPVHDTIARVVSQGKI 83 F NGIP TIAR++ Sbjct: 61 FANGIPRRHTIARILKAVGP 80 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 102 bits (256), Expect = 3e-21, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 39/78 (50%), Gaps = 1/78 (1%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++ PD R H L +IL + + AV+ GA ++E F + LD L+Q+ E Sbjct: 3 FLDVFGEVPDPR-DLTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVHDTIARVVSQGKI 83 G P HDT +RV++ Sbjct: 62 RGAPSHDTFSRVLAALDP 79 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 102 bits (256), Expect = 3e-21, Method: Composition-based stats. Identities = 24/81 (29%), Positives = 45/81 (55%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L ++E D+R A + H+LS++L + +CAV+SGA+ +E+I +G + +L+ + Sbjct: 6 LADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPWLRGFL 65 Query: 63 DFENGIPVHDTIARVVSQGKI 83 + G+ DT RV + Sbjct: 66 RLDYGVASPDTFERVFALLDP 86 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 102 bits (255), Expect = 3e-21, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 43/81 (53%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L + D RQA KV H++ ++L++ C+ + E + D+ DF ++ L +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSQGKI 83 ++G P HD V+ + Sbjct: 61 PLKHGAPSHDVFRNVLMAIQP 81 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 102 bits (255), Expect = 4e-21, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 35/82 (42%), Gaps = 1/82 (1%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 + + E PD R A +H L++IL + + A + GA D+ F + Sbjct: 4 PMDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDV 62 Query: 62 GDFENGIPVHDTIARVVSQGKI 83 +NG+P HDT +RV Sbjct: 63 LVLKNGLPSHDTFSRVFRMLDP 84 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 101 bits (253), Expect = 6e-21, Method: Composition-based stats. Identities = 26/80 (32%), Positives = 42/80 (52%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L D R + ++ L +ILL+T+CA+I G + W+ I DFG+ +L Q+ + Sbjct: 14 QYLFHCFLSIKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVN 73 Query: 64 FENGIPVHDTIARVVSQGKI 83 G+P T ARV S + Sbjct: 74 MRCGVPSTLTFARVFSLIEP 93 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 101 bits (253), Expect = 7e-21, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 36/83 (43%), Gaps = 1/83 (1%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 + + + PD R A V H L ++L++ +V+ G+ ++ FG F + Sbjct: 10 IAMHIFLSAFDEVPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 + ++ IP HDT + V Sbjct: 69 FLKLKHAIPSHDTFSEVFRIIDP 91 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 101 bits (252), Expect = 7e-21, Method: Composition-based stats. Identities = 31/83 (37%), Positives = 49/83 (59%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M ++ +H S D+RQ+ KV + L D+L ++CAVI+ + GW +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 F +GIP DTIAR+VS Sbjct: 61 QKMFIDGIPADDTIARIVSMIDP 83 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 101 bits (252), Expect = 7e-21, Method: Composition-based stats. Identities = 35/75 (46%), Positives = 51/75 (68%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++++E + D R A + H L DIL+L +CAV+SGA+GW+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARV 77 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRV 81 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 33/81 (40%), Positives = 47/81 (58%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + ++ + D R+ WK+ H LSDI+LL A +SGAE W++IE FG+ + LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFENGIPVHDTIARVVSQGKI 83 ENGIP HDT+ RV + Sbjct: 66 QLENGIPSHDTLQRVFATLDP 86 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 28/79 (35%), Positives = 39/79 (49%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +++ D R K VHK+ I+ ++I AVI GA+ W +IE+FG + F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPD 63 Query: 65 ENGIPVHDTIARVVSQGKI 83 IP HDT R S K Sbjct: 64 LEFIPSHDTFNRFFSIIKP 82 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats. Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 1/80 (1%) Query: 5 KLMEHISITPDYR-QAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 LM D R ++ +H ++L++ I AV+S + EDI +G D+L+Q+ Sbjct: 8 SLMCCFREVDDPRKRSNGTLHDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLV 67 Query: 64 FENGIPVHDTIARVVSQGKI 83 NG+ +T R+ Sbjct: 68 LLNGVASEETFLRIFRALDP 87 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats. Identities = 28/79 (35%), Positives = 39/79 (49%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +++ D R K VHK+ I+ ++I AVI GA+ W +IE+FG + F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPS 63 Query: 65 ENGIPVHDTIARVVSQGKI 83 IP HDT R S K Sbjct: 64 LEFIPSHDTFNRFFSMIKP 82 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats. Identities = 35/80 (43%), Positives = 47/80 (58%), Gaps = 1/80 (1%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + D R A+ H DI+ L + AVISGA W +I+ FGE HLD+L++Y Sbjct: 1 MSVFRFFENLSDPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRP 59 Query: 64 FENGIPVHDTIARVVSQGKI 83 FE GIPV DTIARV+ + + Sbjct: 60 FECGIPVDDTIARVIKRIEP 79 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats. Identities = 24/77 (31%), Positives = 39/77 (50%) Query: 7 MEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFEN 66 ++H D R L I+ + I AV++GA+G+ IE +G+ +L+ + D Sbjct: 28 LKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLPK 87 Query: 67 GIPVHDTIARVVSQGKI 83 GIP HDT RV+ + Sbjct: 88 GIPSHDTFGRVLRILEP 104 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 99.5 bits (247), Expect = 3e-20, Method: Composition-based stats. Identities = 31/78 (39%), Positives = 44/78 (56%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 + + H S D RQ KV + L +ILLLT+CAV+SGA W I +G L FLK++ F Sbjct: 24 EFLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPF 83 Query: 65 ENGIPVHDTIARVVSQGK 82 +G P HD + + + Sbjct: 84 ADGTPSHDQLGNIFAALD 101 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 99.1 bits (246), Expect = 4e-20, Method: Composition-based stats. Identities = 22/80 (27%), Positives = 44/80 (55%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + +E ++ D+R + ++L DILL++ AVI + + ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSQGKI 83 F +G P HDT +V+S+ Sbjct: 63 FRHGPPSHDTFGKVLSRLDP 82 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 97.5 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 37/83 (44%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 + ++ + + D R H+ DI+++ +C V+ G +G I + ++L+ Sbjct: 6 VAVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQG 65 Query: 61 YGDFENGIPVHDTIARVVSQGKI 83 + + NG+P D I + + Sbjct: 66 FLELPNGLPSRDCIRNWLMALQP 88 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 97.5 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 1/78 (1%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+ PD R+ + H+L ++LL IC VISGAE W + + + LD+L+ Y + Sbjct: 7 SLLTAFDDLPDPRR-RECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 ENGIPVHDTIARVVSQGK 82 +GI HDT RV S Sbjct: 66 AHGIASHDTFGRVFSLLD 83 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 97.2 bits (241), Expect = 2e-19, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 33/79 (41%), Gaps = 1/79 (1%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 ++ PD R A H L ++L++ +V+ GA ++ FG + + Sbjct: 36 PILSAFEDVPDPR-AENTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKL 94 Query: 65 ENGIPVHDTIARVVSQGKI 83 ++ +P HDT + V Sbjct: 95 KHAVPSHDTFSAVFRMIDP 113 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 96.4 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 1/78 (1%) Query: 7 MEHISITPDYRQA-WKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 M + D R+ +H +IL++ I AV+S + EDI + T +L+++ + Sbjct: 1 MSCLDEIDDPRKPSNGTLHDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLK 60 Query: 66 NGIPVHDTIARVVSQGKI 83 NGIP +T R++ Sbjct: 61 NGIPSEETFLRILRALDP 78 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 96.4 bits (239), Expect = 3e-19, Method: Composition-based stats. Identities = 22/81 (27%), Positives = 40/81 (49%), Gaps = 2/81 (2%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFL--KQY 61 L+E S PD R+ + L+ IL++ +CA++ GA+ W ++ D+ + ++L + Sbjct: 1 MTLLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFR 60 Query: 62 GDFENGIPVHDTIARVVSQGK 82 E G P HDT + Sbjct: 61 WPLEGGTPSHDTFGDLFRVLD 81 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 95.6 bits (237), Expect = 4e-19, Method: Composition-based stats. Identities = 31/78 (39%), Positives = 50/78 (64%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L+EH+++ + R H L D++ L I A++SGAEGW DIE +G++ +D+L+Q+ Sbjct: 7 MTLIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRP 66 Query: 64 FENGIPVHDTIARVVSQG 81 F NGIP T+AR++ Sbjct: 67 FANGIPRRHTVARILRCI 84 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 94.5 bits (234), Expect = 9e-19, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 33/83 (39%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +++ L+E + PD R+ V L +L L + AV GA G+ +I + L Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 62 GDFENGIPVHDTIARVVSQGKIT 84 P T RV+ T Sbjct: 91 FGLVRCAPSAATFRRVLCTTDPT 113 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 94.5 bits (234), Expect = 1e-18, Method: Composition-based stats. Identities = 21/74 (28%), Positives = 30/74 (40%) Query: 10 ISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIP 69 PD R H L D+L + + A I GAE D F +++ + G+P Sbjct: 9 FQGLPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLP 68 Query: 70 VHDTIARVVSQGKI 83 HDT +RV Sbjct: 69 SHDTFSRVFRLLDP 82 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 93.7 bits (232), Expect = 2e-18, Method: Composition-based stats. Identities = 26/74 (35%), Positives = 40/74 (54%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +++H+ D R H + DI L + AVISGA+ W +FG L++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVS 79 NGIP +I R+ Sbjct: 61 NGIPSQQSIGRIFR 74 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 93.3 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 31/79 (39%), Positives = 46/79 (58%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L++H+ I D R + H L D++ LT+ A++SGA GW+ IE FG LD+L+ Y Sbjct: 1 MTLLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRP 60 Query: 64 FENGIPVHDTIARVVSQGK 82 FE+GIP IA ++ Sbjct: 61 FEHGIPRRHCIANIIKSLD 79 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 92.9 bits (230), Expect = 3e-18, Method: Composition-based stats. Identities = 31/85 (36%), Positives = 39/85 (45%), Gaps = 2/85 (2%) Query: 1 MELKKLMEHISITP--DYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFL 58 +E+ L E D R H+ S I+L+ I AVI GA+ W IEDFG++ F Sbjct: 9 IEISNLHEFADSLILIDNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFF 68 Query: 59 KQYGDFENGIPVHDTIARVVSQGKI 83 NGIP HDT R S Sbjct: 69 AAKLSNFNGIPSHDTFNRFFSALDP 93 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 92.5 bits (229), Expect = 3e-18, Method: Composition-based stats. Identities = 26/82 (31%), Positives = 43/82 (52%), Gaps = 3/82 (3%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M ++ +I PD R +++ ++I+ + + AVI GA+ W +IE FG+TH + K Sbjct: 1 MTIQAFS---AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKA 57 Query: 61 YGDFENGIPVHDTIARVVSQGK 82 IP HDT++R S Sbjct: 58 RLPGLVSIPSHDTLSRFFSILD 79 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 92.2 bits (228), Expect = 5e-18, Method: Composition-based stats. Identities = 29/74 (39%), Positives = 50/74 (67%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + +++H S D RQ+W+VV+ L +I LL +CA +SG E + +I +G+ L+FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTI 74 + +E G+P HDT+ Sbjct: 77 FLPYERGLPAHDTL 90 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 92.2 bits (228), Expect = 5e-18, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 36/80 (45%), Gaps = 1/80 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + E I D R+ KV + I+L+T+ V + W DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSQGK 82 P HDT+ R K Sbjct: 77 PDLETTPSHDTLRRFFCIIK 96 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 90.2 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 47/81 (58%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L+ L+EH S D R +++H L +ILLL +C ++ + +E+I +G HL FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIPVHDTIARVVSQGKI 83 + +G+P + ++++ Sbjct: 72 PYAHGVPGERWLTILMNRIDP 92 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 89.8 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 35/78 (44%), Positives = 46/78 (58%), Gaps = 1/78 (1%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + D R A+ H DI+ L + AVISGA W +I+ FGE HLD+L++Y Sbjct: 1 MSVFRFFENLSDPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRKYRP 59 Query: 64 FENGIPVHDTIARVVSQG 81 FE GIPV DTIARV+ + Sbjct: 60 FECGIPVDDTIARVIKRI 77 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 86.0 bits (212), Expect = 3e-16, Method: Composition-based stats. Identities = 28/70 (40%), Positives = 42/70 (60%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L+ H + D RQ+ KV + L D+L +T+C VI+GAEGW +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIPV 70 G G+PV Sbjct: 72 KGILTEGVPV 81 >UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV84_9DELT Length = 357 Score = 86.0 bits (212), Expect = 4e-16, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 35/87 (40%), Gaps = 5/87 (5%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGE--THLDFLK 59 +++ L ++ D R+ H++S +L + A + G +G++ I + + Sbjct: 214 QMESLPDYFKTVTDPRRTHGRRHRISTVLSIAAAATLCGMKGYKAIYGWANKLGQKARQR 273 Query: 60 QYGDFENG---IPVHDTIARVVSQGKI 83 ENG IP I V+ + Sbjct: 274 FRCRKENGKYVIPSQFVIRDVLVRADP 300 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 85.6 bits (211), Expect = 4e-16, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 40/88 (45%), Gaps = 7/88 (7%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 E+ L+E ++ PD R V H L+ +L LT CAV++GA + ++ + L + Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 62 GD------FENGI-PVHDTIARVVSQGK 82 F P TI RV+++ Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARID 125 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 85.2 bits (210), Expect = 6e-16, Method: Composition-based stats. Identities = 17/81 (20%), Positives = 39/81 (48%), Gaps = 1/81 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFL-KQY 61 + L+E + D+R+ H L +L++ I + G G+ ++ +F + + L +++ Sbjct: 1 MLSLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEF 60 Query: 62 GDFENGIPVHDTIARVVSQGK 82 +P + TI RV+ + Sbjct: 61 NIIPERVPSYSTIRRVMMGVE 81 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 84.1 bits (207), Expect = 1e-15, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 35/80 (43%), Gaps = 1/80 (1%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E ++ PD R + L +L L + AV+ G E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSQGKI 83 NG +P +TIA ++ + Sbjct: 64 RNGNMPCPNTIAGLLRRLDP 83 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 82.9 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 35/79 (44%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L ++ PD R + L IL + +CAV++GA + I D+ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FENGIPVHDTIARVVSQGK 82 F + +P T+ R++ + Sbjct: 89 FTDRVPAATTVWRLLIRID 107 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 81.4 bits (200), Expect = 1e-14, Method: Composition-based stats. Identities = 17/82 (20%), Positives = 36/82 (43%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ L+ + PD+R V ++L+ +L L + I+G + + ++ + Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 62 GDFENGIPVHDTIARVVSQGKI 83 F +P TI R+V +G Sbjct: 84 LGFPRRVPSERTIRRIVEEGPP 105 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 81.0 bits (199), Expect = 1e-14, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 37/81 (45%), Gaps = 1/81 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L+ + + D RQ KVVH+ I++ + V + + W ++ DF +DF++++ Sbjct: 18 LESMYNALGEI-DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFF 76 Query: 63 DFENGIPVHDTIARVVSQGKI 83 P HDT+ R Sbjct: 77 PDIQKAPSHDTLRRFFCLVCP 97 >UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BR51_9GAMM Length = 153 Score = 80.6 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 14/87 (16%), Positives = 35/87 (40%), Gaps = 5/87 (5%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHLD 56 +++ L ++ + PD +A H+L +L L A + G +G++ + ++ Sbjct: 7 QMRSLPQYFTDLPDPHRAEGRRHRLPVVLTLAAGASLCGMQGYKSMAEWASSLAQAARRR 66 Query: 57 FLKQYGDFENGIPVHDTIARVVSQGKI 83 F + + +P I + + Sbjct: 67 FGCRRVNGHYLVPSLYVIRDCLVRLGP 93 >UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BLK3_9GAMM Length = 190 Score = 80.6 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 15/87 (17%), Positives = 36/87 (41%), Gaps = 5/87 (5%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHLD 56 +++ L ++ + PD R+A H+L + LT A + G +G++ + ++ Sbjct: 59 QMRALPQYFTDLPDPRRAQGRRHRLPVVPALTAGASLCGMQGYKAMAEWASSLGQAARQR 118 Query: 57 FLKQYGDFENGIPVHDTIARVVSQGKI 83 F + + +P I + + Sbjct: 119 FGCRRVNGHYLVPSLYVIRDCLVRLGP 145 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 79.8 bits (196), Expect = 2e-14, Method: Composition-based stats. Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 2/82 (2%) Query: 3 LKKLMEHISITPDYRQA--WKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 L+KL E S PD+R+A + HKLSDI++L I +S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSQGK 82 NGIP T+ R+ Sbjct: 95 LDILVNGIPSEATLCRMEEGID 116 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 79.8 bits (196), Expect = 2e-14, Method: Composition-based stats. Identities = 25/82 (30%), Positives = 37/82 (45%), Gaps = 2/82 (2%) Query: 3 LKKLMEHISITPDYRQ--AWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 +K L E + PDYR+ +KL DILLL I + DI FG+ +L + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSQGK 82 G +G+P T+ R+ Sbjct: 78 LGILLDGVPSEPTLCRIFKHID 99 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 79.4 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 26/82 (31%), Positives = 42/82 (51%), Gaps = 2/82 (2%) Query: 3 LKKLMEHISITPDYRQA--WKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 L+KL E S PD+R+A + HKL D+++L I +S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSQGK 82 NGIP T+ R+ Sbjct: 95 LDILANGIPSEATLCRMEEGID 116 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 79.1 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 27/86 (31%), Positives = 42/86 (48%), Gaps = 6/86 (6%) Query: 3 LKKLMEHISITPD------YRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLD 56 + +++ I D RQ+WK+ + LS IL L ++G E +++EDF E + Sbjct: 1 MTTMIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEP 60 Query: 57 FLKQYGDFENGIPVHDTIARVVSQGK 82 Y D G P HDT+ RV+S Sbjct: 61 LFATYVDLSEGCPSHDTLERVISLVN 86 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 79.1 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 23/84 (27%), Positives = 35/84 (41%), Gaps = 1/84 (1%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + L E +S PD R ++H L +L L A++ G + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSQGKI 83 F G P T++R + + Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDP 84 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 79.1 bits (194), Expect = 5e-14, Method: Composition-based stats. Identities = 24/86 (27%), Positives = 40/86 (46%), Gaps = 7/86 (8%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 ++ I++ D R ++ + L ILL+++ A ISG + WE IED+ H + L+ Sbjct: 4 EIWNAIAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTK 63 Query: 65 ENG-------IPVHDTIARVVSQGKI 83 +G +P HDT V Sbjct: 64 LSGKELKVSRMPTHDTFNHVFQVIDP 89 >UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium meliloti RepID=Q1WLF6_RHIME Length = 105 Score = 78.7 bits (193), Expect = 6e-14, Method: Composition-based stats. Identities = 18/64 (28%), Positives = 26/64 (40%), Gaps = 1/64 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + PD R H L+ IL + I A++ GAE D+ DFG +LK Sbjct: 1 MSRFAACFEDLPDPR-GRNARHPLTSILFIAIAAIVCGAESCTDMADFGVAKKKWLKTIV 59 Query: 63 DFEN 66 Sbjct: 60 PLPY 63 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 78.3 bits (192), Expect = 7e-14, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 37/83 (44%), Gaps = 1/83 (1%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFL-K 59 + L+ + PD R+A + L +L+ T+ A++SGA + I F E + L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 60 QYGDFENGIPVHDTIARVVSQGK 82 +G PV +T+ V+ Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLD 93 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 78.3 bits (192), Expect = 8e-14, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 1/79 (1%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E ++ PD+R A + L +LLL I +S G+ +EDF H + L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 -ENGIPVHDTIARVVSQGK 82 P T RV+ Sbjct: 64 PPTRFPSDSTFRRVMMGID 82 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 76.7 bits (188), Expect = 2e-13, Method: Composition-based stats. Identities = 19/62 (30%), Positives = 35/62 (56%) Query: 22 VVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQG 81 V + L+++LL T+ +I A +++IE G LD+L+Q+ FE+G+P T ++ Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 KI 83 Sbjct: 62 DP 63 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 76.4 bits (187), Expect = 3e-13, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 37/90 (41%), Gaps = 7/90 (7%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFG----ETHLDF 57 +++ L + PD R H L IL + + AV++ A+ + + ++ + L Sbjct: 219 QMEGLRQIFFRLPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKR 278 Query: 58 LKQYGDF---ENGIPVHDTIARVVSQGKIT 84 ++ + P T+ RV+ +T Sbjct: 279 IRARFNPRTQRYVAPSEPTLRRVLQGANVT 308 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 76.0 bits (186), Expect = 4e-13, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 35/90 (38%), Gaps = 8/90 (8%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVI-SGAEGWEDIEDFG-ETHLDFLK 59 +++ L+ D R A V +++S +L L +CA+ +G + ++ + L Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 60 QY------GDFENGIPVHDTIARVVSQGKI 83 + IP T+ V+ + Sbjct: 90 AFGLPYHPLRGRYRIPSEKTLRTVLGRLDP 119 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 75.6 bits (185), Expect = 4e-13, Method: Composition-based stats. Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 1/82 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + L+ + D R H L ++L L + A + GA+ ++ +F E + L++ Sbjct: 1 MSVLISILREVRDPR-DVNARHDLGELLFLALLATLRGAKTCVEMAEFSEARQEELREIV 59 Query: 63 DFENGIPVHDTIARVVSQGKIT 84 +G P HDT +RV T Sbjct: 60 ALRHGAPSHDTFSRVFRLLDPT 81 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 75.6 bits (185), Expect = 5e-13, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 33/85 (38%), Gaps = 8/85 (9%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVIS-GAEGWEDIEDFGE-------THLDF 57 L++ ++ D R H L+ IL + CA ++ G + IE + + L Sbjct: 30 LIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLHI 89 Query: 58 LKQYGDFENGIPVHDTIARVVSQGK 82 + + P TI RV++ Sbjct: 90 WRDPFTGLHRPPSERTIRRVLAALD 114 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 74.8 bits (183), Expect = 8e-13, Method: Composition-based stats. Identities = 22/80 (27%), Positives = 40/80 (50%), Gaps = 1/80 (1%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 +L ++S PD+R+A ++ L+ +LL +I A++SGA + I+ F +TH + L Sbjct: 1 MQLKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQ 60 Query: 64 F-ENGIPVHDTIARVVSQGK 82 P H +I + Sbjct: 61 LHRKRAPAHTSIRYALQGLD 80 >UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3K1_9PROT Length = 134 Score = 74.8 bits (183), Expect = 9e-13, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 1/80 (1%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L+ S D R+ ++ L+ +LL T+ A+++GA + ++ F THLD L D Sbjct: 3 STLLSLFSQISDPRRDQGKIYPLAPVLLFTVLAMLAGARSYRQVQAFIRTHLDRLNDGFD 62 Query: 64 F-ENGIPVHDTIARVVSQGK 82 P + T+ ++ Sbjct: 63 LSLRRAPAYSTVRFILRGID 82 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 73.7 bits (180), Expect = 2e-12, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 35/84 (41%), Gaps = 7/84 (8%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L EH++ PD R + H L IL + + A+ SGAE + + ++ T L Q Sbjct: 15 GLWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGC 74 Query: 65 ENG-------IPVHDTIARVVSQG 81 + P T+ RV+ Sbjct: 75 QESPSRQCFVPPSWTTLHRVIRTI 98 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 73.3 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 26/86 (30%), Positives = 42/86 (48%), Gaps = 6/86 (6%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICA----VISGAEGWEDIEDFGETHLDF 57 +LKKL+E S PD R+A V H+L+ +LL + + + S E D+ L Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSR--PAFLQA 136 Query: 58 LKQYGDFENGIPVHDTIARVVSQGKI 83 L+ +P DT+ARV+ + + Sbjct: 137 LQGLFPELETLPHGDTLARVLERIEP 162 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 73.3 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 17/53 (32%), Positives = 25/53 (47%) Query: 30 LLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQGK 82 +L+T+ V + W DI DF DFL+++ P HDT+ R K Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIK 53 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 73.3 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 17/81 (20%), Positives = 34/81 (41%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ L+ + PD R ++L ++ + +CAV +GA + I D+ + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSQGK 82 +P TI +V + Sbjct: 102 CGIRFRVPSEATIRQVFGRVD 122 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 72.9 bits (178), Expect = 3e-12, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 37/88 (42%), Gaps = 7/88 (7%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +++ LM+ +S T D R+ + H ++ +CA++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDFENG-------IPVHDTIARVVSQGK 82 F P T+ R + Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSID 308 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 72.5 bits (177), Expect = 4e-12, Method: Composition-based stats. Identities = 22/80 (27%), Positives = 39/80 (48%), Gaps = 1/80 (1%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 +L ++ PD+R+A ++ L+ +LL +I AV+SGA + I+ F + H + L Sbjct: 1 MQLKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQ 60 Query: 64 FE-NGIPVHDTIARVVSQGK 82 PVH +I + Sbjct: 61 LHWKRAPVHTSIRYALQGLD 80 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 72.1 bits (176), Expect = 6e-12, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 36/88 (40%), Gaps = 7/88 (7%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 E++ L + ++ PD R + H+L IL L+ AV +G + E+I + + Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 62 GDFENG-------IPVHDTIARVVSQGK 82 P DT+ RV+S Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVD 125 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 71.4 bits (174), Expect = 8e-12, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 38/81 (46%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 + L+E ++ PD R+ V ++ + +L + +CA++SGA + I ++ + Sbjct: 47 DQTALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAG 106 Query: 62 GDFENGIPVHDTIARVVSQGK 82 +P TI RV+ + Sbjct: 107 LGLTGRVPGPVTIWRVLVRVD 127 >UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW01_RHOSR Length = 130 Score = 71.4 bits (174), Expect = 8e-12, Method: Composition-based stats. Identities = 16/74 (21%), Positives = 25/74 (33%) Query: 11 SITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPV 70 PD R V H+ S IL + A +GA + I ++ +P Sbjct: 49 DEVPDPRHRRGVRHRFSVILAPALSATCAGARSFIAIAEWAHDAPAEALGGLGVSAVVPS 108 Query: 71 HDTIARVVSQGKIT 84 T R ++ T Sbjct: 109 ESTSRRFLAGVDAT 122 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 71.0 bits (173), Expect = 1e-11, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 1/77 (1%) Query: 7 MEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFEN 66 + + D R H L+++L L + A + GA+ +I +F E LK+ + Sbjct: 5 LSILREIHDPR-DINARHDLAELLFLALAATLCGAKNCVEIAEFVEGREAELKEIVTLRH 63 Query: 67 GIPVHDTIARVVSQGKI 83 G P HDT +R+ Sbjct: 64 GCPSHDTFSRIFRLIDP 80 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 69.8 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 31/80 (38%), Gaps = 7/80 (8%) Query: 11 SITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG-----DFE 65 + D R+A + H +LL+ + V++G +E I + + + + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 66 NG--IPVHDTIARVVSQGKI 83 P TI R++S+ Sbjct: 289 ERFLPPSEPTIRRILSKADP 308 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 69.0 bits (168), Expect = 4e-11, Method: Composition-based stats. Identities = 18/46 (39%), Positives = 25/46 (54%) Query: 37 VISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQGK 82 ++ AE WEDIE +G + +L+ + NGIP HDT RV Sbjct: 3 RVACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLD 48 >UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E6_9ACTO Length = 128 Score = 68.7 bits (167), Expect = 6e-11, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 30/83 (36%), Gaps = 7/83 (8%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + L E ++ D R+ H +LL+ AV++GA + I ++ + Sbjct: 1 MGPLAERLATLADPRRRRGKRHPFVAVLLIACSAVVTGARSFVAIHEWATDSPQDVLARL 60 Query: 63 DFENG-------IPVHDTIARVV 78 P TI RV+ Sbjct: 61 GARTATALAVRIPPSGVTIRRVI 83 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 68.3 bits (166), Expect = 7e-11, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 27/78 (34%), Gaps = 1/78 (1%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+ H+ PD R V +LL+ + ++S E D+E F H L + E Sbjct: 13 LISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGIE 72 Query: 66 -NGIPVHDTIARVVSQGK 82 P Q Sbjct: 73 LKRPPSDSAFRYFFLQVD 90 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 67.9 bits (165), Expect = 1e-10, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 35/88 (39%), Gaps = 7/88 (7%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDI----EDFGETHLDF 57 ++ L+ + D R+A ++ LS +L + A ++GA G +I DFG+ L Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKQ---YGDFENGIPVHDTIARVVSQGK 82 L + P I + + Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMD 108 >UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ITI9_METNO Length = 74 Score = 66.7 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 1/64 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L D R+ H+L IL++ +CAVI+ AE +DI +G + +L+ + Sbjct: 2 IEGLAVCFPGLEDLRETSGCDHRLIAILVIAVCAVIACAES-KDIGLYGRSKQAWLQTFL 60 Query: 63 DFEN 66 Sbjct: 61 PLPC 64 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 65.6 bits (159), Expect = 5e-10, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 34/81 (41%), Gaps = 1/81 (1%) Query: 3 LKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + L + + PD+R+A + L +LL +I A++SGA + I F TH L Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 63 DFE-NGIPVHDTIARVVSQGK 82 P + +I + Sbjct: 61 GCRWRRTPAYSSIRYALQGLD 81 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 62.9 bits (152), Expect = 3e-09, Method: Composition-based stats. Identities = 24/84 (28%), Positives = 40/84 (47%), Gaps = 6/84 (7%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L E + + P R K ++ L +LL+ + +SG W +IED+ E + + LK + Sbjct: 4 SLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEM 63 Query: 65 ENG------IPVHDTIARVVSQGK 82 G +P HDT+ R +S Sbjct: 64 LTGHQLMHTMPSHDTLNRSISLLD 87 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 62.1 bits (150), Expect = 5e-09, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 3/80 (3%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L E +S PD R A + L +L L + A +S + +E F + L G Sbjct: 1 MTLREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG- 58 Query: 64 FENGIPVHDTIARVVSQGKI 83 P H + ++ + Sbjct: 59 -LRKPPGHTILTLLLHRLDP 77 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 62.1 bits (150), Expect = 5e-09, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 35/85 (41%), Gaps = 5/85 (5%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLK-- 59 + + +++ PD R+ K H+ D+LL+ + AV SG + + + FL Sbjct: 5 SIPSPLPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDE 64 Query: 60 ---QYGDFENGIPVHDTIARVVSQG 81 + E +P T+ R+ Sbjct: 65 VHIRTRRGERKLPGQATLYRLFWSL 89 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 61.0 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 3/80 (3%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L + +S PD R A + L +L L + A +S + +E F + L G Sbjct: 1 MTLRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG- 58 Query: 64 FENGIPVHDTIARVVSQGKI 83 P H I ++ + Sbjct: 59 -LRKAPGHTAITLLLHRLDP 77 >UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ13_9PLAN Length = 75 Score = 61.0 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 8/64 (12%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 EL++L ++ D R HKL +++L+ +CAVI+GA+G IE +L Sbjct: 19 ELRELAKYFQSLDDSRSHVNQRHKLVNVVLIAMCAVIAGADGPTAIE--------WLAGR 70 Query: 62 GDFE 65 Sbjct: 71 LQLP 74 >UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VX04_POLNA Length = 100 Score = 60.2 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 16/66 (24%), Positives = 37/66 (56%), Gaps = 4/66 (6%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDI----EDFGETHLDFLKQY 61 L++ SI PD R + L +++++T+ AV+ GA+ W D+ + +G++ + +++ Sbjct: 2 LIQAFSILPDPRTGPAQRYDLREMIVMTLSAVLCGADNWVDVPVGSKKYGDSCMQVVREK 61 Query: 62 GDFENG 67 +G Sbjct: 62 CCLTSG 67 >UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AI67_NITEC Length = 94 Score = 58.3 bits (140), Expect = 8e-08, Method: Composition-based stats. Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 11/61 (18%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +L + D RQ K H L +L++TI +I + LD+L+QY Sbjct: 34 RLADVFVSITDPRQ-RKSRHDLVKVLVITI----------NEILAWANEKLDWLRQYLKL 82 Query: 65 E 65 Sbjct: 83 T 83 >UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J572_FRASC Length = 148 Score = 56.7 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 13/49 (26%), Positives = 19/49 (38%), Gaps = 1/49 (2%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGET 53 + E PD R V H+L +L L AV+ G G + + Sbjct: 67 PVAECFEQIPDPRDPRGVRHRLPVVLSLC-AAVLCGESGLAGVAAWVAA 114 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 55.6 bits (133), Expect = 5e-07, Method: Composition-based stats. Identities = 15/65 (23%), Positives = 25/65 (38%), Gaps = 1/65 (1%) Query: 20 WKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGI-PVHDTIARVV 78 +H L +L L AV+ G + I FG + L F G P T+++ + Sbjct: 2 QGRIHPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTL 61 Query: 79 SQGKI 83 + Sbjct: 62 RRIDP 66 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 55.6 bits (133), Expect = 5e-07, Method: Composition-based stats. Identities = 12/58 (20%), Positives = 22/58 (37%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQGKI 83 L+ +L L V++G + + + ++ L GIP T R+V Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFGLTRGIPSERTTRRLVEGCDP 105 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 55.6 bits (133), Expect = 5e-07, Method: Composition-based stats. Identities = 39/65 (60%), Positives = 42/65 (64%), Gaps = 12/65 (18%) Query: 1 MELKKLMEHISITPDYRQAWKVVHKLSDILLLTI------CAVISGAEGWEDIEDFGETH 54 MELKKLMEHISI PDYRQAWKV HKL DIL + C ++ G FGETH Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRG------FGETH 54 Query: 55 LDFLK 59 LDFLK Sbjct: 55 LDFLK 59 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 52.1 bits (124), Expect = 5e-06, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 34/85 (40%), Gaps = 7/85 (8%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAV-ISGAEGWEDIEDFGETHLDFLKQYGD 63 + E ++ PD+R A +V+ L + + +CAV +G + + ++ + Sbjct: 23 GIWERLAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERVRLR 82 Query: 64 FE------NGIPVHDTIARVVSQGK 82 + +P TI R ++ Sbjct: 83 LPWNPWDGHLLPDEATIRRFLNTVD 107 >UniRef50_D1RJD3 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RJD3_LEGLO Length = 61 Score = 51.3 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 1/55 (1%) Query: 22 VVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIAR 76 + L I+ L + + I DFG +++LKQ+ ++NG+PV DT+ R Sbjct: 2 KRYLLIKIMFLLLVLQFMDVKAGT-IRDFGLLKIEWLKQFLTYKNGMPVDDTMTR 55 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 51.3 bits (122), Expect = 9e-06, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 19/55 (34%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQGKI 83 +L + + A +G G+ + T D + P T V+S+ Sbjct: 3 LLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDP 57 >UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BWL6_9GAMM Length = 94 Score = 50.9 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 20/55 (36%), Positives = 32/55 (58%), Gaps = 1/55 (1%) Query: 8 EHISITPDYRQA-WKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 EH PD R+ + HK DIL++ ICA+I GA+ W + +FG+ D+ + + Sbjct: 40 EHFKSLPDPRRRTMNLRHKFIDILIIAICAIICGADSWVAVAEFGKAKEDWFRVF 94 >UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HUT6_9ACTO Length = 130 Score = 49.4 bits (117), Expect = 4e-05, Method: Composition-based stats. Identities = 17/71 (23%), Positives = 27/71 (38%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L +S PD R+ + L +L L + AV+ GA I F L++ Sbjct: 45 SLAGTLSRIPDPRRVRGRRYHLGSLLALCLVAVLGGARSLATIARFAADTNSSLREQLGL 104 Query: 65 ENGIPVHDTIA 75 + P T+ Sbjct: 105 ASSTPNASTLG 115 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 49.4 bits (117), Expect = 4e-05, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 1/55 (1%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG-IPVHDTIARVVSQGK 82 +L L + AV++G E I FG L F+NG +P +TIA ++ + Sbjct: 3 LLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLD 57 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 48.6 bits (115), Expect = 6e-05, Method: Composition-based stats. Identities = 9/55 (16%), Positives = 18/55 (32%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQGKI 83 +L + + A + G+ + T D + P T V+S+ Sbjct: 3 LLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDP 57 >UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TLQ7_SHEHH Length = 74 Score = 48.2 bits (114), Expect = 8e-05, Method: Composition-based stats. Identities = 17/44 (38%), Positives = 23/44 (52%) Query: 7 MEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDF 50 EH+SI R + H DI+ L A+ S EGW DI++F Sbjct: 4 FEHLSIIKAPRSSINHEHDPVDIMFLVNSAIASDCEGWLDIDEF 47 >UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XCB4_SALTO Length = 117 Score = 47.9 bits (113), Expect = 1e-04, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 27/57 (47%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQGK 82 ++ +L +CAV++GA + D+ E F + +PV T+ R++ + Sbjct: 1 MASVLADAVCAVMAGASTFAAFGDWVEDLDAPAWSRLGFTDRVPVLTTLWRLLVRVD 57 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 47.5 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 11/54 (20%), Positives = 23/54 (42%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQGK 82 +L+ + G + +THL+ L+++ + GI TI R++ Sbjct: 1 MLVCVTLGFLCGRTTIRRSLKWCKTHLEELRKHMKLKYGIASPSTITRMLCGID 54 >UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LD44_FRASN Length = 261 Score = 46.3 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 10/37 (27%), Positives = 21/37 (56%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVI 38 + L+E ++ PD R+ V H + +L + +CA++ Sbjct: 57 DQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAML 93 >UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TKB9_RICCO Length = 107 Score = 46.3 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 14/78 (17%), Positives = 27/78 (34%), Gaps = 1/78 (1%) Query: 6 LMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 + S D R+A + L +L + +++SG+ ++ F E L L + Sbjct: 8 FGDVFSELRDVRRAQGKRYALEPLLCAIVMSILSGSASLRKMQVFIEEQLPNLNRLFGTS 67 Query: 66 -NGIPVHDTIARVVSQGK 82 P I + Sbjct: 68 WRKAPCWVAIREFLLGLD 85 >UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKL6_9PROT Length = 506 Score = 44.4 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 32/85 (37%), Gaps = 6/85 (7%) Query: 2 ELKKLMEHISITPDYRQAWKVVHK----LSDILLLTICAVISGAEGWEDIEDFGETHLDF 57 EL L+ + PD R K HK L LL+ + S E ++ L Sbjct: 75 ELPALLGQLEQIPDPRDPRKRRHKLTVLLLYGLLMFVFQFASRRETNREMTR--PQFLAN 132 Query: 58 LKQYGDFENGIPVHDTIARVVSQGK 82 L++ +P DT+ R++ Sbjct: 133 LQRLFPEIEALPHADTLYRLLRDID 157 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 43.2 bits (101), Expect = 0.003, Method: Composition-based stats. Identities = 8/37 (21%), Positives = 15/37 (40%) Query: 47 IEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQGKI 83 + + L+ + NG P DT RV+ + + Sbjct: 1 MHELCLERGASLRPPVELPNGCPSVDTFERVLQRIEP 37 >UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L6S8_RUMHA Length = 107 Score = 42.8 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 8/37 (21%), Positives = 17/37 (45%) Query: 47 IEDFGETHLDFLKQYGDFENGIPVHDTIARVVSQGKI 83 + F + + ++ D + G P DT+ RV + + Sbjct: 1 MAFFMKLQEPYFEKILDLKYGTPSADTLLRVFAIIEP 37 >UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4X2F9_SALTO Length = 143 Score = 42.5 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 12/46 (26%), Positives = 21/46 (45%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDF 50 L + PD V+H+L+ +L+ ICAV + I ++ Sbjct: 14 GLPAALLDLPDPLCRLGVLHRLTVVLIAAICAVAVSNRSYTAIAEW 59 >UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromonas gingivalis ATCC 33277 RepID=B2RI66_PORG3 Length = 87 Score = 41.7 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 14/45 (31%), Positives = 23/45 (51%) Query: 17 RQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 R K V+ L + L+ + +SG W +IED+ E + + LK Sbjct: 23 RIESKEVYPLDFLFLIVFLSTLSGDTSWYEIEDYAEEYEEVLKSR 67 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 41.3 bits (96), Expect = 0.010, Method: Composition-based stats. Identities = 9/86 (10%), Positives = 23/86 (26%), Gaps = 7/86 (8%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICA-VISGAEGWEDIEDFGETHLDFLKQYG 62 + + E + D R V+ + + +C+ +G + + + Sbjct: 22 EGIWERLDRVTDPRSTRGRVYSWLCLAAVWLCSLTAAGHHRVSAVRAWLARTSGAERARL 81 Query: 63 DFE------NGIPVHDTIARVVSQGK 82 +P TI + Sbjct: 82 RLPWDPFAGWRLPSTATIHCFLQAVD 107 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 40.9 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 31/86 (36%), Gaps = 7/86 (8%) Query: 4 KKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDF------ 57 L+ ++ PD R VVH L +L + AV++GA + ++ Sbjct: 27 SSLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELG 86 Query: 58 -LKQYGDFENGIPVHDTIARVVSQGK 82 + + P T R+++ Sbjct: 87 VFRDPFTGVHRAPDESTFRRILAGVD 112 >UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PA49_CLOTS Length = 245 Score = 40.5 bits (94), Expect = 0.016, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 29/78 (37%), Gaps = 4/78 (5%) Query: 5 KLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L E I+ D R V +S I + + + + +E + K+ Sbjct: 18 HLGEKINTLKDKRVKSSVK--ISTITFVVLFGFMLQIRSFNRLEHW--LKKGKFKKALPK 73 Query: 65 ENGIPVHDTIARVVSQGK 82 + +P DTI RV+S Sbjct: 74 KTKMPRIDTIRRVLSNFD 91 >UniRef50_C6VL62 Transposase n=25 Tax=Bacilli RepID=C6VL62_LACPJ Length = 599 Score = 40.1 bits (93), Expect = 0.020, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 31/78 (39%), Gaps = 7/78 (8%) Query: 2 ELKKLMEHISITPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIEDFGETHLD--FLK 59 + +L+E + + DY+ + L +L L + A G IE F + +L Sbjct: 28 YINQLVESLKLKYDYQFGRPREYNLGAMLKLVLLAYSYGIFSSRKIERFARENKPAGWL- 86 Query: 60 QYGDFENGIPVHDTIARV 77 + IP + TI R Sbjct: 87 ----IADQIPSYRTICRF 100 >UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms RepID=Q11MU1_MESSB Length = 447 Score = 38.2 bits (88), Expect = 0.087, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 1/46 (2%) Query: 5 KLMEHISI-TPDYRQAWKVVHKLSDILLLTICAVISGAEGWEDIED 49 KL E ++ D R +V H L+DIL I A+ G E D++ Sbjct: 45 KLAEKLAAAIRDPRDPARVRHSLTDILRARIFAIACGYEDANDLDR 90 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.312 0.163 0.505 Lambda K H 0.267 0.0506 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 638,921,774 Number of Sequences: 3077464 Number of extensions: 25209964 Number of successful extensions: 68289 Number of sequences better than 1.0e-01: 139 Number of HSP's better than 0.1 without gapping: 311 Number of HSP's successfully gapped in prelim test: 21 Number of HSP's that attempted gapping in prelim test: 67889 Number of HSP's gapped (non-prelim): 332 length of query: 84 length of database: 1,040,396,356 effective HSP length: 54 effective length of query: 30 effective length of database: 874,213,300 effective search space: 26226399000 effective search space used: 26226399000 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 88 (38.2 bits)