BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (91 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=roo... 177 9e-44 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 137 1e-31 UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepI... 104 8e-22 UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacter... 94 2e-18 UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax... 91 1e-17 UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammapr... 88 7e-17 UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Kl... 86 4e-16 UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwin... 82 5e-15 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 77 2e-13 UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacter... 64 1e-09 UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma ling... 63 2e-09 UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX 60 3e-08 UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriacea... 54 1e-06 UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepI... 54 1e-06 UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea p... 53 4e-06 UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmon... 52 5e-06 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 52 8e-06 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 47 2e-04 UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp.... 46 4e-04 UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae... 45 5e-04 UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edw... 45 6e-04 UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria... 45 0.001 UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis ... 44 0.001 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 44 0.001 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 43 0.004 UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_S... 42 0.007 UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candid... 41 0.009 UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methano... 41 0.014 UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriale... 41 0.014 UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE 40 0.019 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 40 0.029 UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aur... 40 0.032 UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ru... 38 0.093 >UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=root RepID=INSA2_ECOLX Length = 91 Score = 177 bits (449), Expect = 9e-44, Method: Compositional matrix adjust. Identities = 83/91 (91%), Positives = 87/91 (95%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MAS+SI CPSCSAT+GVVRNGKSTAGHQRYLCS CRKTWQLQFTYTASQPG HQKIIDMA Sbjct: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 MNGVGCRA+ARIMGVGLNT+ RHLKNSGRSR Sbjct: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 137 bits (344), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 63/68 (92%), Positives = 66/68 (97%) Query: 17 VVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVG 76 +VRNGKSTAGHQRYLCS CRKTWQLQFTYTASQPG HQKIIDMAMNGVGCRA+ARIMGVG Sbjct: 1 MVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 60 Query: 77 LNTVLRHL 84 LNT+LRHL Sbjct: 61 LNTILRHL 68 >UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32IP8_SHIDS Length = 101 Score = 104 bits (260), Expect = 8e-22, Method: Compositional matrix adjust. Identities = 48/52 (92%), Positives = 50/52 (96%) Query: 40 QLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 QLQFTYTASQPG HQKIIDMAMNGVGCRA+ARIMGV LNT+LRHLKNSGRSR Sbjct: 50 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVSLNTILRHLKNSGRSR 101 >UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacteriaceae RepID=A4TI48_YERPP Length = 91 Score = 93.6 bits (231), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 39/90 (43%), Positives = 58/90 (64%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA + ++CP C V ++G GHQRY C CR+++QL++ Y A PG ++I+D+A Sbjct: 1 MAKVDVKCPFCEQFHPVKKHGPGRTGHQRYRCQACRRSFQLEYEYRACHPGMKEQIVDLA 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 MN G R +AR + + +N V+R LKNS RS Sbjct: 61 MNNAGIRDTARALHISINAVMRTLKNSRRS 90 >UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax=Gammaproteobacteria RepID=INA2_SHIDY Length = 90 Score = 90.9 bits (224), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 41/90 (45%), Positives = 61/90 (67%), Gaps = 1/90 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MAS++I CP C + + V R+G++ GH R+ C C + +QL +TY A +PG + I +MA Sbjct: 1 MASVNIHCPRCQSAQ-VYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPGIKELITEMA 59 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 NG G R +AR + +G+NTV+R LKNS +S Sbjct: 60 FNGAGVRDTARTLKIGINTVIRTLKNSRQS 89 >UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammaproteobacteria RepID=A7N597_VIBHB Length = 91 Score = 88.2 bits (217), Expect = 7e-17, Method: Compositional matrix adjust. Identities = 40/87 (45%), Positives = 57/87 (65%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA+I ++C C+ TE V ++GK +G R+ C CRK++QL + Y A +P +KI+DMA Sbjct: 1 MATIQVQCRFCNKTESVRKHGKGHSGFPRFRCIECRKSFQLDYVYEARKPNVKEKIVDMA 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNS 87 MN G R +A ++ V NTVL LKNS Sbjct: 61 MNSSGVRETAGVLNVAYNTVLSTLKNS 87 >UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T8A4_KLEPR Length = 83 Score = 85.5 bits (210), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 42/66 (63%), Positives = 46/66 (69%), Gaps = 1/66 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQ-KIIDM 59 MASI + PSC+ TEGV RNGKSTAGHQ YLC CRK W L FTYT SQ HQ KIIDM Sbjct: 7 MASIYVGSPSCAVTEGVDRNGKSTAGHQHYLCRQCRKPWTLTFTYTTSQRSTHQRKIIDM 66 Query: 60 AMNGVG 65 + + Sbjct: 67 TIMALD 72 >UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwinia RepID=D0FXR2_ERWPY Length = 92 Score = 82.0 bits (201), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 40/82 (48%), Positives = 50/82 (60%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGV 64 I CP CS + + RNG+S +G QRY C C KT+QL F Y S P + II+M +G Sbjct: 5 DIACPRCSESARIRRNGRSASGIQRYRCQGCLKTFQLHFYYAGSSPNMQKTIIEMMNDGS 64 Query: 65 GCRASARIMGVGLNTVLRHLKN 86 R AR +GV L TVLRHLK+ Sbjct: 65 EQRDIARKLGVSLETVLRHLKD 86 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 77.0 bits (188), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 1/91 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA ++I CP C + + V R+G++ G R C C + +QL +TY A +PG + I +MA Sbjct: 1 MARVNIHCPRCQSAQ-VYRHGQNPKGRDRLRCRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 NG G R +AR + +G NTV+R LK R Sbjct: 60 FNGAGVRDTARTLKIGSNTVIRTLKKLAPKR 90 >UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacteria RepID=Q1V9Z0_VIBAL Length = 88 Score = 64.3 bits (155), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 31/81 (38%), Positives = 47/81 (58%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M + + C C ++ VV++G GHQRY C C +T+Q+ + Y A +PG +II+M Sbjct: 1 MTTNNPHCHFCCKSDSVVKHGYGPKGHQRYRCLSCCRTFQVNYCYEACKPGIRSRIIEMT 60 Query: 61 MNGVGCRASARIMGVGLNTVL 81 G RA++R + V NTVL Sbjct: 61 AQNHGKRATSRHLQVSYNTVL 81 >UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQ17_9SPHI Length = 107 Score = 63.2 bits (152), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 32/91 (35%), Positives = 49/91 (53%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M ++ C T+ + R G + AG QRY C C +T+ +T+ A P ++I M Sbjct: 1 MVLEAVTCKHFGQTQHIKRYGTTCAGTQRYRCFDCGRTFVQTYTHKARDPLVKEQITQMV 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 +NG G R +AR++GV NTV K +G +R Sbjct: 61 LNGAGIRDTARVLGVNRNTVSAQFKKNGAAR 91 >UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX Length = 71 Score = 59.7 bits (143), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 24/65 (36%), Positives = 45/65 (69%), Gaps = 1/65 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA++++ P C+ ++ V R+G+S + H+R+ C C++ +QL ++Y A +PG + I++MA Sbjct: 1 MATVTVHRPRCN-SDKVYRHGRSCSQHERFRCRSCKRVFQLTYSYEARKPGFKELIVEMA 59 Query: 61 MNGVG 65 NG G Sbjct: 60 HNGTG 64 >UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriaceae RepID=B7LWW4_ECO55 Length = 134 Score = 54.3 bits (129), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 27/77 (35%), Positives = 45/77 (58%), Gaps = 3/77 (3%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M+S++I CP C + + V R+G++ G R+ C + +QL +TY A +PG + I +MA Sbjct: 1 MSSVNIHCPRCQSAQ-VYRHGQNPKGRDRFRYRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MNGVGCRAS--ARIMGV 75 N G + AR+ G+ Sbjct: 60 FNEPGMMLARMARLHGI 76 >UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IXU3_9CHRO Length = 92 Score = 54.3 bits (129), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 31/87 (35%), Positives = 49/87 (56%), Gaps = 4/87 (4%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLC---SPCRKTWQLQFTYTASQPGKHQKIIDMA 60 ++I CP C +T+ VV+NG S G QRY C S R+++ ++Y + ++I M Sbjct: 5 LAIECPHCHSTD-VVKNGFSGEGKQRYFCQNKSCERRSFIRDYSYNGCRKEVKKQIPKMV 63 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNS 87 +NG G R +AR++ + TV LK S Sbjct: 64 VNGSGIRDTARVLEISPITVASELKKS 90 >UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea psychrophila RepID=Q6AKY5_DESPS Length = 101 Score = 52.8 bits (125), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 27/78 (34%), Positives = 45/78 (57%), Gaps = 6/78 (7%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCR 67 C C T+ V R+GK + G+QR+ CS C++T+QL++ Y A +H++ + G R Sbjct: 3 CRFCGGTDEVRRHGKDSNGNQRFRCSDCKRTFQLEYPYVAD---RHER---YSPGNAGIR 56 Query: 68 ASARIMGVGLNTVLRHLK 85 +AR++ VG + R K Sbjct: 57 DTARVLKVGCMGLTRFRK 74 >UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SSR9_AERS4 Length = 91 Score = 52.4 bits (124), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 25/64 (39%), Positives = 39/64 (60%), Gaps = 1/64 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MASI+I CP C+ ++ V R+GK+ AG+ RY C C +QL +TY A P ++++ Sbjct: 10 MASITIHCPRCN-SDHVYRHGKTPAGNIRYRCPACPHVFQLTYTYEARNPASKRRLLIWR 68 Query: 61 MNGV 64 G+ Sbjct: 69 STGL 72 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 25/84 (29%), Positives = 45/84 (53%), Gaps = 1/84 (1%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 I CP C + V++NG + Q+Y C C + +T A + +QK + + G+G Sbjct: 12 INCPKCKE-KKVIKNGTTKNNKQQYYCKMCFYRFIQNYTNQAYKLDINQKNVQLTKEGLG 70 Query: 66 CRASARIMGVGLNTVLRHLKNSGR 89 R++ARI+ + T+L+ + + GR Sbjct: 71 IRSTARILEISATTLLKRIVSIGR 94 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 46.6 bits (109), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 18/47 (38%), Positives = 28/47 (59%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTA 47 M ++ ++C C TE V ++ K A HQRY C C + +QL++ Y A Sbjct: 1 METLEVKCRFCQQTEFVKKHSKGDADHQRYRCFSCNQIFQLEYAYRA 47 >UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HXD1_CYAP4 Length = 95 Score = 45.8 bits (107), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 4/82 (4%) Query: 9 PSCSATEGVVRNGKSTAGHQRYLCSPC---RKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 PSC +++ VV+ + T G QRY C R T+ Q+ Y Q+I++M +NG G Sbjct: 9 PSCGSSD-VVKPRQLTEGIQRYKCRNAEWSRCTFIRQYAYRGYLVEVKQQIVEMVVNGSG 67 Query: 66 CRASARIMGVGLNTVLRHLKNS 87 R AR++ + TV LK S Sbjct: 68 TRDPARVLKISRTTVTETLKKS 89 >UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae RepID=C5BFY7_EDWI9 Length = 46 Score = 45.4 bits (106), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 20/41 (48%), Positives = 27/41 (65%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL 41 MA I + CP + T+ V+RNG +T+G Q Y C C KT+QL Sbjct: 1 MAKIDVVCPRGAKTQDVIRNGHATSGAQVYRCKLCLKTFQL 41 >UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8T3_EDWI9 Length = 73 Score = 45.4 bits (106), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 21/51 (41%), Positives = 31/51 (60%) Query: 41 LQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 L Y A + ++II+MA G G R +A + +G+NTV+R LKNS +S Sbjct: 23 LTLAYEAHKLDIKEQIIEMAFKGSGVRDTANTLKIGINTVIRTLKNSRQSE 73 >UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria RepID=C4MEL4_CAMCO Length = 339 Score = 44.7 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 8/85 (9%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTW----QLQFTYTASQPGKHQKIIDMA 60 + CP C+ ++ V+NGK+ HQRY+C C KT+ + T G K ID Sbjct: 45 ATHCPYCN-SDKFVKNGKAKT-HQRYICKTCNKTFTDTNKTILFNTKKDIGIWYKYIDCL 102 Query: 61 MNGVGCRASARIMGVGLNT--VLRH 83 +N R +A+I G+ L T V RH Sbjct: 103 VNKYPLRKTAKICGISLPTAFVWRH 127 >UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis BM407 RepID=C6GT28_STRS4 Length = 341 Score = 44.3 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 8/77 (10%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQ-----KIIDMAMN 62 CP C +E + RNGK G QRY+C C+KT+ FT +A+ K K +N Sbjct: 54 CPLC-GSETISRNGKYN-GKQRYICKSCKKTFT-DFTNSATYKSKKTLDKWLKYAKCMIN 110 Query: 63 GVGCRASARIMGVGLNT 79 G R SA+I+ + + T Sbjct: 111 GYSIRKSAKIVEINIAT 127 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 44.3 bits (103), Expect = 0.001, Method: Compositional matrix adjust. Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 2/87 (2%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGV 64 + CPSC + VV+ G+ G Q++LC C K + +Y ++ + M NG+ Sbjct: 10 DVSCPSC-GSHHVVKCGRPL-GRQKFLCRDCGKYFLGDASYHHHSRKLREEALRMYANGM 67 Query: 65 GCRASARIMGVGLNTVLRHLKNSGRSR 91 RA +R++ V L TV +K GR + Sbjct: 68 SMRAISRVLNVPLGTVFTWIKRYGRKK 94 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 3/85 (3%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNG 63 I CPSC ++ V++NG S+ G +Y C+ CR+T+ + S+ K ++I+ +N Sbjct: 67 IRPNCPSCK-SDKVIKNG-SSRGKTKYKCNVCRRTFYDANSRRMSREQK-ERILKEYLNR 123 Query: 64 VGCRASARIMGVGLNTVLRHLKNSG 88 + R A++ G L TV +K G Sbjct: 124 MSMRGIAKVEGKPLTTVYSLIKRKG 148 >UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_STRAG Length = 345 Score = 42.0 bits (97), Expect = 0.007, Method: Compositional matrix adjust. Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 5/78 (6%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKH----QKIIDMAMNG 63 CP C VVRNG G QRY+C C K++ + S K ++ ID MNG Sbjct: 52 CPLCGCIH-VVRNGHRKDGTQRYVCKDCGKSFVIATNSIVSGTRKDLSVWEQYIDCMMNG 110 Query: 64 VGCRASARIMGVGLNTVL 81 + R +A G+ NT Sbjct: 111 LSIRKTAVACGIHRNTAF 128 >UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L432_AMOA5 Length = 118 Score = 41.2 bits (95), Expect = 0.009, Method: Compositional matrix adjust. Identities = 25/88 (28%), Positives = 40/88 (45%), Gaps = 10/88 (11%) Query: 5 SIRCPSC----SATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 ++ CP C S +G+VR G QRY C CR + + +K + + Sbjct: 3 TMNCPRCNNAHSCKDGIVR------GRQRYQCKSCRFRYTVSHKSDVKPLSTKRKALQLY 56 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSG 88 + G+G RA RI+ + TV + +K G Sbjct: 57 LEGLGFRAIGRILNISYGTVYQWVKACG 84 >UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methanosarcina RepID=Q46CV1_METBF Length = 139 Score = 40.8 bits (94), Expect = 0.014, Method: Compositional matrix adjust. Identities = 23/85 (27%), Positives = 43/85 (50%), Gaps = 2/85 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + CP C+++ +NG G Q Y C C + ++ TAS P ++ + + + G+G Sbjct: 1 MNCPRCNSSTHK-KNG-IVFGRQHYKCHDCGYNYTVEVKSTASSPSVKRQALQLYLEGLG 58 Query: 66 CRASARIMGVGLNTVLRHLKNSGRS 90 R+ R +GV +V + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQE 83 >UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriales RepID=B5W4N9_SPIMA Length = 163 Score = 40.8 bits (94), Expect = 0.014, Method: Compositional matrix adjust. Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 2/80 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + CP C + + VV+NG G Q YLC C + ++ + + M++NG+G Sbjct: 1 MDCPYCQSHK-VVKNGHRQ-GKQSYLCRECGRQFRENPCPGGYSSDVKELCVKMSLNGMG 58 Query: 66 CRASARIMGVGLNTVLRHLK 85 RA R+ G+ NT+L ++ Sbjct: 59 FRAIERVTGISHNTILNWVR 78 >UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE Length = 348 Score = 40.4 bits (93), Expect = 0.019, Method: Composition-based stats. Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 10/83 (12%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFT---YTASQPG--KHQKIIDMAMN 62 CP C + V +NGKS G QRY+C CR ++ +FT ++ ++ G K K ++ + Sbjct: 53 CPKCQCKD-VNKNGKSN-GRQRYICKRCRTSFD-EFTMSPFSNTKLGLDKWIKYCELMIL 109 Query: 63 GVGCRASARIMGVGLNT--VLRH 83 G+ R A +GVG+ T +RH Sbjct: 110 GLSIRKCAEEVGVGVKTSFYMRH 132 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 39.7 bits (91), Expect = 0.029, Method: Compositional matrix adjust. Identities = 22/85 (25%), Positives = 43/85 (50%), Gaps = 2/85 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + CP C+++ +NG G QRY C C + ++ T+ P ++ + + + G+G Sbjct: 1 MNCPRCNSSTHK-KNG-IVFGRQRYKCHDCGYNYTVEVKSTSISPSVKRQALQLYLEGLG 58 Query: 66 CRASARIMGVGLNTVLRHLKNSGRS 90 R+ R +GV +V + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQE 83 >UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PIK6_CORA7 Length = 403 Score = 39.7 bits (91), Expect = 0.032, Method: Composition-based stats. Identities = 28/79 (35%), Positives = 41/79 (51%), Gaps = 4/79 (5%) Query: 9 PSCSAT-EGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKI-IDMAMNGVGC 66 PSC G+V+NGK+ AG QR+LC C + +T+ +H KI ID ++G Sbjct: 7 PSCDMCGHGLVKNGKTAAGTQRWLCPQCNVSSINTRAHTSDI--RHFKIFIDWILSGESA 64 Query: 67 RASARIMGVGLNTVLRHLK 85 A+ +GV T+ R K Sbjct: 65 DHLAKRLGVTRRTLTRWFK 83 >UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N1_SALRD Length = 92 Score = 38.1 bits (87), Expect = 0.093, Method: Compositional matrix adjust. Identities = 25/85 (29%), Positives = 38/85 (44%), Gaps = 1/85 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M + C C ++ +V+NG S +G Q+Y C C L + +KI+ Sbjct: 1 MIKETYECRECGSS-NIVKNGHSASGSQQYHCKDCGAHKVLDPEPRGYSEEEKEKILRAY 59 Query: 61 MNGVGCRASARIMGVGLNTVLRHLK 85 RA +RI G+ NT+ R LK Sbjct: 60 RERGSKRAISRIFGISRNTLTRWLK 84 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=roo... 122 3e-27 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 117 2e-25 UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax... 115 4e-25 UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacter... 110 1e-23 UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammapr... 107 1e-22 UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma ling... 104 7e-22 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 100 3e-20 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 99 6e-20 UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwin... 97 1e-19 UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriacea... 95 7e-19 UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacter... 91 1e-17 UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepI... 90 2e-17 UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp.... 85 4e-16 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 85 9e-16 UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria... 82 5e-15 UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepI... 82 5e-15 UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea p... 75 5e-13 UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmon... 75 7e-13 UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX 75 8e-13 UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Kl... 71 9e-12 UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis ... 69 6e-11 UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edw... 68 8e-11 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 62 8e-09 UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae... 52 6e-06 Sequences not found previously or not previously below threshold: UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_S... 70 3e-11 UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriale... 69 6e-11 UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methano... 67 1e-10 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 66 4e-10 UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5W... 64 2e-09 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 63 3e-09 UniRef50_A9VV42 Insertion element protein n=7 Tax=Bacillus RepID... 62 5e-09 UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcti... 62 7e-09 UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium ... 61 1e-08 UniRef50_Q97IZ3 Zn-finger DNA-binding domain n=1 Tax=Clostridium... 60 2e-08 UniRef50_Q891N5 Putative transposase n=1 Tax=Clostridium tetani ... 60 2e-08 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 60 3e-08 UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 R... 59 4e-08 UniRef50_Q116V8 Insertion element protein n=2 Tax=Oscillatoriale... 59 4e-08 UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ru... 59 4e-08 UniRef50_A3JAS9 Hypothetical transposase n=1 Tax=Marinobacter sp... 58 7e-08 UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candid... 56 3e-07 UniRef50_A7HSL0 Insertion element protein n=1 Tax=Parvibaculum l... 56 4e-07 UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis ae... 55 5e-07 UniRef50_A8V2B8 Hypothetical insertion element protein n=1 Tax=H... 55 8e-07 UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaen... 54 1e-06 UniRef50_Q97IP3 Zn-finger DNA-binding domain n=1 Tax=Clostridium... 53 2e-06 UniRef50_A6CNB6 Putative uncharacterized protein n=8 Tax=Bacillu... 53 2e-06 UniRef50_C6X4X6 Putative uncharacterized protein n=1 Tax=Flavoba... 53 3e-06 UniRef50_D2QCU0 Insertion element protein n=7 Tax=Bacteroidetes ... 53 3e-06 UniRef50_C4IIL3 Putative transposase n=2 Tax=Clostridium butyric... 53 4e-06 UniRef50_B1QSI6 Putative transposase n=10 Tax=Clostridium butyri... 52 5e-06 UniRef50_C0QU68 Putative transposase n=1 Tax=Persephonella marin... 52 7e-06 UniRef50_B0ABB1 Putative uncharacterized protein n=2 Tax=cellula... 52 8e-06 UniRef50_Q64EP4 Putative uncharacterized protein n=10 Tax=enviro... 51 1e-05 UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202... 51 1e-05 UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE 51 1e-05 UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candida... 51 1e-05 UniRef50_C5U8R8 Insertion element protein (Fragment) n=2 Tax=Met... 50 2e-05 UniRef50_Q2GDS3 Putative uncharacterized protein n=2 Tax=Neorick... 50 2e-05 UniRef50_Q81ZP0 Putative uncharacterized protein n=2 Tax=Nitroso... 50 2e-05 UniRef50_B6B4C9 Putative uncharacterized protein n=1 Tax=Rhodoba... 50 2e-05 UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum A... 50 3e-05 UniRef50_UPI000196AFFE hypothetical protein CATMIT_00334 n=2 Tax... 49 3e-05 UniRef50_B5ETH8 Transposase n=15 Tax=Vibrionaceae RepID=B5ETH8_V... 49 4e-05 UniRef50_C8SD87 Insertion element protein n=1 Tax=Ferroglobus pl... 49 5e-05 UniRef50_C6W2G4 Transcriptional regulator, LacI family n=1 Tax=D... 49 6e-05 UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aur... 48 7e-05 UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseud... 48 8e-05 UniRef50_C1DQR5 Transposase n=5 Tax=Bacteria RepID=C1DQR5_AZOVD 48 9e-05 UniRef50_B6ARX2 Probable transposase n=1 Tax=Leptospirillum sp. ... 48 9e-05 UniRef50_B4WSN9 Insertion element protein n=3 Tax=Cyanobacteria ... 48 1e-04 UniRef50_A1SXI4 Conserved hypothetical transposase n=18 Tax=Gamm... 47 1e-04 UniRef50_A0L9I6 Insertion element protein n=1 Tax=Magnetococcus ... 47 2e-04 UniRef50_Q8D3Z3 Transposase n=48 Tax=Vibrionales RepID=Q8D3Z3_VIBVU 47 2e-04 UniRef50_Q9AMR3 Putative transposase n=1 Tax=Azotobacter vinelan... 47 2e-04 UniRef50_B0K4X0 Integrase, catalytic region n=11 Tax=Clostridia ... 47 2e-04 UniRef50_A7HJ99 Putative uncharacterized protein n=2 Tax=Fervido... 46 3e-04 UniRef50_D1JAI8 Putative uncharacterized protein n=1 Tax=uncultu... 45 5e-04 UniRef50_B2GC72 Transposase n=11 Tax=Lactobacillus RepID=B2GC72_... 45 8e-04 UniRef50_B9TDK1 Putative uncharacterized protein n=2 Tax=Ricinus... 45 8e-04 UniRef50_Q10ZA4 Putative uncharacterized protein n=1 Tax=Trichod... 45 8e-04 UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nod... 45 0.001 UniRef50_C9BRL5 Transposase n=2 Tax=Enterococcus faecium RepID=C... 44 0.001 UniRef50_UPI0001BC5CBF transposase n=1 Tax=Fusobacterium sp. D12... 44 0.001 UniRef50_Q3Y3Y2 Transposase, IS204/IS1001/IS1096/IS1165 n=4 Tax=... 44 0.001 UniRef50_A8ZMX8 Putative uncharacterized protein n=5 Tax=Acaryoc... 44 0.001 UniRef50_A4W908 Putative cytoplasmic protein n=2 Tax=Enterobacte... 44 0.001 UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachl... 44 0.002 UniRef50_C6MKY8 Putative uncharacterized protein n=1 Tax=Geobact... 44 0.002 UniRef50_A5FST1 Integrase, catalytic region n=1 Tax=Dehalococcoi... 44 0.002 UniRef50_D0WF86 Putative Zn-finger DNA-binding domain protein n=... 44 0.002 UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q... 43 0.002 UniRef50_C0BSX6 Putative uncharacterized protein n=1 Tax=Bifidob... 43 0.002 UniRef50_C0X375 Transposase n=23 Tax=Enterococcus RepID=C0X375_E... 43 0.003 UniRef50_B5K5I7 Transposase n=4 Tax=Octadecabacter antarcticus 2... 43 0.003 UniRef50_UPI0001C348D8 hypothetical protein PretD1_11772 n=1 Tax... 43 0.003 UniRef50_C6MXF0 Putative uncharacterized protein n=1 Tax=Geobact... 43 0.003 UniRef50_P04137 Uncharacterized protein in transposable element ... 43 0.004 UniRef50_A8ZNT7 Putative uncharacterized protein n=2 Tax=Acaryoc... 43 0.004 UniRef50_Q8PSY9 Conserved protein n=2 Tax=Methanosarcina RepID=Q... 42 0.005 UniRef50_Q4JT92 Transposase for IS3503f n=4 Tax=Corynebacterium ... 42 0.006 UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae... 42 0.006 UniRef50_D0BMT4 Transposase n=10 Tax=Lactobacillales RepID=D0BMT... 42 0.007 UniRef50_Q3Y1C3 Transposase, IS204/IS1001/IS1096/IS1165 n=51 Tax... 42 0.009 UniRef50_Q0W4F0 Putative uncharacterized protein n=1 Tax=uncultu... 41 0.010 UniRef50_Q07NT9 Putative uncharacterized protein n=2 Tax=Rhizobi... 41 0.010 UniRef50_C2BVZ4 Possible IS3509a transposase n=1 Tax=Mobiluncus ... 41 0.010 UniRef50_C2FEQ0 IS1181 transposase n=1 Tax=Lactobacillus paracas... 41 0.011 UniRef50_B7K7J3 Putative uncharacterized protein n=1 Tax=Cyanoth... 41 0.014 UniRef50_B2A0V7 Integrase catalytic region n=15 Tax=Clostridia R... 40 0.016 UniRef50_A4WNK0 Putative uncharacterized protein n=1 Tax=Rhodoba... 40 0.017 UniRef50_Q70JT0 IsmA protein n=2 Tax=Microcystis aeruginosa RepI... 40 0.017 UniRef50_B2J098 Putative uncharacterized protein n=2 Tax=Nostoca... 40 0.019 UniRef50_D2PJ85 Putative uncharacterized protein n=5 Tax=Sulfolo... 40 0.019 UniRef50_UPI0001BC5E64 putative transposase n=1 Tax=Fusobacteriu... 40 0.023 UniRef50_B4WST6 Putative uncharacterized protein n=1 Tax=Synecho... 40 0.027 UniRef50_Q1VPB4 Putative uncharacterized protein n=4 Tax=Psychro... 40 0.030 UniRef50_A8ZMW7 Putative uncharacterized protein n=1 Tax=Acaryoc... 40 0.031 UniRef50_B0V2Z3 Novel zinc finger protein (Fragment) n=2 Tax=Dan... 40 0.031 UniRef50_Q5LYW0 IS1193, transposase, ISL3 family n=185 Tax=Bacte... 39 0.043 UniRef50_B5S3H3 Probable transposase protein n=6 Tax=Burkholderi... 39 0.043 UniRef50_Q8PWW0 Putative uncharacterized protein n=1 Tax=Methano... 39 0.052 UniRef50_UPI0001C34261 hypothetical protein CATC2_13668 n=1 Tax=... 39 0.058 UniRef50_A5DPJ1 Putative uncharacterized protein n=2 Tax=Pichia ... 38 0.071 UniRef50_UPI0001793699 PREDICTED: similar to zinc-finger homeodo... 38 0.073 UniRef50_Q2FRB5 Putative uncharacterized protein n=1 Tax=Methano... 38 0.079 UniRef50_D0SJK4 Predicted protein n=1 Tax=Acinetobacter junii SH... 38 0.080 UniRef50_UPI000196CFC8 hypothetical protein CATMIT_02848 n=1 Tax... 38 0.082 UniRef50_Q8RFT8 Transposase n=17 Tax=Fusobacterium RepID=Q8RFT8_... 38 0.092 UniRef50_Q9H5H4 Zinc finger protein 768 n=9 Tax=Theria RepID=ZN7... 38 0.096 >UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=root RepID=INSA2_ECOLX Length = 91 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 83/91 (91%), Positives = 87/91 (95%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MAS+SI CPSCSAT+GVVRNGKSTAGHQRYLCS CRKTWQLQFTYTASQPG HQKIIDMA Sbjct: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 MNGVGCRA+ARIMGVGLNT+ RHLKNSGRSR Sbjct: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 117 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 1/91 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA ++I CP C + + V R+G++ G R C C + +QL +TY A +PG + I +MA Sbjct: 1 MARVNIHCPRCQSAQ-VYRHGQNPKGRDRLRCRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 NG G R +AR + +G NTV+R LK R Sbjct: 60 FNGAGVRDTARTLKIGSNTVIRTLKKLAPKR 90 >UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax=Gammaproteobacteria RepID=INA2_SHIDY Length = 90 Score = 115 bits (288), Expect = 4e-25, Method: Composition-based stats. Identities = 41/91 (45%), Positives = 61/91 (67%), Gaps = 1/91 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MAS++I CP C + + V R+G++ GH R+ C C + +QL +TY A +PG + I +MA Sbjct: 1 MASVNIHCPRCQSAQ-VYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPGIKELITEMA 59 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 NG G R +AR + +G+NTV+R LKNS +S Sbjct: 60 FNGAGVRDTARTLKIGINTVIRTLKNSRQSE 90 >UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacteriaceae RepID=A4TI48_YERPP Length = 91 Score = 110 bits (275), Expect = 1e-23, Method: Composition-based stats. Identities = 39/90 (43%), Positives = 58/90 (64%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA + ++CP C V ++G GHQRY C CR+++QL++ Y A PG ++I+D+A Sbjct: 1 MAKVDVKCPFCEQFHPVKKHGPGRTGHQRYRCQACRRSFQLEYEYRACHPGMKEQIVDLA 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 MN G R +AR + + +N V+R LKNS RS Sbjct: 61 MNNAGIRDTARALHISINAVMRTLKNSRRS 90 >UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammaproteobacteria RepID=A7N597_VIBHB Length = 91 Score = 107 bits (266), Expect = 1e-22, Method: Composition-based stats. Identities = 40/91 (43%), Positives = 59/91 (64%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA+I ++C C+ TE V ++GK +G R+ C CRK++QL + Y A +P +KI+DMA Sbjct: 1 MATIQVQCRFCNKTESVRKHGKGHSGFPRFRCIECRKSFQLDYVYEARKPNVKEKIVDMA 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 MN G R +A ++ V NTVL LKNS + + Sbjct: 61 MNSSGVRETAGVLNVAYNTVLSTLKNSRQGK 91 >UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQ17_9SPHI Length = 107 Score = 104 bits (260), Expect = 7e-22, Method: Composition-based stats. Identities = 32/91 (35%), Positives = 49/91 (53%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M ++ C T+ + R G + AG QRY C C +T+ +T+ A P ++I M Sbjct: 1 MVLEAVTCKHFGQTQHIKRYGTTCAGTQRYRCFDCGRTFVQTYTHKARDPLVKEQITQMV 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 +NG G R +AR++GV NTV K +G +R Sbjct: 61 LNGAGIRDTARVLGVNRNTVSAQFKKNGAAR 91 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 99.7 bits (247), Expect = 3e-20, Method: Composition-based stats. Identities = 63/74 (85%), Positives = 66/74 (89%) Query: 17 VVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVG 76 +VRNGKSTAGHQRYLCS CRKTWQLQFTYTASQPG HQKIIDMAMNGVGCRA+ARIMGVG Sbjct: 1 MVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 60 Query: 77 LNTVLRHLKNSGRS 90 LNT+LRHL Sbjct: 61 LNTILRHLNKLRPQ 74 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 98.6 bits (244), Expect = 6e-20, Method: Composition-based stats. Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 2/87 (2%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGV 64 + CPSC + VV+ G+ G Q++LC C K + +Y ++ + M NG+ Sbjct: 10 DVSCPSCG-SHHVVKCGR-PLGRQKFLCRDCGKYFLGDASYHHHSRKLREEALRMYANGM 67 Query: 65 GCRASARIMGVGLNTVLRHLKNSGRSR 91 RA +R++ V L TV +K GR + Sbjct: 68 SMRAISRVLNVPLGTVFTWIKRYGRKK 94 >UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwinia RepID=D0FXR2_ERWPY Length = 92 Score = 97.4 bits (241), Expect = 1e-19, Method: Composition-based stats. Identities = 41/91 (45%), Positives = 53/91 (58%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M I CP CS + + RNG+S +G QRY C C KT+QL F Y S P + II+M Sbjct: 1 MKMGDIACPRCSESARIRRNGRSASGIQRYRCQGCLKTFQLHFYYAGSSPNMQKTIIEMM 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 +G R AR +GV L TVLRHLK+ ++ Sbjct: 61 NDGSEQRDIARKLGVSLETVLRHLKDLRLNK 91 >UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriaceae RepID=B7LWW4_ECO55 Length = 134 Score = 95.1 bits (235), Expect = 7e-19, Method: Composition-based stats. Identities = 28/89 (31%), Positives = 48/89 (53%), Gaps = 3/89 (3%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M+S++I CP C + + V R+G++ G R+ C + +QL +TY A +PG + I +MA Sbjct: 1 MSSVNIHCPRCQSAQ-VYRHGQNPKGRDRFRYRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MNGVGCRAS--ARIMGVGLNTVLRHLKNS 87 N G + AR+ G+ + + K Sbjct: 60 FNEPGMMLARMARLHGIQPCQLFKWKKQY 88 >UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacteria RepID=Q1V9Z0_VIBAL Length = 88 Score = 90.9 bits (224), Expect = 1e-17, Method: Composition-based stats. Identities = 31/82 (37%), Positives = 47/82 (57%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M + + C C ++ VV++G GHQRY C C +T+Q+ + Y A +PG +II+M Sbjct: 1 MTTNNPHCHFCCKSDSVVKHGYGPKGHQRYRCLSCCRTFQVNYCYEACKPGIRSRIIEMT 60 Query: 61 MNGVGCRASARIMGVGLNTVLR 82 G RA++R + V NTVL Sbjct: 61 AQNHGKRATSRHLQVSYNTVLS 82 >UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IXU3_9CHRO Length = 92 Score = 90.1 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 30/87 (34%), Positives = 48/87 (55%), Gaps = 4/87 (4%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCS---PCRKTWQLQFTYTASQPGKHQKIIDMA 60 ++I CP C +T+ VV+NG S G QRY C R+++ ++Y + ++I M Sbjct: 5 LAIECPHCHSTD-VVKNGFSGEGKQRYFCQNKSCERRSFIRDYSYNGCRKEVKKQIPKMV 63 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNS 87 +NG G R +AR++ + TV LK S Sbjct: 64 VNGSGIRDTARVLEISPITVASELKKS 90 >UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HXD1_CYAP4 Length = 95 Score = 85.5 bits (210), Expect = 4e-16, Method: Composition-based stats. Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 4/94 (4%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPC---RKTWQLQFTYTASQPGKHQKII 57 M + PSC +++ VV+ + T G QRY C R T+ Q+ Y Q+I+ Sbjct: 1 MVLEPVLYPSCGSSD-VVKPRQLTEGIQRYKCRNAEWSRCTFIRQYAYRGYLVEVKQQIV 59 Query: 58 DMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 +M +NG G R AR++ + TV LK S + Sbjct: 60 EMVVNGSGTRDPARVLKISRTTVTETLKKSSSAE 93 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 84.7 bits (208), Expect = 9e-16, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 45/85 (52%), Gaps = 1/85 (1%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 I CP C + V++NG + Q+Y C C + +T A + +QK + + G+G Sbjct: 12 INCPKCKE-KKVIKNGTTKNNKQQYYCKMCFYRFIQNYTNQAYKLDINQKNVQLTKEGLG 70 Query: 66 CRASARIMGVGLNTVLRHLKNSGRS 90 R++ARI+ + T+L+ + + GR Sbjct: 71 IRSTARILEISATTLLKRIVSIGRK 95 >UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria RepID=C4MEL4_CAMCO Length = 339 Score = 82.4 bits (202), Expect = 5e-15, Method: Composition-based stats. Identities = 30/85 (35%), Positives = 42/85 (49%), Gaps = 8/85 (9%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ----LQFTYTASQPGKHQKIIDMA 60 + CP C+ ++ V+NGK+ HQRY+C C KT+ T G K ID Sbjct: 45 ATHCPYCN-SDKFVKNGKAKT-HQRYICKTCNKTFTDTNKTILFNTKKDIGIWYKYIDCL 102 Query: 61 MNGVGCRASARIMGVGLNT--VLRH 83 +N R +A+I G+ L T V RH Sbjct: 103 VNKYPLRKTAKICGISLPTAFVWRH 127 >UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32IP8_SHIDS Length = 101 Score = 82.0 bits (201), Expect = 5e-15, Method: Composition-based stats. Identities = 48/52 (92%), Positives = 50/52 (96%) Query: 40 QLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 QLQFTYTASQPG HQKIIDMAMNGVGCRA+ARIMGV LNT+LRHLKNSGRSR Sbjct: 50 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVSLNTILRHLKNSGRSR 101 >UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea psychrophila RepID=Q6AKY5_DESPS Length = 101 Score = 75.4 bits (184), Expect = 5e-13, Method: Composition-based stats. Identities = 28/85 (32%), Positives = 48/85 (56%), Gaps = 6/85 (7%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + C C T+ V R+GK + G+QR+ CS C++T+QL++ Y A +H++ + G Sbjct: 1 MSCRFCGGTDEVRRHGKDSNGNQRFRCSDCKRTFQLEYPYVA---DRHERY---SPGNAG 54 Query: 66 CRASARIMGVGLNTVLRHLKNSGRS 90 R +AR++ VG + R K + R Sbjct: 55 IRDTARVLKVGCMGLTRFRKLNPRQ 79 >UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SSR9_AERS4 Length = 91 Score = 75.1 bits (183), Expect = 7e-13, Method: Composition-based stats. Identities = 25/64 (39%), Positives = 39/64 (60%), Gaps = 1/64 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MASI+I CP C+ ++ V R+GK+ AG+ RY C C +QL +TY A P ++++ Sbjct: 10 MASITIHCPRCN-SDHVYRHGKTPAGNIRYRCPACPHVFQLTYTYEARNPASKRRLLIWR 68 Query: 61 MNGV 64 G+ Sbjct: 69 STGL 72 >UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX Length = 71 Score = 74.7 bits (182), Expect = 8e-13, Method: Composition-based stats. Identities = 24/65 (36%), Positives = 45/65 (69%), Gaps = 1/65 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA++++ P C+ ++ V R+G+S + H+R+ C C++ +QL ++Y A +PG + I++MA Sbjct: 1 MATVTVHRPRCN-SDKVYRHGRSCSQHERFRCRSCKRVFQLTYSYEARKPGFKELIVEMA 59 Query: 61 MNGVG 65 NG G Sbjct: 60 HNGTG 64 >UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T8A4_KLEPR Length = 83 Score = 71.2 bits (173), Expect = 9e-12, Method: Composition-based stats. Identities = 42/66 (63%), Positives = 46/66 (69%), Gaps = 1/66 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQ-KIIDM 59 MASI + PSC+ TEGV RNGKSTAGHQ YLC CRK W L FTYT SQ HQ KIIDM Sbjct: 7 MASIYVGSPSCAVTEGVDRNGKSTAGHQHYLCRQCRKPWTLTFTYTTSQRSTHQRKIIDM 66 Query: 60 AMNGVG 65 + + Sbjct: 67 TIMALD 72 >UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_STRAG Length = 345 Score = 69.7 bits (169), Expect = 3e-11, Method: Composition-based stats. Identities = 26/84 (30%), Positives = 36/84 (42%), Gaps = 5/84 (5%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL----QFTYTASQPGKHQKIIDMAMNG 63 CP C VVRNG G QRY+C C K++ + + T ++ ID MNG Sbjct: 52 CPLCGCI-HVVRNGHRKDGTQRYVCKDCGKSFVIATNSIVSGTRKDLSVWEQYIDCMMNG 110 Query: 64 VGCRASARIMGVGLNTVLRHLKNS 87 + R +A G+ NT Sbjct: 111 LSIRKTAVACGIHRNTAFLWRHKI 134 >UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis BM407 RepID=C6GT28_STRS4 Length = 341 Score = 68.5 bits (166), Expect = 6e-11, Method: Composition-based stats. Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 8/85 (9%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQ-----KIIDMAMN 62 CP C +E + RNGK G QRY+C C+KT+ FT +A+ K K +N Sbjct: 54 CPLCG-SETISRNGKY-NGKQRYICKSCKKTFT-DFTNSATYKSKKTLDKWLKYAKCMIN 110 Query: 63 GVGCRASARIMGVGLNTVLRHLKNS 87 G R SA+I+ + + T Sbjct: 111 GYSIRKSAKIVEINIATSFFWRHKI 135 >UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriales RepID=B5W4N9_SPIMA Length = 163 Score = 68.5 bits (166), Expect = 6e-11, Method: Composition-based stats. Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 2/78 (2%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCR 67 CP C + VV+NG G Q YLC C + ++ + + M++NG+G R Sbjct: 3 CPYC-QSHKVVKNGH-RQGKQSYLCRECGRQFRENPCPGGYSSDVKELCVKMSLNGMGFR 60 Query: 68 ASARIMGVGLNTVLRHLK 85 A R+ G+ NT+L ++ Sbjct: 61 AIERVTGISHNTILNWVR 78 >UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8T3_EDWI9 Length = 73 Score = 68.1 bits (165), Expect = 8e-11, Method: Composition-based stats. Identities = 21/52 (40%), Positives = 31/52 (59%) Query: 40 QLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 L Y A + ++II+MA G G R +A + +G+NTV+R LKNS +S Sbjct: 22 LLTLAYEAHKLDIKEQIIEMAFKGSGVRDTANTLKIGINTVIRTLKNSRQSE 73 >UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methanosarcina RepID=Q46CV1_METBF Length = 139 Score = 67.4 bits (163), Expect = 1e-10, Method: Composition-based stats. Identities = 23/84 (27%), Positives = 43/84 (51%), Gaps = 2/84 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + CP C+++ +NG G Q Y C C + ++ TAS P ++ + + + G+G Sbjct: 1 MNCPRCNSSTH-KKNG-IVFGRQHYKCHDCGYNYTVEVKSTASSPSVKRQALQLYLEGLG 58 Query: 66 CRASARIMGVGLNTVLRHLKNSGR 89 R+ R +GV +V + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQ 82 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 65.8 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 43/84 (51%), Gaps = 2/84 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + CP C+++ +NG G QRY C C + ++ T+ P ++ + + + G+G Sbjct: 1 MNCPRCNSSTH-KKNG-IVFGRQRYKCHDCGYNYTVEVKSTSISPSVKRQALQLYLEGLG 58 Query: 66 CRASARIMGVGLNTVLRHLKNSGR 89 R+ R +GV +V + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQ 82 >UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5WC29_PSYWF Length = 233 Score = 63.9 bits (154), Expect = 2e-09, Method: Composition-based stats. Identities = 23/86 (26%), Positives = 40/86 (46%), Gaps = 3/86 (3%) Query: 3 SISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ--FTYTASQPGKHQKIIDMA 60 ++ I+CP+C ++ + +NG + G Q Y C C++ + TY K KI + Sbjct: 4 TLYIKCPAC-LSDNIKKNGFKSYGKQNYKCKDCKRQFIGDHALTYQGCHSQKDSKIRYLM 62 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKN 86 + G G + A + + VL LK Sbjct: 63 VRGSGIKDIACVERISKGKVLATLKK 88 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 62.7 bits (151), Expect = 3e-09, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 44/86 (51%), Gaps = 3/86 (3%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQF-TYTASQPGKHQKIIDMAMNGV 64 ++CP C ATE + +NGK G Q ++C+ C + + + Q+ ++M +NG+ Sbjct: 1 MQCPYCGATE-IRKNGK-RRGKQNHICTKCERQFIDVYDPPKGYSEELKQECLEMYLNGM 58 Query: 65 GCRASARIMGVGLNTVLRHLKNSGRS 90 G R R+ GV T++ +K G Sbjct: 59 GFRPIERVKGVHHTTIIFWVKQMGEK 84 >UniRef50_A9VV42 Insertion element protein n=7 Tax=Bacillus RepID=A9VV42_BACWK Length = 342 Score = 62.3 bits (150), Expect = 5e-09, Method: Composition-based stats. Identities = 26/83 (31%), Positives = 34/83 (40%), Gaps = 5/83 (6%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTW---QLQFTYTASQPGKHQKIIDMAMNG 63 CP C+ +E VVR GK QRY C C KT+ Y + + +D G Sbjct: 55 ECPHCA-SEHVVRFGK-HNNRQRYRCKCCSKTFTDTTNTVLYRTRKGNEWITFVDCMFKG 112 Query: 64 VGCRASARIMGVGLNTVLRHLKN 86 R SA I+GV T+ Sbjct: 113 YSLRKSAEIVGVTWVTLFYWRHK 135 >UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcticus 273-4 RepID=Q4FRR6_PSYA2 Length = 108 Score = 61.6 bits (148), Expect = 7e-09, Method: Composition-based stats. Identities = 26/88 (29%), Positives = 36/88 (40%), Gaps = 3/88 (3%) Query: 2 ASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ--FTYTASQPGKHQKIIDM 59 I I CP C + + +NG + G Q Y C C++ + TY +I M Sbjct: 3 TQIDISCPDCHSI-SLKKNGIKSYGKQNYQCKDCQRQFIGDHALTYQGCHSRIEDRIRLM 61 Query: 60 AMNGVGCRASARIMGVGLNTVLRHLKNS 87 G G R A I V + VL L +S Sbjct: 62 TARGCGIRDIAVITSVSIGKVLSTLGSS 89 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 61.6 bits (148), Expect = 8e-09, Method: Composition-based stats. Identities = 18/49 (36%), Positives = 28/49 (57%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQ 49 M ++ ++C C TE V ++ K A HQRY C C + +QL++ Y A Sbjct: 1 METLEVKCRFCQQTEFVKKHSKGDADHQRYRCFSCNQIFQLEYAYRACH 49 >UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZK1_TRIEI Length = 177 Score = 60.8 bits (146), Expect = 1e-08, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 39/83 (46%), Gaps = 2/83 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 I+CP C + + + +NG G Q Y+C C + + + + + +NG+G Sbjct: 11 IQCPDC-SCQHIPKNGHQP-GKQNYICVACSHQFIKPYHPQEYSDNVKRLFLRIYVNGMG 68 Query: 66 CRASARIMGVGLNTVLRHLKNSG 88 R A + GV T++ +K++ Sbjct: 69 IRRIAWVKGVTYPTIINLIKHTR 91 >UniRef50_Q97IZ3 Zn-finger DNA-binding domain n=1 Tax=Clostridium acetobutylicum RepID=Q97IZ3_CLOAB Length = 142 Score = 60.4 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 38/85 (44%), Gaps = 8/85 (9%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQ-----KIIDMAMN 62 CP C +E + RN K G Q Y+C C+K++ FT +A+ K K +N Sbjct: 54 CPICG-SETISRNSKY-NGKQGYICKSCKKSFT-DFTNSATYKSKKTLDKWLKYAKCMVN 110 Query: 63 GVGCRASARIMGVGLNTVLRHLKNS 87 G R SA+++ + + T Sbjct: 111 GYSIRKSAKVVEINIATSFFWRHKI 135 >UniRef50_Q891N5 Putative transposase n=1 Tax=Clostridium tetani RepID=Q891N5_CLOTE Length = 279 Score = 60.0 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 37/87 (42%), Gaps = 6/87 (6%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ----LQFTYTASQPGKHQKIIDMA 60 C C +E +V+NGK QRY+C C KT+ +Y+ K + Sbjct: 56 DTICVHC-KSENIVKNGKYKE-KQRYICKDCHKTFTNYTNSPISYSKKNISKWIEYTKCM 113 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNS 87 + G R S++++G+ L+T Sbjct: 114 LAGYSLRKSSKLVGISLSTAFYWRHKI 140 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 59.7 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 43/85 (50%), Gaps = 3/85 (3%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNG 63 I CPSC ++ V++NG S+ G +Y C+ CR+T+ + ++I+ +N Sbjct: 67 IRPNCPSC-KSDKVIKNG-SSRGKTKYKCNVCRRTF-YDANSRRMSREQKERILKEYLNR 123 Query: 64 VGCRASARIMGVGLNTVLRHLKNSG 88 + R A++ G L TV +K G Sbjct: 124 MSMRGIAKVEGKPLTTVYSLIKRKG 148 >UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 RepID=P73782_SYNY3 Length = 141 Score = 59.3 bits (142), Expect = 4e-08, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 38/84 (45%), Gaps = 2/84 (2%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGC 66 CP C VV+NG G QR+ C C+ + + + + M G+ Sbjct: 6 HCPQCGHGN-VVKNGFV-KGKQRFKCKRCQYKFTNLSKERGKLLWMKLEAVLLYMGGMSM 63 Query: 67 RASARIMGVGLNTVLRHLKNSGRS 90 A+A+++GV ++L +++ G + Sbjct: 64 NATAKLLGVSTQSLLNWIRDFGEA 87 >UniRef50_Q116V8 Insertion element protein n=2 Tax=Oscillatoriales RepID=Q116V8_TRIEI Length = 108 Score = 59.3 bits (142), Expect = 4e-08, Method: Composition-based stats. Identities = 22/81 (27%), Positives = 38/81 (46%), Gaps = 2/81 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + CP C + +V+NG G Q YLC C + ++ + + M ++G+G Sbjct: 1 MHCPYC-QSHKIVKNGH-RNGKQSYLCRKCGRQFRENPCPIGYSSEVKEACLKMFLSGMG 58 Query: 66 CRASARIMGVGLNTVLRHLKN 86 RA R G+ N+VL ++ Sbjct: 59 FRAIERATGISHNSVLNWVRR 79 >UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N1_SALRD Length = 92 Score = 58.9 bits (141), Expect = 4e-08, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 1/86 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M + C C ++ +V+NG S +G Q+Y C C L + +KI+ Sbjct: 1 MIKETYECRECGSSN-IVKNGHSASGSQQYHCKDCGAHKVLDPEPRGYSEEEKEKILRAY 59 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKN 86 RA +RI G+ NT+ R LK Sbjct: 60 RERGSKRAISRIFGISRNTLTRWLKK 85 >UniRef50_A3JAS9 Hypothetical transposase n=1 Tax=Marinobacter sp. ELB17 RepID=A3JAS9_9ALTE Length = 181 Score = 58.5 bits (140), Expect = 7e-08, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 35/82 (42%), Gaps = 4/82 (4%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTW---QLQFTYTASQPGKHQKIIDMAMN 62 +CP C ++ +R G S QRY C C KT+ Y + + ++ Sbjct: 55 TQCPYC-QSKTFIRWGSSENERQRYRCKRCAKTFNALVGSPLYRMRKEELWLEYVETMRY 113 Query: 63 GVGCRASARIMGVGLNTVLRHL 84 G+ R +A++ GV L T R Sbjct: 114 GLSLRKAAKVTGVSLRTAFRWR 135 >UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L432_AMOA5 Length = 118 Score = 56.2 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 38/84 (45%), Gaps = 2/84 (2%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGV 64 ++ CP C+ ++G G QRY C CR + + +K + + + G+ Sbjct: 3 TMNCPRCNNAHSC-KDG-IVRGRQRYQCKSCRFRYTVSHKSDVKPLSTKRKALQLYLEGL 60 Query: 65 GCRASARIMGVGLNTVLRHLKNSG 88 G RA RI+ + TV + +K G Sbjct: 61 GFRAIGRILNISYGTVYQWVKACG 84 >UniRef50_A7HSL0 Insertion element protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HSL0_PARL1 Length = 342 Score = 55.8 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 33/94 (35%), Gaps = 11/94 (11%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCS-----PCRKTW---QLQFTYTASQPGKHQKIIDM 59 CP C + +V++G+ G QR+ C C +T+ +P K M Sbjct: 55 CPHCGH-DDIVKHGRDRGGRQRFRCRRSGSSGCGQTFNALTGTAFTRMRKPEKWAAYARM 113 Query: 60 AMNGVGCRASARI--MGVGLNTVLRHLKNSGRSR 91 G + +G+ T R R++ Sbjct: 114 MATGFKSVDDVKTSGLGISRLTAWRWRHRLLRAQ 147 >UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YEG9_MICAE Length = 171 Score = 55.4 bits (132), Expect = 5e-07, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 31/80 (38%), Gaps = 5/80 (6%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCR 67 CP+C + ++NG G + C C + + + T Q I + + G+ R Sbjct: 37 CPNCG-SHHTIKNGSIHNGKPKRQCKECGRQFVINPTNKTVSDETKQLIDKLLLEGISLR 95 Query: 68 ASARIMGVGLNTVLRHLKNS 87 AR+ G L+N Sbjct: 96 VIARVTGAS----WSWLQNY 111 >UniRef50_A8V2B8 Hypothetical insertion element protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8V2B8_9AQUI Length = 125 Score = 55.0 bits (131), Expect = 8e-07, Method: Composition-based stats. Identities = 23/84 (27%), Positives = 36/84 (42%), Gaps = 2/84 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 I+CP C + + GK+T G QRY C+ C + + Y + M G+ Sbjct: 14 IKCPECG-SNWCKKFGKNT-GKQRYKCNECGRHFYEGAKYHKHPEKVKLLALKMYSKGMS 71 Query: 66 CRASARIMGVGLNTVLRHLKNSGR 89 A AR++ + TV R G+ Sbjct: 72 KSAIARVLNLPYRTVARWTYEVGK 95 >UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TRD2_9FLAO Length = 219 Score = 54.3 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 34/85 (40%), Gaps = 2/85 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + C +C + R ++ G QRY C C ++Q + Y A +I + Sbjct: 1 MNCKNCDQAHCIKRGKRN--GIQRYYCKICFTSFQENYHYKAYDSSIDTLLISLLRECCS 58 Query: 66 CRASARIMGVGLNTVLRHLKNSGRS 90 AR++ + NTVL + + Sbjct: 59 VLGIARVLKISKNTVLSRMLKISKQ 83 >UniRef50_Q97IP3 Zn-finger DNA-binding domain n=1 Tax=Clostridium acetobutylicum RepID=Q97IP3_CLOAB Length = 171 Score = 53.5 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 33/90 (36%), Gaps = 11/90 (12%) Query: 3 SISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTY-----TASQPGKHQKII 57 + + C E RNGK QRY+C C+KT+ FTY + K + Sbjct: 50 KVYLHCKL----EMFSRNGKHDE-KQRYVCKTCKKTFT-DFTYSPISSSKKPLDKWLQYA 103 Query: 58 DMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 + G R A+ + + + T Sbjct: 104 KCMIVGYSIRKCAKTVNINIATSFFWRHKI 133 >UniRef50_A6CNB6 Putative uncharacterized protein n=8 Tax=Bacillus RepID=A6CNB6_9BACI Length = 335 Score = 53.1 bits (126), Expect = 2e-06, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 34/89 (38%), Gaps = 7/89 (7%) Query: 3 SISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYT----ASQPGKHQKIID 58 + C C + V RNGK QRYLC C K++ + + T GK K Sbjct: 49 KEGLGCIHCGSV-KVKRNGKYRE-RQRYLCRDCGKSF-NELSNTPIAGTRYLGKWAKYFH 105 Query: 59 MAMNGVGCRASARIMGVGLNTVLRHLKNS 87 M + G A+ + + ++T Sbjct: 106 MMVEGYTLPKIAKRLKIHISTAFYWRHKI 134 >UniRef50_C6X4X6 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X4X6_FLAB3 Length = 169 Score = 53.1 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 35/85 (41%), Gaps = 2/85 (2%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGC 66 CP C + VV++G QR+LC C + ++ K + + + G+ Sbjct: 35 TCPKCQQ-QNVVKSGIVKE-RQRFLCRSCNYYFTVKKLGKQIDDYYVTKALQLYLEGLSY 92 Query: 67 RASARIMGVGLNTVLRHLKNSGRSR 91 R RI+GV T+ ++ R Sbjct: 93 REIERILGVSHVTISSWVRKYNIKR 117 >UniRef50_D2QCU0 Insertion element protein n=7 Tax=Bacteroidetes RepID=D2QCU0_9SPHI Length = 139 Score = 52.7 bits (125), Expect = 3e-06, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 39/87 (44%), Gaps = 4/87 (4%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA++ +CP C++ + V RNG QR+ C C + + K + + Sbjct: 1 MATL--KCPKCNSVDAV-RNG-IVNQRQRFRCKKCNYNFTVGKVGKGISTYYVIKALQLY 56 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNS 87 + GV R R++G+ +V+ +K Sbjct: 57 IEGVSFREIERLLGISHVSVMNWVKKY 83 >UniRef50_C4IIL3 Putative transposase n=2 Tax=Clostridium butyricum RepID=C4IIL3_CLOBU Length = 325 Score = 52.7 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 32/83 (38%), Gaps = 6/83 (7%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMNG 63 CP C E + + GK G QRY C C+KT+ + Y P K K I++ Sbjct: 35 CPHCKNVEFI-KFGKYD-GIQRYRCKSCKKTFSYTTNSLWKYLKHPPEKWFKFIELLGEK 92 Query: 64 VGCRASARIMGVGLNTVLRHLKN 86 A+ + + + T Sbjct: 93 KTLEYCAKTLKISIVTAFNWRHK 115 >UniRef50_B1QSI6 Putative transposase n=10 Tax=Clostridium butyricum RepID=B1QSI6_CLOBU Length = 336 Score = 52.3 bits (124), Expect = 5e-06, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 34/87 (39%), Gaps = 8/87 (9%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSP--CRKTWQLQFT----YTASQPGKHQKIIDMA 60 CP C + ++ GK QRY C C KT+ Y QP K + I++ Sbjct: 34 SCPYCG-CKHFIKYGKY-QDIQRYKCKNEECGKTFSNTTFSVWKYLKYQPEKWIEFIELM 91 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNS 87 G+ +SARI+ + T Sbjct: 92 CEGMTLESSARILKITTTTAFYWRHKI 118 >UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae RepID=C5BFY7_EDWI9 Length = 46 Score = 51.9 bits (123), Expect = 6e-06, Method: Composition-based stats. Identities = 20/41 (48%), Positives = 27/41 (65%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL 41 MA I + CP + T+ V+RNG +T+G Q Y C C KT+QL Sbjct: 1 MAKIDVVCPRGAKTQDVIRNGHATSGAQVYRCKLCLKTFQL 41 >UniRef50_C0QU68 Putative transposase n=1 Tax=Persephonella marina EX-H1 RepID=C0QU68_PERMH Length = 94 Score = 51.6 bits (122), Expect = 7e-06, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 37/87 (42%), Gaps = 2/87 (2%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M I CP C +E V+NGK+ G Q YLC C + + + ++ +++ Sbjct: 1 MGGKKISCPHC-ESERCVKNGKA-NGKQTYLCKECYYRFTINASKRKYPFKIRREAVNLY 58 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNS 87 G ++ + + + T+ +K Sbjct: 59 KEGYTLTEISKKLNIKVQTIHHWVKKY 85 >UniRef50_B0ABB1 Putative uncharacterized protein n=2 Tax=cellular organisms RepID=B0ABB1_9CLOT Length = 454 Score = 51.6 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 36/81 (44%), Gaps = 6/81 (7%) Query: 3 SISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIID 58 ++CP C + + + +NGK T QRY+C CR T+ + + T K Sbjct: 136 KNDLKCPKCGSFD-LNKNGK-TNQRQRYICKNCRTTFDERSFSPLSNTKLSLDTWLKYCQ 193 Query: 59 MAMNGVGCRASARIMGVGLNT 79 + G + A+ +GV + T Sbjct: 194 FMIEGGTIKYCAQKVGVSIPT 214 >UniRef50_Q64EP4 Putative uncharacterized protein n=10 Tax=environmental samples RepID=Q64EP4_9ARCH Length = 164 Score = 51.2 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 31/77 (40%), Gaps = 4/77 (5%) Query: 17 VVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMNGVGCRASARI 72 +VR G G QR+ C C K + F + I + + G RA RI Sbjct: 38 IVRYGHDKNGRQRFKCKTCGKVFVETKNTVFYNRKLSEDQIILICKLLVEKNGIRAIERI 97 Query: 73 MGVGLNTVLRHLKNSGR 89 M + +T+ +K+ R Sbjct: 98 MEIHRDTISDVVKDLAR 114 >UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202 RepID=C8KX67_9PAST Length = 238 Score = 51.2 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 28/85 (32%), Positives = 37/85 (43%), Gaps = 3/85 (3%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ--FTYTASQPGKHQKIIDMAMNG 63 I CP CS+ + + +NGK Q YLC C + + TY Q+I+ M + G Sbjct: 7 ISCPKCSSCQ-IKKNGKKPNNKQNYLCKCCGRQFIGDHALTYRGCHSKISQRILIMLVRG 65 Query: 64 VGCRASARIMGVGLNTVLRHLKNSG 88 G R A I V VL L N Sbjct: 66 CGIRDVAAIEKVSCTKVLSVLLNVR 90 >UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE Length = 348 Score = 50.8 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 38/77 (49%), Gaps = 6/77 (7%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTW----QLQFTYTASQPGKHQKIIDMAMN 62 CP C + V +NGKS G QRY+C CR ++ F+ T K K ++ + Sbjct: 52 ECPKCQCKD-VNKNGKS-NGRQRYICKRCRTSFDEFTMSPFSNTKLGLDKWIKYCELMIL 109 Query: 63 GVGCRASARIMGVGLNT 79 G+ R A +GVG+ T Sbjct: 110 GLSIRKCAEEVGVGVKT 126 >UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETP6_AMOA5 Length = 232 Score = 50.8 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 36/83 (43%), Gaps = 2/83 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + C C + + NGK G QRY C C + K + I + + +G Sbjct: 1 MECKGC-KSNKTINNGKVR-GKQRYNCKSCGFNFVEVDERRGKNIDKQRMAIHLYLENMG 58 Query: 66 CRASARIMGVGLNTVLRHLKNSG 88 RA R++GV VL+ ++ +G Sbjct: 59 FRAIGRVLGVSNLAVLKWIRAAG 81 >UniRef50_C5U8R8 Insertion element protein (Fragment) n=2 Tax=Methanocaldococcus infernus ME RepID=C5U8R8_9EURY Length = 100 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 45/91 (49%), Gaps = 6/91 (6%) Query: 6 IRCPSCSATEGVVRNGKSTAG----HQRYLCSPCRKTWQLQFTYTASQPGKHQKIID-MA 60 IRC C+ ++ VV+ GK + Q YLC C++ + + +K++ + Sbjct: 5 IRCKYCN-SDKVVKAGKHKSEKYGVRQMYLCKKCKRRFVEESKAPRYSDSFKEKVVRSVV 63 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 G+G R + R+ + T+LR +K+ +++ Sbjct: 64 FEGLGIRQAGRVFKLSTTTILRWIKDFKKTK 94 >UniRef50_Q2GDS3 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDS3_NEOSM Length = 134 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 36/79 (45%), Gaps = 3/79 (3%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + CP C++ +++GK+ QRY C C + + A + + ++G+ Sbjct: 1 MHCPKCNSV-RFIKSGKAKE-KQRYKCLNCGCQFSRNEKHGAPLR-LKMHAVQLFLSGIS 57 Query: 66 CRASARIMGVGLNTVLRHL 84 + A+I V TV+R + Sbjct: 58 MNSIAKIFSVSPPTVMRWV 76 >UniRef50_Q81ZP0 Putative uncharacterized protein n=2 Tax=Nitrosomonas europaea RepID=Q81ZP0_NITEU Length = 323 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 31/88 (35%), Gaps = 5/88 (5%) Query: 2 ASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ---LQFTYTASQPGKHQKIID 58 +S CP C + R G AG QR+ C C+ T+ + Sbjct: 43 SSFEPICPVC-QSNHFYRWGY-QAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA 100 Query: 59 MAMNGVGCRASARIMGVGLNTVLRHLKN 86 + G+ RASAR + NT R Sbjct: 101 ALIEGLTVRASARQCRIDKNTSFRWRHR 128 >UniRef50_B6B4C9 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2083 RepID=B6B4C9_9RHOB Length = 321 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 27/84 (32%), Positives = 40/84 (47%), Gaps = 7/84 (8%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGK----HQKIIDMAMN 62 CP C A + R G++ AG QRY C C KT+ + + +Q K +Q + DM + Sbjct: 49 TCPHCGAVDR-QRWGRTRAGSQRYRCQGCLKTFNGRTGSSIAQLQKLDQFYQVLKDMFSD 107 Query: 63 GVG--CRASARIMGVGLNTVLRHL 84 G R AR + V +T+ R Sbjct: 108 GPPRSIRRLARQLDVNKDTIWRWR 131 >UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C2KAD9_9FLAO Length = 239 Score = 49.6 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 42/90 (46%), Gaps = 1/90 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M RC C+ + ++ G ++ QRY C C+K + +++Y A Q + I + Sbjct: 1 MNKRRNRCIHCNYS-YCIKAGITSQNKQRYQCKKCKKKFIGKYSYRAYQKSTNHNIQQLI 59 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 GVG R +R++ V TVL+ + Sbjct: 60 KEGVGIRGISRLLNVSKTTVLKKILKIASK 89 >UniRef50_UPI000196AFFE hypothetical protein CATMIT_00334 n=2 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196AFFE Length = 357 Score = 49.3 bits (116), Expect = 3e-05, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 32/78 (41%), Gaps = 5/78 (6%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ----LQFTYTASQPGKHQKIIDMAMNG 63 CP C + +NGK HQRY+C C K++ F ++ + I++ + Sbjct: 50 CPICGSV-HFKKNGKDKNRHQRYICLDCHKSFSDRTNTLFYWSHFTLDQWLHFIELELYK 108 Query: 64 VGCRASARIMGVGLNTVL 81 + A+++ T Sbjct: 109 MPLEGEAQVLETSKTTCF 126 >UniRef50_B5ETH8 Transposase n=15 Tax=Vibrionaceae RepID=B5ETH8_VIBFM Length = 489 Score = 49.3 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 34/92 (36%), Gaps = 12/92 (13%) Query: 9 PSCSATE------GVVRNGK------STAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKI 56 PSC+ +E V+ + + + QRY C C T+ +++ + QK+ Sbjct: 79 PSCNNSECEHFGFDVLTHRELYHAFGYSGDRQRYRCKSCASTFVDKWSGENQKSLIQQKL 138 Query: 57 IDMAMNGVGCRASARIMGVGLNTVLRHLKNSG 88 + G R R + + T H+ Sbjct: 139 LGFLFTGYSVREICRRLHINPKTFYDHINQIA 170 >UniRef50_C8SD87 Insertion element protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SD87_FERPL Length = 94 Score = 48.9 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 43/89 (48%), Gaps = 5/89 (5%) Query: 8 CPSCSATEGVVR---NGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAM--N 62 CP C + + V + KS QRY C C +T+ L + P + ++++ A+ Sbjct: 3 CPHCKSIKTVKMGCYHTKSGERRQRYKCKNCGRTFVLNPIKPRNYPEEFKEMVVKAVVRE 62 Query: 63 GVGCRASARIMGVGLNTVLRHLKNSGRSR 91 GVG R ++RI + NTV ++ + R Sbjct: 63 GVGVRQASRIFKLSPNTVTAWVREFSKKR 91 >UniRef50_C6W2G4 Transcriptional regulator, LacI family n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W2G4_DYAFD Length = 388 Score = 48.9 bits (115), Expect = 6e-05, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 34/79 (43%), Gaps = 6/79 (7%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 I C C+ +G+++ G G QRYLC C + A + + ++ + Sbjct: 2 IECVKCAQVDGIMKAGYVR-GKQRYLCKWCNYYFT-----HAEKDDSIESLVKRKRHQTT 55 Query: 66 CRASARIMGVGLNTVLRHL 84 A+ +GV +TV R L Sbjct: 56 IIDIAKSLGVSNSTVSRAL 74 >UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PIK6_CORA7 Length = 403 Score = 48.5 bits (114), Expect = 7e-05, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 36/80 (45%), Gaps = 3/80 (3%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 C C G+V+NGK+ AG QR+LC C + +T+ ID ++G Sbjct: 7 PSCDMCG--HGLVKNGKTAAGTQRWLCPQCNVSSINTRAHTSDIRHFK-IFIDWILSGES 63 Query: 66 CRASARIMGVGLNTVLRHLK 85 A+ +GV T+ R K Sbjct: 64 ADHLAKRLGVTRRTLTRWFK 83 >UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JSC1_YERPY Length = 53 Score = 48.1 bits (113), Expect = 8e-05, Method: Composition-based stats. Identities = 15/37 (40%), Positives = 21/37 (56%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRK 37 MA I +CP C + V ++G +GHQRY C +K Sbjct: 1 MAKIDEKCPFCERKDLVKKHGYGKSGHQRYRCPHAKK 37 >UniRef50_C1DQR5 Transposase n=5 Tax=Bacteria RepID=C1DQR5_AZOVD Length = 317 Score = 48.1 bits (113), Expect = 9e-05, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 31/89 (34%), Gaps = 5/89 (5%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKT---WQLQFTYTASQPGKHQKII 57 M + CP C ++E ++ S+ G RY C C KT + Q Sbjct: 38 MIATPSSCPHCQSSE--LQPWGSSGGLPRYRCKFCGKTSNPLTGTPMARLRKRHLWQGYA 95 Query: 58 DMAMNGVGCRASARIMGVGLNTVLRHLKN 86 + + R +A+ GV NT Sbjct: 96 EALTQSLTVRRAAKHCGVSKNTAFLWRHR 124 >UniRef50_B6ARX2 Probable transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6ARX2_9BACT Length = 133 Score = 48.1 bits (113), Expect = 9e-05, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 36/88 (40%), Gaps = 7/88 (7%) Query: 3 SISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTA----SQPGKHQKIID 58 S RCP C E V + G+ G QRY C CR+ + T T + K ++ Sbjct: 47 SEHPRCPHCQD-EHVAKWGRV-KGLQRYRCEACRRQFTP-LTNTPLSGLRKREKWGAYLE 103 Query: 59 MAMNGVGCRASARIMGVGLNTVLRHLKN 86 +G+ R +A+ +GV T Sbjct: 104 AMEDGLSVRKAAQRIGVNHKTTFLWRHR 131 >UniRef50_B4WSN9 Insertion element protein n=3 Tax=Cyanobacteria RepID=B4WSN9_9SYNE Length = 83 Score = 47.7 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 34/78 (43%), Gaps = 5/78 (6%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQK----IIDMAMNG 63 CP C ++GK++ G QRY C+ CR+T+ F + + I+ + G Sbjct: 3 CPFCDHPTP-HKHGKTSKGSQRYRCTACRRTFTETFDTLYDRRQVTSEQVKLILQTYVEG 61 Query: 64 VGCRASARIMGVGLNTVL 81 R +RI TV+ Sbjct: 62 SSLRGISRIGKRAYGTVV 79 >UniRef50_A1SXI4 Conserved hypothetical transposase n=18 Tax=Gammaproteobacteria RepID=A1SXI4_PSYIN Length = 319 Score = 47.3 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 32/85 (37%), Gaps = 7/85 (8%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTW---QLQFTYTASQPGKHQKIIDMAM 61 S +CP C + GK+ + QRY C C KT+ + K + Sbjct: 52 SPQCPHCHCA-HFTKWGKAGS-VQRYKCFSCHKTFNNKTKTPLAKLHRCELWDKYAECMS 109 Query: 62 NGVGCRASARIMGVGLNT--VLRHL 84 + R +A + + L T + RH Sbjct: 110 LKLTLREAAAVCNINLKTSFLWRHR 134 >UniRef50_A0L9I6 Insertion element protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L9I6_MAGSM Length = 89 Score = 47.3 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 6/90 (6%) Query: 1 MAS--ISIRCPSCSATEGVVRNGKSTAGHQRYLCSP--CRKT-WQLQFTYTASQPGKHQK 55 MA+ + + CP C + + V++ GK G QR+ C+ C +T + + ++ Sbjct: 1 MATMEVHVHCPDCGSLD-VIKFGKDRHGRQRFRCNDHFCDRTIFMMDDPDWWRFEEVKKQ 59 Query: 56 IIDMAMNGVGCRASARIMGVGLNTVLRHLK 85 I ++G G +A +G+ V R K Sbjct: 60 IALHLLSGNGIHQTAHNLGLHPEFVNRMAK 89 >UniRef50_Q8D3Z3 Transposase n=48 Tax=Vibrionales RepID=Q8D3Z3_VIBVU Length = 507 Score = 46.9 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 13/76 (17%), Positives = 31/76 (40%), Gaps = 1/76 (1%) Query: 14 TEGVVRNGKSTAG-HQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCRASARI 72 T + + +G QRY C C+ T+ +++ + + ++ + G R R Sbjct: 113 THKHLYHAFGYSGDRQRYRCKSCQSTFVDKWSGANKKLQFQENLMGLLFTGYSVREICRK 172 Query: 73 MGVGLNTVLRHLKNSG 88 + + T H+++ Sbjct: 173 LAINPKTFYDHVEHIA 188 >UniRef50_Q9AMR3 Putative transposase n=1 Tax=Azotobacter vinelandii RepID=Q9AMR3_AZOVI Length = 214 Score = 46.9 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 31/89 (34%), Gaps = 5/89 (5%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKT---WQLQFTYTASQPGKHQKII 57 M + CP C ++E ++ S+ G RY C C KT + Q Sbjct: 38 MIATPSSCPHCQSSE--LQPWGSSGGLPRYRCKFCGKTSNPLTGTPMARLRKRHLWQGYA 95 Query: 58 DMAMNGVGCRASARIMGVGLNTVLRHLKN 86 + + R +A+ GV NT Sbjct: 96 EALTQSLTVRRAAKHCGVSKNTAFLWRHR 124 >UniRef50_B0K4X0 Integrase, catalytic region n=11 Tax=Clostridia RepID=B0K4X0_THEPX Length = 343 Score = 46.9 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 4/60 (6%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNG 63 + ++CP C+ T + GK G+Q+YLC C + P K K + G Sbjct: 5 VPLKCPKCNNTHLFYKYGKDKDGYQKYLCRKCYHQFAPD----KPSPKKTSKYPRCPVCG 60 >UniRef50_A7HJ99 Putative uncharacterized protein n=2 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ99_FERNB Length = 261 Score = 46.2 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 26/48 (54%), Gaps = 1/48 (2%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTAS 48 M +I ++CP C ++ + +NG +Q + C C++ ++L FT Sbjct: 1 MTNIQLKCPHCGSSNFI-KNGHDKFKNQIFFCKDCKRYFKLSFTKKHK 47 >UniRef50_D1JAI8 Putative uncharacterized protein n=1 Tax=uncultured archaeon RepID=D1JAI8_9ARCH Length = 192 Score = 45.4 bits (106), Expect = 5e-04, Method: Composition-based stats. Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 4/71 (5%) Query: 20 NGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMNGVGCRASARIMGV 75 GK Q C C K + + + G I + G G RA+ARIMG+ Sbjct: 36 YGKGEKRTQMLKCKVCGKRFSIHKGTPLFNLKADEGAFYGTIAHLVEGNGIRATARIMGI 95 Query: 76 GLNTVLRHLKN 86 +TV + LK Sbjct: 96 NKDTVSKWLKK 106 >UniRef50_B2GC72 Transposase n=11 Tax=Lactobacillus RepID=B2GC72_LACF3 Length = 428 Score = 45.0 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 20/99 (20%), Positives = 35/99 (35%), Gaps = 21/99 (21%) Query: 7 RCPSCSATEGVVRNGKSTA-----------------GHQRYLCSPCRKTWQLQFT----Y 45 RCP C + ++NG S QR C C+ ++ + Y Sbjct: 44 RCPHCGFADTFIKNGHSYQTIKYLSINESCPTMLRIDKQRLRCKNCQDSFMAKTNVVDKY 103 Query: 46 TASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHL 84 + K + M + V + ++ GV +T+ R L Sbjct: 104 CSIAKAVKHKALTMLESNVSQKDVSKFTGVSPSTIGRLL 142 >UniRef50_B9TDK1 Putative uncharacterized protein n=2 Tax=Ricinus communis RepID=B9TDK1_RICCO Length = 321 Score = 45.0 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 17/82 (20%), Positives = 29/82 (35%), Gaps = 5/82 (6%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKT---WQLQFTYTASQPGKHQKIIDMAMNGV 64 CP C R G++ +G QR+ C C ++ + + + Sbjct: 52 CPHCGCARK-HRCGQA-SGLQRFRCLHCGRSHNALTKTPLARLRKKECWLPYLQCVLESR 109 Query: 65 GCRASARIMGVGLNTVLRHLKN 86 R +A+I+GV T R Sbjct: 110 TVRDAAQIVGVHRTTSFRWRHR 131 >UniRef50_Q10ZA4 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZA4_TRIEI Length = 469 Score = 45.0 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 12/37 (32%), Positives = 22/37 (59%), Gaps = 2/37 (5%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ 42 ++CP+C +T + +NG+ QRY C C + + +Q Sbjct: 1 MKCPTCGST-SLRKNGR-PNNRQRYRCKDCGRQFMVQ 35 >UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ24_FERNB Length = 316 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 13/66 (19%), Positives = 29/66 (43%), Gaps = 3/66 (4%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M + ++ CP C +T + +NG G+Q++LC C +++ +++ + Sbjct: 1 MNNSTLSCPKCGST-SLYKNGHDKYGNQQFLCKLCHHSFK--LSHSQKRKNFPFPYPKCT 57 Query: 61 MNGVGC 66 G Sbjct: 58 SCGKSM 63 >UniRef50_C9BRL5 Transposase n=2 Tax=Enterococcus faecium RepID=C9BRL5_ENTFC Length = 433 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 30/101 (29%), Gaps = 23/101 (22%) Query: 7 RCPSCSATEG---VVRNGKSTA----------------GHQRYLCSPCRKTWQLQFTYTA 47 RCP C +V+NGK + QRY C C + Sbjct: 46 RCPLCKQMNHEGMIVKNGKKKSLIQLNKCANQLTYLALAKQRYHCRGCHTYFTANTYIVD 105 Query: 48 SQ----PGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHL 84 KI++ A+ GV +TV R L Sbjct: 106 RNCFIAKQVRYKILEELTEKQAMTTIAKHCGVSWSTVSRTL 146 >UniRef50_UPI0001BC5CBF transposase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5CBF Length = 184 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 21/106 (19%), Positives = 37/106 (34%), Gaps = 21/106 (19%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAG----------------HQRYLCSPCRKTWQLQFT 44 + + CP C ++ V+NG T+ QR+LC C ++ L+ Sbjct: 42 LTKDTCACPHCH-SQTTVKNGFKTSKVRYLPFQNYPIIIALKKQRFLCKECHHSFTLETP 100 Query: 45 YTASQPGKHQKIIDMAMN----GVGCRASARIMGVGLNTVLRHLKN 86 Q + +N + A+ + + TV R LK Sbjct: 101 IVKKYASISQTLKLSVLNSLQENMSLSLIAKQHRISIPTVQRILKQ 146 >UniRef50_Q3Y3Y2 Transposase, IS204/IS1001/IS1096/IS1165 n=4 Tax=Enterococcus faecium RepID=Q3Y3Y2_ENTFC Length = 401 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 27/99 (27%), Positives = 41/99 (41%), Gaps = 21/99 (21%) Query: 7 RCPSCS-ATEGVVRNGK-------STAG---------HQRYLCSPCRKTWQ-LQFTYTAS 48 RCP C +T+ +V+NGK + +G QRYLC C+K + + T Sbjct: 46 RCPCCKDSTKQIVKNGKKISMILLNRSGNKRTYLRLKKQRYLCRACKKYFTARTYLVTPF 105 Query: 49 ---QPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHL 84 H KI++ +A + V + TV R L Sbjct: 106 CFISKQIHYKILEELTERQSIKAIGKHCDVSVTTVQRTL 144 >UniRef50_A8ZMX8 Putative uncharacterized protein n=5 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMX8_ACAM1 Length = 134 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 35/89 (39%), Gaps = 9/89 (10%) Query: 6 IRCPSCSATEGVVRNG----KSTAGHQRYLCSPCRKTW-QLQFTYTASQP---GKHQKII 57 + CP C +E +++ G + QRY C C + + + T A I Sbjct: 1 MECPYC-QSEKILKRGFDSLQDGTLVQRYQCKDCNRRFNERTGTPMARLRTASSVVSYAI 59 Query: 58 DMAMNGVGCRASARIMGVGLNTVLRHLKN 86 G+G R++ R G T++R K Sbjct: 60 KARTEGMGVRSAGRTFGKSHTTIMRWEKR 88 >UniRef50_A4W908 Putative cytoplasmic protein n=2 Tax=Enterobacteriaceae RepID=A4W908_ENT38 Length = 414 Score = 43.9 bits (102), Expect = 0.001, Method: Composition-based stats. Identities = 11/33 (33%), Positives = 18/33 (54%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ 40 CP+C + ++RNG G QR+ C C ++ Sbjct: 68 CPTCGQGDALIRNGCGLRGAQRWRCRTCNSSFT 100 >UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachlamydiaceae RepID=Q6MD28_PARUW Length = 209 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 28/79 (35%), Gaps = 1/79 (1%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 +RC C ++ V +NG + Q + C C K W + + + V Sbjct: 1 MRCTHCG-SDLVKKNGYTRHEKQNFRCLECGKQWSENKEAKIINEQTKELVRKALLEKVS 59 Query: 66 CRASARIMGVGLNTVLRHL 84 RI V + +L + Sbjct: 60 LNGICRIFDVSMPWLLDFI 78 >UniRef50_C6MKY8 Putative uncharacterized protein n=1 Tax=Geobacter sp. M18 RepID=C6MKY8_9DELT Length = 632 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 2/66 (3%) Query: 21 GKSTAGHQRYLCSPCRKTWQLQFT--YTASQPGKHQKIIDMAMNGVGCRASARIMGVGLN 78 G + AG QR+ C C KT+ + + GK ++ + V R AR VG Sbjct: 123 GHTKAGSQRFRCKICHKTFSIPLAANLRQRKKGKSTEVFRLLTCQVAIRKMARNARVGKE 182 Query: 79 TVLRHL 84 TV R++ Sbjct: 183 TVHRYI 188 >UniRef50_A5FST1 Integrase, catalytic region n=1 Tax=Dehalococcoides sp. BAV1 RepID=A5FST1_DEHSB Length = 319 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 25/96 (26%), Positives = 38/96 (39%), Gaps = 17/96 (17%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKH---QKI---IDM 59 I C C + R G S A QR+LC+ C T+ T++QPG ++I + M Sbjct: 8 IECKYCG-SRHTRRYGHSRAQKQRWLCNDCCHTFVE----TSAQPGMRTPPEQIGAAVSM 62 Query: 60 AMNGVGCRASAR----IMGVGL--NTVLRHLKNSGR 89 G+ A R I + TV + + Sbjct: 63 FYEGLSLSAICRQMKQIHNISPSDGTVYGWITKYSK 98 >UniRef50_D0WF86 Putative Zn-finger DNA-binding domain protein n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WF86_9ACTN Length = 243 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 27/86 (31%), Gaps = 5/86 (5%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTW---QLQFTYTASQPGKH-QKIIDMA 60 + CP C + +GK+ G +RY C C + A P +I ++ Sbjct: 51 APVCPDCGSVRP-RLDGKAPNGARRYRCRECGCRFSALTGTIFADAKLPLHKIMRIAEVM 109 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKN 86 + R + V T Sbjct: 110 CHSASLRLMELVAEVSHGTAFLWRHK 135 >UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q0SUU8_CLOPS Length = 340 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 11/42 (26%), Positives = 20/42 (47%), Gaps = 1/42 (2%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ 42 M +I+CP C +E + + G +Q+Y C C + + Sbjct: 1 MNKTNIKCPRCH-SEKLYKFGFDKQANQKYQCKECGRQFAPD 41 >UniRef50_C0BSX6 Putative uncharacterized protein n=1 Tax=Bifidobacterium pseudocatenulatum DSM 20438 RepID=C0BSX6_9BIFI Length = 352 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 5/78 (6%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTW----QLQFTYTASQPGKHQKIIDMAMNG 63 C C + ++R G+ G QR+ C C +T+ + + G + ++ ++ Sbjct: 55 CVRCGSI-RIIRKGRGRDGSQRWKCMNCNRTFGVRTNRVMGMSKLKAGVWMRFLECFVDC 113 Query: 64 VGCRASARIMGVGLNTVL 81 + R A+ GV L T Sbjct: 114 LSLRKCAQRCGVCLKTAF 131 >UniRef50_C0X375 Transposase n=23 Tax=Enterococcus RepID=C0X375_ENTFA Length = 446 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 14/69 (20%), Positives = 29/69 (42%), Gaps = 4/69 (5%) Query: 26 GHQRYLCSPCRKTWQLQFTYTASQ----PGKHQKIIDMAMNGVGCRASARIMGVGLNTVL 81 QR+ C C KT+ + + + + Q I+++ + AR+ + TV+ Sbjct: 85 NKQRFKCKHCGKTFLAEDSVSDRRCSIARRVKQAILELLSEPISMSLIARMKHISPTTVI 144 Query: 82 RHLKNSGRS 90 R L++ Sbjct: 145 RILRSLRPK 153 >UniRef50_B5K5I7 Transposase n=4 Tax=Octadecabacter antarcticus 238 RepID=B5K5I7_9RHOB Length = 319 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 33/87 (37%), Gaps = 5/87 (5%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTW---QLQFTYTASQPGKHQKIIDMAMNG 63 CP C+A V+R G+S G +RY C C KT+ + +G Sbjct: 50 NCPHCAAGGAVIR-GRS-NGLKRYFCKICSKTFNALTGTPLARLRHKDCWTEFAGSLSDG 107 Query: 64 VGCRASARIMGVGLNTVLRHLKNSGRS 90 + SA GV +T R R+ Sbjct: 108 DTVKTSAARCGVASSTAFRWRHRFLRA 134 >UniRef50_UPI0001C348D8 hypothetical protein PretD1_11772 n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C348D8 Length = 467 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 35/87 (40%), Gaps = 3/87 (3%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M +I CPSC +TE + + G + G RY C C + L+ K+I+ Sbjct: 67 MKNIEKACPSCYSTENI-KYGTTAIGTVRYQCKNCNNVYSLKNLNKFDDVD--NKLIESL 123 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNS 87 + + + + + R L+N Sbjct: 124 LKNTKVSTIFKELKITPASFYRRLENI 150 >UniRef50_C6MXF0 Putative uncharacterized protein n=1 Tax=Geobacter sp. M18 RepID=C6MXF0_9DELT Length = 512 Score = 42.7 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 18/68 (26%), Positives = 30/68 (44%), Gaps = 2/68 (2%) Query: 19 RNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKH--QKIIDMAMNGVGCRASARIMGVG 76 R G++ AG +RY C C +T+ + TA Q H +KI +N + + Sbjct: 43 RFGETAAGARRYRCKLCSRTFSINGKPTARQRDTHKNKKIYMHLVNKSPFKRICEQAEIS 102 Query: 77 LNTVLRHL 84 T+ R + Sbjct: 103 PATLYRKI 110 >UniRef50_P04137 Uncharacterized protein in transposable element ISH50 n=11 Tax=Halobacteriaceae RepID=YIH50_HALSA Length = 294 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 37/90 (41%), Gaps = 7/90 (7%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAM 61 + CPSC E V+R G QRYLC C +T+ Q F ++A K + + Sbjct: 26 VYCPSC-RAESVIRYGSYRV-FQRYLCKDCDRTFNDQTGTVFEHSAVALRKWFLAVYTYI 83 Query: 62 N-GVGCRASARIMGVGLNTVLRHLKNSGRS 90 R + V TV R ++ R+ Sbjct: 84 RLNTSIRQLDAEIDVSYKTVYRRVQRFLRA 113 >UniRef50_A8ZNT7 Putative uncharacterized protein n=2 Tax=Acaryochloris marina MBIC11017 RepID=A8ZNT7_ACAM1 Length = 188 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 9/89 (10%) Query: 6 IRCPSCSATEGVVRNG----KSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAM 61 ++C C +E VV+NG K+ Q +LC C + + + ++ + I MA+ Sbjct: 1 MQCIHC-QSENVVKNGTKTLKTAQVVQYFLCKDCGRRFNERSGTPMARLRTPVETISMAI 59 Query: 62 N----GVGCRASARIMGVGLNTVLRHLKN 86 N G+G RA+ R++ N+++ K Sbjct: 60 NARTEGLGIRAAGRVLRKSPNSIILWEKR 88 >UniRef50_Q8PSY9 Conserved protein n=2 Tax=Methanosarcina RepID=Q8PSY9_METMA Length = 146 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 35/79 (44%), Gaps = 4/79 (5%) Query: 17 VVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA----MNGVGCRASARI 72 +++ GK GHQRY C C K + + ++ I M + G R+ RI Sbjct: 26 IIKRGKYKTGHQRYYCKHCEKFFMDTIGTAIYRKHLSKEEIRMIYRLFLEKNGIRSIERI 85 Query: 73 MGVGLNTVLRHLKNSGRSR 91 G +T+ LK++ ++ Sbjct: 86 TGHHRDTISNLLKDTVKNE 104 >UniRef50_Q4JT92 Transposase for IS3503f n=4 Tax=Corynebacterium jeikeium K411 RepID=Q4JT92_CORJK Length = 165 Score = 41.9 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 24/80 (30%), Gaps = 2/80 (2%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M + CP C +NG ++ R+ C+ C ++ I A Sbjct: 1 MTTNRPSCPLCG--NNTKKNGTTSKSTTRWRCTHCGHSFTRNTQTHNKNTATMALFIQWA 58 Query: 61 MNGVGCRASARIMGVGLNTV 80 A GV T+ Sbjct: 59 TGTQSLTTFAAHHGVTRQTM 78 >UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae BGR1 RepID=C5A9A4_BURGB Length = 284 Score = 41.9 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 35/79 (44%), Gaps = 8/79 (10%) Query: 14 TEGVVRNGKSTAGH-----QRYLCSPCRKTW---QLQFTYTASQPGKHQKIIDMAMNGVG 65 + +NG H RY C C K + Q++ + +P + ++ MA++ VG Sbjct: 21 ADFYRKNGYRRTKHNGQPVPRYQCKACGKNFCATQVKPIHGQHRPDLNTQVFKMAVSRVG 80 Query: 66 CRASARIMGVGLNTVLRHL 84 R A ++ G T+ R + Sbjct: 81 IRRMATVLDCGRETIQRKI 99 >UniRef50_D0BMT4 Transposase n=10 Tax=Lactobacillales RepID=D0BMT4_9LACT Length = 426 Score = 41.9 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 18/101 (17%), Positives = 34/101 (33%), Gaps = 21/101 (20%) Query: 6 IRCPSCSATEGVVRNGKSTAGH----------------QRYLCSPCRKTWQLQFTYTASQ 49 CP C +++ V+++ QR++C CRKTW Sbjct: 45 TSCPYC-SSKNVIKHSPMEHKIRIPHLYGNKTLLELKVQRFICKDCRKTWVTDCPLVPKN 103 Query: 50 PGKHQ----KIIDMAMNGVGCRASARIMGVGLNTVLRHLKN 86 +I+ + A+++ + TV R +K Sbjct: 104 SNISYDLACQIMLYLKENFSRKTIAKLLSISDKTVERVMKK 144 >UniRef50_Q3Y1C3 Transposase, IS204/IS1001/IS1096/IS1165 n=51 Tax=Enterococcus RepID=Q3Y1C3_ENTFC Length = 431 Score = 41.5 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 35/106 (33%), Gaps = 27/106 (25%) Query: 7 RCPSCSATEG-------VVRNGKSTA----------------GHQRYLCSPCRKTWQLQF 43 C +C +T VV+NGK QRY C CR W Q Sbjct: 44 TCRNCGSTVVDGNGKVIVVKNGKKETIVRFEQYNHMPLVMRLKKQRYTCKNCRTHWTTQS 103 Query: 44 TYTASQPGK----HQKIIDMAMNGVGCRASARIMGVGLNTVLRHLK 85 + + KI + V A+ V L TV+R LK Sbjct: 104 YFVQPRHSIANHVRYKIASLLTEKVSLSFIAKNCQVSLTTVIRTLK 149 >UniRef50_Q0W4F0 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4F0_UNCMA Length = 141 Score = 41.2 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 36/89 (40%), Gaps = 6/89 (6%) Query: 7 RCPSCSATEG--VVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQK----IIDMA 60 C +EG VV+ G S AGHQ + C C + + ++ I + Sbjct: 17 SCEFYLKSEGSRVVKKGFSRAGHQVFQCRHCGRHFCETINTPMYGRRITREDVILIGKLL 76 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGR 89 G RA RI G +TV+R K+ R Sbjct: 77 NERNGIRAIERITGHHRDTVMRVAKDLAR 105 >UniRef50_Q07NT9 Putative uncharacterized protein n=2 Tax=Rhizobiales RepID=Q07NT9_RHOP5 Length = 577 Score = 41.2 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 31/85 (36%), Gaps = 11/85 (12%) Query: 7 RCP--SCSATEGVV--------RNGKSTAGHQRYLCSPCRKTW-QLQFTYTASQPGKHQK 55 CP SC + R+G S G RY C CRKT+ + +++ Sbjct: 103 HCPDDSCENYNKLFDSHPKSYFRHGTSAIGAPRYRCKACRKTFSVRTGHSRHRKSHENKT 162 Query: 56 IIDMAMNGVGCRASARIMGVGLNTV 80 + + ++ V +I + V Sbjct: 163 VFQLLVSKVPITKIGQITDLSPAAV 187 >UniRef50_C2BVZ4 Possible IS3509a transposase n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BVZ4_9ACTO Length = 225 Score = 41.2 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 13/47 (27%), Positives = 28/47 (59%), Gaps = 2/47 (4%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGK 52 ++CP+C+ + RNGK+++G QR+ C C ++ + +A + + Sbjct: 41 MKCPACNT--PLKRNGKTSSGSQRWRCKECGRSKVGKIDNSAKELNR 85 >UniRef50_C2FEQ0 IS1181 transposase n=1 Tax=Lactobacillus paracasei subsp. paracasei ATCC 25302 RepID=C2FEQ0_LACPA Length = 425 Score = 41.2 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 21/104 (20%), Positives = 30/104 (28%), Gaps = 20/104 (19%) Query: 7 RCPSCSATEGVVRNGKSTA----------------GHQRYLCSPCRKTWQLQFTYTASQP 50 CP+C +VR G QR+ C CR +Q + Y + Sbjct: 48 HCPACGFASKLVRYGFERTCVLMPSYSYRPTYMKLSRQRFRCELCRSVFQSETDYVRPRS 107 Query: 51 GKHQKIIDM----AMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 + M A + AR V TV R + Sbjct: 108 TISTPVRQMVLFEAFSNCSLTDIARRFHVADKTVQRIIDEEAAK 151 >UniRef50_B7K7J3 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7K7J3_CYAP7 Length = 354 Score = 40.8 bits (94), Expect = 0.014, Method: Composition-based stats. Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 2/37 (5%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ 40 I I+CP C ++ +NG + AG QRY C C + + Sbjct: 2 ILIQCPKC-KSKNYRKNG-TIAGKQRYQCKSCGRNFL 36 >UniRef50_B2A0V7 Integrase catalytic region n=15 Tax=Clostridia RepID=B2A0V7_NATTJ Length = 353 Score = 40.4 bits (93), Expect = 0.016, Method: Composition-based stats. Identities = 10/36 (27%), Positives = 17/36 (47%), Gaps = 2/36 (5%) Query: 6 IRCPSCSA--TEGVVRNGKSTAGHQRYLCSPCRKTW 39 + CP C+ ++ + G GHQ+Y C C + Sbjct: 4 VVCPRCNNNCSDKFYKFGFDNHGHQKYQCQECFSQF 39 >UniRef50_A4WNK0 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WNK0_RHOS5 Length = 481 Score = 40.4 bits (93), Expect = 0.017, Method: Composition-based stats. Identities = 16/67 (23%), Positives = 29/67 (43%), Gaps = 1/67 (1%) Query: 19 RNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKII-DMAMNGVGCRASARIMGVGL 77 R GK+ G R+ C C KT+ + + +++ DM N + +RI G+ Sbjct: 132 RFGKTKGGDARWRCKGCGKTFSVGKPARRHKRSDKNRLVLDMLCNDLSFAKMSRISGLAY 191 Query: 78 NTVLRHL 84 + R + Sbjct: 192 RDIYRRV 198 >UniRef50_Q70JT0 IsmA protein n=2 Tax=Microcystis aeruginosa RepID=Q70JT0_MICAE Length = 112 Score = 40.4 bits (93), Expect = 0.017, Method: Composition-based stats. Identities = 11/48 (22%), Positives = 18/48 (37%), Gaps = 1/48 (2%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQ 54 CPSC + ++NG G + C C + + + T P Sbjct: 34 TCPSCG-SHHTIKNGYLPKGKPKRHCQECGQPFVINPTNKTISPDTKT 80 >UniRef50_B2J098 Putative uncharacterized protein n=2 Tax=Nostocaceae RepID=B2J098_NOSP7 Length = 133 Score = 40.4 bits (93), Expect = 0.019, Method: Composition-based stats. Identities = 10/34 (29%), Positives = 20/34 (58%), Gaps = 1/34 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTW 39 + CP C+ + + ++G+ G QRY+C C + + Sbjct: 34 MECPKCN-SHLLGKHGREPDGVQRYICKNCSRIF 66 >UniRef50_D2PJ85 Putative uncharacterized protein n=5 Tax=Sulfolobus islandicus RepID=D2PJ85_SULIS Length = 82 Score = 40.4 bits (93), Expect = 0.019, Method: Composition-based stats. Identities = 11/39 (28%), Positives = 16/39 (41%), Gaps = 2/39 (5%) Query: 27 HQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 QRYLC C + + Y ++ + M NGV Sbjct: 5 RQRYLCRDCGRYFLGDAIY--HSRELREEALKMYSNGVS 41 >UniRef50_UPI0001BC5E64 putative transposase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5E64 Length = 173 Score = 40.0 bits (92), Expect = 0.023, Method: Composition-based stats. Identities = 22/103 (21%), Positives = 36/103 (34%), Gaps = 25/103 (24%) Query: 7 RCPSCSATEGVVRNGKSTAG----------------HQRYLCSPCRKT------WQLQFT 44 +CP C + ++RNG + QR+LC C KT + ++ Sbjct: 48 KCPFCGE-KHIIRNGTKLSKIKILDVSNTPSYLYLRKQRFLCKSCSKTFSASTNFVRKYC 106 Query: 45 YTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 A I + N + + A+ V +TV R L Sbjct: 107 NIADS--IKLSIALESKNIISEKDIAKRFRVSSSTVKRSLLQY 147 >UniRef50_B4WST6 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WST6_9SYNE Length = 81 Score = 39.6 bits (91), Expect = 0.027, Method: Composition-based stats. Identities = 14/71 (19%), Positives = 26/71 (36%), Gaps = 1/71 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M P+C ++ VV+NGK G Q + C C + + I D+ Sbjct: 1 MLDHQPTRPACH-SKQVVKNGKIHNGKQNHRCKNCGRQFVKDPQQKRISDATKALIDDLL 59 Query: 61 MNGVGCRASAR 71 + + ++ Sbjct: 60 LERLSMNNPSK 70 >UniRef50_Q1VPB4 Putative uncharacterized protein n=4 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VPB4_9FLAO Length = 343 Score = 39.6 bits (91), Expect = 0.030, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 27/86 (31%), Gaps = 5/86 (5%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTW---QLQFTYTASQPGKHQKIIDMAMNGV 64 CP C E VR G G QRY C C +++ + + + + + Sbjct: 51 CPHCLH-EKYVRFG-VDKGSQRYKCKSCNRSFTEYTGTWMAGLQRKDMISSYLSLMVQEK 108 Query: 65 GCRASARIMGVGLNTVLRHLKNSGRS 90 + +G+ T S Sbjct: 109 SLDKISSELGINKKTAFDWRHKILAS 134 >UniRef50_A8ZMW7 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMW7_ACAM1 Length = 75 Score = 39.6 bits (91), Expect = 0.031, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 28/58 (48%), Gaps = 1/58 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIID 58 M+ + ++ P C + + GK++ G QRY C C++T+ F + ++I Sbjct: 1 MSYLLMQSPLCDHP-KIHKPGKTSKGSQRYRCLDCQQTFSETFDTLYYRLQISSEMIQ 57 >UniRef50_B0V2Z3 Novel zinc finger protein (Fragment) n=2 Tax=Danio rerio RepID=B0V2Z3_DANRE Length = 1404 Score = 39.6 bits (91), Expect = 0.031, Method: Composition-based stats. Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 11/63 (17%) Query: 5 SIRCPSCS----ATEGVVRNGKSTA----GHQRYLCSPCRKTWQLQFT---YTASQPGKH 53 I CP C +++ + R+ ++ A QR+ CS CR+T+ F+ + + Sbjct: 121 EIACPRCERRFTSSQDLDRHIQTHALSTYHTQRFKCSRCRRTFSTLFSRRRHEKRHENGN 180 Query: 54 QKI 56 +KI Sbjct: 181 KKI 183 >UniRef50_Q5LYW0 IS1193, transposase, ISL3 family n=185 Tax=Bacteria RepID=Q5LYW0_STRT1 Length = 448 Score = 39.2 bits (90), Expect = 0.043, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 34/103 (33%), Gaps = 19/103 (18%) Query: 1 MASISIRCPSCSAT---EGVVRNGKSTAGHQ------------RYLCSPCRKTWQLQFTY 45 + +++ CP C +N K + Q R+ C CR+ + + Sbjct: 15 LITLAPSCPHCQGKMIKYDFQKNSKISLLEQAGTPTLLRLKKRRFQCKSCRRVTVAETSI 74 Query: 46 TASQPGK----HQKIIDMAMNGVGCRASARIMGVGLNTVLRHL 84 QK+ + V AR + V +TV R L Sbjct: 75 VEKNCQISNLVRQKVTQLLTEKVSLTDIARRLRVSTSTVYRKL 117 >UniRef50_B5S3H3 Probable transposase protein n=6 Tax=Burkholderiaceae RepID=B5S3H3_RALSO Length = 460 Score = 39.2 bits (90), Expect = 0.043, Method: Composition-based stats. Identities = 14/94 (14%), Positives = 33/94 (35%), Gaps = 9/94 (9%) Query: 2 ASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKH---QKIID 58 ++ + CP C + + +G + C C + P + + Sbjct: 345 STHAASCPWCGSDQTKYHPAPRPSGLPGFRCRACLAYFTRVSNTPLVHPMARAYASRFVP 404 Query: 59 MA---MNGVGCRASARIMGVGLNTVLRHLKNSGR 89 M G G +AR +G+ + T+ +++ + Sbjct: 405 MLGWHETGAG---AARELGIAMGTLHTWVRSWRQ 435 >UniRef50_Q8PWW0 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PWW0_METMA Length = 155 Score = 38.9 bits (89), Expect = 0.052, Method: Composition-based stats. Identities = 21/100 (21%), Positives = 40/100 (40%), Gaps = 14/100 (14%) Query: 5 SIRCPS--CS-----ATEGVVRNGKSTAGHQR---YLCSPCRKTW---QLQFTYTASQPG 51 + CP+ C E ++ NG ++R Y+C C + + F + Sbjct: 12 DVFCPNKDCKLYGITGKENIIGNGTYEIKNKRVRKYICRECGRVFNDRTGTFFDNVRKDE 71 Query: 52 KHQKI-IDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 K+ I MA+ G+ +A + ++ V TV L + + Sbjct: 72 SDIKLAIKMAIKGMSIQAISDVLEVQPATVSNWLFRAAKQ 111 >UniRef50_UPI0001C34261 hypothetical protein CATC2_13668 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C34261 Length = 387 Score = 38.9 bits (89), Expect = 0.058, Method: Composition-based stats. Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 1/33 (3%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ 40 CP C E + R G++ G QR C C+K W Sbjct: 74 CPDCYQRETI-RYGRNPQGSQRVQCRACKKVWT 105 >UniRef50_A5DPJ1 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DPJ1_PICGU Length = 778 Score = 38.5 bits (88), Expect = 0.071, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 25/68 (36%), Gaps = 6/68 (8%) Query: 9 PSCSAT----EGVVRNGKSTAGHQRYLCS--PCRKTWQLQFTYTASQPGKHQKIIDMAMN 62 P+C T + + R+ + R+ CS C KT+ Y +H+ + + + Sbjct: 22 PNCRKTFSRPDRLARHRLNHETVPRHRCSWPDCGKTFVRNDVYKKHYRRQHENKTEPSQD 81 Query: 63 GVGCRASA 70 A Sbjct: 82 SYKITKPA 89 >UniRef50_UPI0001793699 PREDICTED: similar to zinc-finger homeodomain protein 1, partial n=1 Tax=Acyrthosiphon pisum RepID=UPI0001793699 Length = 1011 Score = 38.5 bits (88), Expect = 0.073, Method: Composition-based stats. Identities = 17/69 (24%), Positives = 32/69 (46%), Gaps = 4/69 (5%) Query: 7 RCPSCSAT----EGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMN 62 +CP C + + + +G + + CS C K + +Y++ K II++ M Sbjct: 246 KCPECEKAFKFKHHLKEHIRIHSGEKPFECSNCGKRFSHSGSYSSHMTSKKCLIINLKMG 305 Query: 63 GVGCRASAR 71 G G +A+ R Sbjct: 306 GRGSQANNR 314 >UniRef50_Q2FRB5 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FRB5_METHJ Length = 138 Score = 38.1 bits (87), Expect = 0.079, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 37/85 (43%), Gaps = 6/85 (7%) Query: 5 SIRCPSCSATEG--VVRNGKSTAGHQRYLCSPCRKTWQLQFT---YTASQPGKHQKII-D 58 + C +G + +NG ++AG+Q+Y C CR+ + Y + P II Sbjct: 14 NPDCTYFQIEDGKNITKNGHNSAGNQQYYCHHCRRFFIETKNTPLYDSRLPRTAVLIIAK 73 Query: 59 MAMNGVGCRASARIMGVGLNTVLRH 83 + R +R+ G +T+ R+ Sbjct: 74 HSTEKTSIRGVSRVTGHHRDTISRY 98 >UniRef50_D0SJK4 Predicted protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJK4_ACIJU Length = 460 Score = 38.1 bits (87), Expect = 0.080, Method: Composition-based stats. Identities = 17/99 (17%), Positives = 35/99 (35%), Gaps = 21/99 (21%) Query: 7 RCPSCSATEGVVRNGKSTA----------------GHQRYLCSPCRKTWQLQFTYTASQP 50 +CP C ++ + ++G +RY C C+ T+ + T Sbjct: 33 KCPKCG-SDQLYKHGTKPVIYRDIPRHMKPTVINVEVKRYRCKSCKATFLQEVTGIYPDT 91 Query: 51 GKHQKIIDMAMN---GVGCRASARIMGVGLNTVLRHLKN 86 ++ + + +AR+MG T+ R + N Sbjct: 92 RMTERFVKKIQDICLDYTFSDTARMMGCDSKTI-RTITN 129 >UniRef50_UPI000196CFC8 hypothetical protein CATMIT_02848 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196CFC8 Length = 262 Score = 38.1 bits (87), Expect = 0.082, Method: Composition-based stats. Identities = 10/34 (29%), Positives = 15/34 (44%), Gaps = 1/34 (2%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ 40 +C C + + G G QRY+C C K + Sbjct: 97 QCLFCG-SHDFTKYGHKKDGTQRYICKGCGKRFT 129 >UniRef50_Q8RFT8 Transposase n=17 Tax=Fusobacterium RepID=Q8RFT8_FUSNN Length = 428 Score = 38.1 bits (87), Expect = 0.092, Method: Composition-based stats. Identities = 18/100 (18%), Positives = 35/100 (35%), Gaps = 21/100 (21%) Query: 7 RCPSCSATEGVVRNGKSTAGH----------------QRYLCSPCRKTWQLQFT----YT 46 CP C +++ +V+NG QRY+C C+KT+ + Sbjct: 52 TCPHC-SSKNIVKNGSRHRKIKYIPIQNHNIELELTVQRYICKDCKKTFSPSTNIVSDNS 110 Query: 47 ASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKN 86 + I + + A+ + + +V R + N Sbjct: 111 SISNNLKYAIALELQKNISLTSIAKRYNISIPSVQRIMDN 150 >UniRef50_Q9H5H4 Zinc finger protein 768 n=9 Tax=Theria RepID=ZN768_HUMAN Length = 540 Score = 38.1 bits (87), Expect = 0.096, Method: Composition-based stats. Identities = 14/78 (17%), Positives = 31/78 (39%), Gaps = 11/78 (14%) Query: 7 RCPSCSA----TEGVVRNGKSTAGHQRYLCSPCRKTW-------QLQFTYTASQPGKHQK 55 +CP C + ++R+ ++ +G + Y C C K + + Q T++ +P + Sbjct: 318 KCPRCGKAFADSSYLLRHQRTHSGQKPYKCPHCGKAFGDSSYLLRHQRTHSHERPYSCTE 377 Query: 56 IIDMAMNGVGCRASARIM 73 R+ R+ Sbjct: 378 CGKCYSQNSSLRSHQRVH 395 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=roo... 109 4e-23 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 107 9e-23 UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax... 105 5e-22 UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma ling... 101 5e-21 UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacter... 101 7e-21 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 100 1e-20 UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriale... 98 1e-19 UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_S... 97 1e-19 UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammapr... 97 2e-19 UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwin... 96 3e-19 UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria... 94 1e-18 UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriacea... 92 5e-18 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 92 6e-18 UniRef50_A3JAS9 Hypothetical transposase n=1 Tax=Marinobacter sp... 92 7e-18 UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methano... 92 8e-18 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 91 1e-17 UniRef50_Q116V8 Insertion element protein n=2 Tax=Oscillatoriale... 90 2e-17 UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepI... 88 6e-17 UniRef50_A6CNB6 Putative uncharacterized protein n=8 Tax=Bacillu... 88 7e-17 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 88 1e-16 UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis ... 88 1e-16 UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp.... 87 2e-16 UniRef50_A9VV42 Insertion element protein n=7 Tax=Bacillus RepID... 86 3e-16 UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium ... 86 4e-16 UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ru... 86 4e-16 UniRef50_B1QSI6 Putative transposase n=10 Tax=Clostridium butyri... 85 7e-16 UniRef50_C1DQR5 Transposase n=5 Tax=Bacteria RepID=C1DQR5_AZOVD 85 9e-16 UniRef50_Q81ZP0 Putative uncharacterized protein n=2 Tax=Nitroso... 83 2e-15 UniRef50_B6ARX2 Probable transposase n=1 Tax=Leptospirillum sp. ... 83 2e-15 UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 R... 83 2e-15 UniRef50_C4IIL3 Putative transposase n=2 Tax=Clostridium butyric... 83 2e-15 UniRef50_B0ABB1 Putative uncharacterized protein n=2 Tax=cellula... 83 4e-15 UniRef50_D2QCU0 Insertion element protein n=7 Tax=Bacteroidetes ... 82 4e-15 UniRef50_Q9AMR3 Putative transposase n=1 Tax=Azotobacter vinelan... 82 4e-15 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 82 6e-15 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 82 6e-15 UniRef50_Q891N5 Putative transposase n=1 Tax=Clostridium tetani ... 82 6e-15 UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacter... 82 7e-15 UniRef50_A7HSL0 Insertion element protein n=1 Tax=Parvibaculum l... 81 9e-15 UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candid... 80 1e-14 UniRef50_Q97IP3 Zn-finger DNA-binding domain n=1 Tax=Clostridium... 80 3e-14 UniRef50_UPI000196AFFE hypothetical protein CATMIT_00334 n=2 Tax... 80 3e-14 UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis ae... 80 3e-14 UniRef50_C6X4X6 Putative uncharacterized protein n=1 Tax=Flavoba... 80 3e-14 UniRef50_Q97IZ3 Zn-finger DNA-binding domain n=1 Tax=Clostridium... 79 5e-14 UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE 78 8e-14 UniRef50_Q2GDS3 Putative uncharacterized protein n=2 Tax=Neorick... 77 1e-13 UniRef50_B9TDK1 Putative uncharacterized protein n=2 Tax=Ricinus... 77 1e-13 UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5W... 77 2e-13 UniRef50_A1SXI4 Conserved hypothetical transposase n=18 Tax=Gamm... 76 3e-13 UniRef50_A8ZMX8 Putative uncharacterized protein n=5 Tax=Acaryoc... 75 6e-13 UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachl... 75 8e-13 UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candida... 75 8e-13 UniRef50_A8V2B8 Hypothetical insertion element protein n=1 Tax=H... 74 1e-12 UniRef50_Q64EP4 Putative uncharacterized protein n=10 Tax=enviro... 74 1e-12 UniRef50_C0BSX6 Putative uncharacterized protein n=1 Tax=Bifidob... 74 1e-12 UniRef50_B6B4C9 Putative uncharacterized protein n=1 Tax=Rhodoba... 74 2e-12 UniRef50_C6W2G4 Transcriptional regulator, LacI family n=1 Tax=D... 73 2e-12 UniRef50_A0L9I6 Insertion element protein n=1 Tax=Magnetococcus ... 73 2e-12 UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaen... 73 2e-12 UniRef50_D0WF86 Putative Zn-finger DNA-binding domain protein n=... 73 4e-12 UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcti... 72 4e-12 UniRef50_Q8D3Z3 Transposase n=48 Tax=Vibrionales RepID=Q8D3Z3_VIBVU 72 6e-12 UniRef50_A5FST1 Integrase, catalytic region n=1 Tax=Dehalococcoi... 71 8e-12 UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea p... 71 9e-12 UniRef50_C8SD87 Insertion element protein n=1 Tax=Ferroglobus pl... 71 1e-11 UniRef50_B5ETH8 Transposase n=15 Tax=Vibrionaceae RepID=B5ETH8_V... 71 1e-11 UniRef50_B2GC72 Transposase n=11 Tax=Lactobacillus RepID=B2GC72_... 70 2e-11 UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmon... 69 4e-11 UniRef50_C0QU68 Putative transposase n=1 Tax=Persephonella marin... 69 5e-11 UniRef50_C5U8R8 Insertion element protein (Fragment) n=2 Tax=Met... 68 8e-11 UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aur... 68 1e-10 UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepI... 67 2e-10 UniRef50_C6MKY8 Putative uncharacterized protein n=1 Tax=Geobact... 67 2e-10 UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX 66 2e-10 UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum A... 66 2e-10 UniRef50_UPI0001BC5CBF transposase n=1 Tax=Fusobacterium sp. D12... 64 1e-09 UniRef50_B0K4X0 Integrase, catalytic region n=11 Tax=Clostridia ... 64 1e-09 UniRef50_C9BRL5 Transposase n=2 Tax=Enterococcus faecium RepID=C... 63 2e-09 UniRef50_B4WSN9 Insertion element protein n=3 Tax=Cyanobacteria ... 63 2e-09 UniRef50_Q3Y3Y2 Transposase, IS204/IS1001/IS1096/IS1165 n=4 Tax=... 61 7e-09 UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Kl... 60 2e-08 UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nod... 60 2e-08 UniRef50_D1JAI8 Putative uncharacterized protein n=1 Tax=uncultu... 60 2e-08 UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q... 59 4e-08 UniRef50_A7HJ99 Putative uncharacterized protein n=2 Tax=Fervido... 56 3e-07 UniRef50_Q10ZA4 Putative uncharacterized protein n=1 Tax=Trichod... 56 3e-07 UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edw... 56 3e-07 UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202... 55 1e-06 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 53 2e-06 UniRef50_A4W908 Putative cytoplasmic protein n=2 Tax=Enterobacte... 53 4e-06 UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae... 52 6e-06 UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseud... 49 6e-05 Sequences not found previously or not previously below threshold: UniRef50_C1I4B6 Putative uncharacterized protein n=2 Tax=Clostri... 78 1e-13 UniRef50_B5K5I7 Transposase n=4 Tax=Octadecabacter antarcticus 2... 70 2e-11 UniRef50_Q1VPB4 Putative uncharacterized protein n=4 Tax=Psychro... 68 1e-10 UniRef50_A8ZNT7 Putative uncharacterized protein n=2 Tax=Acaryoc... 63 3e-09 UniRef50_C8SCF9 Integrase catalytic region n=1 Tax=Ferroglobus p... 60 3e-08 UniRef50_P04137 Uncharacterized protein in transposable element ... 58 8e-08 UniRef50_UPI000196CFC8 hypothetical protein CATMIT_02848 n=1 Tax... 58 1e-07 UniRef50_Q0W4F0 Putative uncharacterized protein n=1 Tax=uncultu... 57 2e-07 UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=... 56 4e-07 UniRef50_Q4JT92 Transposase for IS3503f n=4 Tax=Corynebacterium ... 56 4e-07 UniRef50_Q8PSY9 Conserved protein n=2 Tax=Methanosarcina RepID=Q... 56 4e-07 UniRef50_C0X375 Transposase n=23 Tax=Enterococcus RepID=C0X375_E... 55 5e-07 UniRef50_B4WST6 Putative uncharacterized protein n=1 Tax=Synecho... 55 6e-07 UniRef50_Q5LYW0 IS1193, transposase, ISL3 family n=185 Tax=Bacte... 55 6e-07 UniRef50_Q8RFT8 Transposase n=17 Tax=Fusobacterium RepID=Q8RFT8_... 55 6e-07 UniRef50_C6MXF0 Putative uncharacterized protein n=1 Tax=Geobact... 55 6e-07 UniRef50_C2FEQ0 IS1181 transposase n=1 Tax=Lactobacillus paracas... 54 2e-06 UniRef50_B2A0V7 Integrase catalytic region n=15 Tax=Clostridia R... 53 2e-06 UniRef50_Q07NT9 Putative uncharacterized protein n=2 Tax=Rhizobi... 53 2e-06 UniRef50_Q2FRB5 Putative uncharacterized protein n=1 Tax=Methano... 53 2e-06 UniRef50_Q3Y1C3 Transposase, IS204/IS1001/IS1096/IS1165 n=51 Tax... 53 2e-06 UniRef50_A1VN28 Insertion element protein n=1 Tax=Polaromonas na... 53 3e-06 UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candida... 53 3e-06 UniRef50_C2CJK1 ISSha1 transposase n=7 Tax=Anaerococcus RepID=C2... 52 5e-06 UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthros... 52 6e-06 UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae... 52 7e-06 UniRef50_C7N1Y2 Putative uncharacterized protein n=1 Tax=Slackia... 51 8e-06 UniRef50_Q4JSN3 Transposase for IS3507b n=53 Tax=Actinobacterida... 51 1e-05 UniRef50_A4WNK0 Putative uncharacterized protein n=1 Tax=Rhodoba... 51 1e-05 UniRef50_Q64DF0 Putative uncharacterized protein n=6 Tax=cellula... 50 2e-05 UniRef50_UPI0001BC5E64 putative transposase n=1 Tax=Fusobacteriu... 50 2e-05 UniRef50_UPI0001C348D8 hypothetical protein PretD1_11772 n=1 Tax... 50 2e-05 UniRef50_Q7NH53 TetR family transcriptional regulatory protein n... 50 3e-05 UniRef50_B4WUH8 Putative uncharacterized protein n=1 Tax=Synecho... 50 3e-05 UniRef50_C2BVZ4 Possible IS3509a transposase n=1 Tax=Mobiluncus ... 49 4e-05 UniRef50_B4VTL4 Putative uncharacterized protein n=1 Tax=Microco... 49 5e-05 UniRef50_A8ZMW7 Putative uncharacterized protein n=1 Tax=Acaryoc... 49 5e-05 UniRef50_Q8PWW0 Putative uncharacterized protein n=1 Tax=Methano... 48 6e-05 UniRef50_C9BRL4 Transposase n=30 Tax=Enterococcus RepID=C9BRL4_E... 48 6e-05 UniRef50_Q70JT0 IsmA protein n=2 Tax=Microcystis aeruginosa RepI... 48 7e-05 UniRef50_D1QQX4 Putative uncharacterized protein n=15 Tax=Prevot... 48 8e-05 UniRef50_C0WLQ9 Transposase n=3 Tax=Lactobacillus RepID=C0WLQ9_L... 48 8e-05 UniRef50_Q03NU3 Transposase n=12 Tax=Lactobacillus RepID=Q03NU3_... 48 8e-05 UniRef50_C1DPZ8 Transposase n=4 Tax=Bacteria RepID=C1DPZ8_AZOVD 48 8e-05 UniRef50_Q2RQJ8 Putative uncharacterized protein n=1 Tax=Rhodosp... 48 1e-04 UniRef50_B5S3H3 Probable transposase protein n=6 Tax=Burkholderi... 48 1e-04 UniRef50_B2J7N9 Putative uncharacterized protein n=1 Tax=Nostoc ... 48 1e-04 UniRef50_C5S2C5 Putative transposase n=1 Tax=Actinobacillus mino... 48 1e-04 UniRef50_B7K7J3 Putative uncharacterized protein n=1 Tax=Cyanoth... 48 1e-04 UniRef50_A7JMB8 Predicted protein n=8 Tax=Francisella RepID=A7JM... 47 1e-04 UniRef50_Q9V1K2 Putative uncharacterized protein n=2 Tax=Pyrococ... 47 2e-04 UniRef50_Q93CQ1 Transposase TnpA n=1 Tax=Enterococcus faecium Re... 47 2e-04 UniRef50_D0BMT4 Transposase n=10 Tax=Lactobacillales RepID=D0BMT... 47 2e-04 UniRef50_Q035C5 Transposase n=27 Tax=Lactobacillales RepID=Q035C... 47 2e-04 UniRef50_Q8PRR9 Conserved protein n=2 Tax=Archaea RepID=Q8PRR9_M... 47 2e-04 UniRef50_Q10VF2 Putative uncharacterized protein n=1 Tax=Trichod... 47 2e-04 UniRef50_B2J098 Putative uncharacterized protein n=2 Tax=Nostoca... 47 2e-04 UniRef50_Q6MK35 Putative transposase n=1 Tax=Bdellovibrio bacter... 46 3e-04 UniRef50_C8SCF8 Putative uncharacterized protein n=1 Tax=Ferrogl... 46 3e-04 UniRef50_B0CG58 Transcriptional regulator, TetR family n=1 Tax=A... 46 3e-04 UniRef50_D1PSS1 Insertion element protein (Fragment) n=14 Tax=Pr... 46 3e-04 UniRef50_Q7VL05 Possible transposase n=4 Tax=Pasteurellaceae Rep... 46 4e-04 UniRef50_Q3Y3Y3 Transposase, IS204/IS1001/IS1096/IS1165 n=11 Tax... 46 4e-04 UniRef50_Q2J1M8 Putative uncharacterized protein n=1 Tax=Rhodops... 46 4e-04 UniRef50_A5FLG0 Putative uncharacterized protein n=1 Tax=Flavoba... 45 6e-04 UniRef50_B9Y9S5 Putative uncharacterized protein (Fragment) n=1 ... 45 6e-04 UniRef50_Q03IY7 Transposase n=198 Tax=Lactobacillales RepID=Q03I... 45 6e-04 UniRef50_C7RJT2 Conserved possible transposase n=21 Tax=Proteoba... 45 7e-04 UniRef50_Q7N9S9 Transposase TnpA, ISL3 family n=1 Tax=Photorhabd... 45 0.001 UniRef50_D1UAU0 Transposase, putative n=1 Tax=Desulfovibrio aesp... 45 0.001 UniRef50_B3GXU2 Transposase n=15 Tax=Pasteurellaceae RepID=B3GXU... 45 0.001 UniRef50_Q11ZU0 Putative uncharacterized protein n=1 Tax=Polarom... 44 0.001 UniRef50_C6HZQ4 Transposase n=2 Tax=Leptospirillum ferrodiazotro... 44 0.001 UniRef50_B7X577 Transposase IS204/IS1001/IS1096/IS1165 family pr... 44 0.001 UniRef50_C7P9K3 Transcriptional regulator, ArsR family n=2 Tax=M... 44 0.001 UniRef50_D0MDA7 Transposase-like protein n=7 Tax=Bacteria RepID=... 44 0.001 UniRef50_B7C761 Putative uncharacterized protein n=1 Tax=Eubacte... 44 0.001 UniRef50_C2H217 Possible transposase n=5 Tax=Enterococcaceae Rep... 44 0.001 UniRef50_UPI00016C448A hypothetical protein GobsU_12575 n=6 Tax=... 44 0.002 UniRef50_D2PJ85 Putative uncharacterized protein n=5 Tax=Sulfolo... 44 0.002 UniRef50_UPI0001C31088 transcriptional regulator, TetR family n=... 44 0.002 UniRef50_Q8U293 Transposase n=53 Tax=Pyrococcus RepID=Q8U293_PYRFU 44 0.002 UniRef50_D0U1S9 Transposase n=1 Tax=Enterococcus faecium RepID=D... 44 0.002 UniRef50_A2V378 Putative uncharacterized protein n=1 Tax=Shewane... 44 0.002 UniRef50_A9BGL8 Transposase IS204/IS1001/IS1096/IS1165 family pr... 44 0.002 UniRef50_C6QEP3 ISSpo8, transposase n=4 Tax=Alphaproteobacteria ... 44 0.002 UniRef50_A8UDH0 Transposase n=5 Tax=Bacteria RepID=A8UDH0_9LACT 44 0.002 UniRef50_C1F2K1 Unclassified family transposase n=1 Tax=Acidobac... 43 0.002 UniRef50_Q5LW63 ISSpo8, transposase n=4 Tax=Rhodobacterales RepI... 43 0.002 UniRef50_Q1GHU2 Putative uncharacterized protein n=1 Tax=Ruegeri... 43 0.002 UniRef50_B1IC92 Transposase n=24 Tax=Lactobacillales RepID=B1IC9... 43 0.002 UniRef50_C0WEV9 Transposase (Fragment) n=1 Tax=Acidaminococcus s... 43 0.003 UniRef50_C5RB59 Possible transposase n=1 Tax=Weissella paramesen... 43 0.003 UniRef50_B8F7V2 ISRssp2, family IS1595 n=4 Tax=Pasteurellaceae R... 43 0.003 UniRef50_B4WVD1 Putative uncharacterized protein n=7 Tax=Synecho... 43 0.003 UniRef50_C6HVY3 Probable transposase n=1 Tax=Leptospirillum ferr... 43 0.003 UniRef50_A7HVK5 Putative uncharacterized protein n=1 Tax=Parviba... 43 0.003 UniRef50_C3MUP9 Resolvase helix-turn-helix domain protein n=40 T... 43 0.003 UniRef50_C3L491 Putative uncharacterized protein n=1 Tax=Candida... 43 0.003 UniRef50_C9CRL2 Transposase n=3 Tax=Alphaproteobacteria RepID=C9... 43 0.003 UniRef50_Q8R819 Transposase n=2 Tax=Thermoanaerobacter tengconge... 43 0.003 UniRef50_B9JNY3 Transposase n=4 Tax=Alphaproteobacteria RepID=B9... 43 0.004 UniRef50_A7HMZ5 Transposase IS204/IS1001/IS1096/IS1165 family pr... 43 0.004 UniRef50_D2M0Z3 Two component transcriptional regulator, LuxR fa... 43 0.004 UniRef50_UPI00016C46F4 hypothetical protein GobsU_15563 n=2 Tax=... 43 0.004 UniRef50_UPI0001793827 PREDICTED: similar to CG5669 CG5669-PA n=... 42 0.005 UniRef50_Q87RY6 Putative resolvase n=3 Tax=Vibrio parahaemolytic... 42 0.005 UniRef50_B9ZCS9 DNA topoisomerase type IA zn finger domain prote... 42 0.005 UniRef50_B2JMI5 Transposase n=2 Tax=Burkholderia RepID=B2JMI5_BURP8 42 0.005 UniRef50_D0SJK4 Predicted protein n=1 Tax=Acinetobacter junii SH... 42 0.005 UniRef50_C9RDH8 Regulatory protein LacI n=1 Tax=Ammonifex degens... 42 0.005 UniRef50_B2TB85 Transposase IS3/IS911 family protein n=2 Tax=Bur... 42 0.006 UniRef50_B9JG85 Putative uncharacterized protein n=1 Tax=Agrobac... 42 0.006 UniRef50_A5VLK7 Transposase, IS204/IS1001/IS1096/IS1165 family p... 42 0.006 UniRef50_D2MKS9 ISXo5 transposase n=1 Tax=Candidatus Poribacteri... 42 0.006 UniRef50_Q5ZT03 Transposase (IS652) n=29 Tax=Gammaproteobacteria... 42 0.006 UniRef50_UPI000186E028 transcription factor Sp4, putative n=1 Ta... 42 0.006 UniRef50_Q9SVC5 Dof zinc finger protein DOF3.5 n=2 Tax=Arabidops... 42 0.006 UniRef50_C7XW38 Transposase ISLasa4v n=4 Tax=Lactobacillus RepID... 42 0.006 UniRef50_D2LK53 Putative uncharacterized protein n=1 Tax=Rhodomi... 42 0.007 UniRef50_C4W7G8 Transposase for ISSha1 n=2 Tax=Staphylococcus Re... 42 0.007 UniRef50_A7C135 Putative uncharacterized protein n=1 Tax=Beggiat... 42 0.008 UniRef50_B8FWC8 Putative uncharacterized protein n=1 Tax=Desulfi... 41 0.008 UniRef50_A9IG79 ISSod11, transposase n=14 Tax=Proteobacteria Rep... 41 0.009 UniRef50_A8YX76 Transposase n=42 Tax=Lactobacillus RepID=A8YX76_... 41 0.009 UniRef50_A0Q207 Transcriptional regulator n=3 Tax=Clostridium Re... 41 0.009 UniRef50_Q894I5 Phage-related protein n=1 Tax=Clostridium tetani... 41 0.010 UniRef50_B2JXE0 Putative uncharacterized protein n=2 Tax=Burkhol... 41 0.010 UniRef50_C7PCU2 Two component transcriptional regulator, LuxR fa... 41 0.010 UniRef50_A7BQK2 Transposase n=3 Tax=Bacteria RepID=A7BQK2_9GAMM 41 0.010 UniRef50_Q3C030 Putative sigma-54-dependent transcriptional regu... 41 0.011 UniRef50_C7YZZ3 Putative uncharacterized protein n=1 Tax=Nectria... 41 0.011 UniRef50_UPI0001C34261 hypothetical protein CATC2_13668 n=1 Tax=... 41 0.011 UniRef50_B8HUB6 Putative uncharacterized protein n=1 Tax=Cyanoth... 41 0.011 UniRef50_Q6K537 Os02g0252400 protein n=6 Tax=Oryza sativa RepID=... 41 0.011 UniRef50_B2UM39 Putative uncharacterized protein n=1 Tax=Akkerma... 41 0.011 UniRef50_B8F7J2 Putative uncharacterized protein n=1 Tax=Haemoph... 41 0.012 UniRef50_A2A935 PR domain zinc finger protein 16 n=35 Tax=Eutele... 41 0.012 UniRef50_A7UZI1 Putative uncharacterized protein n=1 Tax=Bactero... 41 0.012 UniRef50_A3VEU0 ISSpo8, transposase n=1 Tax=Rhodobacterales bact... 41 0.012 UniRef50_Q5NZ47 Putative uncharacterized protein n=1 Tax=Aromato... 41 0.013 UniRef50_B5VVQ8 KWG Leptospira repeat protein n=8 Tax=Arthrospir... 41 0.013 UniRef50_Q9HAZ2 PR domain zinc finger protein 16 n=26 Tax=Eutele... 41 0.013 UniRef50_Q3D1N8 Transposase, ISL3 family n=13 Tax=Bacilli RepID=... 41 0.014 UniRef50_A3DPW4 Putative uncharacterized protein n=1 Tax=Staphyl... 41 0.014 UniRef50_C0W2A4 Transposase (Fragment) n=1 Tax=Actinomyces coleo... 41 0.015 UniRef50_C6P8Q1 Transposase IS3/IS911 family protein n=1 Tax=The... 41 0.015 UniRef50_A6Q3M3 Transposase n=1 Tax=Nitratiruptor sp. SB155-2 Re... 41 0.016 UniRef50_A5KRX5 ISSpo8, transposase n=2 Tax=candidate division T... 41 0.016 UniRef50_A8KYE3 Two component transcriptional regulator, LuxR fa... 41 0.016 UniRef50_Q3QZA8 Putative uncharacterized protein n=1 Tax=Xylella... 41 0.016 UniRef50_C3XYB0 Putative uncharacterized protein n=2 Tax=Chordat... 41 0.016 UniRef50_B2KBW2 Two component transcriptional regulator, LuxR fa... 41 0.017 UniRef50_UPI00015B4C26 PREDICTED: similar to HAMLET n=1 Tax=Naso... 41 0.017 UniRef50_Q2RLR5 Integrase, catalytic region n=5 Tax=Clostridia R... 40 0.017 UniRef50_D1W685 Putative uncharacterized protein n=2 Tax=Prevote... 40 0.017 UniRef50_B6ARX3 Transposase n=15 Tax=Bacteria RepID=B6ARX3_9BACT 40 0.017 >UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=root RepID=INSA2_ECOLX Length = 91 Score = 109 bits (272), Expect = 4e-23, Method: Composition-based stats. Identities = 83/91 (91%), Positives = 87/91 (95%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MAS+SI CPSCSAT+GVVRNGKSTAGHQRYLCS CRKTWQLQFTYTASQPG HQKIIDMA Sbjct: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 MNGVGCRA+ARIMGVGLNT+ RHLKNSGRSR Sbjct: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 107 bits (268), Expect = 9e-23, Method: Composition-based stats. Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 1/91 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA ++I CP C + + V R+G++ G R C C + +QL +TY A +PG + I +MA Sbjct: 1 MARVNIHCPRCQSAQ-VYRHGQNPKGRDRLRCRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 NG G R +AR + +G NTV+R LK R Sbjct: 60 FNGAGVRDTARTLKIGSNTVIRTLKKLAPKR 90 >UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax=Gammaproteobacteria RepID=INA2_SHIDY Length = 90 Score = 105 bits (262), Expect = 5e-22, Method: Composition-based stats. Identities = 41/91 (45%), Positives = 61/91 (67%), Gaps = 1/91 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MAS++I CP C + + V R+G++ GH R+ C C + +QL +TY A +PG + I +MA Sbjct: 1 MASVNIHCPRCQSAQ-VYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPGIKELITEMA 59 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 NG G R +AR + +G+NTV+R LKNS +S Sbjct: 60 FNGAGVRDTARTLKIGINTVIRTLKNSRQSE 90 >UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQ17_9SPHI Length = 107 Score = 101 bits (253), Expect = 5e-21, Method: Composition-based stats. Identities = 32/91 (35%), Positives = 49/91 (53%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M ++ C T+ + R G + AG QRY C C +T+ +T+ A P ++I M Sbjct: 1 MVLEAVTCKHFGQTQHIKRYGTTCAGTQRYRCFDCGRTFVQTYTHKARDPLVKEQITQMV 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 +NG G R +AR++GV NTV K +G +R Sbjct: 61 LNGAGIRDTARVLGVNRNTVSAQFKKNGAAR 91 >UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacteriaceae RepID=A4TI48_YERPP Length = 91 Score = 101 bits (252), Expect = 7e-21, Method: Composition-based stats. Identities = 39/90 (43%), Positives = 58/90 (64%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA + ++CP C V ++G GHQRY C CR+++QL++ Y A PG ++I+D+A Sbjct: 1 MAKVDVKCPFCEQFHPVKKHGPGRTGHQRYRCQACRRSFQLEYEYRACHPGMKEQIVDLA 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 MN G R +AR + + +N V+R LKNS RS Sbjct: 61 MNNAGIRDTARALHISINAVMRTLKNSRRS 90 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 2/87 (2%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGV 64 + CPSC + VV+ G+ G Q++LC C K + +Y ++ + M NG+ Sbjct: 10 DVSCPSCG-SHHVVKCGR-PLGRQKFLCRDCGKYFLGDASYHHHSRKLREEALRMYANGM 67 Query: 65 GCRASARIMGVGLNTVLRHLKNSGRSR 91 RA +R++ V L TV +K GR + Sbjct: 68 SMRAISRVLNVPLGTVFTWIKRYGRKK 94 >UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriales RepID=B5W4N9_SPIMA Length = 163 Score = 97.7 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 2/80 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + CP C + VV+NG G Q YLC C + ++ + + M++NG+G Sbjct: 1 MDCPYC-QSHKVVKNGH-RQGKQSYLCRECGRQFRENPCPGGYSSDVKELCVKMSLNGMG 58 Query: 66 CRASARIMGVGLNTVLRHLK 85 RA R+ G+ NT+L ++ Sbjct: 59 FRAIERVTGISHNTILNWVR 78 >UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_STRAG Length = 345 Score = 97.3 bits (241), Expect = 1e-19, Method: Composition-based stats. Identities = 26/84 (30%), Positives = 36/84 (42%), Gaps = 5/84 (5%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQF----TYTASQPGKHQKIIDMAMNG 63 CP C VVRNG G QRY+C C K++ + + T ++ ID MNG Sbjct: 52 CPLCGCI-HVVRNGHRKDGTQRYVCKDCGKSFVIATNSIVSGTRKDLSVWEQYIDCMMNG 110 Query: 64 VGCRASARIMGVGLNTVLRHLKNS 87 + R +A G+ NT Sbjct: 111 LSIRKTAVACGIHRNTAFLWRHKI 134 >UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammaproteobacteria RepID=A7N597_VIBHB Length = 91 Score = 96.5 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 40/91 (43%), Positives = 59/91 (64%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA+I ++C C+ TE V ++GK +G R+ C CRK++QL + Y A +P +KI+DMA Sbjct: 1 MATIQVQCRFCNKTESVRKHGKGHSGFPRFRCIECRKSFQLDYVYEARKPNVKEKIVDMA 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 MN G R +A ++ V NTVL LKNS + + Sbjct: 61 MNSSGVRETAGVLNVAYNTVLSTLKNSRQGK 91 >UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwinia RepID=D0FXR2_ERWPY Length = 92 Score = 96.1 bits (238), Expect = 3e-19, Method: Composition-based stats. Identities = 41/91 (45%), Positives = 53/91 (58%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M I CP CS + + RNG+S +G QRY C C KT+QL F Y S P + II+M Sbjct: 1 MKMGDIACPRCSESARIRRNGRSASGIQRYRCQGCLKTFQLHFYYAGSSPNMQKTIIEMM 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 +G R AR +GV L TVLRHLK+ ++ Sbjct: 61 NDGSEQRDIARKLGVSLETVLRHLKDLRLNK 91 >UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria RepID=C4MEL4_CAMCO Length = 339 Score = 94.2 bits (233), Expect = 1e-18, Method: Composition-based stats. Identities = 27/85 (31%), Positives = 38/85 (44%), Gaps = 6/85 (7%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMN 62 CP C+ ++ V+NGK+ HQRY+C C KT+ T G K ID +N Sbjct: 47 HCPYCN-SDKFVKNGKAKT-HQRYICKTCNKTFTDTNKTILFNTKKDIGIWYKYIDCLVN 104 Query: 63 GVGCRASARIMGVGLNTVLRHLKNS 87 R +A+I G+ L T Sbjct: 105 KYPLRKTAKICGISLPTAFVWRHKI 129 >UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriaceae RepID=B7LWW4_ECO55 Length = 134 Score = 92.3 bits (228), Expect = 5e-18, Method: Composition-based stats. Identities = 28/89 (31%), Positives = 48/89 (53%), Gaps = 3/89 (3%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M+S++I CP C + + V R+G++ G R+ C + +QL +TY A +PG + I +MA Sbjct: 1 MSSVNIHCPRCQSAQ-VYRHGQNPKGRDRFRYRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MN--GVGCRASARIMGVGLNTVLRHLKNS 87 N G+ AR+ G+ + + K Sbjct: 60 FNEPGMMLARMARLHGIQPCQLFKWKKQY 88 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 91.9 bits (227), Expect = 6e-18, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 44/86 (51%), Gaps = 3/86 (3%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQF-TYTASQPGKHQKIIDMAMNGV 64 ++CP C ATE + +NGK G Q ++C+ C + + + Q+ ++M +NG+ Sbjct: 1 MQCPYCGATE-IRKNGK-RRGKQNHICTKCERQFIDVYDPPKGYSEELKQECLEMYLNGM 58 Query: 65 GCRASARIMGVGLNTVLRHLKNSGRS 90 G R R+ GV T++ +K G Sbjct: 59 GFRPIERVKGVHHTTIIFWVKQMGEK 84 >UniRef50_A3JAS9 Hypothetical transposase n=1 Tax=Marinobacter sp. ELB17 RepID=A3JAS9_9ALTE Length = 181 Score = 91.5 bits (226), Expect = 7e-18, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 4/87 (4%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ---LQFTYTASQPGKHQKIIDMAMNG 63 +CP C ++ +R G S QRY C C KT+ Y + + ++ G Sbjct: 56 QCPYC-QSKTFIRWGSSENERQRYRCKRCAKTFNALVGSPLYRMRKEELWLEYVETMRYG 114 Query: 64 VGCRASARIMGVGLNTVLRHLKNSGRS 90 + R +A++ GV L T R S Sbjct: 115 LSLRKAAKVTGVSLRTAFRWRHAFLSS 141 >UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methanosarcina RepID=Q46CV1_METBF Length = 139 Score = 91.5 bits (226), Expect = 8e-18, Method: Composition-based stats. Identities = 23/84 (27%), Positives = 43/84 (51%), Gaps = 2/84 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + CP C+++ +NG G Q Y C C + ++ TAS P ++ + + + G+G Sbjct: 1 MNCPRCNSSTH-KKNG-IVFGRQHYKCHDCGYNYTVEVKSTASSPSVKRQALQLYLEGLG 58 Query: 66 CRASARIMGVGLNTVLRHLKNSGR 89 R+ R +GV +V + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQ 82 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 90.7 bits (224), Expect = 1e-17, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 43/84 (51%), Gaps = 2/84 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + CP C+++ +NG G QRY C C + ++ T+ P ++ + + + G+G Sbjct: 1 MNCPRCNSSTH-KKNG-IVFGRQRYKCHDCGYNYTVEVKSTSISPSVKRQALQLYLEGLG 58 Query: 66 CRASARIMGVGLNTVLRHLKNSGR 89 R+ R +GV +V + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQ 82 >UniRef50_Q116V8 Insertion element protein n=2 Tax=Oscillatoriales RepID=Q116V8_TRIEI Length = 108 Score = 90.0 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 22/81 (27%), Positives = 38/81 (46%), Gaps = 2/81 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + CP C + +V+NG G Q YLC C + ++ + + M ++G+G Sbjct: 1 MHCPYC-QSHKIVKNGH-RNGKQSYLCRKCGRQFRENPCPIGYSSEVKEACLKMFLSGMG 58 Query: 66 CRASARIMGVGLNTVLRHLKN 86 RA R G+ N+VL ++ Sbjct: 59 FRAIERATGISHNSVLNWVRR 79 >UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IXU3_9CHRO Length = 92 Score = 88.4 bits (218), Expect = 6e-17, Method: Composition-based stats. Identities = 31/87 (35%), Positives = 49/87 (56%), Gaps = 4/87 (4%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLC--SPC-RKTWQLQFTYTASQPGKHQKIIDMA 60 ++I CP C +T+ VV+NG S G QRY C C R+++ ++Y + ++I M Sbjct: 5 LAIECPHCHSTD-VVKNGFSGEGKQRYFCQNKSCERRSFIRDYSYNGCRKEVKKQIPKMV 63 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNS 87 +NG G R +AR++ + TV LK S Sbjct: 64 VNGSGIRDTARVLEISPITVASELKKS 90 >UniRef50_A6CNB6 Putative uncharacterized protein n=8 Tax=Bacillus RepID=A6CNB6_9BACI Length = 335 Score = 88.4 bits (218), Expect = 7e-17, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 31/88 (35%), Gaps = 5/88 (5%) Query: 3 SISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL---QFTYTASQPGKHQKIIDM 59 + C C + V RNGK QRYLC C K++ GK K M Sbjct: 49 KEGLGCIHCGSV-KVKRNGKYRE-RQRYLCRDCGKSFNELSNTPIAGTRYLGKWAKYFHM 106 Query: 60 AMNGVGCRASARIMGVGLNTVLRHLKNS 87 + G A+ + + ++T Sbjct: 107 MVEGYTLPKIAKRLKIHISTAFYWRHKI 134 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 87.7 bits (216), Expect = 1e-16, Method: Composition-based stats. Identities = 63/73 (86%), Positives = 65/73 (89%) Query: 18 VRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGL 77 VRNGKSTAGHQRYLCS CRKTWQLQFTYTASQPG HQKIIDMAMNGVGCRA+ARIMGVGL Sbjct: 2 VRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGL 61 Query: 78 NTVLRHLKNSGRS 90 NT+LRHL Sbjct: 62 NTILRHLNKLRPQ 74 >UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis BM407 RepID=C6GT28_STRS4 Length = 341 Score = 87.7 bits (216), Expect = 1e-16, Method: Composition-based stats. Identities = 25/84 (29%), Positives = 36/84 (42%), Gaps = 6/84 (7%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL----QFTYTASQPGKHQKIIDMAMNG 63 CP C +E + RNGK G QRY+C C+KT+ + K K +NG Sbjct: 54 CPLCG-SETISRNGKY-NGKQRYICKSCKKTFTDFTNSATYKSKKTLDKWLKYAKCMING 111 Query: 64 VGCRASARIMGVGLNTVLRHLKNS 87 R SA+I+ + + T Sbjct: 112 YSIRKSAKIVEINIATSFFWRHKI 135 >UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HXD1_CYAP4 Length = 95 Score = 87.3 bits (215), Expect = 2e-16, Method: Composition-based stats. Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 4/94 (4%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPC---RKTWQLQFTYTASQPGKHQKII 57 M + PSC +++ VV+ + T G QRY C R T+ Q+ Y Q+I+ Sbjct: 1 MVLEPVLYPSCGSSD-VVKPRQLTEGIQRYKCRNAEWSRCTFIRQYAYRGYLVEVKQQIV 59 Query: 58 DMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 +M +NG G R AR++ + TV LK S + Sbjct: 60 EMVVNGSGTRDPARVLKISRTTVTETLKKSSSAE 93 >UniRef50_A9VV42 Insertion element protein n=7 Tax=Bacillus RepID=A9VV42_BACWK Length = 342 Score = 86.1 bits (212), Expect = 3e-16, Method: Composition-based stats. Identities = 27/91 (29%), Positives = 35/91 (38%), Gaps = 5/91 (5%) Query: 3 SISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFT---YTASQPGKHQKIIDM 59 CP C A+E VVR GK QRY C C KT+ Y + + +D Sbjct: 51 KEGFECPHC-ASEHVVRFGK-HNNRQRYRCKCCSKTFTDTTNTVLYRTRKGNEWITFVDC 108 Query: 60 AMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 G R SA I+GV T+ + Sbjct: 109 MFKGYSLRKSAEIVGVTWVTLFYWRHKLLSA 139 >UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZK1_TRIEI Length = 177 Score = 85.7 bits (211), Expect = 4e-16, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 39/85 (45%), Gaps = 2/85 (2%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGV 64 I+CP C + + + +NG G Q Y+C C + + + + + +NG+ Sbjct: 10 PIQCPDC-SCQHIPKNGHQP-GKQNYICVACSHQFIKPYHPQEYSDNVKRLFLRIYVNGM 67 Query: 65 GCRASARIMGVGLNTVLRHLKNSGR 89 G R A + GV T++ +K++ Sbjct: 68 GIRRIAWVKGVTYPTIINLIKHTRE 92 >UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N1_SALRD Length = 92 Score = 85.7 bits (211), Expect = 4e-16, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 1/86 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M + C C ++ +V+NG S +G Q+Y C C L + +KI+ Sbjct: 1 MIKETYECRECGSSN-IVKNGHSASGSQQYHCKDCGAHKVLDPEPRGYSEEEKEKILRAY 59 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKN 86 RA +RI G+ NT+ R LK Sbjct: 60 RERGSKRAISRIFGISRNTLTRWLKK 85 >UniRef50_B1QSI6 Putative transposase n=10 Tax=Clostridium butyricum RepID=B1QSI6_CLOBU Length = 336 Score = 85.0 bits (209), Expect = 7e-16, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 35/92 (38%), Gaps = 8/92 (8%) Query: 2 ASISIRCPSCSATEGVVRNGKSTAGHQRYLCSP--CRKTWQLQ----FTYTASQPGKHQK 55 CP C + ++ GK QRY C C KT+ + Y QP K + Sbjct: 29 IKEYESCPYCGC-KHFIKYGKY-QDIQRYKCKNEECGKTFSNTTFSVWKYLKYQPEKWIE 86 Query: 56 IIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 I++ G+ +SARI+ + T Sbjct: 87 FIELMCEGMTLESSARILKITTTTAFYWRHKI 118 >UniRef50_C1DQR5 Transposase n=5 Tax=Bacteria RepID=C1DQR5_AZOVD Length = 317 Score = 84.6 bits (208), Expect = 9e-16, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 31/90 (34%), Gaps = 5/90 (5%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKT---WQLQFTYTASQPGKHQKII 57 M + CP C ++E ++ S+ G RY C C KT + Q Sbjct: 38 MIATPSSCPHCQSSE--LQPWGSSGGLPRYRCKFCGKTSNPLTGTPMARLRKRHLWQGYA 95 Query: 58 DMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 + + R +A+ GV NT Sbjct: 96 EALTQSLTVRRAAKHCGVSKNTAFLWRHRF 125 >UniRef50_Q81ZP0 Putative uncharacterized protein n=2 Tax=Nitrosomonas europaea RepID=Q81ZP0_NITEU Length = 323 Score = 83.4 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 31/89 (34%), Gaps = 5/89 (5%) Query: 2 ASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ---LQFTYTASQPGKHQKIID 58 +S CP C + R G AG QR+ C C+ T+ + Sbjct: 43 SSFEPICPVC-QSNHFYRWGY-QAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA 100 Query: 59 MAMNGVGCRASARIMGVGLNTVLRHLKNS 87 + G+ RASAR + NT R Sbjct: 101 ALIEGLTVRASARQCRIDKNTSFRWRHRF 129 >UniRef50_B6ARX2 Probable transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6ARX2_9BACT Length = 133 Score = 83.4 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 34/89 (38%), Gaps = 5/89 (5%) Query: 3 SISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL---QFTYTASQPGKHQKIIDM 59 S RCP C E V + G+ G QRY C CR+ + + K ++ Sbjct: 47 SEHPRCPHCQD-EHVAKWGRVK-GLQRYRCEACRRQFTPLTNTPLSGLRKREKWGAYLEA 104 Query: 60 AMNGVGCRASARIMGVGLNTVLRHLKNSG 88 +G+ R +A+ +GV T Sbjct: 105 MEDGLSVRKAAQRIGVNHKTTFLWRHRFS 133 >UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 RepID=P73782_SYNY3 Length = 141 Score = 83.4 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 39/88 (44%), Gaps = 2/88 (2%) Query: 3 SISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMN 62 S CP C VV+NG G QR+ C C+ + + + + M Sbjct: 2 STHCHCPQCGHGN-VVKNGFVK-GKQRFKCKRCQYKFTNLSKERGKLLWMKLEAVLLYMG 59 Query: 63 GVGCRASARIMGVGLNTVLRHLKNSGRS 90 G+ A+A+++GV ++L +++ G + Sbjct: 60 GMSMNATAKLLGVSTQSLLNWIRDFGEA 87 >UniRef50_C4IIL3 Putative transposase n=2 Tax=Clostridium butyricum RepID=C4IIL3_CLOBU Length = 325 Score = 83.4 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 32/89 (35%), Gaps = 6/89 (6%) Query: 2 ASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKII 57 CP C E ++ GK G QRY C C+KT+ + Y P K K I Sbjct: 29 IKEYSCCPHCKNVE-FIKFGKY-DGIQRYRCKSCKKTFSYTTNSLWKYLKHPPEKWFKFI 86 Query: 58 DMAMNGVGCRASARIMGVGLNTVLRHLKN 86 ++ A+ + + + T Sbjct: 87 ELLGEKKTLEYCAKTLKISIVTAFNWRHK 115 >UniRef50_B0ABB1 Putative uncharacterized protein n=2 Tax=cellular organisms RepID=B0ABB1_9CLOT Length = 454 Score = 82.7 bits (203), Expect = 4e-15, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 35/89 (39%), Gaps = 6/89 (6%) Query: 3 SISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL----QFTYTASQPGKHQKIID 58 ++CP C + + + +NGK+ QRY+C CR T+ + T K Sbjct: 136 KNDLKCPKCGSFD-LNKNGKT-NQRQRYICKNCRTTFDERSFSPLSNTKLSLDTWLKYCQ 193 Query: 59 MAMNGVGCRASARIMGVGLNTVLRHLKNS 87 + G + A+ +GV + T Sbjct: 194 FMIEGGTIKYCAQKVGVSIPTSFFMRHRI 222 >UniRef50_D2QCU0 Insertion element protein n=7 Tax=Bacteroidetes RepID=D2QCU0_9SPHI Length = 139 Score = 82.3 bits (202), Expect = 4e-15, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 2/86 (2%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGV 64 +++CP C++ + V RNG QR+ C C + + K + + + GV Sbjct: 3 TLKCPKCNSVDAV-RNG-IVNQRQRFRCKKCNYNFTVGKVGKGISTYYVIKALQLYIEGV 60 Query: 65 GCRASARIMGVGLNTVLRHLKNSGRS 90 R R++G+ +V+ +K Sbjct: 61 SFREIERLLGISHVSVMNWVKKYQIK 86 >UniRef50_Q9AMR3 Putative transposase n=1 Tax=Azotobacter vinelandii RepID=Q9AMR3_AZOVI Length = 214 Score = 82.3 bits (202), Expect = 4e-15, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 31/90 (34%), Gaps = 5/90 (5%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKT---WQLQFTYTASQPGKHQKII 57 M + CP C ++E ++ S+ G RY C C KT + Q Sbjct: 38 MIATPSSCPHCQSSE--LQPWGSSGGLPRYRCKFCGKTSNPLTGTPMARLRKRHLWQGYA 95 Query: 58 DMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 + + R +A+ GV NT Sbjct: 96 EALTQSLTVRRAAKHCGVSKNTAFLWRHRF 125 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 81.9 bits (201), Expect = 6e-15, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 43/85 (50%), Gaps = 3/85 (3%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNG 63 I CPSC ++ V++NG S+ G +Y C+ CR+T+ + ++I+ +N Sbjct: 67 IRPNCPSC-KSDKVIKNG-SSRGKTKYKCNVCRRTFY-DANSRRMSREQKERILKEYLNR 123 Query: 64 VGCRASARIMGVGLNTVLRHLKNSG 88 + R A++ G L TV +K G Sbjct: 124 MSMRGIAKVEGKPLTTVYSLIKRKG 148 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 81.9 bits (201), Expect = 6e-15, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 45/85 (52%), Gaps = 1/85 (1%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 I CP C + V++NG + Q+Y C C + +T A + +QK + + G+G Sbjct: 12 INCPKCKE-KKVIKNGTTKNNKQQYYCKMCFYRFIQNYTNQAYKLDINQKNVQLTKEGLG 70 Query: 66 CRASARIMGVGLNTVLRHLKNSGRS 90 R++ARI+ + T+L+ + + GR Sbjct: 71 IRSTARILEISATTLLKRIVSIGRK 95 >UniRef50_Q891N5 Putative transposase n=1 Tax=Clostridium tetani RepID=Q891N5_CLOTE Length = 279 Score = 81.9 bits (201), Expect = 6e-15, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 37/88 (42%), Gaps = 6/88 (6%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL----QFTYTASQPGKHQKIIDM 59 C C +E +V+NGK QRY+C C KT+ +Y+ K + Sbjct: 55 TDTICVHC-KSENIVKNGKYKE-KQRYICKDCHKTFTNYTNSPISYSKKNISKWIEYTKC 112 Query: 60 AMNGVGCRASARIMGVGLNTVLRHLKNS 87 + G R S++++G+ L+T Sbjct: 113 MLAGYSLRKSSKLVGISLSTAFYWRHKI 140 >UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacteria RepID=Q1V9Z0_VIBAL Length = 88 Score = 81.9 bits (201), Expect = 7e-15, Method: Composition-based stats. Identities = 31/86 (36%), Positives = 47/86 (54%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M + + C C ++ VV++G GHQRY C C +T+Q+ + Y A +PG +II+M Sbjct: 1 MTTNNPHCHFCCKSDSVVKHGYGPKGHQRYRCLSCCRTFQVNYCYEACKPGIRSRIIEMT 60 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKN 86 G RA++R + V NTVL Sbjct: 61 AQNHGKRATSRHLQVSYNTVLSACHR 86 >UniRef50_A7HSL0 Insertion element protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HSL0_PARL1 Length = 342 Score = 81.1 bits (199), Expect = 9e-15, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 33/94 (35%), Gaps = 11/94 (11%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSP-----CRKTWQ---LQFTYTASQPGKHQKIIDM 59 CP C + +V++G+ G QR+ C C +T+ +P K M Sbjct: 55 CPHCGH-DDIVKHGRDRGGRQRFRCRRSGSSGCGQTFNALTGTAFTRMRKPEKWAAYARM 113 Query: 60 AMNGVGCRASARI--MGVGLNTVLRHLKNSGRSR 91 G + +G+ T R R++ Sbjct: 114 MATGFKSVDDVKTSGLGISRLTAWRWRHRLLRAQ 147 >UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L432_AMOA5 Length = 118 Score = 80.3 bits (197), Expect = 1e-14, Method: Composition-based stats. Identities = 22/86 (25%), Positives = 38/86 (44%), Gaps = 2/86 (2%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGV 64 ++ CP C+ ++G G QRY C CR + + +K + + + G+ Sbjct: 3 TMNCPRCNNAHSC-KDGIVR-GRQRYQCKSCRFRYTVSHKSDVKPLSTKRKALQLYLEGL 60 Query: 65 GCRASARIMGVGLNTVLRHLKNSGRS 90 G RA RI+ + TV + +K G Sbjct: 61 GFRAIGRILNISYGTVYQWVKACGDQ 86 >UniRef50_Q97IP3 Zn-finger DNA-binding domain n=1 Tax=Clostridium acetobutylicum RepID=Q97IP3_CLOAB Length = 171 Score = 79.6 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 32/92 (34%), Gaps = 9/92 (9%) Query: 3 SISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTW----QLQFTYTASQPGKHQKIID 58 + + C E RNGK QRY+C C+KT+ + + K + Sbjct: 50 KVYLHCKL----EMFSRNGKHDE-KQRYVCKTCKKTFTDFTYSPISSSKKPLDKWLQYAK 104 Query: 59 MAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 + G R A+ + + + T + Sbjct: 105 CMIVGYSIRKCAKTVNINIATSFFWRHKILEA 136 >UniRef50_UPI000196AFFE hypothetical protein CATMIT_00334 n=2 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196AFFE Length = 357 Score = 79.6 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 33/84 (39%), Gaps = 5/84 (5%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMNG 63 CP C + +NGK HQRY+C C K++ + F ++ + I++ + Sbjct: 50 CPICGSV-HFKKNGKDKNRHQRYICLDCHKSFSDRTNTLFYWSHFTLDQWLHFIELELYK 108 Query: 64 VGCRASARIMGVGLNTVLRHLKNS 87 + A+++ T Sbjct: 109 MPLEGEAQVLETSKTTCFYMRHKL 132 >UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YEG9_MICAE Length = 171 Score = 79.6 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 33/84 (39%), Gaps = 5/84 (5%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCR 67 CP+C + ++NG G + C C + + + T Q I + + G+ R Sbjct: 37 CPNCG-SHHTIKNGSIHNGKPKRQCKECGRQFVINPTNKTVSDETKQLIDKLLLEGISLR 95 Query: 68 ASARIMGVGLNTVLRHLKNSGRSR 91 AR+ G L+N ++ Sbjct: 96 VIARVTGAS----WSWLQNYVNNK 115 >UniRef50_C6X4X6 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X4X6_FLAB3 Length = 169 Score = 79.6 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 2/84 (2%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCR 67 CP C + VV++G QR+LC C + ++ K + + + G+ R Sbjct: 36 CPKCQQ-QNVVKSGIVKE-RQRFLCRSCNYYFTVKKLGKQIDDYYVTKALQLYLEGLSYR 93 Query: 68 ASARIMGVGLNTVLRHLKNSGRSR 91 RI+GV T+ ++ R Sbjct: 94 EIERILGVSHVTISSWVRKYNIKR 117 >UniRef50_Q97IZ3 Zn-finger DNA-binding domain n=1 Tax=Clostridium acetobutylicum RepID=Q97IZ3_CLOAB Length = 142 Score = 78.8 bits (193), Expect = 5e-14, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 34/84 (40%), Gaps = 6/84 (7%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL----QFTYTASQPGKHQKIIDMAMNG 63 CP C +E + RN K G Q Y+C C+K++ + K K +NG Sbjct: 54 CPICG-SETISRNSKY-NGKQGYICKSCKKSFTDFTNSATYKSKKTLDKWLKYAKCMVNG 111 Query: 64 VGCRASARIMGVGLNTVLRHLKNS 87 R SA+++ + + T Sbjct: 112 YSIRKSAKVVEINIATSFFWRHKI 135 >UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE Length = 348 Score = 78.0 bits (191), Expect = 8e-14, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 38/85 (44%), Gaps = 6/85 (7%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL----QFTYTASQPGKHQKIIDMAMN 62 CP C + V +NGKS G QRY+C CR ++ F+ T K K ++ + Sbjct: 52 ECPKCQCKD-VNKNGKS-NGRQRYICKRCRTSFDEFTMSPFSNTKLGLDKWIKYCELMIL 109 Query: 63 GVGCRASARIMGVGLNTVLRHLKNS 87 G+ R A +GVG+ T Sbjct: 110 GLSIRKCAEEVGVGVKTSFYMRHRI 134 >UniRef50_C1I4B6 Putative uncharacterized protein n=2 Tax=Clostridium sp. 7_2_43FAA RepID=C1I4B6_9CLOT Length = 361 Score = 77.6 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 20/86 (23%), Positives = 35/86 (40%), Gaps = 8/86 (9%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLC--SPCRKTWQLQ----FTYTASQPGKHQKIIDMAM 61 CP+C+ + ++ GK G QR+ C C KT+ + F+ + K + + Sbjct: 57 CPNCN-SNNFIKYGKYR-GLQRFKCLNKDCCKTFSQKTNSIFSNSKKPLELWLKYLILMN 114 Query: 62 NGVGCRASARIMGVGLNTVLRHLKNS 87 N R + I+G+ L T Sbjct: 115 NKFSLRKCSSILGINLATSFYWRHKF 140 >UniRef50_Q2GDS3 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDS3_NEOSM Length = 134 Score = 77.3 bits (189), Expect = 1e-13, Method: Composition-based stats. Identities = 18/83 (21%), Positives = 36/83 (43%), Gaps = 3/83 (3%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + CP C++ +++GK+ QRY C C + + A + + ++G+ Sbjct: 1 MHCPKCNSV-RFIKSGKAKE-KQRYKCLNCGCQFSRNEKHGAPLR-LKMHAVQLFLSGIS 57 Query: 66 CRASARIMGVGLNTVLRHLKNSG 88 + A+I V TV+R + Sbjct: 58 MNSIAKIFSVSPPTVMRWVNQFS 80 >UniRef50_B9TDK1 Putative uncharacterized protein n=2 Tax=Ricinus communis RepID=B9TDK1_RICCO Length = 321 Score = 77.3 bits (189), Expect = 1e-13, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 29/83 (34%), Gaps = 5/83 (6%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKT---WQLQFTYTASQPGKHQKIIDMAMNGV 64 CP C R G++ +G QR+ C C ++ + + + Sbjct: 52 CPHCGCARKH-RCGQA-SGLQRFRCLHCGRSHNALTKTPLARLRKKECWLPYLQCVLESR 109 Query: 65 GCRASARIMGVGLNTVLRHLKNS 87 R +A+I+GV T R Sbjct: 110 TVRDAAQIVGVHRTTSFRWRHRF 132 >UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5WC29_PSYWF Length = 233 Score = 76.9 bits (188), Expect = 2e-13, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 40/87 (45%), Gaps = 3/87 (3%) Query: 2 ASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ--FTYTASQPGKHQKIIDM 59 ++ I+CP+C ++ + +NG + G Q Y C C++ + TY K KI + Sbjct: 3 ITLYIKCPAC-LSDNIKKNGFKSYGKQNYKCKDCKRQFIGDHALTYQGCHSQKDSKIRYL 61 Query: 60 AMNGVGCRASARIMGVGLNTVLRHLKN 86 + G G + A + + VL LK Sbjct: 62 MVRGSGIKDIACVERISKGKVLATLKK 88 >UniRef50_A1SXI4 Conserved hypothetical transposase n=18 Tax=Gammaproteobacteria RepID=A1SXI4_PSYIN Length = 319 Score = 76.1 bits (186), Expect = 3e-13, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 31/90 (34%), Gaps = 5/90 (5%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL---QFTYTASQPGKHQKIIDMAM 61 S +CP C + GK+ + QRY C C KT+ + K + Sbjct: 52 SPQCPHCHCA-HFTKWGKAGS-VQRYKCFSCHKTFNNKTKTPLAKLHRCELWDKYAECMS 109 Query: 62 NGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 + R +A + + L T ++ Sbjct: 110 LKLTLREAAAVCNINLKTSFLWRHRFLMAQ 139 >UniRef50_A8ZMX8 Putative uncharacterized protein n=5 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMX8_ACAM1 Length = 134 Score = 75.3 bits (184), Expect = 6e-13, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 35/93 (37%), Gaps = 9/93 (9%) Query: 6 IRCPSCSATEGVVRNGKSTAG----HQRYLCSPCRKTWQLQF-TYTASQP---GKHQKII 57 + CP C +E +++ G + QRY C C + + + T A I Sbjct: 1 MECPYC-QSEKILKRGFDSLQDGTLVQRYQCKDCNRRFNERTGTPMARLRTASSVVSYAI 59 Query: 58 DMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 G+G R++ R G T++R K Sbjct: 60 KARTEGMGVRSAGRTFGKSHTTIMRWEKRLADQ 92 >UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachlamydiaceae RepID=Q6MD28_PARUW Length = 209 Score = 74.9 bits (183), Expect = 8e-13, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 28/79 (35%), Gaps = 1/79 (1%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 +RC C ++ V +NG + Q + C C K W + + + V Sbjct: 1 MRCTHCG-SDLVKKNGYTRHEKQNFRCLECGKQWSENKEAKIINEQTKELVRKALLEKVS 59 Query: 66 CRASARIMGVGLNTVLRHL 84 RI V + +L + Sbjct: 60 LNGICRIFDVSMPWLLDFI 78 >UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETP6_AMOA5 Length = 232 Score = 74.9 bits (183), Expect = 8e-13, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 36/84 (42%), Gaps = 2/84 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + C C + + NGK G QRY C C + K + I + + +G Sbjct: 1 MECKGC-KSNKTINNGKVR-GKQRYNCKSCGFNFVEVDERRGKNIDKQRMAIHLYLENMG 58 Query: 66 CRASARIMGVGLNTVLRHLKNSGR 89 RA R++GV VL+ ++ +G Sbjct: 59 FRAIGRVLGVSNLAVLKWIRAAGE 82 >UniRef50_A8V2B8 Hypothetical insertion element protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8V2B8_9AQUI Length = 125 Score = 74.2 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 2/87 (2%) Query: 3 SISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMN 62 I+CP C + + GK+T G QRY C+ C + + Y + M Sbjct: 11 QEHIKCPECG-SNWCKKFGKNT-GKQRYKCNECGRHFYEGAKYHKHPEKVKLLALKMYSK 68 Query: 63 GVGCRASARIMGVGLNTVLRHLKNSGR 89 G+ A AR++ + TV R G+ Sbjct: 69 GMSKSAIARVLNLPYRTVARWTYEVGK 95 >UniRef50_Q64EP4 Putative uncharacterized protein n=10 Tax=environmental samples RepID=Q64EP4_9ARCH Length = 164 Score = 74.2 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 31/77 (40%), Gaps = 4/77 (5%) Query: 17 VVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMNGVGCRASARI 72 +VR G G QR+ C C K + F + I + + G RA RI Sbjct: 38 IVRYGHDKNGRQRFKCKTCGKVFVETKNTVFYNRKLSEDQIILICKLLVEKNGIRAIERI 97 Query: 73 MGVGLNTVLRHLKNSGR 89 M + +T+ +K+ R Sbjct: 98 MEIHRDTISDVVKDLAR 114 >UniRef50_C0BSX6 Putative uncharacterized protein n=1 Tax=Bifidobacterium pseudocatenulatum DSM 20438 RepID=C0BSX6_9BIFI Length = 352 Score = 74.2 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 5/84 (5%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTW----QLQFTYTASQPGKHQKIIDMAMNG 63 C C + ++R G+ G QR+ C C +T+ + + G + ++ ++ Sbjct: 55 CVRCGSI-RIIRKGRGRDGSQRWKCMNCNRTFGVRTNRVMGMSKLKAGVWMRFLECFVDC 113 Query: 64 VGCRASARIMGVGLNTVLRHLKNS 87 + R A+ GV L T + Sbjct: 114 LSLRKCAQRCGVCLKTAFLMRQRV 137 >UniRef50_B6B4C9 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2083 RepID=B6B4C9_9RHOB Length = 321 Score = 73.8 bits (180), Expect = 2e-12, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 37/83 (44%), Gaps = 7/83 (8%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYT----ASQPGKHQKIIDMAMNG 63 CP C A + R G++ AG QRY C C KT+ + + +Q + DM +G Sbjct: 50 CPHCGAVDR-QRWGRTRAGSQRYRCQGCLKTFNGRTGSSIAQLQKLDQFYQVLKDMFSDG 108 Query: 64 V--GCRASARIMGVGLNTVLRHL 84 R AR + V +T+ R Sbjct: 109 PPRSIRRLARQLDVNKDTIWRWR 131 >UniRef50_C6W2G4 Transcriptional regulator, LacI family n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W2G4_DYAFD Length = 388 Score = 73.4 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 34/80 (42%), Gaps = 6/80 (7%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 I C C+ +G+++ G G QRYLC C + A + + ++ + Sbjct: 2 IECVKCAQVDGIMKAGYVR-GKQRYLCKWCNYYFT-----HAEKDDSIESLVKRKRHQTT 55 Query: 66 CRASARIMGVGLNTVLRHLK 85 A+ +GV +TV R L Sbjct: 56 IIDIAKSLGVSNSTVSRALH 75 >UniRef50_A0L9I6 Insertion element protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L9I6_MAGSM Length = 89 Score = 73.4 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 6/90 (6%) Query: 1 MAS--ISIRCPSCSATEGVVRNGKSTAGHQRYLCSP--CRKT-WQLQFTYTASQPGKHQK 55 MA+ + + CP C + + V++ GK G QR+ C+ C +T + + ++ Sbjct: 1 MATMEVHVHCPDCGSLD-VIKFGKDRHGRQRFRCNDHFCDRTIFMMDDPDWWRFEEVKKQ 59 Query: 56 IIDMAMNGVGCRASARIMGVGLNTVLRHLK 85 I ++G G +A +G+ V R K Sbjct: 60 IALHLLSGNGIHQTAHNLGLHPEFVNRMAK 89 >UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TRD2_9FLAO Length = 219 Score = 73.4 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 21/85 (24%), Positives = 34/85 (40%), Gaps = 2/85 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + C +C + R GK G QRY C C ++Q + Y A +I + Sbjct: 1 MNCKNCDQAHCIKR-GK-RNGIQRYYCKICFTSFQENYHYKAYDSSIDTLLISLLRECCS 58 Query: 66 CRASARIMGVGLNTVLRHLKNSGRS 90 AR++ + NTVL + + Sbjct: 59 VLGIARVLKISKNTVLSRMLKISKQ 83 >UniRef50_D0WF86 Putative Zn-finger DNA-binding domain protein n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WF86_9ACTN Length = 243 Score = 72.6 bits (177), Expect = 4e-12, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 27/87 (31%), Gaps = 5/87 (5%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ----LQFTYTASQPGKHQKIIDMA 60 + CP C + +GK+ G +RY C C + F K +I ++ Sbjct: 51 APVCPDCGSVRP-RLDGKAPNGARRYRCRECGCRFSALTGTIFADAKLPLHKIMRIAEVM 109 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNS 87 + R + V T Sbjct: 110 CHSASLRLMELVAEVSHGTAFLWRHKV 136 >UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcticus 273-4 RepID=Q4FRR6_PSYA2 Length = 108 Score = 72.3 bits (176), Expect = 4e-12, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 34/85 (40%), Gaps = 3/85 (3%) Query: 2 ASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ--FTYTASQPGKHQKIIDM 59 I I CP C + + +NG + G Q Y C C++ + TY +I M Sbjct: 3 TQIDISCPDCHSI-SLKKNGIKSYGKQNYQCKDCQRQFIGDHALTYQGCHSRIEDRIRLM 61 Query: 60 AMNGVGCRASARIMGVGLNTVLRHL 84 G G R A I V + VL L Sbjct: 62 TARGCGIRDIAVITSVSIGKVLSTL 86 >UniRef50_Q8D3Z3 Transposase n=48 Tax=Vibrionales RepID=Q8D3Z3_VIBVU Length = 507 Score = 71.9 bits (175), Expect = 6e-12, Method: Composition-based stats. Identities = 13/79 (16%), Positives = 31/79 (39%), Gaps = 1/79 (1%) Query: 13 ATEGVVRNGKSTAG-HQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCRASAR 71 T + + +G QRY C C+ T+ +++ + + ++ + G R R Sbjct: 112 HTHKHLYHAFGYSGDRQRYRCKSCQSTFVDKWSGANKKLQFQENLMGLLFTGYSVREICR 171 Query: 72 IMGVGLNTVLRHLKNSGRS 90 + + T H+++ Sbjct: 172 KLAINPKTFYDHVEHIASR 190 >UniRef50_A5FST1 Integrase, catalytic region n=1 Tax=Dehalococcoides sp. BAV1 RepID=A5FST1_DEHSB Length = 319 Score = 71.5 bits (174), Expect = 8e-12, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 33/94 (35%), Gaps = 9/94 (9%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQF--TYTASQPGKHQKIIDMAM 61 + I C C + R G S A QR+LC+ C T+ + P + + M Sbjct: 6 LPIECKYCG-SRHTRRYGHSRAQKQRWLCNDCCHTFVETSAQPGMRTPPEQIGAAVSMFY 64 Query: 62 NGVGCRASAR----IMGVGL--NTVLRHLKNSGR 89 G+ A R I + TV + + Sbjct: 65 EGLSLSAICRQMKQIHNISPSDGTVYGWITKYSK 98 >UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea psychrophila RepID=Q6AKY5_DESPS Length = 101 Score = 71.1 bits (173), Expect = 9e-12, Method: Composition-based stats. Identities = 28/85 (32%), Positives = 48/85 (56%), Gaps = 6/85 (7%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 + C C T+ V R+GK + G+QR+ CS C++T+QL++ Y A +H++ + G Sbjct: 1 MSCRFCGGTDEVRRHGKDSNGNQRFRCSDCKRTFQLEYPYVA---DRHERY---SPGNAG 54 Query: 66 CRASARIMGVGLNTVLRHLKNSGRS 90 R +AR++ VG + R K + R Sbjct: 55 IRDTARVLKVGCMGLTRFRKLNPRQ 79 >UniRef50_C8SD87 Insertion element protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SD87_FERPL Length = 94 Score = 70.7 bits (172), Expect = 1e-11, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 39/91 (42%), Gaps = 5/91 (5%) Query: 6 IRCPSCSATEGVVR---NGKSTAGHQRYLCSPCRKTWQLQF-TYTASQPGKHQKIIDMAM 61 + CP C + + V + KS QRY C C +T+ L + ++ + Sbjct: 1 MMCPHCKSIKTVKMGCYHTKSGERRQRYKCKNCGRTFVLNPIKPRNYPEEFKEMVVKAVV 60 Query: 62 -NGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 GVG R ++RI + NTV ++ + R Sbjct: 61 REGVGVRQASRIFKLSPNTVTAWVREFSKKR 91 >UniRef50_B5ETH8 Transposase n=15 Tax=Vibrionaceae RepID=B5ETH8_VIBFM Length = 489 Score = 70.7 bits (172), Expect = 1e-11, Method: Composition-based stats. Identities = 17/94 (18%), Positives = 32/94 (34%), Gaps = 12/94 (12%) Query: 9 PSCSAT-----------EGVVRNGKSTAG-HQRYLCSPCRKTWQLQFTYTASQPGKHQKI 56 PSC+ + + + +G QRY C C T+ +++ + QK+ Sbjct: 79 PSCNNSECEHFGFDVLTHRELYHAFGYSGDRQRYRCKSCASTFVDKWSGENQKSLIQQKL 138 Query: 57 IDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 + G R R + + T H+ Sbjct: 139 LGFLFTGYSVREICRRLHINPKTFYDHINQIASR 172 >UniRef50_B2GC72 Transposase n=11 Tax=Lactobacillus RepID=B2GC72_LACF3 Length = 428 Score = 70.3 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 20/99 (20%), Positives = 35/99 (35%), Gaps = 21/99 (21%) Query: 7 RCPSCSATEGVVRNGKSTA-----------------GHQRYLCSPCRKTWQLQF----TY 45 RCP C + ++NG S QR C C+ ++ + Y Sbjct: 44 RCPHCGFADTFIKNGHSYQTIKYLSINESCPTMLRIDKQRLRCKNCQDSFMAKTNVVDKY 103 Query: 46 TASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHL 84 + K + M + V + ++ GV +T+ R L Sbjct: 104 CSIAKAVKHKALTMLESNVSQKDVSKFTGVSPSTIGRLL 142 >UniRef50_B5K5I7 Transposase n=4 Tax=Octadecabacter antarcticus 238 RepID=B5K5I7_9RHOB Length = 319 Score = 69.9 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 33/87 (37%), Gaps = 5/87 (5%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ---LQFTYTASQPGKHQKIIDMAMNG 63 CP C+A V+R G+S G +RY C C KT+ + +G Sbjct: 50 NCPHCAAGGAVIR-GRS-NGLKRYFCKICSKTFNALTGTPLARLRHKDCWTEFAGSLSDG 107 Query: 64 VGCRASARIMGVGLNTVLRHLKNSGRS 90 + SA GV +T R R+ Sbjct: 108 DTVKTSAARCGVASSTAFRWRHRFLRA 134 >UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SSR9_AERS4 Length = 91 Score = 69.2 bits (168), Expect = 4e-11, Method: Composition-based stats. Identities = 25/64 (39%), Positives = 39/64 (60%), Gaps = 1/64 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MASI+I CP C+ ++ V R+GK+ AG+ RY C C +QL +TY A P ++++ Sbjct: 10 MASITIHCPRCN-SDHVYRHGKTPAGNIRYRCPACPHVFQLTYTYEARNPASKRRLLIWR 68 Query: 61 MNGV 64 G+ Sbjct: 69 STGL 72 >UniRef50_C0QU68 Putative transposase n=1 Tax=Persephonella marina EX-H1 RepID=C0QU68_PERMH Length = 94 Score = 68.8 bits (167), Expect = 5e-11, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 37/87 (42%), Gaps = 2/87 (2%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M I CP C +E V+NGK+ G Q YLC C + + + ++ +++ Sbjct: 1 MGGKKISCPHC-ESERCVKNGKA-NGKQTYLCKECYYRFTINASKRKYPFKIRREAVNLY 58 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNS 87 G ++ + + + T+ +K Sbjct: 59 KEGYTLTEISKKLNIKVQTIHHWVKKY 85 >UniRef50_C5U8R8 Insertion element protein (Fragment) n=2 Tax=Methanocaldococcus infernus ME RepID=C5U8R8_9EURY Length = 100 Score = 68.0 bits (165), Expect = 8e-11, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 45/91 (49%), Gaps = 6/91 (6%) Query: 6 IRCPSCSATEGVVRNGKSTAGH----QRYLCSPCRKTWQLQFTYTASQPGKHQKIID-MA 60 IRC C+ ++ VV+ GK + Q YLC C++ + + +K++ + Sbjct: 5 IRCKYCN-SDKVVKAGKHKSEKYGVRQMYLCKKCKRRFVEESKAPRYSDSFKEKVVRSVV 63 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 G+G R + R+ + T+LR +K+ +++ Sbjct: 64 FEGLGIRQAGRVFKLSTTTILRWIKDFKKTK 94 >UniRef50_Q1VPB4 Putative uncharacterized protein n=4 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VPB4_9FLAO Length = 343 Score = 68.0 bits (165), Expect = 1e-10, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 26/83 (31%), Gaps = 5/83 (6%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL---QFTYTASQPGKHQKIIDMAMNGV 64 CP C E VR G G QRY C C +++ + + + + + Sbjct: 51 CPHCLH-EKYVRFG-VDKGSQRYKCKSCNRSFTEYTGTWMAGLQRKDMISSYLSLMVQEK 108 Query: 65 GCRASARIMGVGLNTVLRHLKNS 87 + +G+ T Sbjct: 109 SLDKISSELGINKKTAFDWRHKI 131 >UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PIK6_CORA7 Length = 403 Score = 67.6 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 26/86 (30%), Positives = 40/86 (46%), Gaps = 4/86 (4%) Query: 1 MASIS-IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDM 59 MA+ + C C G+V+NGK+ AG QR+LC C + +T+ ID Sbjct: 1 MANRNRPSCDMCG--HGLVKNGKTAAGTQRWLCPQCNVSSINTRAHTSDIRHFKI-FIDW 57 Query: 60 AMNGVGCRASARIMGVGLNTVLRHLK 85 ++G A+ +GV T+ R K Sbjct: 58 ILSGESADHLAKRLGVTRRTLTRWFK 83 >UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32IP8_SHIDS Length = 101 Score = 66.9 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 48/52 (92%), Positives = 50/52 (96%) Query: 40 QLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 QLQFTYTASQPG HQKIIDMAMNGVGCRA+ARIMGV LNT+LRHLKNSGRSR Sbjct: 50 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVSLNTILRHLKNSGRSR 101 >UniRef50_C6MKY8 Putative uncharacterized protein n=1 Tax=Geobacter sp. M18 RepID=C6MKY8_9DELT Length = 632 Score = 66.9 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 21/72 (29%), Positives = 31/72 (43%), Gaps = 2/72 (2%) Query: 21 GKSTAGHQRYLCSPCRKTWQLQFTY--TASQPGKHQKIIDMAMNGVGCRASARIMGVGLN 78 G + AG QR+ C C KT+ + + GK ++ + V R AR VG Sbjct: 123 GHTKAGSQRFRCKICHKTFSIPLAANLRQRKKGKSTEVFRLLTCQVAIRKMARNARVGKE 182 Query: 79 TVLRHLKNSGRS 90 TV R++ R Sbjct: 183 TVHRYIHLIHRQ 194 >UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX Length = 71 Score = 66.5 bits (161), Expect = 2e-10, Method: Composition-based stats. Identities = 24/65 (36%), Positives = 45/65 (69%), Gaps = 1/65 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 MA++++ P C+ ++ V R+G+S + H+R+ C C++ +QL ++Y A +PG + I++MA Sbjct: 1 MATVTVHRPRCN-SDKVYRHGRSCSQHERFRCRSCKRVFQLTYSYEARKPGFKELIVEMA 59 Query: 61 MNGVG 65 NG G Sbjct: 60 HNGTG 64 >UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C2KAD9_9FLAO Length = 239 Score = 66.5 bits (161), Expect = 2e-10, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 42/90 (46%), Gaps = 1/90 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M RC C+ + ++ G ++ QRY C C+K + +++Y A Q + I + Sbjct: 1 MNKRRNRCIHCNYS-YCIKAGITSQNKQRYQCKKCKKKFIGKYSYRAYQKSTNHNIQQLI 59 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 GVG R +R++ V TVL+ + Sbjct: 60 KEGVGIRGISRLLNVSKTTVLKKILKIASK 89 >UniRef50_UPI0001BC5CBF transposase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5CBF Length = 184 Score = 64.2 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 36/106 (33%), Gaps = 21/106 (19%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAG----------------HQRYLCSPCRKTWQLQFT 44 + + CP C ++ V+NG T+ QR+LC C ++ L+ Sbjct: 42 LTKDTCACPHCH-SQTTVKNGFKTSKVRYLPFQNYPIIIALKKQRFLCKECHHSFTLETP 100 Query: 45 YTASQPGK----HQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKN 86 +++ + A+ + + TV R LK Sbjct: 101 IVKKYASISQTLKLSVLNSLQENMSLSLIAKQHRISIPTVQRILKQ 146 >UniRef50_B0K4X0 Integrase, catalytic region n=11 Tax=Clostridia RepID=B0K4X0_THEPX Length = 343 Score = 64.2 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 16/62 (25%), Positives = 25/62 (40%), Gaps = 4/62 (6%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNG 63 + ++CP C+ T + GK G+Q+YLC C + P K K + G Sbjct: 5 VPLKCPKCNNTHLFYKYGKDKDGYQKYLCRKCYHQFAPD----KPSPKKTSKYPRCPVCG 60 Query: 64 VG 65 Sbjct: 61 KS 62 >UniRef50_C9BRL5 Transposase n=2 Tax=Enterococcus faecium RepID=C9BRL5_ENTFC Length = 433 Score = 63.4 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 30/101 (29%), Gaps = 23/101 (22%) Query: 7 RCPSCSATEG---VVRNGKSTA----------------GHQRYLCSPCRKTWQLQFTYTA 47 RCP C +V+NGK + QRY C C + Sbjct: 46 RCPLCKQMNHEGMIVKNGKKKSLIQLNKCANQLTYLALAKQRYHCRGCHTYFTANTYIVD 105 Query: 48 SQ----PGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHL 84 KI++ A+ GV +TV R L Sbjct: 106 RNCFIAKQVRYKILEELTEKQAMTTIAKHCGVSWSTVSRTL 146 >UniRef50_B4WSN9 Insertion element protein n=3 Tax=Cyanobacteria RepID=B4WSN9_9SYNE Length = 83 Score = 63.4 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 5/84 (5%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQK----IIDMAM 61 + CP C ++GK++ G QRY C+ CR+T+ F + + I+ + Sbjct: 1 MDCPFCDHPTPH-KHGKTSKGSQRYRCTACRRTFTETFDTLYDRRQVTSEQVKLILQTYV 59 Query: 62 NGVGCRASARIMGVGLNTVLRHLK 85 G R +RI TV+ ++ Sbjct: 60 EGSSLRGISRIGKRAYGTVVDIVR 83 >UniRef50_A8ZNT7 Putative uncharacterized protein n=2 Tax=Acaryochloris marina MBIC11017 RepID=A8ZNT7_ACAM1 Length = 188 Score = 63.0 bits (152), Expect = 3e-09, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 37/93 (39%), Gaps = 9/93 (9%) Query: 6 IRCPSCSATEGVVRNG----KSTAGHQRYLCSPCRKTWQL---QFTYTASQP-GKHQKII 57 ++C C +E VV+NG K+ Q +LC C + + P I Sbjct: 1 MQCIHC-QSENVVKNGTKTLKTAQVVQYFLCKDCGRRFNERSGTPMARLRTPVETISMAI 59 Query: 58 DMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 + G+G RA+ R++ N+++ K Sbjct: 60 NARTEGLGIRAAGRVLRKSPNSIILWEKRLSAQ 92 >UniRef50_Q3Y3Y2 Transposase, IS204/IS1001/IS1096/IS1165 n=4 Tax=Enterococcus faecium RepID=Q3Y3Y2_ENTFC Length = 401 Score = 61.5 bits (148), Expect = 7e-09, Method: Composition-based stats. Identities = 26/105 (24%), Positives = 41/105 (39%), Gaps = 21/105 (20%) Query: 1 MASISIRCPSCS-ATEGVVRNGK-------STAG---------HQRYLCSPCRKTWQLQF 43 + RCP C +T+ +V+NGK + +G QRYLC C+K + + Sbjct: 40 LIRTYRRCPCCKDSTKQIVKNGKKISMILLNRSGNKRTYLRLKKQRYLCRACKKYFTART 99 Query: 44 TYTAS----QPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHL 84 H KI++ +A + V + TV R L Sbjct: 100 YLVTPFCFISKQIHYKILEELTERQSIKAIGKHCDVSVTTVQRTL 144 >UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T8A4_KLEPR Length = 83 Score = 60.3 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 42/62 (67%), Positives = 45/62 (72%), Gaps = 1/62 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQ-KIIDM 59 MASI + PSC+ TEGV RNGKSTAGHQ YLC CRK W L FTYT SQ HQ KIIDM Sbjct: 7 MASIYVGSPSCAVTEGVDRNGKSTAGHQHYLCRQCRKPWTLTFTYTTSQRSTHQRKIIDM 66 Query: 60 AM 61 + Sbjct: 67 TI 68 >UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ24_FERNB Length = 316 Score = 60.3 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 13/66 (19%), Positives = 29/66 (43%), Gaps = 3/66 (4%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M + ++ CP C +T + +NG G+Q++LC C +++ +++ + Sbjct: 1 MNNSTLSCPKCGST-SLYKNGHDKYGNQQFLCKLCHHSFK--LSHSQKRKNFPFPYPKCT 57 Query: 61 MNGVGC 66 G Sbjct: 58 SCGKSM 63 >UniRef50_D1JAI8 Putative uncharacterized protein n=1 Tax=uncultured archaeon RepID=D1JAI8_9ARCH Length = 192 Score = 59.9 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 21/73 (28%), Positives = 30/73 (41%), Gaps = 4/73 (5%) Query: 20 NGKSTAGHQRYLCSPCRKTWQL----QFTYTASQPGKHQKIIDMAMNGVGCRASARIMGV 75 GK Q C C K + + + G I + G G RA+ARIMG+ Sbjct: 36 YGKGEKRTQMLKCKVCGKRFSIHKGTPLFNLKADEGAFYGTIAHLVEGNGIRATARIMGI 95 Query: 76 GLNTVLRHLKNSG 88 +TV + LK + Sbjct: 96 NKDTVSKWLKKAS 108 >UniRef50_C8SCF9 Integrase catalytic region n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCF9_FERPL Length = 357 Score = 59.5 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 35/98 (35%), Gaps = 11/98 (11%) Query: 3 SISIRCPSCSATEGVVRNG--KSTAG-HQRYLCSPCRKTWQLQ--FTYTASQPGKHQKII 57 C +C + V++ G + +G Q Y C C K + + F + + Sbjct: 81 KEERTCKNCGRDDEVIKKGIRYNKSGPVQMYYCKRCGKKFSARTGFGGMKKRAEAIVAAL 140 Query: 58 DMAMNGVGCRASARIMG------VGLNTVLRHLKNSGR 89 D+ G+ R A+ + V TV +K + Sbjct: 141 DLYFRGLSLRQVAQHLKASYNVEVCHKTVHNWIKRYVK 178 >UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q0SUU8_CLOPS Length = 340 Score = 59.2 bits (142), Expect = 4e-08, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 22/63 (34%), Gaps = 5/63 (7%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M +I+CP C +E + + G +Q+Y C C + + + K Sbjct: 1 MNKTNIKCPRCH-SEKLYKFGFDKQANQKYQCKECGRQFAPDSVSSRP----KSKYPRCP 55 Query: 61 MNG 63 Sbjct: 56 KCN 58 >UniRef50_P04137 Uncharacterized protein in transposable element ISH50 n=11 Tax=Halobacteriaceae RepID=YIH50_HALSA Length = 294 Score = 58.0 bits (139), Expect = 8e-08, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 37/90 (41%), Gaps = 7/90 (7%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAM 61 + CPSC E V+R G QRYLC C +T+ Q F ++A K + + Sbjct: 26 VYCPSC-RAESVIRYGSYRV-FQRYLCKDCDRTFNDQTGTVFEHSAVALRKWFLAVYTYI 83 Query: 62 N-GVGCRASARIMGVGLNTVLRHLKNSGRS 90 R + V TV R ++ R+ Sbjct: 84 RLNTSIRQLDAEIDVSYKTVYRRVQRFLRA 113 >UniRef50_UPI000196CFC8 hypothetical protein CATMIT_02848 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196CFC8 Length = 262 Score = 58.0 bits (139), Expect = 1e-07, Method: Composition-based stats. Identities = 14/82 (17%), Positives = 24/82 (29%), Gaps = 5/82 (6%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL----QFTYTASQPGKHQKIIDMAMN 62 +C C + + G G QRY+C C K + F + + + Sbjct: 97 QCLFCG-SHDFTKYGHKKDGTQRYICKGCGKRFTPLTNTIFDSKKIPISEWIEYLLHLFE 155 Query: 63 GVGCRASARIMGVGLNTVLRHL 84 ++A T L Sbjct: 156 FHSINSTAYDNRNSPTTGKYWL 177 >UniRef50_Q0W4F0 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4F0_UNCMA Length = 141 Score = 56.8 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 30/78 (38%), Gaps = 4/78 (5%) Query: 16 GVVRNGKSTAGHQRYLCSPCRKTW----QLQFTYTASQPGKHQKIIDMAMNGVGCRASAR 71 VV+ G S AGHQ + C C + + I + G RA R Sbjct: 28 RVVKKGFSRAGHQVFQCRHCGRHFCETINTPMYGRRITREDVILIGKLLNERNGIRAIER 87 Query: 72 IMGVGLNTVLRHLKNSGR 89 I G +TV+R K+ R Sbjct: 88 ITGHHRDTVMRVAKDLAR 105 >UniRef50_A7HJ99 Putative uncharacterized protein n=2 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ99_FERNB Length = 261 Score = 56.5 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 26/48 (54%), Gaps = 1/48 (2%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTAS 48 M +I ++CP C ++ ++NG +Q + C C++ ++L FT Sbjct: 1 MTNIQLKCPHCGSSN-FIKNGHDKFKNQIFFCKDCKRYFKLSFTKKHK 47 >UniRef50_Q10ZA4 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZA4_TRIEI Length = 469 Score = 56.5 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 31/67 (46%), Gaps = 8/67 (11%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTAS------QPGKHQKIIDM 59 ++CP+C +T + +NG+ QRY C C + + +Q + P + + + + Sbjct: 1 MKCPTCGST-SLRKNGR-PNNRQRYRCKDCGRQFMVQSPTSNIEQKISVNPSESKALAEA 58 Query: 60 AMNGVGC 66 +G+ Sbjct: 59 PKSGMAI 65 >UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8T3_EDWI9 Length = 73 Score = 56.1 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 20/50 (40%), Positives = 30/50 (60%) Query: 42 QFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 Y A + ++II+MA G G R +A + +G+NTV+R LKNS +S Sbjct: 24 TLAYEAHKLDIKEQIIEMAFKGSGVRDTANTLKIGINTVIRTLKNSRQSE 73 >UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=A0RXS8_CENSY Length = 436 Score = 56.1 bits (134), Expect = 4e-07, Method: Composition-based stats. Identities = 22/97 (22%), Positives = 37/97 (38%), Gaps = 15/97 (15%) Query: 4 ISIRCPSCSATEGVV--RNGKSTAGHQRYLCSPCRKTWQLQFTY---TASQPGKHQKIID 58 I CP CS+T V RNG G Q + C CR + + T ++ Sbjct: 71 IVPECPKCSSTVRVKAGRNG----GRQMFQCKQCRTRYVSRGPGARKTRYSQDIISAALN 126 Query: 59 MAMNGVGCRASARIMG------VGLNTVLRHLKNSGR 89 M+G+ R +A + + NT++ + + Sbjct: 127 KVMSGMSYRKTAEEVNTAHGRDLSPNTIMFWTRKYTQ 163 >UniRef50_Q4JT92 Transposase for IS3503f n=4 Tax=Corynebacterium jeikeium K411 RepID=Q4JT92_CORJK Length = 165 Score = 55.7 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 25/86 (29%), Gaps = 5/86 (5%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M + CP C +NG ++ R+ C+ C ++ I A Sbjct: 1 MTTNRPSCPLCG--NNTKKNGTTSKSTTRWRCTHCGHSFTRNTQTHNKNTATMALFIQWA 58 Query: 61 MNGVGCRASARIMGVGLNTV---LRH 83 A GV T+ R Sbjct: 59 TGTQSLTTFAAHHGVTRQTMHHRFRW 84 >UniRef50_Q8PSY9 Conserved protein n=2 Tax=Methanosarcina RepID=Q8PSY9_METMA Length = 146 Score = 55.7 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 21/100 (21%), Positives = 40/100 (40%), Gaps = 11/100 (11%) Query: 3 SISIRCPSCSAT-------EGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPG 51 + + CP+ + +++ GK GHQRY C C K + Sbjct: 5 TDEVVCPNPKCSYYLKAEGRAIIKRGKYKTGHQRYYCKHCEKFFMDTIGTAIYRKHLSKE 64 Query: 52 KHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 + + I + + G R+ RI G +T+ LK++ ++ Sbjct: 65 EIRMIYRLFLEKNGIRSIERITGHHRDTISNLLKDTVKNE 104 >UniRef50_C0X375 Transposase n=23 Tax=Enterococcus RepID=C0X375_ENTFA Length = 446 Score = 55.3 bits (132), Expect = 5e-07, Method: Composition-based stats. Identities = 17/106 (16%), Positives = 36/106 (33%), Gaps = 22/106 (20%) Query: 7 RCPSCS--ATEGVVRNGKSTA----------------GHQRYLCSPCRKTWQLQFTYTAS 48 C C + +++ G QR+ C C KT+ + + + Sbjct: 48 ECFHCHYQNKQTIIKWGWKKVSILLNDVSNYKTILRINKQRFKCKHCGKTFLAEDSVSDR 107 Query: 49 Q----PGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 + Q I+++ + AR+ + TV+R L++ Sbjct: 108 RCSIARRVKQAILELLSEPISMSLIARMKHISPTTVIRILRSLRPK 153 >UniRef50_B4WST6 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WST6_9SYNE Length = 81 Score = 55.3 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 14/71 (19%), Positives = 26/71 (36%), Gaps = 1/71 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M P+C + + VV+NGK G Q + C C + + I D+ Sbjct: 1 MLDHQPTRPACHSKQ-VVKNGKIHNGKQNHRCKNCGRQFVKDPQQKRISDATKALIDDLL 59 Query: 61 MNGVGCRASAR 71 + + ++ Sbjct: 60 LERLSMNNPSK 70 >UniRef50_Q5LYW0 IS1193, transposase, ISL3 family n=185 Tax=Bacteria RepID=Q5LYW0_STRT1 Length = 448 Score = 55.3 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 20/110 (18%), Positives = 34/110 (30%), Gaps = 19/110 (17%) Query: 1 MASISIRCPSCSAT---EGVVRNGKSTAGHQ------------RYLCSPCRKTWQLQFTY 45 + +++ CP C +N K + Q R+ C CR+ + + Sbjct: 15 LITLAPSCPHCQGKMIKYDFQKNSKISLLEQAGTPTLLRLKKRRFQCKSCRRVTVAETSI 74 Query: 46 TASQPG----KHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 QK+ + V AR + V +TV R L Sbjct: 75 VEKNCQISNLVRQKVTQLLTEKVSLTDIARRLRVSTSTVYRKLYQFTFKE 124 >UniRef50_Q8RFT8 Transposase n=17 Tax=Fusobacterium RepID=Q8RFT8_FUSNN Length = 428 Score = 55.3 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 18/102 (17%), Positives = 35/102 (34%), Gaps = 21/102 (20%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGH----------------QRYLCSPCRKTWQLQFT 44 + S CP C +++ +V+NG QRY+C C+KT+ Sbjct: 46 LKSDYCTCPHC-SSKNIVKNGSRHRKIKYIPIQNHNIELELTVQRYICKDCKKTFSPSTN 104 Query: 45 ----YTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLR 82 ++ I + + A+ + + +V R Sbjct: 105 IVSDNSSISNNLKYAIALELQKNISLTSIAKRYNISIPSVQR 146 >UniRef50_C6MXF0 Putative uncharacterized protein n=1 Tax=Geobacter sp. M18 RepID=C6MXF0_9DELT Length = 512 Score = 55.3 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 18/66 (27%), Positives = 29/66 (43%), Gaps = 2/66 (3%) Query: 19 RNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKH--QKIIDMAMNGVGCRASARIMGVG 76 R G++ AG +RY C C +T+ + TA Q H +KI +N + + Sbjct: 43 RFGETAAGARRYRCKLCSRTFSINGKPTARQRDTHKNKKIYMHLVNKSPFKRICEQAEIS 102 Query: 77 LNTVLR 82 T+ R Sbjct: 103 PATLYR 108 >UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202 RepID=C8KX67_9PAST Length = 238 Score = 54.5 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 28/87 (32%), Positives = 37/87 (42%), Gaps = 3/87 (3%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ--FTYTASQPGKHQKIIDMAM 61 I CP CS+ + + +NGK Q YLC C + + TY Q+I+ M + Sbjct: 5 TPISCPKCSSCQ-IKKNGKKPNNKQNYLCKCCGRQFIGDHALTYRGCHSKISQRILIMLV 63 Query: 62 NGVGCRASARIMGVGLNTVLRHLKNSG 88 G G R A I V VL L N Sbjct: 64 RGCGIRDVAAIEKVSCTKVLSVLLNVR 90 >UniRef50_C2FEQ0 IS1181 transposase n=1 Tax=Lactobacillus paracasei subsp. paracasei ATCC 25302 RepID=C2FEQ0_LACPA Length = 425 Score = 53.8 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 20/98 (20%), Positives = 30/98 (30%), Gaps = 20/98 (20%) Query: 7 RCPSCSATEGVVRNGKSTA----------------GHQRYLCSPCRKTWQLQFTYTASQP 50 CP+C +VR G QR+ C CR +Q + Y + Sbjct: 48 HCPACGFASKLVRYGFERTCVLMPSYSYRPTYMKLSRQRFRCELCRSVFQSETDYVRPRS 107 Query: 51 GKHQKIIDMAM----NGVGCRASARIMGVGLNTVLRHL 84 + M + + AR V TV R + Sbjct: 108 TISTPVRQMVLFEAFSNCSLTDIARRFHVADKTVQRII 145 >UniRef50_B2A0V7 Integrase catalytic region n=15 Tax=Clostridia RepID=B2A0V7_NATTJ Length = 353 Score = 53.4 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 14/68 (20%), Positives = 23/68 (33%), Gaps = 7/68 (10%) Query: 1 MASISIRCPSCS--ATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYT---ASQPGKHQK 55 M + CP C+ ++ + G GHQ+Y C C + + P +K Sbjct: 1 MTK--VVCPRCNNNCSDKFYKFGFDNHGHQKYQCQECFSQFAPKTLSKGGDKRGPNMPRK 58 Query: 56 IIDMAMNG 63 G Sbjct: 59 YPSCPKCG 66 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 13/115 (11%), Positives = 26/115 (22%), Gaps = 30/115 (26%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLC--SPCRKTWQ------------------ 40 M CP C + + C C ++ Sbjct: 55 MPRKYPSCPKCGKATFLH---HDYEFYSNLRCCDKSCNHSFYVPKPQSIPEPSQLDINGK 111 Query: 41 LQFTYTASQPGKHQKIIDMAM-NGVGCRASARIM------GVGLNTVLRHLKNSG 88 + F+ + + + NG R ++ + V T+ K Sbjct: 112 VDFSNMRHPLHTIIRALYLYFINGSSTRGVSQFLLDCEGIKVSHVTIADWTKKFA 166 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 53.4 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 18/51 (35%), Positives = 28/51 (54%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPG 51 M ++ ++C C TE V ++ K A HQRY C C + +QL++ Y A Sbjct: 1 METLEVKCRFCQQTEFVKKHSKGDADHQRYRCFSCNQIFQLEYAYRACHCS 51 >UniRef50_Q07NT9 Putative uncharacterized protein n=2 Tax=Rhizobiales RepID=Q07NT9_RHOP5 Length = 577 Score = 53.4 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 34/87 (39%), Gaps = 11/87 (12%) Query: 7 RCP--SCSATEGVV--------RNGKSTAGHQRYLCSPCRKTWQLQFTY-TASQPGKHQK 55 CP SC + R+G S G RY C CRKT+ ++ + + +++ Sbjct: 103 HCPDDSCENYNKLFDSHPKSYFRHGTSAIGAPRYRCKACRKTFSVRTGHSRHRKSHENKT 162 Query: 56 IIDMAMNGVGCRASARIMGVGLNTVLR 82 + + ++ V +I + V Sbjct: 163 VFQLLVSKVPITKIGQITDLSPAAVYD 189 >UniRef50_Q2FRB5 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FRB5_METHJ Length = 138 Score = 53.4 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 32/79 (40%), Gaps = 4/79 (5%) Query: 15 EGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMNGVGCRASA 70 + + +NG ++AG+Q+Y C CR+ + + I + R + Sbjct: 26 KNITKNGHNSAGNQQYYCHHCRRFFIETKNTPLYDSRLPRTAVLIIAKHSTEKTSIRGVS 85 Query: 71 RIMGVGLNTVLRHLKNSGR 89 R+ G +T+ R+ G Sbjct: 86 RVTGHHRDTISRYYHLIGE 104 >UniRef50_Q3Y1C3 Transposase, IS204/IS1001/IS1096/IS1165 n=51 Tax=Enterococcus RepID=Q3Y1C3_ENTFC Length = 431 Score = 53.4 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 27/107 (25%), Positives = 34/107 (31%), Gaps = 27/107 (25%) Query: 8 CPSCSATEG-------VVRNGKSTA----------------GHQRYLCSPCRKTWQLQFT 44 C +C +T VV+NGK QRY C CR W Q Sbjct: 45 CRNCGSTVVDGNGKVIVVKNGKKETIVRFEQYNHMPLVMRLKKQRYTCKNCRTHWTTQSY 104 Query: 45 ----YTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 + KI + V A+ V L TV+R LK Sbjct: 105 FVQPRHSIANHVRYKIASLLTEKVSLSFIAKNCQVSLTTVIRTLKEF 151 >UniRef50_A1VN28 Insertion element protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VN28_POLNA Length = 324 Score = 53.0 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 37/91 (40%), Gaps = 5/91 (5%) Query: 2 ASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ---LQFTYTASQPGKHQKIID 58 A+ CP C TE + R+G +G QRY C CR+T+ + GK + Sbjct: 47 ATEPRCCPHCQGTE-LYRHGHV-SGLQRYRCRTCRRTFNALTGTALARLRKKGKWFGFSE 104 Query: 59 MAMNGVGCRASARIMGVGLNTVLRHLKNSGR 89 + R +A + V NT LR R Sbjct: 105 ALAASLTLRRAATALQVHRNTALRWRHRFLR 135 >UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX7_PARUW Length = 163 Score = 53.0 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 9/71 (12%), Positives = 21/71 (29%) Query: 19 RNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLN 78 + G G Q+Y C C + + L + + + + V + Sbjct: 11 KKGHIHNGKQKYQCLACGRQFVLNPSQKIIDERTRLLTKKTLLECIALEGVCWVFDVSMP 70 Query: 79 TVLRHLKNSGR 89 +L + + Sbjct: 71 WLLEFIGELTK 81 >UniRef50_A4W908 Putative cytoplasmic protein n=2 Tax=Enterobacteriaceae RepID=A4W908_ENT38 Length = 414 Score = 52.6 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 12/37 (32%), Positives = 20/37 (54%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFT 44 CP+C + ++RNG G QR+ C C ++ + T Sbjct: 68 CPTCGQGDALIRNGCGLRGAQRWRCRTCNSSFTDKST 104 >UniRef50_C2CJK1 ISSha1 transposase n=7 Tax=Anaerococcus RepID=C2CJK1_9FIRM Length = 422 Score = 52.2 bits (124), Expect = 5e-06, Method: Composition-based stats. Identities = 14/100 (14%), Positives = 29/100 (29%), Gaps = 20/100 (20%) Query: 8 CPSCSATEGVVRNGKSTAG----------------HQRYLCSPCRKTWQLQF----TYTA 47 CP C + +++ G ++ Q+ C C K + L+ + Sbjct: 48 CPHCGSNHNLIKYGFKSSNVRCSRAGDYPVIIDLKKQKMFCKSCNKYFLLETKIVDKHCN 107 Query: 48 SQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 + I+ + + V TV R + Sbjct: 108 ISNQIKRHILASLTKKLSMKDIGSNNYVSTTTVARFMAKL 147 >UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae RepID=C5BFY7_EDWI9 Length = 46 Score = 52.2 bits (124), Expect = 6e-06, Method: Composition-based stats. Identities = 19/40 (47%), Positives = 26/40 (65%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ 40 MA I + CP + T+ V+RNG +T+G Q Y C C KT+Q Sbjct: 1 MAKIDVVCPRGAKTQDVIRNGHATSGAQVYRCKLCLKTFQ 40 >UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthrospira maxima CS-328 RepID=B5VWL6_SPIMA Length = 153 Score = 51.8 bits (123), Expect = 6e-06, Method: Composition-based stats. Identities = 11/36 (30%), Positives = 21/36 (58%) Query: 50 PGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLK 85 + + M++NG+G RA R+ G+ NT+L ++ Sbjct: 19 SDVKELCVKMSLNGMGFRAIERVTGISHNTILNWVR 54 >UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae BGR1 RepID=C5A9A4_BURGB Length = 284 Score = 51.8 bits (123), Expect = 7e-06, Method: Composition-based stats. Identities = 22/97 (22%), Positives = 38/97 (39%), Gaps = 15/97 (15%) Query: 1 MASISIRCPSCSATEGV-------VRNGKSTAGH-----QRYLCSPCRKTW---QLQFTY 45 M + CP+ +NG H RY C C K + Q++ + Sbjct: 1 MRNPRPVCPNPDCVHHTNPPADFYRKNGYRRTKHNGQPVPRYQCKACGKNFCATQVKPIH 60 Query: 46 TASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLR 82 +P + ++ MA++ VG R A ++ G T+ R Sbjct: 61 GQHRPDLNTQVFKMAVSRVGIRRMATVLDCGRETIQR 97 >UniRef50_C7N1Y2 Putative uncharacterized protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N1Y2_SLAHD Length = 332 Score = 51.4 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 15/84 (17%), Positives = 24/84 (28%), Gaps = 5/84 (5%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL----QFTYTASQPGKHQKIIDMAMNG 63 CP C + E V R + C C + + F + I + Sbjct: 54 CPRCGSGETVGRGRTGAGRR-FWECRDCGRKYTSLAGTIFESSKKPLSAWVLFIRLMCYN 112 Query: 64 VGCRASARIMGVGLNTVLRHLKNS 87 V A+A + G+ T Sbjct: 113 VQLDAAAELCGMSHQTAWEWRHRV 136 >UniRef50_Q4JSN3 Transposase for IS3507b n=53 Tax=Actinobacteridae RepID=Q4JSN3_CORJK Length = 422 Score = 50.7 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 29/82 (35%), Gaps = 4/82 (4%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M+ RC C + RNG ++ G R+ C C + + + G ID Sbjct: 31 MSKNQPRC-HCGG--EMKRNGTTSKGTTRWRCKHCGASSVKRRIDITNSTGF-TAFIDHL 86 Query: 61 MNGVGCRASARIMGVGLNTVLR 82 G A +G T+ R Sbjct: 87 TTGASLDTIASRVGCSPRTLQR 108 >UniRef50_A4WNK0 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WNK0_RHOS5 Length = 481 Score = 50.7 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 1/67 (1%) Query: 19 RNGKSTAGHQRYLCSPCRKTWQLQFTYTASQP-GKHQKIIDMAMNGVGCRASARIMGVGL 77 R GK+ G R+ C C KT+ + + K++ ++DM N + +RI G+ Sbjct: 132 RFGKTKGGDARWRCKGCGKTFSVGKPARRHKRSDKNRLVLDMLCNDLSFAKMSRISGLAY 191 Query: 78 NTVLRHL 84 + R + Sbjct: 192 RDIYRRV 198 >UniRef50_Q64DF0 Putative uncharacterized protein n=6 Tax=cellular organisms RepID=Q64DF0_9ARCH Length = 337 Score = 50.3 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 32/93 (34%), Gaps = 9/93 (9%) Query: 5 SIRCPSCSATEGV---VRNGKSTAG-HQRYLCSPCRKTWQLQFT----YTASQPGKHQKI 56 +CP C ++E V + + G Q LC C ++ + K+ Sbjct: 7 PCKCPKC-SSENVRFDYKYDTISNGSRQMLLCRGCGASFSETKNTFLQNIRTPVSTIWKV 65 Query: 57 IDMAMNGVGCRASARIMGVGLNTVLRHLKNSGR 89 + G A+ R+ + NT+L + Sbjct: 66 LKSRTEGTSLNATCRVFDIAKNTLLAWERKFSS 98 >UniRef50_UPI0001BC5E64 putative transposase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5E64 Length = 173 Score = 50.3 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 34/101 (33%), Gaps = 21/101 (20%) Query: 7 RCPSCSATEGVVRNGKSTA----------------GHQRYLCSPCRKTWQLQF----TYT 46 +CP C + ++RNG + QR+LC C KT+ Y Sbjct: 48 KCPFCGE-KHIIRNGTKLSKIKILDVSNTPSYLYLRKQRFLCKSCSKTFSASTNFVRKYC 106 Query: 47 ASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 I + N + + A+ V +TV R L Sbjct: 107 NIADSIKLSIALESKNIISEKDIAKRFRVSSSTVKRSLLQY 147 >UniRef50_UPI0001C348D8 hypothetical protein PretD1_11772 n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C348D8 Length = 467 Score = 49.9 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 33/89 (37%), Gaps = 3/89 (3%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M +I CPSC +TE + + G + G RY C C + K+I+ Sbjct: 67 MKNIEKACPSCYSTENI-KYGTTAIGTVRYQCKNCNNVYS--LKNLNKFDDVDNKLIESL 123 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGR 89 + + + + + R L+N Sbjct: 124 LKNTKVSTIFKELKITPASFYRRLENINE 152 >UniRef50_Q7NH53 TetR family transcriptional regulatory protein n=1 Tax=Gloeobacter violaceus RepID=Q7NH53_GLOVI Length = 227 Score = 49.5 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 2/37 (5%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ 42 ++CP C +E + RNG QR LC C + + L Sbjct: 188 MKCPRCG-SERLSRNGH-RHDRQRLLCKDCSRQFLLP 222 >UniRef50_B4WUH8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WUH8_9SYNE Length = 76 Score = 49.5 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 13/56 (23%), Positives = 27/56 (48%), Gaps = 3/56 (5%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFT-YTASQPGKHQKIIDMA 60 + CP C +E + +NG G Q Y+C+ CR+ + ++ + ++ + M Sbjct: 1 MACPEC-QSEHIRKNGH-KRGKQNYICADCRRQFVENPKEHSGYSDEERKQCLSMY 54 >UniRef50_C2BVZ4 Possible IS3509a transposase n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BVZ4_9ACTO Length = 225 Score = 49.1 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 13/43 (30%), Positives = 26/43 (60%), Gaps = 2/43 (4%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTAS 48 ++CP+C+ + RNGK+++G QR+ C C ++ + +A Sbjct: 41 MKCPACNT--PLKRNGKTSSGSQRWRCKECGRSKVGKIDNSAK 81 >UniRef50_B4VTL4 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VTL4_9CYAN Length = 124 Score = 49.1 bits (116), Expect = 5e-05, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 35/93 (37%), Gaps = 12/93 (12%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMA 60 +CP C +T + + +RY C+ C ++ + F T K I + Sbjct: 20 YPQCPYCQST-----HSRRLKKERRYQCNECFTSYSVTVGTLFHKTHVDLEKWVLAIYLV 74 Query: 61 M---NGVGCRASARIMGVGLNTVLRHLKNSGRS 90 + + R A+ +GV NT + ++ Sbjct: 75 LNPPERISVRQLAKKIGVNKNTASYMIARIRQA 107 >UniRef50_A8ZMW7 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMW7_ACAM1 Length = 75 Score = 48.8 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 14/61 (22%), Positives = 29/61 (47%), Gaps = 1/61 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA 60 M+ + ++ P C + + GK++ G QRY C C++T+ F + ++I Sbjct: 1 MSYLLMQSPLCDHP-KIHKPGKTSKGSQRYRCLDCQQTFSETFDTLYYRLQISSEMIQAI 59 Query: 61 M 61 + Sbjct: 60 L 60 >UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JSC1_YERPY Length = 53 Score = 48.8 bits (115), Expect = 6e-05, Method: Composition-based stats. Identities = 15/37 (40%), Positives = 21/37 (56%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRK 37 MA I +CP C + V ++G +GHQRY C +K Sbjct: 1 MAKIDEKCPFCERKDLVKKHGYGKSGHQRYRCPHAKK 37 >UniRef50_Q8PWW0 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PWW0_METMA Length = 155 Score = 48.4 bits (114), Expect = 6e-05, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 34/83 (40%), Gaps = 7/83 (8%) Query: 15 EGVVRNGKSTAGHQR---YLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMNGVGCR 67 E ++ NG ++R Y+C C + + + F + I MA+ G+ + Sbjct: 29 ENIIGNGTYEIKNKRVRKYICRECGRVFNDRTGTFFDNVRKDESDIKLAIKMAIKGMSIQ 88 Query: 68 ASARIMGVGLNTVLRHLKNSGRS 90 A + ++ V TV L + + Sbjct: 89 AISDVLEVQPATVSNWLFRAAKQ 111 >UniRef50_C9BRL4 Transposase n=30 Tax=Enterococcus RepID=C9BRL4_ENTFC Length = 431 Score = 48.4 bits (114), Expect = 6e-05, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 24/67 (35%), Gaps = 4/67 (5%) Query: 27 HQRYLCSPCRKTWQLQFTYTASQPGKHQK----IIDMAMNGVGCRASARIMGVGLNTVLR 82 QR+ C C++T+ + QK +I AR + ++V R Sbjct: 86 KQRFKCKSCQRTFVADTSVAEKHCFISQKVRWSVIARLKENTSMTEIARQKNISTSSVYR 145 Query: 83 HLKNSGR 89 +K R Sbjct: 146 VMKRFYR 152 >UniRef50_Q70JT0 IsmA protein n=2 Tax=Microcystis aeruginosa RepID=Q70JT0_MICAE Length = 112 Score = 48.4 bits (114), Expect = 7e-05, Method: Composition-based stats. Identities = 11/46 (23%), Positives = 18/46 (39%), Gaps = 1/46 (2%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKH 53 CPSC + ++NG G + C C + + + T P Sbjct: 35 CPSCG-SHHTIKNGYLPKGKPKRHCQECGQPFVINPTNKTISPDTK 79 >UniRef50_D1QQX4 Putative uncharacterized protein n=15 Tax=Prevotella RepID=D1QQX4_9BACT Length = 318 Score = 48.4 bits (114), Expect = 8e-05, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 27/82 (32%), Gaps = 7/82 (8%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 +RC C +T +NG G Q Y C C ++ S Sbjct: 4 MRCCVCGSTHT-KKNG-VRKGLQLYKCQDCGYQFRSG--SQVSNDELWTAYQQ---QKQT 56 Query: 66 CRASARIMGVGLNTVLRHLKNS 87 + + + ++TV R L + Sbjct: 57 IKELSVRFKISVSTVKRRLHDI 78 >UniRef50_C0WLQ9 Transposase n=3 Tax=Lactobacillus RepID=C0WLQ9_LACBU Length = 418 Score = 48.4 bits (114), Expect = 8e-05, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 30/96 (31%), Gaps = 20/96 (20%) Query: 7 RCPSCSATEGVVRNGKSTAG----------------HQRYLCSPCR----KTWQLQFTYT 46 RCP+C + +V++G T QR+LC C Sbjct: 49 RCPNCGFADCLVKDGHKTVNLKLSPQRFHLLILRLAKQRFLCKHCGSIITSQTDAVKPNC 108 Query: 47 ASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLR 82 Q ++ + + A+ GV TV R Sbjct: 109 QISKNVWQSVVMDFHDNMAATLIAKQNGVSAGTVNR 144 >UniRef50_Q03NU3 Transposase n=12 Tax=Lactobacillus RepID=Q03NU3_LACBA Length = 423 Score = 48.4 bits (114), Expect = 8e-05, Method: Composition-based stats. Identities = 23/103 (22%), Positives = 30/103 (29%), Gaps = 21/103 (20%) Query: 8 CPSCSATEGVVRNGKSTA----------------GHQRYLCSPCRKTWQLQFTYTAS--- 48 CP C+ VV NG T QR+ C C KT Q Sbjct: 47 CPYCAQ-RQVVCNGHKTVYVRLPNVSERTVILILRKQRFRCKACGKTSIAQTPVVRRQHQ 105 Query: 49 -QPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 I + R+ A V N+V R + G+ Sbjct: 106 ISENTRHAIDKTLIEDRTMRSIADQYNVSTNSVSRRILALGKQ 148 >UniRef50_C1DPZ8 Transposase n=4 Tax=Bacteria RepID=C1DPZ8_AZOVD Length = 236 Score = 48.0 bits (113), Expect = 8e-05, Method: Composition-based stats. Identities = 7/42 (16%), Positives = 12/42 (28%) Query: 46 TASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 + Q + + R +A+ GV NT Sbjct: 3 RLRKRHLWQGYAEALTQSLTVRRAAKHCGVSKNTAFLWRHRF 44 >UniRef50_Q2RQJ8 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RQJ8_RHORT Length = 150 Score = 48.0 bits (113), Expect = 1e-04, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 32/98 (32%), Gaps = 14/98 (14%) Query: 2 ASISIRCPSCSATEGVVRNGKSTAGHQR--YLCSPCRKTWQLQFTYTASQPGK----HQK 55 A CP C R+ G QR + C+ CRK + + + Sbjct: 41 ARSRPVCPHCG-----FRHAYRLEGSQRVRFKCARCRKQYSARRGTVMERSNVPTAGWLT 95 Query: 56 IIDMAMN--GVGCRA-SARIMGVGLNTVLRHLKNSGRS 90 + + ++ G G A R GV T ++ + Sbjct: 96 ALRLFISAPGAGLPARIERATGVSYKTAWSMVQRMRAA 133 >UniRef50_B5S3H3 Probable transposase protein n=6 Tax=Burkholderiaceae RepID=B5S3H3_RALSO Length = 460 Score = 47.6 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 15/88 (17%), Positives = 30/88 (34%), Gaps = 6/88 (6%) Query: 6 IRCPSCSATEGVVRNGKSTAGH-QRYLCSPCRKTWQL---QFTYTASQPGKHQKIIDMAM 61 RCP C +R ++ G+ + C C++++ K +I + Sbjct: 103 PRCPHCDGLR--IRPDRNKGGNLPSFFCHGCKRSFNRLTGTPFSHLVNRAKGAAMIPLLS 160 Query: 62 NGVGCRASARIMGVGLNTVLRHLKNSGR 89 + + + +G VL L R Sbjct: 161 RQMSLDQAGKRLGRTKKAVLSWLLAFRR 188 Score = 44.5 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 13/89 (14%), Positives = 29/89 (32%), Gaps = 9/89 (10%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL---QFTYTASQPGKHQKIIDMA--- 60 CP C + + +G + C C + + + M Sbjct: 350 SCPWCGSDQTKYHPAPRPSGLPGFRCRACLAYFTRVSNTPLVHPMARAYASRFVPMLGWH 409 Query: 61 MNGVGCRASARIMGVGLNTVLRHLKNSGR 89 G G +AR +G+ + T+ +++ + Sbjct: 410 ETGAG---AARELGIAMGTLHTWVRSWRQ 435 >UniRef50_B2J7N9 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J7N9_NOSP7 Length = 428 Score = 47.6 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 11/44 (25%), Positives = 19/44 (43%), Gaps = 2/44 (4%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQ 49 ++CP C +T +NG Q YLC C + + + + Sbjct: 1 MKCPRCEST-SCRQNGC-RNDKQNYLCKNCGQQFLEPVFPHSLK 42 >UniRef50_C5S2C5 Putative transposase n=1 Tax=Actinobacillus minor NM305 RepID=C5S2C5_9PAST Length = 394 Score = 47.6 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 17/94 (18%), Positives = 29/94 (30%), Gaps = 12/94 (12%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDM 59 I CP C T+ Q++ C C++ + L F + + Sbjct: 52 EKICCPHCQRTQP-----YFIKSRQKWRCRGCKREFSLTSGTLFASHKLPLRTYLLALVF 106 Query: 60 AMN---GVGCRASARIMGVGLNTVLRHLKNSGRS 90 +N G+ + AR + V T S Sbjct: 107 YINAKQGITSKRLARELAVNYRTAFMLSHKIRES 140 >UniRef50_B7K7J3 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7K7J3_CYAP7 Length = 354 Score = 47.6 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 14/61 (22%), Positives = 23/61 (37%), Gaps = 11/61 (18%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 I+CP C ++ +NG + AG QRY C C + + + G+ Sbjct: 4 IQCPKC-KSKNYRKNG-TIAGKQRYQCKSCGRNFLAVSLSQSLP---------CFEQGMS 52 Query: 66 C 66 Sbjct: 53 I 53 >UniRef50_A7JMB8 Predicted protein n=8 Tax=Francisella RepID=A7JMB8_FRANO Length = 82 Score = 47.2 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 30/83 (36%), Gaps = 4/83 (4%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA--MNG 63 I+C C +EG+ + G QRY C C + L + + + Sbjct: 2 IKCNRCH-SEGIHKTGVVRN-KQRYKCKSCGYNFVLSDGRIKPDIAIKLALTVIMYSLGK 59 Query: 64 VGCRASARIMGVGLNTVLRHLKN 86 A++ GV + T+ L+ Sbjct: 60 YSYGFIAKLFGVRMTTIQNWLEQ 82 >UniRef50_Q9V1K2 Putative uncharacterized protein n=2 Tax=Pyrococcus RepID=Q9V1K2_PYRAB Length = 141 Score = 47.2 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 31/93 (33%), Gaps = 9/93 (9%) Query: 3 SISIRCPSCSATEGVVRNGKS-TAGH---QRYLCSPCRKTWQL----QFTYTASQPGKHQ 54 I CP C + +V+ G +G+ QRY C C +T+ S Sbjct: 31 KGRITCPYCKSPN-IVKIGYIMRSGNFKIQRYKCKNCNRTFTELDGTPLKGAHSLKDIVI 89 Query: 55 KIIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 + + A+I+ + + R K Sbjct: 90 VAYLTLDLKLPPSSIAKILPINRPKLYRAYKRV 122 >UniRef50_Q93CQ1 Transposase TnpA n=1 Tax=Enterococcus faecium RepID=Q93CQ1_ENTFC Length = 446 Score = 47.2 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 17/95 (17%), Positives = 27/95 (28%), Gaps = 19/95 (20%) Query: 5 SIRCPSCSATEGVVRNGKSTA----------------GHQRYLCSPCRKTWQLQFTYTAS 48 RC C + ++GK QRY C C +T+ + Sbjct: 34 PTRCQKCGTIANLYKHGKKRQLFFDLPMHAKRVGIYLKRQRYKCRDCNETFFEKLPDLDD 93 Query: 49 QPGKHQK---IIDMAMNGVGCRASARIMGVGLNTV 80 ++ I + A +GV TV Sbjct: 94 ARSVTKRLNNFIQEVSLEKTFTSVAEEIGVDEKTV 128 >UniRef50_D0BMT4 Transposase n=10 Tax=Lactobacillales RepID=D0BMT4_9LACT Length = 426 Score = 47.2 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 18/104 (17%), Positives = 35/104 (33%), Gaps = 21/104 (20%) Query: 7 RCPSCSATEGVVRNGKSTAGH----------------QRYLCSPCRKTWQLQ----FTYT 46 CP C +++ V+++ QR++C CRKTW + Sbjct: 46 SCPYC-SSKNVIKHSPMEHKIRIPHLYGNKTLLELKVQRFICKDCRKTWVTDCPLVPKNS 104 Query: 47 ASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 +I+ + A+++ + TV R +K Sbjct: 105 NISYDLACQIMLYLKENFSRKTIAKLLSISDKTVERVMKKFKIK 148 >UniRef50_Q035C5 Transposase n=27 Tax=Lactobacillales RepID=Q035C5_LACC3 Length = 414 Score = 47.2 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 23/105 (21%), Positives = 39/105 (37%), Gaps = 21/105 (20%) Query: 7 RCPSCSATEGVVRNGKST------AG----------HQRYLCSPCRKTWQLQFT----YT 46 RCP C E + NG T G QR+ C C T + Sbjct: 50 RCPLCG-FEALHPNGFYTAHVRVLNGVEIPTVIDLHKQRWRCHNCYHTVSAKTPLVQPNH 108 Query: 47 ASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 ++I+ +A + + ARI+G+ ++V R + + + R Sbjct: 109 TIAAHMTERIMKLAHERLPVKTIARIIGISASSVQRIIDQNLKLR 153 >UniRef50_Q8PRR9 Conserved protein n=2 Tax=Archaea RepID=Q8PRR9_METMA Length = 148 Score = 46.8 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 26/77 (33%), Gaps = 4/77 (5%) Query: 12 SATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMNGVGCR 67 + V++ H + C C+K + F + + + I M R Sbjct: 34 NQGNIVLKERYGKNNHALFKCKTCKKCFSETKGTIFFELNTPDEEVLRTIAMLPEKGSIR 93 Query: 68 ASARIMGVGLNTVLRHL 84 AR G +T+ R L Sbjct: 94 GVARATGHSKDTICRWL 110 >UniRef50_Q10VF2 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10VF2_TRIEI Length = 59 Score = 46.8 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 13/59 (22%), Positives = 24/59 (40%), Gaps = 1/59 (1%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDM 59 M+ + CPSC + +V+NG Q+Y C C++ + + I + Sbjct: 1 MSIHKLICPSCG-SNHIVKNGTIHNKKQKYQCQNCQRQFVENSQRDYISNETKELIDKL 58 >UniRef50_B2J098 Putative uncharacterized protein n=2 Tax=Nostocaceae RepID=B2J098_NOSP7 Length = 133 Score = 46.8 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 10/38 (26%), Positives = 22/38 (57%), Gaps = 1/38 (2%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQF 43 + CP C+ + + ++G+ G QRY+C C + ++ + Sbjct: 34 MECPKCN-SHLLGKHGREPDGVQRYICKNCSRIFRARP 70 >UniRef50_Q6MK35 Putative transposase n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MK35_BDEBA Length = 300 Score = 46.4 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 34/93 (36%), Gaps = 15/93 (16%) Query: 5 SIRCPSCS-------ATEGVVRNGKSTAGHQ-----RYLCSPCRKTWQLQFTYT---ASQ 49 +++CP C A + R G+ R C C K++ + Sbjct: 8 NLKCPYCHLQRDPKDANRTIRRLGRYYRKSDGQTLTRLWCVRCGKSFSAATQSRLKGQKK 67 Query: 50 PGKHQKIIDMAMNGVGCRASARIMGVGLNTVLR 82 ++ I D+ + R AR++ + TV+R Sbjct: 68 RHLNKLIRDLLTGEMSQREIARVLKINRKTVVR 100 >UniRef50_C8SCF8 Putative uncharacterized protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCF8_FERPL Length = 317 Score = 46.1 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 14/74 (18%), Positives = 25/74 (33%), Gaps = 5/74 (6%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAGH---QRYLCSPCRKTWQLQ--FTYTASQPGKHQK 55 + + CP C++ + + ++YLC C T+ F +T P Sbjct: 28 LNKWNPSCPHCNSYHIIKKTDIKRERKGYAKKYLCRDCNSTFTFDNCFEWTHYPPRVVGD 87 Query: 56 IIDMAMNGVGCRAS 69 I + G R Sbjct: 88 IFHLIAKGESYRDI 101 >UniRef50_B0CG58 Transcriptional regulator, TetR family n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CG58_ACAM1 Length = 260 Score = 46.1 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 2/36 (5%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQL 41 + CP C ++ + +NGK Q Y+C CRK + Sbjct: 221 MICPHC-QSDRLSKNGKRRNQ-QCYVCKDCRKQFVE 254 >UniRef50_D1PSS1 Insertion element protein (Fragment) n=14 Tax=Prevotella RepID=D1PSS1_9BACT Length = 113 Score = 46.1 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 28/78 (35%), Gaps = 7/78 (8%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCR 67 C C ++ VRNG G Q Y+C C ++ T S+ + Sbjct: 1 CSVC-KSKHTVRNG-VRQGKQLYMCKECHSQFR--AGNTVSEDELWRSYQQ---EKQTIA 53 Query: 68 ASARIMGVGLNTVLRHLK 85 + G+ L TV R L Sbjct: 54 ELSSRFGISLATVKRRLH 71 >UniRef50_Q7VL05 Possible transposase n=4 Tax=Pasteurellaceae RepID=Q7VL05_HAEDU Length = 363 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 12/93 (12%), Positives = 24/93 (25%), Gaps = 11/93 (11%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTY-------TASQPGKHQKII 57 I CP C V Q++ C C + + + + K + Sbjct: 50 DIECPHC----HVRHEAYFIKTRQQWQCKHCCYRFSITAGTIFHLAKLSLRKILKALRYF 105 Query: 58 DMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 + G+ + + V T + Sbjct: 106 ALKSKGLSAIELSHEINVQYKTAWGLRHKFREA 138 >UniRef50_Q3Y3Y3 Transposase, IS204/IS1001/IS1096/IS1165 n=11 Tax=Lactobacillales RepID=Q3Y3Y3_ENTFC Length = 424 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 34/105 (32%), Gaps = 21/105 (20%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQ----------------RYLCSPCRKTWQLQF----T 44 C C + ++RNG T Q R+LC C +T+ + Sbjct: 44 PSHCEHCGSI-RIIRNGSYTTRTQILKVKEKLTILELKRTRFLCYDCGQTFSAKTDLVDE 102 Query: 45 YTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGR 89 + Q I+ + A+ V TV R L+ + + Sbjct: 103 HHQLTKELKQAILMELYENQSRKLIAKKYFVSDGTVTRILREATK 147 >UniRef50_Q2J1M8 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris HaA2 RepID=Q2J1M8_RHOP2 Length = 204 Score = 45.7 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 30/91 (32%), Gaps = 11/91 (12%) Query: 6 IRCPSCS--ATEGVVRNGKSTA-GHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIID 58 CP C + V GKS G YLCS CR+ + + T Sbjct: 19 PECPHCGVGSPSVVAIAGKSHRPGL--YLCSACRRQFTVTVGTPLEGTKLPLKLWIGAAH 76 Query: 59 MAMNGVGC--RASARIMGVGLNTVLRHLKNS 87 + + R R +GV T + ++ Sbjct: 77 LLNSHQPIAVREIERALGVTYKTAWKVVQRL 107 >UniRef50_A5FLG0 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FLG0_FLAJ1 Length = 311 Score = 45.3 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 31/92 (33%), Gaps = 11/92 (11%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQ---KIID 58 CP C + + V NG + R C C+K + ++ F T K II Sbjct: 39 PTCPYCESEKVKVLNGTTK----RLKCYGCKKQFGVKVGTIFHDTKISLRKWFIAVYIIT 94 Query: 59 MAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 G+ +R + V T L + Sbjct: 95 AHKKGISSHQLSRDLKVTQKTAWFMLHRVREA 126 >UniRef50_B9Y9S5 Putative uncharacterized protein (Fragment) n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y9S5_9FIRM Length = 238 Score = 45.3 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 7/40 (17%), Positives = 15/40 (37%) Query: 51 GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 + Q+ ++ + G AR+ + NT R + Sbjct: 3 DQWQRFLECYLRGESLDVCARVAQIHRNTAFRWRHKVNDA 42 >UniRef50_Q03IY7 Transposase n=198 Tax=Lactobacillales RepID=Q03IY7_STRTD Length = 442 Score = 45.3 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 13/65 (20%), Positives = 23/65 (35%), Gaps = 4/65 (6%) Query: 27 HQRYLCSPCRKTWQLQFTYTASQPG----KHQKIIDMAMNGVGCRASARIMGVGLNTVLR 82 +R+ C C K + + +QKI + + A+ + V +TV R Sbjct: 101 KRRFKCKECGKMAVAETSLVKKNHQIATVVYQKIAQLLIEKQSMTDIAKRLAVSTSTVSR 160 Query: 83 HLKNS 87 L Sbjct: 161 KLNEF 165 >UniRef50_C7RJT2 Conserved possible transposase n=21 Tax=Proteobacteria RepID=C7RJT2_9PROT Length = 342 Score = 44.9 bits (105), Expect = 7e-04, Method: Composition-based stats. Identities = 14/94 (14%), Positives = 28/94 (29%), Gaps = 11/94 (11%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDM 59 + CP C + + C+ C++ + + F + + + + Sbjct: 40 EEVVCPHCGMAHRHY----FRPARKIWRCAGCQEDFSVTSGTIFAFHKLPLRLYLAAVIL 95 Query: 60 AMN---GVGCRASARIMGVGLNTVLRHLKNSGRS 90 N G+ R +GV T L S Sbjct: 96 FTNAVKGISALQVGRDLGVSHKTAYVLLHKIRES 129 >UniRef50_Q7N9S9 Transposase TnpA, ISL3 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N9S9_PHOLL Length = 429 Score = 44.5 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 35/105 (33%), Gaps = 20/105 (19%) Query: 2 ASISIRCPSCSATEGV---VR---------NGKSTA---GHQRYLCSPCRKTWQLQFTYT 46 A+ C C E V R NGK T +RY C C KT+ + Sbjct: 29 ANPPTHCIHCKHPEIVGFGRRDEVIMDTPVNGKRTGIILNRRRYRCQICCKTFMEPVPHK 88 Query: 47 ASQPGKHQKIIDMAMNGVGCRAS----ARIMGVGLNTVLRHLKNS 87 + ++I + R + A +GV TV +S Sbjct: 89 DGKRQMTHRLIQ-YIERESLRRTFSSVAEDVGVDEKTVRNIFHDS 132 >UniRef50_D1UAU0 Transposase, putative n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1UAU0_9DELT Length = 320 Score = 44.5 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 32/86 (37%), Gaps = 12/86 (13%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPG-----KHQKIIDMAMN 62 CP C R +G +R C+ C+ T+ F+ G + +I + ++ Sbjct: 65 CPRCGH-----RKVYDLSG-ERLRCADCKYTFH-PFSGRWINNGALTSLEWLNLITLFVD 117 Query: 63 GVGCRASARIMGVGLNTVLRHLKNSG 88 + +G+ NTV + L Sbjct: 118 ECSVHQMKQRLGLSYNTVYKALTAIR 143 >UniRef50_B3GXU2 Transposase n=15 Tax=Pasteurellaceae RepID=B3GXU2_ACTP7 Length = 373 Score = 44.5 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 10/93 (10%), Positives = 26/93 (27%), Gaps = 11/93 (11%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYT---ASQPGKHQKIIDMA- 60 + CP C + + +R+ C C++ + + P + Sbjct: 43 DVCCPHCG----IRHHAYFLQSRKRWCCKHCQRHFYITTNTAFAFHKLPFVDILAATLLF 98 Query: 61 ---MNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 + G+ +R + + T + Sbjct: 99 ANEVKGISAITMSRHLNISYKTAFVLCHKLREA 131 >UniRef50_Q11ZU0 Putative uncharacterized protein n=1 Tax=Polaromonas sp. JS666 RepID=Q11ZU0_POLSJ Length = 590 Score = 44.1 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 17/85 (20%), Positives = 28/85 (32%), Gaps = 14/85 (16%) Query: 8 CPSCSATEGVV---------RNGKSTAGHQRYLCSPCRKTWQLQFTY-----TASQPGKH 53 CP C + +V G +TAG Y C C KT+ ++ + K+ Sbjct: 104 CPDCMCSNHLVPITQPKAYHSFGLTTAGSHCYRCKVCSKTFSVKPKGINPIARQLRSDKN 163 Query: 54 QKIIDMAMNGVGCRASARIMGVGLN 78 ++ M + R V Sbjct: 164 PPVLRMLTGKMPLRRICEAADVAPK 188 >UniRef50_C6HZQ4 Transposase n=2 Tax=Leptospirillum ferrodiazotrophum RepID=C6HZQ4_9BACT Length = 443 Score = 44.1 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 30/93 (32%), Gaps = 20/93 (21%) Query: 7 RCPSCSATEGV--------VR----NGKSTA---GHQRYLCSPCRKTWQLQFTYTASQPG 51 RC C + + V +R +GK +R+ C C +T+ + Sbjct: 35 RCVHCGSIDLVGFGRREQWIRDLPIHGKRVGIAVDTRRFRCKSCGRTFYEPLPAVDDKRL 94 Query: 52 KHQKIIDMAMNGVGCR----ASARIMGVGLNTV 80 + + + R A GVG T+ Sbjct: 95 MTTR-LKTWLEKKSLRPPFSQLAEETGVGALTI 126 >UniRef50_B7X577 Transposase IS204/IS1001/IS1096/IS1165 family protein n=1 Tax=Comamonas testosteroni KF-1 RepID=B7X577_COMTE Length = 471 Score = 44.1 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 29/96 (30%), Gaps = 21/96 (21%) Query: 5 SIRCPSCSATEGVVRNG----------------KSTAGHQRYLCSPCRKTWQLQFTYTAS 48 CP C + + R+G K A QRY C+ C++T+ Sbjct: 33 PTACPKCGTLDCIYRHGTKATTYVDIPMRGKPAKLRAKVQRYRCTSCKETFLQPLGGILE 92 Query: 49 QPGKHQKIIDMAMNGVGCRA----SARIMGVGLNTV 80 ++ + R A +G TV Sbjct: 93 GRRMTERCAT-YIKAHSLRDTFTRIAENVGCDDKTV 127 >UniRef50_C7P9K3 Transcriptional regulator, ArsR family n=2 Tax=Methanocaldococcus RepID=C7P9K3_METFA Length = 206 Score = 44.1 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 12/37 (32%), Positives = 19/37 (51%) Query: 51 GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 + KI+ NG G R +A+ +G+ TV R +K Sbjct: 146 DRWVKILKSLYNGCGVRETAKNLGLSPATVSREVKKL 182 >UniRef50_D0MDA7 Transposase-like protein n=7 Tax=Bacteria RepID=D0MDA7_RHOM4 Length = 279 Score = 44.1 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 29/88 (32%), Gaps = 10/88 (11%) Query: 7 RCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMN 62 CP C + G+ Y C CR+ W + GK + + Sbjct: 31 HCPYCKSEHL----GRVRRRF--YKCYRCRREWSPRKGSLLEGLRLPLGKFLLALKLFEL 84 Query: 63 GVGCRASARIMGVGLNTVLRHLKNSGRS 90 V R +AR +G+ NTV R Sbjct: 85 EVSARRAARELGLAYNTVHRLFLLFRER 112 >UniRef50_B7C761 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7C761_9FIRM Length = 74 Score = 44.1 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 12/62 (19%), Positives = 23/62 (37%), Gaps = 4/62 (6%) Query: 30 YLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLK 85 + C C+K + + Y+ K ++I +NGV + +A + V V Sbjct: 2 FRCKECKKRFVVDRGQLTFYSHHDQSKWNELILDTLNGVSLKETAAKINVNERNVFNMRH 61 Query: 86 NS 87 Sbjct: 62 KL 63 >UniRef50_C2H217 Possible transposase n=5 Tax=Enterococcaceae RepID=C2H217_ENTFA Length = 438 Score = 44.1 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 20/104 (19%), Positives = 33/104 (31%), Gaps = 21/104 (20%) Query: 1 MASISIRCPSCSATEGVVRNGKSTAG----------------HQRYLCSPCRKTWQLQFT 44 + + RC C + V+R+ + QR+ C+ CR T+ + Sbjct: 53 LTGEAPRCEYCG-FDSVIRHSYQDSWIQLLPYQEVPTYLHLYKQRFRCTRCRHTFSAKTY 111 Query: 45 YTASQPGK----HQKIIDMAMNGVGCRASARIMGVGLNTVLRHL 84 Y A I + + A+ V TV R L Sbjct: 112 YVAENCYISQALKFAIAVDLKKKISMKDIAQRYFVSTKTVERVL 155 >UniRef50_UPI00016C448A hypothetical protein GobsU_12575 n=6 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C448A Length = 234 Score = 44.1 bits (103), Expect = 0.002, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 35/97 (36%), Gaps = 16/97 (16%) Query: 10 SCSATEGVVRNGKSTAGH----QRY--------LCSPCRKTWQL----QFTYTASQPGKH 53 C +GK G+ RY CS C+ + T Sbjct: 7 FCCRNADCPDHGKRGHGNLTVPARYGPNRTRVLRCSTCQARFSERKGTPLYGTRLSAQTV 66 Query: 54 QKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 ++ G G R +AR++GV +TV R+++ +G Sbjct: 67 TAVLAHVAEGAGTRKTARLVGVHRDTVTRYIRQAGHQ 103 >UniRef50_D2PJ85 Putative uncharacterized protein n=5 Tax=Sulfolobus islandicus RepID=D2PJ85_SULIS Length = 82 Score = 43.7 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 11/39 (28%), Positives = 16/39 (41%), Gaps = 2/39 (5%) Query: 27 HQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 QRYLC C + + Y ++ + M NGV Sbjct: 5 RQRYLCRDCGRYFLGDAIY--HSRELREEALKMYSNGVS 41 >UniRef50_UPI0001C31088 transcriptional regulator, TetR family n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31088 Length = 349 Score = 43.7 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 25/88 (28%), Gaps = 10/88 (11%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMN- 62 CP C A + G RY C C + F + + G M Sbjct: 31 CPRCGADRPF----RLRRGDIRYACRVCEMALDPRAGTAFEGSRTPLGTWFVATAMLRED 86 Query: 63 -GVGCRASARIMGVGLNTVLRHLKNSGR 89 + A A GV T R L+ Sbjct: 87 PQLTPTALAAEAGVSYATSWRMLRRLRE 114 >UniRef50_Q8U293 Transposase n=53 Tax=Pyrococcus RepID=Q8U293_PYRFU Length = 314 Score = 43.7 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 9/44 (20%), Positives = 21/44 (47%) Query: 47 ASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 P K + +++ + G+ R +ARI+ + TV ++ + Sbjct: 102 KIPPEKKIRGVELYLRGLSYRQTARILKISHVTVWEAVQKLAEA 145 >UniRef50_D0U1S9 Transposase n=1 Tax=Enterococcus faecium RepID=D0U1S9_ENTFC Length = 427 Score = 43.7 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 32/104 (30%), Gaps = 23/104 (22%) Query: 4 ISIRCPSCSATE---GVVRNGKSTAG----------------HQRYLCSPCRKTWQLQFT 44 + C C A + +NG T+ QR++C C K++ + Sbjct: 39 VPKECAHCEAPNVGFSIYKNGTQTSRVTFPMAGILPTYLRIRKQRFMCKCCGKSFTARTP 98 Query: 45 YTAS----QPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHL 84 +I+ + + A+ V +V R L Sbjct: 99 VVERNCFISNYIKAQILTQSGETRSVKDIAKHTNVSEASVQRVL 142 >UniRef50_A2V378 Putative uncharacterized protein n=1 Tax=Shewanella putrefaciens 200 RepID=A2V378_SHEPU Length = 214 Score = 43.7 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 29/90 (32%), Gaps = 12/90 (13%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAM-- 61 CPSC E S Y C+ C + L T K I + Sbjct: 29 CPSCGGKEYCKLKRHSL-----YQCNTCHQQTSLTAGTILDNTKLPLTKWFLAIFLLTQV 83 Query: 62 -NGVGCRASARIMGVGLNTVLRHLKNSGRS 90 NG+ +R++ V NT R ++ Sbjct: 84 KNGISALELSRLIEVSYNTAWRMKHKLMQA 113 >UniRef50_A9BGL8 Transposase IS204/IS1001/IS1096/IS1165 family protein n=9 Tax=Petrotoga mobilis SJ95 RepID=A9BGL8_PETMO Length = 455 Score = 43.7 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 38/101 (37%), Gaps = 22/101 (21%) Query: 5 SIRCPSCS-ATEGVVRNGKSTAGH-----------------QRYLCSPCRKTWQLQ---F 43 +C C E +VRNGK+ QRY+C KT++ + + Sbjct: 72 PYKCKGCKDKREYIVRNGKAKERIIKAGKVGTQRIYLIHRPQRYMCKKTGKTFRDEKISY 131 Query: 44 TYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHL 84 + + + I+ + A+A+ GV + TV L Sbjct: 132 RWQRITRAETENIVKGLR-KMSISATAKEFGVSVRTVSNLL 171 >UniRef50_C6QEP3 ISSpo8, transposase n=4 Tax=Alphaproteobacteria RepID=C6QEP3_9RHIZ Length = 330 Score = 43.7 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 14/103 (13%), Positives = 34/103 (33%), Gaps = 18/103 (17%) Query: 6 IRCPSCSATEGVV-------RNGK-STAGHQRY---LCSPCRKTWQLQ----FTYTASQP 50 CP C A + + + K + G +R+ C CRK + ++ F + Sbjct: 28 PVCPHCGADKRIYDLKGVRSKPSKRNPKGVERHGLKKCGACRKQFTVRVGTVFESSHIPL 87 Query: 51 GKHQKIIDMAMN---GVGCRASARIMGVGLNTVLRHLKNSGRS 90 + + + + G+ R++ + + + Sbjct: 88 HLWLQAVHLMCSSKKGISSHQLHRVLEIKYQSAWFMSHRIREA 130 >UniRef50_A8UDH0 Transposase n=5 Tax=Bacteria RepID=A8UDH0_9LACT Length = 439 Score = 43.7 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 31/109 (28%), Gaps = 23/109 (21%) Query: 5 SIRCPSCSATEG---VVRNGKSTAG----------------HQRYLCSPCRKTWQLQFT- 44 C C +V+NG T+ QR+LC C T+ Q Sbjct: 46 PSHCECCGMKNHSYSIVKNGYLTSRVKWVSSTHYPTYIQLKKQRFLCRECGVTFVAQSPE 105 Query: 45 ---YTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 Q I + + ++ V T R LK +S Sbjct: 106 IEQGCFIAKRVKQSIAVELADTTSVKDLSKRHFVSPTTTDRVLKQLNQS 154 >UniRef50_C1F2K1 Unclassified family transposase n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F2K1_ACIC5 Length = 327 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 29/92 (31%), Gaps = 12/92 (13%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAM 61 CP C + R T + C CRK + ++ F + K + + + Sbjct: 39 PVCPHCGSA----RYSFLTTRR-IWKCKSCRKQYSVKSGTIFEDSPIPLDKWLMAVWLVV 93 Query: 62 ---NGVGCRASARIMGVGLNTVLRHLKNSGRS 90 NGV R + V + L + Sbjct: 94 NCKNGVSSYEIMRAVKVTQKSAWFMLHRIRLA 125 >UniRef50_Q5LW63 ISSpo8, transposase n=4 Tax=Rhodobacterales RepID=Q5LW63_SILPO Length = 355 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 28/93 (30%), Gaps = 12/93 (12%) Query: 7 RCPSCSA-TEGVVRNGKSTAGHQRYLC--SPCRKTWQLQFT----YTASQPGKHQKIIDM 59 CP C + + +R + G Y C C + + T I + Sbjct: 39 HCPHCGSLSSTPIRGRTARPGL--YQCAERECCLQFTVTTKTPMHATKLDLRIWIAAIFL 96 Query: 60 AMN---GVGCRASARIMGVGLNTVLRHLKNSGR 89 + G+ ARI+GV T + Sbjct: 97 MLTSSKGISSVVMARILGVNQKTAWKLGHAIRE 129 >UniRef50_Q1GHU2 Putative uncharacterized protein n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GHU2_SILST Length = 124 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 15/67 (22%), Positives = 26/67 (38%), Gaps = 6/67 (8%) Query: 24 TAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNG------VGCRASARIMGVGL 77 QRY C C+KT+ + ++ + D+ + R AR +G+ Sbjct: 2 RTNVQRYRCGSCKKTFSGRTGTRIARIHRPGLFFDVLKDMPGPRPLSSVRVLARCLGLNK 61 Query: 78 NTVLRHL 84 +TV R Sbjct: 62 HTVWRWR 68 >UniRef50_B1IC92 Transposase n=24 Tax=Lactobacillales RepID=B1IC92_STRPI Length = 415 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 33/97 (34%), Gaps = 19/97 (19%) Query: 7 RCPSCS----ATEGVVR--------NGKSTA---GHQRYLCSPCRKTW----QLQFTYTA 47 RC +C +G + NG+ QRY C C T+ L Sbjct: 50 RCRNCGFPTVNKDGFRKTHVRLASLNGRRYELELRKQRYKCKSCHTTFGAITNLTKENQT 109 Query: 48 SQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHL 84 +I+ +A G+ + A + ++V R + Sbjct: 110 LSSDLKNQIMLLARKGLSGQLIAEMCHCSPSSVRRTI 146 >UniRef50_C0WEV9 Transposase (Fragment) n=1 Tax=Acidaminococcus sp. D21 RepID=C0WEV9_9FIRM Length = 358 Score = 43.4 bits (101), Expect = 0.003, Method: Composition-based stats. Identities = 17/101 (16%), Positives = 31/101 (30%), Gaps = 19/101 (18%) Query: 6 IRCPSCS-ATEGV--VRNGKSTAG------------HQRYLCSPCRKTWQLQFT----YT 46 ++CP+C T+ + R + G +RY+C C +T+ Y Sbjct: 45 VQCPNCHAKTDRIKDYRWQRIAIGSILHQQAFVRLHKRRYVCPCCGRTFFETVPFLQRYQ 104 Query: 47 ASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 +I+ A TV+R+ Sbjct: 105 RKSKDLQMQIMVSCFQKRSFTDIAADFHTSTTTVIRYFDRL 145 >UniRef50_C5RB59 Possible transposase n=1 Tax=Weissella paramesenteroides ATCC 33313 RepID=C5RB59_WEIPA Length = 228 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 30/95 (31%), Gaps = 22/95 (23%) Query: 8 CPSCSATEGVVRNG----------------KSTAGHQRYLCSPCRKT----WQLQFTYTA 47 CP C+ + +NG + Q+Y+C C +T + Sbjct: 30 CPQCAC--LMNKNGTKLVQHIASRAANIFNQLAIRKQKYICPQCHQTALAEFTDIKAGDH 87 Query: 48 SQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLR 82 Q + V + A+ + +TV+R Sbjct: 88 IIANVKQAAAMELSDNVSQKHIAQAYNISPHTVMR 122 >UniRef50_B8F7V2 ISRssp2, family IS1595 n=4 Tax=Pasteurellaceae RepID=B8F7V2_HAEPS Length = 378 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 13/92 (14%), Positives = 28/92 (30%), Gaps = 11/92 (11%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMA 60 + CP C + + +R+ C C++ + + F + I + Sbjct: 44 DVCCPFCG----IRHHAYFLQSRKRWTCKHCKRNFYITTNTAFAFHKLPLVDILLAISLF 99 Query: 61 MN---GVGCRASARIMGVGLNTVLRHLKNSGR 89 +N G+ +R + V T Sbjct: 100 VNEVKGISAITMSRHLNVNYKTAFVLCHKLRE 131 >UniRef50_B4WVD1 Putative uncharacterized protein n=7 Tax=Synechococcus sp. PCC 7335 RepID=B4WVD1_9SYNE Length = 298 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 16/88 (18%), Positives = 29/88 (32%), Gaps = 10/88 (11%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKII-DMAMN-- 62 + CP C + + K+ G+ + C CR+T+ + + II + + Sbjct: 27 MDCPHCQSPRVSLLQRKTNLGYDMFRCKRCRRTFNERTGTPFNFIEVPTDIIFQVLLCRV 86 Query: 63 --GVGCRASA-----RIMGVGLNTVLRH 83 + R A R TV Sbjct: 87 RYKLSYRDVAEFFLLRGFQFTHETVRDW 114 >UniRef50_C6HVY3 Probable transposase n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HVY3_9BACT Length = 146 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 4/48 (8%), Positives = 12/48 (25%) Query: 40 QLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 + + + ++ G R +A G+ + Sbjct: 28 DEEPFGELRHKDRWRYYMEGMNEGESVRKAAWRCGINREAAFNWRQRF 75 >UniRef50_A7HVK5 Putative uncharacterized protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVK5_PARL1 Length = 608 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 11/62 (17%), Positives = 23/62 (37%), Gaps = 2/62 (3%) Query: 23 STAGHQRYLCSPCRKTWQLQFTY--TASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTV 80 S G QR+ C C++T+ + P ++ + ++ R + G+ V Sbjct: 133 SRGGAQRFRCKACQRTFSVALKSTVRQRAPHLNRTVFAEVVSKKPLRGIMEVTGLSAAAV 192 Query: 81 LR 82 Sbjct: 193 YD 194 >UniRef50_C3MUP9 Resolvase helix-turn-helix domain protein n=40 Tax=Sulfolobus RepID=C3MUP9_SULIM Length = 369 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 9/39 (23%), Positives = 18/39 (46%) Query: 51 GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGR 89 + I + + G+ A+I+ V +TV R +K + Sbjct: 22 ETKARAILLHLEGMKISQIAKILQVHKSTVYRWVKEFEK 60 >UniRef50_C3L491 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L491_AMOA5 Length = 119 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 9/53 (16%), Positives = 21/53 (39%) Query: 38 TWQLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 + Q + + + + G+G RA ++ + TV + ++ SG Sbjct: 38 SLYSQPKSGVKPIQTKRLALQLYLEGLGFRAIGNLLQISYGTVYQWIEASGEQ 90 >UniRef50_C9CRL2 Transposase n=3 Tax=Alphaproteobacteria RepID=C9CRL2_9RHOB Length = 432 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 12/95 (12%), Positives = 28/95 (29%), Gaps = 16/95 (16%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMA---- 60 CP+C + +R+ C+ C + + + T + +D+ Sbjct: 42 EPICPACGCVDV-----YDLTTRRRFKCAACHRQFSV--TSGTIFASRKLAFVDLLGAIC 94 Query: 61 -----MNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 G+ +R + V T + + Sbjct: 95 LFVNAAKGLSAVQMSRDLDVQHKTAFVLMHKLREA 129 >UniRef50_Q8R819 Transposase n=2 Tax=Thermoanaerobacter tengcongensis RepID=Q8R819_THETN Length = 455 Score = 42.6 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 11/45 (24%), Positives = 15/45 (33%), Gaps = 1/45 (2%) Query: 7 RCPSCSAT-EGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQP 50 CP C A + + GK G Q+ C C+ W Sbjct: 95 NCPVCGAPPDYLYSFGKDPDGFQKLQCKVCKHQWAPGKPAPKKSR 139 >UniRef50_B9JNY3 Transposase n=4 Tax=Alphaproteobacteria RepID=B9JNY3_AGRRK Length = 365 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 30/95 (31%), Gaps = 15/95 (15%) Query: 8 CPSCSATEGVVRNGKSTAGHQR-----YLCS--PCRKTWQLQFT----YTASQPGKHQKI 56 CP+C + G+ G +R Y CS CR + + T K Sbjct: 43 CPACGYKRSIAIAGRDM-GKRRARPGLYQCSSGDCRFQFTVTTHTPLHATKLPLRTWLKA 101 Query: 57 IDMAMN---GVGCRASARIMGVGLNTVLRHLKNSG 88 + + + G+ A +GV T R Sbjct: 102 MWLLLQSDKGLSSVRLAETLGVSQPTAWRIGHALR 136 >UniRef50_A7HMZ5 Transposase IS204/IS1001/IS1096/IS1165 family protein n=14 Tax=Bacteria RepID=A7HMZ5_FERNB Length = 395 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 15/104 (14%), Positives = 33/104 (31%), Gaps = 19/104 (18%) Query: 7 RCPSCS---------ATEGVV------RNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPG 51 +CP C T+ V + +RY+C C K + ++ Sbjct: 40 KCPKCGNITSKVHDYHTQKVKDVPIMGKKTYLIIRKRRYVCKACGKKFFEHISFLGKSQR 99 Query: 52 KHQKIIDMAMNGV----GCRASARIMGVGLNTVLRHLKNSGRSR 91 ++ ++ + + A+ V + TV+R + Sbjct: 100 MTNRLAAYIISQLGSLTSMKEIAKHTNVSVTTVMRLFDKVNPGQ 143 >UniRef50_D2M0Z3 Two component transcriptional regulator, LuxR family n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2M0Z3_BACS4 Length = 204 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 12/69 (17%), Positives = 29/69 (42%) Query: 22 KSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVL 81 K G Q +L + + ++ + +++ M + G + +A+I+ + TV Sbjct: 117 KVLNGEQAFLNNGSPRNYREIDEEFHELSKREEEVFYMKLRGYTVKDTAQILNISPKTVE 176 Query: 82 RHLKNSGRS 90 H +N + Sbjct: 177 NHRRNIRKK 185 >UniRef50_UPI00016C46F4 hypothetical protein GobsU_15563 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C46F4 Length = 139 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 18/73 (24%), Positives = 26/73 (35%), Gaps = 4/73 (5%) Query: 20 NGKSTAGHQRYLCSPCRKTWQL----QFTYTASQPGKHQKIIDMAMNGVGCRASARIMGV 75 G + C+ C K + K I + G G R + R+ G Sbjct: 32 WSSKPRGIRCLRCTACGKNFSERKGTPLFGLHMSDEKALDIAHHLVEGNGMRPTGRLCGG 91 Query: 76 GLNTVLRHLKNSG 88 LNTVLR + +G Sbjct: 92 TLNTVLRFARKAG 104 >UniRef50_UPI0001793827 PREDICTED: similar to CG5669 CG5669-PA n=1 Tax=Acyrthosiphon pisum RepID=UPI0001793827 Length = 640 Score = 42.2 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 10/61 (16%), Positives = 21/61 (34%), Gaps = 7/61 (11%) Query: 10 SCSA----TEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 C ++ + R+ ++ G +R+ C+ C K + Q + M G Sbjct: 536 HCGKRFTRSDELQRHNRTHTGEKRFQCNECPKRFMRSD---HLQKHVRTHLKQKLMEGNS 592 Query: 66 C 66 Sbjct: 593 V 593 >UniRef50_Q87RY6 Putative resolvase n=3 Tax=Vibrio parahaemolyticus RepID=Q87RY6_VIBPA Length = 216 Score = 42.2 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 14/48 (29%), Positives = 24/48 (50%) Query: 39 WQLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKN 86 W +F + +HQ+II++ + G R A+ +G +TV R K Sbjct: 161 WAEKFQGRKANTKQHQRIIELLLEGKSIRGVAQELGCNASTVQRVKKK 208 >UniRef50_B9ZCS9 DNA topoisomerase type IA zn finger domain protein n=1 Tax=Natrialba magadii ATCC 43099 RepID=B9ZCS9_NATMA Length = 244 Score = 42.2 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 35/91 (38%), Gaps = 10/91 (10%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAM 61 + CP C ++ V+NG Q YLC C +T+ + F ++ K I + Sbjct: 26 VTCPRC-RSDLTVKNGSYGH-FQHYLCKNCDRTFNDKTGTIFAHSKVALRKWLFSIYAFL 83 Query: 62 N-GVGCRASARIMGV-GLNTVLRHLKNSGRS 90 + + T+ +H++ ++ Sbjct: 84 RFNTSLHQL--QLEIDQYKTIYQHIERFTKA 112 >UniRef50_B2JMI5 Transposase n=2 Tax=Burkholderia RepID=B2JMI5_BURP8 Length = 75 Score = 42.2 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 9/39 (23%), Positives = 17/39 (43%) Query: 47 ASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLK 85 + + M NG A+A+I+G+ T+ +K Sbjct: 7 RYTLEFKIEAVRMVRNGQSQAATAKILGISTQTLNAWIK 45 >UniRef50_D0SJK4 Predicted protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJK4_ACIJU Length = 460 Score = 42.2 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 32/93 (34%), Gaps = 20/93 (21%) Query: 7 RCPSCSATEGVVRNGKSTA----------------GHQRYLCSPCRKTWQLQFTYTASQP 50 +CP C ++ + ++G +RY C C+ T+ + T Sbjct: 33 KCPKCG-SDQLYKHGTKPVIYRDIPRHMKPTVINVEVKRYRCKSCKATFLQEVTGIYPDT 91 Query: 51 GKHQKIIDMAMN---GVGCRASARIMGVGLNTV 80 ++ + + +AR+MG T+ Sbjct: 92 RMTERFVKKIQDICLDYTFSDTARMMGCDSKTI 124 >UniRef50_C9RDH8 Regulatory protein LacI n=1 Tax=Ammonifex degensii KC4 RepID=C9RDH8_AMMDK Length = 435 Score = 42.2 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 12/34 (35%), Positives = 17/34 (50%) Query: 55 KIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSG 88 +++ + G R AR +GV NTV R L N Sbjct: 398 RVVRLRAEGRSLREIAREVGVSKNTVARWLNNLS 431 >UniRef50_B2TB85 Transposase IS3/IS911 family protein n=2 Tax=Burkholderia RepID=B2TB85_BURPP Length = 145 Score = 42.2 bits (98), Expect = 0.006, Method: Composition-based stats. Identities = 11/49 (22%), Positives = 18/49 (36%), Gaps = 1/49 (2%) Query: 44 TYTASQPGKHQKIIDMAMNGV-GCRASARIMGVGLNTVLRHLKNSGRSR 91 Y P + ++I++ ++G AR V N V K R Sbjct: 29 KYRRRTPDEKRQIVEETLSGGGSVAEIARSHKVSANQVFDWRKQYLDGR 77 >UniRef50_B9JG85 Putative uncharacterized protein n=1 Tax=Agrobacterium radiobacter K84 RepID=B9JG85_AGRRK Length = 191 Score = 42.2 bits (98), Expect = 0.006, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 3/78 (3%) Query: 15 EGVVR-NGKSTAGHQRYLCSPCRKTWQLQF--TYTASQPGKHQKIIDMAMNGVGCRASAR 71 +GVVR G S AG + C C ++ + + K + + + +A Sbjct: 9 DGVVRARGPSEAGLPVFRCLACDVHFRRTTGTPPSGLKFRKLELFVRLLSQQRPITDAAE 68 Query: 72 IMGVGLNTVLRHLKNSGR 89 ++ V + TV+R +K + Sbjct: 69 MIDVKVVTVIRWVKRMRQ 86 >UniRef50_A5VLK7 Transposase, IS204/IS1001/IS1096/IS1165 family protein n=19 Tax=Lactobacillus reuteri RepID=A5VLK7_LACRD Length = 343 Score = 42.2 bits (98), Expect = 0.006, Method: Composition-based stats. Identities = 16/105 (15%), Positives = 30/105 (28%), Gaps = 22/105 (20%) Query: 8 CPSCSATEG--VVRNGKSTAGH----------------QRYLCSPCRKTWQLQFT----Y 45 CP+C +++ G A H QR+ C C T+ Sbjct: 47 CPNCGVINRGQILKYGFYQAKHKYGQFRTQPLVLLVKTQRFQCPDCHTTFNATSYLFEKQ 106 Query: 46 TASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 +++I + A + + +V R L + Sbjct: 107 RTISRDLRREVILRLTRIQTIKDIAHDLFISEASVQRILLDLADQ 151 >UniRef50_D2MKS9 ISXo5 transposase n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MKS9_9BACT Length = 293 Score = 42.2 bits (98), Expect = 0.006, Method: Composition-based stats. Identities = 16/94 (17%), Positives = 28/94 (29%), Gaps = 9/94 (9%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQ----LQFTYTASQPGKHQKIIDM 59 CP C + + + G R+ C CR ++ F T K I + Sbjct: 27 EEPTCPHCESPHVARKADGTRQG--RWNCHGCRSSFTVLSGTIFEKTRIPLQKWFLAIGL 84 Query: 60 AMN---GVGCRASARIMGVGLNTVLRHLKNSGRS 90 +N + AR + + T + Sbjct: 85 IVNAKKSLSSCQLARDLSLTQPTAWYIQARIRSA 118 >UniRef50_Q5ZT03 Transposase (IS652) n=29 Tax=Gammaproteobacteria RepID=Q5ZT03_LEGPH Length = 399 Score = 41.8 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 19/99 (19%), Positives = 35/99 (35%), Gaps = 19/99 (19%) Query: 6 IRCPSCSATEGVVRNGKSTA------GHQR---------YLCSPCRKTWQLQF----TYT 46 +RC C + V++ G +R Y C C + + +F Y Sbjct: 42 VRCIHCGNKKLRVKDSFIRRIRHESIGLRRSYLCLKAHKYYCPSCGRYFNQRFPGIGKYQ 101 Query: 47 ASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLK 85 + +++ GV + AR + +G +TV R Sbjct: 102 RASESLRKQVFHYHSKGVSQKDLARDLKLGKSTVERWYH 140 >UniRef50_UPI000186E028 transcription factor Sp4, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186E028 Length = 749 Score = 41.8 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 10/62 (16%), Positives = 27/62 (43%), Gaps = 5/62 (8%) Query: 10 SCSA----TEGVVRNGKSTAGHQRYLCSPCRKTWQL-QFTYTASQPGKHQKIIDMAMNGV 64 C ++ + R+ ++ G +R+ C C+K + + + QK+++ A + + Sbjct: 656 YCGKRFTRSDELQRHRRTHTGEKRFQCPDCQKKFMRSDHLSKHIKTHQKQKLMEAATSTI 715 Query: 65 GC 66 Sbjct: 716 SL 717 >UniRef50_Q9SVC5 Dof zinc finger protein DOF3.5 n=2 Tax=Arabidopsis thaliana RepID=DOF35_ARATH Length = 247 Score = 41.8 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 13/41 (31%), Positives = 17/41 (41%), Gaps = 3/41 (7%) Query: 2 ASISIRCPSCSATEG--VVRNGKSTAGHQRYLCSPCRKTWQ 40 A I+ CP C ++ N S RY C CR+ W Sbjct: 21 AEITPSCPRCGSSNTKFCYYNNYSLTQ-PRYFCKGCRRYWT 60 >UniRef50_C7XW38 Transposase ISLasa4v n=4 Tax=Lactobacillus RepID=C7XW38_9LACO Length = 436 Score = 41.8 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 24/113 (21%), Positives = 29/113 (25%), Gaps = 30/113 (26%) Query: 3 SISIRCPSCSATEGVVRNGKSTAG-------------------HQRYLCS---PCRKTWQ 40 S+ + CP C + RNG Q+YLC C Sbjct: 37 SLPMHCPVCGQ--LMQRNGWLRRRPVKIKILSIAGQPTVLSIIKQQYLCKPSASCPHPVT 94 Query: 41 LQFTYTASQPG------KHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 Q G Q I + AR V NTV R L Sbjct: 95 CVAPIQGIQKGCRIANLVKQHITLELTQNISQTTIARQHNVSTNTVSRVLTQM 147 >UniRef50_D2LK53 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LK53_RHOVA Length = 249 Score = 41.8 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 16/93 (17%), Positives = 23/93 (24%), Gaps = 12/93 (12%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQK----IIDMA 60 + CP C T + C C K + L + I + Sbjct: 19 NPVCPECGGTNH-----YDLKSRPVWKCKACSKQFSLTSGTIFHSRKLRIRDILGAIAIF 73 Query: 61 MN---GVGCRASARIMGVGLNTVLRHLKNSGRS 90 N G +R +G T L S Sbjct: 74 TNGAKGYSALQLSRDLGCDYKTCFVLLHKLRES 106 >UniRef50_C4W7G8 Transposase for ISSha1 n=2 Tax=Staphylococcus RepID=C4W7G8_STAWA Length = 434 Score = 41.8 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 17/110 (15%), Positives = 31/110 (28%), Gaps = 23/110 (20%) Query: 4 ISIRCPSCS---ATEGVVRNGKSTAG----------------HQRYLCSPCRKTWQLQFT 44 I + C C ++++G Q + C C +T+ Q Sbjct: 42 IPMGCECCGIKNDNHLIIKHGFRETKVYMGLILERPAYLQLKKQSFYCKECGQTFTAQTP 101 Query: 45 Y----TASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 Y ++ + A + +TV R+LK S Sbjct: 102 YIEPRCRISKDVKLMMMKKLAKVSSEKDVANSLFHSPSTVHRYLKEVSSS 151 >UniRef50_A7C135 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C135_9GAMM Length = 372 Score = 41.8 bits (97), Expect = 0.008, Method: Composition-based stats. Identities = 11/37 (29%), Positives = 16/37 (43%) Query: 53 HQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGR 89 + I G RA+ARI+ V +T L + R Sbjct: 37 FEIAIRAFAEGNSIRATARILQVDKDTACDWLHRAAR 73 >UniRef50_B8FWC8 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense DCB-2 RepID=B8FWC8_DESHD Length = 60 Score = 41.4 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 10/33 (30%), Positives = 17/33 (51%) Query: 56 IIDMAMNGVGCRASARIMGVGLNTVLRHLKNSG 88 I+D+ G R A+ +GV TV+ ++ G Sbjct: 13 ILDLYKQGYTSREIAKQVGVSPTTVMNRIRKYG 45 >UniRef50_A9IG79 ISSod11, transposase n=14 Tax=Proteobacteria RepID=A9IG79_BORPD Length = 223 Score = 41.4 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 27/91 (29%), Gaps = 13/91 (14%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAMN- 62 CP C V R A R +C C+ + F T + N Sbjct: 46 CPRCGNAGDVYR-----ASRTRLMCRSCQYQGTVTSGTIFDKTRTPLRVWLAAAWYLTNQ 100 Query: 63 --GVGCRASARIMGV-GLNTVLRHLKNSGRS 90 GV R++G+ T L R+ Sbjct: 101 KQGVSALGLQRVLGLGSYQTAWTMLHRFRRA 131 >UniRef50_A8YX76 Transposase n=42 Tax=Lactobacillus RepID=A8YX76_LACH4 Length = 426 Score = 41.4 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 21/106 (19%), Positives = 31/106 (29%), Gaps = 22/106 (20%) Query: 4 ISIRCPSCSATEGVVRNGK-----------------STAGHQRYLCSPCRK----TWQLQ 42 I C C + + + NG QR C C + +L Sbjct: 43 IQPACLFCGSLDLLH-NGHLITNIHYPTANASLPVIIRLAKQRVKCRDCERWSMAQSELV 101 Query: 43 FTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSG 88 Y + K++ + AR V +NTV R L N Sbjct: 102 NKYCSISNASKLKVLSALTEDRSMTSIARENNVSINTVQRVLGNCS 147 >UniRef50_A0Q207 Transcriptional regulator n=3 Tax=Clostridium RepID=A0Q207_CLONN Length = 520 Score = 41.4 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 8/35 (22%), Positives = 16/35 (45%) Query: 53 HQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 + II R +A+++GV T++ +K Sbjct: 481 KEAIIKALKKNKTFRKTAKVLGVSHTTIINKIKKY 515 >UniRef50_Q894I5 Phage-related protein n=1 Tax=Clostridium tetani RepID=Q894I5_CLOTE Length = 142 Score = 41.4 bits (96), Expect = 0.010, Method: Composition-based stats. Identities = 9/34 (26%), Positives = 19/34 (55%) Query: 53 HQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKN 86 K +++ +NG +A+I+GV T+ R ++ Sbjct: 6 KIKAMELLLNGETITDTAKIVGVERKTIYRWMEK 39 >UniRef50_B2JXE0 Putative uncharacterized protein n=2 Tax=Burkholderiaceae RepID=B2JXE0_BURP8 Length = 358 Score = 41.4 bits (96), Expect = 0.010, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 34/88 (38%), Gaps = 7/88 (7%) Query: 8 CPSCSATEGVVRNGKSTAGH---QRYLCSPCRKTWQLQFTYTASQPGKHQKI---IDMAM 61 CP C T +++ G + Y C C + S+ Q+ I + Sbjct: 59 CPRCRGT-RILKKGYARLRTGPLPTYRCEQCGHCFSRLSGTPLSKRPVRQQAGELIALLP 117 Query: 62 NGVGCRASARIMGVGLNTVLRHLKNSGR 89 + C +AR +GV +TVL ++ R Sbjct: 118 QEISCAEAARQLGVMEHTVLETVRLVRR 145 >UniRef50_C7PCU2 Two component transcriptional regulator, LuxR family n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PCU2_CHIPD Length = 220 Score = 41.0 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 7/45 (15%), Positives = 17/45 (37%) Query: 46 TASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRS 90 +++ M G+ + A+ + + + TV H N + Sbjct: 151 HHKLSKTEFRVMQMIAEGMSTKEIAQSLNISIKTVENHRHNISKK 195 >UniRef50_A7BQK2 Transposase n=3 Tax=Bacteria RepID=A7BQK2_9GAMM Length = 391 Score = 41.0 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 11/35 (31%), Positives = 19/35 (54%) Query: 52 KHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKN 86 +II ++ G G R +R++G+ +TV R K Sbjct: 28 VRSRIILLSDEGFGSRKVSRMLGISRDTVQRWRKR 62 >UniRef50_Q3C030 Putative sigma-54-dependent transcriptional regulator n=1 Tax=Xanthomonas campestris pv. vesicatoria str. 85-10 RepID=Q3C030_XANC5 Length = 363 Score = 41.0 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 10/33 (30%), Positives = 18/33 (54%) Query: 54 QKIIDMAMNGVGCRASARIMGVGLNTVLRHLKN 86 +++ + +G+ RA A+ +GV TV R L Sbjct: 331 LQVMRLHADGLSMRAIAKHVGVSAATVSRWLNK 363 >UniRef50_C7YZZ3 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7YZZ3_NECH7 Length = 917 Score = 41.0 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 19/73 (26%), Positives = 28/73 (38%), Gaps = 6/73 (8%) Query: 1 MASISIRCPSCSAT----EGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKI 56 M I+C C+ T E + R+ +S + Y C C K++ Q Sbjct: 1 MVETQIKCDICNCTFARQEHLTRHSRSHTREKPYQCLQCSKSFSRLDVLQRHISSHEQSA 60 Query: 57 IDMAMNGVGCRAS 69 D+A GV RA Sbjct: 61 SDLA--GVSTRAC 71 >UniRef50_UPI0001C34261 hypothetical protein CATC2_13668 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C34261 Length = 387 Score = 41.0 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 13/35 (37%), Positives = 18/35 (51%), Gaps = 1/35 (2%) Query: 8 CPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ 42 CP C E + R G++ G QR C C+K W + Sbjct: 74 CPDCYQRETI-RYGRNPQGSQRVQCRACKKVWTPK 107 >UniRef50_B8HUB6 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HUB6_CYAP4 Length = 144 Score = 41.0 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 13/38 (34%), Positives = 18/38 (47%), Gaps = 2/38 (5%) Query: 5 SIRCPSCSATEGVVRN--GKSTAGHQRYLCSPCRKTWQ 40 S++CP C + G G QR+ C CRKT+ Sbjct: 106 SLQCPYCEGRYLTKKGFTGCYQTGRQRWFCKDCRKTFS 143 >UniRef50_Q6K537 Os02g0252400 protein n=6 Tax=Oryza sativa RepID=Q6K537_ORYSJ Length = 373 Score = 41.0 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 15/43 (34%), Positives = 19/43 (44%), Gaps = 3/43 (6%) Query: 1 MASISIRCPSCSA--TEGVVRNGKSTAGHQRYLCSPCRKTWQL 41 M S CP C++ T+ N S A RY C CR+ W Sbjct: 40 MKKSSPCCPRCNSIKTKFCYYNNYSMAQ-PRYFCRECRRYWTQ 81 >UniRef50_B2UM39 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UM39_AKKM8 Length = 313 Score = 41.0 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 18/95 (18%), Positives = 33/95 (34%), Gaps = 14/95 (14%) Query: 6 IRCPSCSATEG---VVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIID 58 + CP C +E V RNG + C C K + ++ F + K Sbjct: 35 VVCPFCGKSEKQYRVKRNGVEGY----FECGECGKVYTVRTGTIFERSHVPLHKWIFAFY 90 Query: 59 MAMN---GVGCRASARIMGVGLNTVLRHLKNSGRS 90 + + G+ ++ +GV T L+ + Sbjct: 91 LVVTSRKGISSMQLSKEIGVTQKTAWFMLQRIREA 125 >UniRef50_B8F7J2 Putative uncharacterized protein n=1 Tax=Haemophilus parasuis SH0165 RepID=B8F7J2_HAEPS Length = 233 Score = 41.0 bits (95), Expect = 0.012, Method: Composition-based stats. Identities = 11/41 (26%), Positives = 21/41 (51%), Gaps = 2/41 (4%) Query: 4 ISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFT 44 +C C +++ + ++G QRY C+ C KT+ L+ Sbjct: 35 EPKKCHFCHSSD-IRKHG-IRNNIQRYKCNACNKTFTLKKK 73 >UniRef50_A2A935 PR domain zinc finger protein 16 n=35 Tax=Euteleostomi RepID=PRD16_MOUSE Length = 1275 Score = 41.0 bits (95), Expect = 0.012, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 19/57 (33%), Gaps = 4/57 (7%) Query: 3 SISIRCPSCSA----TEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQK 55 C C + + R+ ++ G Q Y C C +++ + H K Sbjct: 948 KERYTCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 1004 >UniRef50_A7UZI1 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7UZI1_BACUN Length = 321 Score = 41.0 bits (95), Expect = 0.012, Method: Composition-based stats. Identities = 14/92 (15%), Positives = 30/92 (32%), Gaps = 8/92 (8%) Query: 5 SIRCPSCS-ATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDM 59 CP C+ T+ + + Y C C+K + + F + K + + Sbjct: 49 EPICPHCNCQTKEHYKLKSNGEFKGLYKCKRCKKRFTVTIGTMFEGSHVSLKKWFYALYI 108 Query: 60 AM---NGVGCRASARIMGVGLNTVLRHLKNSG 88 + G+ ++ + V T L+ Sbjct: 109 FLAHKKGISSIQLSKDIDVTQKTAWFMLERIR 140 >UniRef50_A3VEU0 ISSpo8, transposase n=1 Tax=Rhodobacterales bacterium HTCC2654 RepID=A3VEU0_9RHOB Length = 308 Score = 41.0 bits (95), Expect = 0.012, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 27/95 (28%), Gaps = 14/95 (14%) Query: 5 SIRCPSCSAT--EGVVRNGKSTAGHQRYLCSPCRKTWQLQFTY----TASQPGKHQKIID 58 C C + + Y C CRK + ++ + K I Sbjct: 34 EPVCGHCGSVSVTECKDHKPMP-----YRCKDCRKHFSVRTGTVLAESRLPLQKWLLAIF 88 Query: 59 MAMN---GVGCRASARIMGVGLNTVLRHLKNSGRS 90 M + G+ AR +GV T + + Sbjct: 89 MLTSARKGIPSTQMARELGVTQKTAWFLAQRIRET 123 >UniRef50_Q5NZ47 Putative uncharacterized protein n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5NZ47_AZOSE Length = 266 Score = 41.0 bits (95), Expect = 0.013, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 27/81 (33%), Gaps = 9/81 (11%) Query: 5 SIRCPSCSATEGVVRNGKSTAGHQR------YLCSPCRKTWQLQFTYTASQPGKHQKIID 58 ++ CPSC + + +S Q Y C C + + ++ Sbjct: 6 TVTCPSCGSADCRKSKWQSEGERQVQTGKRPYRCRACTHRFHAPEHKRPRWRDRASFMVP 65 Query: 59 MAMNGVGCRASARIMGVGLNT 79 + G A A ++ VG T Sbjct: 66 ALLMGA---AIAAVIVVGART 83 >UniRef50_B5VVQ8 KWG Leptospira repeat protein n=8 Tax=Arthrospira RepID=B5VVQ8_SPIMA Length = 267 Score = 41.0 bits (95), Expect = 0.013, Method: Composition-based stats. Identities = 11/50 (22%), Positives = 18/50 (36%), Gaps = 1/50 (2%) Query: 40 QLQFTYTASQPGKHQKIIDMAMNG-VGCRASARIMGVGLNTVLRHLKNSG 88 T + +KI++ G R A+ V +TV R +K Sbjct: 201 SQPTTMSRYSLDFRKKIVEAYEKGDTSIRKVAKRFLVSPDTVRRLVKQYR 250 >UniRef50_Q9HAZ2 PR domain zinc finger protein 16 n=26 Tax=Euteleostomi RepID=PRD16_HUMAN Length = 1276 Score = 40.7 bits (94), Expect = 0.013, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 19/57 (33%), Gaps = 4/57 (7%) Query: 3 SISIRCPSCSA----TEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQK 55 C C + + R+ ++ G Q Y C C +++ + H K Sbjct: 948 KERYTCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 1004 >UniRef50_Q3D1N8 Transposase, ISL3 family n=13 Tax=Bacilli RepID=Q3D1N8_STRAG Length = 440 Score = 40.7 bits (94), Expect = 0.014, Method: Composition-based stats. Identities = 14/100 (14%), Positives = 29/100 (29%), Gaps = 20/100 (20%) Query: 7 RCPSCSATEGVVRNGKSTA----------------GHQRYLCSPCRKTWQLQFTYTASQP 50 RCP C + + ++ +RY C C T+ + + Sbjct: 35 RCPECG-FDKLYKHSSRNQLIMDLPIRLKRVGLHLNRRRYKCRECGSTFWERLISVDEKR 93 Query: 51 GKHQKIIDMAMNGV---GCRASARIMGVGLNTVLRHLKNS 87 ++++ A +GV T+ K+ Sbjct: 94 SMTKRLLKSIQEQSMSKTFVEVAESVGVDEKTIRNVFKDY 133 >UniRef50_A3DPW4 Putative uncharacterized protein n=1 Tax=Staphylothermus marinus F1 RepID=A3DPW4_STAMF Length = 240 Score = 40.7 bits (94), Expect = 0.014, Method: Composition-based stats. Identities = 10/44 (22%), Positives = 22/44 (50%) Query: 45 YTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSG 88 + +KI+ + G+ + ++++G+ TVL HL +G Sbjct: 51 PSQRSKELREKIVSLYKQGLSGKQISKMLGINYQTVLYHLHKAG 94 >UniRef50_C0W2A4 Transposase (Fragment) n=1 Tax=Actinomyces coleocanis DSM 15436 RepID=C0W2A4_9ACTO Length = 195 Score = 40.7 bits (94), Expect = 0.015, Method: Composition-based stats. Identities = 11/50 (22%), Positives = 20/50 (40%), Gaps = 3/50 (6%) Query: 20 NGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCRAS 69 +GK+ AG QR+ C C T A + + ++G + + Sbjct: 1 HGKTKAGRQRWRCKSCSITNLNPINTDAKNL---ELFLSWLLSGKTLKDT 47 >UniRef50_C6P8Q1 Transposase IS3/IS911 family protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6P8Q1_CLOTS Length = 91 Score = 40.7 bits (94), Expect = 0.015, Method: Composition-based stats. Identities = 8/45 (17%), Positives = 17/45 (37%) Query: 47 ASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 Q+I ++ +G +R GV T+ + +K + Sbjct: 7 RYSEEFKQQIAELYQSGQSVLDLSREYGVTTVTIYKWIKQLSPVK 51 >UniRef50_A6Q3M3 Transposase n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q3M3_NITSB Length = 211 Score = 40.7 bits (94), Expect = 0.016, Method: Composition-based stats. Identities = 13/85 (15%), Positives = 28/85 (32%), Gaps = 13/85 (15%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVG 65 +RC C+A ++ NG C C++ + + +KI+ + Sbjct: 1 MRCIYCNAPTYLLGNG-------NRKCKRCKRKFSPEKIAR------KEKIVKCFCENLS 47 Query: 66 CRASARIMGVGLNTVLRHLKNSGRS 90 R G T+ + + + Sbjct: 48 VNECMRQSGYNYVTIKNYYEMFRKK 72 >UniRef50_A5KRX5 ISSpo8, transposase n=2 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KRX5_9BACT Length = 275 Score = 40.7 bits (94), Expect = 0.016, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 33/90 (36%), Gaps = 11/90 (12%) Query: 6 IRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDM-- 59 + CP C + V+ K +G R C+ CR + ++ F + K I + Sbjct: 32 VVCPKCGEID--VKYYKLASG--RMKCASCRSPFTVRMGSIFEESPVPLQKWFLAIYLCT 87 Query: 60 -AMNGVGCRASARIMGVGLNTVLRHLKNSG 88 GV ++ +GV T L+ Sbjct: 88 SLKKGVSSIQLSKYIGVTQKTAWFMLQRIR 117 >UniRef50_A8KYE3 Two component transcriptional regulator, LuxR family n=36 Tax=Actinomycetales RepID=A8KYE3_FRASN Length = 249 Score = 40.7 bits (94), Expect = 0.016, Method: Composition-based stats. Identities = 10/33 (30%), Positives = 18/33 (54%) Query: 55 KIIDMAMNGVGCRASARIMGVGLNTVLRHLKNS 87 +++ + G+ R +AR +GV TV H+ N Sbjct: 192 EVLRLVAKGLSARDAARQLGVSHRTVQNHVHNV 224 >UniRef50_Q3QZA8 Putative uncharacterized protein n=1 Tax=Xylella fastidiosa subsp. sandyi Ann-1 RepID=Q3QZA8_XYLFA Length = 243 Score = 40.7 bits (94), Expect = 0.016, Method: Composition-based stats. Identities = 16/61 (26%), Positives = 21/61 (34%) Query: 24 TAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNGVGCRASARIMGVGLNTVLRH 83 T G+QRY R L ++ RA A+ +GV TVL Sbjct: 71 TTGNQRYHIKEIRCQQLLPPKRKKMDVKTLTELALKKSGDTSIRAFAKRLGVSHVTVLAW 130 Query: 84 L 84 L Sbjct: 131 L 131 >UniRef50_C3XYB0 Putative uncharacterized protein n=2 Tax=Chordata RepID=C3XYB0_BRAFL Length = 1482 Score = 40.7 bits (94), Expect = 0.016, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 19/57 (33%), Gaps = 4/57 (7%) Query: 3 SISIRCPSCSA----TEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQK 55 C C + + R+ ++ G Q Y C C +++ + H K Sbjct: 786 KDRYTCRYCGKLFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 842 >UniRef50_B2KBW2 Two component transcriptional regulator, LuxR family n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KBW2_ELUMP Length = 215 Score = 40.7 bits (94), Expect = 0.017, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 29/69 (42%), Gaps = 8/69 (11%) Query: 30 YLCSPCRKT----WQLQFTYTASQPG----KHQKIIDMAMNGVGCRASARIMGVGLNTVL 81 YLCS K + + + + P K ++I+ + G + A+ G+ LNTV Sbjct: 126 YLCSKVSKFVVQGFLGKASPSKKDPSGLTPKEKQILQLIAEGFSSKEIAKEFGLSLNTVH 185 Query: 82 RHLKNSGRS 90 H N R Sbjct: 186 VHRNNIMRK 194 >UniRef50_UPI00015B4C26 PREDICTED: similar to HAMLET n=1 Tax=Nasonia vitripennis RepID=UPI00015B4C26 Length = 1136 Score = 40.7 bits (94), Expect = 0.017, Method: Composition-based stats. Identities = 10/58 (17%), Positives = 19/58 (32%), Gaps = 4/58 (6%) Query: 2 ASISIRCPSCSA----TEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQK 55 C C + + R+ ++ G Q Y C C +++ + H K Sbjct: 918 IKDRYSCKFCGKNFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHDK 975 >UniRef50_Q2RLR5 Integrase, catalytic region n=5 Tax=Clostridia RepID=Q2RLR5_MOOTA Length = 495 Score = 40.3 bits (93), Expect = 0.017, Method: Composition-based stats. Identities = 17/40 (42%), Positives = 24/40 (60%) Query: 52 KHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR 91 K Q+I + GVG R AR +GV NTV ++LK +G + Sbjct: 3 KWQRIKALHAQGVGIRQIARDVGVSRNTVRKYLKEAGPPQ 42 >UniRef50_D1W685 Putative uncharacterized protein n=2 Tax=Prevotella RepID=D1W685_9BACT Length = 298 Score = 40.3 bits (93), Expect = 0.017, Method: Composition-based stats. Identities = 15/75 (20%), Positives = 26/75 (34%), Gaps = 8/75 (10%) Query: 17 VVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQPGKHQKIIDMAMNG-VGCRASARIMGV 75 VV+ G QR+ C C +T+ + + + G + A V Sbjct: 2 VVKRGFHKN-RQRWYCKSCGRTFV------GHKRLTEETVNTRYSKGNLTVEDLATEYAV 54 Query: 76 GLNTVLRHLKNSGRS 90 TV R L + ++ Sbjct: 55 STRTVYRRLSKTYKA 69 >UniRef50_B6ARX3 Transposase n=15 Tax=Bacteria RepID=B6ARX3_9BACT Length = 314 Score = 40.3 bits (93), Expect = 0.017, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 29/91 (31%), Gaps = 15/91 (16%) Query: 7 RCPSC-SATEGVVRNGKSTAGHQRYLCSPCRKTWQLQ----FTYTASQPGKHQKIIDMAM 61 RCPSC V+ NG RY C+ CR L F T I + Sbjct: 41 RCPSCEGDRFCVLSNG-------RYQCNSCRHQASLTSGTLFAGTKLPLTVWFLAIYLLT 93 Query: 62 ---NGVGCRASARIMGVGLNTVLRHLKNSGR 89 N + R +GV NT + Sbjct: 94 QSKNAMSALELKRQLGVCYNTAWLLKHKVLQ 124 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.314 0.149 0.527 Lambda K H 0.267 0.0461 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 573,812,454 Number of Sequences: 3077464 Number of extensions: 19887569 Number of successful extensions: 171670 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 270 Number of HSP's successfully gapped in prelim test: 698 Number of HSP's that attempted gapping in prelim test: 168313 Number of HSP's gapped (non-prelim): 3569 length of query: 91 length of database: 1,040,396,356 effective HSP length: 61 effective length of query: 30 effective length of database: 852,671,052 effective search space: 25580131560 effective search space used: 25580131560 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 87 (38.0 bits)