BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (91 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=roo... 190 1e-47 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 142 3e-33 UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepI... 107 1e-22 UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacter... 94 2e-18 UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax... 91 8e-18 UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammapr... 87 2e-16 UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Kl... 86 3e-16 UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwin... 83 3e-15 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 78 9e-14 UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacter... 65 6e-10 UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma ling... 65 9e-10 UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX 62 4e-09 UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepI... 58 9e-08 UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea p... 57 2e-07 UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriacea... 56 4e-07 UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmon... 54 1e-06 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 53 3e-06 UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp.... 50 2e-05 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 49 4e-05 UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria... 49 5e-05 UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candid... 48 9e-05 UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edw... 46 4e-04 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 45 6e-04 UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae... 44 0.001 UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE 44 0.001 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 44 0.002 UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_S... 43 0.003 UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methano... 43 0.004 UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriale... 42 0.005 UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis ... 42 0.007 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 41 0.011 UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseud... 40 0.018 UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ru... 40 0.029 UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aur... 39 0.056 >UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=root RepID=INSA2_ECOLX Length = 91 Score = 190 bits (482), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 91/91 (100%), Positives = 91/91 (100%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA Sbjct: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR Sbjct: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 142 bits (359), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 66/68 (97%), Positives = 67/68 (98%) Query: 17 VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 76 +VRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG Sbjct: 1 MVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 60 Query: 77 LNTIFRHL 84 LNTI RHL Sbjct: 61 LNTILRHL 68 >UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32IP8_SHIDS Length = 101 Score = 107 bits (266), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 50/52 (96%), Positives = 50/52 (96%) Query: 40 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGV LNTI RHLKNSGRSR Sbjct: 50 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVSLNTILRHLKNSGRSR 101 >UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacteriaceae RepID=A4TI48_YERPP Length = 91 Score = 93.6 bits (231), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 40/90 (44%), Positives = 56/90 (62%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA V + CP C V ++G GHQRY C CR+++QL++ Y A PG ++I+D+A Sbjct: 1 MAKVDVKCPFCEQFHPVKKHGPGRTGHQRYRCQACRRSFQLEYEYRACHPGMKEQIVDLA 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 MN G R TAR + + +N + R LKNS RS Sbjct: 61 MNNAGIRDTARALHISINAVMRTLKNSRRS 90 >UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax=Gammaproteobacteria RepID=INA2_SHIDY Length = 90 Score = 91.3 bits (225), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 42/90 (46%), Positives = 59/90 (65%), Gaps = 1/90 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MASV+I CP C + V R+G++ GH R+ C C + +QL +TY A +PG + I +MA Sbjct: 1 MASVNIHCPRCQSAQ-VYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPGIKELITEMA 59 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 NG G R TAR + +G+NT+ R LKNS +S Sbjct: 60 FNGAGVRDTARTLKIGINTVIRTLKNSRQS 89 >UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammaproteobacteria RepID=A7N597_VIBHB Length = 91 Score = 87.0 bits (214), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 37/87 (42%), Positives = 55/87 (63%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA++ + C C+ T+ V ++GK +G R+ C CRK++QL + Y A +P +KI+DMA Sbjct: 1 MATIQVQCRFCNKTESVRKHGKGHSGFPRFRCIECRKSFQLDYVYEARKPNVKEKIVDMA 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNS 87 MN G R TA ++ V NT+ LKNS Sbjct: 61 MNSSGVRETAGVLNVAYNTVLSTLKNS 87 >UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T8A4_KLEPR Length = 83 Score = 86.3 bits (212), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 41/65 (63%), Positives = 47/65 (72%), Gaps = 1/65 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQ-KIIDM 59 MAS+ + PSC+ T+GV RNGKSTAGHQ YLC CRK W L FTYT SQ THQ KIIDM Sbjct: 7 MASIYVGSPSCAVTEGVDRNGKSTAGHQHYLCRQCRKPWTLTFTYTTSQRSTHQRKIIDM 66 Query: 60 AMNGV 64 + + Sbjct: 67 TIMAL 71 >UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwinia RepID=D0FXR2_ERWPY Length = 92 Score = 82.8 bits (203), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 38/82 (46%), Positives = 50/82 (60%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 I+CP CS + + RNG+S +G QRY C C KT+QL F Y S P + II+M +G Sbjct: 5 DIACPRCSESARIRRNGRSASGIQRYRCQGCLKTFQLHFYYAGSSPNMQKTIIEMMNDGS 64 Query: 65 GCRATARIMGVGLNTIFRHLKN 86 R AR +GV L T+ RHLK+ Sbjct: 65 EQRDIARKLGVSLETVLRHLKD 86 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 77.8 bits (190), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 38/91 (41%), Positives = 52/91 (57%), Gaps = 1/91 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA V+I CP C + V R+G++ G R C C + +QL +TY A +PG + I +MA Sbjct: 1 MARVNIHCPRCQSAQ-VYRHGQNPKGRDRLRCRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 NG G R TAR + +G NT+ R LK R Sbjct: 60 FNGAGVRDTARTLKIGSNTVIRTLKKLAPKR 90 >UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacteria RepID=Q1V9Z0_VIBAL Length = 88 Score = 65.1 bits (157), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 31/81 (38%), Positives = 46/81 (56%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + + C C +D VV++G GHQRY C C +T+Q+ + Y A +PG +II+M Sbjct: 1 MTTNNPHCHFCCKSDSVVKHGYGPKGHQRYRCLSCCRTFQVNYCYEACKPGIRSRIIEMT 60 Query: 61 MNGVGCRATARIMGVGLNTIF 81 G RAT+R + V NT+ Sbjct: 61 AQNHGKRATSRHLQVSYNTVL 81 >UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQ17_9SPHI Length = 107 Score = 64.7 bits (156), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 32/91 (35%), Positives = 49/91 (53%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M +++C T + R G + AG QRY C C +T+ +T+ A P ++I M Sbjct: 1 MVLEAVTCKHFGQTQHIKRYGTTCAGTQRYRCFDCGRTFVQTYTHKARDPLVKEQITQMV 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 +NG G R TAR++GV NT+ K +G +R Sbjct: 61 LNGAGIRDTARVLGVNRNTVSAQFKKNGAAR 91 >UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX Length = 71 Score = 62.4 bits (150), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA+V++ P C+ +D V R+G+S + H+R+ C C++ +QL ++Y A +PG + I++MA Sbjct: 1 MATVTVHRPRCN-SDKVYRHGRSCSQHERFRCRSCKRVFQLTYSYEARKPGFKELIVEMA 59 Query: 61 MNGVG 65 NG G Sbjct: 60 HNGTG 64 >UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IXU3_9CHRO Length = 92 Score = 58.2 bits (139), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 32/87 (36%), Positives = 50/87 (57%), Gaps = 4/87 (4%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSH--C-RKTWQLQFTYTASQPGTHQKIIDMA 60 ++I CP C +TD VV+NG S G QRY C + C R+++ ++Y + ++I M Sbjct: 5 LAIECPHCHSTD-VVKNGFSGEGKQRYFCQNKSCERRSFIRDYSYNGCRKEVKKQIPKMV 63 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNS 87 +NG G R TAR++ + T+ LK S Sbjct: 64 VNGSGIRDTARVLEISPITVASELKKS 90 >UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea psychrophila RepID=Q6AKY5_DESPS Length = 101 Score = 57.0 bits (136), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 31/84 (36%), Positives = 49/84 (58%), Gaps = 6/84 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 +SC C TD V R+GK + G+QR+ CS C++T+QL++ Y A + H++ + G Sbjct: 1 MSCRFCGGTDEVRRHGKDSNGNQRFRCSDCKRTFQLEYPYVADR---HER---YSPGNAG 54 Query: 66 CRATARIMGVGLNTIFRHLKNSGR 89 R TAR++ VG + R K + R Sbjct: 55 IRDTARVLKVGCMGLTRFRKLNPR 78 >UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriaceae RepID=B7LWW4_ECO55 Length = 134 Score = 55.8 bits (133), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 3/84 (3%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M+SV+I CP C + V R+G++ G R+ C + +QL +TY A +PG + I +MA Sbjct: 1 MSSVNIHCPRCQSAQ-VYRHGQNPKGRDRFRYRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MN--GVGCRATARIMGVGLNTIFR 82 N G+ AR+ G+ +F+ Sbjct: 60 FNEPGMMLARMARLHGIQPCQLFK 83 >UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SSR9_AERS4 Length = 91 Score = 53.9 bits (128), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 1/64 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MAS++I CP C+ +D V R+GK+ AG+ RY C C +QL +TY A P + ++++ Sbjct: 10 MASITIHCPRCN-SDHVYRHGKTPAGNIRYRCPACPHVFQLTYTYEARNPASKRRLLIWR 68 Query: 61 MNGV 64 G+ Sbjct: 69 STGL 72 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 53.1 bits (126), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 25/84 (29%), Positives = 44/84 (52%), Gaps = 1/84 (1%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I+CP C V++NG + Q+Y C C + +T A + +QK + + G+G Sbjct: 12 INCPKCKE-KKVIKNGTTKNNKQQYYCKMCFYRFIQNYTNQAYKLDINQKNVQLTKEGLG 70 Query: 66 CRATARIMGVGLNTIFRHLKNSGR 89 R+TARI+ + T+ + + + GR Sbjct: 71 IRSTARILEISATTLLKRIVSIGR 94 >UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HXD1_CYAP4 Length = 95 Score = 50.1 bits (118), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 4/82 (4%) Query: 9 PSCSATDGVVRNGKSTAGHQRYLCSHC---RKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 PSC ++D VV+ + T G QRY C + R T+ Q+ Y Q+I++M +NG G Sbjct: 9 PSCGSSD-VVKPRQLTEGIQRYKCRNAEWSRCTFIRQYAYRGYLVEVKQQIVEMVVNGSG 67 Query: 66 CRATARIMGVGLNTIFRHLKNS 87 R AR++ + T+ LK S Sbjct: 68 TRDPARVLKISRTTVTETLKKS 89 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 49.3 bits (116), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 28/87 (32%), Positives = 46/87 (52%), Gaps = 2/87 (2%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 +SCPSC + VV+ G+ G Q++LC C K + +Y ++ + M NG+ Sbjct: 10 DVSCPSC-GSHHVVKCGRPL-GRQKFLCRDCGKYFLGDASYHHHSRKLREEALRMYANGM 67 Query: 65 GCRATARIMGVGLNTIFRHLKNSGRSR 91 RA +R++ V L T+F +K GR + Sbjct: 68 SMRAISRVLNVPLGTVFTWIKRYGRKK 94 >UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria RepID=C4MEL4_CAMCO Length = 339 Score = 48.9 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 32/82 (39%), Positives = 42/82 (51%), Gaps = 8/82 (9%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW----QLQFTYTASQPGTHQKIIDMAMNG 63 CP C+ +D V+NGK+ HQRY+C C KT+ + T G K ID +N Sbjct: 48 CPYCN-SDKFVKNGKAKT-HQRYICKTCNKTFTDTNKTILFNTKKDIGIWYKYIDCLVNK 105 Query: 64 VGCRATARIMGVGLNTIF--RH 83 R TA+I G+ L T F RH Sbjct: 106 YPLRKTAKICGISLPTAFVWRH 127 >UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L432_AMOA5 Length = 118 Score = 48.1 bits (113), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 10/88 (11%) Query: 5 SISCPSC----SATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 +++CP C S DG+VR G QRY C CR + + T +K + + Sbjct: 3 TMNCPRCNNAHSCKDGIVR------GRQRYQCKSCRFRYTVSHKSDVKPLSTKRKALQLY 56 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSG 88 + G+G RA RI+ + T+++ +K G Sbjct: 57 LEGLGFRAIGRILNISYGTVYQWVKACG 84 >UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8T3_EDWI9 Length = 73 Score = 46.2 bits (108), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 21/51 (41%), Positives = 30/51 (58%) Query: 41 LQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 L Y A + ++II+MA G G R TA + +G+NT+ R LKNS +S Sbjct: 23 LTLAYEAHKLDIKEQIIEMAFKGSGVRDTANTLKIGINTVIRTLKNSRQSE 73 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 45.4 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 25/82 (30%), Positives = 46/82 (56%), Gaps = 3/82 (3%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGC 66 +CPSC +D V++NG S+ G +Y C+ CR+T+ + S+ ++I+ +N + Sbjct: 70 NCPSCK-SDKVIKNG-SSRGKTKYKCNVCRRTFYDANSRRMSRE-QKERILKEYLNRMSM 126 Query: 67 RATARIMGVGLNTIFRHLKNSG 88 R A++ G L T++ +K G Sbjct: 127 RGIAKVEGKPLTTVYSLIKRKG 148 >UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae RepID=C5BFY7_EDWI9 Length = 46 Score = 44.3 bits (103), Expect = 0.001, Method: Compositional matrix adjust. Identities = 19/41 (46%), Positives = 26/41 (63%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL 41 MA + + CP + T V+RNG +T+G Q Y C C KT+QL Sbjct: 1 MAKIDVVCPRGAKTQDVIRNGHATSGAQVYRCKLCLKTFQL 41 >UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE Length = 348 Score = 43.9 bits (102), Expect = 0.001, Method: Composition-based stats. Identities = 30/83 (36%), Positives = 46/83 (55%), Gaps = 10/83 (12%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT---YTASQPGTHQ--KIIDMAMN 62 CP C D V +NGKS G QRY+C CR ++ +FT ++ ++ G + K ++ + Sbjct: 53 CPKCQCKD-VNKNGKSN-GRQRYICKRCRTSFD-EFTMSPFSNTKLGLDKWIKYCELMIL 109 Query: 63 GVGCRATARIMGVGLNTIF--RH 83 G+ R A +GVG+ T F RH Sbjct: 110 GLSIRKCAEEVGVGVKTSFYMRH 132 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust. Identities = 17/47 (36%), Positives = 27/47 (57%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTA 47 M ++ + C C T+ V ++ K A HQRY C C + +QL++ Y A Sbjct: 1 METLEVKCRFCQQTEFVKKHSKGDADHQRYRCFSCNQIFQLEYAYRA 47 >UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_STRAG Length = 345 Score = 43.1 bits (100), Expect = 0.003, Method: Compositional matrix adjust. Identities = 28/78 (35%), Positives = 37/78 (47%), Gaps = 5/78 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNG 63 CP C VVRNG G QRY+C C K++ + + T ++ ID MNG Sbjct: 52 CPLCGCIH-VVRNGHRKDGTQRYVCKDCGKSFVIATNSIVSGTRKDLSVWEQYIDCMMNG 110 Query: 64 VGCRATARIMGVGLNTIF 81 + R TA G+ NT F Sbjct: 111 LSIRKTAVACGIHRNTAF 128 >UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methanosarcina RepID=Q46CV1_METBF Length = 139 Score = 42.7 bits (99), Expect = 0.004, Method: Compositional matrix adjust. Identities = 22/89 (24%), Positives = 45/89 (50%), Gaps = 10/89 (11%) Query: 6 ISCPSCSAT----DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 ++CP C+++ +G+V G Q Y C C + ++ TAS P ++ + + + Sbjct: 1 MNCPRCNSSTHKKNGIV------FGRQHYKCHDCGYNYTVEVKSTASSPSVKRQALQLYL 54 Query: 62 NGVGCRATARIMGVGLNTIFRHLKNSGRS 90 G+G R+ R +GV ++ + +K G+ Sbjct: 55 EGLGFRSIGRFLGVSHVSVQKWIKKFGQE 83 >UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriales RepID=B5W4N9_SPIMA Length = 163 Score = 42.4 bits (98), Expect = 0.005, Method: Compositional matrix adjust. Identities = 24/80 (30%), Positives = 39/80 (48%), Gaps = 2/80 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C + VV+NG G Q YLC C + ++ + + M++NG+G Sbjct: 1 MDCPYCQSHK-VVKNGHRQ-GKQSYLCRECGRQFRENPCPGGYSSDVKELCVKMSLNGMG 58 Query: 66 CRATARIMGVGLNTIFRHLK 85 RA R+ G+ NTI ++ Sbjct: 59 FRAIERVTGISHNTILNWVR 78 >UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis BM407 RepID=C6GT28_STRS4 Length = 341 Score = 41.6 bits (96), Expect = 0.007, Method: Composition-based stats. Identities = 30/83 (36%), Positives = 45/83 (54%), Gaps = 10/83 (12%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS--QPGTHQKIIDMA---MN 62 CP C ++ + RNGK G QRY+C C+KT+ FT +A+ T K + A +N Sbjct: 54 CPLC-GSETISRNGKYN-GKQRYICKSCKKTFT-DFTNSATYKSKKTLDKWLKYAKCMIN 110 Query: 63 GVGCRATARIMGVGLNTIF--RH 83 G R +A+I+ + + T F RH Sbjct: 111 GYSIRKSAKIVEINIATSFFWRH 133 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 41.2 bits (95), Expect = 0.011, Method: Compositional matrix adjust. Identities = 21/89 (23%), Positives = 45/89 (50%), Gaps = 10/89 (11%) Query: 6 ISCPSCSAT----DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 ++CP C+++ +G+V G QRY C C + ++ T+ P ++ + + + Sbjct: 1 MNCPRCNSSTHKKNGIV------FGRQRYKCHDCGYNYTVEVKSTSISPSVKRQALQLYL 54 Query: 62 NGVGCRATARIMGVGLNTIFRHLKNSGRS 90 G+G R+ R +GV ++ + +K G+ Sbjct: 55 EGLGFRSIGRFLGVSHVSVQKWIKKFGQE 83 >UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JSC1_YERPY Length = 53 Score = 40.4 bits (93), Expect = 0.018, Method: Compositional matrix adjust. Identities = 16/37 (43%), Positives = 21/37 (56%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRK 37 MA + CP C D V ++G +GHQRY C H +K Sbjct: 1 MAKIDEKCPFCERKDLVKKHGYGKSGHQRYRCPHAKK 37 >UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N1_SALRD Length = 92 Score = 39.7 bits (91), Expect = 0.029, Method: Compositional matrix adjust. Identities = 25/85 (29%), Positives = 38/85 (44%), Gaps = 1/85 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + C C +++ +V+NG S +G Q+Y C C L +KI+ Sbjct: 1 MIKETYECRECGSSN-IVKNGHSASGSQQYHCKDCGAHKVLDPEPRGYSEEEKEKILRAY 59 Query: 61 MNGVGCRATARIMGVGLNTIFRHLK 85 RA +RI G+ NT+ R LK Sbjct: 60 RERGSKRAISRIFGISRNTLTRWLK 84 >UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PIK6_CORA7 Length = 403 Score = 38.9 bits (89), Expect = 0.056, Method: Composition-based stats. Identities = 28/79 (35%), Positives = 40/79 (50%), Gaps = 4/79 (5%) Query: 9 PSCSAT-DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKI-IDMAMNGVGC 66 PSC G+V+NGK+ AG QR+LC C + +T+ H KI ID ++G Sbjct: 7 PSCDMCGHGLVKNGKTAAGTQRWLCPQCNVSSINTRAHTSDI--RHFKIFIDWILSGESA 64 Query: 67 RATARIMGVGLNTIFRHLK 85 A+ +GV T+ R K Sbjct: 65 DHLAKRLGVTRRTLTRWFK 83 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=roo... 122 3e-27 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 115 5e-25 UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax... 112 3e-24 UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacter... 107 1e-22 UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammapr... 102 3e-21 UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma ling... 101 7e-21 UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriacea... 100 1e-20 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 100 2e-20 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 97 2e-19 UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwin... 95 6e-19 UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepI... 88 7e-17 UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacter... 88 9e-17 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 86 3e-16 UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp.... 84 1e-15 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 82 6e-15 UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria... 81 1e-14 UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepI... 80 2e-14 UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candid... 78 7e-14 UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea p... 76 3e-13 UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmon... 75 5e-13 UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX 74 1e-12 UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Kl... 71 9e-12 UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edw... 66 4e-10 UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE 65 8e-10 UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae... 52 5e-06 Sequences not found previously or not previously below threshold: UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methano... 77 2e-13 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 75 7e-13 UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_S... 72 4e-12 UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriale... 71 1e-11 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 69 4e-11 UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5W... 66 3e-10 UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium ... 66 4e-10 UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 R... 64 2e-09 UniRef50_A3JAS9 Hypothetical transposase n=1 Tax=Marinobacter sp... 63 4e-09 UniRef50_Q116V8 Insertion element protein n=2 Tax=Oscillatoriale... 62 5e-09 UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis ... 62 6e-09 UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcti... 62 6e-09 UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ru... 61 8e-09 UniRef50_A9VV42 Insertion element protein n=7 Tax=Bacillus RepID... 61 9e-09 UniRef50_B0ABB1 Putative uncharacterized protein n=2 Tax=cellula... 60 2e-08 UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candida... 60 3e-08 UniRef50_A8V2B8 Hypothetical insertion element protein n=1 Tax=H... 59 5e-08 UniRef50_C1I4B6 Putative uncharacterized protein n=2 Tax=Clostri... 57 1e-07 UniRef50_Q891N5 Putative transposase n=1 Tax=Clostridium tetani ... 57 1e-07 UniRef50_D2QCU0 Insertion element protein n=7 Tax=Bacteroidetes ... 56 3e-07 UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis ae... 56 3e-07 UniRef50_C6X4X6 Putative uncharacterized protein n=1 Tax=Flavoba... 56 3e-07 UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaen... 56 4e-07 UniRef50_Q2GDS3 Putative uncharacterized protein n=2 Tax=Neorick... 55 5e-07 UniRef50_C4IIL3 Putative transposase n=2 Tax=Clostridium butyric... 54 1e-06 UniRef50_B1QSI6 Putative transposase n=10 Tax=Clostridium butyri... 54 1e-06 UniRef50_C0QU68 Putative transposase n=1 Tax=Persephonella marin... 54 2e-06 UniRef50_Q97IZ3 Zn-finger DNA-binding domain n=1 Tax=Clostridium... 54 2e-06 UniRef50_B5ETH8 Transposase n=15 Tax=Vibrionaceae RepID=B5ETH8_V... 53 3e-06 UniRef50_Q64EP4 Putative uncharacterized protein n=10 Tax=enviro... 53 3e-06 UniRef50_Q97IP3 Zn-finger DNA-binding domain n=1 Tax=Clostridium... 53 3e-06 UniRef50_A7HSL0 Insertion element protein n=1 Tax=Parvibaculum l... 53 3e-06 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 53 3e-06 UniRef50_Q8D3Z3 Transposase n=48 Tax=Vibrionales RepID=Q8D3Z3_VIBVU 52 4e-06 UniRef50_A6CNB6 Putative uncharacterized protein n=8 Tax=Bacillu... 52 9e-06 UniRef50_UPI000196AFFE hypothetical protein CATMIT_00334 n=2 Tax... 51 9e-06 UniRef50_B4WSN9 Insertion element protein n=3 Tax=Cyanobacteria ... 51 1e-05 UniRef50_A1SXI4 Conserved hypothetical transposase n=18 Tax=Gamm... 51 1e-05 UniRef50_C5U8R8 Insertion element protein (Fragment) n=2 Tax=Met... 51 1e-05 UniRef50_Q81ZP0 Putative uncharacterized protein n=2 Tax=Nitroso... 51 1e-05 UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum A... 51 1e-05 UniRef50_A8ZMX8 Putative uncharacterized protein n=5 Tax=Acaryoc... 50 2e-05 UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aur... 50 2e-05 UniRef50_C1DQR5 Transposase n=5 Tax=Bacteria RepID=C1DQR5_AZOVD 50 2e-05 UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202... 50 2e-05 UniRef50_B6B4C9 Putative uncharacterized protein n=1 Tax=Rhodoba... 49 4e-05 UniRef50_C6W2G4 Transcriptional regulator, LacI family n=1 Tax=D... 49 4e-05 UniRef50_A0L9I6 Insertion element protein n=1 Tax=Magnetococcus ... 49 5e-05 UniRef50_C8SD87 Insertion element protein n=1 Tax=Ferroglobus pl... 49 5e-05 UniRef50_Q9AMR3 Putative transposase n=1 Tax=Azotobacter vinelan... 49 6e-05 UniRef50_B9TDK1 Putative uncharacterized protein n=2 Tax=Ricinus... 48 8e-05 UniRef50_B6ARX2 Probable transposase n=1 Tax=Leptospirillum sp. ... 48 9e-05 UniRef50_C0BSX6 Putative uncharacterized protein n=1 Tax=Bifidob... 48 1e-04 UniRef50_A8ZNT7 Putative uncharacterized protein n=2 Tax=Acaryoc... 47 1e-04 UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseud... 47 2e-04 UniRef50_A5FST1 Integrase, catalytic region n=1 Tax=Dehalococcoi... 47 2e-04 UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachl... 47 3e-04 UniRef50_UPI0001C348D8 hypothetical protein PretD1_11772 n=1 Tax... 47 3e-04 UniRef50_Q3Y3Y2 Transposase, IS204/IS1001/IS1096/IS1165 n=4 Tax=... 46 3e-04 UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nod... 46 3e-04 UniRef50_C6MXF0 Putative uncharacterized protein n=1 Tax=Geobact... 46 3e-04 UniRef50_B2GC72 Transposase n=11 Tax=Lactobacillus RepID=B2GC72_... 46 3e-04 UniRef50_Q10ZA4 Putative uncharacterized protein n=1 Tax=Trichod... 46 4e-04 UniRef50_A7HJ99 Putative uncharacterized protein n=2 Tax=Fervido... 46 4e-04 UniRef50_B0K4X0 Integrase, catalytic region n=11 Tax=Clostridia ... 46 4e-04 UniRef50_C9BRL5 Transposase n=2 Tax=Enterococcus faecium RepID=C... 46 5e-04 UniRef50_P04137 Uncharacterized protein in transposable element ... 45 5e-04 UniRef50_A4W908 Putative cytoplasmic protein n=2 Tax=Enterobacte... 45 6e-04 UniRef50_D0WF86 Putative Zn-finger DNA-binding domain protein n=... 45 6e-04 UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q... 45 6e-04 UniRef50_UPI0001BC5CBF transposase n=1 Tax=Fusobacterium sp. D12... 45 7e-04 UniRef50_A4WNK0 Putative uncharacterized protein n=1 Tax=Rhodoba... 45 8e-04 UniRef50_Q07NT9 Putative uncharacterized protein n=2 Tax=Rhizobi... 45 9e-04 UniRef50_C0X375 Transposase n=23 Tax=Enterococcus RepID=C0X375_E... 45 9e-04 UniRef50_B2A0V7 Integrase catalytic region n=15 Tax=Clostridia R... 45 0.001 UniRef50_Q4JT92 Transposase for IS3503f n=4 Tax=Corynebacterium ... 44 0.001 UniRef50_D1JAI8 Putative uncharacterized protein n=1 Tax=uncultu... 44 0.002 UniRef50_Q0W4F0 Putative uncharacterized protein n=1 Tax=uncultu... 43 0.002 UniRef50_Q3Y1C3 Transposase, IS204/IS1001/IS1096/IS1165 n=51 Tax... 43 0.002 UniRef50_Q8PSY9 Conserved protein n=2 Tax=Methanosarcina RepID=Q... 43 0.002 UniRef50_C6MKY8 Putative uncharacterized protein n=1 Tax=Geobact... 43 0.002 UniRef50_C2FEQ0 IS1181 transposase n=1 Tax=Lactobacillus paracas... 43 0.003 UniRef50_Q1VPB4 Putative uncharacterized protein n=4 Tax=Psychro... 43 0.003 UniRef50_B5K5I7 Transposase n=4 Tax=Octadecabacter antarcticus 2... 42 0.004 UniRef50_Q5LYW0 IS1193, transposase, ISL3 family n=185 Tax=Bacte... 42 0.004 UniRef50_C2BVZ4 Possible IS3509a transposase n=1 Tax=Mobiluncus ... 42 0.005 UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae... 42 0.006 UniRef50_B4WST6 Putative uncharacterized protein n=1 Tax=Synecho... 42 0.006 UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=... 42 0.006 UniRef50_B7K7J3 Putative uncharacterized protein n=1 Tax=Cyanoth... 42 0.007 UniRef50_D0BMT4 Transposase n=10 Tax=Lactobacillales RepID=D0BMT... 42 0.008 UniRef50_Q2FRB5 Putative uncharacterized protein n=1 Tax=Methano... 42 0.008 UniRef50_C8SCF9 Integrase catalytic region n=1 Tax=Ferroglobus p... 42 0.008 UniRef50_B2J098 Putative uncharacterized protein n=2 Tax=Nostoca... 41 0.011 UniRef50_A8ZMW7 Putative uncharacterized protein n=1 Tax=Acaryoc... 41 0.014 UniRef50_B0V2Z3 Novel zinc finger protein (Fragment) n=2 Tax=Dan... 41 0.015 UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthros... 40 0.018 UniRef50_Q8PWW0 Putative uncharacterized protein n=1 Tax=Methano... 40 0.019 UniRef50_D2PJ85 Putative uncharacterized protein n=5 Tax=Sulfolo... 40 0.023 UniRef50_C3L491 Putative uncharacterized protein n=1 Tax=Candida... 40 0.024 UniRef50_Q8RFT8 Transposase n=17 Tax=Fusobacterium RepID=Q8RFT8_... 40 0.025 UniRef50_Q70JT0 IsmA protein n=2 Tax=Microcystis aeruginosa RepI... 40 0.026 UniRef50_D2LYX8 Tn5468, transposition protein D n=1 Tax=Bacillus... 40 0.029 UniRef50_UPI000186E028 transcription factor Sp4, putative n=1 Ta... 40 0.032 UniRef50_A7HVK5 Putative uncharacterized protein n=1 Tax=Parviba... 39 0.035 UniRef50_B5S3H3 Probable transposase protein n=6 Tax=Burkholderi... 39 0.037 UniRef50_Q10VF2 Putative uncharacterized protein n=1 Tax=Trichod... 39 0.038 UniRef50_UPI0001793827 PREDICTED: similar to CG5669 CG5669-PA n=... 39 0.041 UniRef50_B8X8Z3 Resolvase n=1 Tax=Pectobacterium atrosepticum Re... 39 0.043 UniRef50_UPI000196CFC8 hypothetical protein CATMIT_02848 n=1 Tax... 39 0.043 UniRef50_UPI0001BC5E64 putative transposase n=1 Tax=Fusobacteriu... 39 0.052 UniRef50_Q4JSN3 Transposase for IS3507b n=53 Tax=Actinobacterida... 38 0.063 UniRef50_UPI0001C34261 hypothetical protein CATC2_13668 n=1 Tax=... 38 0.069 UniRef50_C2CJK1 ISSha1 transposase n=7 Tax=Anaerococcus RepID=C2... 38 0.072 UniRef50_B7X577 Transposase IS204/IS1001/IS1096/IS1165 family pr... 38 0.074 UniRef50_D0SJK4 Predicted protein n=1 Tax=Acinetobacter junii SH... 38 0.078 UniRef50_D1W685 Putative uncharacterized protein n=2 Tax=Prevote... 38 0.093 UniRef50_Q8PRR9 Conserved protein n=2 Tax=Archaea RepID=Q8PRR9_M... 38 0.093 UniRef50_UPI000186EB06 zinc finger protein 705A, putative n=1 Ta... 38 0.099 >UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=root RepID=INSA2_ECOLX Length = 91 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 91/91 (100%), Positives = 91/91 (100%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA Sbjct: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR Sbjct: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 115 bits (287), Expect = 5e-25, Method: Composition-based stats. Identities = 38/91 (41%), Positives = 52/91 (57%), Gaps = 1/91 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA V+I CP C + V R+G++ G R C C + +QL +TY A +PG + I +MA Sbjct: 1 MARVNIHCPRCQSAQ-VYRHGQNPKGRDRLRCRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 NG G R TAR + +G NT+ R LK R Sbjct: 60 FNGAGVRDTARTLKIGSNTVIRTLKKLAPKR 90 >UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax=Gammaproteobacteria RepID=INA2_SHIDY Length = 90 Score = 112 bits (280), Expect = 3e-24, Method: Composition-based stats. Identities = 42/91 (46%), Positives = 59/91 (64%), Gaps = 1/91 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MASV+I CP C + V R+G++ GH R+ C C + +QL +TY A +PG + I +MA Sbjct: 1 MASVNIHCPRCQSAQ-VYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPGIKELITEMA 59 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 NG G R TAR + +G+NT+ R LKNS +S Sbjct: 60 FNGAGVRDTARTLKIGINTVIRTLKNSRQSE 90 >UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacteriaceae RepID=A4TI48_YERPP Length = 91 Score = 107 bits (267), Expect = 1e-22, Method: Composition-based stats. Identities = 40/90 (44%), Positives = 56/90 (62%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA V + CP C V ++G GHQRY C CR+++QL++ Y A PG ++I+D+A Sbjct: 1 MAKVDVKCPFCEQFHPVKKHGPGRTGHQRYRCQACRRSFQLEYEYRACHPGMKEQIVDLA 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 MN G R TAR + + +N + R LKNS RS Sbjct: 61 MNNAGIRDTARALHISINAVMRTLKNSRRS 90 >UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammaproteobacteria RepID=A7N597_VIBHB Length = 91 Score = 102 bits (255), Expect = 3e-21, Method: Composition-based stats. Identities = 37/91 (40%), Positives = 57/91 (62%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA++ + C C+ T+ V ++GK +G R+ C CRK++QL + Y A +P +KI+DMA Sbjct: 1 MATIQVQCRFCNKTESVRKHGKGHSGFPRFRCIECRKSFQLDYVYEARKPNVKEKIVDMA 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 MN G R TA ++ V NT+ LKNS + + Sbjct: 61 MNSSGVRETAGVLNVAYNTVLSTLKNSRQGK 91 >UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQ17_9SPHI Length = 107 Score = 101 bits (252), Expect = 7e-21, Method: Composition-based stats. Identities = 32/91 (35%), Positives = 49/91 (53%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M +++C T + R G + AG QRY C C +T+ +T+ A P ++I M Sbjct: 1 MVLEAVTCKHFGQTQHIKRYGTTCAGTQRYRCFDCGRTFVQTYTHKARDPLVKEQITQMV 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 +NG G R TAR++GV NT+ K +G +R Sbjct: 61 LNGAGIRDTARVLGVNRNTVSAQFKKNGAAR 91 >UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriaceae RepID=B7LWW4_ECO55 Length = 134 Score = 100 bits (249), Expect = 1e-20, Method: Composition-based stats. Identities = 30/89 (33%), Positives = 48/89 (53%), Gaps = 3/89 (3%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M+SV+I CP C + V R+G++ G R+ C + +QL +TY A +PG + I +MA Sbjct: 1 MSSVNIHCPRCQSAQ-VYRHGQNPKGRDRFRYRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MN--GVGCRATARIMGVGLNTIFRHLKNS 87 N G+ AR+ G+ +F+ K Sbjct: 60 FNEPGMMLARMARLHGIQPCQLFKWKKQY 88 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 99.7 bits (247), Expect = 2e-20, Method: Composition-based stats. Identities = 28/87 (32%), Positives = 46/87 (52%), Gaps = 2/87 (2%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 +SCPSC + VV+ G+ G Q++LC C K + +Y ++ + M NG+ Sbjct: 10 DVSCPSCG-SHHVVKCGR-PLGRQKFLCRDCGKYFLGDASYHHHSRKLREEALRMYANGM 67 Query: 65 GCRATARIMGVGLNTIFRHLKNSGRSR 91 RA +R++ V L T+F +K GR + Sbjct: 68 SMRAISRVLNVPLGTVFTWIKRYGRKK 94 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 97.0 bits (240), Expect = 2e-19, Method: Composition-based stats. Identities = 66/74 (89%), Positives = 67/74 (90%) Query: 17 VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 76 +VRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG Sbjct: 1 MVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 60 Query: 77 LNTIFRHLKNSGRS 90 LNTI RHL Sbjct: 61 LNTILRHLNKLRPQ 74 >UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwinia RepID=D0FXR2_ERWPY Length = 92 Score = 95.1 bits (235), Expect = 6e-19, Method: Composition-based stats. Identities = 39/91 (42%), Positives = 53/91 (58%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M I+CP CS + + RNG+S +G QRY C C KT+QL F Y S P + II+M Sbjct: 1 MKMGDIACPRCSESARIRRNGRSASGIQRYRCQGCLKTFQLHFYYAGSSPNMQKTIIEMM 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 +G R AR +GV L T+ RHLK+ ++ Sbjct: 61 NDGSEQRDIARKLGVSLETVLRHLKDLRLNK 91 >UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IXU3_9CHRO Length = 92 Score = 88.2 bits (217), Expect = 7e-17, Method: Composition-based stats. Identities = 32/87 (36%), Positives = 49/87 (56%), Gaps = 4/87 (4%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLC--SHC-RKTWQLQFTYTASQPGTHQKIIDMA 60 ++I CP C +TD VV+NG S G QRY C C R+++ ++Y + ++I M Sbjct: 5 LAIECPHCHSTD-VVKNGFSGEGKQRYFCQNKSCERRSFIRDYSYNGCRKEVKKQIPKMV 63 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNS 87 +NG G R TAR++ + T+ LK S Sbjct: 64 VNGSGIRDTARVLEISPITVASELKKS 90 >UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacteria RepID=Q1V9Z0_VIBAL Length = 88 Score = 87.8 bits (216), Expect = 9e-17, Method: Composition-based stats. Identities = 31/82 (37%), Positives = 46/82 (56%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + + C C +D VV++G GHQRY C C +T+Q+ + Y A +PG +II+M Sbjct: 1 MTTNNPHCHFCCKSDSVVKHGYGPKGHQRYRCLSCCRTFQVNYCYEACKPGIRSRIIEMT 60 Query: 61 MNGVGCRATARIMGVGLNTIFR 82 G RAT+R + V NT+ Sbjct: 61 AQNHGKRATSRHLQVSYNTVLS 82 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 86.3 bits (212), Expect = 3e-16, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 1/85 (1%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I+CP C V++NG + Q+Y C C + +T A + +QK + + G+G Sbjct: 12 INCPKCKEK-KVIKNGTTKNNKQQYYCKMCFYRFIQNYTNQAYKLDINQKNVQLTKEGLG 70 Query: 66 CRATARIMGVGLNTIFRHLKNSGRS 90 R+TARI+ + T+ + + + GR Sbjct: 71 IRSTARILEISATTLLKRIVSIGRK 95 >UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HXD1_CYAP4 Length = 95 Score = 84.3 bits (207), Expect = 1e-15, Method: Composition-based stats. Identities = 30/94 (31%), Positives = 47/94 (50%), Gaps = 4/94 (4%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHC---RKTWQLQFTYTASQPGTHQKII 57 M + PSC ++D VV+ + T G QRY C + R T+ Q+ Y Q+I+ Sbjct: 1 MVLEPVLYPSCGSSD-VVKPRQLTEGIQRYKCRNAEWSRCTFIRQYAYRGYLVEVKQQIV 59 Query: 58 DMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 +M +NG G R AR++ + T+ LK S + Sbjct: 60 EMVVNGSGTRDPARVLKISRTTVTETLKKSSSAE 93 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 81.6 bits (200), Expect = 6e-15, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 47/85 (55%), Gaps = 3/85 (3%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNG 63 + +CPSC +D V++NG S+ G +Y C+ CR+T+ + S+ ++I+ +N Sbjct: 67 IRPNCPSC-KSDKVIKNG-SSRGKTKYKCNVCRRTFYDANSRRMSR-EQKERILKEYLNR 123 Query: 64 VGCRATARIMGVGLNTIFRHLKNSG 88 + R A++ G L T++ +K G Sbjct: 124 MSMRGIAKVEGKPLTTVYSLIKRKG 148 >UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria RepID=C4MEL4_CAMCO Length = 339 Score = 80.9 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 30/84 (35%), Positives = 39/84 (46%), Gaps = 6/84 (7%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF----TYTASQPGTHQKIIDMAMN 62 CP C+ +D V+NGK+ HQRY+C C KT+ T G K ID +N Sbjct: 47 HCPYCN-SDKFVKNGKAKT-HQRYICKTCNKTFTDTNKTILFNTKKDIGIWYKYIDCLVN 104 Query: 63 GVGCRATARIMGVGLNTIFRHLKN 86 R TA+I G+ L T F Sbjct: 105 KYPLRKTAKICGISLPTAFVWRHK 128 >UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32IP8_SHIDS Length = 101 Score = 80.1 bits (196), Expect = 2e-14, Method: Composition-based stats. Identities = 50/52 (96%), Positives = 50/52 (96%) Query: 40 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGV LNTI RHLKNSGRSR Sbjct: 50 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVSLNTILRHLKNSGRSR 101 >UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L432_AMOA5 Length = 118 Score = 78.2 bits (191), Expect = 7e-14, Method: Composition-based stats. Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 10/88 (11%) Query: 5 SISCPSCS----ATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 +++CP C+ DG+VR G QRY C CR + + T +K + + Sbjct: 3 TMNCPRCNNAHSCKDGIVR------GRQRYQCKSCRFRYTVSHKSDVKPLSTKRKALQLY 56 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSG 88 + G+G RA RI+ + T+++ +K G Sbjct: 57 LEGLGFRAIGRILNISYGTVYQWVKACG 84 >UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methanosarcina RepID=Q46CV1_METBF Length = 139 Score = 77.0 bits (188), Expect = 2e-13, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 44/84 (52%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 ++CP C+++ +NG G Q Y C C + ++ TAS P ++ + + + G+G Sbjct: 1 MNCPRCNSSTH-KKNG-IVFGRQHYKCHDCGYNYTVEVKSTASSPSVKRQALQLYLEGLG 58 Query: 66 CRATARIMGVGLNTIFRHLKNSGR 89 R+ R +GV ++ + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQ 82 >UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea psychrophila RepID=Q6AKY5_DESPS Length = 101 Score = 76.2 bits (186), Expect = 3e-13, Method: Composition-based stats. Identities = 31/85 (36%), Positives = 49/85 (57%), Gaps = 6/85 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 +SC C TD V R+GK + G+QR+ CS C++T+QL++ Y A + H++ + G Sbjct: 1 MSCRFCGGTDEVRRHGKDSNGNQRFRCSDCKRTFQLEYPYVADR---HERY---SPGNAG 54 Query: 66 CRATARIMGVGLNTIFRHLKNSGRS 90 R TAR++ VG + R K + R Sbjct: 55 IRDTARVLKVGCMGLTRFRKLNPRQ 79 >UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SSR9_AERS4 Length = 91 Score = 75.5 bits (184), Expect = 5e-13, Method: Composition-based stats. Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 1/64 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MAS++I CP C+ +D V R+GK+ AG+ RY C C +QL +TY A P + ++++ Sbjct: 10 MASITIHCPRCN-SDHVYRHGKTPAGNIRYRCPACPHVFQLTYTYEARNPASKRRLLIWR 68 Query: 61 MNGV 64 G+ Sbjct: 69 STGL 72 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 75.1 bits (183), Expect = 7e-13, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 44/84 (52%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 ++CP C+++ +NG G QRY C C + ++ T+ P ++ + + + G+G Sbjct: 1 MNCPRCNSSTH-KKNG-IVFGRQRYKCHDCGYNYTVEVKSTSISPSVKRQALQLYLEGLG 58 Query: 66 CRATARIMGVGLNTIFRHLKNSGR 89 R+ R +GV ++ + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQ 82 >UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX Length = 71 Score = 74.3 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA+V++ P C+ +D V R+G+S + H+R+ C C++ +QL ++Y A +PG + I++MA Sbjct: 1 MATVTVHRPRCN-SDKVYRHGRSCSQHERFRCRSCKRVFQLTYSYEARKPGFKELIVEMA 59 Query: 61 MNGVG 65 NG G Sbjct: 60 HNGTG 64 >UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_STRAG Length = 345 Score = 72.4 bits (176), Expect = 4e-12, Method: Composition-based stats. Identities = 28/83 (33%), Positives = 37/83 (44%), Gaps = 5/83 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNG 63 CP C VVRNG G QRY+C C K++ + + T ++ ID MNG Sbjct: 52 CPLCGCI-HVVRNGHRKDGTQRYVCKDCGKSFVIATNSIVSGTRKDLSVWEQYIDCMMNG 110 Query: 64 VGCRATARIMGVGLNTIFRHLKN 86 + R TA G+ NT F Sbjct: 111 LSIRKTAVACGIHRNTAFLWRHK 133 >UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T8A4_KLEPR Length = 83 Score = 71.2 bits (173), Expect = 9e-12, Method: Composition-based stats. Identities = 41/65 (63%), Positives = 47/65 (72%), Gaps = 1/65 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQ-KIIDM 59 MAS+ + PSC+ T+GV RNGKSTAGHQ YLC CRK W L FTYT SQ THQ KIIDM Sbjct: 7 MASIYVGSPSCAVTEGVDRNGKSTAGHQHYLCRQCRKPWTLTFTYTTSQRSTHQRKIIDM 66 Query: 60 AMNGV 64 + + Sbjct: 67 TIMAL 71 >UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriales RepID=B5W4N9_SPIMA Length = 163 Score = 71.2 bits (173), Expect = 1e-11, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 39/80 (48%), Gaps = 2/80 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C + VV+NG G Q YLC C + ++ + + M++NG+G Sbjct: 1 MDCPYCQ-SHKVVKNGH-RQGKQSYLCRECGRQFRENPCPGGYSSDVKELCVKMSLNGMG 58 Query: 66 CRATARIMGVGLNTIFRHLK 85 RA R+ G+ NTI ++ Sbjct: 59 FRAIERVTGISHNTILNWVR 78 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 69.3 bits (168), Expect = 4e-11, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 3/86 (3%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF-TYTASQPGTHQKIIDMAMNGV 64 + CP C AT+ + +NGK G Q ++C+ C + + + Q+ ++M +NG+ Sbjct: 1 MQCPYCGATE-IRKNGK-RRGKQNHICTKCERQFIDVYDPPKGYSEELKQECLEMYLNGM 58 Query: 65 GCRATARIMGVGLNTIFRHLKNSGRS 90 G R R+ GV TI +K G Sbjct: 59 GFRPIERVKGVHHTTIIFWVKQMGEK 84 >UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5WC29_PSYWF Length = 233 Score = 66.2 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 37/87 (42%), Gaps = 3/87 (3%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDMA 60 ++ I CP+C +D + +NG + G Q Y C C++ + TY KI + Sbjct: 4 TLYIKCPAC-LSDNIKKNGFKSYGKQNYKCKDCKRQFIGDHALTYQGCHSQKDSKIRYLM 62 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNS 87 + G G + A + + + LK Sbjct: 63 VRGSGIKDIACVERISKGKVLATLKKC 89 >UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZK1_TRIEI Length = 177 Score = 65.8 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 36/83 (43%), Gaps = 2/83 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I CP CS + +NG G Q Y+C C + + + + + +NG+G Sbjct: 11 IQCPDCSC-QHIPKNGHQP-GKQNYICVACSHQFIKPYHPQEYSDNVKRLFLRIYVNGMG 68 Query: 66 CRATARIMGVGLNTIFRHLKNSG 88 R A + GV TI +K++ Sbjct: 69 IRRIAWVKGVTYPTIINLIKHTR 91 >UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8T3_EDWI9 Length = 73 Score = 65.8 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 21/52 (40%), Positives = 30/52 (57%) Query: 40 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 L Y A + ++II+MA G G R TA + +G+NT+ R LKNS +S Sbjct: 22 LLTLAYEAHKLDIKEQIIEMAFKGSGVRDTANTLKIGINTVIRTLKNSRQSE 73 >UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE Length = 348 Score = 64.7 bits (156), Expect = 8e-10, Method: Composition-based stats. Identities = 27/79 (34%), Positives = 38/79 (48%), Gaps = 6/79 (7%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIIDMAMN 62 CP C D V +NGKS G QRY+C CR ++ F+ T K ++ + Sbjct: 52 ECPKCQCKD-VNKNGKS-NGRQRYICKRCRTSFDEFTMSPFSNTKLGLDKWIKYCELMIL 109 Query: 63 GVGCRATARIMGVGLNTIF 81 G+ R A +GVG+ T F Sbjct: 110 GLSIRKCAEEVGVGVKTSF 128 >UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 RepID=P73782_SYNY3 Length = 141 Score = 63.5 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 38/84 (45%), Gaps = 2/84 (2%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGC 66 CP C + VV+NG G QR+ C C+ + + + + M G+ Sbjct: 6 HCPQCGHGN-VVKNGFV-KGKQRFKCKRCQYKFTNLSKERGKLLWMKLEAVLLYMGGMSM 63 Query: 67 RATARIMGVGLNTIFRHLKNSGRS 90 ATA+++GV ++ +++ G + Sbjct: 64 NATAKLLGVSTQSLLNWIRDFGEA 87 >UniRef50_A3JAS9 Hypothetical transposase n=1 Tax=Marinobacter sp. ELB17 RepID=A3JAS9_9ALTE Length = 181 Score = 62.8 bits (151), Expect = 4e-09, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 33/81 (40%), Gaps = 4/81 (4%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDMAMNG 63 CP C + +R G S QRY C C KT+ Y + + ++ G Sbjct: 56 QCPYCQ-SKTFIRWGSSENERQRYRCKRCAKTFNALVGSPLYRMRKEELWLEYVETMRYG 114 Query: 64 VGCRATARIMGVGLNTIFRHL 84 + R A++ GV L T FR Sbjct: 115 LSLRKAAKVTGVSLRTAFRWR 135 >UniRef50_Q116V8 Insertion element protein n=2 Tax=Oscillatoriales RepID=Q116V8_TRIEI Length = 108 Score = 62.0 bits (149), Expect = 5e-09, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 37/81 (45%), Gaps = 2/81 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C + +V+NG G Q YLC C + ++ + + M ++G+G Sbjct: 1 MHCPYCQ-SHKIVKNGH-RNGKQSYLCRKCGRQFRENPCPIGYSSEVKEACLKMFLSGMG 58 Query: 66 CRATARIMGVGLNTIFRHLKN 86 RA R G+ N++ ++ Sbjct: 59 FRAIERATGISHNSVLNWVRR 79 >UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis BM407 RepID=C6GT28_STRS4 Length = 341 Score = 62.0 bits (149), Expect = 6e-09, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 6/83 (7%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL---QFTYTASQ-PGTHQKIIDMAMNG 63 CP C ++ + RNGK G QRY+C C+KT+ TY + + K +NG Sbjct: 54 CPLCG-SETISRNGK-YNGKQRYICKSCKKTFTDFTNSATYKSKKTLDKWLKYAKCMING 111 Query: 64 VGCRATARIMGVGLNTIFRHLKN 86 R +A+I+ + + T F Sbjct: 112 YSIRKSAKIVEINIATSFFWRHK 134 >UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcticus 273-4 RepID=Q4FRR6_PSYA2 Length = 108 Score = 62.0 bits (149), Expect = 6e-09, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 36/88 (40%), Gaps = 3/88 (3%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDM 59 + ISCP C + + +NG + G Q Y C C++ + TY +I M Sbjct: 3 TQIDISCPDCHSI-SLKKNGIKSYGKQNYQCKDCQRQFIGDHALTYQGCHSRIEDRIRLM 61 Query: 60 AMNGVGCRATARIMGVGLNTIFRHLKNS 87 G G R A I V + + L +S Sbjct: 62 TARGCGIRDIAVITSVSIGKVLSTLGSS 89 >UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N1_SALRD Length = 92 Score = 61.2 bits (147), Expect = 8e-09, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 1/86 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + C C +++ +V+NG S +G Q+Y C C L +KI+ Sbjct: 1 MIKETYECRECGSSN-IVKNGHSASGSQQYHCKDCGAHKVLDPEPRGYSEEEKEKILRAY 59 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKN 86 RA +RI G+ NT+ R LK Sbjct: 60 RERGSKRAISRIFGISRNTLTRWLKK 85 >UniRef50_A9VV42 Insertion element protein n=7 Tax=Bacillus RepID=A9VV42_BACWK Length = 342 Score = 61.2 bits (147), Expect = 9e-09, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 34/83 (40%), Gaps = 5/83 (6%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT---YTASQPGTHQKIIDMAMNG 63 CP C+ ++ VVR GK QRY C C KT+ Y + +D G Sbjct: 55 ECPHCA-SEHVVRFGK-HNNRQRYRCKCCSKTFTDTTNTVLYRTRKGNEWITFVDCMFKG 112 Query: 64 VGCRATARIMGVGLNTIFRHLKN 86 R +A I+GV T+F Sbjct: 113 YSLRKSAEIVGVTWVTLFYWRHK 135 >UniRef50_B0ABB1 Putative uncharacterized protein n=2 Tax=cellular organisms RepID=B0ABB1_9CLOT Length = 454 Score = 60.1 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 24/83 (28%), Positives = 38/83 (45%), Gaps = 6/83 (7%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIID 58 + CP C + D + +NGK T QRY+C +CR T+ + + T T K Sbjct: 136 KNDLKCPKCGSFD-LNKNGK-TNQRQRYICKNCRTTFDERSFSPLSNTKLSLDTWLKYCQ 193 Query: 59 MAMNGVGCRATARIMGVGLNTIF 81 + G + A+ +GV + T F Sbjct: 194 FMIEGGTIKYCAQKVGVSIPTSF 216 >UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETP6_AMOA5 Length = 232 Score = 59.7 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 35/83 (42%), Gaps = 2/83 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + C C ++ + NGK G QRY C C + + I + + +G Sbjct: 1 MECKGC-KSNKTINNGKV-RGKQRYNCKSCGFNFVEVDERRGKNIDKQRMAIHLYLENMG 58 Query: 66 CRATARIMGVGLNTIFRHLKNSG 88 RA R++GV + + ++ +G Sbjct: 59 FRAIGRVLGVSNLAVLKWIRAAG 81 >UniRef50_A8V2B8 Hypothetical insertion element protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8V2B8_9AQUI Length = 125 Score = 58.9 bits (141), Expect = 5e-08, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 36/84 (42%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I CP C ++ + GK+T G QRY C+ C + + Y + M G+ Sbjct: 14 IKCPECG-SNWCKKFGKNT-GKQRYKCNECGRHFYEGAKYHKHPEKVKLLALKMYSKGMS 71 Query: 66 CRATARIMGVGLNTIFRHLKNSGR 89 A AR++ + T+ R G+ Sbjct: 72 KSAIARVLNLPYRTVARWTYEVGK 95 >UniRef50_C1I4B6 Putative uncharacterized protein n=2 Tax=Clostridium sp. 7_2_43FAA RepID=C1I4B6_9CLOT Length = 361 Score = 57.4 bits (137), Expect = 1e-07, Method: Composition-based stats. Identities = 21/85 (24%), Positives = 36/85 (42%), Gaps = 8/85 (9%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLC--SHCRKTWQL----QFTYTASQPGTHQKIIDMAM 61 CP+C+ ++ ++ GK G QR+ C C KT+ F+ + K + + Sbjct: 57 CPNCN-SNNFIKYGK-YRGLQRFKCLNKDCCKTFSQKTNSIFSNSKKPLELWLKYLILMN 114 Query: 62 NGVGCRATARIMGVGLNTIFRHLKN 86 N R + I+G+ L T F Sbjct: 115 NKFSLRKCSSILGINLATSFYWRHK 139 >UniRef50_Q891N5 Putative transposase n=1 Tax=Clostridium tetani RepID=Q891N5_CLOTE Length = 279 Score = 57.4 bits (137), Expect = 1e-07, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 37/83 (44%), Gaps = 6/83 (7%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDMAMNG 63 C C ++ +V+NGK QRY+C C KT+ +Y+ + + G Sbjct: 59 CVHC-KSENIVKNGKYKE-KQRYICKDCHKTFTNYTNSPISYSKKNISKWIEYTKCMLAG 116 Query: 64 VGCRATARIMGVGLNTIFRHLKN 86 R +++++G+ L+T F Sbjct: 117 YSLRKSSKLVGISLSTAFYWRHK 139 >UniRef50_D2QCU0 Insertion element protein n=7 Tax=Bacteroidetes RepID=D2QCU0_9SPHI Length = 139 Score = 56.2 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 35/83 (42%), Gaps = 2/83 (2%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 ++ CP C++ D V RNG QR+ C C + + K + + + GV Sbjct: 3 TLKCPKCNSVDAV-RNG-IVNQRQRFRCKKCNYNFTVGKVGKGISTYYVIKALQLYIEGV 60 Query: 65 GCRATARIMGVGLNTIFRHLKNS 87 R R++G+ ++ +K Sbjct: 61 SFREIERLLGISHVSVMNWVKKY 83 >UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YEG9_MICAE Length = 171 Score = 56.2 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 1/69 (1%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCR 67 CP+C + ++NG G + C C + + + T T Q I + + G+ R Sbjct: 37 CPNCG-SHHTIKNGSIHNGKPKRQCKECGRQFVINPTNKTVSDETKQLIDKLLLEGISLR 95 Query: 68 ATARIMGVG 76 AR+ G Sbjct: 96 VIARVTGAS 104 >UniRef50_C6X4X6 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X4X6_FLAB3 Length = 169 Score = 56.2 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 21/85 (24%), Positives = 35/85 (41%), Gaps = 2/85 (2%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGC 66 +CP C VV++G QR+LC C + ++ K + + + G+ Sbjct: 35 TCPKCQQ-QNVVKSGIVKE-RQRFLCRSCNYYFTVKKLGKQIDDYYVTKALQLYLEGLSY 92 Query: 67 RATARIMGVGLNTIFRHLKNSGRSR 91 R RI+GV TI ++ R Sbjct: 93 REIERILGVSHVTISSWVRKYNIKR 117 >UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TRD2_9FLAO Length = 219 Score = 55.8 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 17/85 (20%), Positives = 34/85 (40%), Gaps = 2/85 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 ++C +C + R ++ G QRY C C ++Q + Y A +I + Sbjct: 1 MNCKNCDQAHCIKRGKRN--GIQRYYCKICFTSFQENYHYKAYDSSIDTLLISLLRECCS 58 Query: 66 CRATARIMGVGLNTIFRHLKNSGRS 90 AR++ + NT+ + + Sbjct: 59 VLGIARVLKISKNTVLSRMLKISKQ 83 >UniRef50_Q2GDS3 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDS3_NEOSM Length = 134 Score = 55.4 bits (132), Expect = 5e-07, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 36/79 (45%), Gaps = 3/79 (3%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C++ +++GK+ QRY C +C + + A + + ++G+ Sbjct: 1 MHCPKCNSV-RFIKSGKAKE-KQRYKCLNCGCQFSRNEKHGA-PLRLKMHAVQLFLSGIS 57 Query: 66 CRATARIMGVGLNTIFRHL 84 + A+I V T+ R + Sbjct: 58 MNSIAKIFSVSPPTVMRWV 76 >UniRef50_C4IIL3 Putative transposase n=2 Tax=Clostridium butyricum RepID=C4IIL3_CLOBU Length = 325 Score = 54.3 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 32/83 (38%), Gaps = 6/83 (7%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNG 63 CP C + ++ GK G QRY C C+KT+ + Y P K I++ Sbjct: 35 CPHCKNVE-FIKFGK-YDGIQRYRCKSCKKTFSYTTNSLWKYLKHPPEKWFKFIELLGEK 92 Query: 64 VGCRATARIMGVGLNTIFRHLKN 86 A+ + + + T F Sbjct: 93 KTLEYCAKTLKISIVTAFNWRHK 115 >UniRef50_B1QSI6 Putative transposase n=10 Tax=Clostridium butyricum RepID=B1QSI6_CLOBU Length = 336 Score = 54.3 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 23/86 (26%), Positives = 36/86 (41%), Gaps = 8/86 (9%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSH--CRKTWQLQ----FTYTASQPGTHQKIIDMA 60 SCP C ++ GK QRY C + C KT+ + Y QP + I++ Sbjct: 34 SCPYCGCK-HFIKYGKYQD-IQRYKCKNEECGKTFSNTTFSVWKYLKYQPEKWIEFIELM 91 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKN 86 G+ ++ARI+ + T F Sbjct: 92 CEGMTLESSARILKITTTTAFYWRHK 117 >UniRef50_C0QU68 Putative transposase n=1 Tax=Persephonella marina EX-H1 RepID=C0QU68_PERMH Length = 94 Score = 53.5 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 38/87 (43%), Gaps = 2/87 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M ISCP C ++ V+NGK+ G Q YLC C + + + ++ +++ Sbjct: 1 MGGKKISCPHC-ESERCVKNGKA-NGKQTYLCKECYYRFTINASKRKYPFKIRREAVNLY 58 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNS 87 G ++ + + + TI +K Sbjct: 59 KEGYTLTEISKKLNIKVQTIHHWVKKY 85 >UniRef50_Q97IZ3 Zn-finger DNA-binding domain n=1 Tax=Clostridium acetobutylicum RepID=Q97IZ3_CLOAB Length = 142 Score = 53.5 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 37/83 (44%), Gaps = 6/83 (7%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL---QFTYTASQ-PGTHQKIIDMAMNG 63 CP C ++ + RN K G Q Y+C C+K++ TY + + K +NG Sbjct: 54 CPICG-SETISRNSK-YNGKQGYICKSCKKSFTDFTNSATYKSKKTLDKWLKYAKCMVNG 111 Query: 64 VGCRATARIMGVGLNTIFRHLKN 86 R +A+++ + + T F Sbjct: 112 YSIRKSAKVVEINIATSFFWRHK 134 >UniRef50_B5ETH8 Transposase n=15 Tax=Vibrionaceae RepID=B5ETH8_VIBFM Length = 489 Score = 53.1 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 24/58 (41%) Query: 27 HQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 QRY C C T+ +++ + QK++ G R R + + T + H+ Sbjct: 109 RQRYRCKSCASTFVDKWSGENQKSLIQQKLLGFLFTGYSVREICRRLHINPKTFYDHI 166 >UniRef50_Q64EP4 Putative uncharacterized protein n=10 Tax=environmental samples RepID=Q64EP4_9ARCH Length = 164 Score = 53.1 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 21/77 (27%), Positives = 30/77 (38%), Gaps = 4/77 (5%) Query: 17 VVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNGVGCRATARI 72 +VR G G QR+ C C K + F I + + G RA RI Sbjct: 38 IVRYGHDKNGRQRFKCKTCGKVFVETKNTVFYNRKLSEDQIILICKLLVEKNGIRAIERI 97 Query: 73 MGVGLNTIFRHLKNSGR 89 M + +TI +K+ R Sbjct: 98 MEIHRDTISDVVKDLAR 114 >UniRef50_Q97IP3 Zn-finger DNA-binding domain n=1 Tax=Clostridium acetobutylicum RepID=Q97IP3_CLOAB Length = 171 Score = 53.1 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 33/89 (37%), Gaps = 11/89 (12%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYT-----ASQPGTHQKII 57 V + C + RNGK QRY+C C+KT+ FTY+ + Sbjct: 50 KVYLHC----KLEMFSRNGKHDE-KQRYVCKTCKKTF-TDFTYSPISSSKKPLDKWLQYA 103 Query: 58 DMAMNGVGCRATARIMGVGLNTIFRHLKN 86 + G R A+ + + + T F Sbjct: 104 KCMIVGYSIRKCAKTVNINIATSFFWRHK 132 >UniRef50_A7HSL0 Insertion element protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HSL0_PARL1 Length = 342 Score = 52.7 bits (125), Expect = 3e-06, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 33/95 (34%), Gaps = 11/95 (11%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSH-----CRKTW---QLQFTYTASQPGTHQKIID 58 CP C D +V++G+ G QR+ C C +T+ +P Sbjct: 54 GCPHCGH-DDIVKHGRDRGGRQRFRCRRSGSSGCGQTFNALTGTAFTRMRKPEKWAAYAR 112 Query: 59 MAMNGVGCRATARI--MGVGLNTIFRHLKNSGRSR 91 M G + +G+ T +R R++ Sbjct: 113 MMATGFKSVDDVKTSGLGISRLTAWRWRHRLLRAQ 147 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 52.7 bits (125), Expect = 3e-06, Method: Composition-based stats. Identities = 17/49 (34%), Positives = 27/49 (55%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQ 49 M ++ + C C T+ V ++ K A HQRY C C + +QL++ Y A Sbjct: 1 METLEVKCRFCQQTEFVKKHSKGDADHQRYRCFSCNQIFQLEYAYRACH 49 >UniRef50_Q8D3Z3 Transposase n=48 Tax=Vibrionales RepID=Q8D3Z3_VIBVU Length = 507 Score = 52.4 bits (124), Expect = 4e-06, Method: Composition-based stats. Identities = 14/91 (15%), Positives = 33/91 (36%), Gaps = 8/91 (8%) Query: 8 CPSCSATDGVVRNGKSTAG----HQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNG 63 C + + ++ G QRY C C+ T+ +++ + + ++ + G Sbjct: 104 CANFGLSVHTHKHLYHAFGYSGDRQRYRCKSCQSTFVDKWSGANKKLQFQENLMGLLFTG 163 Query: 64 VGCRATARIMGVGLNTIFRHLK----NSGRS 90 R R + + T + H++ R Sbjct: 164 YSVREICRKLAINPKTFYDHVEHIASRCRRK 194 >UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae RepID=C5BFY7_EDWI9 Length = 46 Score = 52.4 bits (124), Expect = 5e-06, Method: Composition-based stats. Identities = 19/42 (45%), Positives = 26/42 (61%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 MA + + CP + T V+RNG +T+G Q Y C C KT+QL Sbjct: 1 MAKIDVVCPRGAKTQDVIRNGHATSGAQVYRCKLCLKTFQLS 42 >UniRef50_A6CNB6 Putative uncharacterized protein n=8 Tax=Bacillus RepID=A6CNB6_9BACI Length = 335 Score = 51.6 bits (122), Expect = 9e-06, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 34/88 (38%), Gaps = 7/88 (7%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYT----ASQPGTHQKIID 58 + C C + V RNGK QRYLC C K++ + + T G K Sbjct: 49 KEGLGCIHCGSV-KVKRNGKYRE-RQRYLCRDCGKSF-NELSNTPIAGTRYLGKWAKYFH 105 Query: 59 MAMNGVGCRATARIMGVGLNTIFRHLKN 86 M + G A+ + + ++T F Sbjct: 106 MMVEGYTLPKIAKRLKIHISTAFYWRHK 133 >UniRef50_UPI000196AFFE hypothetical protein CATMIT_00334 n=2 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196AFFE Length = 357 Score = 51.2 bits (121), Expect = 9e-06, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 33/78 (42%), Gaps = 5/78 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNG 63 CP C + +NGK HQRY+C C K++ + F ++ I++ + Sbjct: 50 CPICGSV-HFKKNGKDKNRHQRYICLDCHKSFSDRTNTLFYWSHFTLDQWLHFIELELYK 108 Query: 64 VGCRATARIMGVGLNTIF 81 + A+++ T F Sbjct: 109 MPLEGEAQVLETSKTTCF 126 >UniRef50_B4WSN9 Insertion element protein n=3 Tax=Cyanobacteria RepID=B4WSN9_9SYNE Length = 83 Score = 51.2 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 34/79 (43%), Gaps = 5/79 (6%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH----QKIIDMAM 61 + CP C ++GK++ G QRY C+ CR+T+ F + + I+ + Sbjct: 1 MDCPFCDHPTP-HKHGKTSKGSQRYRCTACRRTFTETFDTLYDRRQVTSEQVKLILQTYV 59 Query: 62 NGVGCRATARIMGVGLNTI 80 G R +RI T+ Sbjct: 60 EGSSLRGISRIGKRAYGTV 78 >UniRef50_A1SXI4 Conserved hypothetical transposase n=18 Tax=Gammaproteobacteria RepID=A1SXI4_PSYIN Length = 319 Score = 51.2 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 28/85 (32%), Gaps = 5/85 (5%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDMAM 61 S CP C + GK+ + QRY C C KT+ + K + Sbjct: 52 SPQCPHCHCA-HFTKWGKAGS-VQRYKCFSCHKTFNNKTKTPLAKLHRCELWDKYAECMS 109 Query: 62 NGVGCRATARIMGVGLNTIFRHLKN 86 + R A + + L T F Sbjct: 110 LKLTLREAAAVCNINLKTSFLWRHR 134 >UniRef50_C5U8R8 Insertion element protein (Fragment) n=2 Tax=Methanocaldococcus infernus ME RepID=C5U8R8_9EURY Length = 100 Score = 50.8 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 6/91 (6%) Query: 6 ISCPSCSATDGVVRNGKSTAG----HQRYLCSHCRKTWQLQFTYTASQPGTHQKIID-MA 60 I C C+ +D VV+ GK + Q YLC C++ + + +K++ + Sbjct: 5 IRCKYCN-SDKVVKAGKHKSEKYGVRQMYLCKKCKRRFVEESKAPRYSDSFKEKVVRSVV 63 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 G+G R R+ + TI R +K+ +++ Sbjct: 64 FEGLGIRQAGRVFKLSTTTILRWIKDFKKTK 94 >UniRef50_Q81ZP0 Putative uncharacterized protein n=2 Tax=Nitrosomonas europaea RepID=Q81ZP0_NITEU Length = 323 Score = 50.8 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 38/88 (43%), Gaps = 5/88 (5%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA- 60 +S CP C ++ R G AG QR+ C C+ T+ ++ + ++ + Sbjct: 43 SSFEPICPVCQ-SNHFYRWGY-QAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA 100 Query: 61 --MNGVGCRATARIMGVGLNTIFRHLKN 86 + G+ RA+AR + NT FR Sbjct: 101 ALIEGLTVRASARQCRIDKNTSFRWRHR 128 >UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C2KAD9_9FLAO Length = 239 Score = 50.8 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 40/83 (48%), Gaps = 1/83 (1%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCR 67 C C+ + ++ G ++ QRY C C+K + +++Y A Q T+ I + GVG R Sbjct: 8 CIHCNYS-YCIKAGITSQNKQRYQCKKCKKKFIGKYSYRAYQKSTNHNIQQLIKEGVGIR 66 Query: 68 ATARIMGVGLNTIFRHLKNSGRS 90 +R++ V T+ + + Sbjct: 67 GISRLLNVSKTTVLKKILKIASK 89 >UniRef50_A8ZMX8 Putative uncharacterized protein n=5 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMX8_ACAM1 Length = 134 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 37/89 (41%), Gaps = 9/89 (10%) Query: 6 ISCPSCSATDGVVRNG----KSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 + CP C ++ +++ G + QRY C C + + + ++ T ++ A+ Sbjct: 1 MECPYCQ-SEKILKRGFDSLQDGTLVQRYQCKDCNRRFNERTGTPMARLRTASSVVSYAI 59 Query: 62 ----NGVGCRATARIMGVGLNTIFRHLKN 86 G+G R+ R G TI R K Sbjct: 60 KARTEGMGVRSAGRTFGKSHTTIMRWEKR 88 >UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PIK6_CORA7 Length = 403 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 2/80 (2%) Query: 7 SCPSCS-ATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + PSC G+V+NGK+ AG QR+LC C + +T+ ID ++G Sbjct: 5 NRPSCDMCGHGLVKNGKTAAGTQRWLCPQCNVSSINTRAHTSDIRHFK-IFIDWILSGES 63 Query: 66 CRATARIMGVGLNTIFRHLK 85 A+ +GV T+ R K Sbjct: 64 ADHLAKRLGVTRRTLTRWFK 83 >UniRef50_C1DQR5 Transposase n=5 Tax=Bacteria RepID=C1DQR5_AZOVD Length = 317 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 32/89 (35%), Gaps = 5/89 (5%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQPGTHQKII 57 M + SCP C +++ ++ S+ G RY C C KT + Q Sbjct: 38 MIATPSSCPHCQSSE--LQPWGSSGGLPRYRCKFCGKTSNPLTGTPMARLRKRHLWQGYA 95 Query: 58 DMAMNGVGCRATARIMGVGLNTIFRHLKN 86 + + R A+ GV NT F Sbjct: 96 EALTQSLTVRRAAKHCGVSKNTAFLWRHR 124 >UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202 RepID=C8KX67_9PAST Length = 238 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 27/85 (31%), Positives = 36/85 (42%), Gaps = 3/85 (3%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDMAMNG 63 ISCP CS+ + +NGK Q YLC C + + TY Q+I+ M + G Sbjct: 7 ISCPKCSSCQ-IKKNGKKPNNKQNYLCKCCGRQFIGDHALTYRGCHSKISQRILIMLVRG 65 Query: 64 VGCRATARIMGVGLNTIFRHLKNSG 88 G R A I V + L N Sbjct: 66 CGIRDVAAIEKVSCTKVLSVLLNVR 90 >UniRef50_B6B4C9 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2083 RepID=B6B4C9_9RHOB Length = 321 Score = 49.3 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 26/84 (30%), Positives = 38/84 (45%), Gaps = 7/84 (8%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKII-DMAMN 62 +CP C A D R G++ AG QRY C C KT+ + +++ DM + Sbjct: 49 TCPHCGAVDR-QRWGRTRAGSQRYRCQGCLKTFNGRTGSSIAQLQKLDQFYQVLKDMFSD 107 Query: 63 G--VGCRATARIMGVGLNTIFRHL 84 G R AR + V +TI+R Sbjct: 108 GPPRSIRRLARQLDVNKDTIWRWR 131 >UniRef50_C6W2G4 Transcriptional regulator, LacI family n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W2G4_DYAFD Length = 388 Score = 49.3 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 35/79 (44%), Gaps = 6/79 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I C C+ DG+++ G G QRYLC C + A + + + ++ + Sbjct: 2 IECVKCAQVDGIMKAGYV-RGKQRYLCKWCNYYFT-----HAEKDDSIESLVKRKRHQTT 55 Query: 66 CRATARIMGVGLNTIFRHL 84 A+ +GV +T+ R L Sbjct: 56 IIDIAKSLGVSNSTVSRAL 74 >UniRef50_A0L9I6 Insertion element protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L9I6_MAGSM Length = 89 Score = 48.9 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 41/90 (45%), Gaps = 6/90 (6%) Query: 1 MAS--VSISCPSCSATDGVVRNGKSTAGHQRYLCSH--CRKT-WQLQFTYTASQPGTHQK 55 MA+ V + CP C + D V++ GK G QR+ C+ C +T + + ++ Sbjct: 1 MATMEVHVHCPDCGSLD-VIKFGKDRHGRQRFRCNDHFCDRTIFMMDDPDWWRFEEVKKQ 59 Query: 56 IIDMAMNGVGCRATARIMGVGLNTIFRHLK 85 I ++G G TA +G+ + R K Sbjct: 60 IALHLLSGNGIHQTAHNLGLHPEFVNRMAK 89 >UniRef50_C8SD87 Insertion element protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SD87_FERPL Length = 94 Score = 48.9 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 38/91 (41%), Gaps = 5/91 (5%) Query: 6 ISCPSCSATDGVVR---NGKSTAGHQRYLCSHCRKTWQLQ-FTYTASQPGTHQKIIDMAM 61 + CP C + V + KS QRY C +C +T+ L + ++ + Sbjct: 1 MMCPHCKSIKTVKMGCYHTKSGERRQRYKCKNCGRTFVLNPIKPRNYPEEFKEMVVKAVV 60 Query: 62 -NGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 GVG R +RI + NT+ ++ + R Sbjct: 61 REGVGVRQASRIFKLSPNTVTAWVREFSKKR 91 >UniRef50_Q9AMR3 Putative transposase n=1 Tax=Azotobacter vinelandii RepID=Q9AMR3_AZOVI Length = 214 Score = 48.5 bits (114), Expect = 6e-05, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 32/89 (35%), Gaps = 5/89 (5%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQPGTHQKII 57 M + SCP C +++ ++ S+ G RY C C KT + Q Sbjct: 38 MIATPSSCPHCQSSE--LQPWGSSGGLPRYRCKFCGKTSNPLTGTPMARLRKRHLWQGYA 95 Query: 58 DMAMNGVGCRATARIMGVGLNTIFRHLKN 86 + + R A+ GV NT F Sbjct: 96 EALTQSLTVRRAAKHCGVSKNTAFLWRHR 124 >UniRef50_B9TDK1 Putative uncharacterized protein n=2 Tax=Ricinus communis RepID=B9TDK1_RICCO Length = 321 Score = 48.1 bits (113), Expect = 8e-05, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 30/82 (36%), Gaps = 5/82 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQPGTHQKIIDMAMNGV 64 CP C R G++ +G QR+ C HC ++ + + + Sbjct: 52 CPHCGCARK-HRCGQA-SGLQRFRCLHCGRSHNALTKTPLARLRKKECWLPYLQCVLESR 109 Query: 65 GCRATARIMGVGLNTIFRHLKN 86 R A+I+GV T FR Sbjct: 110 TVRDAAQIVGVHRTTSFRWRHR 131 >UniRef50_B6ARX2 Probable transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6ARX2_9BACT Length = 133 Score = 48.1 bits (113), Expect = 9e-05, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 32/87 (36%), Gaps = 5/87 (5%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDM 59 S CP C + V + G+ G QRY C CR+ + + ++ Sbjct: 47 SEHPRCPHCQD-EHVAKWGRV-KGLQRYRCEACRRQFTPLTNTPLSGLRKREKWGAYLEA 104 Query: 60 AMNGVGCRATARIMGVGLNTIFRHLKN 86 +G+ R A+ +GV T F Sbjct: 105 MEDGLSVRKAAQRIGVNHKTTFLWRHR 131 >UniRef50_C0BSX6 Putative uncharacterized protein n=1 Tax=Bifidobacterium pseudocatenulatum DSM 20438 RepID=C0BSX6_9BIFI Length = 352 Score = 47.7 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 36/78 (46%), Gaps = 5/78 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS----QPGTHQKIIDMAMNG 63 C C + ++R G+ G QR+ C +C +T+ ++ + G + ++ ++ Sbjct: 55 CVRCGSI-RIIRKGRGRDGSQRWKCMNCNRTFGVRTNRVMGMSKLKAGVWMRFLECFVDC 113 Query: 64 VGCRATARIMGVGLNTIF 81 + R A+ GV L T F Sbjct: 114 LSLRKCAQRCGVCLKTAF 131 >UniRef50_A8ZNT7 Putative uncharacterized protein n=2 Tax=Acaryochloris marina MBIC11017 RepID=A8ZNT7_ACAM1 Length = 188 Score = 47.3 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 41/89 (46%), Gaps = 9/89 (10%) Query: 6 ISCPSCSATDGVVRNG----KSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 + C C ++ VV+NG K+ Q +LC C + + + ++ T + I MA+ Sbjct: 1 MQCIHCQ-SENVVKNGTKTLKTAQVVQYFLCKDCGRRFNERSGTPMARLRTPVETISMAI 59 Query: 62 N----GVGCRATARIMGVGLNTIFRHLKN 86 N G+G RA R++ N+I K Sbjct: 60 NARTEGLGIRAAGRVLRKSPNSIILWEKR 88 >UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JSC1_YERPY Length = 53 Score = 47.3 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 16/37 (43%), Positives = 21/37 (56%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRK 37 MA + CP C D V ++G +GHQRY C H +K Sbjct: 1 MAKIDEKCPFCERKDLVKKHGYGKSGHQRYRCPHAKK 37 >UniRef50_A5FST1 Integrase, catalytic region n=1 Tax=Dehalococcoides sp. BAV1 RepID=A5FST1_DEHSB Length = 319 Score = 47.3 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 33/92 (35%), Gaps = 9/92 (9%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTA--SQPGTHQKIIDMAMNG 63 I C C + R G S A QR+LC+ C T+ + P + M G Sbjct: 8 IECKYCG-SRHTRRYGHSRAQKQRWLCNDCCHTFVETSAQPGMRTPPEQIGAAVSMFYEG 66 Query: 64 VG----CRATARIMGVGL--NTIFRHLKNSGR 89 + CR +I + T++ + + Sbjct: 67 LSLSAICRQMKQIHNISPSDGTVYGWITKYSK 98 >UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachlamydiaceae RepID=Q6MD28_PARUW Length = 209 Score = 46.6 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 27/79 (34%), Gaps = 1/79 (1%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + C C +D V +NG + Q + C C K W T + + + V Sbjct: 1 MRCTHCG-SDLVKKNGYTRHEKQNFRCLECGKQWSENKEAKIINEQTKELVRKALLEKVS 59 Query: 66 CRATARIMGVGLNTIFRHL 84 RI V + + + Sbjct: 60 LNGICRIFDVSMPWLLDFI 78 >UniRef50_UPI0001C348D8 hypothetical protein PretD1_11772 n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C348D8 Length = 467 Score = 46.6 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 38/86 (44%), Gaps = 3/86 (3%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M ++ +CPSC +T+ + + G + G RY C +C + L+ K+I+ Sbjct: 67 MKNIEKACPSCYSTENI-KYGTTAIGTVRYQCKNCNNVYSLKNLNKFDD--VDNKLIESL 123 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKN 86 + + + + + +R L+N Sbjct: 124 LKNTKVSTIFKELKITPASFYRRLEN 149 >UniRef50_Q3Y3Y2 Transposase, IS204/IS1001/IS1096/IS1165 n=4 Tax=Enterococcus faecium RepID=Q3Y3Y2_ENTFC Length = 401 Score = 46.2 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 24/98 (24%), Positives = 39/98 (39%), Gaps = 21/98 (21%) Query: 8 CPSCS-ATDGVVRNGK-------STAG---------HQRYLCSHCRKTWQ-LQF---TYT 46 CP C +T +V+NGK + +G QRYLC C+K + + + Sbjct: 47 CPCCKDSTKQIVKNGKKISMILLNRSGNKRTYLRLKKQRYLCRACKKYFTARTYLVTPFC 106 Query: 47 ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 H KI++ +A + V + T+ R L Sbjct: 107 FISKQIHYKILEELTERQSIKAIGKHCDVSVTTVQRTL 144 >UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ24_FERNB Length = 316 Score = 46.2 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 14/44 (31%), Positives = 27/44 (61%), Gaps = 1/44 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT 44 M + ++SCP C +T + +NG G+Q++LC C +++L + Sbjct: 1 MNNSTLSCPKCGST-SLYKNGHDKYGNQQFLCKLCHHSFKLSHS 43 >UniRef50_C6MXF0 Putative uncharacterized protein n=1 Tax=Geobacter sp. M18 RepID=C6MXF0_9DELT Length = 512 Score = 46.2 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 19/68 (27%), Positives = 32/68 (47%), Gaps = 2/68 (2%) Query: 19 RNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH--QKIIDMAMNGVGCRATARIMGVG 76 R G++ AG +RY C C +T+ + TA Q TH +KI +N + + Sbjct: 43 RFGETAAGARRYRCKLCSRTFSINGKPTARQRDTHKNKKIYMHLVNKSPFKRICEQAEIS 102 Query: 77 LNTIFRHL 84 T++R + Sbjct: 103 PATLYRKI 110 >UniRef50_B2GC72 Transposase n=11 Tax=Lactobacillus RepID=B2GC72_LACF3 Length = 428 Score = 46.2 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 21/100 (21%), Positives = 35/100 (35%), Gaps = 25/100 (25%) Query: 8 CPSCSATDGVVRNGKSTA-----------------GHQRYLCSHCRKTW------QLQFT 44 CP C D ++NG S QR C +C+ ++ ++ Sbjct: 45 CPHCGFADTFIKNGHSYQTIKYLSINESCPTMLRIDKQRLRCKNCQDSFMAKTNVVDKYC 104 Query: 45 YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 A K + M + V + ++ GV +TI R L Sbjct: 105 SIAK--AVKHKALTMLESNVSQKDVSKFTGVSPSTIGRLL 142 >UniRef50_Q10ZA4 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZA4_TRIEI Length = 469 Score = 46.2 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 12/37 (32%), Positives = 21/37 (56%), Gaps = 2/37 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 + CP+C +T + +NG+ QRY C C + + +Q Sbjct: 1 MKCPTCGST-SLRKNGR-PNNRQRYRCKDCGRQFMVQ 35 >UniRef50_A7HJ99 Putative uncharacterized protein n=2 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ99_FERNB Length = 261 Score = 45.8 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 26/48 (54%), Gaps = 1/48 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS 48 M ++ + CP C +++ ++NG +Q + C C++ ++L FT Sbjct: 1 MTNIQLKCPHCGSSN-FIKNGHDKFKNQIFFCKDCKRYFKLSFTKKHK 47 >UniRef50_B0K4X0 Integrase, catalytic region n=11 Tax=Clostridia RepID=B0K4X0_THEPX Length = 343 Score = 45.8 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 13/36 (36%), Positives = 19/36 (52%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW 39 V + CP C+ T + GK G+Q+YLC C + Sbjct: 5 VPLKCPKCNNTHLFYKYGKDKDGYQKYLCRKCYHQF 40 >UniRef50_C9BRL5 Transposase n=2 Tax=Enterococcus faecium RepID=C9BRL5_ENTFC Length = 433 Score = 45.8 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 33/101 (32%), Gaps = 25/101 (24%) Query: 8 CPSCSATDG---VVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTAS 48 CP C + +V+NGK + QRY C C + TY Sbjct: 47 CPLCKQMNHEGMIVKNGKKKSLIQLNKCANQLTYLALAKQRYHCRGCHTYF-TANTYIVD 105 Query: 49 Q-----PGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 + KI++ A+ GV +T+ R L Sbjct: 106 RNCFIAKQVRYKILEELTEKQAMTTIAKHCGVSWSTVSRTL 146 >UniRef50_P04137 Uncharacterized protein in transposable element ISH50 n=11 Tax=Halobacteriaceae RepID=YIH50_HALSA Length = 294 Score = 45.4 bits (106), Expect = 5e-04, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 7/90 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ--FTYTASQPGTHQKIIDMAM-- 61 + CPSC + V+R G S QRYLC C +T+ Q + S + + + Sbjct: 26 VYCPSC-RAESVIRYG-SYRVFQRYLCKDCDRTFNDQTGTVFEHSAVALRKWFLAVYTYI 83 Query: 62 -NGVGCRATARIMGVGLNTIFRHLKNSGRS 90 R + V T++R ++ R+ Sbjct: 84 RLNTSIRQLDAEIDVSYKTVYRRVQRFLRA 113 >UniRef50_A4W908 Putative cytoplasmic protein n=2 Tax=Enterobacteriaceae RepID=A4W908_ENT38 Length = 414 Score = 45.4 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 14/41 (34%), Positives = 22/41 (53%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT 44 V + CP+C D ++RNG G QR+ C C ++ + T Sbjct: 64 VLLYCPTCGQGDALIRNGCGLRGAQRWRCRTCNSSFTDKST 104 >UniRef50_D0WF86 Putative Zn-finger DNA-binding domain protein n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WF86_9ACTN Length = 243 Score = 45.4 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 27/83 (32%), Gaps = 5/83 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTH-QKIIDMAMNG 63 CP C + +GK+ G +RY C C + A P +I ++ + Sbjct: 54 CPDCGSVRP-RLDGKAPNGARRYRCRECGCRFSALTGTIFADAKLPLHKIMRIAEVMCHS 112 Query: 64 VGCRATARIMGVGLNTIFRHLKN 86 R + V T F Sbjct: 113 ASLRLMELVAEVSHGTAFLWRHK 135 >UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q0SUU8_CLOPS Length = 340 Score = 45.0 bits (105), Expect = 6e-04, Method: Composition-based stats. Identities = 12/54 (22%), Positives = 24/54 (44%), Gaps = 3/54 (5%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQ 54 M +I CP C ++ + + G +Q+Y C C + + +S+P + Sbjct: 1 MNKTNIKCPRCH-SEKLYKFGFDKQANQKYQCKECGRQFAPD--SVSSRPKSKY 51 >UniRef50_UPI0001BC5CBF transposase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5CBF Length = 184 Score = 45.0 bits (105), Expect = 7e-04, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 38/105 (36%), Gaps = 21/105 (20%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQFT 44 + + +CP C + V+NG T+ QR+LC C ++ L+ Sbjct: 42 LTKDTCACPHCH-SQTTVKNGFKTSKVRYLPFQNYPIIIALKKQRFLCKECHHSFTLETP 100 Query: 45 ----YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLK 85 Y + +++ + A+ + + T+ R LK Sbjct: 101 IVKKYASISQTLKLSVLNSLQENMSLSLIAKQHRISIPTVQRILK 145 >UniRef50_A4WNK0 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WNK0_RHOS5 Length = 481 Score = 45.0 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 30/67 (44%), Gaps = 1/67 (1%) Query: 19 RNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK-IIDMAMNGVGCRATARIMGVGL 77 R GK+ G R+ C C KT+ + + + ++DM N + +RI G+ Sbjct: 132 RFGKTKGGDARWRCKGCGKTFSVGKPARRHKRSDKNRLVLDMLCNDLSFAKMSRISGLAY 191 Query: 78 NTIFRHL 84 I+R + Sbjct: 192 RDIYRRV 198 >UniRef50_Q07NT9 Putative uncharacterized protein n=2 Tax=Rhizobiales RepID=Q07NT9_RHOP5 Length = 577 Score = 44.7 bits (104), Expect = 9e-04, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 32/86 (37%), Gaps = 11/86 (12%) Query: 7 SCP--SCSATDGVV--------RNGKSTAGHQRYLCSHCRKTW-QLQFTYTASQPGTHQK 55 CP SC + + R+G S G RY C CRKT+ + ++ Sbjct: 103 HCPDDSCENYNKLFDSHPKSYFRHGTSAIGAPRYRCKACRKTFSVRTGHSRHRKSHENKT 162 Query: 56 IIDMAMNGVGCRATARIMGVGLNTIF 81 + + ++ V +I + ++ Sbjct: 163 VFQLLVSKVPITKIGQITDLSPAAVY 188 >UniRef50_C0X375 Transposase n=23 Tax=Enterococcus RepID=C0X375_ENTFA Length = 446 Score = 44.7 bits (104), Expect = 9e-04, Method: Composition-based stats. Identities = 14/69 (20%), Positives = 29/69 (42%), Gaps = 4/69 (5%) Query: 26 GHQRYLCSHCRKTWQLQFTYTASQ----PGTHQKIIDMAMNGVGCRATARIMGVGLNTIF 81 QR+ C HC KT+ + + + + Q I+++ + AR+ + T+ Sbjct: 85 NKQRFKCKHCGKTFLAEDSVSDRRCSIARRVKQAILELLSEPISMSLIARMKHISPTTVI 144 Query: 82 RHLKNSGRS 90 R L++ Sbjct: 145 RILRSLRPK 153 >UniRef50_B2A0V7 Integrase catalytic region n=15 Tax=Clostridia RepID=B2A0V7_NATTJ Length = 353 Score = 44.7 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 2/36 (5%) Query: 6 ISCPSCS--ATDGVVRNGKSTAGHQRYLCSHCRKTW 39 + CP C+ +D + G GHQ+Y C C + Sbjct: 4 VVCPRCNNNCSDKFYKFGFDNHGHQKYQCQECFSQF 39 >UniRef50_Q4JT92 Transposase for IS3503f n=4 Tax=Corynebacterium jeikeium K411 RepID=Q4JT92_CORJK Length = 165 Score = 43.9 bits (102), Expect = 0.001, Method: Composition-based stats. Identities = 20/86 (23%), Positives = 29/86 (33%), Gaps = 5/86 (5%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + SCP C +NG ++ R+ C+HC ++ T I A Sbjct: 1 MTTNRPSCPLCGNNTK--KNGTTSKSTTRWRCTHCGHSFTRNTQTHNKNTATMALFIQWA 58 Query: 61 MNGVGCRATARIMGVGLNTI---FRH 83 A GV T+ FR Sbjct: 59 TGTQSLTTFAAHHGVTRQTMHHRFRW 84 >UniRef50_D1JAI8 Putative uncharacterized protein n=1 Tax=uncultured archaeon RepID=D1JAI8_9ARCH Length = 192 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 4/71 (5%) Query: 20 NGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNGVGCRATARIMGV 75 GK Q C C K + + + G I + G G RATARIMG+ Sbjct: 36 YGKGEKRTQMLKCKVCGKRFSIHKGTPLFNLKADEGAFYGTIAHLVEGNGIRATARIMGI 95 Query: 76 GLNTIFRHLKN 86 +T+ + LK Sbjct: 96 NKDTVSKWLKK 106 >UniRef50_Q0W4F0 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4F0_UNCMA Length = 141 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 23/82 (28%), Positives = 33/82 (40%), Gaps = 4/82 (4%) Query: 12 SATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK----IIDMAMNGVGCR 67 S VV+ G S AGHQ + C HC + + ++ I + G R Sbjct: 24 SEGSRVVKKGFSRAGHQVFQCRHCGRHFCETINTPMYGRRITREDVILIGKLLNERNGIR 83 Query: 68 ATARIMGVGLNTIFRHLKNSGR 89 A RI G +T+ R K+ R Sbjct: 84 AIERITGHHRDTVMRVAKDLAR 105 >UniRef50_Q3Y1C3 Transposase, IS204/IS1001/IS1096/IS1165 n=51 Tax=Enterococcus RepID=Q3Y1C3_ENTFC Length = 431 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 28/106 (26%), Positives = 38/106 (35%), Gaps = 27/106 (25%) Query: 7 SCPSCSAT--DG-----VVRNGKSTA----------------GHQRYLCSHCRKTWQLQF 43 +C +C +T DG VV+NGK QRY C +CR W Q Sbjct: 44 TCRNCGSTVVDGNGKVIVVKNGKKETIVRFEQYNHMPLVMRLKKQRYTCKNCRTHWTTQS 103 Query: 44 TYTASQPGT----HQKIIDMAMNGVGCRATARIMGVGLNTIFRHLK 85 + + KI + V A+ V L T+ R LK Sbjct: 104 YFVQPRHSIANHVRYKIASLLTEKVSLSFIAKNCQVSLTTVIRTLK 149 >UniRef50_Q8PSY9 Conserved protein n=2 Tax=Methanosarcina RepID=Q8PSY9_METMA Length = 146 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 24/100 (24%), Positives = 42/100 (42%), Gaps = 11/100 (11%) Query: 3 SVSISCPSCSAT-------DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 + + CP+ + +++ GK GHQRY C HC K + + ++ Sbjct: 5 TDEVVCPNPKCSYYLKAEGRAIIKRGKYKTGHQRYYCKHCEKFFMDTIGTAIYRKHLSKE 64 Query: 56 IIDM----AMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 I M + G R+ RI G +TI LK++ ++ Sbjct: 65 EIRMIYRLFLEKNGIRSIERITGHHRDTISNLLKDTVKNE 104 >UniRef50_C6MKY8 Putative uncharacterized protein n=1 Tax=Geobacter sp. M18 RepID=C6MKY8_9DELT Length = 632 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 20/72 (27%), Positives = 30/72 (41%), Gaps = 2/72 (2%) Query: 21 GKSTAGHQRYLCSHCRKTWQ--LQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLN 78 G + AG QR+ C C KT+ L + G ++ + V R AR VG Sbjct: 123 GHTKAGSQRFRCKICHKTFSIPLAANLRQRKKGKSTEVFRLLTCQVAIRKMARNARVGKE 182 Query: 79 TIFRHLKNSGRS 90 T+ R++ R Sbjct: 183 TVHRYIHLIHRQ 194 >UniRef50_C2FEQ0 IS1181 transposase n=1 Tax=Lactobacillus paracasei subsp. paracasei ATCC 25302 RepID=C2FEQ0_LACPA Length = 425 Score = 42.7 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 20/98 (20%), Positives = 31/98 (31%), Gaps = 20/98 (20%) Query: 7 SCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTASQP 50 CP+C +VR G QR+ C CR +Q + Y + Sbjct: 48 HCPACGFASKLVRYGFERTCVLMPSYSYRPTYMKLSRQRFRCELCRSVFQSETDYVRPRS 107 Query: 51 GT----HQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 Q ++ A + AR V T+ R + Sbjct: 108 TISTPVRQMVLFEAFSNCSLTDIARRFHVADKTVQRII 145 >UniRef50_Q1VPB4 Putative uncharacterized protein n=4 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VPB4_9FLAO Length = 343 Score = 42.7 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 27/83 (32%), Gaps = 5/83 (6%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDMAMNG 63 CP C + VR G G QRY C C +++ + + + + + Sbjct: 50 GCPHC-LHEKYVRFGVD-KGSQRYKCKSCNRSFTEYTGTWMAGLQRKDMISSYLSLMVQE 107 Query: 64 VGCRATARIMGVGLNTIFRHLKN 86 + +G+ T F Sbjct: 108 KSLDKISSELGINKKTAFDWRHK 130 >UniRef50_B5K5I7 Transposase n=4 Tax=Octadecabacter antarcticus 238 RepID=B5K5I7_9RHOB Length = 319 Score = 42.3 bits (98), Expect = 0.004, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 35/87 (40%), Gaps = 5/87 (5%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDMAMNG 63 +CP C+A V+R G+S G +RY C C KT+ + +G Sbjct: 50 NCPHCAAGGAVIR-GRS-NGLKRYFCKICSKTFNALTGTPLARLRHKDCWTEFAGSLSDG 107 Query: 64 VGCRATARIMGVGLNTIFRHLKNSGRS 90 + +A GV +T FR R+ Sbjct: 108 DTVKTSAARCGVASSTAFRWRHRFLRA 134 >UniRef50_Q5LYW0 IS1193, transposase, ISL3 family n=185 Tax=Bacteria RepID=Q5LYW0_STRT1 Length = 448 Score = 42.3 bits (98), Expect = 0.004, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 36/103 (34%), Gaps = 19/103 (18%) Query: 1 MASVSISCPSCSAT---DGVVRNGKSTAGHQ------------RYLCSHCRKTWQLQFTY 45 + +++ SCP C +N K + Q R+ C CR+ + + Sbjct: 15 LITLAPSCPHCQGKMIKYDFQKNSKISLLEQAGTPTLLRLKKRRFQCKSCRRVTVAETSI 74 Query: 46 TASQPGT----HQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 QK+ + V AR + V +T++R L Sbjct: 75 VEKNCQISNLVRQKVTQLLTEKVSLTDIARRLRVSTSTVYRKL 117 >UniRef50_C2BVZ4 Possible IS3509a transposase n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BVZ4_9ACTO Length = 225 Score = 42.0 bits (97), Expect = 0.005, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 9/72 (12%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPG-------THQKIID 58 + CP+C+ + RNGK+++G QR+ C C ++ + +A + + Q+ D Sbjct: 41 MKCPACNT--PLKRNGKTSSGSQRWRCKECGRSKVGKIDNSAKELNRFLSWLLSRQRQKD 98 Query: 59 MAMNGVGCRATA 70 M G R A Sbjct: 99 MPGAGRTFRRHA 110 >UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae BGR1 RepID=C5A9A4_BURGB Length = 284 Score = 42.0 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 32/79 (40%), Gaps = 8/79 (10%) Query: 14 TDGVVRNGKSTAGH-----QRYLCSHCRKTWQLQFT---YTASQPGTHQKIIDMAMNGVG 65 D +NG H RY C C K + + +P + ++ MA++ VG Sbjct: 21 ADFYRKNGYRRTKHNGQPVPRYQCKACGKNFCATQVKPIHGQHRPDLNTQVFKMAVSRVG 80 Query: 66 CRATARIMGVGLNTIFRHL 84 R A ++ G TI R + Sbjct: 81 IRRMATVLDCGRETIQRKI 99 >UniRef50_B4WST6 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WST6_9SYNE Length = 81 Score = 42.0 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 28/71 (39%), Gaps = 1/71 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + P+C + VV+NGK G Q + C +C + + T I D+ Sbjct: 1 MLDHQPTRPACHSKQ-VVKNGKIHNGKQNHRCKNCGRQFVKDPQQKRISDATKALIDDLL 59 Query: 61 MNGVGCRATAR 71 + + ++ Sbjct: 60 LERLSMNNPSK 70 >UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=A0RXS8_CENSY Length = 436 Score = 42.0 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 27/98 (27%), Positives = 39/98 (39%), Gaps = 17/98 (17%) Query: 4 VSISCPSCSATDGVV--RNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 + CP CS+T V RNG G Q + C CR + + A + Q II A+ Sbjct: 71 IVPECPKCSSTVRVKAGRNG----GRQMFQCKQCRTRYVSRGP-GARKTRYSQDIISAAL 125 Query: 62 N----GVGCRATARIMG------VGLNTIFRHLKNSGR 89 N G+ R TA + + NTI + + Sbjct: 126 NKVMSGMSYRKTAEEVNTAHGRDLSPNTIMFWTRKYTQ 163 >UniRef50_B7K7J3 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7K7J3_CYAP7 Length = 354 Score = 42.0 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 2/37 (5%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ 40 + I CP C + +NG + AG QRY C C + + Sbjct: 2 ILIQCPKC-KSKNYRKNG-TIAGKQRYQCKSCGRNFL 36 >UniRef50_D0BMT4 Transposase n=10 Tax=Lactobacillales RepID=D0BMT4_9LACT Length = 426 Score = 41.6 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 18/100 (18%), Positives = 34/100 (34%), Gaps = 21/100 (21%) Query: 7 SCPSCSATDGVVRNGKSTAGH----------------QRYLCSHCRKTWQLQFTYTASQP 50 SCP C ++ V+++ QR++C CRKTW Sbjct: 46 SCPYC-SSKNVIKHSPMEHKIRIPHLYGNKTLLELKVQRFICKDCRKTWVTDCPLVPKNS 104 Query: 51 GTHQ----KIIDMAMNGVGCRATARIMGVGLNTIFRHLKN 86 +I+ + A+++ + T+ R +K Sbjct: 105 NISYDLACQIMLYLKENFSRKTIAKLLSISDKTVERVMKK 144 >UniRef50_Q2FRB5 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FRB5_METHJ Length = 138 Score = 41.6 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 6/90 (6%) Query: 5 SISCPSCSATDG--VVRNGKSTAGHQRYLCSHCRKTWQLQFT---YTASQPGTHQKII-D 58 + C DG + +NG ++AG+Q+Y C HCR+ + Y + P T II Sbjct: 14 NPDCTYFQIEDGKNITKNGHNSAGNQQYYCHHCRRFFIETKNTPLYDSRLPRTAVLIIAK 73 Query: 59 MAMNGVGCRATARIMGVGLNTIFRHLKNSG 88 + R +R+ G +TI R+ G Sbjct: 74 HSTEKTSIRGVSRVTGHHRDTISRYYHLIG 103 >UniRef50_C8SCF9 Integrase catalytic region n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCF9_FERPL Length = 357 Score = 41.6 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 35/92 (38%), Gaps = 11/92 (11%) Query: 7 SCPSCSATDGVVRNG--KSTAG-HQRYLCSHCRKTW--QLQFTYTASQPGTHQKIIDMAM 61 +C +C D V++ G + +G Q Y C C K + + F + +D+ Sbjct: 85 TCKNCGRDDEVIKKGIRYNKSGPVQMYYCKRCGKKFSARTGFGGMKKRAEAIVAALDLYF 144 Query: 62 NGVGCRATARIMGVGLN------TIFRHLKNS 87 G+ R A+ + N T+ +K Sbjct: 145 RGLSLRQVAQHLKASYNVEVCHKTVHNWIKRY 176 >UniRef50_B2J098 Putative uncharacterized protein n=2 Tax=Nostocaceae RepID=B2J098_NOSP7 Length = 133 Score = 41.2 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 10/34 (29%), Positives = 21/34 (61%), Gaps = 1/34 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW 39 + CP C+ + + ++G+ G QRY+C +C + + Sbjct: 34 MECPKCN-SHLLGKHGREPDGVQRYICKNCSRIF 66 >UniRef50_A8ZMW7 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMW7_ACAM1 Length = 75 Score = 40.8 bits (94), Expect = 0.014, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 27/58 (46%), Gaps = 1/58 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIID 58 M+ + + P C + + GK++ G QRY C C++T+ F + ++I Sbjct: 1 MSYLLMQSPLCDHP-KIHKPGKTSKGSQRYRCLDCQQTFSETFDTLYYRLQISSEMIQ 57 >UniRef50_B0V2Z3 Novel zinc finger protein (Fragment) n=2 Tax=Danio rerio RepID=B0V2Z3_DANRE Length = 1404 Score = 40.8 bits (94), Expect = 0.015, Method: Composition-based stats. Identities = 16/62 (25%), Positives = 30/62 (48%), Gaps = 11/62 (17%) Query: 6 ISCPSCS----ATDGVVRNGKSTA----GHQRYLCSHCRKTWQLQFT---YTASQPGTHQ 54 I+CP C ++ + R+ ++ A QR+ CS CR+T+ F+ + ++ Sbjct: 122 IACPRCERRFTSSQDLDRHIQTHALSTYHTQRFKCSRCRRTFSTLFSRRRHEKRHENGNK 181 Query: 55 KI 56 KI Sbjct: 182 KI 183 >UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthrospira maxima CS-328 RepID=B5VWL6_SPIMA Length = 153 Score = 40.4 bits (93), Expect = 0.018, Method: Composition-based stats. Identities = 11/35 (31%), Positives = 20/35 (57%) Query: 51 GTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLK 85 + + M++NG+G RA R+ G+ NTI ++ Sbjct: 20 DVKELCVKMSLNGMGFRAIERVTGISHNTILNWVR 54 >UniRef50_Q8PWW0 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PWW0_METMA Length = 155 Score = 40.4 bits (93), Expect = 0.019, Method: Composition-based stats. Identities = 18/100 (18%), Positives = 40/100 (40%), Gaps = 14/100 (14%) Query: 5 SISCPS--CS-----ATDGVVRNGKSTAGHQR---YLCSHCRKTW---QLQFTYTASQPG 51 + CP+ C + ++ NG ++R Y+C C + + F + Sbjct: 12 DVFCPNKDCKLYGITGKENIIGNGTYEIKNKRVRKYICRECGRVFNDRTGTFFDNVRKDE 71 Query: 52 TH-QKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 + + I MA+ G+ +A + ++ V T+ L + + Sbjct: 72 SDIKLAIKMAIKGMSIQAISDVLEVQPATVSNWLFRAAKQ 111 >UniRef50_D2PJ85 Putative uncharacterized protein n=5 Tax=Sulfolobus islandicus RepID=D2PJ85_SULIS Length = 82 Score = 40.0 bits (92), Expect = 0.023, Method: Composition-based stats. Identities = 13/62 (20%), Positives = 22/62 (35%), Gaps = 5/62 (8%) Query: 27 HQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGV---GLNTIFRH 83 QRYLC C + + Y ++ + M NGV + + T H Sbjct: 5 RQRYLCRDCGRYFLGDAIY--HSRELREEALKMYSNGVSPSLMGEKVNFIIYNIMTTLSH 62 Query: 84 LK 85 ++ Sbjct: 63 IQ 64 >UniRef50_C3L491 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L491_AMOA5 Length = 119 Score = 40.0 bits (92), Expect = 0.024, Method: Composition-based stats. Identities = 8/43 (18%), Positives = 21/43 (48%) Query: 48 SQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 T + + + + G+G RA ++ + T+++ ++ SG Sbjct: 48 KPIQTKRLALQLYLEGLGFRAIGNLLQISYGTVYQWIEASGEQ 90 >UniRef50_Q8RFT8 Transposase n=17 Tax=Fusobacterium RepID=Q8RFT8_FUSNN Length = 428 Score = 40.0 bits (92), Expect = 0.025, Method: Composition-based stats. Identities = 17/101 (16%), Positives = 35/101 (34%), Gaps = 21/101 (20%) Query: 7 SCPSCSATDGVVRNGKSTAGH----------------QRYLCSHCRKTWQLQFT----YT 46 +CP C ++ +V+NG QRY+C C+KT+ + Sbjct: 52 TCPHC-SSKNIVKNGSRHRKIKYIPIQNHNIELELTVQRYICKDCKKTFSPSTNIVSDNS 110 Query: 47 ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNS 87 + I + + A+ + + ++ R + N Sbjct: 111 SISNNLKYAIALELQKNISLTSIAKRYNISIPSVQRIMDNC 151 >UniRef50_Q70JT0 IsmA protein n=2 Tax=Microcystis aeruginosa RepID=Q70JT0_MICAE Length = 112 Score = 40.0 bits (92), Expect = 0.026, Method: Composition-based stats. Identities = 12/47 (25%), Positives = 20/47 (42%), Gaps = 1/47 (2%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH 53 +CPSC + ++NG G + C C + + + T P T Sbjct: 34 TCPSCG-SHHTIKNGYLPKGKPKRHCQECGQPFVINPTNKTISPDTK 79 >UniRef50_D2LYX8 Tn5468, transposition protein D n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LYX8_BACS4 Length = 609 Score = 39.6 bits (91), Expect = 0.029, Method: Composition-based stats. Identities = 20/100 (20%), Positives = 40/100 (40%), Gaps = 16/100 (16%) Query: 4 VSISCPSCSATD----GVVRNGKSTAGHQRYLCSHCRKTWQL-------QFTYTASQPG- 51 ++ +CP + R K+ R+ C C ++ Y S+ Sbjct: 322 LNATCPDYKKNAIPEITIRRCEKTKKLIGRFTCHTCDFSYTRKGMDPNKDDCYKFSRIMD 381 Query: 52 ---THQKIIDMAMN-GVGCRATARIMGVGLNTIFRHLKNS 87 ++ + + +N G+ R ARI+GV NT+ ++ K + Sbjct: 382 FGFLWKRELQLLLNKGLSYREVARILGVDTNTVIKYEKKN 421 >UniRef50_UPI000186E028 transcription factor Sp4, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186E028 Length = 749 Score = 39.6 bits (91), Expect = 0.032, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 25/55 (45%), Gaps = 2/55 (3%) Query: 6 ISC-PSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL-QFTYTASQPGTHQKIID 58 + C + +D + R+ ++ G +R+ C C+K + + QK+++ Sbjct: 655 MYCGKRFTRSDELQRHRRTHTGEKRFQCPDCQKKFMRSDHLSKHIKTHQKQKLME 709 >UniRef50_A7HVK5 Putative uncharacterized protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVK5_PARL1 Length = 608 Score = 39.3 bits (90), Expect = 0.035, Method: Composition-based stats. Identities = 13/64 (20%), Positives = 27/64 (42%), Gaps = 2/64 (3%) Query: 23 STAGHQRYLCSHCRKTWQLQFTYTASQ--PGTHQKIIDMAMNGVGCRATARIMGVGLNTI 80 S G QR+ C C++T+ + T Q P ++ + ++ R + G+ + Sbjct: 133 SRGGAQRFRCKACQRTFSVALKSTVRQRAPHLNRTVFAEVVSKKPLRGIMEVTGLSAAAV 192 Query: 81 FRHL 84 + L Sbjct: 193 YDKL 196 >UniRef50_B5S3H3 Probable transposase protein n=6 Tax=Burkholderiaceae RepID=B5S3H3_RALSO Length = 460 Score = 39.3 bits (90), Expect = 0.037, Method: Composition-based stats. Identities = 13/91 (14%), Positives = 30/91 (32%), Gaps = 3/91 (3%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH---QKIID 58 ++ + SCP C + +G + C C + P + + Sbjct: 345 STHAASCPWCGSDQTKYHPAPRPSGLPGFRCRACLAYFTRVSNTPLVHPMARAYASRFVP 404 Query: 59 MAMNGVGCRATARIMGVGLNTIFRHLKNSGR 89 M AR +G+ + T+ +++ + Sbjct: 405 MLGWHETGAGAARELGIAMGTLHTWVRSWRQ 435 >UniRef50_Q10VF2 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10VF2_TRIEI Length = 59 Score = 39.3 bits (90), Expect = 0.038, Method: Composition-based stats. Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 1/59 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDM 59 M+ + CPSC ++ +V+NG Q+Y C +C++ + T + I + Sbjct: 1 MSIHKLICPSCG-SNHIVKNGTIHNKKQKYQCQNCQRQFVENSQRDYISNETKELIDKL 58 >UniRef50_UPI0001793827 PREDICTED: similar to CG5669 CG5669-PA n=1 Tax=Acyrthosiphon pisum RepID=UPI0001793827 Length = 640 Score = 39.3 bits (90), Expect = 0.041, Method: Composition-based stats. Identities = 10/54 (18%), Positives = 24/54 (44%), Gaps = 2/54 (3%) Query: 7 SC-PSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL-QFTYTASQPGTHQKIID 58 C + +D + R+ ++ G +R+ C+ C K + + QK+++ Sbjct: 536 HCGKRFTRSDELQRHNRTHTGEKRFQCNECPKRFMRSDHLQKHVRTHLKQKLME 589 >UniRef50_B8X8Z3 Resolvase n=1 Tax=Pectobacterium atrosepticum RepID=B8X8Z3_ERWCT Length = 109 Score = 39.3 bits (90), Expect = 0.043, Method: Composition-based stats. Identities = 12/46 (26%), Positives = 23/46 (50%), Gaps = 1/46 (2%) Query: 44 TYTASQPGTHQKIID-MAMNGVGCRATARIMGVGLNTIFRHLKNSG 88 Y A P Q+I+ NG+ + + + GV +T++ +K+S Sbjct: 63 FYRAQPPEIQQQIMQNAYNNGMTVKDISEVTGVATSTVYSKIKSSR 108 >UniRef50_UPI000196CFC8 hypothetical protein CATMIT_02848 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196CFC8 Length = 262 Score = 39.3 bits (90), Expect = 0.043, Method: Composition-based stats. Identities = 15/82 (18%), Positives = 22/82 (26%), Gaps = 5/82 (6%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDMAMN 62 C C + + G G QRY+C C K + F + + Sbjct: 97 QCLFCG-SHDFTKYGHKKDGTQRYICKGCGKRFTPLTNTIFDSKKIPISEWIEYLLHLFE 155 Query: 63 GVGCRATARIMGVGLNTIFRHL 84 +TA T L Sbjct: 156 FHSINSTAYDNRNSPTTGKYWL 177 >UniRef50_UPI0001BC5E64 putative transposase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5E64 Length = 173 Score = 38.9 bits (89), Expect = 0.052, Method: Composition-based stats. Identities = 21/100 (21%), Positives = 34/100 (34%), Gaps = 25/100 (25%) Query: 7 SCPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKT------WQLQFT 44 CP C ++RNG + QR+LC C KT + ++ Sbjct: 48 KCPFCGEK-HIIRNGTKLSKIKILDVSNTPSYLYLRKQRFLCKSCSKTFSASTNFVRKYC 106 Query: 45 YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 A I + N + + A+ V +T+ R L Sbjct: 107 NIAD--SIKLSIALESKNIISEKDIAKRFRVSSSTVKRSL 144 >UniRef50_Q4JSN3 Transposase for IS3507b n=53 Tax=Actinobacteridae RepID=Q4JSN3_CORJK Length = 422 Score = 38.5 bits (88), Expect = 0.063, Method: Composition-based stats. Identities = 17/81 (20%), Positives = 25/81 (30%), Gaps = 11/81 (13%) Query: 9 PSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK-----IIDMAMNG 63 P C + RNG ++ G R+ C HC + + ID G Sbjct: 36 PRCHCGGEMKRNGTTSKGTTRWRCKHCG------ASSVKRRIDITNSTGFTAFIDHLTTG 89 Query: 64 VGCRATARIMGVGLNTIFRHL 84 A +G T+ R Sbjct: 90 ASLDTIASRVGCSPRTLQRRF 110 >UniRef50_UPI0001C34261 hypothetical protein CATC2_13668 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C34261 Length = 387 Score = 38.5 bits (88), Expect = 0.069, Method: Composition-based stats. Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 1/33 (3%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ 40 CP C + + R G++ G QR C C+K W Sbjct: 74 CPDCYQRETI-RYGRNPQGSQRVQCRACKKVWT 105 >UniRef50_C2CJK1 ISSha1 transposase n=7 Tax=Anaerococcus RepID=C2CJK1_9FIRM Length = 422 Score = 38.5 bits (88), Expect = 0.072, Method: Composition-based stats. Identities = 13/99 (13%), Positives = 28/99 (28%), Gaps = 20/99 (20%) Query: 8 CPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQFTYTASQPG 51 CP C + +++ G ++ Q+ C C K + L+ Sbjct: 48 CPHCGSNHNLIKYGFKSSNVRCSRAGDYPVIIDLKKQKMFCKSCNKYFLLETKIVDKHCN 107 Query: 52 ----THQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKN 86 + I+ + + V T+ R + Sbjct: 108 ISNQIKRHILASLTKKLSMKDIGSNNYVSTTTVARFMAK 146 >UniRef50_B7X577 Transposase IS204/IS1001/IS1096/IS1165 family protein n=1 Tax=Comamonas testosteroni KF-1 RepID=B7X577_COMTE Length = 471 Score = 38.5 bits (88), Expect = 0.074, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 25/78 (32%), Gaps = 17/78 (21%) Query: 8 CPSCSATDGVVRNG----------------KSTAGHQRYLCSHCRKTWQLQFTYTASQPG 51 CP C D + R+G K A QRY C+ C++T+ Sbjct: 36 CPKCGTLDCIYRHGTKATTYVDIPMRGKPAKLRAKVQRYRCTSCKETFLQPLGGILEGRR 95 Query: 52 THQKIIDMAMNGVGCRAT 69 ++ + R T Sbjct: 96 MTER-CATYIKAHSLRDT 112 >UniRef50_D0SJK4 Predicted protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJK4_ACIJU Length = 460 Score = 38.1 bits (87), Expect = 0.078, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 31/93 (33%), Gaps = 20/93 (21%) Query: 7 SCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTASQP 50 CP C +D + ++G +RY C C+ T+ + T Sbjct: 33 KCPKCG-SDQLYKHGTKPVIYRDIPRHMKPTVINVEVKRYRCKSCKATFLQEVTGIYPDT 91 Query: 51 GTHQKIIDMAMN---GVGCRATARIMGVGLNTI 80 ++ + + TAR+MG TI Sbjct: 92 RMTERFVKKIQDICLDYTFSDTARMMGCDSKTI 124 >UniRef50_D1W685 Putative uncharacterized protein n=2 Tax=Prevotella RepID=D1W685_9BACT Length = 298 Score = 38.1 bits (87), Expect = 0.093, Method: Composition-based stats. Identities = 13/70 (18%), Positives = 22/70 (31%), Gaps = 6/70 (8%) Query: 17 VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 76 VV+ G QR+ C C +T+ + + + A V Sbjct: 2 VVKRGF-HKNRQRWYCKSCGRTFVG-----HKRLTEETVNTRYSKGNLTVEDLATEYAVS 55 Query: 77 LNTIFRHLKN 86 T++R L Sbjct: 56 TRTVYRRLSK 65 >UniRef50_Q8PRR9 Conserved protein n=2 Tax=Archaea RepID=Q8PRR9_METMA Length = 148 Score = 38.1 bits (87), Expect = 0.093, Method: Composition-based stats. Identities = 17/81 (20%), Positives = 31/81 (38%), Gaps = 4/81 (4%) Query: 12 SATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQ-KIIDMAMNGVGCR 67 + + V++ H + C C+K + + + + P + I M R Sbjct: 34 NQGNIVLKERYGKNNHALFKCKTCKKCFSETKGTIFFELNTPDEEVLRTIAMLPEKGSIR 93 Query: 68 ATARIMGVGLNTIFRHLKNSG 88 AR G +TI R L+ +G Sbjct: 94 GVARATGHSKDTICRWLEIAG 114 >UniRef50_UPI000186EB06 zinc finger protein 705A, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186EB06 Length = 555 Score = 38.1 bits (87), Expect = 0.099, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 28/70 (40%), Gaps = 5/70 (7%) Query: 8 CPSCSAT---DGVVRNGKSTAGHQRY--LCSHCRKTWQLQFTYTASQPGTHQKIIDMAMN 62 CP C T +R + T GH ++ C+ C K + + +P QK + + Sbjct: 469 CPECGKTFADRSNLRAHQRTRGHHKWEWRCASCNKAFSQERYLDRHRPEACQKYLQYTVR 528 Query: 63 GVGCRATARI 72 G + + I Sbjct: 529 QHGVKKFSEI 538 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=roo... 107 1e-22 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 107 1e-22 UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax... 104 8e-22 UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma ling... 100 2e-20 UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacter... 100 2e-20 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 100 3e-20 UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_S... 97 2e-19 UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriale... 96 3e-19 UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriacea... 96 4e-19 UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammapr... 95 4e-19 UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwin... 95 8e-19 UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria... 94 1e-18 UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methano... 93 2e-18 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 92 5e-18 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 92 5e-18 UniRef50_A3JAS9 Hypothetical transposase n=1 Tax=Marinobacter sp... 92 6e-18 UniRef50_Q116V8 Insertion element protein n=2 Tax=Oscillatoriale... 89 4e-17 UniRef50_A9VV42 Insertion element protein n=7 Tax=Bacillus RepID... 87 1e-16 UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis ... 87 2e-16 UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp.... 87 3e-16 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 86 3e-16 UniRef50_C1DQR5 Transposase n=5 Tax=Bacteria RepID=C1DQR5_AZOVD 86 4e-16 UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium ... 85 5e-16 UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ru... 85 5e-16 UniRef50_A6CNB6 Putative uncharacterized protein n=8 Tax=Bacillu... 85 5e-16 UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepI... 85 6e-16 UniRef50_B1QSI6 Putative transposase n=10 Tax=Clostridium butyri... 85 7e-16 UniRef50_B6ARX2 Probable transposase n=1 Tax=Leptospirillum sp. ... 84 1e-15 UniRef50_Q81ZP0 Putative uncharacterized protein n=2 Tax=Nitroso... 84 1e-15 UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candid... 84 1e-15 UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 R... 84 2e-15 UniRef50_C4IIL3 Putative transposase n=2 Tax=Clostridium butyric... 84 2e-15 UniRef50_D2QCU0 Insertion element protein n=7 Tax=Bacteroidetes ... 83 2e-15 UniRef50_Q9AMR3 Putative transposase n=1 Tax=Azotobacter vinelan... 83 2e-15 UniRef50_C1I4B6 Putative uncharacterized protein n=2 Tax=Clostri... 82 4e-15 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 82 7e-15 UniRef50_A7HSL0 Insertion element protein n=1 Tax=Parvibaculum l... 82 7e-15 UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacter... 82 7e-15 UniRef50_Q97IP3 Zn-finger DNA-binding domain n=1 Tax=Clostridium... 81 9e-15 UniRef50_B0ABB1 Putative uncharacterized protein n=2 Tax=cellula... 81 1e-14 UniRef50_UPI000196AFFE hypothetical protein CATMIT_00334 n=2 Tax... 81 1e-14 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 81 1e-14 UniRef50_Q891N5 Putative transposase n=1 Tax=Clostridium tetani ... 80 3e-14 UniRef50_C6X4X6 Putative uncharacterized protein n=1 Tax=Flavoba... 80 3e-14 UniRef50_A1SXI4 Conserved hypothetical transposase n=18 Tax=Gamm... 79 3e-14 UniRef50_B9TDK1 Putative uncharacterized protein n=2 Tax=Ricinus... 79 4e-14 UniRef50_Q2GDS3 Putative uncharacterized protein n=2 Tax=Neorick... 78 8e-14 UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis ae... 78 8e-14 UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE 78 1e-13 UniRef50_Q97IZ3 Zn-finger DNA-binding domain n=1 Tax=Clostridium... 78 1e-13 UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5W... 77 2e-13 UniRef50_A8ZMX8 Putative uncharacterized protein n=5 Tax=Acaryoc... 77 2e-13 UniRef50_A0L9I6 Insertion element protein n=1 Tax=Magnetococcus ... 75 5e-13 UniRef50_C6W2G4 Transcriptional regulator, LacI family n=1 Tax=D... 75 6e-13 UniRef50_A5FST1 Integrase, catalytic region n=1 Tax=Dehalococcoi... 75 7e-13 UniRef50_A8V2B8 Hypothetical insertion element protein n=1 Tax=H... 75 8e-13 UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachl... 75 9e-13 UniRef50_Q8D3Z3 Transposase n=48 Tax=Vibrionales RepID=Q8D3Z3_VIBVU 75 1e-12 UniRef50_C0BSX6 Putative uncharacterized protein n=1 Tax=Bifidob... 74 1e-12 UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candida... 74 1e-12 UniRef50_A8ZNT7 Putative uncharacterized protein n=2 Tax=Acaryoc... 73 2e-12 UniRef50_Q64EP4 Putative uncharacterized protein n=10 Tax=enviro... 73 2e-12 UniRef50_D0WF86 Putative Zn-finger DNA-binding domain protein n=... 73 3e-12 UniRef50_B6B4C9 Putative uncharacterized protein n=1 Tax=Rhodoba... 72 6e-12 UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaen... 71 7e-12 UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcti... 71 9e-12 UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmon... 71 1e-11 UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea p... 70 2e-11 UniRef50_C8SD87 Insertion element protein n=1 Tax=Ferroglobus pl... 70 2e-11 UniRef50_Q4JT92 Transposase for IS3503f n=4 Tax=Corynebacterium ... 70 2e-11 UniRef50_C0QU68 Putative transposase n=1 Tax=Persephonella marin... 70 3e-11 UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aur... 69 5e-11 UniRef50_B2GC72 Transposase n=11 Tax=Lactobacillus RepID=B2GC72_... 68 6e-11 UniRef50_P04137 Uncharacterized protein in transposable element ... 68 6e-11 UniRef50_UPI0001BC5CBF transposase n=1 Tax=Fusobacterium sp. D12... 68 1e-10 UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX 67 1e-10 UniRef50_B5ETH8 Transposase n=15 Tax=Vibrionaceae RepID=B5ETH8_V... 67 1e-10 UniRef50_C5U8R8 Insertion element protein (Fragment) n=2 Tax=Met... 67 2e-10 UniRef50_B4WSN9 Insertion element protein n=3 Tax=Cyanobacteria ... 66 3e-10 UniRef50_UPI0001C348D8 hypothetical protein PretD1_11772 n=1 Tax... 65 8e-10 UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepI... 65 1e-09 UniRef50_C0X375 Transposase n=23 Tax=Enterococcus RepID=C0X375_E... 63 2e-09 UniRef50_C9BRL5 Transposase n=2 Tax=Enterococcus faecium RepID=C... 63 3e-09 UniRef50_A4WNK0 Putative uncharacterized protein n=1 Tax=Rhodoba... 63 4e-09 UniRef50_Q3Y3Y2 Transposase, IS204/IS1001/IS1096/IS1165 n=4 Tax=... 62 5e-09 UniRef50_C6MXF0 Putative uncharacterized protein n=1 Tax=Geobact... 61 9e-09 UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum A... 61 1e-08 UniRef50_Q07NT9 Putative uncharacterized protein n=2 Tax=Rhizobi... 61 1e-08 UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Kl... 61 1e-08 UniRef50_D1JAI8 Putative uncharacterized protein n=1 Tax=uncultu... 59 4e-08 UniRef50_B0K4X0 Integrase, catalytic region n=11 Tax=Clostridia ... 59 5e-08 UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q... 57 2e-07 UniRef50_Q10ZA4 Putative uncharacterized protein n=1 Tax=Trichod... 57 2e-07 UniRef50_A7HJ99 Putative uncharacterized protein n=2 Tax=Fervido... 57 2e-07 UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edw... 55 5e-07 UniRef50_A4W908 Putative cytoplasmic protein n=2 Tax=Enterobacte... 55 7e-07 UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nod... 55 1e-06 UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202... 52 4e-06 UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae... 52 6e-06 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 51 8e-06 UniRef50_B2A0V7 Integrase catalytic region n=15 Tax=Clostridia R... 51 1e-05 UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseud... 50 3e-05 Sequences not found previously or not previously below threshold: UniRef50_B5K5I7 Transposase n=4 Tax=Octadecabacter antarcticus 2... 71 7e-12 UniRef50_Q1VPB4 Putative uncharacterized protein n=4 Tax=Psychro... 70 2e-11 UniRef50_C8SCF9 Integrase catalytic region n=1 Tax=Ferroglobus p... 60 2e-08 UniRef50_UPI000196CFC8 hypothetical protein CATMIT_02848 n=1 Tax... 59 4e-08 UniRef50_C6MKY8 Putative uncharacterized protein n=1 Tax=Geobact... 58 1e-07 UniRef50_Q3Y1C3 Transposase, IS204/IS1001/IS1096/IS1165 n=51 Tax... 58 1e-07 UniRef50_Q8RFT8 Transposase n=17 Tax=Fusobacterium RepID=Q8RFT8_... 57 2e-07 UniRef50_Q0W4F0 Putative uncharacterized protein n=1 Tax=uncultu... 57 2e-07 UniRef50_B4WST6 Putative uncharacterized protein n=1 Tax=Synecho... 57 2e-07 UniRef50_Q5LYW0 IS1193, transposase, ISL3 family n=185 Tax=Bacte... 56 3e-07 UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=... 56 3e-07 UniRef50_Q8PSY9 Conserved protein n=2 Tax=Methanosarcina RepID=Q... 56 3e-07 UniRef50_Q4JSN3 Transposase for IS3507b n=53 Tax=Actinobacterida... 56 4e-07 UniRef50_Q2FRB5 Putative uncharacterized protein n=1 Tax=Methano... 55 1e-06 UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae... 54 2e-06 UniRef50_C7N1Y2 Putative uncharacterized protein n=1 Tax=Slackia... 54 2e-06 UniRef50_A1VN28 Insertion element protein n=1 Tax=Polaromonas na... 53 2e-06 UniRef50_C2FEQ0 IS1181 transposase n=1 Tax=Lactobacillus paracas... 53 2e-06 UniRef50_C9BRL4 Transposase n=30 Tax=Enterococcus RepID=C9BRL4_E... 53 2e-06 UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candida... 53 3e-06 UniRef50_Q64DF0 Putative uncharacterized protein n=6 Tax=cellula... 53 3e-06 UniRef50_C2CJK1 ISSha1 transposase n=7 Tax=Anaerococcus RepID=C2... 52 5e-06 UniRef50_C2BVZ4 Possible IS3509a transposase n=1 Tax=Mobiluncus ... 51 1e-05 UniRef50_Q8PWW0 Putative uncharacterized protein n=1 Tax=Methano... 51 1e-05 UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthros... 51 1e-05 UniRef50_Q2RQJ8 Putative uncharacterized protein n=1 Tax=Rhodosp... 50 2e-05 UniRef50_C5S2C5 Putative transposase n=1 Tax=Actinobacillus mino... 50 2e-05 UniRef50_B4VTL4 Putative uncharacterized protein n=1 Tax=Microco... 50 2e-05 UniRef50_B4WUH8 Putative uncharacterized protein n=1 Tax=Synecho... 50 2e-05 UniRef50_D0BMT4 Transposase n=10 Tax=Lactobacillales RepID=D0BMT... 50 3e-05 UniRef50_UPI0001BC5E64 putative transposase n=1 Tax=Fusobacteriu... 50 3e-05 UniRef50_Q7NH53 TetR family transcriptional regulatory protein n... 50 3e-05 UniRef50_D1PSS1 Insertion element protein (Fragment) n=14 Tax=Pr... 49 4e-05 UniRef50_Q03NU3 Transposase n=12 Tax=Lactobacillus RepID=Q03NU3_... 49 4e-05 UniRef50_C1DPZ8 Transposase n=4 Tax=Bacteria RepID=C1DPZ8_AZOVD 49 4e-05 UniRef50_D1QQX4 Putative uncharacterized protein n=15 Tax=Prevot... 49 4e-05 UniRef50_Q9V1K2 Putative uncharacterized protein n=2 Tax=Pyrococ... 49 4e-05 UniRef50_Q11ZU0 Putative uncharacterized protein n=1 Tax=Polarom... 49 5e-05 UniRef50_A8ZMW7 Putative uncharacterized protein n=1 Tax=Acaryoc... 49 5e-05 UniRef50_Q10VF2 Putative uncharacterized protein n=1 Tax=Trichod... 49 6e-05 UniRef50_Q70JT0 IsmA protein n=2 Tax=Microcystis aeruginosa RepI... 49 6e-05 UniRef50_Q7VL05 Possible transposase n=4 Tax=Pasteurellaceae Rep... 49 6e-05 UniRef50_C0WLQ9 Transposase n=3 Tax=Lactobacillus RepID=C0WLQ9_L... 49 6e-05 UniRef50_B8F7V2 ISRssp2, family IS1595 n=4 Tax=Pasteurellaceae R... 48 6e-05 UniRef50_C9CRL2 Transposase n=3 Tax=Alphaproteobacteria RepID=C9... 48 6e-05 UniRef50_B3GXU2 Transposase n=15 Tax=Pasteurellaceae RepID=B3GXU... 48 7e-05 UniRef50_D0U1S9 Transposase n=1 Tax=Enterococcus faecium RepID=D... 48 8e-05 UniRef50_B7K7J3 Putative uncharacterized protein n=1 Tax=Cyanoth... 48 9e-05 UniRef50_A7JMB8 Predicted protein n=8 Tax=Francisella RepID=A7JM... 48 9e-05 UniRef50_C7RJT2 Conserved possible transposase n=21 Tax=Proteoba... 48 1e-04 UniRef50_B5S3H3 Probable transposase protein n=6 Tax=Burkholderi... 48 1e-04 UniRef50_Q6MK35 Putative transposase n=1 Tax=Bdellovibrio bacter... 48 1e-04 UniRef50_Q8PRR9 Conserved protein n=2 Tax=Archaea RepID=Q8PRR9_M... 47 2e-04 UniRef50_D2LK53 Putative uncharacterized protein n=1 Tax=Rhodomi... 47 2e-04 UniRef50_B9JNY3 Transposase n=4 Tax=Alphaproteobacteria RepID=B9... 47 2e-04 UniRef50_Q035C5 Transposase n=27 Tax=Lactobacillales RepID=Q035C... 47 2e-04 UniRef50_A8UDH0 Transposase n=5 Tax=Bacteria RepID=A8UDH0_9LACT 47 2e-04 UniRef50_B2J098 Putative uncharacterized protein n=2 Tax=Nostoca... 47 2e-04 UniRef50_C5RB59 Possible transposase n=1 Tax=Weissella paramesen... 46 2e-04 UniRef50_C8SCF8 Putative uncharacterized protein n=1 Tax=Ferrogl... 46 2e-04 UniRef50_B2J7N9 Putative uncharacterized protein n=1 Tax=Nostoc ... 46 3e-04 UniRef50_C6QEP3 ISSpo8, transposase n=4 Tax=Alphaproteobacteria ... 46 3e-04 UniRef50_Q93CQ1 Transposase TnpA n=1 Tax=Enterococcus faecium Re... 46 3e-04 UniRef50_UPI0001C31088 transcriptional regulator, TetR family n=... 46 3e-04 UniRef50_B0CG58 Transcriptional regulator, TetR family n=1 Tax=A... 46 3e-04 UniRef50_B9ZCS9 DNA topoisomerase type IA zn finger domain prote... 46 3e-04 UniRef50_A9IG79 ISSod11, transposase n=14 Tax=Proteobacteria Rep... 46 4e-04 UniRef50_Q5LW63 ISSpo8, transposase n=4 Tax=Rhodobacterales RepI... 46 4e-04 UniRef50_Q2J1M8 Putative uncharacterized protein n=1 Tax=Rhodops... 46 4e-04 UniRef50_A8YX76 Transposase n=42 Tax=Lactobacillus RepID=A8YX76_... 46 4e-04 UniRef50_B4WVD1 Putative uncharacterized protein n=7 Tax=Synecho... 46 4e-04 UniRef50_Q2P6H2 ISXo5 transposase n=74 Tax=Xanthomonas RepID=Q2P... 46 5e-04 UniRef50_Q3Y3Y3 Transposase, IS204/IS1001/IS1096/IS1165 n=11 Tax... 46 5e-04 UniRef50_Q03IY7 Transposase n=198 Tax=Lactobacillales RepID=Q03I... 46 5e-04 UniRef50_A5FLG0 Putative uncharacterized protein n=1 Tax=Flavoba... 46 5e-04 UniRef50_A7HMZ5 Transposase IS204/IS1001/IS1096/IS1165 family pr... 46 5e-04 UniRef50_D1UAU0 Transposase, putative n=1 Tax=Desulfovibrio aesp... 45 6e-04 UniRef50_B1IC92 Transposase n=24 Tax=Lactobacillales RepID=B1IC9... 45 6e-04 UniRef50_B9Y9S5 Putative uncharacterized protein (Fragment) n=1 ... 45 6e-04 UniRef50_B2JXE0 Putative uncharacterized protein n=2 Tax=Burkhol... 45 8e-04 UniRef50_B2SIA3 ISXo5 transposase n=157 Tax=Proteobacteria RepID... 45 8e-04 UniRef50_A7HVK5 Putative uncharacterized protein n=1 Tax=Parviba... 45 8e-04 UniRef50_B7C761 Putative uncharacterized protein n=1 Tax=Eubacte... 45 8e-04 UniRef50_Q7N9S9 Transposase TnpA, ISL3 family n=1 Tax=Photorhabd... 45 9e-04 UniRef50_UPI000186E028 transcription factor Sp4, putative n=1 Ta... 45 9e-04 UniRef50_C7P9K3 Transcriptional regulator, ArsR family n=2 Tax=M... 45 0.001 UniRef50_B7X577 Transposase IS204/IS1001/IS1096/IS1165 family pr... 45 0.001 UniRef50_C3L491 Putative uncharacterized protein n=1 Tax=Candida... 45 0.001 UniRef50_B0SXP6 Putative transposase n=1 Tax=Caulobacter sp. K31... 44 0.001 UniRef50_B9JG85 Putative uncharacterized protein n=1 Tax=Agrobac... 44 0.001 UniRef50_C3MUP9 Resolvase helix-turn-helix domain protein n=40 T... 44 0.001 UniRef50_Q1GHU2 Putative uncharacterized protein n=1 Tax=Ruegeri... 44 0.002 UniRef50_A5VLK7 Transposase, IS204/IS1001/IS1096/IS1165 family p... 44 0.002 UniRef50_A2V378 Putative uncharacterized protein n=1 Tax=Shewane... 44 0.002 UniRef50_A7HYI5 Putative uncharacterized protein n=1 Tax=Parviba... 44 0.002 UniRef50_B2UM39 Putative uncharacterized protein n=1 Tax=Akkerma... 44 0.002 UniRef50_C2H217 Possible transposase n=5 Tax=Enterococcaceae Rep... 43 0.002 UniRef50_D0MDA7 Transposase-like protein n=7 Tax=Bacteria RepID=... 43 0.002 UniRef50_Q8R819 Transposase n=2 Tax=Thermoanaerobacter tengconge... 43 0.002 UniRef50_C6HZQ4 Transposase n=2 Tax=Leptospirillum ferrodiazotro... 43 0.002 UniRef50_C0WEV9 Transposase (Fragment) n=1 Tax=Acidaminococcus s... 43 0.002 UniRef50_D1W685 Putative uncharacterized protein n=2 Tax=Prevote... 43 0.003 UniRef50_D0SJK4 Predicted protein n=1 Tax=Acinetobacter junii SH... 43 0.003 UniRef50_A8A9S5 Transcriptional regulator, AsnC family n=1 Tax=I... 43 0.003 UniRef50_C4RAC5 Transposase n=2 Tax=magnetite-containing magneti... 43 0.003 UniRef50_UPI0001C34261 hypothetical protein CATC2_13668 n=1 Tax=... 43 0.003 UniRef50_C0W2A4 Transposase (Fragment) n=1 Tax=Actinomyces coleo... 43 0.003 UniRef50_Q894I5 Phage-related protein n=1 Tax=Clostridium tetani... 43 0.004 UniRef50_B8FWC8 Putative uncharacterized protein n=1 Tax=Desulfi... 43 0.004 UniRef50_Q6V7R1 Bcep22gp32 n=1 Tax=Burkholderia phage Bcep22 Rep... 43 0.004 UniRef50_UPI00016C448A hypothetical protein GobsU_12575 n=6 Tax=... 43 0.004 UniRef50_Q54X15 Type-2 histone deacetylase 1 n=1 Tax=Dictyosteli... 43 0.004 UniRef50_UPI00016C46F4 hypothetical protein GobsU_15563 n=2 Tax=... 43 0.004 UniRef50_A2A935 PR domain zinc finger protein 16 n=35 Tax=Eutele... 43 0.004 UniRef50_UPI00016C5DA1 ISSpo8, transposase n=1 Tax=Clostridium d... 43 0.004 UniRef50_Q8U293 Transposase n=53 Tax=Pyrococcus RepID=Q8U293_PYRFU 43 0.004 UniRef50_Q9HAZ2 PR domain zinc finger protein 16 n=26 Tax=Eutele... 42 0.004 UniRef50_C1F2K1 Unclassified family transposase n=1 Tax=Acidobac... 42 0.005 UniRef50_Q87RY6 Putative resolvase n=3 Tax=Vibrio parahaemolytic... 42 0.005 UniRef50_Q12Y80 Transposase n=1 Tax=Methanococcoides burtonii DS... 42 0.005 UniRef50_A9BGL8 Transposase IS204/IS1001/IS1096/IS1165 family pr... 42 0.005 UniRef50_D2EIL2 Transposase n=1 Tax=Pediococcus acidilactici 7_4... 42 0.005 UniRef50_D2PJ85 Putative uncharacterized protein n=5 Tax=Sulfolo... 42 0.006 UniRef50_D2MKS9 ISXo5 transposase n=1 Tax=Candidatus Poribacteri... 42 0.006 UniRef50_Q4S840 Chromosome 9 SCAF14710, whole genome shotgun seq... 42 0.006 UniRef50_Q4L7B5 Transposase for ISSha1 n=49 Tax=Staphylococcus R... 42 0.006 UniRef50_C3XYB0 Putative uncharacterized protein n=2 Tax=Chordat... 42 0.006 UniRef50_C6MCG8 Putative uncharacterized protein n=17 Tax=Proteo... 42 0.006 UniRef50_A0Q207 Transcriptional regulator n=3 Tax=Clostridium Re... 42 0.006 UniRef50_A3VEU0 ISSpo8, transposase n=1 Tax=Rhodobacterales bact... 42 0.007 UniRef50_C9RDH8 Regulatory protein LacI n=1 Tax=Ammonifex degens... 42 0.007 UniRef50_D1VTW0 Transposase (Fragment) n=1 Tax=Peptoniphilus lac... 42 0.007 UniRef50_Q03112 MDS1 and EVI1 complex locus protein EVI1 n=58 Ta... 42 0.007 UniRef50_C6P8Q1 Transposase IS3/IS911 family protein n=1 Tax=The... 42 0.007 UniRef50_A7BQK2 Transposase n=3 Tax=Bacteria RepID=A7BQK2_9GAMM 41 0.008 UniRef50_Q30XD0 Transposase n=4 Tax=Proteobacteria RepID=Q30XD0_... 41 0.008 UniRef50_C7XW38 Transposase ISLasa4v n=4 Tax=Lactobacillus RepID... 41 0.008 UniRef50_C1SJT0 Transposase family protein, COG3464 n=1 Tax=Deni... 41 0.008 UniRef50_Q5ZT03 Transposase (IS652) n=29 Tax=Gammaproteobacteria... 41 0.008 UniRef50_B2JMI5 Transposase n=2 Tax=Burkholderia RepID=B2JMI5_BURP8 41 0.009 UniRef50_B8F7J2 Putative uncharacterized protein n=1 Tax=Haemoph... 41 0.009 UniRef50_Q9SVC5 Dof zinc finger protein DOF3.5 n=2 Tax=Arabidops... 41 0.009 UniRef50_A5KRX5 ISSpo8, transposase n=2 Tax=candidate division T... 41 0.009 UniRef50_D2M0Z3 Two component transcriptional regulator, LuxR fa... 41 0.010 UniRef50_A6Q3M3 Transposase n=1 Tax=Nitratiruptor sp. SB155-2 Re... 41 0.010 UniRef50_B2SSB8 Transposase TnpA, ISL3 family n=6 Tax=Bacteria R... 41 0.010 UniRef50_B2TB85 Transposase IS3/IS911 family protein n=2 Tax=Bur... 41 0.010 UniRef50_A9VK04 Transposase IS3/IS911 family protein n=12 Tax=Ba... 41 0.010 >UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=root RepID=INSA2_ECOLX Length = 91 Score = 107 bits (267), Expect = 1e-22, Method: Composition-based stats. Identities = 91/91 (100%), Positives = 91/91 (100%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA Sbjct: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR Sbjct: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 107 bits (267), Expect = 1e-22, Method: Composition-based stats. Identities = 38/91 (41%), Positives = 52/91 (57%), Gaps = 1/91 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA V+I CP C + V R+G++ G R C C + +QL +TY A +PG + I +MA Sbjct: 1 MARVNIHCPRCQSAQ-VYRHGQNPKGRDRLRCRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 NG G R TAR + +G NT+ R LK R Sbjct: 60 FNGAGVRDTARTLKIGSNTVIRTLKKLAPKR 90 >UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax=Gammaproteobacteria RepID=INA2_SHIDY Length = 90 Score = 104 bits (260), Expect = 8e-22, Method: Composition-based stats. Identities = 42/91 (46%), Positives = 59/91 (64%), Gaps = 1/91 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MASV+I CP C + V R+G++ GH R+ C C + +QL +TY A +PG + I +MA Sbjct: 1 MASVNIHCPRCQSAQ-VYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPGIKELITEMA 59 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 NG G R TAR + +G+NT+ R LKNS +S Sbjct: 60 FNGAGVRDTARTLKIGINTVIRTLKNSRQSE 90 >UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQ17_9SPHI Length = 107 Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats. Identities = 32/91 (35%), Positives = 49/91 (53%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M +++C T + R G + AG QRY C C +T+ +T+ A P ++I M Sbjct: 1 MVLEAVTCKHFGQTQHIKRYGTTCAGTQRYRCFDCGRTFVQTYTHKARDPLVKEQITQMV 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 +NG G R TAR++GV NT+ K +G +R Sbjct: 61 LNGAGIRDTARVLGVNRNTVSAQFKKNGAAR 91 >UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacteriaceae RepID=A4TI48_YERPP Length = 91 Score = 100 bits (248), Expect = 2e-20, Method: Composition-based stats. Identities = 40/90 (44%), Positives = 56/90 (62%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA V + CP C V ++G GHQRY C CR+++QL++ Y A PG ++I+D+A Sbjct: 1 MAKVDVKCPFCEQFHPVKKHGPGRTGHQRYRCQACRRSFQLEYEYRACHPGMKEQIVDLA 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 MN G R TAR + + +N + R LKNS RS Sbjct: 61 MNNAGIRDTARALHISINAVMRTLKNSRRS 90 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 99.6 bits (247), Expect = 3e-20, Method: Composition-based stats. Identities = 28/87 (32%), Positives = 46/87 (52%), Gaps = 2/87 (2%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 +SCPSC + VV+ G+ G Q++LC C K + +Y ++ + M NG+ Sbjct: 10 DVSCPSCG-SHHVVKCGR-PLGRQKFLCRDCGKYFLGDASYHHHSRKLREEALRMYANGM 67 Query: 65 GCRATARIMGVGLNTIFRHLKNSGRSR 91 RA +R++ V L T+F +K GR + Sbjct: 68 SMRAISRVLNVPLGTVFTWIKRYGRKK 94 >UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_STRAG Length = 345 Score = 96.9 bits (240), Expect = 2e-19, Method: Composition-based stats. Identities = 28/84 (33%), Positives = 37/84 (44%), Gaps = 5/84 (5%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF----TYTASQPGTHQKIIDMAMNG 63 CP C VVRNG G QRY+C C K++ + + T ++ ID MNG Sbjct: 52 CPLCGCI-HVVRNGHRKDGTQRYVCKDCGKSFVIATNSIVSGTRKDLSVWEQYIDCMMNG 110 Query: 64 VGCRATARIMGVGLNTIFRHLKNS 87 + R TA G+ NT F Sbjct: 111 LSIRKTAVACGIHRNTAFLWRHKI 134 >UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriales RepID=B5W4N9_SPIMA Length = 163 Score = 96.1 bits (238), Expect = 3e-19, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 39/80 (48%), Gaps = 2/80 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C + VV+NG G Q YLC C + ++ + + M++NG+G Sbjct: 1 MDCPYCQ-SHKVVKNGH-RQGKQSYLCRECGRQFRENPCPGGYSSDVKELCVKMSLNGMG 58 Query: 66 CRATARIMGVGLNTIFRHLK 85 RA R+ G+ NTI ++ Sbjct: 59 FRAIERVTGISHNTILNWVR 78 >UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriaceae RepID=B7LWW4_ECO55 Length = 134 Score = 95.8 bits (237), Expect = 4e-19, Method: Composition-based stats. Identities = 30/91 (32%), Positives = 48/91 (52%), Gaps = 3/91 (3%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M+SV+I CP C + V R+G++ G R+ C + +QL +TY A +PG + I +MA Sbjct: 1 MSSVNIHCPRCQSAQ-VYRHGQNPKGRDRFRYRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MN--GVGCRATARIMGVGLNTIFRHLKNSGR 89 N G+ AR+ G+ +F+ K Sbjct: 60 FNEPGMMLARMARLHGIQPCQLFKWKKQYLE 90 >UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammaproteobacteria RepID=A7N597_VIBHB Length = 91 Score = 95.4 bits (236), Expect = 4e-19, Method: Composition-based stats. Identities = 37/91 (40%), Positives = 57/91 (62%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA++ + C C+ T+ V ++GK +G R+ C CRK++QL + Y A +P +KI+DMA Sbjct: 1 MATIQVQCRFCNKTESVRKHGKGHSGFPRFRCIECRKSFQLDYVYEARKPNVKEKIVDMA 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 MN G R TA ++ V NT+ LKNS + + Sbjct: 61 MNSSGVRETAGVLNVAYNTVLSTLKNSRQGK 91 >UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwinia RepID=D0FXR2_ERWPY Length = 92 Score = 94.6 bits (234), Expect = 8e-19, Method: Composition-based stats. Identities = 39/91 (42%), Positives = 53/91 (58%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M I+CP CS + + RNG+S +G QRY C C KT+QL F Y S P + II+M Sbjct: 1 MKMGDIACPRCSESARIRRNGRSASGIQRYRCQGCLKTFQLHFYYAGSSPNMQKTIIEMM 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 +G R AR +GV L T+ RHLK+ ++ Sbjct: 61 NDGSEQRDIARKLGVSLETVLRHLKDLRLNK 91 >UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria RepID=C4MEL4_CAMCO Length = 339 Score = 94.2 bits (233), Expect = 1e-18, Method: Composition-based stats. Identities = 30/85 (35%), Positives = 39/85 (45%), Gaps = 6/85 (7%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT----YTASQPGTHQKIIDMAMN 62 CP C+ +D V+NGK+ HQRY+C C KT+ T G K ID +N Sbjct: 47 HCPYCN-SDKFVKNGKAKT-HQRYICKTCNKTFTDTNKTILFNTKKDIGIWYKYIDCLVN 104 Query: 63 GVGCRATARIMGVGLNTIFRHLKNS 87 R TA+I G+ L T F Sbjct: 105 KYPLRKTAKICGISLPTAFVWRHKI 129 >UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methanosarcina RepID=Q46CV1_METBF Length = 139 Score = 93.1 bits (230), Expect = 2e-18, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 44/84 (52%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 ++CP C+++ +NG G Q Y C C + ++ TAS P ++ + + + G+G Sbjct: 1 MNCPRCNSSTH-KKNG-IVFGRQHYKCHDCGYNYTVEVKSTASSPSVKRQALQLYLEGLG 58 Query: 66 CRATARIMGVGLNTIFRHLKNSGR 89 R+ R +GV ++ + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQ 82 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 91.9 bits (227), Expect = 5e-18, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 3/86 (3%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF-TYTASQPGTHQKIIDMAMNGV 64 + CP C AT+ + +NGK G Q ++C+ C + + + Q+ ++M +NG+ Sbjct: 1 MQCPYCGATE-IRKNGK-RRGKQNHICTKCERQFIDVYDPPKGYSEELKQECLEMYLNGM 58 Query: 65 GCRATARIMGVGLNTIFRHLKNSGRS 90 G R R+ GV TI +K G Sbjct: 59 GFRPIERVKGVHHTTIIFWVKQMGEK 84 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 91.9 bits (227), Expect = 5e-18, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 44/84 (52%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 ++CP C+++ +NG G QRY C C + ++ T+ P ++ + + + G+G Sbjct: 1 MNCPRCNSSTH-KKNG-IVFGRQRYKCHDCGYNYTVEVKSTSISPSVKRQALQLYLEGLG 58 Query: 66 CRATARIMGVGLNTIFRHLKNSGR 89 R+ R +GV ++ + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQ 82 >UniRef50_A3JAS9 Hypothetical transposase n=1 Tax=Marinobacter sp. ELB17 RepID=A3JAS9_9ALTE Length = 181 Score = 91.9 bits (227), Expect = 6e-18, Method: Composition-based stats. Identities = 24/87 (27%), Positives = 34/87 (39%), Gaps = 4/87 (4%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDMAMNG 63 CP C + +R G S QRY C C KT+ Y + + ++ G Sbjct: 56 QCPYCQ-SKTFIRWGSSENERQRYRCKRCAKTFNALVGSPLYRMRKEELWLEYVETMRYG 114 Query: 64 VGCRATARIMGVGLNTIFRHLKNSGRS 90 + R A++ GV L T FR S Sbjct: 115 LSLRKAAKVTGVSLRTAFRWRHAFLSS 141 >UniRef50_Q116V8 Insertion element protein n=2 Tax=Oscillatoriales RepID=Q116V8_TRIEI Length = 108 Score = 89.2 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 37/81 (45%), Gaps = 2/81 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C + +V+NG G Q YLC C + ++ + + M ++G+G Sbjct: 1 MHCPYCQ-SHKIVKNGH-RNGKQSYLCRKCGRQFRENPCPIGYSSEVKEACLKMFLSGMG 58 Query: 66 CRATARIMGVGLNTIFRHLKN 86 RA R G+ N++ ++ Sbjct: 59 FRAIERATGISHNSVLNWVRR 79 >UniRef50_A9VV42 Insertion element protein n=7 Tax=Bacillus RepID=A9VV42_BACWK Length = 342 Score = 87.3 bits (215), Expect = 1e-16, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 35/91 (38%), Gaps = 5/91 (5%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT---YTASQPGTHQKIIDM 59 CP C+ ++ VVR GK QRY C C KT+ Y + +D Sbjct: 51 KEGFECPHCA-SEHVVRFGK-HNNRQRYRCKCCSKTFTDTTNTVLYRTRKGNEWITFVDC 108 Query: 60 AMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 G R +A I+GV T+F + Sbjct: 109 MFKGYSLRKSAEIVGVTWVTLFYWRHKLLSA 139 >UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis BM407 RepID=C6GT28_STRS4 Length = 341 Score = 86.9 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 23/84 (27%), Positives = 36/84 (42%), Gaps = 6/84 (7%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIIDMAMNG 63 CP C ++ + RNGK G QRY+C C+KT+ + K +NG Sbjct: 54 CPLCG-SETISRNGK-YNGKQRYICKSCKKTFTDFTNSATYKSKKTLDKWLKYAKCMING 111 Query: 64 VGCRATARIMGVGLNTIFRHLKNS 87 R +A+I+ + + T F Sbjct: 112 YSIRKSAKIVEINIATSFFWRHKI 135 >UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HXD1_CYAP4 Length = 95 Score = 86.5 bits (213), Expect = 3e-16, Method: Composition-based stats. Identities = 30/94 (31%), Positives = 47/94 (50%), Gaps = 4/94 (4%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHC---RKTWQLQFTYTASQPGTHQKII 57 M + PSC ++D VV+ + T G QRY C + R T+ Q+ Y Q+I+ Sbjct: 1 MVLEPVLYPSCGSSD-VVKPRQLTEGIQRYKCRNAEWSRCTFIRQYAYRGYLVEVKQQIV 59 Query: 58 DMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 +M +NG G R AR++ + T+ LK S + Sbjct: 60 EMVVNGSGTRDPARVLKISRTTVTETLKKSSSAE 93 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 86.1 bits (212), Expect = 3e-16, Method: Composition-based stats. Identities = 66/74 (89%), Positives = 67/74 (90%) Query: 17 VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 76 +VRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG Sbjct: 1 MVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 60 Query: 77 LNTIFRHLKNSGRS 90 LNTI RHL Sbjct: 61 LNTILRHLNKLRPQ 74 >UniRef50_C1DQR5 Transposase n=5 Tax=Bacteria RepID=C1DQR5_AZOVD Length = 317 Score = 85.7 bits (211), Expect = 4e-16, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 32/93 (34%), Gaps = 5/93 (5%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQPGTHQKII 57 M + SCP C +++ ++ S+ G RY C C KT + Q Sbjct: 38 MIATPSSCPHCQSSE--LQPWGSSGGLPRYRCKFCGKTSNPLTGTPMARLRKRHLWQGYA 95 Query: 58 DMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 + + R A+ GV NT F Sbjct: 96 EALTQSLTVRRAAKHCGVSKNTAFLWRHRFLTQ 128 >UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZK1_TRIEI Length = 177 Score = 85.4 bits (210), Expect = 5e-16, Method: Composition-based stats. Identities = 22/85 (25%), Positives = 36/85 (42%), Gaps = 2/85 (2%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 I CP CS + +NG G Q Y+C C + + + + + +NG+ Sbjct: 10 PIQCPDCSC-QHIPKNGHQP-GKQNYICVACSHQFIKPYHPQEYSDNVKRLFLRIYVNGM 67 Query: 65 GCRATARIMGVGLNTIFRHLKNSGR 89 G R A + GV TI +K++ Sbjct: 68 GIRRIAWVKGVTYPTIINLIKHTRE 92 >UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N1_SALRD Length = 92 Score = 85.4 bits (210), Expect = 5e-16, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 1/86 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + C C +++ +V+NG S +G Q+Y C C L +KI+ Sbjct: 1 MIKETYECRECGSSN-IVKNGHSASGSQQYHCKDCGAHKVLDPEPRGYSEEEKEKILRAY 59 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKN 86 RA +RI G+ NT+ R LK Sbjct: 60 RERGSKRAISRIFGISRNTLTRWLKK 85 >UniRef50_A6CNB6 Putative uncharacterized protein n=8 Tax=Bacillus RepID=A6CNB6_9BACI Length = 335 Score = 85.4 bits (210), Expect = 5e-16, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 31/88 (35%), Gaps = 5/88 (5%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL---QFTYTASQPGTHQKIIDM 59 + C C + V RNGK QRYLC C K++ G K M Sbjct: 49 KEGLGCIHCGSV-KVKRNGKYRE-RQRYLCRDCGKSFNELSNTPIAGTRYLGKWAKYFHM 106 Query: 60 AMNGVGCRATARIMGVGLNTIFRHLKNS 87 + G A+ + + ++T F Sbjct: 107 MVEGYTLPKIAKRLKIHISTAFYWRHKI 134 >UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IXU3_9CHRO Length = 92 Score = 85.0 bits (209), Expect = 6e-16, Method: Composition-based stats. Identities = 32/85 (37%), Positives = 47/85 (55%), Gaps = 4/85 (4%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLC--SHC-RKTWQLQFTYTASQPGTHQKIIDMAMN 62 I CP C +TD VV+NG S G QRY C C R+++ ++Y + ++I M +N Sbjct: 7 IECPHCHSTD-VVKNGFSGEGKQRYFCQNKSCERRSFIRDYSYNGCRKEVKKQIPKMVVN 65 Query: 63 GVGCRATARIMGVGLNTIFRHLKNS 87 G G R TAR++ + T+ LK S Sbjct: 66 GSGIRDTARVLEISPITVASELKKS 90 >UniRef50_B1QSI6 Putative transposase n=10 Tax=Clostridium butyricum RepID=B1QSI6_CLOBU Length = 336 Score = 85.0 bits (209), Expect = 7e-16, Method: Composition-based stats. Identities = 23/95 (24%), Positives = 37/95 (38%), Gaps = 8/95 (8%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSH--CRKTWQLQ----FTYTASQPGTHQK 55 SCP C ++ GK QRY C + C KT+ + Y QP + Sbjct: 29 IKEYESCPYCGC-KHFIKYGK-YQDIQRYKCKNEECGKTFSNTTFSVWKYLKYQPEKWIE 86 Query: 56 IIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 I++ G+ ++ARI+ + T F + Sbjct: 87 FIELMCEGMTLESSARILKITTTTAFYWRHKILHA 121 >UniRef50_B6ARX2 Probable transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6ARX2_9BACT Length = 133 Score = 84.2 bits (207), Expect = 1e-15, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 32/89 (35%), Gaps = 5/89 (5%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ---LQFTYTASQPGTHQKIIDM 59 S CP C + V + G+ G QRY C CR+ + + ++ Sbjct: 47 SEHPRCPHCQD-EHVAKWGRVK-GLQRYRCEACRRQFTPLTNTPLSGLRKREKWGAYLEA 104 Query: 60 AMNGVGCRATARIMGVGLNTIFRHLKNSG 88 +G+ R A+ +GV T F Sbjct: 105 MEDGLSVRKAAQRIGVNHKTTFLWRHRFS 133 >UniRef50_Q81ZP0 Putative uncharacterized protein n=2 Tax=Nitrosomonas europaea RepID=Q81ZP0_NITEU Length = 323 Score = 83.8 bits (206), Expect = 1e-15, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 32/89 (35%), Gaps = 5/89 (5%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ---LQFTYTASQPGTHQKIID 58 +S CP C ++ R G AG QR+ C C+ T+ Sbjct: 43 SSFEPICPVCQ-SNHFYRWGY-QAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA 100 Query: 59 MAMNGVGCRATARIMGVGLNTIFRHLKNS 87 + G+ RA+AR + NT FR Sbjct: 101 ALIEGLTVRASARQCRIDKNTSFRWRHRF 129 >UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L432_AMOA5 Length = 118 Score = 83.8 bits (206), Expect = 1e-15, Method: Composition-based stats. Identities = 22/86 (25%), Positives = 41/86 (47%), Gaps = 2/86 (2%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 +++CP C+ ++G G QRY C CR + + T +K + + + G+ Sbjct: 3 TMNCPRCNNAHSC-KDGIVR-GRQRYQCKSCRFRYTVSHKSDVKPLSTKRKALQLYLEGL 60 Query: 65 GCRATARIMGVGLNTIFRHLKNSGRS 90 G RA RI+ + T+++ +K G Sbjct: 61 GFRAIGRILNISYGTVYQWVKACGDQ 86 >UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 RepID=P73782_SYNY3 Length = 141 Score = 83.8 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 39/88 (44%), Gaps = 2/88 (2%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMN 62 S CP C + VV+NG G QR+ C C+ + + + + M Sbjct: 2 STHCHCPQCGHGN-VVKNGFVK-GKQRFKCKRCQYKFTNLSKERGKLLWMKLEAVLLYMG 59 Query: 63 GVGCRATARIMGVGLNTIFRHLKNSGRS 90 G+ ATA+++GV ++ +++ G + Sbjct: 60 GMSMNATAKLLGVSTQSLLNWIRDFGEA 87 >UniRef50_C4IIL3 Putative transposase n=2 Tax=Clostridium butyricum RepID=C4IIL3_CLOBU Length = 325 Score = 83.8 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 32/89 (35%), Gaps = 6/89 (6%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKII 57 CP C + ++ GK G QRY C C+KT+ + Y P K I Sbjct: 29 IKEYSCCPHCKNVE-FIKFGK-YDGIQRYRCKSCKKTFSYTTNSLWKYLKHPPEKWFKFI 86 Query: 58 DMAMNGVGCRATARIMGVGLNTIFRHLKN 86 ++ A+ + + + T F Sbjct: 87 ELLGEKKTLEYCAKTLKISIVTAFNWRHK 115 >UniRef50_D2QCU0 Insertion element protein n=7 Tax=Bacteroidetes RepID=D2QCU0_9SPHI Length = 139 Score = 83.4 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 35/86 (40%), Gaps = 2/86 (2%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 ++ CP C++ D V RNG QR+ C C + + K + + + GV Sbjct: 3 TLKCPKCNSVDAV-RNG-IVNQRQRFRCKKCNYNFTVGKVGKGISTYYVIKALQLYIEGV 60 Query: 65 GCRATARIMGVGLNTIFRHLKNSGRS 90 R R++G+ ++ +K Sbjct: 61 SFREIERLLGISHVSVMNWVKKYQIK 86 >UniRef50_Q9AMR3 Putative transposase n=1 Tax=Azotobacter vinelandii RepID=Q9AMR3_AZOVI Length = 214 Score = 83.4 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 32/93 (34%), Gaps = 5/93 (5%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQPGTHQKII 57 M + SCP C +++ ++ S+ G RY C C KT + Q Sbjct: 38 MIATPSSCPHCQSSE--LQPWGSSGGLPRYRCKFCGKTSNPLTGTPMARLRKRHLWQGYA 95 Query: 58 DMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 + + R A+ GV NT F Sbjct: 96 EALTQSLTVRRAAKHCGVSKNTAFLWRHRFLTQ 128 >UniRef50_C1I4B6 Putative uncharacterized protein n=2 Tax=Clostridium sp. 7_2_43FAA RepID=C1I4B6_9CLOT Length = 361 Score = 82.3 bits (202), Expect = 4e-15, Method: Composition-based stats. Identities = 21/86 (24%), Positives = 37/86 (43%), Gaps = 8/86 (9%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLC--SHCRKTWQLQ----FTYTASQPGTHQKIIDMAM 61 CP+C+ ++ ++ GK G QR+ C C KT+ + F+ + K + + Sbjct: 57 CPNCN-SNNFIKYGK-YRGLQRFKCLNKDCCKTFSQKTNSIFSNSKKPLELWLKYLILMN 114 Query: 62 NGVGCRATARIMGVGLNTIFRHLKNS 87 N R + I+G+ L T F Sbjct: 115 NKFSLRKCSSILGINLATSFYWRHKF 140 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 81.5 bits (200), Expect = 7e-15, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 47/85 (55%), Gaps = 3/85 (3%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNG 63 + +CPSC +D V++NG S+ G +Y C+ CR+T+ + S+ ++I+ +N Sbjct: 67 IRPNCPSC-KSDKVIKNG-SSRGKTKYKCNVCRRTFYDANSRRMSR-EQKERILKEYLNR 123 Query: 64 VGCRATARIMGVGLNTIFRHLKNSG 88 + R A++ G L T++ +K G Sbjct: 124 MSMRGIAKVEGKPLTTVYSLIKRKG 148 >UniRef50_A7HSL0 Insertion element protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HSL0_PARL1 Length = 342 Score = 81.5 bits (200), Expect = 7e-15, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 33/94 (35%), Gaps = 11/94 (11%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCS-----HCRKTW---QLQFTYTASQPGTHQKIIDM 59 CP C D +V++G+ G QR+ C C +T+ +P M Sbjct: 55 CPHCGH-DDIVKHGRDRGGRQRFRCRRSGSSGCGQTFNALTGTAFTRMRKPEKWAAYARM 113 Query: 60 AMNGVGCRATARI--MGVGLNTIFRHLKNSGRSR 91 G + +G+ T +R R++ Sbjct: 114 MATGFKSVDDVKTSGLGISRLTAWRWRHRLLRAQ 147 >UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacteria RepID=Q1V9Z0_VIBAL Length = 88 Score = 81.5 bits (200), Expect = 7e-15, Method: Composition-based stats. Identities = 31/86 (36%), Positives = 46/86 (53%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + + C C +D VV++G GHQRY C C +T+Q+ + Y A +PG +II+M Sbjct: 1 MTTNNPHCHFCCKSDSVVKHGYGPKGHQRYRCLSCCRTFQVNYCYEACKPGIRSRIIEMT 60 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKN 86 G RAT+R + V NT+ Sbjct: 61 AQNHGKRATSRHLQVSYNTVLSACHR 86 >UniRef50_Q97IP3 Zn-finger DNA-binding domain n=1 Tax=Clostridium acetobutylicum RepID=Q97IP3_CLOAB Length = 171 Score = 81.1 bits (199), Expect = 9e-15, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 32/92 (34%), Gaps = 9/92 (9%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIID 58 V + C + RNGK QRY+C C+KT+ + + + Sbjct: 50 KVYLHC----KLEMFSRNGKHDE-KQRYVCKTCKKTFTDFTYSPISSSKKPLDKWLQYAK 104 Query: 59 MAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 + G R A+ + + + T F + Sbjct: 105 CMIVGYSIRKCAKTVNINIATSFFWRHKILEA 136 >UniRef50_B0ABB1 Putative uncharacterized protein n=2 Tax=cellular organisms RepID=B0ABB1_9CLOT Length = 454 Score = 81.1 bits (199), Expect = 1e-14, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 6/89 (6%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF----TYTASQPGTHQKIID 58 + CP C + D + +NGK+ QRY+C +CR T+ + + T T K Sbjct: 136 KNDLKCPKCGSFD-LNKNGKT-NQRQRYICKNCRTTFDERSFSPLSNTKLSLDTWLKYCQ 193 Query: 59 MAMNGVGCRATARIMGVGLNTIFRHLKNS 87 + G + A+ +GV + T F Sbjct: 194 FMIEGGTIKYCAQKVGVSIPTSFFMRHRI 222 >UniRef50_UPI000196AFFE hypothetical protein CATMIT_00334 n=2 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196AFFE Length = 357 Score = 80.7 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 33/84 (39%), Gaps = 5/84 (5%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNG 63 CP C + +NGK HQRY+C C K++ + F ++ I++ + Sbjct: 50 CPICGSV-HFKKNGKDKNRHQRYICLDCHKSFSDRTNTLFYWSHFTLDQWLHFIELELYK 108 Query: 64 VGCRATARIMGVGLNTIFRHLKNS 87 + A+++ T F Sbjct: 109 MPLEGEAQVLETSKTTCFYMRHKL 132 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 80.7 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 1/85 (1%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I+CP C V++NG + Q+Y C C + +T A + +QK + + G+G Sbjct: 12 INCPKCKE-KKVIKNGTTKNNKQQYYCKMCFYRFIQNYTNQAYKLDINQKNVQLTKEGLG 70 Query: 66 CRATARIMGVGLNTIFRHLKNSGRS 90 R+TARI+ + T+ + + + GR Sbjct: 71 IRSTARILEISATTLLKRIVSIGRK 95 >UniRef50_Q891N5 Putative transposase n=1 Tax=Clostridium tetani RepID=Q891N5_CLOTE Length = 279 Score = 79.6 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 6/84 (7%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIIDMAMNG 63 C C ++ +V+NGK QRY+C C KT+ +Y+ + + G Sbjct: 59 CVHC-KSENIVKNGKYKE-KQRYICKDCHKTFTNYTNSPISYSKKNISKWIEYTKCMLAG 116 Query: 64 VGCRATARIMGVGLNTIFRHLKNS 87 R +++++G+ L+T F Sbjct: 117 YSLRKSSKLVGISLSTAFYWRHKI 140 >UniRef50_C6X4X6 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X4X6_FLAB3 Length = 169 Score = 79.6 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 34/84 (40%), Gaps = 2/84 (2%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCR 67 CP C VV++G QR+LC C + ++ K + + + G+ R Sbjct: 36 CPKCQQ-QNVVKSGIVKE-RQRFLCRSCNYYFTVKKLGKQIDDYYVTKALQLYLEGLSYR 93 Query: 68 ATARIMGVGLNTIFRHLKNSGRSR 91 RI+GV TI ++ R Sbjct: 94 EIERILGVSHVTISSWVRKYNIKR 117 >UniRef50_A1SXI4 Conserved hypothetical transposase n=18 Tax=Gammaproteobacteria RepID=A1SXI4_PSYIN Length = 319 Score = 79.2 bits (194), Expect = 3e-14, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 31/92 (33%), Gaps = 5/92 (5%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDM 59 + S CP C + GK+ + QRY C C KT+ + K + Sbjct: 50 NSSPQCPHCHCA-HFTKWGKAGS-VQRYKCFSCHKTFNNKTKTPLAKLHRCELWDKYAEC 107 Query: 60 AMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 + R A + + L T F ++ Sbjct: 108 MSLKLTLREAAAVCNINLKTSFLWRHRFLMAQ 139 >UniRef50_B9TDK1 Putative uncharacterized protein n=2 Tax=Ricinus communis RepID=B9TDK1_RICCO Length = 321 Score = 79.2 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 29/83 (34%), Gaps = 5/83 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQPGTHQKIIDMAMNGV 64 CP C R G+ +G QR+ C HC ++ + + + Sbjct: 52 CPHCGCARK-HRCGQ-ASGLQRFRCLHCGRSHNALTKTPLARLRKKECWLPYLQCVLESR 109 Query: 65 GCRATARIMGVGLNTIFRHLKNS 87 R A+I+GV T FR Sbjct: 110 TVRDAAQIVGVHRTTSFRWRHRF 132 >UniRef50_Q2GDS3 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDS3_NEOSM Length = 134 Score = 78.0 bits (191), Expect = 8e-14, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 35/83 (42%), Gaps = 3/83 (3%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C++ +++GK+ QRY C +C + + + + ++G+ Sbjct: 1 MHCPKCNSV-RFIKSGKAKE-KQRYKCLNCGCQFSRNEK-HGAPLRLKMHAVQLFLSGIS 57 Query: 66 CRATARIMGVGLNTIFRHLKNSG 88 + A+I V T+ R + Sbjct: 58 MNSIAKIFSVSPPTVMRWVNQFS 80 >UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YEG9_MICAE Length = 171 Score = 78.0 bits (191), Expect = 8e-14, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 33/80 (41%), Gaps = 5/80 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCR 67 CP+C + ++NG G + C C + + + T T Q I + + G+ R Sbjct: 37 CPNCG-SHHTIKNGSIHNGKPKRQCKECGRQFVINPTNKTVSDETKQLIDKLLLEGISLR 95 Query: 68 ATARIMGVGLNTIFRHLKNS 87 AR+ G + L+N Sbjct: 96 VIARVTGAS----WSWLQNY 111 >UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE Length = 348 Score = 77.7 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 27/85 (31%), Positives = 38/85 (44%), Gaps = 6/85 (7%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIIDMAMN 62 CP C D V +NGKS G QRY+C CR ++ F+ T K ++ + Sbjct: 52 ECPKCQCKD-VNKNGKS-NGRQRYICKRCRTSFDEFTMSPFSNTKLGLDKWIKYCELMIL 109 Query: 63 GVGCRATARIMGVGLNTIFRHLKNS 87 G+ R A +GVG+ T F Sbjct: 110 GLSIRKCAEEVGVGVKTSFYMRHRI 134 >UniRef50_Q97IZ3 Zn-finger DNA-binding domain n=1 Tax=Clostridium acetobutylicum RepID=Q97IZ3_CLOAB Length = 142 Score = 77.7 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 19/84 (22%), Positives = 34/84 (40%), Gaps = 6/84 (7%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIIDMAMNG 63 CP C ++ + RN K G Q Y+C C+K++ + K +NG Sbjct: 54 CPICG-SETISRNSK-YNGKQGYICKSCKKSFTDFTNSATYKSKKTLDKWLKYAKCMVNG 111 Query: 64 VGCRATARIMGVGLNTIFRHLKNS 87 R +A+++ + + T F Sbjct: 112 YSIRKSAKVVEINIATSFFWRHKI 135 >UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5WC29_PSYWF Length = 233 Score = 76.9 bits (188), Expect = 2e-13, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 37/88 (42%), Gaps = 3/88 (3%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDM 59 ++ I CP+C +D + +NG + G Q Y C C++ + TY KI + Sbjct: 3 ITLYIKCPAC-LSDNIKKNGFKSYGKQNYKCKDCKRQFIGDHALTYQGCHSQKDSKIRYL 61 Query: 60 AMNGVGCRATARIMGVGLNTIFRHLKNS 87 + G G + A + + + LK Sbjct: 62 MVRGSGIKDIACVERISKGKVLATLKKC 89 >UniRef50_A8ZMX8 Putative uncharacterized protein n=5 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMX8_ACAM1 Length = 134 Score = 76.5 bits (187), Expect = 2e-13, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 37/93 (39%), Gaps = 9/93 (9%) Query: 6 ISCPSCSATDGVVRNGKSTAGH----QRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 + CP C ++ +++ G + QRY C C + + + ++ T ++ A+ Sbjct: 1 MECPYCQ-SEKILKRGFDSLQDGTLVQRYQCKDCNRRFNERTGTPMARLRTASSVVSYAI 59 Query: 62 ----NGVGCRATARIMGVGLNTIFRHLKNSGRS 90 G+G R+ R G TI R K Sbjct: 60 KARTEGMGVRSAGRTFGKSHTTIMRWEKRLADQ 92 >UniRef50_A0L9I6 Insertion element protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L9I6_MAGSM Length = 89 Score = 75.3 bits (184), Expect = 5e-13, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 41/90 (45%), Gaps = 6/90 (6%) Query: 1 MAS--VSISCPSCSATDGVVRNGKSTAGHQRYLCSH--CRKT-WQLQFTYTASQPGTHQK 55 MA+ V + CP C + D V++ GK G QR+ C+ C +T + + ++ Sbjct: 1 MATMEVHVHCPDCGSLD-VIKFGKDRHGRQRFRCNDHFCDRTIFMMDDPDWWRFEEVKKQ 59 Query: 56 IIDMAMNGVGCRATARIMGVGLNTIFRHLK 85 I ++G G TA +G+ + R K Sbjct: 60 IALHLLSGNGIHQTAHNLGLHPEFVNRMAK 89 >UniRef50_C6W2G4 Transcriptional regulator, LacI family n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W2G4_DYAFD Length = 388 Score = 75.3 bits (184), Expect = 6e-13, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 35/80 (43%), Gaps = 6/80 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I C C+ DG+++ G G QRYLC C + A + + + ++ + Sbjct: 2 IECVKCAQVDGIMKAGYVR-GKQRYLCKWCNYYFT-----HAEKDDSIESLVKRKRHQTT 55 Query: 66 CRATARIMGVGLNTIFRHLK 85 A+ +GV +T+ R L Sbjct: 56 IIDIAKSLGVSNSTVSRALH 75 >UniRef50_A5FST1 Integrase, catalytic region n=1 Tax=Dehalococcoides sp. BAV1 RepID=A5FST1_DEHSB Length = 319 Score = 75.0 bits (183), Expect = 7e-13, Method: Composition-based stats. Identities = 20/94 (21%), Positives = 34/94 (36%), Gaps = 9/94 (9%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTA--SQPGTHQKIIDMAM 61 + I C C + R G S A QR+LC+ C T+ + P + M Sbjct: 6 LPIECKYCG-SRHTRRYGHSRAQKQRWLCNDCCHTFVETSAQPGMRTPPEQIGAAVSMFY 64 Query: 62 NGVGCRATAR----IMGVGLN--TIFRHLKNSGR 89 G+ A R I + + T++ + + Sbjct: 65 EGLSLSAICRQMKQIHNISPSDGTVYGWITKYSK 98 >UniRef50_A8V2B8 Hypothetical insertion element protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8V2B8_9AQUI Length = 125 Score = 75.0 bits (183), Expect = 8e-13, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 36/87 (41%), Gaps = 2/87 (2%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMN 62 I CP C ++ + GK+T G QRY C+ C + + Y + M Sbjct: 11 QEHIKCPECG-SNWCKKFGKNT-GKQRYKCNECGRHFYEGAKYHKHPEKVKLLALKMYSK 68 Query: 63 GVGCRATARIMGVGLNTIFRHLKNSGR 89 G+ A AR++ + T+ R G+ Sbjct: 69 GMSKSAIARVLNLPYRTVARWTYEVGK 95 >UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachlamydiaceae RepID=Q6MD28_PARUW Length = 209 Score = 74.6 bits (182), Expect = 9e-13, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 27/79 (34%), Gaps = 1/79 (1%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + C C +D V +NG + Q + C C K W T + + + V Sbjct: 1 MRCTHCG-SDLVKKNGYTRHEKQNFRCLECGKQWSENKEAKIINEQTKELVRKALLEKVS 59 Query: 66 CRATARIMGVGLNTIFRHL 84 RI V + + + Sbjct: 60 LNGICRIFDVSMPWLLDFI 78 >UniRef50_Q8D3Z3 Transposase n=48 Tax=Vibrionales RepID=Q8D3Z3_VIBVU Length = 507 Score = 74.6 bits (182), Expect = 1e-12, Method: Composition-based stats. Identities = 14/91 (15%), Positives = 33/91 (36%), Gaps = 8/91 (8%) Query: 8 CPSCSATDGVVRNGKSTAG----HQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNG 63 C + + ++ G QRY C C+ T+ +++ + + ++ + G Sbjct: 104 CANFGLSVHTHKHLYHAFGYSGDRQRYRCKSCQSTFVDKWSGANKKLQFQENLMGLLFTG 163 Query: 64 VGCRATARIMGVGLNTIFRHLK----NSGRS 90 R R + + T + H++ R Sbjct: 164 YSVREICRKLAINPKTFYDHVEHIASRCRRK 194 >UniRef50_C0BSX6 Putative uncharacterized protein n=1 Tax=Bifidobacterium pseudocatenulatum DSM 20438 RepID=C0BSX6_9BIFI Length = 352 Score = 74.2 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 18/83 (21%), Positives = 37/83 (44%), Gaps = 5/83 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS----QPGTHQKIIDMAMNG 63 C C + ++R G+ G QR+ C +C +T+ ++ + G + ++ ++ Sbjct: 55 CVRCGSI-RIIRKGRGRDGSQRWKCMNCNRTFGVRTNRVMGMSKLKAGVWMRFLECFVDC 113 Query: 64 VGCRATARIMGVGLNTIFRHLKN 86 + R A+ GV L T F + Sbjct: 114 LSLRKCAQRCGVCLKTAFLMRQR 136 >UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETP6_AMOA5 Length = 232 Score = 74.2 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 19/84 (22%), Positives = 35/84 (41%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + C C ++ + NGK G QRY C C + + I + + +G Sbjct: 1 MECKGC-KSNKTINNGKVR-GKQRYNCKSCGFNFVEVDERRGKNIDKQRMAIHLYLENMG 58 Query: 66 CRATARIMGVGLNTIFRHLKNSGR 89 RA R++GV + + ++ +G Sbjct: 59 FRAIGRVLGVSNLAVLKWIRAAGE 82 >UniRef50_A8ZNT7 Putative uncharacterized protein n=2 Tax=Acaryochloris marina MBIC11017 RepID=A8ZNT7_ACAM1 Length = 188 Score = 73.4 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 40/93 (43%), Gaps = 9/93 (9%) Query: 6 ISCPSCSATDGVVRNG----KSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 + C C ++ VV+NG K+ Q +LC C + + + ++ T + I MA+ Sbjct: 1 MQCIHCQ-SENVVKNGTKTLKTAQVVQYFLCKDCGRRFNERSGTPMARLRTPVETISMAI 59 Query: 62 ----NGVGCRATARIMGVGLNTIFRHLKNSGRS 90 G+G RA R++ N+I K Sbjct: 60 NARTEGLGIRAAGRVLRKSPNSIILWEKRLSAQ 92 >UniRef50_Q64EP4 Putative uncharacterized protein n=10 Tax=environmental samples RepID=Q64EP4_9ARCH Length = 164 Score = 73.0 bits (178), Expect = 2e-12, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 30/78 (38%), Gaps = 4/78 (5%) Query: 16 GVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNGVGCRATAR 71 +VR G G QR+ C C K + F I + + G RA R Sbjct: 37 NIVRYGHDKNGRQRFKCKTCGKVFVETKNTVFYNRKLSEDQIILICKLLVEKNGIRAIER 96 Query: 72 IMGVGLNTIFRHLKNSGR 89 IM + +TI +K+ R Sbjct: 97 IMEIHRDTISDVVKDLAR 114 >UniRef50_D0WF86 Putative Zn-finger DNA-binding domain protein n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WF86_9ACTN Length = 243 Score = 73.0 bits (178), Expect = 3e-12, Method: Composition-based stats. Identities = 16/85 (18%), Positives = 26/85 (30%), Gaps = 5/85 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDMAM 61 CP C + +GK+ G +RY C C + F +I ++ Sbjct: 52 PVCPDCGSVRP-RLDGKAPNGARRYRCRECGCRFSALTGTIFADAKLPLHKIMRIAEVMC 110 Query: 62 NGVGCRATARIMGVGLNTIFRHLKN 86 + R + V T F Sbjct: 111 HSASLRLMELVAEVSHGTAFLWRHK 135 >UniRef50_B6B4C9 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2083 RepID=B6B4C9_9RHOB Length = 321 Score = 71.9 bits (175), Expect = 6e-12, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 36/83 (43%), Gaps = 7/83 (8%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIID-MAMNG 63 CP C A D R G++ AG QRY C C KT+ + +++ M +G Sbjct: 50 CPHCGAVDR-QRWGRTRAGSQRYRCQGCLKTFNGRTGSSIAQLQKLDQFYQVLKDMFSDG 108 Query: 64 --VGCRATARIMGVGLNTIFRHL 84 R AR + V +TI+R Sbjct: 109 PPRSIRRLARQLDVNKDTIWRWR 131 >UniRef50_B5K5I7 Transposase n=4 Tax=Octadecabacter antarcticus 238 RepID=B5K5I7_9RHOB Length = 319 Score = 71.5 bits (174), Expect = 7e-12, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 35/87 (40%), Gaps = 5/87 (5%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDMAMNG 63 +CP C+A V+R G+S G +RY C C KT+ + +G Sbjct: 50 NCPHCAAGGAVIR-GRS-NGLKRYFCKICSKTFNALTGTPLARLRHKDCWTEFAGSLSDG 107 Query: 64 VGCRATARIMGVGLNTIFRHLKNSGRS 90 + +A GV +T FR R+ Sbjct: 108 DTVKTSAARCGVASSTAFRWRHRFLRA 134 >UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TRD2_9FLAO Length = 219 Score = 71.5 bits (174), Expect = 7e-12, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 34/85 (40%), Gaps = 2/85 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 ++C +C + R GK G QRY C C ++Q + Y A +I + Sbjct: 1 MNCKNCDQAHCIKR-GK-RNGIQRYYCKICFTSFQENYHYKAYDSSIDTLLISLLRECCS 58 Query: 66 CRATARIMGVGLNTIFRHLKNSGRS 90 AR++ + NT+ + + Sbjct: 59 VLGIARVLKISKNTVLSRMLKISKQ 83 >UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcticus 273-4 RepID=Q4FRR6_PSYA2 Length = 108 Score = 71.5 bits (174), Expect = 9e-12, Method: Composition-based stats. Identities = 23/85 (27%), Positives = 34/85 (40%), Gaps = 3/85 (3%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDM 59 + ISCP C + + +NG + G Q Y C C++ + TY +I M Sbjct: 3 TQIDISCPDCHSI-SLKKNGIKSYGKQNYQCKDCQRQFIGDHALTYQGCHSRIEDRIRLM 61 Query: 60 AMNGVGCRATARIMGVGLNTIFRHL 84 G G R A I V + + L Sbjct: 62 TARGCGIRDIAVITSVSIGKVLSTL 86 >UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SSR9_AERS4 Length = 91 Score = 71.1 bits (173), Expect = 1e-11, Method: Composition-based stats. Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 1/64 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MAS++I CP C+ +D V R+GK+ AG+ RY C C +QL +TY A P + ++++ Sbjct: 10 MASITIHCPRCN-SDHVYRHGKTPAGNIRYRCPACPHVFQLTYTYEARNPASKRRLLIWR 68 Query: 61 MNGV 64 G+ Sbjct: 69 STGL 72 >UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea psychrophila RepID=Q6AKY5_DESPS Length = 101 Score = 70.3 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 30/85 (35%), Positives = 46/85 (54%), Gaps = 6/85 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 +SC C TD V R+GK + G+QR+ CS C++T+QL++ Y A + + G Sbjct: 1 MSCRFCGGTDEVRRHGKDSNGNQRFRCSDCKRTFQLEYPYVADRHE------RYSPGNAG 54 Query: 66 CRATARIMGVGLNTIFRHLKNSGRS 90 R TAR++ VG + R K + R Sbjct: 55 IRDTARVLKVGCMGLTRFRKLNPRQ 79 >UniRef50_C8SD87 Insertion element protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SD87_FERPL Length = 94 Score = 70.3 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 38/91 (41%), Gaps = 5/91 (5%) Query: 6 ISCPSCSATDGVVR---NGKSTAGHQRYLCSHCRKTWQLQF-TYTASQPGTHQKIIDMAM 61 + CP C + V + KS QRY C +C +T+ L + ++ + Sbjct: 1 MMCPHCKSIKTVKMGCYHTKSGERRQRYKCKNCGRTFVLNPIKPRNYPEEFKEMVVKAVV 60 Query: 62 -NGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 GVG R +RI + NT+ ++ + R Sbjct: 61 REGVGVRQASRIFKLSPNTVTAWVREFSKKR 91 >UniRef50_Q1VPB4 Putative uncharacterized protein n=4 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VPB4_9FLAO Length = 343 Score = 70.3 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 26/83 (31%), Gaps = 5/83 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTA---SQPGTHQKIIDMAMNGV 64 CP C + VR G G QRY C C +++ + + + + Sbjct: 51 CPHCLH-EKYVRFGVDK-GSQRYKCKSCNRSFTEYTGTWMAGLQRKDMISSYLSLMVQEK 108 Query: 65 GCRATARIMGVGLNTIFRHLKNS 87 + +G+ T F Sbjct: 109 SLDKISSELGINKKTAFDWRHKI 131 >UniRef50_Q4JT92 Transposase for IS3503f n=4 Tax=Corynebacterium jeikeium K411 RepID=Q4JT92_CORJK Length = 165 Score = 70.3 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 20/86 (23%), Positives = 30/86 (34%), Gaps = 5/86 (5%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + SCP C + +NG ++ R+ C+HC ++ T I A Sbjct: 1 MTTNRPSCPLCG--NNTKKNGTTSKSTTRWRCTHCGHSFTRNTQTHNKNTATMALFIQWA 58 Query: 61 MNGVGCRATARIMGVGLNTI---FRH 83 A GV T+ FR Sbjct: 59 TGTQSLTTFAAHHGVTRQTMHHRFRW 84 >UniRef50_C0QU68 Putative transposase n=1 Tax=Persephonella marina EX-H1 RepID=C0QU68_PERMH Length = 94 Score = 69.6 bits (169), Expect = 3e-11, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 37/87 (42%), Gaps = 2/87 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M ISCP C ++ V+NGK G Q YLC C + + + ++ +++ Sbjct: 1 MGGKKISCPHCE-SERCVKNGK-ANGKQTYLCKECYYRFTINASKRKYPFKIRREAVNLY 58 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNS 87 G ++ + + + TI +K Sbjct: 59 KEGYTLTEISKKLNIKVQTIHHWVKKY 85 >UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PIK6_CORA7 Length = 403 Score = 68.8 bits (167), Expect = 5e-11, Method: Composition-based stats. Identities = 27/86 (31%), Positives = 40/86 (46%), Gaps = 4/86 (4%) Query: 1 MAS-VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDM 59 MA+ SC C G+V+NGK+ AG QR+LC C + +T+ ID Sbjct: 1 MANRNRPSCDMCG--HGLVKNGKTAAGTQRWLCPQCNVSSINTRAHTSDIRHFKI-FIDW 57 Query: 60 AMNGVGCRATARIMGVGLNTIFRHLK 85 ++G A+ +GV T+ R K Sbjct: 58 ILSGESADHLAKRLGVTRRTLTRWFK 83 >UniRef50_B2GC72 Transposase n=11 Tax=Lactobacillus RepID=B2GC72_LACF3 Length = 428 Score = 68.4 bits (166), Expect = 6e-11, Method: Composition-based stats. Identities = 21/99 (21%), Positives = 35/99 (35%), Gaps = 21/99 (21%) Query: 7 SCPSCSATDGVVRNGKSTA-----------------GHQRYLCSHCRKTWQLQF----TY 45 CP C D ++NG S QR C +C+ ++ + Y Sbjct: 44 RCPHCGFADTFIKNGHSYQTIKYLSINESCPTMLRIDKQRLRCKNCQDSFMAKTNVVDKY 103 Query: 46 TASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 + K + M + V + ++ GV +TI R L Sbjct: 104 CSIAKAVKHKALTMLESNVSQKDVSKFTGVSPSTIGRLL 142 >UniRef50_P04137 Uncharacterized protein in transposable element ISH50 n=11 Tax=Halobacteriaceae RepID=YIH50_HALSA Length = 294 Score = 68.4 bits (166), Expect = 6e-11, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 38/90 (42%), Gaps = 7/90 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAM 61 + CPSC + V+R G S QRYLC C +T+ Q F ++A + + Sbjct: 26 VYCPSC-RAESVIRYG-SYRVFQRYLCKDCDRTFNDQTGTVFEHSAVALRKWFLAVYTYI 83 Query: 62 N-GVGCRATARIMGVGLNTIFRHLKNSGRS 90 R + V T++R ++ R+ Sbjct: 84 RLNTSIRQLDAEIDVSYKTVYRRVQRFLRA 113 >UniRef50_UPI0001BC5CBF transposase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5CBF Length = 184 Score = 68.0 bits (165), Expect = 1e-10, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 38/106 (35%), Gaps = 21/106 (19%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQF- 43 + + +CP C + V+NG T+ QR+LC C ++ L+ Sbjct: 42 LTKDTCACPHCH-SQTTVKNGFKTSKVRYLPFQNYPIIIALKKQRFLCKECHHSFTLETP 100 Query: 44 ---TYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKN 86 Y + +++ + A+ + + T+ R LK Sbjct: 101 IVKKYASISQTLKLSVLNSLQENMSLSLIAKQHRISIPTVQRILKQ 146 >UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX Length = 71 Score = 67.3 bits (163), Expect = 1e-10, Method: Composition-based stats. Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA+V++ P C+ +D V R+G+S + H+R+ C C++ +QL ++Y A +PG + I++MA Sbjct: 1 MATVTVHRPRCN-SDKVYRHGRSCSQHERFRCRSCKRVFQLTYSYEARKPGFKELIVEMA 59 Query: 61 MNGVG 65 NG G Sbjct: 60 HNGTG 64 >UniRef50_B5ETH8 Transposase n=15 Tax=Vibrionaceae RepID=B5ETH8_VIBFM Length = 489 Score = 67.3 bits (163), Expect = 1e-10, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 32/88 (36%), Gaps = 12/88 (13%) Query: 9 PSCSATDG--------VVRNGKSTAG----HQRYLCSHCRKTWQLQFTYTASQPGTHQKI 56 PSC+ ++ R G QRY C C T+ +++ + QK+ Sbjct: 79 PSCNNSECEHFGFDVLTHRELYHAFGYSGDRQRYRCKSCASTFVDKWSGENQKSLIQQKL 138 Query: 57 IDMAMNGVGCRATARIMGVGLNTIFRHL 84 + G R R + + T + H+ Sbjct: 139 LGFLFTGYSVREICRRLHINPKTFYDHI 166 >UniRef50_C5U8R8 Insertion element protein (Fragment) n=2 Tax=Methanocaldococcus infernus ME RepID=C5U8R8_9EURY Length = 100 Score = 66.9 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 6/91 (6%) Query: 6 ISCPSCSATDGVVRNGKSTAGH----QRYLCSHCRKTWQLQFTYTASQPGTHQKIID-MA 60 I C C+ +D VV+ GK + Q YLC C++ + + +K++ + Sbjct: 5 IRCKYCN-SDKVVKAGKHKSEKYGVRQMYLCKKCKRRFVEESKAPRYSDSFKEKVVRSVV 63 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 G+G R R+ + TI R +K+ +++ Sbjct: 64 FEGLGIRQAGRVFKLSTTTILRWIKDFKKTK 94 >UniRef50_B4WSN9 Insertion element protein n=3 Tax=Cyanobacteria RepID=B4WSN9_9SYNE Length = 83 Score = 66.5 bits (161), Expect = 3e-10, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 36/84 (42%), Gaps = 5/84 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGT----HQKIIDMAM 61 + CP C ++GK++ G QRY C+ CR+T+ F + + I+ + Sbjct: 1 MDCPFCDHPTP-HKHGKTSKGSQRYRCTACRRTFTETFDTLYDRRQVTSEQVKLILQTYV 59 Query: 62 NGVGCRATARIMGVGLNTIFRHLK 85 G R +RI T+ ++ Sbjct: 60 EGSSLRGISRIGKRAYGTVVDIVR 83 >UniRef50_UPI0001C348D8 hypothetical protein PretD1_11772 n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C348D8 Length = 467 Score = 64.9 bits (157), Expect = 8e-10, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 37/89 (41%), Gaps = 3/89 (3%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M ++ +CPSC +T+ + + G + G RY C +C + L K+I+ Sbjct: 67 MKNIEKACPSCYSTENI-KYGTTAIGTVRYQCKNCNNVYSL--KNLNKFDDVDNKLIESL 123 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGR 89 + + + + + +R L+N Sbjct: 124 LKNTKVSTIFKELKITPASFYRRLENINE 152 >UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32IP8_SHIDS Length = 101 Score = 64.6 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 50/52 (96%), Positives = 50/52 (96%) Query: 40 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGV LNTI RHLKNSGRSR Sbjct: 50 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVSLNTILRHLKNSGRSR 101 >UniRef50_C0X375 Transposase n=23 Tax=Enterococcus RepID=C0X375_ENTFA Length = 446 Score = 63.4 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 17/106 (16%), Positives = 35/106 (33%), Gaps = 22/106 (20%) Query: 7 SCPSC--SATDGVVRNGKST----------------AGHQRYLCSHCRKTWQLQFTYTAS 48 C C +++ G QR+ C HC KT+ + + + Sbjct: 48 ECFHCHYQNKQTIIKWGWKKVSILLNDVSNYKTILRINKQRFKCKHCGKTFLAEDSVSDR 107 Query: 49 Q----PGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 + Q I+++ + AR+ + T+ R L++ Sbjct: 108 RCSIARRVKQAILELLSEPISMSLIARMKHISPTTVIRILRSLRPK 153 >UniRef50_C9BRL5 Transposase n=2 Tax=Enterococcus faecium RepID=C9BRL5_ENTFC Length = 433 Score = 63.0 bits (152), Expect = 3e-09, Method: Composition-based stats. Identities = 20/101 (19%), Positives = 30/101 (29%), Gaps = 23/101 (22%) Query: 7 SCPSCSATDG---VVRNGKSTA----------------GHQRYLCSHCRKTWQLQFT--- 44 CP C + +V+NGK + QRY C C + Sbjct: 46 RCPLCKQMNHEGMIVKNGKKKSLIQLNKCANQLTYLALAKQRYHCRGCHTYFTANTYIVD 105 Query: 45 -YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 KI++ A+ GV +T+ R L Sbjct: 106 RNCFIAKQVRYKILEELTEKQAMTTIAKHCGVSWSTVSRTL 146 >UniRef50_A4WNK0 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WNK0_RHOS5 Length = 481 Score = 62.6 bits (151), Expect = 4e-09, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 32/67 (47%), Gaps = 1/67 (1%) Query: 19 RNGKSTAGHQRYLCSHCRKTWQL-QFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGL 77 R GK+ G R+ C C KT+ + + + ++ ++DM N + +RI G+ Sbjct: 132 RFGKTKGGDARWRCKGCGKTFSVGKPARRHKRSDKNRLVLDMLCNDLSFAKMSRISGLAY 191 Query: 78 NTIFRHL 84 I+R + Sbjct: 192 RDIYRRV 198 >UniRef50_Q3Y3Y2 Transposase, IS204/IS1001/IS1096/IS1165 n=4 Tax=Enterococcus faecium RepID=Q3Y3Y2_ENTFC Length = 401 Score = 62.3 bits (150), Expect = 5e-09, Method: Composition-based stats. Identities = 24/99 (24%), Positives = 39/99 (39%), Gaps = 21/99 (21%) Query: 7 SCPSC-SATDGVVRNGK-------STAG---------HQRYLCSHCRKTWQLQF----TY 45 CP C +T +V+NGK + +G QRYLC C+K + + + Sbjct: 46 RCPCCKDSTKQIVKNGKKISMILLNRSGNKRTYLRLKKQRYLCRACKKYFTARTYLVTPF 105 Query: 46 TASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 H KI++ +A + V + T+ R L Sbjct: 106 CFISKQIHYKILEELTERQSIKAIGKHCDVSVTTVQRTL 144 >UniRef50_C6MXF0 Putative uncharacterized protein n=1 Tax=Geobacter sp. M18 RepID=C6MXF0_9DELT Length = 512 Score = 61.5 bits (148), Expect = 9e-09, Method: Composition-based stats. Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 2/69 (2%) Query: 18 VRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH--QKIIDMAMNGVGCRATARIMGV 75 R G++ AG +RY C C +T+ + TA Q TH +KI +N + + Sbjct: 42 HRFGETAAGARRYRCKLCSRTFSINGKPTARQRDTHKNKKIYMHLVNKSPFKRICEQAEI 101 Query: 76 GLNTIFRHL 84 T++R + Sbjct: 102 SPATLYRKI 110 >UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C2KAD9_9FLAO Length = 239 Score = 61.1 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 1/90 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M C C+ + ++ G ++ QRY C C+K + +++Y A Q T+ I + Sbjct: 1 MNKRRNRCIHCNYS-YCIKAGITSQNKQRYQCKKCKKKFIGKYSYRAYQKSTNHNIQQLI 59 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 GVG R +R++ V T+ + + Sbjct: 60 KEGVGIRGISRLLNVSKTTVLKKILKIASK 89 >UniRef50_Q07NT9 Putative uncharacterized protein n=2 Tax=Rhizobiales RepID=Q07NT9_RHOP5 Length = 577 Score = 61.1 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 17/89 (19%), Positives = 36/89 (40%), Gaps = 11/89 (12%) Query: 7 SCP--SCSATDGVV--------RNGKSTAGHQRYLCSHCRKTWQLQFTY-TASQPGTHQK 55 CP SC + + R+G S G RY C CRKT+ ++ + + ++ Sbjct: 103 HCPDDSCENYNKLFDSHPKSYFRHGTSAIGAPRYRCKACRKTFSVRTGHSRHRKSHENKT 162 Query: 56 IIDMAMNGVGCRATARIMGVGLNTIFRHL 84 + + ++ V +I + ++ + Sbjct: 163 VFQLLVSKVPITKIGQITDLSPAAVYDKI 191 >UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T8A4_KLEPR Length = 83 Score = 60.7 bits (146), Expect = 1e-08, Method: Composition-based stats. Identities = 41/62 (66%), Positives = 46/62 (74%), Gaps = 1/62 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQ-KIIDM 59 MAS+ + PSC+ T+GV RNGKSTAGHQ YLC CRK W L FTYT SQ THQ KIIDM Sbjct: 7 MASIYVGSPSCAVTEGVDRNGKSTAGHQHYLCRQCRKPWTLTFTYTTSQRSTHQRKIIDM 66 Query: 60 AM 61 + Sbjct: 67 TI 68 >UniRef50_C8SCF9 Integrase catalytic region n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCF9_FERPL Length = 357 Score = 60.3 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 35/96 (36%), Gaps = 11/96 (11%) Query: 3 SVSISCPSCSATDGVVRNG--KSTAG-HQRYLCSHCRKTWQLQFTYTA--SQPGTHQKII 57 +C +C D V++ G + +G Q Y C C K + + + + + Sbjct: 81 KEERTCKNCGRDDEVIKKGIRYNKSGPVQMYYCKRCGKKFSARTGFGGMKKRAEAIVAAL 140 Query: 58 DMAMNGVGCRATARIMG------VGLNTIFRHLKNS 87 D+ G+ R A+ + V T+ +K Sbjct: 141 DLYFRGLSLRQVAQHLKASYNVEVCHKTVHNWIKRY 176 >UniRef50_D1JAI8 Putative uncharacterized protein n=1 Tax=uncultured archaeon RepID=D1JAI8_9ARCH Length = 192 Score = 59.2 bits (142), Expect = 4e-08, Method: Composition-based stats. Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 4/71 (5%) Query: 20 NGKSTAGHQRYLCSHCRKTWQLQFTYT----ASQPGTHQKIIDMAMNGVGCRATARIMGV 75 GK Q C C K + + + G I + G G RATARIMG+ Sbjct: 36 YGKGEKRTQMLKCKVCGKRFSIHKGTPLFNLKADEGAFYGTIAHLVEGNGIRATARIMGI 95 Query: 76 GLNTIFRHLKN 86 +T+ + LK Sbjct: 96 NKDTVSKWLKK 106 >UniRef50_UPI000196CFC8 hypothetical protein CATMIT_02848 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196CFC8 Length = 262 Score = 59.2 bits (142), Expect = 4e-08, Method: Composition-based stats. Identities = 15/84 (17%), Positives = 22/84 (26%), Gaps = 5/84 (5%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIIDMAMN 62 C C + + G G QRY+C C K + F + + Sbjct: 97 QCLFCG-SHDFTKYGHKKDGTQRYICKGCGKRFTPLTNTIFDSKKIPISEWIEYLLHLFE 155 Query: 63 GVGCRATARIMGVGLNTIFRHLKN 86 +TA T L Sbjct: 156 FHSINSTAYDNRNSPTTGKYWLIK 179 >UniRef50_B0K4X0 Integrase, catalytic region n=11 Tax=Clostridia RepID=B0K4X0_THEPX Length = 343 Score = 58.8 bits (141), Expect = 5e-08, Method: Composition-based stats. Identities = 13/45 (28%), Positives = 19/45 (42%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS 48 V + CP C+ T + GK G+Q+YLC C + Sbjct: 5 VPLKCPKCNNTHLFYKYGKDKDGYQKYLCRKCYHQFAPDKPSPKK 49 >UniRef50_C6MKY8 Putative uncharacterized protein n=1 Tax=Geobacter sp. M18 RepID=C6MKY8_9DELT Length = 632 Score = 58.0 bits (139), Expect = 1e-07, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 30/72 (41%), Gaps = 2/72 (2%) Query: 21 GKSTAGHQRYLCSHCRKTWQLQFTY--TASQPGTHQKIIDMAMNGVGCRATARIMGVGLN 78 G + AG QR+ C C KT+ + + G ++ + V R AR VG Sbjct: 123 GHTKAGSQRFRCKICHKTFSIPLAANLRQRKKGKSTEVFRLLTCQVAIRKMARNARVGKE 182 Query: 79 TIFRHLKNSGRS 90 T+ R++ R Sbjct: 183 TVHRYIHLIHRQ 194 >UniRef50_Q3Y1C3 Transposase, IS204/IS1001/IS1096/IS1165 n=51 Tax=Enterococcus RepID=Q3Y1C3_ENTFC Length = 431 Score = 57.6 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 26/107 (24%), Positives = 34/107 (31%), Gaps = 27/107 (25%) Query: 8 CPSCSATDG-------VVRNGKSTA----------------GHQRYLCSHCRKTWQLQF- 43 C +C +T VV+NGK QRY C +CR W Q Sbjct: 45 CRNCGSTVVDGNGKVIVVKNGKKETIVRFEQYNHMPLVMRLKKQRYTCKNCRTHWTTQSY 104 Query: 44 ---TYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNS 87 + KI + V A+ V L T+ R LK Sbjct: 105 FVQPRHSIANHVRYKIASLLTEKVSLSFIAKNCQVSLTTVIRTLKEF 151 >UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q0SUU8_CLOPS Length = 340 Score = 57.2 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 10/43 (23%), Positives = 19/43 (44%), Gaps = 1/43 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF 43 M +I CP C ++ + + G +Q+Y C C + + Sbjct: 1 MNKTNIKCPRCH-SEKLYKFGFDKQANQKYQCKECGRQFAPDS 42 >UniRef50_Q8RFT8 Transposase n=17 Tax=Fusobacterium RepID=Q8RFT8_FUSNN Length = 428 Score = 57.2 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 37/107 (34%), Gaps = 21/107 (19%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGH----------------QRYLCSHCRKTWQLQFT 44 + S +CP C ++ +V+NG QRY+C C+KT+ Sbjct: 46 LKSDYCTCPHC-SSKNIVKNGSRHRKIKYIPIQNHNIELELTVQRYICKDCKKTFSPSTN 104 Query: 45 ----YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNS 87 ++ I + + A+ + + ++ R + N Sbjct: 105 IVSDNSSISNNLKYAIALELQKNISLTSIAKRYNISIPSVQRIMDNC 151 >UniRef50_Q10ZA4 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZA4_TRIEI Length = 469 Score = 57.2 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 12/41 (29%), Positives = 22/41 (53%), Gaps = 2/41 (4%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYT 46 + CP+C +T + +NG+ QRY C C + + +Q + Sbjct: 1 MKCPTCGST-SLRKNGR-PNNRQRYRCKDCGRQFMVQSPTS 39 >UniRef50_Q0W4F0 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4F0_UNCMA Length = 141 Score = 56.9 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 32/78 (41%), Gaps = 4/78 (5%) Query: 16 GVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK----IIDMAMNGVGCRATAR 71 VV+ G S AGHQ + C HC + + ++ I + G RA R Sbjct: 28 RVVKKGFSRAGHQVFQCRHCGRHFCETINTPMYGRRITREDVILIGKLLNERNGIRAIER 87 Query: 72 IMGVGLNTIFRHLKNSGR 89 I G +T+ R K+ R Sbjct: 88 ITGHHRDTVMRVAKDLAR 105 >UniRef50_A7HJ99 Putative uncharacterized protein n=2 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ99_FERNB Length = 261 Score = 56.9 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 26/48 (54%), Gaps = 1/48 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS 48 M ++ + CP C +++ ++NG +Q + C C++ ++L FT Sbjct: 1 MTNIQLKCPHCGSSN-FIKNGHDKFKNQIFFCKDCKRYFKLSFTKKHK 47 >UniRef50_B4WST6 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WST6_9SYNE Length = 81 Score = 56.9 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 28/71 (39%), Gaps = 1/71 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + P+C + VV+NGK G Q + C +C + + T I D+ Sbjct: 1 MLDHQPTRPACHSKQ-VVKNGKIHNGKQNHRCKNCGRQFVKDPQQKRISDATKALIDDLL 59 Query: 61 MNGVGCRATAR 71 + + ++ Sbjct: 60 LERLSMNNPSK 70 >UniRef50_Q5LYW0 IS1193, transposase, ISL3 family n=185 Tax=Bacteria RepID=Q5LYW0_STRT1 Length = 448 Score = 56.5 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 20/110 (18%), Positives = 36/110 (32%), Gaps = 19/110 (17%) Query: 1 MASVSISCPSCSAT---DGVVRNGKSTAGHQ------------RYLCSHCRKTWQLQFTY 45 + +++ SCP C +N K + Q R+ C CR+ + + Sbjct: 15 LITLAPSCPHCQGKMIKYDFQKNSKISLLEQAGTPTLLRLKKRRFQCKSCRRVTVAETSI 74 Query: 46 TASQPG----THQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 QK+ + V AR + V +T++R L Sbjct: 75 VEKNCQISNLVRQKVTQLLTEKVSLTDIARRLRVSTSTVYRKLYQFTFKE 124 >UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=A0RXS8_CENSY Length = 436 Score = 56.1 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 23/97 (23%), Positives = 36/97 (37%), Gaps = 15/97 (15%) Query: 4 VSISCPSCSATDGVV--RNGKSTAGHQRYLCSHCRKTWQLQFTY---TASQPGTHQKIID 58 + CP CS+T V RNG G Q + C CR + + T ++ Sbjct: 71 IVPECPKCSSTVRVKAGRNG----GRQMFQCKQCRTRYVSRGPGARKTRYSQDIISAALN 126 Query: 59 MAMNGVGCRATARIMG------VGLNTIFRHLKNSGR 89 M+G+ R TA + + NTI + + Sbjct: 127 KVMSGMSYRKTAEEVNTAHGRDLSPNTIMFWTRKYTQ 163 >UniRef50_Q8PSY9 Conserved protein n=2 Tax=Methanosarcina RepID=Q8PSY9_METMA Length = 146 Score = 56.1 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 23/101 (22%), Positives = 40/101 (39%), Gaps = 11/101 (10%) Query: 2 ASVSISCPSCSAT-------DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTY----TASQP 50 + + CP+ + +++ GK GHQRY C HC K + Sbjct: 4 KTDEVVCPNPKCSYYLKAEGRAIIKRGKYKTGHQRYYCKHCEKFFMDTIGTAIYRKHLSK 63 Query: 51 GTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 + I + + G R+ RI G +TI LK++ ++ Sbjct: 64 EEIRMIYRLFLEKNGIRSIERITGHHRDTISNLLKDTVKNE 104 >UniRef50_Q4JSN3 Transposase for IS3507b n=53 Tax=Actinobacteridae RepID=Q4JSN3_CORJK Length = 422 Score = 55.7 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 29/83 (34%), Gaps = 4/83 (4%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M+ P C + RNG ++ G R+ C HC + + + G ID Sbjct: 31 MSKNQ---PRCHCGGEMKRNGTTSKGTTRWRCKHCGASSVKRRIDITNSTGF-TAFIDHL 86 Query: 61 MNGVGCRATARIMGVGLNTIFRH 83 G A +G T+ R Sbjct: 87 TTGASLDTIASRVGCSPRTLQRR 109 >UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8T3_EDWI9 Length = 73 Score = 55.3 bits (132), Expect = 5e-07, Method: Composition-based stats. Identities = 21/51 (41%), Positives = 30/51 (58%) Query: 41 LQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 L Y A + ++II+MA G G R TA + +G+NT+ R LKNS +S Sbjct: 23 LTLAYEAHKLDIKEQIIEMAFKGSGVRDTANTLKIGINTVIRTLKNSRQSE 73 >UniRef50_A4W908 Putative cytoplasmic protein n=2 Tax=Enterobacteriaceae RepID=A4W908_ENT38 Length = 414 Score = 54.9 bits (131), Expect = 7e-07, Method: Composition-based stats. Identities = 14/42 (33%), Positives = 22/42 (52%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT 44 V + CP+C D ++RNG G QR+ C C ++ + T Sbjct: 63 QVLLYCPTCGQGDALIRNGCGLRGAQRWRCRTCNSSFTDKST 104 >UniRef50_Q2FRB5 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FRB5_METHJ Length = 138 Score = 54.5 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 22/97 (22%), Positives = 38/97 (39%), Gaps = 11/97 (11%) Query: 4 VSISC--PSC-----SATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK- 55 + I+C P C + +NG ++AG+Q+Y C HCR+ + + Sbjct: 8 ILITCQNPDCTYFQIEDGKNITKNGHNSAGNQQYYCHHCRRFFIETKNTPLYDSRLPRTA 67 Query: 56 ---IIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGR 89 I + R +R+ G +TI R+ G Sbjct: 68 VLIIAKHSTEKTSIRGVSRVTGHHRDTISRYYHLIGE 104 >UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ24_FERNB Length = 316 Score = 54.5 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 14/49 (28%), Positives = 27/49 (55%), Gaps = 1/49 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQ 49 M + ++SCP C +T + +NG G+Q++LC C +++L + Sbjct: 1 MNNSTLSCPKCGST-SLYKNGHDKYGNQQFLCKLCHHSFKLSHSQKRKN 48 Score = 40.7 bits (94), Expect = 0.013, Method: Composition-based stats. Identities = 13/99 (13%), Positives = 26/99 (26%), Gaps = 19/99 (19%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKT----WQLQFTYTASQPGTHQ------ 54 C SC + + + + C CR + L T Sbjct: 53 YPKCTSCGKSMQIYK---VRRSFVVFRCRACRTKDRVPFNLPEPVTLIPEKFKYFRFPIF 109 Query: 55 ----KIIDMAMNGVGCRATARIM--GVGLNTIFRHLKNS 87 + + + R+ A + V TI++ + Sbjct: 110 FVLKAFVLYMKHNMSYRSLAHSLNIKVSHVTIYKWVIKL 148 >UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae BGR1 RepID=C5A9A4_BURGB Length = 284 Score = 53.8 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 23/99 (23%), Positives = 39/99 (39%), Gaps = 15/99 (15%) Query: 1 MASVSISCPSCSATDGV-------VRNGKSTAGH-----QRYLCSHCRKTW---QLQFTY 45 M + CP+ +NG H RY C C K + Q++ + Sbjct: 1 MRNPRPVCPNPDCVHHTNPPADFYRKNGYRRTKHNGQPVPRYQCKACGKNFCATQVKPIH 60 Query: 46 TASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 +P + ++ MA++ VG R A ++ G TI R + Sbjct: 61 GQHRPDLNTQVFKMAVSRVGIRRMATVLDCGRETIQRKI 99 >UniRef50_C7N1Y2 Putative uncharacterized protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N1Y2_SLAHD Length = 332 Score = 53.8 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 29/83 (34%), Gaps = 5/83 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDMAMNG 63 CP C + + V R ++ AG + + C C + + F + I + Sbjct: 54 CPRCGSGETVGRG-RTGAGRRFWECRDCGRKYTSLAGTIFESSKKPLSAWVLFIRLMCYN 112 Query: 64 VGCRATARIMGVGLNTIFRHLKN 86 V A A + G+ T + Sbjct: 113 VQLDAAAELCGMSHQTAWEWRHR 135 >UniRef50_A1VN28 Insertion element protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VN28_POLNA Length = 324 Score = 53.4 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 34/91 (37%), Gaps = 5/91 (5%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIID 58 A+ CP C T+ + R+G +G QRY C CR+T+ + G + Sbjct: 47 ATEPRCCPHCQGTE-LYRHGHV-SGLQRYRCRTCRRTFNALTGTALARLRKKGKWFGFSE 104 Query: 59 MAMNGVGCRATARIMGVGLNTIFRHLKNSGR 89 + R A + V NT R R Sbjct: 105 ALAASLTLRRAATALQVHRNTALRWRHRFLR 135 >UniRef50_C2FEQ0 IS1181 transposase n=1 Tax=Lactobacillus paracasei subsp. paracasei ATCC 25302 RepID=C2FEQ0_LACPA Length = 425 Score = 53.4 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 30/98 (30%), Gaps = 20/98 (20%) Query: 7 SCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTASQP 50 CP+C +VR G QR+ C CR +Q + Y + Sbjct: 48 HCPACGFASKLVRYGFERTCVLMPSYSYRPTYMKLSRQRFRCELCRSVFQSETDYVRPRS 107 Query: 51 GTHQKIIDMAM----NGVGCRATARIMGVGLNTIFRHL 84 + M + + AR V T+ R + Sbjct: 108 TISTPVRQMVLFEAFSNCSLTDIARRFHVADKTVQRII 145 >UniRef50_C9BRL4 Transposase n=30 Tax=Enterococcus RepID=C9BRL4_ENTFC Length = 431 Score = 53.4 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 16/105 (15%), Positives = 32/105 (30%), Gaps = 22/105 (20%) Query: 7 SCPSCSAT--DGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQFT---- 44 C C ++R G +T QR+ C C++T+ + Sbjct: 48 ECSHCLCVVPSRIIRWGTTTVRLLLNDVSEYRTYLELKKQRFKCKSCQRTFVADTSVAEK 107 Query: 45 YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGR 89 + +I AR + ++++R +K R Sbjct: 108 HCFISQKVRWSVIARLKENTSMTEIARQKNISTSSVYRVMKRFYR 152 >UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX7_PARUW Length = 163 Score = 53.0 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 9/66 (13%), Positives = 20/66 (30%) Query: 19 RNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLN 78 + G G Q+Y C C + + L + T + + + V + Sbjct: 11 KKGHIHNGKQKYQCLACGRQFVLNPSQKIIDERTRLLTKKTLLECIALEGVCWVFDVSMP 70 Query: 79 TIFRHL 84 + + Sbjct: 71 WLLEFI 76 >UniRef50_Q64DF0 Putative uncharacterized protein n=6 Tax=cellular organisms RepID=Q64DF0_9ARCH Length = 337 Score = 53.0 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 31/92 (33%), Gaps = 9/92 (9%) Query: 5 SISCPSCSATDGV---VRNGKSTAG-HQRYLCSHCRKTWQLQFT----YTASQPGTHQKI 56 CP C +++ V + + G Q LC C ++ + T K+ Sbjct: 7 PCKCPKC-SSENVRFDYKYDTISNGSRQMLLCRGCGASFSETKNTFLQNIRTPVSTIWKV 65 Query: 57 IDMAMNGVGCRATARIMGVGLNTIFRHLKNSG 88 + G AT R+ + NT+ + Sbjct: 66 LKSRTEGTSLNATCRVFDIAKNTLLAWERKFS 97 >UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202 RepID=C8KX67_9PAST Length = 238 Score = 52.2 bits (124), Expect = 4e-06, Method: Composition-based stats. Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 3/87 (3%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDMAM 61 ISCP CS+ + +NGK Q YLC C + + TY Q+I+ M + Sbjct: 5 TPISCPKCSSCQ-IKKNGKKPNNKQNYLCKCCGRQFIGDHALTYRGCHSKISQRILIMLV 63 Query: 62 NGVGCRATARIMGVGLNTIFRHLKNSG 88 G G R A I V + L N Sbjct: 64 RGCGIRDVAAIEKVSCTKVLSVLLNVR 90 >UniRef50_C2CJK1 ISSha1 transposase n=7 Tax=Anaerococcus RepID=C2CJK1_9FIRM Length = 422 Score = 52.2 bits (124), Expect = 5e-06, Method: Composition-based stats. Identities = 13/100 (13%), Positives = 29/100 (29%), Gaps = 20/100 (20%) Query: 8 CPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQF----TYTA 47 CP C + +++ G ++ Q+ C C K + L+ + Sbjct: 48 CPHCGSNHNLIKYGFKSSNVRCSRAGDYPVIIDLKKQKMFCKSCNKYFLLETKIVDKHCN 107 Query: 48 SQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNS 87 + I+ + + V T+ R + Sbjct: 108 ISNQIKRHILASLTKKLSMKDIGSNNYVSTTTVARFMAKL 147 >UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae RepID=C5BFY7_EDWI9 Length = 46 Score = 52.2 bits (124), Expect = 6e-06, Method: Composition-based stats. Identities = 19/42 (45%), Positives = 26/42 (61%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 MA + + CP + T V+RNG +T+G Q Y C C KT+QL Sbjct: 1 MAKIDVVCPRGAKTQDVIRNGHATSGAQVYRCKLCLKTFQLS 42 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 51.5 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 17/49 (34%), Positives = 27/49 (55%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQ 49 M ++ + C C T+ V ++ K A HQRY C C + +QL++ Y A Sbjct: 1 METLEVKCRFCQQTEFVKKHSKGDADHQRYRCFSCNQIFQLEYAYRACH 49 >UniRef50_C2BVZ4 Possible IS3509a transposase n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BVZ4_9ACTO Length = 225 Score = 51.1 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 30/57 (52%), Gaps = 5/57 (8%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMN 62 + CP+C+ + RNGK+++G QR+ C C ++ + +A + + + ++ Sbjct: 41 MKCPACN--TPLKRNGKTSSGSQRWRCKECGRSKVGKIDNSAKELN---RFLSWLLS 92 >UniRef50_Q8PWW0 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PWW0_METMA Length = 155 Score = 51.1 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 18/101 (17%), Positives = 39/101 (38%), Gaps = 14/101 (13%) Query: 4 VSISCPS--CSA-----TDGVVRNGKSTAGHQR---YLCSHCRKTWQLQ----FTYTASQ 49 + CP+ C + ++ NG ++R Y+C C + + + F Sbjct: 11 TDVFCPNKDCKLYGITGKENIIGNGTYEIKNKRVRKYICRECGRVFNDRTGTFFDNVRKD 70 Query: 50 PGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 + I MA+ G+ +A + ++ V T+ L + + Sbjct: 71 ESDIKLAIKMAIKGMSIQAISDVLEVQPATVSNWLFRAAKQ 111 >UniRef50_B2A0V7 Integrase catalytic region n=15 Tax=Clostridia RepID=B2A0V7_NATTJ Length = 353 Score = 50.7 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 12/53 (22%), Positives = 19/53 (35%), Gaps = 4/53 (7%) Query: 1 MASVSISCPSCS--ATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPG 51 M + CP C+ +D + G GHQ+Y C C + + Sbjct: 1 MTK--VVCPRCNNNCSDKFYKFGFDNHGHQKYQCQECFSQFAPKTLSKGGDKR 51 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 16/116 (13%), Positives = 29/116 (25%), Gaps = 30/116 (25%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLC--SHCRKTWQ------------------ 40 M SCP C + + C C ++ Sbjct: 55 MPRKYPSCPKCGKATFLH---HDYEFYSNLRCCDKSCNHSFYVPKPQSIPEPSQLDINGK 111 Query: 41 LQFTYTASQPGTHQKIIDMA-MNGVGCRATARIM------GVGLNTIFRHLKNSGR 89 + F+ T + + + +NG R ++ + V TI K Sbjct: 112 VDFSNMRHPLHTIIRALYLYFINGSSTRGVSQFLLDCEGIKVSHVTIADWTKKFAP 167 >UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthrospira maxima CS-328 RepID=B5VWL6_SPIMA Length = 153 Score = 50.7 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 11/36 (30%), Positives = 20/36 (55%) Query: 50 PGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLK 85 + + M++NG+G RA R+ G+ NTI ++ Sbjct: 19 SDVKELCVKMSLNGMGFRAIERVTGISHNTILNWVR 54 >UniRef50_Q2RQJ8 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RQJ8_RHORT Length = 150 Score = 50.3 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 18/99 (18%), Positives = 33/99 (33%), Gaps = 16/99 (16%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQR--YLCSHCRKTWQLQFTYTASQPGT----HQK 55 A CP C R+ G QR + C+ CRK + + + Sbjct: 41 ARSRPVCPHCG-----FRHAYRLEGSQRVRFKCARCRKQYSARRGTVMERSNVPTAGWLT 95 Query: 56 IIDMAMN----GVGCRATARIMGVGLNTIFRHLKNSGRS 90 + + ++ G+ R R GV T + ++ + Sbjct: 96 ALRLFISAPGAGLPAR-IERATGVSYKTAWSMVQRMRAA 133 >UniRef50_C5S2C5 Putative transposase n=1 Tax=Actinobacillus minor NM305 RepID=C5S2C5_9PAST Length = 394 Score = 50.3 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 30/94 (31%), Gaps = 12/94 (12%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDM 59 I CP C T Q++ C C++ + L F T+ + Sbjct: 52 EKICCPHCQRTQP-----YFIKSRQKWRCRGCKREFSLTSGTLFASHKLPLRTYLLALVF 106 Query: 60 AMN---GVGCRATARIMGVGLNTIFRHLKNSGRS 90 +N G+ + AR + V T F S Sbjct: 107 YINAKQGITSKRLARELAVNYRTAFMLSHKIRES 140 >UniRef50_B4VTL4 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VTL4_9CYAN Length = 124 Score = 50.3 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 33/93 (35%), Gaps = 12/93 (12%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMA 60 CP C +T + + +RY C+ C ++ + F T I + Sbjct: 20 YPQCPYCQST-----HSRRLKKERRYQCNECFTSYSVTVGTLFHKTHVDLEKWVLAIYLV 74 Query: 61 M---NGVGCRATARIMGVGLNTIFRHLKNSGRS 90 + + R A+ +GV NT + ++ Sbjct: 75 LNPPERISVRQLAKKIGVNKNTASYMIARIRQA 107 >UniRef50_B4WUH8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WUH8_9SYNE Length = 76 Score = 50.3 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 27/56 (48%), Gaps = 3/56 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT-YTASQPGTHQKIIDMA 60 ++CP C ++ + +NG G Q Y+C+ CR+ + ++ ++ + M Sbjct: 1 MACPECQ-SEHIRKNGH-KRGKQNYICADCRRQFVENPKEHSGYSDEERKQCLSMY 54 >UniRef50_D0BMT4 Transposase n=10 Tax=Lactobacillales RepID=D0BMT4_9LACT Length = 426 Score = 49.9 bits (118), Expect = 3e-05, Method: Composition-based stats. Identities = 18/104 (17%), Positives = 35/104 (33%), Gaps = 21/104 (20%) Query: 7 SCPSCSATDGVVRNGKSTAGH----------------QRYLCSHCRKTWQLQ----FTYT 46 SCP C ++ V+++ QR++C CRKTW + Sbjct: 46 SCPYC-SSKNVIKHSPMEHKIRIPHLYGNKTLLELKVQRFICKDCRKTWVTDCPLVPKNS 104 Query: 47 ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 +I+ + A+++ + T+ R +K Sbjct: 105 NISYDLACQIMLYLKENFSRKTIAKLLSISDKTVERVMKKFKIK 148 >UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JSC1_YERPY Length = 53 Score = 49.9 bits (118), Expect = 3e-05, Method: Composition-based stats. Identities = 16/37 (43%), Positives = 21/37 (56%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRK 37 MA + CP C D V ++G +GHQRY C H +K Sbjct: 1 MAKIDEKCPFCERKDLVKKHGYGKSGHQRYRCPHAKK 37 >UniRef50_UPI0001BC5E64 putative transposase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5E64 Length = 173 Score = 49.5 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 21/101 (20%), Positives = 32/101 (31%), Gaps = 21/101 (20%) Query: 7 SCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQF----TYT 46 CP C ++RNG + QR+LC C KT+ Y Sbjct: 48 KCPFCGE-KHIIRNGTKLSKIKILDVSNTPSYLYLRKQRFLCKSCSKTFSASTNFVRKYC 106 Query: 47 ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNS 87 I + N + + A+ V +T+ R L Sbjct: 107 NIADSIKLSIALESKNIISEKDIAKRFRVSSSTVKRSLLQY 147 >UniRef50_Q7NH53 TetR family transcriptional regulatory protein n=1 Tax=Gloeobacter violaceus RepID=Q7NH53_GLOVI Length = 227 Score = 49.5 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 12/37 (32%), Positives = 18/37 (48%), Gaps = 2/37 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 + CP C ++ + RNG QR LC C + + L Sbjct: 188 MKCPRCG-SERLSRNGH-RHDRQRLLCKDCSRQFLLP 222 >UniRef50_D1PSS1 Insertion element protein (Fragment) n=14 Tax=Prevotella RepID=D1PSS1_9BACT Length = 113 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 27/78 (34%), Gaps = 7/78 (8%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCR 67 C C + VRNG G Q Y+C C + + T S+ + Sbjct: 1 CSVC-KSKHTVRNG-VRQGKQLYMCKECHSQF--RAGNTVSEDELWRSYQQ---EKQTIA 53 Query: 68 ATARIMGVGLNTIFRHLK 85 + G+ L T+ R L Sbjct: 54 ELSSRFGISLATVKRRLH 71 >UniRef50_Q03NU3 Transposase n=12 Tax=Lactobacillus RepID=Q03NU3_LACBA Length = 423 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 23/103 (22%), Positives = 31/103 (30%), Gaps = 21/103 (20%) Query: 8 CPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFT----YTA 47 CP C+ VV NG T QR+ C C KT Q Sbjct: 47 CPYCAQRQ-VVCNGHKTVYVRLPNVSERTVILILRKQRFRCKACGKTSIAQTPVVRRQHQ 105 Query: 48 SQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 T I + R+ A V N++ R + G+ Sbjct: 106 ISENTRHAIDKTLIEDRTMRSIADQYNVSTNSVSRRILALGKQ 148 >UniRef50_C1DPZ8 Transposase n=4 Tax=Bacteria RepID=C1DPZ8_AZOVD Length = 236 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 8/46 (17%), Positives = 12/46 (26%) Query: 45 YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 + Q + + R A+ GV NT F Sbjct: 2 ARLRKRHLWQGYAEALTQSLTVRRAAKHCGVSKNTAFLWRHRFLTQ 47 >UniRef50_D1QQX4 Putative uncharacterized protein n=15 Tax=Prevotella RepID=D1QQX4_9BACT Length = 318 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 14/82 (17%), Positives = 26/82 (31%), Gaps = 7/82 (8%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + C C +T +NG G Q Y C C + + S Sbjct: 4 MRCCVCGST-HTKKNG-VRKGLQLYKCQDCGYQF--RSGSQVSNDELWTAYQQ---QKQT 56 Query: 66 CRATARIMGVGLNTIFRHLKNS 87 + + + ++T+ R L + Sbjct: 57 IKELSVRFKISVSTVKRRLHDI 78 >UniRef50_Q9V1K2 Putative uncharacterized protein n=2 Tax=Pyrococcus RepID=Q9V1K2_PYRAB Length = 141 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 33/92 (35%), Gaps = 9/92 (9%) Query: 3 SVSISCPSCSATDGVVRNGK----STAGHQRYLCSHCRKTWQLQFTYTASQPG-THQKII 57 I+CP C + +V+ G QRY C +C +T+ +I Sbjct: 31 KGRITCPYC-KSPNIVKIGYIMRSGNFKIQRYKCKNCNRTFTELDGTPLKGAHSLKDIVI 89 Query: 58 DMAMN---GVGCRATARIMGVGLNTIFRHLKN 86 + + + A+I+ + ++R K Sbjct: 90 VAYLTLDLKLPPSSIAKILPINRPKLYRAYKR 121 >UniRef50_Q11ZU0 Putative uncharacterized protein n=1 Tax=Polaromonas sp. JS666 RepID=Q11ZU0_POLSJ Length = 590 Score = 49.2 bits (116), Expect = 5e-05, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 31/91 (34%), Gaps = 14/91 (15%) Query: 8 CPSCSATDGVV---------RNGKSTAGHQRYLCSHCRKTWQLQFT-----YTASQPGTH 53 CP C ++ +V G +TAG Y C C KT+ ++ + + Sbjct: 104 CPDCMCSNHLVPITQPKAYHSFGLTTAGSHCYRCKVCSKTFSVKPKGINPIARQLRSDKN 163 Query: 54 QKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 ++ M + R V ++ + Sbjct: 164 PPVLRMLTGKMPLRRICEAADVAPKVLYERI 194 >UniRef50_A8ZMW7 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMW7_ACAM1 Length = 75 Score = 48.8 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 14/61 (22%), Positives = 28/61 (45%), Gaps = 1/61 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M+ + + P C + + GK++ G QRY C C++T+ F + ++I Sbjct: 1 MSYLLMQSPLCDHP-KIHKPGKTSKGSQRYRCLDCQQTFSETFDTLYYRLQISSEMIQAI 59 Query: 61 M 61 + Sbjct: 60 L 60 >UniRef50_Q10VF2 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10VF2_TRIEI Length = 59 Score = 48.8 bits (115), Expect = 6e-05, Method: Composition-based stats. Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 1/59 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDM 59 M+ + CPSC ++ +V+NG Q+Y C +C++ + T + I + Sbjct: 1 MSIHKLICPSCG-SNHIVKNGTIHNKKQKYQCQNCQRQFVENSQRDYISNETKELIDKL 58 >UniRef50_Q70JT0 IsmA protein n=2 Tax=Microcystis aeruginosa RepID=Q70JT0_MICAE Length = 112 Score = 48.8 bits (115), Expect = 6e-05, Method: Composition-based stats. Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 1/46 (2%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH 53 CPSC + ++NG G + C C + + + T P T Sbjct: 35 CPSCG-SHHTIKNGYLPKGKPKRHCQECGQPFVINPTNKTISPDTK 79 >UniRef50_Q7VL05 Possible transposase n=4 Tax=Pasteurellaceae RepID=Q7VL05_HAEDU Length = 363 Score = 48.8 bits (115), Expect = 6e-05, Method: Composition-based stats. Identities = 12/94 (12%), Positives = 25/94 (26%), Gaps = 11/94 (11%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTY-------TASQPGTHQKI 56 I CP C V Q++ C HC + + + + + Sbjct: 49 NDIECPHC----HVRHEAYFIKTRQQWQCKHCCYRFSITAGTIFHLAKLSLRKILKALRY 104 Query: 57 IDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 + G+ + + V T + + Sbjct: 105 FALKSKGLSAIELSHEINVQYKTAWGLRHKFREA 138 >UniRef50_C0WLQ9 Transposase n=3 Tax=Lactobacillus RepID=C0WLQ9_LACBU Length = 418 Score = 48.8 bits (115), Expect = 6e-05, Method: Composition-based stats. Identities = 20/96 (20%), Positives = 31/96 (32%), Gaps = 20/96 (20%) Query: 7 SCPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQF----TYT 46 CP+C D +V++G T QR+LC HC Q Sbjct: 49 RCPNCGFADCLVKDGHKTVNLKLSPQRFHLLILRLAKQRFLCKHCGSIITSQTDAVKPNC 108 Query: 47 ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFR 82 Q ++ + + A+ GV T+ R Sbjct: 109 QISKNVWQSVVMDFHDNMAATLIAKQNGVSAGTVNR 144 >UniRef50_B8F7V2 ISRssp2, family IS1595 n=4 Tax=Pasteurellaceae RepID=B8F7V2_HAEPS Length = 378 Score = 48.4 bits (114), Expect = 6e-05, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 30/93 (32%), Gaps = 11/93 (11%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDM 59 + CP C + + +R+ C HC++ + + F + I + Sbjct: 43 NDVCCPFCG----IRHHAYFLQSRKRWTCKHCKRNFYITTNTAFAFHKLPLVDILLAISL 98 Query: 60 AMN---GVGCRATARIMGVGLNTIFRHLKNSGR 89 +N G+ +R + V T F Sbjct: 99 FVNEVKGISAITMSRHLNVNYKTAFVLCHKLRE 131 >UniRef50_C9CRL2 Transposase n=3 Tax=Alphaproteobacteria RepID=C9CRL2_9RHOB Length = 432 Score = 48.4 bits (114), Expect = 6e-05, Method: Composition-based stats. Identities = 13/95 (13%), Positives = 28/95 (29%), Gaps = 16/95 (16%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA---- 60 CP+C D +R+ C+ C + + + + +D+ Sbjct: 42 EPICPACGCVDV-----YDLTTRRRFKCAACHRQFSVTSGTIFASR--KLAFVDLLGAIC 94 Query: 61 -----MNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 G+ +R + V T F + + Sbjct: 95 LFVNAAKGLSAVQMSRDLDVQHKTAFVLMHKLREA 129 >UniRef50_B3GXU2 Transposase n=15 Tax=Pasteurellaceae RepID=B3GXU2_ACTP7 Length = 373 Score = 48.4 bits (114), Expect = 7e-05, Method: Composition-based stats. Identities = 12/93 (12%), Positives = 28/93 (30%), Gaps = 11/93 (11%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYT---ASQPGTHQKIIDMA- 60 + CP C + + +R+ C HC++ + + P + Sbjct: 43 DVCCPHCG----IRHHAYFLQSRKRWCCKHCQRHFYITTNTAFAFHKLPFVDILAATLLF 98 Query: 61 ---MNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 + G+ +R + + T F + Sbjct: 99 ANEVKGISAITMSRHLNISYKTAFVLCHKLREA 131 >UniRef50_D0U1S9 Transposase n=1 Tax=Enterococcus faecium RepID=D0U1S9_ENTFC Length = 427 Score = 48.4 bits (114), Expect = 8e-05, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 33/104 (31%), Gaps = 23/104 (22%) Query: 4 VSISCPSCSATD---GVVRNGKSTAG----------------HQRYLCSHCRKTWQLQFT 44 V C C A + + +NG T+ QR++C C K++ + Sbjct: 39 VPKECAHCEAPNVGFSIYKNGTQTSRVTFPMAGILPTYLRIRKQRFMCKCCGKSFTARTP 98 Query: 45 ----YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 +I+ + + A+ V ++ R L Sbjct: 99 VVERNCFISNYIKAQILTQSGETRSVKDIAKHTNVSEASVQRVL 142 >UniRef50_B7K7J3 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7K7J3_CYAP7 Length = 354 Score = 48.0 bits (113), Expect = 9e-05, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 24/63 (38%), Gaps = 3/63 (4%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP-GTHQKIIDMAM 61 + I CP C + +NG AG QRY C C + + + I+ + + Sbjct: 1 MILIQCPKC-KSKNYRKNGTI-AGKQRYQCKSCGRNFLAVSLSQSLPCFEQGMSILLLDV 58 Query: 62 NGV 64 + Sbjct: 59 ENM 61 >UniRef50_A7JMB8 Predicted protein n=8 Tax=Francisella RepID=A7JMB8_FRANO Length = 82 Score = 48.0 bits (113), Expect = 9e-05, Method: Composition-based stats. Identities = 18/83 (21%), Positives = 30/83 (36%), Gaps = 4/83 (4%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK--IIDMAMNG 63 I C C ++G+ + G QRY C C + L +I ++ Sbjct: 2 IKCNRCH-SEGIHKTGVVRN-KQRYKCKSCGYNFVLSDGRIKPDIAIKLALTVIMYSLGK 59 Query: 64 VGCRATARIMGVGLNTIFRHLKN 86 A++ GV + TI L+ Sbjct: 60 YSYGFIAKLFGVRMTTIQNWLEQ 82 >UniRef50_C7RJT2 Conserved possible transposase n=21 Tax=Proteobacteria RepID=C7RJT2_9PROT Length = 342 Score = 47.6 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 16/94 (17%), Positives = 31/94 (32%), Gaps = 11/94 (11%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDM 59 + CP C R+ A + C+ C++ + + F + + + + Sbjct: 40 EEVVCPHCGMAH---RHYFRPARKI-WRCAGCQEDFSVTSGTIFAFHKLPLRLYLAAVIL 95 Query: 60 AMN---GVGCRATARIMGVGLNTIFRHLKNSGRS 90 N G+ R +GV T + L S Sbjct: 96 FTNAVKGISALQVGRDLGVSHKTAYVLLHKIRES 129 >UniRef50_B5S3H3 Probable transposase protein n=6 Tax=Burkholderiaceae RepID=B5S3H3_RALSO Length = 460 Score = 47.6 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 11/88 (12%), Positives = 26/88 (29%), Gaps = 6/88 (6%) Query: 6 ISCPSCSATDGVVRNGKSTAGH-QRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDMAM 61 CP C +R ++ G+ + C C++++ +I + Sbjct: 103 PRCPHCDGLR--IRPDRNKGGNLPSFFCHGCKRSFNRLTGTPFSHLVNRAKGAAMIPLLS 160 Query: 62 NGVGCRATARIMGVGLNTIFRHLKNSGR 89 + + +G + L R Sbjct: 161 RQMSLDQAGKRLGRTKKAVLSWLLAFRR 188 Score = 45.7 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 13/86 (15%), Positives = 27/86 (31%), Gaps = 3/86 (3%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP---GTHQKIIDMAMNG 63 SCP C + +G + C C + P + + M Sbjct: 350 SCPWCGSDQTKYHPAPRPSGLPGFRCRACLAYFTRVSNTPLVHPMARAYASRFVPMLGWH 409 Query: 64 VGCRATARIMGVGLNTIFRHLKNSGR 89 AR +G+ + T+ +++ + Sbjct: 410 ETGAGAARELGIAMGTLHTWVRSWRQ 435 >UniRef50_Q6MK35 Putative transposase n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MK35_BDEBA Length = 300 Score = 47.6 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 17/96 (17%), Positives = 33/96 (34%), Gaps = 15/96 (15%) Query: 5 SISCPSCS-------ATDGVVRNGKSTAGHQ-----RYLCSHCRKTWQLQFTYTAS---Q 49 ++ CP C A + R G+ R C C K++ + Sbjct: 8 NLKCPYCHLQRDPKDANRTIRRLGRYYRKSDGQTLTRLWCVRCGKSFSAATQSRLKGQKK 67 Query: 50 PGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLK 85 ++ I D+ + R AR++ + T+ R + Sbjct: 68 RHLNKLIRDLLTGEMSQREIARVLKINRKTVVRKFR 103 >UniRef50_Q8PRR9 Conserved protein n=2 Tax=Archaea RepID=Q8PRR9_METMA Length = 148 Score = 47.2 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 26/77 (33%), Gaps = 4/77 (5%) Query: 12 SATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNGVGCR 67 + + V++ H + C C+K + F + + I M R Sbjct: 34 NQGNIVLKERYGKNNHALFKCKTCKKCFSETKGTIFFELNTPDEEVLRTIAMLPEKGSIR 93 Query: 68 ATARIMGVGLNTIFRHL 84 AR G +TI R L Sbjct: 94 GVARATGHSKDTICRWL 110 >UniRef50_D2LK53 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LK53_RHOVA Length = 249 Score = 47.2 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 25/93 (26%), Gaps = 12/93 (12%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK----IIDMA 60 + CP C T+ + C C K + L + I + Sbjct: 19 NPVCPECGGTNH-----YDLKSRPVWKCKACSKQFSLTSGTIFHSRKLRIRDILGAIAIF 73 Query: 61 MN---GVGCRATARIMGVGLNTIFRHLKNSGRS 90 N G +R +G T F L S Sbjct: 74 TNGAKGYSALQLSRDLGCDYKTCFVLLHKLRES 106 >UniRef50_B9JNY3 Transposase n=4 Tax=Alphaproteobacteria RepID=B9JNY3_AGRRK Length = 365 Score = 46.8 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 31/95 (32%), Gaps = 15/95 (15%) Query: 8 CPSCSATDGVVRNGKSTAGHQR-----YLCS--HCRKTWQLQFTYTAS----QPGTHQKI 56 CP+C + G+ G +R Y CS CR + + T K Sbjct: 43 CPACGYKRSIAIAGRDM-GKRRARPGLYQCSSGDCRFQFTVTTHTPLHATKLPLRTWLKA 101 Query: 57 IDMAMN---GVGCRATARIMGVGLNTIFRHLKNSG 88 + + + G+ A +GV T +R Sbjct: 102 MWLLLQSDKGLSSVRLAETLGVSQPTAWRIGHALR 136 >UniRef50_Q035C5 Transposase n=27 Tax=Lactobacillales RepID=Q035C5_LACC3 Length = 414 Score = 46.8 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 39/105 (37%), Gaps = 21/105 (20%) Query: 7 SCPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQFT----YT 46 CP C + + NG TA QR+ C +C T + Sbjct: 50 RCPLCGF-EALHPNGFYTAHVRVLNGVEIPTVIDLHKQRWRCHNCYHTVSAKTPLVQPNH 108 Query: 47 ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 ++I+ +A + + ARI+G+ +++ R + + + R Sbjct: 109 TIAAHMTERIMKLAHERLPVKTIARIIGISASSVQRIIDQNLKLR 153 >UniRef50_A8UDH0 Transposase n=5 Tax=Bacteria RepID=A8UDH0_9LACT Length = 439 Score = 46.8 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 32/109 (29%), Gaps = 23/109 (21%) Query: 5 SISCPSCSATDG---VVRNGKSTAG----------------HQRYLCSHCRKTWQLQFT- 44 C C + +V+NG T+ QR+LC C T+ Q Sbjct: 46 PSHCECCGMKNHSYSIVKNGYLTSRVKWVSSTHYPTYIQLKKQRFLCRECGVTFVAQSPE 105 Query: 45 ---YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 Q I + + ++ V T R LK +S Sbjct: 106 IEQGCFIAKRVKQSIAVELADTTSVKDLSKRHFVSPTTTDRVLKQLNQS 154 >UniRef50_B2J098 Putative uncharacterized protein n=2 Tax=Nostocaceae RepID=B2J098_NOSP7 Length = 133 Score = 46.8 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 10/38 (26%), Positives = 23/38 (60%), Gaps = 1/38 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF 43 + CP C+ + + ++G+ G QRY+C +C + ++ + Sbjct: 34 MECPKCN-SHLLGKHGREPDGVQRYICKNCSRIFRARP 70 >UniRef50_C5RB59 Possible transposase n=1 Tax=Weissella paramesenteroides ATCC 33313 RepID=C5RB59_WEIPA Length = 228 Score = 46.5 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 15/95 (15%), Positives = 29/95 (30%), Gaps = 22/95 (23%) Query: 8 CPSCSATDGVVRNG----------------KSTAGHQRYLCSHCRKT----WQLQFTYTA 47 CP C+ + +NG + Q+Y+C C +T + Sbjct: 30 CPQCAC--LMNKNGTKLVQHIASRAANIFNQLAIRKQKYICPQCHQTALAEFTDIKAGDH 87 Query: 48 SQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFR 82 Q + V + A+ + +T+ R Sbjct: 88 IIANVKQAAAMELSDNVSQKHIAQAYNISPHTVMR 122 >UniRef50_C8SCF8 Putative uncharacterized protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCF8_FERPL Length = 317 Score = 46.5 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 15/74 (20%), Positives = 26/74 (35%), Gaps = 5/74 (6%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGH---QRYLCSHCRKTWQLQ--FTYTASQPGTHQK 55 + + SCP C++ + + ++YLC C T+ F +T P Sbjct: 28 LNKWNPSCPHCNSYHIIKKTDIKRERKGYAKKYLCRDCNSTFTFDNCFEWTHYPPRVVGD 87 Query: 56 IIDMAMNGVGCRAT 69 I + G R Sbjct: 88 IFHLIAKGESYRDI 101 >UniRef50_B2J7N9 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J7N9_NOSP7 Length = 428 Score = 46.5 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 11/47 (23%), Positives = 19/47 (40%), Gaps = 2/47 (4%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGT 52 + CP C +T +NG Q YLC +C + + + + Sbjct: 1 MKCPRCEST-SCRQNGC-RNDKQNYLCKNCGQQFLEPVFPHSLKGEL 45 >UniRef50_C6QEP3 ISSpo8, transposase n=4 Tax=Alphaproteobacteria RepID=C6QEP3_9RHIZ Length = 330 Score = 46.5 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 14/103 (13%), Positives = 34/103 (33%), Gaps = 18/103 (17%) Query: 6 ISCPSCSATDGVV-------RNGK-STAGHQRY---LCSHCRKTWQLQ----FTYTASQP 50 CP C A + + K + G +R+ C CRK + ++ F + Sbjct: 28 PVCPHCGADKRIYDLKGVRSKPSKRNPKGVERHGLKKCGACRKQFTVRVGTVFESSHIPL 87 Query: 51 GTHQKIIDMAMN---GVGCRATARIMGVGLNTIFRHLKNSGRS 90 + + + + G+ R++ + + + + Sbjct: 88 HLWLQAVHLMCSSKKGISSHQLHRVLEIKYQSAWFMSHRIREA 130 >UniRef50_Q93CQ1 Transposase TnpA n=1 Tax=Enterococcus faecium RepID=Q93CQ1_ENTFC Length = 446 Score = 46.5 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 26/93 (27%), Gaps = 19/93 (20%) Query: 7 SCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTASQP 50 C C + ++GK QRY C C +T+ + Sbjct: 36 RCQKCGTIANLYKHGKKRQLFFDLPMHAKRVGIYLKRQRYKCRDCNETFFEKLPDLDDAR 95 Query: 51 GTHQK---IIDMAMNGVGCRATARIMGVGLNTI 80 ++ I + A +GV T+ Sbjct: 96 SVTKRLNNFIQEVSLEKTFTSVAEEIGVDEKTV 128 >UniRef50_UPI0001C31088 transcriptional regulator, TetR family n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31088 Length = 349 Score = 46.1 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 27/88 (30%), Gaps = 10/88 (11%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMN- 62 CP C A + G RY C C + F + + GT M Sbjct: 31 CPRCGADRPF----RLRRGDIRYACRVCEMALDPRAGTAFEGSRTPLGTWFVATAMLRED 86 Query: 63 -GVGCRATARIMGVGLNTIFRHLKNSGR 89 + A A GV T +R L+ Sbjct: 87 PQLTPTALAAEAGVSYATSWRMLRRLRE 114 >UniRef50_B0CG58 Transcriptional regulator, TetR family n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CG58_ACAM1 Length = 260 Score = 46.1 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 2/37 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 + CP C +D + +NGK Q Y+C CRK + Sbjct: 221 MICPHCQ-SDRLSKNGKRRNQ-QCYVCKDCRKQFVES 255 >UniRef50_B9ZCS9 DNA topoisomerase type IA zn finger domain protein n=1 Tax=Natrialba magadii ATCC 43099 RepID=B9ZCS9_NATMA Length = 244 Score = 46.1 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 37/91 (40%), Gaps = 10/91 (10%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQ-KIIDMA 60 ++CP C +D V+NG S Q YLC +C +T+ + F ++ I Sbjct: 26 VTCPRC-RSDLTVKNG-SYGHFQHYLCKNCDRTFNDKTGTIFAHSKVALRKWLFSIYAFL 83 Query: 61 MNGVGCRATARIMGVG-LNTIFRHLKNSGRS 90 + + TI++H++ ++ Sbjct: 84 RFNTSLHQL--QLEIDQYKTIYQHIERFTKA 112 >UniRef50_A9IG79 ISSod11, transposase n=14 Tax=Proteobacteria RepID=A9IG79_BORPD Length = 223 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 26/91 (28%), Gaps = 13/91 (14%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQ----PGTHQKIIDMAMN- 62 CP C V R A R +C C+ + + N Sbjct: 46 CPRCGNAGDVYR-----ASRTRLMCRSCQYQGTVTSGTIFDKTRTPLRVWLAAAWYLTNQ 100 Query: 63 --GVGCRATARIMGV-GLNTIFRHLKNSGRS 90 GV R++G+ T + L R+ Sbjct: 101 KQGVSALGLQRVLGLGSYQTAWTMLHRFRRA 131 >UniRef50_Q5LW63 ISSpo8, transposase n=4 Tax=Rhodobacterales RepID=Q5LW63_SILPO Length = 355 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 16/93 (17%), Positives = 28/93 (30%), Gaps = 12/93 (12%) Query: 7 SCPSCSA-TDGVVRNGKSTAGHQRYLC--SHCRKTWQLQFTYTAS----QPGTHQKIIDM 59 CP C + + +R + G Y C C + + I + Sbjct: 39 HCPHCGSLSSTPIRGRTARPGL--YQCAERECCLQFTVTTKTPMHATKLDLRIWIAAIFL 96 Query: 60 AM---NGVGCRATARIMGVGLNTIFRHLKNSGR 89 + G+ ARI+GV T ++ Sbjct: 97 MLTSSKGISSVVMARILGVNQKTAWKLGHAIRE 129 >UniRef50_Q2J1M8 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris HaA2 RepID=Q2J1M8_RHOP2 Length = 204 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 31/91 (34%), Gaps = 11/91 (12%) Query: 6 ISCPSCS--ATDGVVRNGKSTA-GHQRYLCSHCRKTWQLQFTY----TASQPGTHQKIID 58 CP C + V GKS G YLCS CR+ + + T Sbjct: 19 PECPHCGVGSPSVVAIAGKSHRPGL--YLCSACRRQFTVTVGTPLEGTKLPLKLWIGAAH 76 Query: 59 MAMNGVGC--RATARIMGVGLNTIFRHLKNS 87 + + R R +GV T ++ ++ Sbjct: 77 LLNSHQPIAVREIERALGVTYKTAWKVVQRL 107 >UniRef50_A8YX76 Transposase n=42 Tax=Lactobacillus RepID=A8YX76_LACH4 Length = 426 Score = 45.7 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 33/106 (31%), Gaps = 22/106 (20%) Query: 4 VSISCPSCSATDGVVRNGK-----------------STAGHQRYLCSHCRK----TWQLQ 42 + +C C + D + NG QR C C + +L Sbjct: 43 IQPACLFCGSLDLLH-NGHLITNIHYPTANASLPVIIRLAKQRVKCRDCERWSMAQSELV 101 Query: 43 FTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSG 88 Y + + K++ + AR V +NT+ R L N Sbjct: 102 NKYCSISNASKLKVLSALTEDRSMTSIARENNVSINTVQRVLGNCS 147 >UniRef50_B4WVD1 Putative uncharacterized protein n=7 Tax=Synechococcus sp. PCC 7335 RepID=B4WVD1_9SYNE Length = 298 Score = 45.7 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 13/94 (13%), Positives = 27/94 (28%), Gaps = 10/94 (10%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTA----SQPGTHQKIIDM-A 60 + CP C + + K+ G+ + C CR+T+ + +++ Sbjct: 27 MDCPHCQSPRVSLLQRKTNLGYDMFRCKRCRRTFNERTGTPFNFIEVPTDIIFQVLLCRV 86 Query: 61 MNGVGCRATA-----RIMGVGLNTIFRHLKNSGR 89 + R A R T+ Sbjct: 87 RYKLSYRDVAEFFLLRGFQFTHETVRDWEARFLP 120 >UniRef50_Q2P6H2 ISXo5 transposase n=74 Tax=Xanthomonas RepID=Q2P6H2_XANOM Length = 332 Score = 45.7 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 16/92 (17%), Positives = 30/92 (32%), Gaps = 14/92 (15%) Query: 8 CPSCSATDG--VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGT--HQKIIDMAMNG 63 CP C+AT R+G + C+ C + L+ ++ M + G Sbjct: 53 CPRCAATAHSRFQRHGTMY-----WQCTACYRQTSLRSGTVMDNSKLPLRTWLLGMYLLG 107 Query: 64 VGCRATA-----RIMGVGLNTIFRHLKNSGRS 90 + R +GV T + ++ Sbjct: 108 QSKTNLSALELMRHLGVSYPTAWPMKHKLMQA 139 >UniRef50_Q3Y3Y3 Transposase, IS204/IS1001/IS1096/IS1165 n=11 Tax=Lactobacillales RepID=Q3Y3Y3_ENTFC Length = 424 Score = 45.7 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 34/105 (32%), Gaps = 21/105 (20%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQ----------------RYLCSHCRKTWQLQF----T 44 C C + ++RNG T Q R+LC C +T+ + Sbjct: 44 PSHCEHCGSI-RIIRNGSYTTRTQILKVKEKLTILELKRTRFLCYDCGQTFSAKTDLVDE 102 Query: 45 YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGR 89 + Q I+ + A+ V T+ R L+ + + Sbjct: 103 HHQLTKELKQAILMELYENQSRKLIAKKYFVSDGTVTRILREATK 147 >UniRef50_Q03IY7 Transposase n=198 Tax=Lactobacillales RepID=Q03IY7_STRTD Length = 442 Score = 45.7 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 12/65 (18%), Positives = 23/65 (35%), Gaps = 4/65 (6%) Query: 27 HQRYLCSHCRKTWQLQFTYTASQPG----THQKIIDMAMNGVGCRATARIMGVGLNTIFR 82 +R+ C C K + + +QKI + + A+ + V +T+ R Sbjct: 101 KRRFKCKECGKMAVAETSLVKKNHQIATVVYQKIAQLLIEKQSMTDIAKRLAVSTSTVSR 160 Query: 83 HLKNS 87 L Sbjct: 161 KLNEF 165 >UniRef50_A5FLG0 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FLG0_FLAJ1 Length = 311 Score = 45.7 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 31/92 (33%), Gaps = 11/92 (11%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQK---IID 58 +CP C + V NG + R C C+K + ++ F T II Sbjct: 39 PTCPYCESEKVKVLNGTTK----RLKCYGCKKQFGVKVGTIFHDTKISLRKWFIAVYIIT 94 Query: 59 MAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 G+ +R + V T + L + Sbjct: 95 AHKKGISSHQLSRDLKVTQKTAWFMLHRVREA 126 >UniRef50_A7HMZ5 Transposase IS204/IS1001/IS1096/IS1165 family protein n=14 Tax=Bacteria RepID=A7HMZ5_FERNB Length = 395 Score = 45.7 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 14/104 (13%), Positives = 30/104 (28%), Gaps = 19/104 (18%) Query: 7 SCPSCS---------ATDGVV------RNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPG 51 CP C T V + +RY+C C K + ++ Sbjct: 40 KCPKCGNITSKVHDYHTQKVKDVPIMGKKTYLIIRKRRYVCKACGKKFFEHISFLGKSQR 99 Query: 52 THQKIIDMAMNGV----GCRATARIMGVGLNTIFRHLKNSGRSR 91 ++ ++ + + A+ V + T+ R + Sbjct: 100 MTNRLAAYIISQLGSLTSMKEIAKHTNVSVTTVMRLFDKVNPGQ 143 >UniRef50_D1UAU0 Transposase, putative n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1UAU0_9DELT Length = 320 Score = 45.3 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 32/86 (37%), Gaps = 12/86 (13%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPG-----THQKIIDMAMN 62 CP C R +G +R C+ C+ T+ F+ G +I + ++ Sbjct: 65 CPRCGH-----RKVYDLSG-ERLRCADCKYTF-HPFSGRWINNGALTSLEWLNLITLFVD 117 Query: 63 GVGCRATARIMGVGLNTIFRHLKNSG 88 + +G+ NT+++ L Sbjct: 118 ECSVHQMKQRLGLSYNTVYKALTAIR 143 >UniRef50_B1IC92 Transposase n=24 Tax=Lactobacillales RepID=B1IC92_STRPI Length = 415 Score = 45.3 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 32/97 (32%), Gaps = 19/97 (19%) Query: 7 SCPSCS----ATDGVVR--------NGKSTA---GHQRYLCSHCRKTW----QLQFTYTA 47 C +C DG + NG+ QRY C C T+ L Sbjct: 50 RCRNCGFPTVNKDGFRKTHVRLASLNGRRYELELRKQRYKCKSCHTTFGAITNLTKENQT 109 Query: 48 SQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 +I+ +A G+ + A + +++ R + Sbjct: 110 LSSDLKNQIMLLARKGLSGQLIAEMCHCSPSSVRRTI 146 >UniRef50_B9Y9S5 Putative uncharacterized protein (Fragment) n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y9S5_9FIRM Length = 238 Score = 45.3 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 8/40 (20%), Positives = 15/40 (37%) Query: 51 GTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 Q+ ++ + G AR+ + NT FR + Sbjct: 3 DQWQRFLECYLRGESLDVCARVAQIHRNTAFRWRHKVNDA 42 >UniRef50_B2JXE0 Putative uncharacterized protein n=2 Tax=Burkholderiaceae RepID=B2JXE0_BURP8 Length = 358 Score = 44.9 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 32/88 (36%), Gaps = 7/88 (7%) Query: 8 CPSCSATDGVVRNGKSTAGH---QRYLCSHCRKTWQLQFTYTASQPGTHQKI---IDMAM 61 CP C T +++ G + Y C C + S+ Q+ I + Sbjct: 59 CPRCRGT-RILKKGYARLRTGPLPTYRCEQCGHCFSRLSGTPLSKRPVRQQAGELIALLP 117 Query: 62 NGVGCRATARIMGVGLNTIFRHLKNSGR 89 + C AR +GV +T+ ++ R Sbjct: 118 QEISCAEAARQLGVMEHTVLETVRLVRR 145 >UniRef50_B2SIA3 ISXo5 transposase n=157 Tax=Proteobacteria RepID=B2SIA3_XANOP Length = 341 Score = 44.9 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 14/90 (15%), Positives = 27/90 (30%), Gaps = 10/90 (11%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGT--HQKIIDMAMNGVG 65 CP C+A G + C+ C + L+ ++ M + G Sbjct: 62 CPRCAANAHSR---FQRQGTTYWQCTACYRQTSLRSGTVMDNSKLPLRTWLLGMYLLGQS 118 Query: 66 CRATA-----RIMGVGLNTIFRHLKNSGRS 90 + R +GV T + ++ Sbjct: 119 KTNLSALELMRHLGVSYPTAWPMKHKLMQA 148 >UniRef50_A7HVK5 Putative uncharacterized protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVK5_PARL1 Length = 608 Score = 44.9 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 13/64 (20%), Positives = 27/64 (42%), Gaps = 2/64 (3%) Query: 23 STAGHQRYLCSHCRKTWQLQFTYTASQ--PGTHQKIIDMAMNGVGCRATARIMGVGLNTI 80 S G QR+ C C++T+ + T Q P ++ + ++ R + G+ + Sbjct: 133 SRGGAQRFRCKACQRTFSVALKSTVRQRAPHLNRTVFAEVVSKKPLRGIMEVTGLSAAAV 192 Query: 81 FRHL 84 + L Sbjct: 193 YDKL 196 >UniRef50_B7C761 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7C761_9FIRM Length = 74 Score = 44.9 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 12/62 (19%), Positives = 23/62 (37%), Gaps = 4/62 (6%) Query: 30 YLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLK 85 + C C+K + + Y+ ++I +NGV + TA + V +F Sbjct: 2 FRCKECKKRFVVDRGQLTFYSHHDQSKWNELILDTLNGVSLKETAAKINVNERNVFNMRH 61 Query: 86 NS 87 Sbjct: 62 KL 63 >UniRef50_Q7N9S9 Transposase TnpA, ISL3 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N9S9_PHOLL Length = 429 Score = 44.9 bits (105), Expect = 9e-04, Method: Composition-based stats. Identities = 23/105 (21%), Positives = 35/105 (33%), Gaps = 20/105 (19%) Query: 2 ASVSISCPSC--------SATDGVVR----NGKSTA---GHQRYLCSHCRKTWQLQFTYT 46 A+ C C D V+ NGK T +RY C C KT+ + Sbjct: 29 ANPPTHCIHCKHPEIVGFGRRDEVIMDTPVNGKRTGIILNRRRYRCQICCKTFMEPVPHK 88 Query: 47 ASQPGTHQKIIDMAMNGVGCRAT----ARIMGVGLNTIFRHLKNS 87 + ++I + R T A +GV T+ +S Sbjct: 89 DGKRQMTHRLIQ-YIERESLRRTFSSVAEDVGVDEKTVRNIFHDS 132 >UniRef50_UPI000186E028 transcription factor Sp4, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186E028 Length = 749 Score = 44.9 bits (105), Expect = 9e-04, Method: Composition-based stats. Identities = 11/63 (17%), Positives = 28/63 (44%), Gaps = 2/63 (3%) Query: 6 ISC-PSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL-QFTYTASQPGTHQKIIDMAMNG 63 + C + +D + R+ ++ G +R+ C C+K + + QK+++ A + Sbjct: 655 MYCGKRFTRSDELQRHRRTHTGEKRFQCPDCQKKFMRSDHLSKHIKTHQKQKLMEAATST 714 Query: 64 VGC 66 + Sbjct: 715 ISL 717 >UniRef50_C7P9K3 Transcriptional regulator, ArsR family n=2 Tax=Methanocaldococcus RepID=C7P9K3_METFA Length = 206 Score = 44.5 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 12/37 (32%), Positives = 18/37 (48%) Query: 51 GTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNS 87 KI+ NG G R TA+ +G+ T+ R +K Sbjct: 146 DRWVKILKSLYNGCGVRETAKNLGLSPATVSREVKKL 182 >UniRef50_B7X577 Transposase IS204/IS1001/IS1096/IS1165 family protein n=1 Tax=Comamonas testosteroni KF-1 RepID=B7X577_COMTE Length = 471 Score = 44.5 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 25/78 (32%), Gaps = 17/78 (21%) Query: 8 CPSCSATDGVVRNG----------------KSTAGHQRYLCSHCRKTWQLQFTYTASQPG 51 CP C D + R+G K A QRY C+ C++T+ Sbjct: 36 CPKCGTLDCIYRHGTKATTYVDIPMRGKPAKLRAKVQRYRCTSCKETFLQPLGGILEGRR 95 Query: 52 THQKIIDMAMNGVGCRAT 69 ++ + R T Sbjct: 96 MTERCA-TYIKAHSLRDT 112 >UniRef50_C3L491 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L491_AMOA5 Length = 119 Score = 44.5 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 9/53 (16%), Positives = 23/53 (43%) Query: 38 TWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 + Q T + + + + G+G RA ++ + T+++ ++ SG Sbjct: 38 SLYSQPKSGVKPIQTKRLALQLYLEGLGFRAIGNLLQISYGTVYQWIEASGEQ 90 >UniRef50_B0SXP6 Putative transposase n=1 Tax=Caulobacter sp. K31 RepID=B0SXP6_CAUSK Length = 334 Score = 44.1 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 26/90 (28%), Gaps = 14/90 (15%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK----IIDMAMNG 63 C C D V A + + C C K + L + + I + +NG Sbjct: 24 CAHCGC-DAVYEY----AARRIFKCKACEKQFSLTSGTIFASRKLAIRDILTAIALFVNG 78 Query: 64 ----VGCRATARIMGVGLNTIFRHLKNSGR 89 R R + V T F L Sbjct: 79 ANGHAALR-MGRDLNVSYKTAFVLLHKLRE 107 >UniRef50_B9JG85 Putative uncharacterized protein n=1 Tax=Agrobacterium radiobacter K84 RepID=B9JG85_AGRRK Length = 191 Score = 44.1 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 29/78 (37%), Gaps = 3/78 (3%) Query: 15 DGVVR-NGKSTAGHQRYLCSHCRKTW--QLQFTYTASQPGTHQKIIDMAMNGVGCRATAR 71 DGVVR G S AG + C C + + + + + + A Sbjct: 9 DGVVRARGPSEAGLPVFRCLACDVHFRRTTGTPPSGLKFRKLELFVRLLSQQRPITDAAE 68 Query: 72 IMGVGLNTIFRHLKNSGR 89 ++ V + T+ R +K + Sbjct: 69 MIDVKVVTVIRWVKRMRQ 86 >UniRef50_C3MUP9 Resolvase helix-turn-helix domain protein n=40 Tax=Sulfolobus RepID=C3MUP9_SULIM Length = 369 Score = 44.1 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 9/39 (23%), Positives = 20/39 (51%) Query: 51 GTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGR 89 T + I + + G+ A+I+ V +T++R +K + Sbjct: 22 ETKARAILLHLEGMKISQIAKILQVHKSTVYRWVKEFEK 60 >UniRef50_Q1GHU2 Putative uncharacterized protein n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GHU2_SILST Length = 124 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 26/67 (38%), Gaps = 6/67 (8%) Query: 24 TAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNG------VGCRATARIMGVGL 77 QRY C C+KT+ + ++ D+ + R AR +G+ Sbjct: 2 RTNVQRYRCGSCKKTFSGRTGTRIARIHRPGLFFDVLKDMPGPRPLSSVRVLARCLGLNK 61 Query: 78 NTIFRHL 84 +T++R Sbjct: 62 HTVWRWR 68 >UniRef50_A5VLK7 Transposase, IS204/IS1001/IS1096/IS1165 family protein n=19 Tax=Lactobacillus reuteri RepID=A5VLK7_LACRD Length = 343 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 15/105 (14%), Positives = 31/105 (29%), Gaps = 22/105 (20%) Query: 8 CPSCSATDG--VVRNGKSTAGH----------------QRYLCSHCRKTWQLQF----TY 45 CP+C + +++ G A H QR+ C C T+ Sbjct: 47 CPNCGVINRGQILKYGFYQAKHKYGQFRTQPLVLLVKTQRFQCPDCHTTFNATSYLFEKQ 106 Query: 46 TASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 +++I + A + + ++ R L + Sbjct: 107 RTISRDLRREVILRLTRIQTIKDIAHDLFISEASVQRILLDLADQ 151 >UniRef50_A2V378 Putative uncharacterized protein n=1 Tax=Shewanella putrefaciens 200 RepID=A2V378_SHEPU Length = 214 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 29/90 (32%), Gaps = 12/90 (13%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT----YTASQPGTHQKIIDMAM-- 61 CPSC + S Y C+ C + L T I + Sbjct: 29 CPSCGGKEYCKLKRHSL-----YQCNTCHQQTSLTAGTILDNTKLPLTKWFLAIFLLTQV 83 Query: 62 -NGVGCRATARIMGVGLNTIFRHLKNSGRS 90 NG+ +R++ V NT +R ++ Sbjct: 84 KNGISALELSRLIEVSYNTAWRMKHKLMQA 113 >UniRef50_A7HYI5 Putative uncharacterized protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HYI5_PARL1 Length = 594 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 21/58 (36%), Gaps = 1/58 (1%) Query: 20 NGKSTAGHQRYLCSHCRKTWQLQ-FTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 76 GK+ G RY C CRKT+ + T + I+ +N + V Sbjct: 135 FGKTAKGDARYQCKACRKTFSIGLPTRRHKKTDKTGAIMRGLVNKMAMSRLCETAQVT 192 >UniRef50_B2UM39 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UM39_AKKM8 Length = 313 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 17/95 (17%), Positives = 34/95 (35%), Gaps = 14/95 (14%) Query: 6 ISCPSCSATDG---VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGT--HQKIIDMA 60 + CP C ++ V RNG + C C K + ++ + H+ I Sbjct: 35 VVCPFCGKSEKQYRVKRNGVEGY----FECGECGKVYTVRTGTIFERSHVPLHKWIFAFY 90 Query: 61 M-----NGVGCRATARIMGVGLNTIFRHLKNSGRS 90 + G+ ++ +GV T + L+ + Sbjct: 91 LVVTSRKGISSMQLSKEIGVTQKTAWFMLQRIREA 125 >UniRef50_C2H217 Possible transposase n=5 Tax=Enterococcaceae RepID=C2H217_ENTFA Length = 438 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 19/104 (18%), Positives = 32/104 (30%), Gaps = 21/104 (20%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQFT 44 + + C C D V+R+ + QR+ C+ CR T+ + Sbjct: 53 LTGEAPRCEYCGF-DSVIRHSYQDSWIQLLPYQEVPTYLHLYKQRFRCTRCRHTFSAKTY 111 Query: 45 YTA----SQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 Y A I + + A+ V T+ R L Sbjct: 112 YVAENCYISQALKFAIAVDLKKKISMKDIAQRYFVSTKTVERVL 155 >UniRef50_D0MDA7 Transposase-like protein n=7 Tax=Bacteria RepID=D0MDA7_RHOM4 Length = 279 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 27/88 (30%), Gaps = 10/88 (11%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS----QPGTHQKIIDMAMN 62 CP C + G+ Y C CR+ W + G + + Sbjct: 31 HCPYCKSEHL----GRVRRRF--YKCYRCRREWSPRKGSLLEGLRLPLGKFLLALKLFEL 84 Query: 63 GVGCRATARIMGVGLNTIFRHLKNSGRS 90 V R AR +G+ NT+ R Sbjct: 85 EVSARRAARELGLAYNTVHRLFLLFRER 112 >UniRef50_Q8R819 Transposase n=2 Tax=Thermoanaerobacter tengcongensis RepID=Q8R819_THETN Length = 455 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 12/45 (26%), Positives = 16/45 (35%), Gaps = 1/45 (2%) Query: 7 SCPSCSAT-DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP 50 +CP C A D + GK G Q+ C C+ W Sbjct: 95 NCPVCGAPPDYLYSFGKDPDGFQKLQCKVCKHQWAPGKPAPKKSR 139 >UniRef50_C6HZQ4 Transposase n=2 Tax=Leptospirillum ferrodiazotrophum RepID=C6HZQ4_9BACT Length = 443 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 29/93 (31%), Gaps = 20/93 (21%) Query: 7 SCPSCSATDGV--------VR----NGK---STAGHQRYLCSHCRKTWQLQFTYTASQPG 51 C C + D V +R +GK +R+ C C +T+ + Sbjct: 35 RCVHCGSIDLVGFGRREQWIRDLPIHGKRVGIAVDTRRFRCKSCGRTFYEPLPAVDDKRL 94 Query: 52 THQKIIDMAMNGVGCR----ATARIMGVGLNTI 80 + + + R A GVG TI Sbjct: 95 MTTR-LKTWLEKKSLRPPFSQLAEETGVGALTI 126 >UniRef50_C0WEV9 Transposase (Fragment) n=1 Tax=Acidaminococcus sp. D21 RepID=C0WEV9_9FIRM Length = 358 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 17/96 (17%), Positives = 28/96 (29%), Gaps = 19/96 (19%) Query: 6 ISCPSCS-ATDGV--VRNGKSTAG------------HQRYLCSHCRKTWQLQFT----YT 46 + CP+C TD + R + G +RY+C C +T+ Y Sbjct: 45 VQCPNCHAKTDRIKDYRWQRIAIGSILHQQAFVRLHKRRYVCPCCGRTFFETVPFLQRYQ 104 Query: 47 ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFR 82 +I+ A T+ R Sbjct: 105 RKSKDLQMQIMVSCFQKRSFTDIAADFHTSTTTVIR 140 >UniRef50_D1W685 Putative uncharacterized protein n=2 Tax=Prevotella RepID=D1W685_9BACT Length = 298 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 15/74 (20%), Positives = 25/74 (33%), Gaps = 6/74 (8%) Query: 17 VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 76 VV+ G QR+ C C +T+ T T + A V Sbjct: 2 VVKRGFHKN-RQRWYCKSCGRTFVGHKRLTEETVNTRYS-----KGNLTVEDLATEYAVS 55 Query: 77 LNTIFRHLKNSGRS 90 T++R L + ++ Sbjct: 56 TRTVYRRLSKTYKA 69 >UniRef50_D0SJK4 Predicted protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJK4_ACIJU Length = 460 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 30/93 (32%), Gaps = 20/93 (21%) Query: 7 SCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTASQP 50 CP C +D + ++G +RY C C+ T+ + T Sbjct: 33 KCPKCG-SDQLYKHGTKPVIYRDIPRHMKPTVINVEVKRYRCKSCKATFLQEVTGIYPDT 91 Query: 51 GTHQKI---IDMAMNGVGCRATARIMGVGLNTI 80 ++ I TAR+MG TI Sbjct: 92 RMTERFVKKIQDICLDYTFSDTARMMGCDSKTI 124 >UniRef50_A8A9S5 Transcriptional regulator, AsnC family n=1 Tax=Ignicoccus hospitalis KIN4/I RepID=A8A9S5_IGNH4 Length = 152 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 11/47 (23%), Positives = 15/47 (31%), Gaps = 2/47 (4%) Query: 46 TASQPGTHQKIIDMAMNGV--GCRATARIMGVGLNTIFRHLKNSGRS 90 KI+ + M G R AR + V T+ LK Sbjct: 4 HKDLDELDYKILSLLMENARKGVREIARELNVSPATVHNRLKKMLSK 50 >UniRef50_C4RAC5 Transposase n=2 Tax=magnetite-containing magnetic vibrio RepID=C4RAC5_9PROT Length = 367 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 28/90 (31%), Gaps = 11/90 (12%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQ----PGTHQKIIDMA--- 60 CP C + D + + Y C CR + + + Sbjct: 40 CPKCGSKDAFLLTSLAYT----YACKECRGHTSVTAGTIMHRTKLALRVWFWAAHLMATH 95 Query: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 NG+ R A MG+ ++ + + RS Sbjct: 96 SNGISARQLAAQMGIAYSSAWLLEQKLRRS 125 >UniRef50_UPI0001C34261 hypothetical protein CATC2_13668 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C34261 Length = 387 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 12/35 (34%), Positives = 18/35 (51%), Gaps = 1/35 (2%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 CP C + + R G++ G QR C C+K W + Sbjct: 74 CPDCYQRETI-RYGRNPQGSQRVQCRACKKVWTPK 107 >UniRef50_C0W2A4 Transposase (Fragment) n=1 Tax=Actinomyces coleocanis DSM 15436 RepID=C0W2A4_9ACTO Length = 195 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 12/50 (24%), Positives = 19/50 (38%), Gaps = 3/50 (6%) Query: 20 NGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRAT 69 +GK+ AG QR+ C C T A + ++G + T Sbjct: 1 HGKTKAGRQRWRCKSCSITNLNPINTDAKNLEL---FLSWLLSGKTLKDT 47 >UniRef50_Q894I5 Phage-related protein n=1 Tax=Clostridium tetani RepID=Q894I5_CLOTE Length = 142 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 11/36 (30%), Positives = 20/36 (55%) Query: 51 GTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKN 86 K +++ +NG TA+I+GV TI+R ++ Sbjct: 4 REKIKAMELLLNGETITDTAKIVGVERKTIYRWMEK 39 >UniRef50_B8FWC8 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense DCB-2 RepID=B8FWC8_DESHD Length = 60 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 9/34 (26%), Positives = 16/34 (47%) Query: 55 KIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSG 88 I+D+ G R A+ +GV T+ ++ G Sbjct: 12 LILDLYKQGYTSREIAKQVGVSPTTVMNRIRKYG 45 >UniRef50_Q6V7R1 Bcep22gp32 n=1 Tax=Burkholderia phage Bcep22 RepID=Q6V7R1_9CAUD Length = 482 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 13/92 (14%), Positives = 21/92 (22%), Gaps = 19/92 (20%) Query: 8 CPSCSATDGVVRNG----------------KSTAGHQRYLCSHCRKTWQLQFTYTASQPG 51 C C V ++G A QR+ C C T+ Sbjct: 35 CQKCGVIGRVYKHGPKMIIFRDSPIRGRPVSIEANAQRFRCRDCGGTFIQPLGGIHPATR 94 Query: 52 THQKIIDMAMNGV---GCRATARIMGVGLNTI 80 + + A +G T+ Sbjct: 95 MTARCVQYIEEQCLRDTFTRIAEHVGCDDKTV 126 >UniRef50_UPI00016C448A hypothetical protein GobsU_12575 n=6 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C448A Length = 234 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 35/95 (36%), Gaps = 12/95 (12%) Query: 8 CPSCSATDGVVR-NG-------KSTAGHQRYLCSHCRKTWQLQFTYTAS----QPGTHQK 55 C + D R +G + CS C+ + + T Sbjct: 9 CRNADCPDHGKRGHGNLTVPARYGPNRTRVLRCSTCQARFSERKGTPLYGTRLSAQTVTA 68 Query: 56 IIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 ++ G G R TAR++GV +T+ R+++ +G Sbjct: 69 VLAHVAEGAGTRKTARLVGVHRDTVTRYIRQAGHQ 103 >UniRef50_Q54X15 Type-2 histone deacetylase 1 n=1 Tax=Dictyostelium discoideum RepID=HDA21_DICDI Length = 1489 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 11/34 (32%), Positives = 13/34 (38%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ 40 CP R GK+ G QR+ C C W Sbjct: 32 HCPRIEDVLYSQRKGKTNKGAQRWRCKACGTKWT 65 >UniRef50_UPI00016C46F4 hypothetical protein GobsU_15563 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C46F4 Length = 139 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 16/73 (21%), Positives = 25/73 (34%), Gaps = 4/73 (5%) Query: 20 NGKSTAGHQRYLCSHCRKTWQLQFTYT----ASQPGTHQKIIDMAMNGVGCRATARIMGV 75 G + C+ C K + + I + G G R T R+ G Sbjct: 32 WSSKPRGIRCLRCTACGKNFSERKGTPLFGLHMSDEKALDIAHHLVEGNGMRPTGRLCGG 91 Query: 76 GLNTIFRHLKNSG 88 LNT+ R + +G Sbjct: 92 TLNTVLRFARKAG 104 >UniRef50_A2A935 PR domain zinc finger protein 16 n=35 Tax=Euteleostomi RepID=PRD16_MOUSE Length = 1275 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 21/57 (36%), Gaps = 4/57 (7%) Query: 3 SVSISCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 948 KERYTCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 1004 >UniRef50_UPI00016C5DA1 ISSpo8, transposase n=1 Tax=Clostridium difficile QCD-76w55 RepID=UPI00016C5DA1 Length = 144 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 13/91 (14%), Positives = 32/91 (35%), Gaps = 13/91 (14%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP----GTHQKIID 58 S+ CP C + K + + + C+ C + + + I+ Sbjct: 27 KNSLICPYCYSD-------KISTYNDLFHCNICNSKFSITTNTIMHKTKLDYRKWLLAIN 79 Query: 59 MAM--NGVGCRATARIMGVGLNTIFRHLKNS 87 + + + R ++ + V NT ++ +K Sbjct: 80 LFIFDENISYRKLSKNIVVNKNTAYKMIKKL 110 >UniRef50_Q8U293 Transposase n=53 Tax=Pyrococcus RepID=Q8U293_PYRFU Length = 314 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 9/49 (18%), Positives = 22/49 (44%) Query: 42 QFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRS 90 F P + +++ + G+ R TARI+ + T++ ++ + Sbjct: 97 PFRRNKIPPEKKIRGVELYLRGLSYRQTARILKISHVTVWEAVQKLAEA 145 >UniRef50_Q9HAZ2 PR domain zinc finger protein 16 n=26 Tax=Euteleostomi RepID=PRD16_HUMAN Length = 1276 Score = 42.2 bits (98), Expect = 0.004, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 21/57 (36%), Gaps = 4/57 (7%) Query: 3 SVSISCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 948 KERYTCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 1004 >UniRef50_C1F2K1 Unclassified family transposase n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F2K1_ACIC5 Length = 327 Score = 42.2 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 15/92 (16%), Positives = 27/92 (29%), Gaps = 12/92 (13%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTA----SQPGTHQKIIDMAM 61 CP C + R T + C CRK + ++ + + + Sbjct: 39 PVCPHCGSA----RYSFLTTRRI-WKCKSCRKQYSVKSGTIFEDSPIPLDKWLMAVWLVV 93 Query: 62 ---NGVGCRATARIMGVGLNTIFRHLKNSGRS 90 NGV R + V + + L + Sbjct: 94 NCKNGVSSYEIMRAVKVTQKSAWFMLHRIRLA 125 >UniRef50_Q87RY6 Putative resolvase n=3 Tax=Vibrio parahaemolyticus RepID=Q87RY6_VIBPA Length = 216 Score = 42.2 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 23/48 (47%) Query: 39 WQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKN 86 W +F + HQ+II++ + G R A+ +G +T+ R K Sbjct: 161 WAEKFQGRKANTKQHQRIIELLLEGKSIRGVAQELGCNASTVQRVKKK 208 >UniRef50_Q12Y80 Transposase n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12Y80_METBU Length = 382 Score = 42.2 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 15/109 (13%), Positives = 30/109 (27%), Gaps = 23/109 (21%) Query: 4 VSISCPSCSATD------GVVRNGKSTAGHQ-----RYLCSHCRKTWQLQFTYTASQPGT 52 + CP C + + + G Q RY C C K + + Sbjct: 65 LHPQCPVCGSNKINKQEYYTRKLKLAEFGSQIIHVRRYYCKKCSKRFTTPLDPIVKKGHQ 124 Query: 53 HQKIIDMAMNG------VGCRATARI------MGVGLNTIFRHLKNSGR 89 + + + + R +I TI+ ++ S + Sbjct: 125 YARTYEQYIEDSYETGYCSFRHLQKIFSSLYDCSPSHQTIYNWIRKSNK 173 >UniRef50_A9BGL8 Transposase IS204/IS1001/IS1096/IS1165 family protein n=9 Tax=Petrotoga mobilis SJ95 RepID=A9BGL8_PETMO Length = 455 Score = 42.2 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 37/101 (36%), Gaps = 22/101 (21%) Query: 5 SISCPSCSAT-DGVVRNGKSTAGH-----------------QRYLCSHCRKTWQLQF-TY 45 C C + +VRNGK+ QRY+C KT++ + +Y Sbjct: 72 PYKCKGCKDKREYIVRNGKAKERIIKAGKVGTQRIYLIHRPQRYMCKKTGKTFRDEKISY 131 Query: 46 TASQ--PGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 + + I+ + ATA+ GV + T+ L Sbjct: 132 RWQRITRAETENIVKGLR-KMSISATAKEFGVSVRTVSNLL 171 >UniRef50_D2EIL2 Transposase n=1 Tax=Pediococcus acidilactici 7_4 RepID=D2EIL2_PEDAC Length = 212 Score = 42.2 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 29/96 (30%), Gaps = 21/96 (21%) Query: 7 SCPSCSATDGVVRN-------------GKS---TAGHQRYLCSHCRKTWQLQFTYTASQ- 49 SC C T +V+N GKS QR+LC C + Q Sbjct: 54 SCTYCH-TRSIVKNEFKTVYIRDIPFNGKSVILQIDKQRFLCKACHLSIIAQTNLIKKHA 112 Query: 50 ---PGTHQKIIDMAMNGVGCRATARIMGVGLNTIFR 82 II+ + + + V ++ R Sbjct: 113 QLTQRLKFSIINYLAKNLSVDNIVQKLNVSPVSVNR 148 >UniRef50_D2PJ85 Putative uncharacterized protein n=5 Tax=Sulfolobus islandicus RepID=D2PJ85_SULIS Length = 82 Score = 42.2 bits (98), Expect = 0.006, Method: Composition-based stats. Identities = 11/39 (28%), Positives = 16/39 (41%), Gaps = 2/39 (5%) Query: 27 HQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 QRYLC C + + Y ++ + M NGV Sbjct: 5 RQRYLCRDCGRYFLGDAIYH--SRELREEALKMYSNGVS 41 >UniRef50_D2MKS9 ISXo5 transposase n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MKS9_9BACT Length = 293 Score = 41.8 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 30/94 (31%), Gaps = 9/94 (9%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIIDM 59 +CP C + + + G R+ C CR ++ + F T I + Sbjct: 27 EEPTCPHCESPHVARKADGTRQG--RWNCHGCRSSFTVLSGTIFEKTRIPLQKWFLAIGL 84 Query: 60 AMN---GVGCRATARIMGVGLNTIFRHLKNSGRS 90 +N + AR + + T + + Sbjct: 85 IVNAKKSLSSCQLARDLSLTQPTAWYIQARIRSA 118 >UniRef50_Q4S840 Chromosome 9 SCAF14710, whole genome shotgun sequence n=5 Tax=Tetraodontidae RepID=Q4S840_TETNG Length = 1167 Score = 41.8 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 21/57 (36%), Gaps = 4/57 (7%) Query: 3 SVSISCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 831 KERYACRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 887 >UniRef50_Q4L7B5 Transposase for ISSha1 n=49 Tax=Staphylococcus RepID=Q4L7B5_STAHJ Length = 438 Score = 41.8 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 19/101 (18%), Positives = 35/101 (34%), Gaps = 23/101 (22%) Query: 7 SCPSCSATD---GVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFT--- 44 C +CS + +V+NGK T+ QR+ C C + + Sbjct: 45 HCENCSTKNENFSIVKNGKKTSTITLLKIMEMPAYLELQKQRFYCKSCDSHFTAKSNIVD 104 Query: 45 -YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 + T ++D A ++ A+ V T+ R + Sbjct: 105 AHCFISNKTKLAVLDKAQEYRSQKSIAKSCLVSSMTVSRVI 145 >UniRef50_C3XYB0 Putative uncharacterized protein n=2 Tax=Chordata RepID=C3XYB0_BRAFL Length = 1482 Score = 41.8 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 21/57 (36%), Gaps = 4/57 (7%) Query: 3 SVSISCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 786 KDRYTCRYCGKLFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 842 >UniRef50_C6MCG8 Putative uncharacterized protein n=17 Tax=Proteobacteria RepID=C6MCG8_9PROT Length = 297 Score = 41.8 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 28/91 (30%), Gaps = 12/91 (13%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMN 62 CP C +G + + CS C K L F T I + + Sbjct: 35 RCPYCGHD-----HGYAITTRHNWECSRCHKQTSLTAGTLFHSTNLPLVKWFWAIYLLAS 89 Query: 63 ---GVGCRATARIMGVGLNTIFRHLKNSGRS 90 G+ ++ + V T R L+ + Sbjct: 90 DKGGISALRLSKQINVSWITASRMLRKIRIA 120 >UniRef50_A0Q207 Transcriptional regulator n=3 Tax=Clostridium RepID=A0Q207_CLONN Length = 520 Score = 41.8 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 10/35 (28%), Positives = 15/35 (42%) Query: 53 HQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNS 87 + II R TA+++GV TI +K Sbjct: 481 KEAIIKALKKNKTFRKTAKVLGVSHTTIINKIKKY 515 >UniRef50_A3VEU0 ISSpo8, transposase n=1 Tax=Rhodobacterales bacterium HTCC2654 RepID=A3VEU0_9RHOB Length = 308 Score = 41.8 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 15/95 (15%), Positives = 27/95 (28%), Gaps = 14/95 (14%) Query: 5 SISCPSCSAT--DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS----QPGTHQKIID 58 C C + + Y C CRK + ++ + I Sbjct: 34 EPVCGHCGSVSVTECKDHKPMP-----YRCKDCRKHFSVRTGTVLAESRLPLQKWLLAIF 88 Query: 59 MAMN---GVGCRATARIMGVGLNTIFRHLKNSGRS 90 M + G+ AR +GV T + + + Sbjct: 89 MLTSARKGIPSTQMARELGVTQKTAWFLAQRIRET 123 >UniRef50_C9RDH8 Regulatory protein LacI n=1 Tax=Ammonifex degensii KC4 RepID=C9RDH8_AMMDK Length = 435 Score = 41.8 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 11/34 (32%), Positives = 17/34 (50%) Query: 55 KIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSG 88 +++ + G R AR +GV NT+ R L N Sbjct: 398 RVVRLRAEGRSLREIAREVGVSKNTVARWLNNLS 431 >UniRef50_D1VTW0 Transposase (Fragment) n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VTW0_9FIRM Length = 179 Score = 41.8 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 18/109 (16%), Positives = 32/109 (29%), Gaps = 23/109 (21%) Query: 1 MASVSISCPSCSATD---GVVRNGKST----------------AGHQRYLCSHCRKTWQL 41 + +CP C + +++ G QR+LC C T+ Sbjct: 42 LTYNPKACPCCGHVNESFNIIKYGSKACKIKLPSISNIPTILFLKKQRFLCKECGSTFSA 101 Query: 42 QF----TYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKN 86 + ++ +KI ++ A V L LKN Sbjct: 102 KTDIVSEFSNISNDVKRKIAIDLTKISSFKSIAESNNVSLIPFLEFLKN 150 >UniRef50_Q03112 MDS1 and EVI1 complex locus protein EVI1 n=58 Tax=Euteleostomi RepID=EVI1_HUMAN Length = 1051 Score = 41.8 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 21/57 (36%), Gaps = 4/57 (7%) Query: 3 SVSISCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 730 KERYTCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 786 >UniRef50_C6P8Q1 Transposase IS3/IS911 family protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6P8Q1_CLOTS Length = 91 Score = 41.8 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 9/44 (20%), Positives = 18/44 (40%) Query: 48 SQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 Q+I ++ +G +R GV TI++ +K + Sbjct: 8 YSEEFKQQIAELYQSGQSVLDLSREYGVTTVTIYKWIKQLSPVK 51 >UniRef50_A7BQK2 Transposase n=3 Tax=Bacteria RepID=A7BQK2_9GAMM Length = 391 Score = 41.5 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 10/35 (28%), Positives = 19/35 (54%) Query: 52 THQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKN 86 +II ++ G G R +R++G+ +T+ R K Sbjct: 28 VRSRIILLSDEGFGSRKVSRMLGISRDTVQRWRKR 62 >UniRef50_Q30XD0 Transposase n=4 Tax=Proteobacteria RepID=Q30XD0_DESDG Length = 422 Score = 41.5 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 15/104 (14%), Positives = 33/104 (31%), Gaps = 20/104 (19%) Query: 4 VSISCPSCS-ATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYT 46 ++ P C + ++G R+ C C K + ++ Sbjct: 29 ETVEVPLCGGCNTLMHKHGTRKNKFMDTPLYMEPVRLEVQRPRFRCESCGKMSMPELSFL 88 Query: 47 ASQPGTHQKIIDMAMN---GVGCRATARIMGVGLNTIFRHLKNS 87 + ++++D+ G A A GV +NT+ + Sbjct: 89 DDKRRATKRLVDVIRQQCLGTTFHALAEQTGVAVNTVKNIAHDL 132 >UniRef50_C7XW38 Transposase ISLasa4v n=4 Tax=Lactobacillus RepID=C7XW38_9LACO Length = 436 Score = 41.5 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 23/110 (20%), Positives = 29/110 (26%), Gaps = 30/110 (27%) Query: 3 SVSISCPSCSATDGVVRNGKSTAG-------------------HQRYLCS---HCRKTWQ 40 S+ + CP C + RNG Q+YLC C Sbjct: 37 SLPMHCPVCG--QLMQRNGWLRRRPVKIKILSIAGQPTVLSIIKQQYLCKPSASCPHPVT 94 Query: 41 LQFTYTASQPG------THQKIIDMAMNGVGCRATARIMGVGLNTIFRHL 84 Q G Q I + AR V NT+ R L Sbjct: 95 CVAPIQGIQKGCRIANLVKQHITLELTQNISQTTIARQHNVSTNTVSRVL 144 >UniRef50_C1SJT0 Transposase family protein, COG3464 n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SJT0_9BACT Length = 436 Score = 41.5 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 29/98 (29%), Gaps = 20/98 (20%) Query: 2 ASVSISCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTY 45 S SC C + V ++GK T QRY C C Sbjct: 30 VSEPESCSGCG-SKPVYKHGKRTHVYADTPMHGMPVKVEIERQRYRCQSCGTVIVPNIPS 88 Query: 46 TASQPGTHQKIIDMA---MNGVGCRATARIMGVGLNTI 80 + +++I+ A G+ +NTI Sbjct: 89 LDEKRVVTKRLIEFVQARCFNNTFTLLANETGLAVNTI 126 >UniRef50_Q5ZT03 Transposase (IS652) n=29 Tax=Gammaproteobacteria RepID=Q5ZT03_LEGPH Length = 399 Score = 41.5 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 17/99 (17%), Positives = 33/99 (33%), Gaps = 19/99 (19%) Query: 6 ISCPSCSATDG------VVRNGKSTAGHQR---------YLCSHCRKTWQLQF----TYT 46 + C C + R + G +R Y C C + + +F Y Sbjct: 42 VRCIHCGNKKLRVKDSFIRRIRHESIGLRRSYLCLKAHKYYCPSCGRYFNQRFPGIGKYQ 101 Query: 47 ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLK 85 + +++ GV + AR + +G +T+ R Sbjct: 102 RASESLRKQVFHYHSKGVSQKDLARDLKLGKSTVERWYH 140 >UniRef50_B2JMI5 Transposase n=2 Tax=Burkholderia RepID=B2JMI5_BURP8 Length = 75 Score = 41.5 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 10/38 (26%), Positives = 17/38 (44%) Query: 48 SQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLK 85 + + M NG ATA+I+G+ T+ +K Sbjct: 8 YTLEFKIEAVRMVRNGQSQAATAKILGISTQTLNAWIK 45 >UniRef50_B8F7J2 Putative uncharacterized protein n=1 Tax=Haemophilus parasuis SH0165 RepID=B8F7J2_HAEPS Length = 233 Score = 41.5 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 12/41 (29%), Positives = 20/41 (48%), Gaps = 2/41 (4%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT 44 C C ++D + ++G QRY C+ C KT+ L+ Sbjct: 35 EPKKCHFCHSSD-IRKHG-IRNNIQRYKCNACNKTFTLKKK 73 >UniRef50_Q9SVC5 Dof zinc finger protein DOF3.5 n=2 Tax=Arabidopsis thaliana RepID=DOF35_ARATH Length = 247 Score = 41.5 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 12/40 (30%), Positives = 20/40 (50%), Gaps = 1/40 (2%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQ-RYLCSHCRKTWQ 40 A ++ SCP C +++ + + Q RY C CR+ W Sbjct: 21 AEITPSCPRCGSSNTKFCYYNNYSLTQPRYFCKGCRRYWT 60 >UniRef50_A5KRX5 ISSpo8, transposase n=2 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KRX5_9BACT Length = 275 Score = 41.5 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 31/90 (34%), Gaps = 11/90 (12%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS----QPGTHQKIIDM-- 59 + CP C D V+ K +G R C+ CR + ++ I + Sbjct: 32 VVCPKCGEID--VKYYKLASG--RMKCASCRSPFTVRMGSIFEESPVPLQKWFLAIYLCT 87 Query: 60 -AMNGVGCRATARIMGVGLNTIFRHLKNSG 88 GV ++ +GV T + L+ Sbjct: 88 SLKKGVSSIQLSKYIGVTQKTAWFMLQRIR 117 >UniRef50_D2M0Z3 Two component transcriptional regulator, LuxR family n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2M0Z3_BACS4 Length = 204 Score = 41.5 bits (96), Expect = 0.010, Method: Composition-based stats. Identities = 12/69 (17%), Positives = 28/69 (40%) Query: 22 KSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTIF 81 K G Q +L + + ++ +++ M + G + TA+I+ + T+ Sbjct: 117 KVLNGEQAFLNNGSPRNYREIDEEFHELSKREEEVFYMKLRGYTVKDTAQILNISPKTVE 176 Query: 82 RHLKNSGRS 90 H +N + Sbjct: 177 NHRRNIRKK 185 >UniRef50_A6Q3M3 Transposase n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q3M3_NITSB Length = 211 Score = 41.1 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 13/85 (15%), Positives = 27/85 (31%), Gaps = 13/85 (15%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + C C+A ++ NG C C++ + + +KI+ + Sbjct: 1 MRCIYCNAPTYLLGNG-------NRKCKRCKRKFSPEKIAR------KEKIVKCFCENLS 47 Query: 66 CRATARIMGVGLNTIFRHLKNSGRS 90 R G TI + + + Sbjct: 48 VNECMRQSGYNYVTIKNYYEMFRKK 72 >UniRef50_B2SSB8 Transposase TnpA, ISL3 family n=6 Tax=Bacteria RepID=B2SSB8_XANOP Length = 472 Score = 41.1 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 13/99 (13%), Positives = 32/99 (32%), Gaps = 20/99 (20%) Query: 8 CPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTASQPG 51 C +C +D ++ +G++ +R+ C C KT+ ++ Sbjct: 35 CTACG-SDRLIGHGRNEQVVRDLPTHGKRLAIYVDTRRWRCQSCGKTFMEPLPAVNAKRE 93 Query: 52 THQKIIDMA---MNGVGCRATARIMGVGLNTIFRHLKNS 87 +++ + A G+ TI ++ Sbjct: 94 MTDRLVKWIGQQSLKRTFASIADDTGLDEKTIRNIFRDY 132 >UniRef50_B2TB85 Transposase IS3/IS911 family protein n=2 Tax=Burkholderia RepID=B2TB85_BURPP Length = 145 Score = 41.1 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 11/49 (22%), Positives = 18/49 (36%), Gaps = 1/49 (2%) Query: 44 TYTASQPGTHQKIIDMAMNGV-GCRATARIMGVGLNTIFRHLKNSGRSR 91 Y P ++I++ ++G AR V N +F K R Sbjct: 29 KYRRRTPDEKRQIVEETLSGGGSVAEIARSHKVSANQVFDWRKQYLDGR 77 >UniRef50_A9VK04 Transposase IS3/IS911 family protein n=12 Tax=Bacillales RepID=A9VK04_BACWK Length = 98 Score = 41.1 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 7/41 (17%), Positives = 20/41 (48%), Gaps = 1/41 (2%) Query: 48 SQPGTHQKIIDMAM-NGVGCRATARIMGVGLNTIFRHLKNS 87 ++ ID+ + G+ + A+ +G+ + + R +K+ Sbjct: 8 YDVEFKKQAIDLYLKEGMSYKTIAKELGIHHSVVSRWVKHF 48 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.312 0.147 0.513 Lambda K H 0.267 0.0461 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 563,754,248 Number of Sequences: 3077464 Number of extensions: 19350501 Number of successful extensions: 209645 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 294 Number of HSP's successfully gapped in prelim test: 545 Number of HSP's that attempted gapping in prelim test: 207465 Number of HSP's gapped (non-prelim): 2253 length of query: 91 length of database: 1,040,396,356 effective HSP length: 61 effective length of query: 30 effective length of database: 852,671,052 effective search space: 25580131560 effective search space used: 25580131560 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.8 bits) S2: 87 (38.0 bits)