BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (91 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=roo... 188 5e-47 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 144 1e-33 UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepI... 108 6e-23 UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacter... 95 8e-19 UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax... 92 4e-18 UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammapr... 89 5e-17 UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Kl... 86 3e-16 UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwin... 84 1e-15 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 79 6e-14 UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacter... 67 2e-10 UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma ling... 65 7e-10 UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX 63 3e-09 UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepI... 59 6e-08 UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea p... 58 1e-07 UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriacea... 55 8e-07 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 55 1e-06 UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmon... 54 1e-06 UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp.... 50 2e-05 UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria... 47 1e-04 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 47 2e-04 UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edw... 47 2e-04 UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candid... 47 3e-04 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 46 3e-04 UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae... 44 0.001 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 44 0.002 UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriale... 44 0.002 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 44 0.002 UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methano... 43 0.002 UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE 43 0.004 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 42 0.006 UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_S... 41 0.012 UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseud... 40 0.016 UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ru... 40 0.022 UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis ... 40 0.025 UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 R... 39 0.043 UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aur... 39 0.044 UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcti... 39 0.060 >UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=root RepID=INSA2_ECOLX Length = 91 Score = 188 bits (477), Expect = 5e-47, Method: Compositional matrix adjust. Identities = 90/91 (98%), Positives = 90/91 (98%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA Sbjct: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 MNGVGCRATARIMGVGLNTI RHLKNSGRSR Sbjct: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 144 bits (362), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 67/68 (98%), Positives = 68/68 (100%) Query: 17 VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 76 +VRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG Sbjct: 1 MVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 60 Query: 77 LNTILRHL 84 LNTILRHL Sbjct: 61 LNTILRHL 68 >UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32IP8_SHIDS Length = 101 Score = 108 bits (270), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 51/52 (98%), Positives = 51/52 (98%) Query: 40 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGV LNTILRHLKNSGRSR Sbjct: 50 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVSLNTILRHLKNSGRSR 101 >UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacteriaceae RepID=A4TI48_YERPP Length = 91 Score = 94.7 bits (234), Expect = 8e-19, Method: Compositional matrix adjust. Identities = 40/90 (44%), Positives = 57/90 (63%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA V + CP C V ++G GHQRY C CR+++QL++ Y A PG ++I+D+A Sbjct: 1 MAKVDVKCPFCEQFHPVKKHGPGRTGHQRYRCQACRRSFQLEYEYRACHPGMKEQIVDLA 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRS 90 MN G R TAR + + +N ++R LKNS RS Sbjct: 61 MNNAGIRDTARALHISINAVMRTLKNSRRS 90 >UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax=Gammaproteobacteria RepID=INA2_SHIDY Length = 90 Score = 92.4 bits (228), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 42/90 (46%), Positives = 60/90 (66%), Gaps = 1/90 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MASV+I CP C + V R+G++ GH R+ C C + +QL +TY A +PG + I +MA Sbjct: 1 MASVNIHCPRCQSAQ-VYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPGIKELITEMA 59 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRS 90 NG G R TAR + +G+NT++R LKNS +S Sbjct: 60 FNGAGVRDTARTLKIGINTVIRTLKNSRQS 89 >UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammaproteobacteria RepID=A7N597_VIBHB Length = 91 Score = 89.0 bits (219), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 38/87 (43%), Positives = 56/87 (64%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA++ + C C+ T+ V ++GK +G R+ C CRK++QL + Y A +P +KI+DMA Sbjct: 1 MATIQVQCRFCNKTESVRKHGKGHSGFPRFRCIECRKSFQLDYVYEARKPNVKEKIVDMA 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNS 87 MN G R TA ++ V NT+L LKNS Sbjct: 61 MNSSGVRETAGVLNVAYNTVLSTLKNS 87 >UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T8A4_KLEPR Length = 83 Score = 86.3 bits (212), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 41/65 (63%), Positives = 47/65 (72%), Gaps = 1/65 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQ-KIIDM 59 MAS+ + PSC+ T+GV RNGKSTAGHQ YLC CRK W L FTYT SQ THQ KIIDM Sbjct: 7 MASIYVGSPSCAVTEGVDRNGKSTAGHQHYLCRQCRKPWTLTFTYTTSQRSTHQRKIIDM 66 Query: 60 AMNGV 64 + + Sbjct: 67 TIMAL 71 >UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwinia RepID=D0FXR2_ERWPY Length = 92 Score = 84.0 bits (206), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 39/82 (47%), Positives = 51/82 (62%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 I+CP CS + + RNG+S +G QRY C C KT+QL F Y S P + II+M +G Sbjct: 5 DIACPRCSESARIRRNGRSASGIQRYRCQGCLKTFQLHFYYAGSSPNMQKTIIEMMNDGS 64 Query: 65 GCRATARIMGVGLNTILRHLKN 86 R AR +GV L T+LRHLK+ Sbjct: 65 EQRDIARKLGVSLETVLRHLKD 86 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 78.6 bits (192), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 1/91 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA V+I CP C + V R+G++ G R C C + +QL +TY A +PG + I +MA Sbjct: 1 MARVNIHCPRCQSAQ-VYRHGQNPKGRDRLRCRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 NG G R TAR + +G NT++R LK R Sbjct: 60 FNGAGVRDTARTLKIGSNTVIRTLKKLAPKR 90 >UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacteria RepID=Q1V9Z0_VIBAL Length = 88 Score = 67.0 bits (162), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 32/81 (39%), Positives = 47/81 (58%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + + C C +D VV++G GHQRY C C +T+Q+ + Y A +PG +II+M Sbjct: 1 MTTNNPHCHFCCKSDSVVKHGYGPKGHQRYRCLSCCRTFQVNYCYEACKPGIRSRIIEMT 60 Query: 61 MNGVGCRATARIMGVGLNTIL 81 G RAT+R + V NT+L Sbjct: 61 AQNHGKRATSRHLQVSYNTVL 81 >UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQ17_9SPHI Length = 107 Score = 65.1 bits (157), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 32/91 (35%), Positives = 49/91 (53%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M +++C T + R G + AG QRY C C +T+ +T+ A P ++I M Sbjct: 1 MVLEAVTCKHFGQTQHIKRYGTTCAGTQRYRCFDCGRTFVQTYTHKARDPLVKEQITQMV 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 +NG G R TAR++GV NT+ K +G +R Sbjct: 61 LNGAGIRDTARVLGVNRNTVSAQFKKNGAAR 91 >UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX Length = 71 Score = 62.8 bits (151), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA+V++ P C+ +D V R+G+S + H+R+ C C++ +QL ++Y A +PG + I++MA Sbjct: 1 MATVTVHRPRCN-SDKVYRHGRSCSQHERFRCRSCKRVFQLTYSYEARKPGFKELIVEMA 59 Query: 61 MNGVG 65 NG G Sbjct: 60 HNGTG 64 >UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IXU3_9CHRO Length = 92 Score = 58.5 bits (140), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 32/87 (36%), Positives = 50/87 (57%), Gaps = 4/87 (4%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSH--C-RKTWQLQFTYTASQPGTHQKIIDMA 60 ++I CP C +TD VV+NG S G QRY C + C R+++ ++Y + ++I M Sbjct: 5 LAIECPHCHSTD-VVKNGFSGEGKQRYFCQNKSCERRSFIRDYSYNGCRKEVKKQIPKMV 63 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNS 87 +NG G R TAR++ + T+ LK S Sbjct: 64 VNGSGIRDTARVLEISPITVASELKKS 90 >UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea psychrophila RepID=Q6AKY5_DESPS Length = 101 Score = 57.8 bits (138), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 31/84 (36%), Positives = 49/84 (58%), Gaps = 6/84 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 +SC C TD V R+GK + G+QR+ CS C++T+QL++ Y A + H++ + G Sbjct: 1 MSCRFCGGTDEVRRHGKDSNGNQRFRCSDCKRTFQLEYPYVADR---HER---YSPGNAG 54 Query: 66 CRATARIMGVGLNTILRHLKNSGR 89 R TAR++ VG + R K + R Sbjct: 55 IRDTARVLKVGCMGLTRFRKLNPR 78 >UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriaceae RepID=B7LWW4_ECO55 Length = 134 Score = 55.1 bits (131), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 28/77 (36%), Positives = 44/77 (57%), Gaps = 3/77 (3%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M+SV+I CP C + V R+G++ G R+ C + +QL +TY A +PG + I +MA Sbjct: 1 MSSVNIHCPRCQSAQ-VYRHGQNPKGRDRFRYRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MN--GVGCRATARIMGV 75 N G+ AR+ G+ Sbjct: 60 FNEPGMMLARMARLHGI 76 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 54.7 bits (130), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 26/84 (30%), Positives = 45/84 (53%), Gaps = 1/84 (1%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I+CP C V++NG + Q+Y C C + +T A + +QK + + G+G Sbjct: 12 INCPKCKE-KKVIKNGTTKNNKQQYYCKMCFYRFIQNYTNQAYKLDINQKNVQLTKEGLG 70 Query: 66 CRATARIMGVGLNTILRHLKNSGR 89 R+TARI+ + T+L+ + + GR Sbjct: 71 IRSTARILEISATTLLKRIVSIGR 94 >UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SSR9_AERS4 Length = 91 Score = 53.9 bits (128), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 1/64 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MAS++I CP C+ +D V R+GK+ AG+ RY C C +QL +TY A P + ++++ Sbjct: 10 MASITIHCPRCN-SDHVYRHGKTPAGNIRYRCPACPHVFQLTYTYEARNPASKRRLLIWR 68 Query: 61 MNGV 64 G+ Sbjct: 69 STGL 72 >UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HXD1_CYAP4 Length = 95 Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 4/82 (4%) Query: 9 PSCSATDGVVRNGKSTAGHQRYLCSHC---RKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 PSC ++D VV+ + T G QRY C + R T+ Q+ Y Q+I++M +NG G Sbjct: 9 PSCGSSD-VVKPRQLTEGIQRYKCRNAEWSRCTFIRQYAYRGYLVEVKQQIVEMVVNGSG 67 Query: 66 CRATARIMGVGLNTILRHLKNS 87 R AR++ + T+ LK S Sbjct: 68 TRDPARVLKISRTTVTETLKKS 89 >UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria RepID=C4MEL4_CAMCO Length = 339 Score = 47.4 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 31/82 (37%), Positives = 42/82 (51%), Gaps = 8/82 (9%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW----QLQFTYTASQPGTHQKIIDMAMNG 63 CP C+ +D V+NGK+ HQRY+C C KT+ + T G K ID +N Sbjct: 48 CPYCN-SDKFVKNGKAKT-HQRYICKTCNKTFTDTNKTILFNTKKDIGIWYKYIDCLVNK 105 Query: 64 VGCRATARIMGVGLNT--ILRH 83 R TA+I G+ L T + RH Sbjct: 106 YPLRKTAKICGISLPTAFVWRH 127 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 2/87 (2%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 +SCPSC + VV+ G+ G Q++LC C K + +Y ++ + M NG+ Sbjct: 10 DVSCPSC-GSHHVVKCGRPL-GRQKFLCRDCGKYFLGDASYHHHSRKLREEALRMYANGM 67 Query: 65 GCRATARIMGVGLNTILRHLKNSGRSR 91 RA +R++ V L T+ +K GR + Sbjct: 68 SMRAISRVLNVPLGTVFTWIKRYGRKK 94 >UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8T3_EDWI9 Length = 73 Score = 46.6 bits (109), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 21/51 (41%), Positives = 31/51 (60%) Query: 41 LQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 L Y A + ++II+MA G G R TA + +G+NT++R LKNS +S Sbjct: 23 LTLAYEAHKLDIKEQIIEMAFKGSGVRDTANTLKIGINTVIRTLKNSRQSE 73 >UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L432_AMOA5 Length = 118 Score = 46.6 bits (109), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 26/88 (29%), Positives = 42/88 (47%), Gaps = 10/88 (11%) Query: 5 SISCPSC----SATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 +++CP C S DG+VR G QRY C CR + + T +K + + Sbjct: 3 TMNCPRCNNAHSCKDGIVR------GRQRYQCKSCRFRYTVSHKSDVKPLSTKRKALQLY 56 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSG 88 + G+G RA RI+ + T+ + +K G Sbjct: 57 LEGLGFRAIGRILNISYGTVYQWVKACG 84 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 46.2 bits (108), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 11/90 (12%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH-----QKIIDMA 60 + CP C AT+ + +NGK G Q ++C+ C + QF P + Q+ ++M Sbjct: 1 MQCPYCGATE-IRKNGKR-RGKQNHICTKCER----QFIDVYDPPKGYSEELKQECLEMY 54 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRS 90 +NG+G R R+ GV TI+ +K G Sbjct: 55 LNGMGFRPIERVKGVHHTTIIFWVKQMGEK 84 >UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae RepID=C5BFY7_EDWI9 Length = 46 Score = 44.3 bits (103), Expect = 0.001, Method: Compositional matrix adjust. Identities = 19/41 (46%), Positives = 26/41 (63%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL 41 MA + + CP + T V+RNG +T+G Q Y C C KT+QL Sbjct: 1 MAKIDVVCPRGAKTQDVIRNGHATSGAQVYRCKLCLKTFQL 41 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust. Identities = 17/47 (36%), Positives = 27/47 (57%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTA 47 M ++ + C C T+ V ++ K A HQRY C C + +QL++ Y A Sbjct: 1 METLEVKCRFCQQTEFVKKHSKGDADHQRYRCFSCNQIFQLEYAYRA 47 >UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriales RepID=B5W4N9_SPIMA Length = 163 Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust. Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 2/80 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C + VV+NG G Q YLC C + ++ + + M++NG+G Sbjct: 1 MDCPYCQSHK-VVKNGHRQ-GKQSYLCRECGRQFRENPCPGGYSSDVKELCVKMSLNGMG 58 Query: 66 CRATARIMGVGLNTILRHLK 85 RA R+ G+ NTIL ++ Sbjct: 59 FRAIERVTGISHNTILNWVR 78 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 25/82 (30%), Positives = 45/82 (54%), Gaps = 3/82 (3%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGC 66 +CPSC +D V++NG S+ G +Y C+ CR+T+ + S+ ++I+ +N + Sbjct: 70 NCPSCK-SDKVIKNG-SSRGKTKYKCNVCRRTFYDANSRRMSRE-QKERILKEYLNRMSM 126 Query: 67 RATARIMGVGLNTILRHLKNSG 88 R A++ G L T+ +K G Sbjct: 127 RGIAKVEGKPLTTVYSLIKRKG 148 >UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methanosarcina RepID=Q46CV1_METBF Length = 139 Score = 43.1 bits (100), Expect = 0.002, Method: Compositional matrix adjust. Identities = 22/89 (24%), Positives = 45/89 (50%), Gaps = 10/89 (11%) Query: 6 ISCPSCSAT----DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 ++CP C+++ +G+V G Q Y C C + ++ TAS P ++ + + + Sbjct: 1 MNCPRCNSSTHKKNGIV------FGRQHYKCHDCGYNYTVEVKSTASSPSVKRQALQLYL 54 Query: 62 NGVGCRATARIMGVGLNTILRHLKNSGRS 90 G+G R+ R +GV ++ + +K G+ Sbjct: 55 EGLGFRSIGRFLGVSHVSVQKWIKKFGQE 83 >UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE Length = 348 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 10/83 (12%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT---YTASQPGTHQ--KIIDMAMN 62 CP C D V +NGKS G QRY+C CR ++ +FT ++ ++ G + K ++ + Sbjct: 53 CPKCQCKD-VNKNGKSN-GRQRYICKRCRTSFD-EFTMSPFSNTKLGLDKWIKYCELMIL 109 Query: 63 GVGCRATARIMGVGLNT--ILRH 83 G+ R A +GVG+ T +RH Sbjct: 110 GLSIRKCAEEVGVGVKTSFYMRH 132 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 42.0 bits (97), Expect = 0.006, Method: Compositional matrix adjust. Identities = 21/89 (23%), Positives = 45/89 (50%), Gaps = 10/89 (11%) Query: 6 ISCPSCSAT----DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 ++CP C+++ +G+V G QRY C C + ++ T+ P ++ + + + Sbjct: 1 MNCPRCNSSTHKKNGIV------FGRQRYKCHDCGYNYTVEVKSTSISPSVKRQALQLYL 54 Query: 62 NGVGCRATARIMGVGLNTILRHLKNSGRS 90 G+G R+ R +GV ++ + +K G+ Sbjct: 55 EGLGFRSIGRFLGVSHVSVQKWIKKFGQE 83 >UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_STRAG Length = 345 Score = 40.8 bits (94), Expect = 0.012, Method: Compositional matrix adjust. Identities = 29/82 (35%), Positives = 39/82 (47%), Gaps = 7/82 (8%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNG 63 CP C VVRNG G QRY+C C K++ + + T ++ ID MNG Sbjct: 52 CPLCGCIH-VVRNGHRKDGTQRYVCKDCGKSFVIATNSIVSGTRKDLSVWEQYIDCMMNG 110 Query: 64 VGCRATARIMGVGLNT--ILRH 83 + R TA G+ NT + RH Sbjct: 111 LSIRKTAVACGIHRNTAFLWRH 132 >UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JSC1_YERPY Length = 53 Score = 40.4 bits (93), Expect = 0.016, Method: Compositional matrix adjust. Identities = 16/37 (43%), Positives = 21/37 (56%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRK 37 MA + CP C D V ++G +GHQRY C H +K Sbjct: 1 MAKIDEKCPFCERKDLVKKHGYGKSGHQRYRCPHAKK 37 >UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N1_SALRD Length = 92 Score = 40.0 bits (92), Expect = 0.022, Method: Compositional matrix adjust. Identities = 25/85 (29%), Positives = 38/85 (44%), Gaps = 1/85 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + C C +++ +V+NG S +G Q+Y C C L +KI+ Sbjct: 1 MIKETYECRECGSSN-IVKNGHSASGSQQYHCKDCGAHKVLDPEPRGYSEEEKEKILRAY 59 Query: 61 MNGVGCRATARIMGVGLNTILRHLK 85 RA +RI G+ NT+ R LK Sbjct: 60 RERGSKRAISRIFGISRNTLTRWLK 84 >UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis BM407 RepID=C6GT28_STRS4 Length = 341 Score = 40.0 bits (92), Expect = 0.025, Method: Composition-based stats. Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 8/77 (10%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS--QPGTHQKIIDMA---MN 62 CP C ++ + RNGK G QRY+C C+KT+ FT +A+ T K + A +N Sbjct: 54 CPLC-GSETISRNGKYN-GKQRYICKSCKKTFT-DFTNSATYKSKKTLDKWLKYAKCMIN 110 Query: 63 GVGCRATARIMGVGLNT 79 G R +A+I+ + + T Sbjct: 111 GYSIRKSAKIVEINIAT 127 >UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 RepID=P73782_SYNY3 Length = 141 Score = 39.3 bits (90), Expect = 0.043, Method: Compositional matrix adjust. Identities = 25/87 (28%), Positives = 44/87 (50%), Gaps = 10/87 (11%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGT----HQKIIDMAMNG 63 CP C + VV+NG G QR+ C C Q +FT + + G + + + M G Sbjct: 7 CPQCGHGN-VVKNG-FVKGKQRFKCKRC----QYKFTNLSKERGKLLWMKLEAVLLYMGG 60 Query: 64 VGCRATARIMGVGLNTILRHLKNSGRS 90 + ATA+++GV ++L +++ G + Sbjct: 61 MSMNATAKLLGVSTQSLLNWIRDFGEA 87 >UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PIK6_CORA7 Length = 403 Score = 39.3 bits (90), Expect = 0.044, Method: Composition-based stats. Identities = 28/79 (35%), Positives = 40/79 (50%), Gaps = 4/79 (5%) Query: 9 PSCSAT-DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKI-IDMAMNGVGC 66 PSC G+V+NGK+ AG QR+LC C + +T+ H KI ID ++G Sbjct: 7 PSCDMCGHGLVKNGKTAAGTQRWLCPQCNVSSINTRAHTSDI--RHFKIFIDWILSGESA 64 Query: 67 RATARIMGVGLNTILRHLK 85 A+ +GV T+ R K Sbjct: 65 DHLAKRLGVTRRTLTRWFK 83 >UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcticus 273-4 RepID=Q4FRR6_PSYA2 Length = 108 Score = 38.5 bits (88), Expect = 0.060, Method: Compositional matrix adjust. Identities = 27/88 (30%), Positives = 40/88 (45%), Gaps = 7/88 (7%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIID----M 59 + ISCP C + + +NG + G Q Y C C++ Q + + G H +I D M Sbjct: 5 IDISCPDCHSI-SLKKNGIKSYGKQNYQCKDCQR--QFIGDHALTYQGCHSRIEDRIRLM 61 Query: 60 AMNGVGCRATARIMGVGLNTILRHLKNS 87 G G R A I V + +L L +S Sbjct: 62 TARGCGIRDIAVITSVSIGKVLSTLGSS 89 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=roo... 126 2e-28 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 118 5e-26 UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax... 115 3e-25 UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacter... 110 2e-23 UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammapr... 108 4e-23 UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma ling... 105 3e-22 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 102 3e-21 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 100 1e-20 UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwin... 99 6e-20 UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriale... 98 6e-20 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 98 1e-19 UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriacea... 97 1e-19 UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacter... 92 7e-18 UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepI... 91 1e-17 UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp.... 89 5e-17 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 89 6e-17 UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria... 84 2e-15 UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candid... 84 2e-15 UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepI... 83 3e-15 UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea p... 80 3e-14 UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmon... 75 5e-13 UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX 75 5e-13 UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Kl... 72 5e-12 UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edw... 67 2e-10 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 61 1e-08 UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae... 53 3e-06 Sequences not found previously or not previously below threshold: UniRef50_Q116V8 Insertion element protein n=2 Tax=Oscillatoriale... 85 5e-16 UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methano... 84 1e-15 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 81 8e-15 UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_S... 75 5e-13 UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium ... 74 1e-12 UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5W... 69 5e-11 UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ru... 68 1e-10 UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 R... 67 2e-10 UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcti... 66 3e-10 UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candida... 65 5e-10 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 65 7e-10 UniRef50_D2QCU0 Insertion element protein n=7 Tax=Bacteroidetes ... 64 1e-09 UniRef50_A8V2B8 Hypothetical insertion element protein n=1 Tax=H... 63 2e-09 UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis ae... 63 2e-09 UniRef50_C6X4X6 Putative uncharacterized protein n=1 Tax=Flavoba... 63 3e-09 UniRef50_A3JAS9 Hypothetical transposase n=1 Tax=Marinobacter sp... 63 3e-09 UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis ... 62 7e-09 UniRef50_Q2GDS3 Putative uncharacterized protein n=2 Tax=Neorick... 60 2e-08 UniRef50_A9VV42 Insertion element protein n=7 Tax=Bacillus RepID... 60 3e-08 UniRef50_C0QU68 Putative transposase n=1 Tax=Persephonella marin... 59 5e-08 UniRef50_Q64EP4 Putative uncharacterized protein n=10 Tax=enviro... 58 9e-08 UniRef50_Q891N5 Putative transposase n=1 Tax=Clostridium tetani ... 58 9e-08 UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaen... 57 1e-07 UniRef50_B1QSI6 Putative transposase n=10 Tax=Clostridium butyri... 57 1e-07 UniRef50_C5U8R8 Insertion element protein (Fragment) n=2 Tax=Met... 57 2e-07 UniRef50_C8SD87 Insertion element protein n=1 Tax=Ferroglobus pl... 57 3e-07 UniRef50_A7HSL0 Insertion element protein n=1 Tax=Parvibaculum l... 56 3e-07 UniRef50_A6CNB6 Putative uncharacterized protein n=8 Tax=Bacillu... 56 3e-07 UniRef50_B4WSN9 Insertion element protein n=3 Tax=Cyanobacteria ... 56 4e-07 UniRef50_C4IIL3 Putative transposase n=2 Tax=Clostridium butyric... 55 7e-07 UniRef50_A8ZMX8 Putative uncharacterized protein n=5 Tax=Acaryoc... 55 8e-07 UniRef50_B6B4C9 Putative uncharacterized protein n=1 Tax=Rhodoba... 54 1e-06 UniRef50_Q97IZ3 Zn-finger DNA-binding domain n=1 Tax=Clostridium... 54 1e-06 UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthros... 54 1e-06 UniRef50_B0ABB1 Putative uncharacterized protein n=2 Tax=cellula... 54 2e-06 UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202... 52 4e-06 UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE 52 5e-06 UniRef50_Q81ZP0 Putative uncharacterized protein n=2 Tax=Nitroso... 52 7e-06 UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum A... 52 8e-06 UniRef50_UPI000196AFFE hypothetical protein CATMIT_00334 n=2 Tax... 52 8e-06 UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachl... 52 8e-06 UniRef50_B6ARX2 Probable transposase n=1 Tax=Leptospirillum sp. ... 52 8e-06 UniRef50_Q97IP3 Zn-finger DNA-binding domain n=1 Tax=Clostridium... 51 1e-05 UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aur... 51 1e-05 UniRef50_C6W2G4 Transcriptional regulator, LacI family n=1 Tax=D... 51 1e-05 UniRef50_A1SXI4 Conserved hypothetical transposase n=18 Tax=Gamm... 51 1e-05 UniRef50_A8ZNT7 Putative uncharacterized protein n=2 Tax=Acaryoc... 51 1e-05 UniRef50_Q10ZA4 Putative uncharacterized protein n=1 Tax=Trichod... 51 1e-05 UniRef50_C1DQR5 Transposase n=5 Tax=Bacteria RepID=C1DQR5_AZOVD 50 2e-05 UniRef50_A5FST1 Integrase, catalytic region n=1 Tax=Dehalococcoi... 50 2e-05 UniRef50_B9TDK1 Putative uncharacterized protein n=2 Tax=Ricinus... 50 2e-05 UniRef50_B5ETH8 Transposase n=15 Tax=Vibrionaceae RepID=B5ETH8_V... 50 2e-05 UniRef50_A0L9I6 Insertion element protein n=1 Tax=Magnetococcus ... 50 3e-05 UniRef50_B2GC72 Transposase n=11 Tax=Lactobacillus RepID=B2GC72_... 50 3e-05 UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseud... 49 4e-05 UniRef50_Q9AMR3 Putative transposase n=1 Tax=Azotobacter vinelan... 49 4e-05 UniRef50_B0K4X0 Integrase, catalytic region n=11 Tax=Clostridia ... 49 4e-05 UniRef50_Q8PSY9 Conserved protein n=2 Tax=Methanosarcina RepID=Q... 49 5e-05 UniRef50_Q8D3Z3 Transposase n=48 Tax=Vibrionales RepID=Q8D3Z3_VIBVU 49 5e-05 UniRef50_D0WF86 Putative Zn-finger DNA-binding domain protein n=... 49 5e-05 UniRef50_A7HJ99 Putative uncharacterized protein n=2 Tax=Fervido... 49 6e-05 UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae... 49 6e-05 UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nod... 49 7e-05 UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q... 49 7e-05 UniRef50_Q0W4F0 Putative uncharacterized protein n=1 Tax=uncultu... 49 7e-05 UniRef50_C6MXF0 Putative uncharacterized protein n=1 Tax=Geobact... 48 8e-05 UniRef50_Q2FRB5 Putative uncharacterized protein n=1 Tax=Methano... 48 9e-05 UniRef50_A4WNK0 Putative uncharacterized protein n=1 Tax=Rhodoba... 48 1e-04 UniRef50_Q3Y3Y2 Transposase, IS204/IS1001/IS1096/IS1165 n=4 Tax=... 47 1e-04 UniRef50_C9BRL5 Transposase n=2 Tax=Enterococcus faecium RepID=C... 47 2e-04 UniRef50_C3L491 Putative uncharacterized protein n=1 Tax=Candida... 47 2e-04 UniRef50_D1JAI8 Putative uncharacterized protein n=1 Tax=uncultu... 47 3e-04 UniRef50_UPI0001C348D8 hypothetical protein PretD1_11772 n=1 Tax... 47 3e-04 UniRef50_A4W908 Putative cytoplasmic protein n=2 Tax=Enterobacte... 46 4e-04 UniRef50_UPI0001BC5CBF transposase n=1 Tax=Fusobacterium sp. D12... 46 5e-04 UniRef50_B4WUH8 Putative uncharacterized protein n=1 Tax=Synecho... 45 5e-04 UniRef50_B4WST6 Putative uncharacterized protein n=1 Tax=Synecho... 45 5e-04 UniRef50_C0X375 Transposase n=23 Tax=Enterococcus RepID=C0X375_E... 45 5e-04 UniRef50_Q3Y1C3 Transposase, IS204/IS1001/IS1096/IS1165 n=51 Tax... 45 6e-04 UniRef50_A8ZMW7 Putative uncharacterized protein n=1 Tax=Acaryoc... 45 6e-04 UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=... 45 7e-04 UniRef50_B7K7J3 Putative uncharacterized protein n=1 Tax=Cyanoth... 45 0.001 UniRef50_Q4JT92 Transposase for IS3503f n=4 Tax=Corynebacterium ... 44 0.001 UniRef50_B2A0V7 Integrase catalytic region n=15 Tax=Clostridia R... 44 0.001 UniRef50_C2BVZ4 Possible IS3509a transposase n=1 Tax=Mobiluncus ... 44 0.001 UniRef50_C8SCF9 Integrase catalytic region n=1 Tax=Ferroglobus p... 44 0.001 UniRef50_P04137 Uncharacterized protein in transposable element ... 44 0.001 UniRef50_Q07NT9 Putative uncharacterized protein n=2 Tax=Rhizobi... 44 0.002 UniRef50_Q5LYW0 IS1193, transposase, ISL3 family n=185 Tax=Bacte... 44 0.002 UniRef50_C0BSX6 Putative uncharacterized protein n=1 Tax=Bifidob... 44 0.002 UniRef50_D0BMT4 Transposase n=10 Tax=Lactobacillales RepID=D0BMT... 44 0.002 UniRef50_B5K5I7 Transposase n=4 Tax=Octadecabacter antarcticus 2... 44 0.002 UniRef50_B2J098 Putative uncharacterized protein n=2 Tax=Nostoca... 44 0.002 UniRef50_Q70JT0 IsmA protein n=2 Tax=Microcystis aeruginosa RepI... 44 0.002 UniRef50_Q8PWW0 Putative uncharacterized protein n=1 Tax=Methano... 43 0.003 UniRef50_Q1VPB4 Putative uncharacterized protein n=4 Tax=Psychro... 43 0.003 UniRef50_C6MKY8 Putative uncharacterized protein n=1 Tax=Geobact... 43 0.004 UniRef50_Q7NH53 TetR family transcriptional regulatory protein n... 42 0.004 UniRef50_Q10VF2 Putative uncharacterized protein n=1 Tax=Trichod... 42 0.004 UniRef50_D2PJ85 Putative uncharacterized protein n=5 Tax=Sulfolo... 42 0.004 UniRef50_Q8RFT8 Transposase n=17 Tax=Fusobacterium RepID=Q8RFT8_... 42 0.005 UniRef50_B5S3H3 Probable transposase protein n=6 Tax=Burkholderi... 42 0.005 UniRef50_C2FEQ0 IS1181 transposase n=1 Tax=Lactobacillus paracas... 42 0.005 UniRef50_UPI0001BC5E64 putative transposase n=1 Tax=Fusobacteriu... 42 0.005 UniRef50_UPI000196CFC8 hypothetical protein CATMIT_02848 n=1 Tax... 42 0.006 UniRef50_D2LYX8 Tn5468, transposition protein D n=1 Tax=Bacillus... 42 0.006 UniRef50_D0SJK4 Predicted protein n=1 Tax=Acinetobacter junii SH... 41 0.010 UniRef50_C2CJK1 ISSha1 transposase n=7 Tax=Anaerococcus RepID=C2... 41 0.011 UniRef50_Q5NZ47 Putative uncharacterized protein n=1 Tax=Aromato... 41 0.012 UniRef50_C8SCF8 Putative uncharacterized protein n=1 Tax=Ferrogl... 41 0.012 UniRef50_Q9H5H4 Zinc finger protein 768 n=9 Tax=Theria RepID=ZN7... 41 0.015 UniRef50_B7X577 Transposase IS204/IS1001/IS1096/IS1165 family pr... 41 0.015 UniRef50_B2J7N9 Putative uncharacterized protein n=1 Tax=Nostoc ... 40 0.016 UniRef50_B0CG58 Transcriptional regulator, TetR family n=1 Tax=A... 40 0.018 UniRef50_Q9V1K2 Putative uncharacterized protein n=2 Tax=Pyrococ... 40 0.020 UniRef50_UPI000186EB06 zinc finger protein 705A, putative n=1 Ta... 40 0.027 UniRef50_Q8U293 Transposase n=53 Tax=Pyrococcus RepID=Q8U293_PYRFU 40 0.029 UniRef50_UPI000051A053 PREDICTED: similar to CG3407-PA n=1 Tax=A... 40 0.030 UniRef50_UPI0001C34261 hypothetical protein CATC2_13668 n=1 Tax=... 40 0.032 UniRef50_A3HUJ9 Transcriptional regulator, LuxR family protein n... 40 0.033 UniRef50_A7JMB8 Predicted protein n=8 Tax=Francisella RepID=A7JM... 39 0.038 UniRef50_Q0W590 Putative uncharacterized protein n=1 Tax=uncultu... 39 0.043 UniRef50_B8X8Z3 Resolvase n=1 Tax=Pectobacterium atrosepticum Re... 39 0.044 UniRef50_C3XYB0 Putative uncharacterized protein n=2 Tax=Chordat... 39 0.045 UniRef50_A2A935 PR domain zinc finger protein 16 n=35 Tax=Eutele... 39 0.045 UniRef50_B8F7J2 Putative uncharacterized protein n=1 Tax=Haemoph... 39 0.049 UniRef50_UPI0001793699 PREDICTED: similar to zinc-finger homeodo... 39 0.052 UniRef50_Q9HAZ2 PR domain zinc finger protein 16 n=26 Tax=Eutele... 39 0.052 UniRef50_Q8PRR9 Conserved protein n=2 Tax=Archaea RepID=Q8PRR9_M... 39 0.053 UniRef50_A7HVK5 Putative uncharacterized protein n=1 Tax=Parviba... 39 0.056 UniRef50_Q4S840 Chromosome 9 SCAF14710, whole genome shotgun seq... 39 0.056 UniRef50_A8NLR1 Predicted protein n=1 Tax=Coprinopsis cinerea ok... 39 0.058 UniRef50_Q03112 MDS1 and EVI1 complex locus protein EVI1 n=58 Ta... 39 0.060 UniRef50_Q6MK35 Putative transposase n=1 Tax=Bdellovibrio bacter... 39 0.063 UniRef50_B7UNR4 Predicted protein n=51 Tax=Enterobacteriaceae Re... 39 0.072 UniRef50_B9ZCS9 DNA topoisomerase type IA zn finger domain prote... 39 0.072 UniRef50_B2JXE0 Putative uncharacterized protein n=2 Tax=Burkhol... 38 0.085 UniRef50_Q64DF0 Putative uncharacterized protein n=6 Tax=cellula... 38 0.087 UniRef50_B7UI65 Predicted protein n=52 Tax=Enterobacteriaceae Re... 38 0.089 UniRef50_UPI0001792185 PREDICTED: similar to AGAP008232-PA n=1 T... 38 0.091 >UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=root RepID=INSA2_ECOLX Length = 91 Score = 126 bits (316), Expect = 2e-28, Method: Composition-based stats. Identities = 90/91 (98%), Positives = 90/91 (98%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA Sbjct: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 MNGVGCRATARIMGVGLNTI RHLKNSGRSR Sbjct: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 118 bits (296), Expect = 5e-26, Method: Composition-based stats. Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 1/91 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA V+I CP C + V R+G++ G R C C + +QL +TY A +PG + I +MA Sbjct: 1 MARVNIHCPRCQSAQ-VYRHGQNPKGRDRLRCRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 NG G R TAR + +G NT++R LK R Sbjct: 60 FNGAGVRDTARTLKIGSNTVIRTLKKLAPKR 90 >UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax=Gammaproteobacteria RepID=INA2_SHIDY Length = 90 Score = 115 bits (289), Expect = 3e-25, Method: Composition-based stats. Identities = 42/91 (46%), Positives = 60/91 (65%), Gaps = 1/91 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MASV+I CP C + V R+G++ GH R+ C C + +QL +TY A +PG + I +MA Sbjct: 1 MASVNIHCPRCQSAQ-VYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPGIKELITEMA 59 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 NG G R TAR + +G+NT++R LKNS +S Sbjct: 60 FNGAGVRDTARTLKIGINTVIRTLKNSRQSE 90 >UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacteriaceae RepID=A4TI48_YERPP Length = 91 Score = 110 bits (274), Expect = 2e-23, Method: Composition-based stats. Identities = 40/90 (44%), Positives = 57/90 (63%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA V + CP C V ++G GHQRY C CR+++QL++ Y A PG ++I+D+A Sbjct: 1 MAKVDVKCPFCEQFHPVKKHGPGRTGHQRYRCQACRRSFQLEYEYRACHPGMKEQIVDLA 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRS 90 MN G R TAR + + +N ++R LKNS RS Sbjct: 61 MNNAGIRDTARALHISINAVMRTLKNSRRS 90 >UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammaproteobacteria RepID=A7N597_VIBHB Length = 91 Score = 108 bits (271), Expect = 4e-23, Method: Composition-based stats. Identities = 38/91 (41%), Positives = 58/91 (63%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA++ + C C+ T+ V ++GK +G R+ C CRK++QL + Y A +P +KI+DMA Sbjct: 1 MATIQVQCRFCNKTESVRKHGKGHSGFPRFRCIECRKSFQLDYVYEARKPNVKEKIVDMA 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 MN G R TA ++ V NT+L LKNS + + Sbjct: 61 MNSSGVRETAGVLNVAYNTVLSTLKNSRQGK 91 >UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQ17_9SPHI Length = 107 Score = 105 bits (263), Expect = 3e-22, Method: Composition-based stats. Identities = 32/91 (35%), Positives = 49/91 (53%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M +++C T + R G + AG QRY C C +T+ +T+ A P ++I M Sbjct: 1 MVLEAVTCKHFGQTQHIKRYGTTCAGTQRYRCFDCGRTFVQTYTHKARDPLVKEQITQMV 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 +NG G R TAR++GV NT+ K +G +R Sbjct: 61 LNGAGIRDTARVLGVNRNTVSAQFKKNGAAR 91 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 102 bits (255), Expect = 3e-21, Method: Composition-based stats. Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 2/87 (2%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 +SCPSC + VV+ G+ G Q++LC C K + +Y ++ + M NG+ Sbjct: 10 DVSCPSCG-SHHVVKCGR-PLGRQKFLCRDCGKYFLGDASYHHHSRKLREEALRMYANGM 67 Query: 65 GCRATARIMGVGLNTILRHLKNSGRSR 91 RA +R++ V L T+ +K GR + Sbjct: 68 SMRAISRVLNVPLGTVFTWIKRYGRKK 94 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 100 bits (249), Expect = 1e-20, Method: Composition-based stats. Identities = 67/74 (90%), Positives = 68/74 (91%) Query: 17 VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 76 +VRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG Sbjct: 1 MVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 60 Query: 77 LNTILRHLKNSGRS 90 LNTILRHL Sbjct: 61 LNTILRHLNKLRPQ 74 >UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwinia RepID=D0FXR2_ERWPY Length = 92 Score = 98.6 bits (244), Expect = 6e-20, Method: Composition-based stats. Identities = 40/91 (43%), Positives = 54/91 (59%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M I+CP CS + + RNG+S +G QRY C C KT+QL F Y S P + II+M Sbjct: 1 MKMGDIACPRCSESARIRRNGRSASGIQRYRCQGCLKTFQLHFYYAGSSPNMQKTIIEMM 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 +G R AR +GV L T+LRHLK+ ++ Sbjct: 61 NDGSEQRDIARKLGVSLETVLRHLKDLRLNK 91 >UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriales RepID=B5W4N9_SPIMA Length = 163 Score = 98.2 bits (243), Expect = 6e-20, Method: Composition-based stats. Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 2/80 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C + VV+NG G Q YLC C + ++ + + M++NG+G Sbjct: 1 MDCPYCQ-SHKVVKNGHRQ-GKQSYLCRECGRQFRENPCPGGYSSDVKELCVKMSLNGMG 58 Query: 66 CRATARIMGVGLNTILRHLK 85 RA R+ G+ NTIL ++ Sbjct: 59 FRAIERVTGISHNTILNWVR 78 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 97.8 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 43/86 (50%), Gaps = 3/86 (3%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF-TYTASQPGTHQKIIDMAMNGV 64 + CP C AT+ + +NGK G Q ++C+ C + + + Q+ ++M +NG+ Sbjct: 1 MQCPYCGATE-IRKNGKR-RGKQNHICTKCERQFIDVYDPPKGYSEELKQECLEMYLNGM 58 Query: 65 GCRATARIMGVGLNTILRHLKNSGRS 90 G R R+ GV TI+ +K G Sbjct: 59 GFRPIERVKGVHHTTIIFWVKQMGEK 84 >UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriaceae RepID=B7LWW4_ECO55 Length = 134 Score = 97.4 bits (241), Expect = 1e-19, Method: Composition-based stats. Identities = 29/89 (32%), Positives = 47/89 (52%), Gaps = 3/89 (3%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M+SV+I CP C + V R+G++ G R+ C + +QL +TY A +PG + I +MA Sbjct: 1 MSSVNIHCPRCQSAQ-VYRHGQNPKGRDRFRYRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MN--GVGCRATARIMGVGLNTILRHLKNS 87 N G+ AR+ G+ + + K Sbjct: 60 FNEPGMMLARMARLHGIQPCQLFKWKKQY 88 >UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacteria RepID=Q1V9Z0_VIBAL Length = 88 Score = 91.7 bits (226), Expect = 7e-18, Method: Composition-based stats. Identities = 32/82 (39%), Positives = 47/82 (57%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + + C C +D VV++G GHQRY C C +T+Q+ + Y A +PG +II+M Sbjct: 1 MTTNNPHCHFCCKSDSVVKHGYGPKGHQRYRCLSCCRTFQVNYCYEACKPGIRSRIIEMT 60 Query: 61 MNGVGCRATARIMGVGLNTILR 82 G RAT+R + V NT+L Sbjct: 61 AQNHGKRATSRHLQVSYNTVLS 82 >UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IXU3_9CHRO Length = 92 Score = 90.5 bits (223), Expect = 1e-17, Method: Composition-based stats. Identities = 32/87 (36%), Positives = 49/87 (56%), Gaps = 4/87 (4%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLC--SHC-RKTWQLQFTYTASQPGTHQKIIDMA 60 ++I CP C +TD VV+NG S G QRY C C R+++ ++Y + ++I M Sbjct: 5 LAIECPHCHSTD-VVKNGFSGEGKQRYFCQNKSCERRSFIRDYSYNGCRKEVKKQIPKMV 63 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNS 87 +NG G R TAR++ + T+ LK S Sbjct: 64 VNGSGIRDTARVLEISPITVASELKKS 90 >UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HXD1_CYAP4 Length = 95 Score = 88.6 bits (218), Expect = 5e-17, Method: Composition-based stats. Identities = 30/94 (31%), Positives = 47/94 (50%), Gaps = 4/94 (4%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHC---RKTWQLQFTYTASQPGTHQKII 57 M + PSC ++D VV+ + T G QRY C + R T+ Q+ Y Q+I+ Sbjct: 1 MVLEPVLYPSCGSSD-VVKPRQLTEGIQRYKCRNAEWSRCTFIRQYAYRGYLVEVKQQIV 59 Query: 58 DMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 +M +NG G R AR++ + T+ LK S + Sbjct: 60 EMVVNGSGTRDPARVLKISRTTVTETLKKSSSAE 93 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 88.6 bits (218), Expect = 6e-17, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 45/85 (52%), Gaps = 1/85 (1%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I+CP C V++NG + Q+Y C C + +T A + +QK + + G+G Sbjct: 12 INCPKCKE-KKVIKNGTTKNNKQQYYCKMCFYRFIQNYTNQAYKLDINQKNVQLTKEGLG 70 Query: 66 CRATARIMGVGLNTILRHLKNSGRS 90 R+TARI+ + T+L+ + + GR Sbjct: 71 IRSTARILEISATTLLKRIVSIGRK 95 >UniRef50_Q116V8 Insertion element protein n=2 Tax=Oscillatoriales RepID=Q116V8_TRIEI Length = 108 Score = 85.5 bits (210), Expect = 5e-16, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 38/81 (46%), Gaps = 2/81 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C + +V+NG G Q YLC C + ++ + + M ++G+G Sbjct: 1 MHCPYCQ-SHKIVKNGHR-NGKQSYLCRKCGRQFRENPCPIGYSSEVKEACLKMFLSGMG 58 Query: 66 CRATARIMGVGLNTILRHLKN 86 RA R G+ N++L ++ Sbjct: 59 FRAIERATGISHNSVLNWVRR 79 >UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methanosarcina RepID=Q46CV1_METBF Length = 139 Score = 84.0 bits (206), Expect = 1e-15, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 44/84 (52%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 ++CP C+++ +NG G Q Y C C + ++ TAS P ++ + + + G+G Sbjct: 1 MNCPRCNSSTH-KKNG-IVFGRQHYKCHDCGYNYTVEVKSTASSPSVKRQALQLYLEGLG 58 Query: 66 CRATARIMGVGLNTILRHLKNSGR 89 R+ R +GV ++ + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQ 82 >UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria RepID=C4MEL4_CAMCO Length = 339 Score = 84.0 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 29/85 (34%), Positives = 39/85 (45%), Gaps = 6/85 (7%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW----QLQFTYTASQPGTHQKIIDMAMN 62 CP C+ +D V+NGK+ HQRY+C C KT+ + T G K ID +N Sbjct: 47 HCPYCN-SDKFVKNGKAKT-HQRYICKTCNKTFTDTNKTILFNTKKDIGIWYKYIDCLVN 104 Query: 63 GVGCRATARIMGVGLNTILRHLKNS 87 R TA+I G+ L T Sbjct: 105 KYPLRKTAKICGISLPTAFVWRHKI 129 >UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L432_AMOA5 Length = 118 Score = 83.6 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 10/88 (11%) Query: 5 SISCPSCS----ATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 +++CP C+ DG+VR G QRY C CR + + T +K + + Sbjct: 3 TMNCPRCNNAHSCKDGIVR------GRQRYQCKSCRFRYTVSHKSDVKPLSTKRKALQLY 56 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSG 88 + G+G RA RI+ + T+ + +K G Sbjct: 57 LEGLGFRAIGRILNISYGTVYQWVKACG 84 >UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32IP8_SHIDS Length = 101 Score = 83.2 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 51/52 (98%), Positives = 51/52 (98%) Query: 40 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGV LNTILRHLKNSGRSR Sbjct: 50 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVSLNTILRHLKNSGRSR 101 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 81.3 bits (199), Expect = 8e-15, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 44/84 (52%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 ++CP C+++ +NG G QRY C C + ++ T+ P ++ + + + G+G Sbjct: 1 MNCPRCNSSTH-KKNG-IVFGRQRYKCHDCGYNYTVEVKSTSISPSVKRQALQLYLEGLG 58 Query: 66 CRATARIMGVGLNTILRHLKNSGR 89 R+ R +GV ++ + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQ 82 >UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea psychrophila RepID=Q6AKY5_DESPS Length = 101 Score = 79.7 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 30/85 (35%), Positives = 46/85 (54%), Gaps = 6/85 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 +SC C TD V R+GK + G+QR+ CS C++T+QL++ Y A + + G Sbjct: 1 MSCRFCGGTDEVRRHGKDSNGNQRFRCSDCKRTFQLEYPYVADRHE------RYSPGNAG 54 Query: 66 CRATARIMGVGLNTILRHLKNSGRS 90 R TAR++ VG + R K + R Sbjct: 55 IRDTARVLKVGCMGLTRFRKLNPRQ 79 >UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SSR9_AERS4 Length = 91 Score = 75.5 bits (184), Expect = 5e-13, Method: Composition-based stats. Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 1/64 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MAS++I CP C+ +D V R+GK+ AG+ RY C C +QL +TY A P + ++++ Sbjct: 10 MASITIHCPRCN-SDHVYRHGKTPAGNIRYRCPACPHVFQLTYTYEARNPASKRRLLIWR 68 Query: 61 MNGV 64 G+ Sbjct: 69 STGL 72 >UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_STRAG Length = 345 Score = 75.5 bits (184), Expect = 5e-13, Method: Composition-based stats. Identities = 27/84 (32%), Positives = 35/84 (41%), Gaps = 5/84 (5%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDMAMNG 63 CP C VVRNG G QRY+C C K++ + T ++ ID MNG Sbjct: 52 CPLCGCI-HVVRNGHRKDGTQRYVCKDCGKSFVIATNSIVSGTRKDLSVWEQYIDCMMNG 110 Query: 64 VGCRATARIMGVGLNTILRHLKNS 87 + R TA G+ NT Sbjct: 111 LSIRKTAVACGIHRNTAFLWRHKI 134 >UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX Length = 71 Score = 75.5 bits (184), Expect = 5e-13, Method: Composition-based stats. Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA+V++ P C+ +D V R+G+S + H+R+ C C++ +QL ++Y A +PG + I++MA Sbjct: 1 MATVTVHRPRCN-SDKVYRHGRSCSQHERFRCRSCKRVFQLTYSYEARKPGFKELIVEMA 59 Query: 61 MNGVG 65 NG G Sbjct: 60 HNGTG 64 >UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZK1_TRIEI Length = 177 Score = 74.3 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 37/84 (44%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I CP CS + +NG G Q Y+C C + + + + + +NG+G Sbjct: 11 IQCPDCSC-QHIPKNGHQP-GKQNYICVACSHQFIKPYHPQEYSDNVKRLFLRIYVNGMG 68 Query: 66 CRATARIMGVGLNTILRHLKNSGR 89 R A + GV TI+ +K++ Sbjct: 69 IRRIAWVKGVTYPTIINLIKHTRE 92 >UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T8A4_KLEPR Length = 83 Score = 72.0 bits (175), Expect = 5e-12, Method: Composition-based stats. Identities = 41/65 (63%), Positives = 47/65 (72%), Gaps = 1/65 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQ-KIIDM 59 MAS+ + PSC+ T+GV RNGKSTAGHQ YLC CRK W L FTYT SQ THQ KIIDM Sbjct: 7 MASIYVGSPSCAVTEGVDRNGKSTAGHQHYLCRQCRKPWTLTFTYTTSQRSTHQRKIIDM 66 Query: 60 AMNGV 64 + + Sbjct: 67 TIMAL 71 >UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5WC29_PSYWF Length = 233 Score = 68.6 bits (166), Expect = 5e-11, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 3/87 (3%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDMA 60 ++ I CP+C +D + +NG + G Q Y C C++ + TY KI + Sbjct: 4 TLYIKCPAC-LSDNIKKNGFKSYGKQNYKCKDCKRQFIGDHALTYQGCHSQKDSKIRYLM 62 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNS 87 + G G + A + + +L LK Sbjct: 63 VRGSGIKDIACVERISKGKVLATLKKC 89 >UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N1_SALRD Length = 92 Score = 67.8 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 1/86 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + C C +++ +V+NG S +G Q+Y C C L +KI+ Sbjct: 1 MIKETYECRECGSSN-IVKNGHSASGSQQYHCKDCGAHKVLDPEPRGYSEEEKEKILRAY 59 Query: 61 MNGVGCRATARIMGVGLNTILRHLKN 86 RA +RI G+ NT+ R LK Sbjct: 60 RERGSKRAISRIFGISRNTLTRWLKK 85 >UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 RepID=P73782_SYNY3 Length = 141 Score = 66.6 bits (161), Expect = 2e-10, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 38/84 (45%), Gaps = 2/84 (2%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGC 66 CP C VV+NG G QR+ C C+ + + + + M G+ Sbjct: 6 HCPQCG-HGNVVKNGF-VKGKQRFKCKRCQYKFTNLSKERGKLLWMKLEAVLLYMGGMSM 63 Query: 67 RATARIMGVGLNTILRHLKNSGRS 90 ATA+++GV ++L +++ G + Sbjct: 64 NATAKLLGVSTQSLLNWIRDFGEA 87 >UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8T3_EDWI9 Length = 73 Score = 66.6 bits (161), Expect = 2e-10, Method: Composition-based stats. Identities = 21/52 (40%), Positives = 31/52 (59%) Query: 40 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 L Y A + ++II+MA G G R TA + +G+NT++R LKNS +S Sbjct: 22 LLTLAYEAHKLDIKEQIIEMAFKGSGVRDTANTLKIGINTVIRTLKNSRQSE 73 >UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcticus 273-4 RepID=Q4FRR6_PSYA2 Length = 108 Score = 66.2 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 25/88 (28%), Positives = 37/88 (42%), Gaps = 3/88 (3%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDM 59 + ISCP C + + +NG + G Q Y C C++ + TY +I M Sbjct: 3 TQIDISCPDCHSI-SLKKNGIKSYGKQNYQCKDCQRQFIGDHALTYQGCHSRIEDRIRLM 61 Query: 60 AMNGVGCRATARIMGVGLNTILRHLKNS 87 G G R A I V + +L L +S Sbjct: 62 TARGCGIRDIAVITSVSIGKVLSTLGSS 89 >UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETP6_AMOA5 Length = 232 Score = 65.5 bits (158), Expect = 5e-10, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + C C + + NGK G QRY C C + + I + + +G Sbjct: 1 MECKGCKSNKTI-NNGK-VRGKQRYNCKSCGFNFVEVDERRGKNIDKQRMAIHLYLENMG 58 Query: 66 CRATARIMGVGLNTILRHLKNSGR 89 RA R++GV +L+ ++ +G Sbjct: 59 FRAIGRVLGVSNLAVLKWIRAAGE 82 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 65.1 bits (157), Expect = 7e-10, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 43/85 (50%), Gaps = 3/85 (3%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNG 63 + +CPSC +D V++NG S+ G +Y C+ CR+T+ ++I+ +N Sbjct: 67 IRPNCPSC-KSDKVIKNG-SSRGKTKYKCNVCRRTFY-DANSRRMSREQKERILKEYLNR 123 Query: 64 VGCRATARIMGVGLNTILRHLKNSG 88 + R A++ G L T+ +K G Sbjct: 124 MSMRGIAKVEGKPLTTVYSLIKRKG 148 >UniRef50_D2QCU0 Insertion element protein n=7 Tax=Bacteroidetes RepID=D2QCU0_9SPHI Length = 139 Score = 64.3 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 38/87 (43%), Gaps = 4/87 (4%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA++ CP C++ D V RNG QR+ C C + + K + + Sbjct: 1 MATL--KCPKCNSVDAV-RNG-IVNQRQRFRCKKCNYNFTVGKVGKGISTYYVIKALQLY 56 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNS 87 + GV R R++G+ +++ +K Sbjct: 57 IEGVSFREIERLLGISHVSVMNWVKKY 83 >UniRef50_A8V2B8 Hypothetical insertion element protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8V2B8_9AQUI Length = 125 Score = 63.2 bits (152), Expect = 2e-09, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 36/84 (42%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I CP C ++ + GK+T G QRY C+ C + + Y + M G+ Sbjct: 14 IKCPECG-SNWCKKFGKNT-GKQRYKCNECGRHFYEGAKYHKHPEKVKLLALKMYSKGMS 71 Query: 66 CRATARIMGVGLNTILRHLKNSGR 89 A AR++ + T+ R G+ Sbjct: 72 KSAIARVLNLPYRTVARWTYEVGK 95 >UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YEG9_MICAE Length = 171 Score = 63.2 bits (152), Expect = 2e-09, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 5/80 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCR 67 CP+C + ++NG G + C C + + + T T Q I + + G+ R Sbjct: 37 CPNCG-SHHTIKNGSIHNGKPKRQCKECGRQFVINPTNKTVSDETKQLIDKLLLEGISLR 95 Query: 68 ATARIMGVGLNTILRHLKNS 87 AR+ G L+N Sbjct: 96 VIARVTGAS----WSWLQNY 111 >UniRef50_C6X4X6 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X4X6_FLAB3 Length = 169 Score = 63.2 bits (152), Expect = 3e-09, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 36/87 (41%), Gaps = 6/87 (6%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQ--KIIDMAMNGV 64 +CP C VV++G QR+LC C + + Q + K + + + G+ Sbjct: 35 TCPKCQQ-QNVVKSGIVKE-RQRFLCRSCNYYFTV--KKLGKQIDDYYVTKALQLYLEGL 90 Query: 65 GCRATARIMGVGLNTILRHLKNSGRSR 91 R RI+GV TI ++ R Sbjct: 91 SYREIERILGVSHVTISSWVRKYNIKR 117 >UniRef50_A3JAS9 Hypothetical transposase n=1 Tax=Marinobacter sp. ELB17 RepID=A3JAS9_9ALTE Length = 181 Score = 62.8 bits (151), Expect = 3e-09, Method: Composition-based stats. Identities = 22/81 (27%), Positives = 32/81 (39%), Gaps = 4/81 (4%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDMAMNG 63 CP C + +R G S QRY C C KT+ Y + + ++ G Sbjct: 56 QCPYCQ-SKTFIRWGSSENERQRYRCKRCAKTFNALVGSPLYRMRKEELWLEYVETMRYG 114 Query: 64 VGCRATARIMGVGLNTILRHL 84 + R A++ GV L T R Sbjct: 115 LSLRKAAKVTGVSLRTAFRWR 135 >UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis BM407 RepID=C6GT28_STRS4 Length = 341 Score = 61.6 bits (148), Expect = 7e-09, Method: Composition-based stats. Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 6/84 (7%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQ-PGTHQKIIDMAMNG 63 CP C ++ + RNGK G QRY+C C+KT + TY + + K +NG Sbjct: 54 CPLCG-SETISRNGK-YNGKQRYICKSCKKTFTDFTNSATYKSKKTLDKWLKYAKCMING 111 Query: 64 VGCRATARIMGVGLNTILRHLKNS 87 R +A+I+ + + T Sbjct: 112 YSIRKSAKIVEINIATSFFWRHKI 135 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 61.2 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 17/49 (34%), Positives = 27/49 (55%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQ 49 M ++ + C C T+ V ++ K A HQRY C C + +QL++ Y A Sbjct: 1 METLEVKCRFCQQTEFVKKHSKGDADHQRYRCFSCNQIFQLEYAYRACH 49 >UniRef50_Q2GDS3 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDS3_NEOSM Length = 134 Score = 60.1 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 17/81 (20%), Positives = 37/81 (45%), Gaps = 3/81 (3%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C++ + ++GK+ QRY C +C + + A + + ++G+ Sbjct: 1 MHCPKCNSVRFI-KSGKAKE-KQRYKCLNCGCQFSRNEKHGA-PLRLKMHAVQLFLSGIS 57 Query: 66 CRATARIMGVGLNTILRHLKN 86 + A+I V T++R + Sbjct: 58 MNSIAKIFSVSPPTVMRWVNQ 78 >UniRef50_A9VV42 Insertion element protein n=7 Tax=Bacillus RepID=A9VV42_BACWK Length = 342 Score = 59.7 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 33/83 (39%), Gaps = 5/83 (6%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDMAMNG 63 CP C A++ VVR GK QRY C C KT+ Y + +D G Sbjct: 55 ECPHC-ASEHVVRFGK-HNNRQRYRCKCCSKTFTDTTNTVLYRTRKGNEWITFVDCMFKG 112 Query: 64 VGCRATARIMGVGLNTILRHLKN 86 R +A I+GV T+ Sbjct: 113 YSLRKSAEIVGVTWVTLFYWRHK 135 >UniRef50_C0QU68 Putative transposase n=1 Tax=Persephonella marina EX-H1 RepID=C0QU68_PERMH Length = 94 Score = 58.5 bits (140), Expect = 5e-08, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 38/87 (43%), Gaps = 2/87 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M ISCP C ++ V+NGK+ G Q YLC C + + + ++ +++ Sbjct: 1 MGGKKISCPHC-ESERCVKNGKA-NGKQTYLCKECYYRFTINASKRKYPFKIRREAVNLY 58 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNS 87 G ++ + + + TI +K Sbjct: 59 KEGYTLTEISKKLNIKVQTIHHWVKKY 85 >UniRef50_Q64EP4 Putative uncharacterized protein n=10 Tax=environmental samples RepID=Q64EP4_9ARCH Length = 164 Score = 58.2 bits (139), Expect = 9e-08, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 30/77 (38%), Gaps = 4/77 (5%) Query: 17 VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQ----KIIDMAMNGVGCRATARI 72 +VR G G QR+ C C K + + I + + G RA RI Sbjct: 38 IVRYGHDKNGRQRFKCKTCGKVFVETKNTVFYNRKLSEDQIILICKLLVEKNGIRAIERI 97 Query: 73 MGVGLNTILRHLKNSGR 89 M + +TI +K+ R Sbjct: 98 MEIHRDTISDVVKDLAR 114 >UniRef50_Q891N5 Putative transposase n=1 Tax=Clostridium tetani RepID=Q891N5_CLOTE Length = 279 Score = 57.8 bits (138), Expect = 9e-08, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 37/85 (43%), Gaps = 8/85 (9%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT-----YTASQPGTHQKIIDMAMN 62 C C ++ +V+NGK QRY+C C KT+ +T Y+ + + Sbjct: 59 CVHC-KSENIVKNGKYKE-KQRYICKDCHKTF-TNYTNSPISYSKKNISKWIEYTKCMLA 115 Query: 63 GVGCRATARIMGVGLNTILRHLKNS 87 G R +++++G+ L+T Sbjct: 116 GYSLRKSSKLVGISLSTAFYWRHKI 140 >UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TRD2_9FLAO Length = 219 Score = 57.4 bits (137), Expect = 1e-07, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 35/85 (41%), Gaps = 2/85 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 ++C +C + R GK G QRY C C ++Q + Y A +I + Sbjct: 1 MNCKNCDQAHCIKR-GKR-NGIQRYYCKICFTSFQENYHYKAYDSSIDTLLISLLRECCS 58 Query: 66 CRATARIMGVGLNTILRHLKNSGRS 90 AR++ + NT+L + + Sbjct: 59 VLGIARVLKISKNTVLSRMLKISKQ 83 >UniRef50_B1QSI6 Putative transposase n=10 Tax=Clostridium butyricum RepID=B1QSI6_CLOBU Length = 336 Score = 57.4 bits (137), Expect = 1e-07, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 8/87 (9%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSH--CRKTWQLQ----FTYTASQPGTHQKIIDMA 60 SCP C ++ GK QRY C + C KT+ + Y QP + I++ Sbjct: 34 SCPYCGC-KHFIKYGK-YQDIQRYKCKNEECGKTFSNTTFSVWKYLKYQPEKWIEFIELM 91 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNS 87 G+ ++ARI+ + T Sbjct: 92 CEGMTLESSARILKITTTTAFYWRHKI 118 >UniRef50_C5U8R8 Insertion element protein (Fragment) n=2 Tax=Methanocaldococcus infernus ME RepID=C5U8R8_9EURY Length = 100 Score = 57.0 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 43/91 (47%), Gaps = 6/91 (6%) Query: 6 ISCPSCSATDGVVRNGKSTAG----HQRYLCSHCRKTWQLQFTYTASQPGTHQKIID-MA 60 I C C+ +D VV+ GK + Q YLC C++ + + +K++ + Sbjct: 5 IRCKYCN-SDKVVKAGKHKSEKYGVRQMYLCKKCKRRFVEESKAPRYSDSFKEKVVRSVV 63 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 G+G R R+ + TILR +K+ +++ Sbjct: 64 FEGLGIRQAGRVFKLSTTTILRWIKDFKKTK 94 >UniRef50_C8SD87 Insertion element protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SD87_FERPL Length = 94 Score = 56.6 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 38/91 (41%), Gaps = 5/91 (5%) Query: 6 ISCPSCSATDGVVR---NGKSTAGHQRYLCSHCRKTWQLQF-TYTASQPGTHQKIIDMAM 61 + CP C + V + KS QRY C +C +T+ L + ++ + Sbjct: 1 MMCPHCKSIKTVKMGCYHTKSGERRQRYKCKNCGRTFVLNPIKPRNYPEEFKEMVVKAVV 60 Query: 62 -NGVGCRATARIMGVGLNTILRHLKNSGRSR 91 GVG R +RI + NT+ ++ + R Sbjct: 61 REGVGVRQASRIFKLSPNTVTAWVREFSKKR 91 >UniRef50_A7HSL0 Insertion element protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HSL0_PARL1 Length = 342 Score = 56.2 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 32/94 (34%), Gaps = 11/94 (11%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCS-----HCRKTW---QLQFTYTASQPGTHQKIIDM 59 CP C D +V++G+ G QR+ C C +T+ +P M Sbjct: 55 CPHCG-HDDIVKHGRDRGGRQRFRCRRSGSSGCGQTFNALTGTAFTRMRKPEKWAAYARM 113 Query: 60 AMNGVGCRATARI--MGVGLNTILRHLKNSGRSR 91 G + +G+ T R R++ Sbjct: 114 MATGFKSVDDVKTSGLGISRLTAWRWRHRLLRAQ 147 >UniRef50_A6CNB6 Putative uncharacterized protein n=8 Tax=Bacillus RepID=A6CNB6_9BACI Length = 335 Score = 56.2 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 30/88 (34%), Gaps = 5/88 (5%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL---QFTYTASQPGTHQKIIDM 59 + C C + V RNGK QRYLC C K++ G K M Sbjct: 49 KEGLGCIHCGSV-KVKRNGKYRE-RQRYLCRDCGKSFNELSNTPIAGTRYLGKWAKYFHM 106 Query: 60 AMNGVGCRATARIMGVGLNTILRHLKNS 87 + G A+ + + ++T Sbjct: 107 MVEGYTLPKIAKRLKIHISTAFYWRHKI 134 >UniRef50_B4WSN9 Insertion element protein n=3 Tax=Cyanobacteria RepID=B4WSN9_9SYNE Length = 83 Score = 55.8 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 5/84 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH----QKIIDMAM 61 + CP C ++GK++ G QRY C+ CR+T+ F + + I+ + Sbjct: 1 MDCPFCDHPTP-HKHGKTSKGSQRYRCTACRRTFTETFDTLYDRRQVTSEQVKLILQTYV 59 Query: 62 NGVGCRATARIMGVGLNTILRHLK 85 G R +RI T++ ++ Sbjct: 60 EGSSLRGISRIGKRAYGTVVDIVR 83 >UniRef50_C4IIL3 Putative transposase n=2 Tax=Clostridium butyricum RepID=C4IIL3_CLOBU Length = 325 Score = 55.1 bits (131), Expect = 7e-07, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 31/83 (37%), Gaps = 6/83 (7%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNG 63 CP C + + + GK G QRY C C+KT+ + Y P K I++ Sbjct: 35 CPHCKNVEFI-KFGKYD-GIQRYRCKSCKKTFSYTTNSLWKYLKHPPEKWFKFIELLGEK 92 Query: 64 VGCRATARIMGVGLNTILRHLKN 86 A+ + + + T Sbjct: 93 KTLEYCAKTLKISIVTAFNWRHK 115 >UniRef50_A8ZMX8 Putative uncharacterized protein n=5 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMX8_ACAM1 Length = 134 Score = 54.7 bits (130), Expect = 8e-07, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 34/89 (38%), Gaps = 9/89 (10%) Query: 6 ISCPSCSATDGVVRNG----KSTAGHQRYLCSHCRKTW-QLQFTYTASQP---GTHQKII 57 + CP C ++ +++ G + QRY C C + + + T A I Sbjct: 1 MECPYCQ-SEKILKRGFDSLQDGTLVQRYQCKDCNRRFNERTGTPMARLRTASSVVSYAI 59 Query: 58 DMAMNGVGCRATARIMGVGLNTILRHLKN 86 G+G R+ R G TI+R K Sbjct: 60 KARTEGMGVRSAGRTFGKSHTTIMRWEKR 88 >UniRef50_B6B4C9 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2083 RepID=B6B4C9_9RHOB Length = 321 Score = 54.3 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 28/84 (33%), Positives = 40/84 (47%), Gaps = 7/84 (8%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP----GTHQKIIDMAMN 62 +CP C A D R G++ AG QRY C C KT+ + + +Q +Q + DM + Sbjct: 49 TCPHCGAVDR-QRWGRTRAGSQRYRCQGCLKTFNGRTGSSIAQLQKLDQFYQVLKDMFSD 107 Query: 63 GVG--CRATARIMGVGLNTILRHL 84 G R AR + V +TI R Sbjct: 108 GPPRSIRRLARQLDVNKDTIWRWR 131 >UniRef50_Q97IZ3 Zn-finger DNA-binding domain n=1 Tax=Clostridium acetobutylicum RepID=Q97IZ3_CLOAB Length = 142 Score = 54.3 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 36/84 (42%), Gaps = 6/84 (7%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQ-PGTHQKIIDMAMNG 63 CP C ++ + RN K G Q Y+C C+K+ + TY + + K +NG Sbjct: 54 CPICG-SETISRNSK-YNGKQGYICKSCKKSFTDFTNSATYKSKKTLDKWLKYAKCMVNG 111 Query: 64 VGCRATARIMGVGLNTILRHLKNS 87 R +A+++ + + T Sbjct: 112 YSIRKSAKVVEINIATSFFWRHKI 135 >UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthrospira maxima CS-328 RepID=B5VWL6_SPIMA Length = 153 Score = 53.9 bits (128), Expect = 1e-06, Method: Composition-based stats. Identities = 12/35 (34%), Positives = 21/35 (60%) Query: 51 GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLK 85 + + M++NG+G RA R+ G+ NTIL ++ Sbjct: 20 DVKELCVKMSLNGMGFRAIERVTGISHNTILNWVR 54 >UniRef50_B0ABB1 Putative uncharacterized protein n=2 Tax=cellular organisms RepID=B0ABB1_9CLOT Length = 454 Score = 53.5 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 37/81 (45%), Gaps = 6/81 (7%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIID 58 + CP C + D + +NGK T QRY+C +CR T+ + + T T K Sbjct: 136 KNDLKCPKCGSFD-LNKNGK-TNQRQRYICKNCRTTFDERSFSPLSNTKLSLDTWLKYCQ 193 Query: 59 MAMNGVGCRATARIMGVGLNT 79 + G + A+ +GV + T Sbjct: 194 FMIEGGTIKYCAQKVGVSIPT 214 >UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae RepID=C5BFY7_EDWI9 Length = 46 Score = 53.1 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 19/42 (45%), Positives = 26/42 (61%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 MA + + CP + T V+RNG +T+G Q Y C C KT+QL Sbjct: 1 MAKIDVVCPRGAKTQDVIRNGHATSGAQVYRCKLCLKTFQLS 42 >UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202 RepID=C8KX67_9PAST Length = 238 Score = 52.4 bits (124), Expect = 4e-06, Method: Composition-based stats. Identities = 28/85 (32%), Positives = 37/85 (43%), Gaps = 3/85 (3%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDMAMNG 63 ISCP CS+ + +NGK Q YLC C + + TY Q+I+ M + G Sbjct: 7 ISCPKCSSCQ-IKKNGKKPNNKQNYLCKCCGRQFIGDHALTYRGCHSKISQRILIMLVRG 65 Query: 64 VGCRATARIMGVGLNTILRHLKNSG 88 G R A I V +L L N Sbjct: 66 CGIRDVAAIEKVSCTKVLSVLLNVR 90 >UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE Length = 348 Score = 52.0 bits (123), Expect = 5e-06, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 37/77 (48%), Gaps = 6/77 (7%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW----QLQFTYTASQPGTHQKIIDMAMN 62 CP C D V +NGKS G QRY+C CR ++ F+ T K ++ + Sbjct: 52 ECPKCQCKD-VNKNGKS-NGRQRYICKRCRTSFDEFTMSPFSNTKLGLDKWIKYCELMIL 109 Query: 63 GVGCRATARIMGVGLNT 79 G+ R A +GVG+ T Sbjct: 110 GLSIRKCAEEVGVGVKT 126 >UniRef50_Q81ZP0 Putative uncharacterized protein n=2 Tax=Nitrosomonas europaea RepID=Q81ZP0_NITEU Length = 323 Score = 51.6 bits (122), Expect = 7e-06, Method: Composition-based stats. Identities = 21/86 (24%), Positives = 31/86 (36%), Gaps = 5/86 (5%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ---LQFTYTASQPGTHQKIID 58 +S CP C ++ R G AG QR+ C C+ T+ Sbjct: 43 SSFEPICPVCQ-SNHFYRWGY-QAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA 100 Query: 59 MAMNGVGCRATARIMGVGLNTILRHL 84 + G+ RA+AR + NT R Sbjct: 101 ALIEGLTVRASARQCRIDKNTSFRWR 126 >UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C2KAD9_9FLAO Length = 239 Score = 51.6 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 42/90 (46%), Gaps = 1/90 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M C C+ + ++ G ++ QRY C C+K + +++Y A Q T+ I + Sbjct: 1 MNKRRNRCIHCNYS-YCIKAGITSQNKQRYQCKKCKKKFIGKYSYRAYQKSTNHNIQQLI 59 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRS 90 GVG R +R++ V T+L+ + Sbjct: 60 KEGVGIRGISRLLNVSKTTVLKKILKIASK 89 >UniRef50_UPI000196AFFE hypothetical protein CATMIT_00334 n=2 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196AFFE Length = 357 Score = 51.6 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 31/78 (39%), Gaps = 5/78 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDMAMNG 63 CP C + +NGK HQRY+C C K++ F ++ I++ + Sbjct: 50 CPICGSV-HFKKNGKDKNRHQRYICLDCHKSFSDRTNTLFYWSHFTLDQWLHFIELELYK 108 Query: 64 VGCRATARIMGVGLNTIL 81 + A+++ T Sbjct: 109 MPLEGEAQVLETSKTTCF 126 >UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachlamydiaceae RepID=Q6MD28_PARUW Length = 209 Score = 51.6 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 28/79 (35%), Gaps = 1/79 (1%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + C C +D V +NG + Q + C C K W T + + + V Sbjct: 1 MRCTHCG-SDLVKKNGYTRHEKQNFRCLECGKQWSENKEAKIINEQTKELVRKALLEKVS 59 Query: 66 CRATARIMGVGLNTILRHL 84 RI V + +L + Sbjct: 60 LNGICRIFDVSMPWLLDFI 78 >UniRef50_B6ARX2 Probable transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6ARX2_9BACT Length = 133 Score = 51.6 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 31/85 (36%), Gaps = 5/85 (5%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDM 59 S CP C + V + G+ G QRY C CR+ + + ++ Sbjct: 47 SEHPRCPHCQD-EHVAKWGR-VKGLQRYRCEACRRQFTPLTNTPLSGLRKREKWGAYLEA 104 Query: 60 AMNGVGCRATARIMGVGLNTILRHL 84 +G+ R A+ +GV T Sbjct: 105 MEDGLSVRKAAQRIGVNHKTTFLWR 129 >UniRef50_Q97IP3 Zn-finger DNA-binding domain n=1 Tax=Clostridium acetobutylicum RepID=Q97IP3_CLOAB Length = 171 Score = 51.2 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 32/90 (35%), Gaps = 11/90 (12%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTY-----TASQPGTHQKII 57 V + C + RNGK QRY+C C+KT+ FTY + + Sbjct: 50 KVYLHC----KLEMFSRNGKHDE-KQRYVCKTCKKTF-TDFTYSPISSSKKPLDKWLQYA 103 Query: 58 DMAMNGVGCRATARIMGVGLNTILRHLKNS 87 + G R A+ + + + T Sbjct: 104 KCMIVGYSIRKCAKTVNINIATSFFWRHKI 133 >UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PIK6_CORA7 Length = 403 Score = 51.2 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 2/80 (2%) Query: 7 SCPSCS-ATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + PSC G+V+NGK+ AG QR+LC C + +T+ ID ++G Sbjct: 5 NRPSCDMCGHGLVKNGKTAAGTQRWLCPQCNVSSINTRAHTSDIRHFK-IFIDWILSGES 63 Query: 66 CRATARIMGVGLNTILRHLK 85 A+ +GV T+ R K Sbjct: 64 ADHLAKRLGVTRRTLTRWFK 83 >UniRef50_C6W2G4 Transcriptional regulator, LacI family n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W2G4_DYAFD Length = 388 Score = 50.8 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 32/79 (40%), Gaps = 6/79 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I C C+ DG+++ G G QRYLC C + + + +K Sbjct: 2 IECVKCAQVDGIMKAGY-VRGKQRYLCKWCNYYFTHAEKDDSIESLVKRKRHQTT----- 55 Query: 66 CRATARIMGVGLNTILRHL 84 A+ +GV +T+ R L Sbjct: 56 IIDIAKSLGVSNSTVSRAL 74 >UniRef50_A1SXI4 Conserved hypothetical transposase n=18 Tax=Gammaproteobacteria RepID=A1SXI4_PSYIN Length = 319 Score = 50.8 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 32/92 (34%), Gaps = 7/92 (7%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDMAM 61 S CP C + GK+ + QRY C C KT+ + K + Sbjct: 52 SPQCPHCHCA-HFTKWGKAGS-VQRYKCFSCHKTFNNKTKTPLAKLHRCELWDKYAECMS 109 Query: 62 NGVGCRATARIMGVGLNT--ILRHLKNSGRSR 91 + R A + + L T + RH +S Sbjct: 110 LKLTLREAAAVCNINLKTSFLWRHRFLMAQSE 141 >UniRef50_A8ZNT7 Putative uncharacterized protein n=2 Tax=Acaryochloris marina MBIC11017 RepID=A8ZNT7_ACAM1 Length = 188 Score = 50.8 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 9/89 (10%) Query: 6 ISCPSCSATDGVVRNG----KSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 + C C ++ VV+NG K+ Q +LC C + + + ++ T + I MA+ Sbjct: 1 MQCIHCQ-SENVVKNGTKTLKTAQVVQYFLCKDCGRRFNERSGTPMARLRTPVETISMAI 59 Query: 62 N----GVGCRATARIMGVGLNTILRHLKN 86 N G+G RA R++ N+I+ K Sbjct: 60 NARTEGLGIRAAGRVLRKSPNSIILWEKR 88 >UniRef50_Q10ZA4 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZA4_TRIEI Length = 469 Score = 50.8 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 12/37 (32%), Positives = 21/37 (56%), Gaps = 2/37 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 + CP+C +T + +NG+ QRY C C + + +Q Sbjct: 1 MKCPTCGST-SLRKNGR-PNNRQRYRCKDCGRQFMVQ 35 >UniRef50_C1DQR5 Transposase n=5 Tax=Bacteria RepID=C1DQR5_AZOVD Length = 317 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 31/87 (35%), Gaps = 5/87 (5%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQPGTHQKII 57 M + SCP C +++ ++ S+ G RY C C KT + Q Sbjct: 38 MIATPSSCPHCQSSE--LQPWGSSGGLPRYRCKFCGKTSNPLTGTPMARLRKRHLWQGYA 95 Query: 58 DMAMNGVGCRATARIMGVGLNTILRHL 84 + + R A+ GV NT Sbjct: 96 EALTQSLTVRRAAKHCGVSKNTAFLWR 122 >UniRef50_A5FST1 Integrase, catalytic region n=1 Tax=Dehalococcoides sp. BAV1 RepID=A5FST1_DEHSB Length = 319 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 31/92 (33%), Gaps = 9/92 (9%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTA--SQPGTHQKIIDMAMNG 63 I C C + R G S A QR+LC+ C T+ + P + M G Sbjct: 8 IECKYCG-SRHTRRYGHSRAQKQRWLCNDCCHTFVETSAQPGMRTPPEQIGAAVSMFYEG 66 Query: 64 VGCRATAR----IMGVGL--NTILRHLKNSGR 89 + A R I + T+ + + Sbjct: 67 LSLSAICRQMKQIHNISPSDGTVYGWITKYSK 98 >UniRef50_B9TDK1 Putative uncharacterized protein n=2 Tax=Ricinus communis RepID=B9TDK1_RICCO Length = 321 Score = 50.1 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 29/80 (36%), Gaps = 5/80 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQPGTHQKIIDMAMNGV 64 CP C R G++ +G QR+ C HC ++ + + + Sbjct: 52 CPHCGCARK-HRCGQA-SGLQRFRCLHCGRSHNALTKTPLARLRKKECWLPYLQCVLESR 109 Query: 65 GCRATARIMGVGLNTILRHL 84 R A+I+GV T R Sbjct: 110 TVRDAAQIVGVHRTTSFRWR 129 >UniRef50_B5ETH8 Transposase n=15 Tax=Vibrionaceae RepID=B5ETH8_VIBFM Length = 489 Score = 50.1 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 34/92 (36%), Gaps = 12/92 (13%) Query: 9 PSCSATD------GVVRNGKS------TAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKI 56 PSC+ ++ V+ + + + QRY C C T+ +++ + QK+ Sbjct: 79 PSCNNSECEHFGFDVLTHRELYHAFGYSGDRQRYRCKSCASTFVDKWSGENQKSLIQQKL 138 Query: 57 IDMAMNGVGCRATARIMGVGLNTILRHLKNSG 88 + G R R + + T H+ Sbjct: 139 LGFLFTGYSVREICRRLHINPKTFYDHINQIA 170 >UniRef50_A0L9I6 Insertion element protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L9I6_MAGSM Length = 89 Score = 49.7 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 41/90 (45%), Gaps = 6/90 (6%) Query: 1 MAS--VSISCPSCSATDGVVRNGKSTAGHQRYLCSH--CRKT-WQLQFTYTASQPGTHQK 55 MA+ V + CP C + D V++ GK G QR+ C+ C +T + + ++ Sbjct: 1 MATMEVHVHCPDCGSLD-VIKFGKDRHGRQRFRCNDHFCDRTIFMMDDPDWWRFEEVKKQ 59 Query: 56 IIDMAMNGVGCRATARIMGVGLNTILRHLK 85 I ++G G TA +G+ + R K Sbjct: 60 IALHLLSGNGIHQTAHNLGLHPEFVNRMAK 89 >UniRef50_B2GC72 Transposase n=11 Tax=Lactobacillus RepID=B2GC72_LACF3 Length = 428 Score = 49.7 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 20/98 (20%), Positives = 33/98 (33%), Gaps = 21/98 (21%) Query: 8 CPSCSATDGVVRNGKSTA-----------------GHQRYLCSHCRKTW--QLQFTYTAS 48 CP C D ++NG S QR C +C+ ++ + Sbjct: 45 CPHCGFADTFIKNGHSYQTIKYLSINESCPTMLRIDKQRLRCKNCQDSFMAKTNVVDKYC 104 Query: 49 QP--GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 K + M + V + ++ GV +TI R L Sbjct: 105 SIAKAVKHKALTMLESNVSQKDVSKFTGVSPSTIGRLL 142 >UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JSC1_YERPY Length = 53 Score = 49.3 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 16/37 (43%), Positives = 21/37 (56%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRK 37 MA + CP C D V ++G +GHQRY C H +K Sbjct: 1 MAKIDEKCPFCERKDLVKKHGYGKSGHQRYRCPHAKK 37 >UniRef50_Q9AMR3 Putative transposase n=1 Tax=Azotobacter vinelandii RepID=Q9AMR3_AZOVI Length = 214 Score = 48.9 bits (115), Expect = 4e-05, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 31/87 (35%), Gaps = 5/87 (5%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQPGTHQKII 57 M + SCP C +++ ++ S+ G RY C C KT + Q Sbjct: 38 MIATPSSCPHCQSSE--LQPWGSSGGLPRYRCKFCGKTSNPLTGTPMARLRKRHLWQGYA 95 Query: 58 DMAMNGVGCRATARIMGVGLNTILRHL 84 + + R A+ GV NT Sbjct: 96 EALTQSLTVRRAAKHCGVSKNTAFLWR 122 >UniRef50_B0K4X0 Integrase, catalytic region n=11 Tax=Clostridia RepID=B0K4X0_THEPX Length = 343 Score = 48.9 bits (115), Expect = 4e-05, Method: Composition-based stats. Identities = 16/60 (26%), Positives = 23/60 (38%), Gaps = 4/60 (6%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNG 63 V + CP C+ T + GK G+Q+YLC C + P K + G Sbjct: 5 VPLKCPKCNNTHLFYKYGKDKDGYQKYLCRKCYHQF----APDKPSPKKTSKYPRCPVCG 60 >UniRef50_Q8PSY9 Conserved protein n=2 Tax=Methanosarcina RepID=Q8PSY9_METMA Length = 146 Score = 48.9 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 37/88 (42%), Gaps = 6/88 (6%) Query: 5 SISCPSCSATDG--VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDM--- 59 + C +G +++ GK GHQRY C HC K + + ++ I M Sbjct: 12 NPKCSYYLKAEGRAIIKRGKYKTGHQRYYCKHCEKFFMDTIGTAIYRKHLSKEEIRMIYR 71 Query: 60 -AMNGVGCRATARIMGVGLNTILRHLKN 86 + G R+ RI G +TI LK+ Sbjct: 72 LFLEKNGIRSIERITGHHRDTISNLLKD 99 >UniRef50_Q8D3Z3 Transposase n=48 Tax=Vibrionales RepID=Q8D3Z3_VIBVU Length = 507 Score = 48.9 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 13/82 (15%), Positives = 30/82 (36%), Gaps = 4/82 (4%) Query: 7 SCPSCSATDGVVRNGKSTAG----HQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMN 62 C + + ++ G QRY C C+ T+ +++ + + ++ + Sbjct: 103 DCANFGLSVHTHKHLYHAFGYSGDRQRYRCKSCQSTFVDKWSGANKKLQFQENLMGLLFT 162 Query: 63 GVGCRATARIMGVGLNTILRHL 84 G R R + + T H+ Sbjct: 163 GYSVREICRKLAINPKTFYDHV 184 >UniRef50_D0WF86 Putative Zn-finger DNA-binding domain protein n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WF86_9ACTN Length = 243 Score = 48.9 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 26/83 (31%), Gaps = 5/83 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTH-QKIIDMAMNG 63 CP C + +GK+ G +RY C C + A P +I ++ + Sbjct: 54 CPDCGSVRP-RLDGKAPNGARRYRCRECGCRFSALTGTIFADAKLPLHKIMRIAEVMCHS 112 Query: 64 VGCRATARIMGVGLNTILRHLKN 86 R + V T Sbjct: 113 ASLRLMELVAEVSHGTAFLWRHK 135 >UniRef50_A7HJ99 Putative uncharacterized protein n=2 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ99_FERNB Length = 261 Score = 48.5 bits (114), Expect = 6e-05, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 26/48 (54%), Gaps = 1/48 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS 48 M ++ + CP C +++ + +NG +Q + C C++ ++L FT Sbjct: 1 MTNIQLKCPHCGSSNFI-KNGHDKFKNQIFFCKDCKRYFKLSFTKKHK 47 >UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae BGR1 RepID=C5A9A4_BURGB Length = 284 Score = 48.5 bits (114), Expect = 6e-05, Method: Composition-based stats. Identities = 24/87 (27%), Positives = 40/87 (45%), Gaps = 9/87 (10%) Query: 14 TDGVVRNGKSTAGH-----QRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDMAMNGVG 65 D +NG H RY C C K + Q++ + +P + ++ MA++ VG Sbjct: 21 ADFYRKNGYRRTKHNGQPVPRYQCKACGKNFCATQVKPIHGQHRPDLNTQVFKMAVSRVG 80 Query: 66 CRATARIMGVGLNTILRHLK-NSGRSR 91 R A ++ G TI R ++ +G SR Sbjct: 81 IRRMATVLDCGRETIQRKIEYLAGESR 107 >UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ24_FERNB Length = 316 Score = 48.5 bits (114), Expect = 7e-05, Method: Composition-based stats. Identities = 14/44 (31%), Positives = 27/44 (61%), Gaps = 1/44 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT 44 M + ++SCP C +T + +NG G+Q++LC C +++L + Sbjct: 1 MNNSTLSCPKCGST-SLYKNGHDKYGNQQFLCKLCHHSFKLSHS 43 >UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q0SUU8_CLOPS Length = 340 Score = 48.5 bits (114), Expect = 7e-05, Method: Composition-based stats. Identities = 12/54 (22%), Positives = 24/54 (44%), Gaps = 3/54 (5%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQ 54 M +I CP C ++ + + G +Q+Y C C + + +S+P + Sbjct: 1 MNKTNIKCPRCH-SEKLYKFGFDKQANQKYQCKECGRQFAPD--SVSSRPKSKY 51 >UniRef50_Q0W4F0 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4F0_UNCMA Length = 141 Score = 48.5 bits (114), Expect = 7e-05, Method: Composition-based stats. Identities = 27/89 (30%), Positives = 41/89 (46%), Gaps = 6/89 (6%) Query: 7 SCPSCSATDG--VVRNGKSTAGHQRYLCSHCRKTWQLQF-TYTASQPGTHQKII---DMA 60 SC ++G VV+ G S AGHQ + C HC + + T + T + +I + Sbjct: 17 SCEFYLKSEGSRVVKKGFSRAGHQVFQCRHCGRHFCETINTPMYGRRITREDVILIGKLL 76 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGR 89 G RA RI G +T++R K+ R Sbjct: 77 NERNGIRAIERITGHHRDTVMRVAKDLAR 105 >UniRef50_C6MXF0 Putative uncharacterized protein n=1 Tax=Geobacter sp. M18 RepID=C6MXF0_9DELT Length = 512 Score = 48.1 bits (113), Expect = 8e-05, Method: Composition-based stats. Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 2/68 (2%) Query: 19 RNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH--QKIIDMAMNGVGCRATARIMGVG 76 R G++ AG +RY C C +T+ + TA Q TH +KI +N + + Sbjct: 43 RFGETAAGARRYRCKLCSRTFSINGKPTARQRDTHKNKKIYMHLVNKSPFKRICEQAEIS 102 Query: 77 LNTILRHL 84 T+ R + Sbjct: 103 PATLYRKI 110 >UniRef50_Q2FRB5 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FRB5_METHJ Length = 138 Score = 48.1 bits (113), Expect = 9e-05, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 6/91 (6%) Query: 5 SISCPSCSATDG--VVRNGKSTAGHQRYLCSHCRKTWQL---QFTYTASQPGTHQKII-D 58 + C DG + +NG ++AG+Q+Y C HCR+ + Y + P T II Sbjct: 14 NPDCTYFQIEDGKNITKNGHNSAGNQQYYCHHCRRFFIETKNTPLYDSRLPRTAVLIIAK 73 Query: 59 MAMNGVGCRATARIMGVGLNTILRHLKNSGR 89 + R +R+ G +TI R+ G Sbjct: 74 HSTEKTSIRGVSRVTGHHRDTISRYYHLIGE 104 >UniRef50_A4WNK0 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WNK0_RHOS5 Length = 481 Score = 47.8 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 30/67 (44%), Gaps = 1/67 (1%) Query: 19 RNGKSTAGHQRYLCSHCRKTW-QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGL 77 R GK+ G R+ C C KT+ + + ++ ++DM N + +RI G+ Sbjct: 132 RFGKTKGGDARWRCKGCGKTFSVGKPARRHKRSDKNRLVLDMLCNDLSFAKMSRISGLAY 191 Query: 78 NTILRHL 84 I R + Sbjct: 192 RDIYRRV 198 >UniRef50_Q3Y3Y2 Transposase, IS204/IS1001/IS1096/IS1165 n=4 Tax=Enterococcus faecium RepID=Q3Y3Y2_ENTFC Length = 401 Score = 47.4 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 26/100 (26%), Positives = 41/100 (41%), Gaps = 25/100 (25%) Query: 8 CPSCS-ATDGVVRNGK-------STAG---------HQRYLCSHCRKTW------QLQFT 44 CP C +T +V+NGK + +G QRYLC C+K + F Sbjct: 47 CPCCKDSTKQIVKNGKKISMILLNRSGNKRTYLRLKKQRYLCRACKKYFTARTYLVTPFC 106 Query: 45 YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 + + Q H KI++ +A + V + T+ R L Sbjct: 107 FISKQ--IHYKILEELTERQSIKAIGKHCDVSVTTVQRTL 144 >UniRef50_C9BRL5 Transposase n=2 Tax=Enterococcus faecium RepID=C9BRL5_ENTFC Length = 433 Score = 47.4 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 33/101 (32%), Gaps = 25/101 (24%) Query: 8 CPSCSATDG---VVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTAS 48 CP C + +V+NGK + QRY C C + TY Sbjct: 47 CPLCKQMNHEGMIVKNGKKKSLIQLNKCANQLTYLALAKQRYHCRGCHTYFTAN-TYIVD 105 Query: 49 Q-----PGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 + KI++ A+ GV +T+ R L Sbjct: 106 RNCFIAKQVRYKILEELTEKQAMTTIAKHCGVSWSTVSRTL 146 >UniRef50_C3L491 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L491_AMOA5 Length = 119 Score = 47.0 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 9/49 (18%), Positives = 21/49 (42%) Query: 42 QFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 Q T + + + + G+G RA ++ + T+ + ++ SG Sbjct: 42 QPKSGVKPIQTKRLALQLYLEGLGFRAIGNLLQISYGTVYQWIEASGEQ 90 >UniRef50_D1JAI8 Putative uncharacterized protein n=1 Tax=uncultured archaeon RepID=D1JAI8_9ARCH Length = 192 Score = 46.6 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 4/71 (5%) Query: 20 NGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNGVGCRATARIMGV 75 GK Q C C K + + + G I + G G RATARIMG+ Sbjct: 36 YGKGEKRTQMLKCKVCGKRFSIHKGTPLFNLKADEGAFYGTIAHLVEGNGIRATARIMGI 95 Query: 76 GLNTILRHLKN 86 +T+ + LK Sbjct: 96 NKDTVSKWLKK 106 >UniRef50_UPI0001C348D8 hypothetical protein PretD1_11772 n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C348D8 Length = 467 Score = 46.6 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 37/89 (41%), Gaps = 3/89 (3%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M ++ +CPSC +T+ + + G + G RY C +C + L+ K+I+ Sbjct: 67 MKNIEKACPSCYSTENI-KYGTTAIGTVRYQCKNCNNVYSLKNLNKFDDVD--NKLIESL 123 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGR 89 + + + + + R L+N Sbjct: 124 LKNTKVSTIFKELKITPASFYRRLENINE 152 >UniRef50_A4W908 Putative cytoplasmic protein n=2 Tax=Enterobacteriaceae RepID=A4W908_ENT38 Length = 414 Score = 45.8 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 12/33 (36%), Positives = 18/33 (54%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ 40 CP+C D ++RNG G QR+ C C ++ Sbjct: 68 CPTCGQGDALIRNGCGLRGAQRWRCRTCNSSFT 100 >UniRef50_UPI0001BC5CBF transposase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5CBF Length = 184 Score = 45.8 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 38/106 (35%), Gaps = 21/106 (19%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQF- 43 + + +CP C + V+NG T+ QR+LC C ++ L+ Sbjct: 42 LTKDTCACPHCH-SQTTVKNGFKTSKVRYLPFQNYPIIIALKKQRFLCKECHHSFTLETP 100 Query: 44 ---TYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKN 86 Y + +++ + A+ + + T+ R LK Sbjct: 101 IVKKYASISQTLKLSVLNSLQENMSLSLIAKQHRISIPTVQRILKQ 146 >UniRef50_B4WUH8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WUH8_9SYNE Length = 76 Score = 45.4 bits (106), Expect = 5e-04, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 27/56 (48%), Gaps = 3/56 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT-YTASQPGTHQKIIDMA 60 ++CP C ++ + +NG G Q Y+C+ CR+ + ++ ++ + M Sbjct: 1 MACPECQ-SEHIRKNGH-KRGKQNYICADCRRQFVENPKEHSGYSDEERKQCLSMY 54 >UniRef50_B4WST6 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WST6_9SYNE Length = 81 Score = 45.4 bits (106), Expect = 5e-04, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 28/71 (39%), Gaps = 1/71 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + P+C + VV+NGK G Q + C +C + + T I D+ Sbjct: 1 MLDHQPTRPACHSKQ-VVKNGKIHNGKQNHRCKNCGRQFVKDPQQKRISDATKALIDDLL 59 Query: 61 MNGVGCRATAR 71 + + ++ Sbjct: 60 LERLSMNNPSK 70 >UniRef50_C0X375 Transposase n=23 Tax=Enterococcus RepID=C0X375_ENTFA Length = 446 Score = 45.4 bits (106), Expect = 5e-04, Method: Composition-based stats. Identities = 14/69 (20%), Positives = 30/69 (43%), Gaps = 4/69 (5%) Query: 26 GHQRYLCSHCRKTWQLQFTYTASQ----PGTHQKIIDMAMNGVGCRATARIMGVGLNTIL 81 QR+ C HC KT+ + + + + Q I+++ + AR+ + T++ Sbjct: 85 NKQRFKCKHCGKTFLAEDSVSDRRCSIARRVKQAILELLSEPISMSLIARMKHISPTTVI 144 Query: 82 RHLKNSGRS 90 R L++ Sbjct: 145 RILRSLRPK 153 >UniRef50_Q3Y1C3 Transposase, IS204/IS1001/IS1096/IS1165 n=51 Tax=Enterococcus RepID=Q3Y1C3_ENTFC Length = 431 Score = 45.4 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 28/106 (26%), Positives = 39/106 (36%), Gaps = 27/106 (25%) Query: 7 SCPSCSAT--DG-----VVRNGKSTA----------------GHQRYLCSHCRKTWQLQF 43 +C +C +T DG VV+NGK QRY C +CR W Q Sbjct: 44 TCRNCGSTVVDGNGKVIVVKNGKKETIVRFEQYNHMPLVMRLKKQRYTCKNCRTHWTTQS 103 Query: 44 TYTASQPGT----HQKIIDMAMNGVGCRATARIMGVGLNTILRHLK 85 + + KI + V A+ V L T++R LK Sbjct: 104 YFVQPRHSIANHVRYKIASLLTEKVSLSFIAKNCQVSLTTVIRTLK 149 >UniRef50_A8ZMW7 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMW7_ACAM1 Length = 75 Score = 45.4 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 27/58 (46%), Gaps = 1/58 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIID 58 M+ + + P C + + GK++ G QRY C C++T+ F + ++I Sbjct: 1 MSYLLMQSPLCD-HPKIHKPGKTSKGSQRYRCLDCQQTFSETFDTLYYRLQISSEMIQ 57 >UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=A0RXS8_CENSY Length = 436 Score = 45.1 bits (105), Expect = 7e-04, Method: Composition-based stats. Identities = 27/98 (27%), Positives = 39/98 (39%), Gaps = 17/98 (17%) Query: 4 VSISCPSCSATDGVV--RNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 + CP CS+T V RNG G Q + C CR + A + Q II A+ Sbjct: 71 IVPECPKCSSTVRVKAGRNG----GRQMFQCKQCRTRYV-SRGPGARKTRYSQDIISAAL 125 Query: 62 N----GVGCRATARIMG------VGLNTILRHLKNSGR 89 N G+ R TA + + NTI+ + + Sbjct: 126 NKVMSGMSYRKTAEEVNTAHGRDLSPNTIMFWTRKYTQ 163 >UniRef50_B7K7J3 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7K7J3_CYAP7 Length = 354 Score = 44.7 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 2/37 (5%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ 40 + I CP C + +NG + AG QRY C C + + Sbjct: 2 ILIQCPKC-KSKNYRKNG-TIAGKQRYQCKSCGRNFL 36 >UniRef50_Q4JT92 Transposase for IS3503f n=4 Tax=Corynebacterium jeikeium K411 RepID=Q4JT92_CORJK Length = 165 Score = 44.3 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 29/85 (34%), Gaps = 2/85 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + SCP C + +NG ++ R+ C+HC ++ T I A Sbjct: 1 MTTNRPSCPLCG--NNTKKNGTTSKSTTRWRCTHCGHSFTRNTQTHNKNTATMALFIQWA 58 Query: 61 MNGVGCRATARIMGVGLNTILRHLK 85 A GV T+ + Sbjct: 59 TGTQSLTTFAAHHGVTRQTMHHRFR 83 >UniRef50_B2A0V7 Integrase catalytic region n=15 Tax=Clostridia RepID=B2A0V7_NATTJ Length = 353 Score = 44.3 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 13/59 (22%), Positives = 21/59 (35%), Gaps = 5/59 (8%) Query: 6 ISCPSCS--ATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTA---SQPGTHQKIIDM 59 + CP C+ +D + G GHQ+Y C C + + P +K Sbjct: 4 VVCPRCNNNCSDKFYKFGFDNHGHQKYQCQECFSQFAPKTLSKGGDKRGPNMPRKYPSC 62 >UniRef50_C2BVZ4 Possible IS3509a transposase n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BVZ4_9ACTO Length = 225 Score = 44.3 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 9/72 (12%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPG-------THQKIID 58 + CP+C+ + RNGK+++G QR+ C C ++ + +A + + Q+ D Sbjct: 41 MKCPACNT--PLKRNGKTSSGSQRWRCKECGRSKVGKIDNSAKELNRFLSWLLSRQRQKD 98 Query: 59 MAMNGVGCRATA 70 M G R A Sbjct: 99 MPGAGRTFRRHA 110 >UniRef50_C8SCF9 Integrase catalytic region n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCF9_FERPL Length = 357 Score = 44.3 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 35/92 (38%), Gaps = 11/92 (11%) Query: 7 SCPSCSATDGVVRNG--KSTAG-HQRYLCSHCRKTW--QLQFTYTASQPGTHQKIIDMAM 61 +C +C D V++ G + +G Q Y C C K + + F + +D+ Sbjct: 85 TCKNCGRDDEVIKKGIRYNKSGPVQMYYCKRCGKKFSARTGFGGMKKRAEAIVAALDLYF 144 Query: 62 NGVGCRATARIMGVGLN------TILRHLKNS 87 G+ R A+ + N T+ +K Sbjct: 145 RGLSLRQVAQHLKASYNVEVCHKTVHNWIKRY 176 >UniRef50_P04137 Uncharacterized protein in transposable element ISH50 n=11 Tax=Halobacteriaceae RepID=YIH50_HALSA Length = 294 Score = 43.9 bits (102), Expect = 0.001, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 7/90 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAM 61 + CPSC + V+R G S QRYLC C +T+ Q F ++A + + Sbjct: 26 VYCPSC-RAESVIRYG-SYRVFQRYLCKDCDRTFNDQTGTVFEHSAVALRKWFLAVYTYI 83 Query: 62 N-GVGCRATARIMGVGLNTILRHLKNSGRS 90 R + V T+ R ++ R+ Sbjct: 84 RLNTSIRQLDAEIDVSYKTVYRRVQRFLRA 113 >UniRef50_Q07NT9 Putative uncharacterized protein n=2 Tax=Rhizobiales RepID=Q07NT9_RHOP5 Length = 577 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 31/86 (36%), Gaps = 11/86 (12%) Query: 7 SCP--SCSATDGVV--------RNGKSTAGHQRYLCSHCRKTW-QLQFTYTASQPGTHQK 55 CP SC + + R+G S G RY C CRKT+ + ++ Sbjct: 103 HCPDDSCENYNKLFDSHPKSYFRHGTSAIGAPRYRCKACRKTFSVRTGHSRHRKSHENKT 162 Query: 56 IIDMAMNGVGCRATARIMGVGLNTIL 81 + + ++ V +I + + Sbjct: 163 VFQLLVSKVPITKIGQITDLSPAAVY 188 >UniRef50_Q5LYW0 IS1193, transposase, ISL3 family n=185 Tax=Bacteria RepID=Q5LYW0_STRT1 Length = 448 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 21/103 (20%), Positives = 36/103 (34%), Gaps = 19/103 (18%) Query: 1 MASVSISCPSCSAT---DGVVRNGK----STAG--------HQRYLCSHCRKTWQLQFTY 45 + +++ SCP C +N K AG +R+ C CR+ + + Sbjct: 15 LITLAPSCPHCQGKMIKYDFQKNSKISLLEQAGTPTLLRLKKRRFQCKSCRRVTVAETSI 74 Query: 46 TASQPGT----HQKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 QK+ + V AR + V +T+ R L Sbjct: 75 VEKNCQISNLVRQKVTQLLTEKVSLTDIARRLRVSTSTVYRKL 117 >UniRef50_C0BSX6 Putative uncharacterized protein n=1 Tax=Bifidobacterium pseudocatenulatum DSM 20438 RepID=C0BSX6_9BIFI Length = 352 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 34/78 (43%), Gaps = 5/78 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW----QLQFTYTASQPGTHQKIIDMAMNG 63 C C + ++R G+ G QR+ C +C +T+ + + G + ++ ++ Sbjct: 55 CVRCGSI-RIIRKGRGRDGSQRWKCMNCNRTFGVRTNRVMGMSKLKAGVWMRFLECFVDC 113 Query: 64 VGCRATARIMGVGLNTIL 81 + R A+ GV L T Sbjct: 114 LSLRKCAQRCGVCLKTAF 131 >UniRef50_D0BMT4 Transposase n=10 Tax=Lactobacillales RepID=D0BMT4_9LACT Length = 426 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 18/100 (18%), Positives = 34/100 (34%), Gaps = 21/100 (21%) Query: 7 SCPSCSATDGVVRNGKSTAGH----------------QRYLCSHCRKTWQLQFTYTASQP 50 SCP C ++ V+++ QR++C CRKTW Sbjct: 46 SCPYC-SSKNVIKHSPMEHKIRIPHLYGNKTLLELKVQRFICKDCRKTWVTDCPLVPKNS 104 Query: 51 GTHQ----KIIDMAMNGVGCRATARIMGVGLNTILRHLKN 86 +I+ + A+++ + T+ R +K Sbjct: 105 NISYDLACQIMLYLKENFSRKTIAKLLSISDKTVERVMKK 144 >UniRef50_B5K5I7 Transposase n=4 Tax=Octadecabacter antarcticus 238 RepID=B5K5I7_9RHOB Length = 319 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 32/87 (36%), Gaps = 5/87 (5%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW---QLQFTYTASQPGTHQKIIDMAMNG 63 +CP C+A V+R + G +RY C C KT+ + +G Sbjct: 50 NCPHCAAGGAVIRG--RSNGLKRYFCKICSKTFNALTGTPLARLRHKDCWTEFAGSLSDG 107 Query: 64 VGCRATARIMGVGLNTILRHLKNSGRS 90 + +A GV +T R R+ Sbjct: 108 DTVKTSAARCGVASSTAFRWRHRFLRA 134 >UniRef50_B2J098 Putative uncharacterized protein n=2 Tax=Nostocaceae RepID=B2J098_NOSP7 Length = 133 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 10/35 (28%), Positives = 22/35 (62%), Gaps = 1/35 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ 40 + CP C+ + + ++G+ G QRY+C +C + ++ Sbjct: 34 MECPKCN-SHLLGKHGREPDGVQRYICKNCSRIFR 67 >UniRef50_Q70JT0 IsmA protein n=2 Tax=Microcystis aeruginosa RepID=Q70JT0_MICAE Length = 112 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 12/47 (25%), Positives = 20/47 (42%), Gaps = 1/47 (2%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH 53 +CPSC + ++NG G + C C + + + T P T Sbjct: 34 TCPSCG-SHHTIKNGYLPKGKPKRHCQECGQPFVINPTNKTISPDTK 79 >UniRef50_Q8PWW0 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PWW0_METMA Length = 155 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 38/94 (40%), Gaps = 14/94 (14%) Query: 5 SISCPS--CS-----ATDGVVRNGKSTAGHQR---YLCSHCRKTW---QLQFTYTASQPG 51 + CP+ C + ++ NG ++R Y+C C + + F + Sbjct: 12 DVFCPNKDCKLYGITGKENIIGNGTYEIKNKRVRKYICRECGRVFNDRTGTFFDNVRKDE 71 Query: 52 TH-QKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 + + I MA+ G+ +A + ++ V T+ L Sbjct: 72 SDIKLAIKMAIKGMSIQAISDVLEVQPATVSNWL 105 >UniRef50_Q1VPB4 Putative uncharacterized protein n=4 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VPB4_9FLAO Length = 343 Score = 42.7 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 14/83 (16%), Positives = 26/83 (31%), Gaps = 5/83 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL---QFTYTASQPGTHQKIIDMAMNGV 64 CP C + VR G G QRY C C +++ + + + + + Sbjct: 51 CPHC-LHEKYVRFG-VDKGSQRYKCKSCNRSFTEYTGTWMAGLQRKDMISSYLSLMVQEK 108 Query: 65 GCRATARIMGVGLNTILRHLKNS 87 + +G+ T Sbjct: 109 SLDKISSELGINKKTAFDWRHKI 131 >UniRef50_C6MKY8 Putative uncharacterized protein n=1 Tax=Geobacter sp. M18 RepID=C6MKY8_9DELT Length = 632 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 30/72 (41%), Gaps = 2/72 (2%) Query: 21 GKSTAGHQRYLCSHCRKTWQLQFTY--TASQPGTHQKIIDMAMNGVGCRATARIMGVGLN 78 G + AG QR+ C C KT+ + + G ++ + V R AR VG Sbjct: 123 GHTKAGSQRFRCKICHKTFSIPLAANLRQRKKGKSTEVFRLLTCQVAIRKMARNARVGKE 182 Query: 79 TILRHLKNSGRS 90 T+ R++ R Sbjct: 183 TVHRYIHLIHRQ 194 >UniRef50_Q7NH53 TetR family transcriptional regulatory protein n=1 Tax=Gloeobacter violaceus RepID=Q7NH53_GLOVI Length = 227 Score = 42.4 bits (98), Expect = 0.004, Method: Composition-based stats. Identities = 12/36 (33%), Positives = 18/36 (50%), Gaps = 2/36 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL 41 + CP C ++ + RNG QR LC C + + L Sbjct: 188 MKCPRCG-SERLSRNGHRH-DRQRLLCKDCSRQFLL 221 >UniRef50_Q10VF2 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10VF2_TRIEI Length = 59 Score = 42.4 bits (98), Expect = 0.004, Method: Composition-based stats. Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 1/59 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDM 59 M+ + CPSC ++ +V+NG Q+Y C +C++ + T + I + Sbjct: 1 MSIHKLICPSCG-SNHIVKNGTIHNKKQKYQCQNCQRQFVENSQRDYISNETKELIDKL 58 >UniRef50_D2PJ85 Putative uncharacterized protein n=5 Tax=Sulfolobus islandicus RepID=D2PJ85_SULIS Length = 82 Score = 42.4 bits (98), Expect = 0.004, Method: Composition-based stats. Identities = 11/39 (28%), Positives = 16/39 (41%), Gaps = 2/39 (5%) Query: 27 HQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 QRYLC C + + Y ++ + M NGV Sbjct: 5 RQRYLCRDCGRYFLGDAIY--HSRELREEALKMYSNGVS 41 >UniRef50_Q8RFT8 Transposase n=17 Tax=Fusobacterium RepID=Q8RFT8_FUSNN Length = 428 Score = 42.4 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 17/101 (16%), Positives = 34/101 (33%), Gaps = 21/101 (20%) Query: 7 SCPSCSATDGVVRNGKSTAGH----------------QRYLCSHCRKTWQLQFTYTASQP 50 +CP C ++ +V+NG QRY+C C+KT+ + Sbjct: 52 TCPHC-SSKNIVKNGSRHRKIKYIPIQNHNIELELTVQRYICKDCKKTFSPSTNIVSDNS 110 Query: 51 GT----HQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNS 87 I + + A+ + + ++ R + N Sbjct: 111 SISNNLKYAIALELQKNISLTSIAKRYNISIPSVQRIMDNC 151 >UniRef50_B5S3H3 Probable transposase protein n=6 Tax=Burkholderiaceae RepID=B5S3H3_RALSO Length = 460 Score = 42.4 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 14/94 (14%), Positives = 31/94 (32%), Gaps = 9/94 (9%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL---QFTYTASQPGTHQKIID 58 ++ + SCP C + +G + C C + + + Sbjct: 345 STHAASCPWCGSDQTKYHPAPRPSGLPGFRCRACLAYFTRVSNTPLVHPMARAYASRFVP 404 Query: 59 MA---MNGVGCRATARIMGVGLNTILRHLKNSGR 89 M G G AR +G+ + T+ +++ + Sbjct: 405 MLGWHETGAG---AARELGIAMGTLHTWVRSWRQ 435 >UniRef50_C2FEQ0 IS1181 transposase n=1 Tax=Lactobacillus paracasei subsp. paracasei ATCC 25302 RepID=C2FEQ0_LACPA Length = 425 Score = 42.4 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 20/96 (20%), Positives = 29/96 (30%), Gaps = 20/96 (20%) Query: 7 SCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTASQP 50 CP+C +VR G QR+ C CR +Q + Y + Sbjct: 48 HCPACGFASKLVRYGFERTCVLMPSYSYRPTYMKLSRQRFRCELCRSVFQSETDYVRPRS 107 Query: 51 GTHQKIIDM----AMNGVGCRATARIMGVGLNTILR 82 + M A + AR V T+ R Sbjct: 108 TISTPVRQMVLFEAFSNCSLTDIARRFHVADKTVQR 143 >UniRef50_UPI0001BC5E64 putative transposase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5E64 Length = 173 Score = 42.4 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 21/103 (20%), Positives = 34/103 (33%), Gaps = 25/103 (24%) Query: 7 SCPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKT------WQLQFT 44 CP C ++RNG + QR+LC C KT + ++ Sbjct: 48 KCPFCGE-KHIIRNGTKLSKIKILDVSNTPSYLYLRKQRFLCKSCSKTFSASTNFVRKYC 106 Query: 45 YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNS 87 A I + N + + A+ V +T+ R L Sbjct: 107 NIADS--IKLSIALESKNIISEKDIAKRFRVSSSTVKRSLLQY 147 >UniRef50_UPI000196CFC8 hypothetical protein CATMIT_02848 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196CFC8 Length = 262 Score = 42.0 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 11/34 (32%), Positives = 15/34 (44%), Gaps = 1/34 (2%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ 40 C C + D + G G QRY+C C K + Sbjct: 97 QCLFCGSHD-FTKYGHKKDGTQRYICKGCGKRFT 129 >UniRef50_D2LYX8 Tn5468, transposition protein D n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LYX8_BACS4 Length = 609 Score = 42.0 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 38/87 (43%), Gaps = 12/87 (13%) Query: 17 VVRNGKSTAGHQRYLCSHCRKTWQL-------QFTYTASQPG----THQKIIDMAMN-GV 64 + R K+ R+ C C ++ Y S+ ++ + + +N G+ Sbjct: 339 IRRCEKTKKLIGRFTCHTCDFSYTRKGMDPNKDDCYKFSRIMDFGFLWKRELQLLLNKGL 398 Query: 65 GCRATARIMGVGLNTILRHLKNSGRSR 91 R ARI+GV NT++++ K + ++ Sbjct: 399 SYREVARILGVDTNTVIKYEKKNIENK 425 >UniRef50_D0SJK4 Predicted protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJK4_ACIJU Length = 460 Score = 41.2 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 31/93 (33%), Gaps = 20/93 (21%) Query: 7 SCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTASQP 50 CP C +D + ++G +RY C C+ T+ + T Sbjct: 33 KCPKCG-SDQLYKHGTKPVIYRDIPRHMKPTVINVEVKRYRCKSCKATFLQEVTGIYPDT 91 Query: 51 GTHQKIIDMAMN---GVGCRATARIMGVGLNTI 80 ++ + + TAR+MG TI Sbjct: 92 RMTERFVKKIQDICLDYTFSDTARMMGCDSKTI 124 >UniRef50_C2CJK1 ISSha1 transposase n=7 Tax=Anaerococcus RepID=C2CJK1_9FIRM Length = 422 Score = 41.2 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 13/99 (13%), Positives = 28/99 (28%), Gaps = 20/99 (20%) Query: 8 CPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQFT--YTASQ 49 CP C + +++ G ++ Q+ C C K + L+ Sbjct: 48 CPHCGSNHNLIKYGFKSSNVRCSRAGDYPVIIDLKKQKMFCKSCNKYFLLETKIVDKHCN 107 Query: 50 PG--THQKIIDMAMNGVGCRATARIMGVGLNTILRHLKN 86 + I+ + + V T+ R + Sbjct: 108 ISNQIKRHILASLTKKLSMKDIGSNNYVSTTTVARFMAK 146 >UniRef50_Q5NZ47 Putative uncharacterized protein n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5NZ47_AZOSE Length = 266 Score = 41.2 bits (95), Expect = 0.012, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 28/81 (34%), Gaps = 9/81 (11%) Query: 5 SISCPSCSATDGVVR------NGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIID 58 +++CPSC + D + G + Y C C + +P + Sbjct: 6 TVTCPSCGSADCRKSKWQSEGERQVQTGKRPYRCRACTHRF---HAPEHKRPRWRDRASF 62 Query: 59 MAMNGVGCRATARIMGVGLNT 79 M + A A ++ VG T Sbjct: 63 MVPALLMGAAIAAVIVVGART 83 >UniRef50_C8SCF8 Putative uncharacterized protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCF8_FERPL Length = 317 Score = 40.8 bits (94), Expect = 0.012, Method: Composition-based stats. Identities = 15/70 (21%), Positives = 26/70 (37%), Gaps = 5/70 (7%) Query: 5 SISCPSCSATDGVVR---NGKSTAGHQRYLCSHCRKTWQLQ--FTYTASQPGTHQKIIDM 59 + SCP C++ + + + ++YLC C T+ F +T P I + Sbjct: 32 NPSCPHCNSYHIIKKTDIKRERKGYAKKYLCRDCNSTFTFDNCFEWTHYPPRVVGDIFHL 91 Query: 60 AMNGVGCRAT 69 G R Sbjct: 92 IAKGESYRDI 101 >UniRef50_Q9H5H4 Zinc finger protein 768 n=9 Tax=Theria RepID=ZN768_HUMAN Length = 540 Score = 40.8 bits (94), Expect = 0.015, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 32/78 (41%), Gaps = 11/78 (14%) Query: 7 SCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTW-------QLQFTYTASQPGTHQK 55 CP C + ++R+ ++ +G + Y C HC K + + Q T++ +P + + Sbjct: 318 KCPRCGKAFADSSYLLRHQRTHSGQKPYKCPHCGKAFGDSSYLLRHQRTHSHERPYSCTE 377 Query: 56 IIDMAMNGVGCRATARIM 73 R+ R+ Sbjct: 378 CGKCYSQNSSLRSHQRVH 395 >UniRef50_B7X577 Transposase IS204/IS1001/IS1096/IS1165 family protein n=1 Tax=Comamonas testosteroni KF-1 RepID=B7X577_COMTE Length = 471 Score = 40.8 bits (94), Expect = 0.015, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 25/78 (32%), Gaps = 17/78 (21%) Query: 8 CPSCSATDGVVRNG----------------KSTAGHQRYLCSHCRKTWQLQFTYTASQPG 51 CP C D + R+G K A QRY C+ C++T+ Sbjct: 36 CPKCGTLDCIYRHGTKATTYVDIPMRGKPAKLRAKVQRYRCTSCKETFLQPLGGILEGRR 95 Query: 52 THQKIIDMAMNGVGCRAT 69 ++ + R T Sbjct: 96 MTERCAT-YIKAHSLRDT 112 >UniRef50_B2J7N9 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J7N9_NOSP7 Length = 428 Score = 40.4 bits (93), Expect = 0.016, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 32/86 (37%), Gaps = 13/86 (15%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C +T +NG Q YLC +C + + + + ++ N G Sbjct: 1 MKCPRCEST-SCRQNGCR-NDKQNYLCKNCGQQFLEPVFPHSLKG-------ELLANSNG 51 Query: 66 CRATARIMGVGLNTILRHLKNSGRSR 91 + + + T + +KN + Sbjct: 52 QTKVS----MAVTTEVFLVKNLPEEK 73 >UniRef50_B0CG58 Transcriptional regulator, TetR family n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CG58_ACAM1 Length = 260 Score = 40.4 bits (93), Expect = 0.018, Method: Composition-based stats. Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 2/37 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 + CP C +D + +NGK Q Y+C CRK + Sbjct: 221 MICPHCQ-SDRLSKNGKR-RNQQCYVCKDCRKQFVES 255 >UniRef50_Q9V1K2 Putative uncharacterized protein n=2 Tax=Pyrococcus RepID=Q9V1K2_PYRAB Length = 141 Score = 40.0 bits (92), Expect = 0.020, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 9/89 (10%) Query: 6 ISCPSCSATDGVVRNGKSTAG----HQRYLCSHCRKTWQL-QFTYTASQPGTHQKIIDMA 60 I+CP C + +V+ G QRY C +C +T+ T +I Sbjct: 34 ITCPYC-KSPNIVKIGYIMRSGNFKIQRYKCKNCNRTFTELDGTPLKGAHSLKDIVIVAY 92 Query: 61 MN---GVGCRATARIMGVGLNTILRHLKN 86 + + + A+I+ + + R K Sbjct: 93 LTLDLKLPPSSIAKILPINRPKLYRAYKR 121 >UniRef50_UPI000186EB06 zinc finger protein 705A, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186EB06 Length = 555 Score = 39.7 bits (91), Expect = 0.027, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 28/70 (40%), Gaps = 5/70 (7%) Query: 8 CPSCSAT---DGVVRNGKSTAGHQRY--LCSHCRKTWQLQFTYTASQPGTHQKIIDMAMN 62 CP C T +R + T GH ++ C+ C K + + +P QK + + Sbjct: 469 CPECGKTFADRSNLRAHQRTRGHHKWEWRCASCNKAFSQERYLDRHRPEACQKYLQYTVR 528 Query: 63 GVGCRATARI 72 G + + I Sbjct: 529 QHGVKKFSEI 538 >UniRef50_Q8U293 Transposase n=53 Tax=Pyrococcus RepID=Q8U293_PYRFU Length = 314 Score = 39.7 bits (91), Expect = 0.029, Method: Composition-based stats. Identities = 12/73 (16%), Positives = 27/73 (36%), Gaps = 4/73 (5%) Query: 18 VRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGL 77 R G + Y K ++ P + +++ + G+ R TARI+ + Sbjct: 77 KRRGNMKSETIIYWVVSALKPFRRNKI----PPEKKIRGVELYLRGLSYRQTARILKISH 132 Query: 78 NTILRHLKNSGRS 90 T+ ++ + Sbjct: 133 VTVWEAVQKLAEA 145 >UniRef50_UPI000051A053 PREDICTED: similar to CG3407-PA n=1 Tax=Apis mellifera RepID=UPI000051A053 Length = 644 Score = 39.7 bits (91), Expect = 0.030, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 27/79 (34%), Gaps = 10/79 (12%) Query: 7 SCPSCSATDGVVR-----NGKSTAGHQRYLCSHCRKTWQ-LQFTYTASQPGTHQKIIDMA 60 C C D + + + G + Y C +C KT++ + + T +K A Sbjct: 468 QCEYCGK-DFARKYSLIVHRRIHTGEKNYRCEYCNKTFRASSYLQNHRRIHTGEKPHQCA 526 Query: 61 MNGVGCR---ATARIMGVG 76 + G R R M Sbjct: 527 VCGKPFRVRSDMKRHMHTH 545 >UniRef50_UPI0001C34261 hypothetical protein CATC2_13668 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C34261 Length = 387 Score = 39.7 bits (91), Expect = 0.032, Method: Composition-based stats. Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 1/33 (3%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ 40 CP C + + R G++ G QR C C+K W Sbjct: 74 CPDCYQRETI-RYGRNPQGSQRVQCRACKKVWT 105 >UniRef50_A3HUJ9 Transcriptional regulator, LuxR family protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HUJ9_9SPHI Length = 269 Score = 39.7 bits (91), Expect = 0.033, Method: Composition-based stats. Identities = 9/37 (24%), Positives = 21/37 (56%) Query: 54 QKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 ++I+ M ++G+ + +++ + NT+ H KN R Sbjct: 211 KQIVKMILDGMKSKEIGQVLNISFNTVSTHRKNILRK 247 >UniRef50_A7JMB8 Predicted protein n=8 Tax=Francisella RepID=A7JMB8_FRANO Length = 82 Score = 39.3 bits (90), Expect = 0.038, Method: Composition-based stats. Identities = 18/83 (21%), Positives = 30/83 (36%), Gaps = 4/83 (4%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH--QKIIDMAMNG 63 I C C ++G+ + G QRY C C + L +I ++ Sbjct: 2 IKCNRCH-SEGIHKTG-VVRNKQRYKCKSCGYNFVLSDGRIKPDIAIKLALTVIMYSLGK 59 Query: 64 VGCRATARIMGVGLNTILRHLKN 86 A++ GV + TI L+ Sbjct: 60 YSYGFIAKLFGVRMTTIQNWLEQ 82 >UniRef50_Q0W590 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W590_UNCMA Length = 166 Score = 39.3 bits (90), Expect = 0.043, Method: Composition-based stats. Identities = 11/63 (17%), Positives = 22/63 (34%), Gaps = 8/63 (12%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGH------QRYLCSHCRKTWQLQFTYTASQPGTHQ 54 M +++ CP C+ G Q+Y C C ++ + + P + Sbjct: 1 MTRLTVHCPRCAGEPHEYAVAWRRTGKKKWLSLQKYRCLKCGHRFEKKHS--GKPPKSLA 58 Query: 55 KII 57 + I Sbjct: 59 RAI 61 >UniRef50_B8X8Z3 Resolvase n=1 Tax=Pectobacterium atrosepticum RepID=B8X8Z3_ERWCT Length = 109 Score = 39.3 bits (90), Expect = 0.044, Method: Composition-based stats. Identities = 12/46 (26%), Positives = 22/46 (47%), Gaps = 1/46 (2%) Query: 44 TYTASQPGTHQKIID-MAMNGVGCRATARIMGVGLNTILRHLKNSG 88 Y A P Q+I+ NG+ + + + GV +T+ +K+S Sbjct: 63 FYRAQPPEIQQQIMQNAYNNGMTVKDISEVTGVATSTVYSKIKSSR 108 >UniRef50_C3XYB0 Putative uncharacterized protein n=2 Tax=Chordata RepID=C3XYB0_BRAFL Length = 1482 Score = 38.9 bits (89), Expect = 0.045, Method: Composition-based stats. Identities = 10/53 (18%), Positives = 21/53 (39%), Gaps = 4/53 (7%) Query: 7 SCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 790 TCRYCGKLFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 842 >UniRef50_A2A935 PR domain zinc finger protein 16 n=35 Tax=Euteleostomi RepID=PRD16_MOUSE Length = 1275 Score = 38.9 bits (89), Expect = 0.045, Method: Composition-based stats. Identities = 10/53 (18%), Positives = 21/53 (39%), Gaps = 4/53 (7%) Query: 7 SCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 952 TCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 1004 >UniRef50_B8F7J2 Putative uncharacterized protein n=1 Tax=Haemophilus parasuis SH0165 RepID=B8F7J2_HAEPS Length = 233 Score = 38.9 bits (89), Expect = 0.049, Method: Composition-based stats. Identities = 12/35 (34%), Positives = 19/35 (54%), Gaps = 2/35 (5%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL 41 C C ++D + ++G QRY C+ C KT+ L Sbjct: 38 KCHFCHSSD-IRKHGIR-NNIQRYKCNACNKTFTL 70 >UniRef50_UPI0001793699 PREDICTED: similar to zinc-finger homeodomain protein 1, partial n=1 Tax=Acyrthosiphon pisum RepID=UPI0001793699 Length = 1011 Score = 38.9 bits (89), Expect = 0.052, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 30/69 (43%), Gaps = 4/69 (5%) Query: 7 SCPSCSAT----DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMN 62 CP C + + + +G + + CS+C K + +Y++ II++ M Sbjct: 246 KCPECEKAFKFKHHLKEHIRIHSGEKPFECSNCGKRFSHSGSYSSHMTSKKCLIINLKMG 305 Query: 63 GVGCRATAR 71 G G +A R Sbjct: 306 GRGSQANNR 314 >UniRef50_Q9HAZ2 PR domain zinc finger protein 16 n=26 Tax=Euteleostomi RepID=PRD16_HUMAN Length = 1276 Score = 38.9 bits (89), Expect = 0.052, Method: Composition-based stats. Identities = 10/53 (18%), Positives = 21/53 (39%), Gaps = 4/53 (7%) Query: 7 SCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 952 TCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 1004 >UniRef50_Q8PRR9 Conserved protein n=2 Tax=Archaea RepID=Q8PRR9_METMA Length = 148 Score = 38.9 bits (89), Expect = 0.053, Method: Composition-based stats. Identities = 17/81 (20%), Positives = 29/81 (35%), Gaps = 4/81 (4%) Query: 12 SATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNGVGCR 67 + + V++ H + C C+K + F + + I M R Sbjct: 34 NQGNIVLKERYGKNNHALFKCKTCKKCFSETKGTIFFELNTPDEEVLRTIAMLPEKGSIR 93 Query: 68 ATARIMGVGLNTILRHLKNSG 88 AR G +TI R L+ +G Sbjct: 94 GVARATGHSKDTICRWLEIAG 114 >UniRef50_A7HVK5 Putative uncharacterized protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVK5_PARL1 Length = 608 Score = 38.9 bits (89), Expect = 0.056, Method: Composition-based stats. Identities = 13/64 (20%), Positives = 26/64 (40%), Gaps = 2/64 (3%) Query: 23 STAGHQRYLCSHCRKTWQLQFTYTASQ--PGTHQKIIDMAMNGVGCRATARIMGVGLNTI 80 S G QR+ C C++T+ + T Q P ++ + ++ R + G+ + Sbjct: 133 SRGGAQRFRCKACQRTFSVALKSTVRQRAPHLNRTVFAEVVSKKPLRGIMEVTGLSAAAV 192 Query: 81 LRHL 84 L Sbjct: 193 YDKL 196 >UniRef50_Q4S840 Chromosome 9 SCAF14710, whole genome shotgun sequence n=5 Tax=Tetraodontidae RepID=Q4S840_TETNG Length = 1167 Score = 38.9 bits (89), Expect = 0.056, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 20/52 (38%), Gaps = 4/52 (7%) Query: 8 CPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 836 CRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 887 >UniRef50_A8NLR1 Predicted protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8NLR1_COPC7 Length = 146 Score = 38.5 bits (88), Expect = 0.058, Method: Composition-based stats. Identities = 15/59 (25%), Positives = 21/59 (35%), Gaps = 5/59 (8%) Query: 17 VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGT-----HQKIIDMAMNGVGCRATA 70 + +NG+ RY C CR T + I+ + G G RATA Sbjct: 77 IRKNGRDPEDIFRYQCPMCRATVNESPVPNYVLRSLVSELQRELIVFLMARGTGYRATA 135 >UniRef50_Q03112 MDS1 and EVI1 complex locus protein EVI1 n=58 Tax=Euteleostomi RepID=EVI1_HUMAN Length = 1051 Score = 38.5 bits (88), Expect = 0.060, Method: Composition-based stats. Identities = 10/53 (18%), Positives = 21/53 (39%), Gaps = 4/53 (7%) Query: 7 SCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 734 TCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 786 >UniRef50_Q6MK35 Putative transposase n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MK35_BDEBA Length = 300 Score = 38.5 bits (88), Expect = 0.063, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 37/97 (38%), Gaps = 15/97 (15%) Query: 4 VSISCPSCS-------ATDGVVRNGK--STAGHQ---RYLCSHCRKTW---QLQFTYTAS 48 +++ CP C A + R G+ + Q R C C K++ Sbjct: 7 MNLKCPYCHLQRDPKDANRTIRRLGRYYRKSDGQTLTRLWCVRCGKSFSAATQSRLKGQK 66 Query: 49 QPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLK 85 + ++ I D+ + R AR++ + T++R + Sbjct: 67 KRHLNKLIRDLLTGEMSQREIARVLKINRKTVVRKFR 103 >UniRef50_B7UNR4 Predicted protein n=51 Tax=Enterobacteriaceae RepID=B7UNR4_ECO27 Length = 374 Score = 38.5 bits (88), Expect = 0.072, Method: Composition-based stats. Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 1/33 (3%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ 40 CP C TD + G + G QR C +C+K W Sbjct: 74 CPVCYGTDMIC-YGHNPQGSQRIQCRNCKKVWT 105 >UniRef50_B9ZCS9 DNA topoisomerase type IA zn finger domain protein n=1 Tax=Natrialba magadii ATCC 43099 RepID=B9ZCS9_NATMA Length = 244 Score = 38.5 bits (88), Expect = 0.072, Method: Composition-based stats. Identities = 15/35 (42%), Positives = 23/35 (65%), Gaps = 4/35 (11%) Query: 6 ISCPSCSATDGVVRNGKSTAGH-QRYLCSHCRKTW 39 ++CP C +D V+NG + GH Q YLC +C +T+ Sbjct: 26 VTCPRC-RSDLTVKNG--SYGHFQHYLCKNCDRTF 57 >UniRef50_B2JXE0 Putative uncharacterized protein n=2 Tax=Burkholderiaceae RepID=B2JXE0_BURP8 Length = 358 Score = 38.1 bits (87), Expect = 0.085, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 33/87 (37%), Gaps = 5/87 (5%) Query: 8 CPSCSATDGVVR-NGKSTAG-HQRYLCSHCRKTWQLQFTYTASQPGTHQK---IIDMAMN 62 CP C T + + + G Y C C + S+ Q+ +I + Sbjct: 59 CPRCRGTRILKKGYARLRTGPLPTYRCEQCGHCFSRLSGTPLSKRPVRQQAGELIALLPQ 118 Query: 63 GVGCRATARIMGVGLNTILRHLKNSGR 89 + C AR +GV +T+L ++ R Sbjct: 119 EISCAEAARQLGVMEHTVLETVRLVRR 145 >UniRef50_Q64DF0 Putative uncharacterized protein n=6 Tax=cellular organisms RepID=Q64DF0_9ARCH Length = 337 Score = 38.1 bits (87), Expect = 0.087, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 32/88 (36%), Gaps = 9/88 (10%) Query: 7 SCPSCSATDGV---VRNGKSTAG-HQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIID 58 CP C +++ V + + G Q LC C ++ + T K++ Sbjct: 9 KCPKC-SSENVRFDYKYDTISNGSRQMLLCRGCGASFSETKNTFLQNIRTPVSTIWKVLK 67 Query: 59 MAMNGVGCRATARIMGVGLNTILRHLKN 86 G AT R+ + NT+L + Sbjct: 68 SRTEGTSLNATCRVFDIAKNTLLAWERK 95 >UniRef50_B7UI65 Predicted protein n=52 Tax=Enterobacteriaceae RepID=B7UI65_ECO27 Length = 419 Score = 38.1 bits (87), Expect = 0.089, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 31/83 (37%), Gaps = 3/83 (3%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGC 66 C +C T + + G S G +R C HC KT+ + + P Q + M G Sbjct: 67 QCSTCGGT-SLKKYGYSAQGQRRMYCHHCEKTFI-TLEHVITTPRGAQLAL-MIEQGEAL 123 Query: 67 RATARIMGVGLNTILRHLKNSGR 89 + + + + R L R Sbjct: 124 ADIRKSLLLNSTGLSRELLKLAR 146 >UniRef50_UPI0001792185 PREDICTED: similar to AGAP008232-PA n=1 Tax=Acyrthosiphon pisum RepID=UPI0001792185 Length = 984 Score = 38.1 bits (87), Expect = 0.091, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 22/52 (42%), Gaps = 4/52 (7%) Query: 8 CPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 C C + + R+ ++ G Q Y C+HC +++ + H++ Sbjct: 829 CKYCGKVFPRSANLTRHVRTHTGEQPYRCAHCERSFSISSNLQRHVRNIHRQ 880 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=roo... 105 3e-22 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 103 2e-21 UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax... 100 1e-20 UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma ling... 98 1e-19 UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacter... 96 3e-19 UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriale... 96 4e-19 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 95 7e-19 UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_S... 93 2e-18 UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriacea... 93 3e-18 UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammapr... 92 6e-18 UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwin... 92 6e-18 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 91 8e-18 UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria... 90 1e-17 UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methano... 89 4e-17 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 89 4e-17 UniRef50_A3JAS9 Hypothetical transposase n=1 Tax=Marinobacter sp... 88 7e-17 UniRef50_Q116V8 Insertion element protein n=2 Tax=Oscillatoriale... 87 2e-16 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 85 7e-16 UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp.... 83 2e-15 UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ru... 83 2e-15 UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium ... 83 2e-15 UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis ... 83 3e-15 UniRef50_D2QCU0 Insertion element protein n=7 Tax=Bacteroidetes ... 83 3e-15 UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepI... 82 4e-15 UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 R... 82 4e-15 UniRef50_C1DQR5 Transposase n=5 Tax=Bacteria RepID=C1DQR5_AZOVD 82 5e-15 UniRef50_A6CNB6 Putative uncharacterized protein n=8 Tax=Bacillu... 82 5e-15 UniRef50_B1QSI6 Putative transposase n=10 Tax=Clostridium butyri... 82 8e-15 UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candid... 81 1e-14 UniRef50_A9VV42 Insertion element protein n=7 Tax=Bacillus RepID... 81 1e-14 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 80 2e-14 UniRef50_Q81ZP0 Putative uncharacterized protein n=2 Tax=Nitroso... 80 2e-14 UniRef50_Q9AMR3 Putative transposase n=1 Tax=Azotobacter vinelan... 80 2e-14 UniRef50_A7HSL0 Insertion element protein n=1 Tax=Parvibaculum l... 79 4e-14 UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacter... 79 4e-14 UniRef50_C4IIL3 Putative transposase n=2 Tax=Clostridium butyric... 79 4e-14 UniRef50_B6ARX2 Probable transposase n=1 Tax=Leptospirillum sp. ... 78 6e-14 UniRef50_C6X4X6 Putative uncharacterized protein n=1 Tax=Flavoba... 78 6e-14 UniRef50_Q891N5 Putative transposase n=1 Tax=Clostridium tetani ... 78 1e-13 UniRef50_B0ABB1 Putative uncharacterized protein n=2 Tax=cellula... 77 1e-13 UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis ae... 77 1e-13 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 77 2e-13 UniRef50_A8ZMX8 Putative uncharacterized protein n=5 Tax=Acaryoc... 77 2e-13 UniRef50_Q2GDS3 Putative uncharacterized protein n=2 Tax=Neorick... 76 3e-13 UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5W... 76 3e-13 UniRef50_Q97IP3 Zn-finger DNA-binding domain n=1 Tax=Clostridium... 75 5e-13 UniRef50_Q64EP4 Putative uncharacterized protein n=10 Tax=enviro... 75 6e-13 UniRef50_UPI000196AFFE hypothetical protein CATMIT_00334 n=2 Tax... 75 7e-13 UniRef50_B9TDK1 Putative uncharacterized protein n=2 Tax=Ricinus... 75 8e-13 UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachl... 75 8e-13 UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candida... 74 1e-12 UniRef50_Q97IZ3 Zn-finger DNA-binding domain n=1 Tax=Clostridium... 74 1e-12 UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE 74 2e-12 UniRef50_A1SXI4 Conserved hypothetical transposase n=18 Tax=Gamm... 73 2e-12 UniRef50_B5K5I7 Transposase n=4 Tax=Octadecabacter antarcticus 2... 73 2e-12 UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea p... 73 3e-12 UniRef50_A8ZNT7 Putative uncharacterized protein n=2 Tax=Acaryoc... 73 3e-12 UniRef50_B6B4C9 Putative uncharacterized protein n=1 Tax=Rhodoba... 73 3e-12 UniRef50_Q8D3Z3 Transposase n=48 Tax=Vibrionales RepID=Q8D3Z3_VIBVU 72 4e-12 UniRef50_A5FST1 Integrase, catalytic region n=1 Tax=Dehalococcoi... 72 4e-12 UniRef50_C6W2G4 Transcriptional regulator, LacI family n=1 Tax=D... 72 5e-12 UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaen... 71 9e-12 UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcti... 71 9e-12 UniRef50_A8V2B8 Hypothetical insertion element protein n=1 Tax=H... 71 1e-11 UniRef50_Q4JT92 Transposase for IS3503f n=4 Tax=Corynebacterium ... 71 1e-11 UniRef50_D0WF86 Putative Zn-finger DNA-binding domain protein n=... 70 2e-11 UniRef50_C8SD87 Insertion element protein n=1 Tax=Ferroglobus pl... 68 6e-11 UniRef50_A0L9I6 Insertion element protein n=1 Tax=Magnetococcus ... 68 8e-11 UniRef50_C0BSX6 Putative uncharacterized protein n=1 Tax=Bifidob... 68 1e-10 UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aur... 68 1e-10 UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmon... 67 1e-10 UniRef50_C0QU68 Putative transposase n=1 Tax=Persephonella marin... 67 1e-10 UniRef50_P04137 Uncharacterized protein in transposable element ... 67 2e-10 UniRef50_B5ETH8 Transposase n=15 Tax=Vibrionaceae RepID=B5ETH8_V... 67 2e-10 UniRef50_B2GC72 Transposase n=11 Tax=Lactobacillus RepID=B2GC72_... 66 3e-10 UniRef50_UPI0001BC5CBF transposase n=1 Tax=Fusobacterium sp. D12... 66 3e-10 UniRef50_UPI0001C348D8 hypothetical protein PretD1_11772 n=1 Tax... 66 4e-10 UniRef50_B4WSN9 Insertion element protein n=3 Tax=Cyanobacteria ... 65 7e-10 UniRef50_C5U8R8 Insertion element protein (Fragment) n=2 Tax=Met... 65 7e-10 UniRef50_C8SCF9 Integrase catalytic region n=1 Tax=Ferroglobus p... 65 9e-10 UniRef50_Q0W4F0 Putative uncharacterized protein n=1 Tax=uncultu... 65 1e-09 UniRef50_Q2FRB5 Putative uncharacterized protein n=1 Tax=Methano... 64 1e-09 UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum A... 63 2e-09 UniRef50_Q8PSY9 Conserved protein n=2 Tax=Methanosarcina RepID=Q... 63 3e-09 UniRef50_Q5LYW0 IS1193, transposase, ISL3 family n=185 Tax=Bacte... 63 4e-09 UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepI... 62 4e-09 UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX 62 5e-09 UniRef50_C6MXF0 Putative uncharacterized protein n=1 Tax=Geobact... 62 5e-09 UniRef50_C9BRL5 Transposase n=2 Tax=Enterococcus faecium RepID=C... 62 6e-09 UniRef50_A4WNK0 Putative uncharacterized protein n=1 Tax=Rhodoba... 62 6e-09 UniRef50_C0X375 Transposase n=23 Tax=Enterococcus RepID=C0X375_E... 60 2e-08 UniRef50_Q3Y3Y2 Transposase, IS204/IS1001/IS1096/IS1165 n=4 Tax=... 60 2e-08 UniRef50_Q3Y1C3 Transposase, IS204/IS1001/IS1096/IS1165 n=51 Tax... 60 2e-08 UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae... 60 2e-08 UniRef50_B4WST6 Putative uncharacterized protein n=1 Tax=Synecho... 60 2e-08 UniRef50_Q07NT9 Putative uncharacterized protein n=2 Tax=Rhizobi... 60 2e-08 UniRef50_D1JAI8 Putative uncharacterized protein n=1 Tax=uncultu... 60 3e-08 UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=... 59 5e-08 UniRef50_B0K4X0 Integrase, catalytic region n=11 Tax=Clostridia ... 58 6e-08 UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Kl... 58 6e-08 UniRef50_D0BMT4 Transposase n=10 Tax=Lactobacillales RepID=D0BMT... 58 8e-08 UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q... 58 9e-08 UniRef50_Q10ZA4 Putative uncharacterized protein n=1 Tax=Trichod... 57 2e-07 UniRef50_A7HJ99 Putative uncharacterized protein n=2 Tax=Fervido... 57 3e-07 UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nod... 57 3e-07 UniRef50_C2BVZ4 Possible IS3509a transposase n=1 Tax=Mobiluncus ... 55 6e-07 UniRef50_A8ZMW7 Putative uncharacterized protein n=1 Tax=Acaryoc... 53 4e-06 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 53 4e-06 UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edw... 52 5e-06 UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202... 52 5e-06 UniRef50_A4W908 Putative cytoplasmic protein n=2 Tax=Enterobacte... 52 7e-06 UniRef50_B2A0V7 Integrase catalytic region n=15 Tax=Clostridia R... 52 8e-06 UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae... 51 1e-05 UniRef50_B7K7J3 Putative uncharacterized protein n=1 Tax=Cyanoth... 50 2e-05 UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseud... 50 3e-05 UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthros... 49 4e-05 UniRef50_B4WUH8 Putative uncharacterized protein n=1 Tax=Synecho... 49 4e-05 UniRef50_C3L491 Putative uncharacterized protein n=1 Tax=Candida... 46 4e-04 Sequences not found previously or not previously below threshold: UniRef50_C1I4B6 Putative uncharacterized protein n=2 Tax=Clostri... 75 6e-13 UniRef50_Q1VPB4 Putative uncharacterized protein n=4 Tax=Psychro... 67 2e-10 UniRef50_C6MKY8 Putative uncharacterized protein n=1 Tax=Geobact... 62 4e-09 UniRef50_UPI000196CFC8 hypothetical protein CATMIT_02848 n=1 Tax... 58 6e-08 UniRef50_Q4JSN3 Transposase for IS3507b n=53 Tax=Actinobacterida... 58 8e-08 UniRef50_Q8RFT8 Transposase n=17 Tax=Fusobacterium RepID=Q8RFT8_... 57 2e-07 UniRef50_C2FEQ0 IS1181 transposase n=1 Tax=Lactobacillus paracas... 57 3e-07 UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candida... 55 9e-07 UniRef50_C7N1Y2 Putative uncharacterized protein n=1 Tax=Slackia... 54 1e-06 UniRef50_Q8PWW0 Putative uncharacterized protein n=1 Tax=Methano... 53 4e-06 UniRef50_Q64DF0 Putative uncharacterized protein n=6 Tax=cellula... 52 4e-06 UniRef50_C2CJK1 ISSha1 transposase n=7 Tax=Anaerococcus RepID=C2... 52 6e-06 UniRef50_A1VN28 Insertion element protein n=1 Tax=Polaromonas na... 52 9e-06 UniRef50_C9BRL4 Transposase n=30 Tax=Enterococcus RepID=C9BRL4_E... 52 9e-06 UniRef50_Q9V1K2 Putative uncharacterized protein n=2 Tax=Pyrococ... 50 2e-05 UniRef50_Q7NH53 TetR family transcriptional regulatory protein n... 50 2e-05 UniRef50_C5S2C5 Putative transposase n=1 Tax=Actinobacillus mino... 50 2e-05 UniRef50_Q8PRR9 Conserved protein n=2 Tax=Archaea RepID=Q8PRR9_M... 50 2e-05 UniRef50_Q03IY7 Transposase n=198 Tax=Lactobacillales RepID=Q03I... 50 2e-05 UniRef50_B4VTL4 Putative uncharacterized protein n=1 Tax=Microco... 50 2e-05 UniRef50_Q11ZU0 Putative uncharacterized protein n=1 Tax=Polarom... 50 2e-05 UniRef50_Q03NU3 Transposase n=12 Tax=Lactobacillus RepID=Q03NU3_... 50 3e-05 UniRef50_D0U1S9 Transposase n=1 Tax=Enterococcus faecium RepID=D... 50 3e-05 UniRef50_Q2RQJ8 Putative uncharacterized protein n=1 Tax=Rhodosp... 49 3e-05 UniRef50_A7JMB8 Predicted protein n=8 Tax=Francisella RepID=A7JM... 49 5e-05 UniRef50_Q6MK35 Putative transposase n=1 Tax=Bdellovibrio bacter... 49 5e-05 UniRef50_UPI0001BC5E64 putative transposase n=1 Tax=Fusobacteriu... 49 5e-05 UniRef50_C4W7G8 Transposase for ISSha1 n=2 Tax=Staphylococcus Re... 48 9e-05 UniRef50_Q10VF2 Putative uncharacterized protein n=1 Tax=Trichod... 48 9e-05 UniRef50_D1QQX4 Putative uncharacterized protein n=15 Tax=Prevot... 48 1e-04 UniRef50_B3GXU2 Transposase n=15 Tax=Pasteurellaceae RepID=B3GXU... 48 1e-04 UniRef50_B2J098 Putative uncharacterized protein n=2 Tax=Nostoca... 47 1e-04 UniRef50_B8F7V2 ISRssp2, family IS1595 n=4 Tax=Pasteurellaceae R... 47 1e-04 UniRef50_D1PSS1 Insertion element protein (Fragment) n=14 Tax=Pr... 47 1e-04 UniRef50_Q035C5 Transposase n=27 Tax=Lactobacillales RepID=Q035C... 47 2e-04 UniRef50_Q7VL05 Possible transposase n=4 Tax=Pasteurellaceae Rep... 47 2e-04 UniRef50_C8SCF8 Putative uncharacterized protein n=1 Tax=Ferrogl... 47 2e-04 UniRef50_D1W685 Putative uncharacterized protein n=2 Tax=Prevote... 47 2e-04 UniRef50_Q70JT0 IsmA protein n=2 Tax=Microcystis aeruginosa RepI... 47 2e-04 UniRef50_B2J7N9 Putative uncharacterized protein n=1 Tax=Nostoc ... 47 2e-04 UniRef50_C7RJT2 Conserved possible transposase n=21 Tax=Proteoba... 47 2e-04 UniRef50_Q93CQ1 Transposase TnpA n=1 Tax=Enterococcus faecium Re... 47 2e-04 UniRef50_B0CG58 Transcriptional regulator, TetR family n=1 Tax=A... 47 2e-04 UniRef50_A5FLG0 Putative uncharacterized protein n=1 Tax=Flavoba... 47 2e-04 UniRef50_A7HMZ5 Transposase IS204/IS1001/IS1096/IS1165 family pr... 47 3e-04 UniRef50_D2LK53 Putative uncharacterized protein n=1 Tax=Rhodomi... 46 3e-04 UniRef50_Q3Y3Y3 Transposase, IS204/IS1001/IS1096/IS1165 n=11 Tax... 46 3e-04 UniRef50_C1DPZ8 Transposase n=4 Tax=Bacteria RepID=C1DPZ8_AZOVD 46 3e-04 UniRef50_C6HZQ4 Transposase n=2 Tax=Leptospirillum ferrodiazotro... 46 4e-04 UniRef50_Q6V7R1 Bcep22gp32 n=1 Tax=Burkholderia phage Bcep22 Rep... 46 4e-04 UniRef50_B4WVD1 Putative uncharacterized protein n=7 Tax=Synecho... 46 4e-04 UniRef50_B5S3H3 Probable transposase protein n=6 Tax=Burkholderi... 46 4e-04 UniRef50_B7X577 Transposase IS204/IS1001/IS1096/IS1165 family pr... 46 4e-04 UniRef50_A8UDH0 Transposase n=5 Tax=Bacteria RepID=A8UDH0_9LACT 46 4e-04 UniRef50_C0WLQ9 Transposase n=3 Tax=Lactobacillus RepID=C0WLQ9_L... 46 4e-04 UniRef50_Q7N9S9 Transposase TnpA, ISL3 family n=1 Tax=Photorhabd... 46 5e-04 UniRef50_C9CRL2 Transposase n=3 Tax=Alphaproteobacteria RepID=C9... 46 5e-04 UniRef50_A7HVK5 Putative uncharacterized protein n=1 Tax=Parviba... 45 5e-04 UniRef50_D0SJK4 Predicted protein n=1 Tax=Acinetobacter junii SH... 45 6e-04 UniRef50_Q2J1M8 Putative uncharacterized protein n=1 Tax=Rhodops... 45 6e-04 UniRef50_UPI0001C31088 transcriptional regulator, TetR family n=... 45 7e-04 UniRef50_B9ZCS9 DNA topoisomerase type IA zn finger domain prote... 45 8e-04 UniRef50_A7HYI5 Putative uncharacterized protein n=1 Tax=Parviba... 45 9e-04 UniRef50_UPI00016C448A hypothetical protein GobsU_12575 n=6 Tax=... 45 9e-04 UniRef50_C2H217 Possible transposase n=5 Tax=Enterococcaceae Rep... 45 0.001 UniRef50_A5VLK7 Transposase, IS204/IS1001/IS1096/IS1165 family p... 45 0.001 UniRef50_B9JNY3 Transposase n=4 Tax=Alphaproteobacteria RepID=B9... 45 0.001 UniRef50_C6QEP3 ISSpo8, transposase n=4 Tax=Alphaproteobacteria ... 45 0.001 UniRef50_Q1GHU2 Putative uncharacterized protein n=1 Tax=Ruegeri... 44 0.001 UniRef50_D1UAU0 Transposase, putative n=1 Tax=Desulfovibrio aesp... 44 0.001 UniRef50_D0MDA7 Transposase-like protein n=7 Tax=Bacteria RepID=... 44 0.001 UniRef50_C5RB59 Possible transposase n=1 Tax=Weissella paramesen... 44 0.001 UniRef50_B1IC92 Transposase n=24 Tax=Lactobacillales RepID=B1IC9... 44 0.001 UniRef50_Q2P6H2 ISXo5 transposase n=74 Tax=Xanthomonas RepID=Q2P... 44 0.001 UniRef50_A8YX76 Transposase n=42 Tax=Lactobacillus RepID=A8YX76_... 44 0.001 UniRef50_Q87RY6 Putative resolvase n=3 Tax=Vibrio parahaemolytic... 44 0.001 UniRef50_C7P9K3 Transcriptional regulator, ArsR family n=2 Tax=M... 44 0.001 UniRef50_UPI000186E028 transcription factor Sp4, putative n=1 Ta... 44 0.001 UniRef50_C0MDD8 Putative transposase n=1 Tax=Steptococcus equi s... 44 0.002 UniRef50_A2V378 Putative uncharacterized protein n=1 Tax=Shewane... 44 0.002 UniRef50_B2SSB8 Transposase TnpA, ISL3 family n=6 Tax=Bacteria R... 43 0.002 UniRef50_Q5ZT03 Transposase (IS652) n=29 Tax=Gammaproteobacteria... 43 0.002 UniRef50_Q12Y80 Transposase n=1 Tax=Methanococcoides burtonii DS... 43 0.002 UniRef50_A9IG79 ISSod11, transposase n=14 Tax=Proteobacteria Rep... 43 0.002 UniRef50_C3MUP9 Resolvase helix-turn-helix domain protein n=40 T... 43 0.002 UniRef50_B2SIA3 ISXo5 transposase n=157 Tax=Proteobacteria RepID... 43 0.002 UniRef50_C0W2A4 Transposase (Fragment) n=1 Tax=Actinomyces coleo... 43 0.003 UniRef50_Q8R819 Transposase n=2 Tax=Thermoanaerobacter tengconge... 43 0.003 UniRef50_C0WEV9 Transposase (Fragment) n=1 Tax=Acidaminococcus s... 43 0.003 UniRef50_B9JG85 Putative uncharacterized protein n=1 Tax=Agrobac... 43 0.003 UniRef50_Q894I5 Phage-related protein n=1 Tax=Clostridium tetani... 43 0.004 UniRef50_A3VEU0 ISSpo8, transposase n=1 Tax=Rhodobacterales bact... 43 0.004 UniRef50_B2JXE0 Putative uncharacterized protein n=2 Tax=Burkhol... 43 0.004 UniRef50_D2EIL2 Transposase n=1 Tax=Pediococcus acidilactici 7_4... 43 0.004 UniRef50_B9Y9S5 Putative uncharacterized protein (Fragment) n=1 ... 43 0.004 UniRef50_UPI00016C46F4 hypothetical protein GobsU_15563 n=2 Tax=... 43 0.004 UniRef50_Q5LW63 ISSpo8, transposase n=4 Tax=Rhodobacterales RepI... 43 0.004 UniRef50_A9BGL8 Transposase IS204/IS1001/IS1096/IS1165 family pr... 43 0.004 UniRef50_C9RDH8 Regulatory protein LacI n=1 Tax=Ammonifex degens... 42 0.004 UniRef50_C1F2K1 Unclassified family transposase n=1 Tax=Acidobac... 42 0.004 UniRef50_UPI00005872C7 PREDICTED: similar to ENSANGP00000019944,... 42 0.005 UniRef50_B8FWC8 Putative uncharacterized protein n=1 Tax=Desulfi... 42 0.005 UniRef50_B2UM39 Putative uncharacterized protein n=1 Tax=Akkerma... 42 0.005 UniRef50_Q4L7B5 Transposase for ISSha1 n=49 Tax=Staphylococcus R... 42 0.005 UniRef50_Q8U293 Transposase n=53 Tax=Pyrococcus RepID=Q8U293_PYRFU 42 0.005 UniRef50_D2MKS9 ISXo5 transposase n=1 Tax=Candidatus Poribacteri... 42 0.005 UniRef50_A2A935 PR domain zinc finger protein 16 n=35 Tax=Eutele... 42 0.005 UniRef50_B8HUB6 Putative uncharacterized protein n=1 Tax=Cyanoth... 42 0.006 UniRef50_C1SJT0 Transposase family protein, COG3464 n=1 Tax=Deni... 42 0.006 UniRef50_A0Q207 Transcriptional regulator n=3 Tax=Clostridium Re... 42 0.006 UniRef50_Q9HAZ2 PR domain zinc finger protein 16 n=26 Tax=Eutele... 42 0.006 UniRef50_B7C761 Putative uncharacterized protein n=1 Tax=Eubacte... 42 0.006 UniRef50_A8A9S5 Transcriptional regulator, AsnC family n=1 Tax=I... 42 0.006 UniRef50_A3YWR5 IS1595 transposase n=2 Tax=Synechococcus sp. WH ... 42 0.007 UniRef50_UPI0001793827 PREDICTED: similar to CG5669 CG5669-PA n=... 42 0.007 UniRef50_A5KRX5 ISSpo8, transposase n=2 Tax=candidate division T... 42 0.007 UniRef50_UPI0001C34261 hypothetical protein CATC2_13668 n=1 Tax=... 42 0.007 UniRef50_C3XYB0 Putative uncharacterized protein n=2 Tax=Chordat... 42 0.007 UniRef50_B0SXP6 Putative transposase n=1 Tax=Caulobacter sp. K31... 42 0.007 UniRef50_Q54X15 Type-2 histone deacetylase 1 n=1 Tax=Dictyosteli... 42 0.007 UniRef50_Q9H5H4 Zinc finger protein 768 n=9 Tax=Theria RepID=ZN7... 42 0.007 UniRef50_C0QSB1 Integrase core domain protein n=1 Tax=Persephone... 42 0.008 UniRef50_A2FJ98 Transposase family protein n=4 Tax=cellular orga... 42 0.008 UniRef50_UPI00016C406B hypothetical protein GobsU_27181 n=1 Tax=... 42 0.008 UniRef50_A7BQK2 Transposase n=3 Tax=Bacteria RepID=A7BQK2_9GAMM 42 0.009 UniRef50_A1SYK0 Putative uncharacterized protein n=2 Tax=Psychro... 42 0.009 UniRef50_Q4S840 Chromosome 9 SCAF14710, whole genome shotgun seq... 42 0.009 UniRef50_Q03112 MDS1 and EVI1 complex locus protein EVI1 n=58 Ta... 41 0.010 UniRef50_Q3C030 Putative sigma-54-dependent transcriptional regu... 41 0.010 UniRef50_B7UNR4 Predicted protein n=51 Tax=Enterobacteriaceae Re... 41 0.010 UniRef50_Q51984 TnpA n=3 Tax=Gammaproteobacteria RepID=Q51984_PSEPU 41 0.010 UniRef50_C7PCU2 Two component transcriptional regulator, LuxR fa... 41 0.011 >UniRef50_P0C653 Insertion element IS1 protein insA n=185 Tax=root RepID=INSA2_ECOLX Length = 91 Score = 105 bits (263), Expect = 3e-22, Method: Composition-based stats. Identities = 90/91 (98%), Positives = 90/91 (98%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA Sbjct: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 MNGVGCRATARIMGVGLNTI RHLKNSGRSR Sbjct: 61 MNGVGCRATARIMGVGLNTIFRHLKNSGRSR 91 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 103 bits (257), Expect = 2e-21, Method: Composition-based stats. Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 1/91 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA V+I CP C + V R+G++ G R C C + +QL +TY A +PG + I +MA Sbjct: 1 MARVNIHCPRCQSAQ-VYRHGQNPKGRDRLRCRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 NG G R TAR + +G NT++R LK R Sbjct: 60 FNGAGVRDTARTLKIGSNTVIRTLKKLAPKR 90 >UniRef50_P03829 Insertion element iso-IS1N protein insA n=87 Tax=Gammaproteobacteria RepID=INA2_SHIDY Length = 90 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 42/91 (46%), Positives = 60/91 (65%), Gaps = 1/91 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MASV+I CP C + V R+G++ GH R+ C C + +QL +TY A +PG + I +MA Sbjct: 1 MASVNIHCPRCQSAQ-VYRHGQNPKGHDRFRCRDCHRVFQLTYTYEARKPGIKELITEMA 59 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 NG G R TAR + +G+NT++R LKNS +S Sbjct: 60 FNGAGVRDTARTLKIGINTVIRTLKNSRQSE 90 >UniRef50_D2QQ17 Insertion element protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQ17_9SPHI Length = 107 Score = 97.8 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 32/91 (35%), Positives = 49/91 (53%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M +++C T + R G + AG QRY C C +T+ +T+ A P ++I M Sbjct: 1 MVLEAVTCKHFGQTQHIKRYGTTCAGTQRYRCFDCGRTFVQTYTHKARDPLVKEQITQMV 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 +NG G R TAR++GV NT+ K +G +R Sbjct: 61 LNGAGIRDTARVLGVNRNTVSAQFKKNGAAR 91 >UniRef50_A4TI48 Insertion sequence protein n=36 Tax=Enterobacteriaceae RepID=A4TI48_YERPP Length = 91 Score = 96.2 bits (238), Expect = 3e-19, Method: Composition-based stats. Identities = 40/90 (44%), Positives = 57/90 (63%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA V + CP C V ++G GHQRY C CR+++QL++ Y A PG ++I+D+A Sbjct: 1 MAKVDVKCPFCEQFHPVKKHGPGRTGHQRYRCQACRRSFQLEYEYRACHPGMKEQIVDLA 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRS 90 MN G R TAR + + +N ++R LKNS RS Sbjct: 61 MNNAGIRDTARALHISINAVMRTLKNSRRS 90 >UniRef50_B5W4N9 Insertion element protein n=3 Tax=Oscillatoriales RepID=B5W4N9_SPIMA Length = 163 Score = 95.8 bits (237), Expect = 4e-19, Method: Composition-based stats. Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 2/80 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C + VV+NG G Q YLC C + ++ + + M++NG+G Sbjct: 1 MDCPYCQ-SHKVVKNGH-RQGKQSYLCRECGRQFRENPCPGGYSSDVKELCVKMSLNGMG 58 Query: 66 CRATARIMGVGLNTILRHLK 85 RA R+ G+ NTIL ++ Sbjct: 59 FRAIERVTGISHNTILNWVR 78 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 94.7 bits (234), Expect = 7e-19, Method: Composition-based stats. Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 2/87 (2%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 +SCPSC + VV+ G+ G Q++LC C K + +Y ++ + M NG+ Sbjct: 10 DVSCPSCG-SHHVVKCGR-PLGRQKFLCRDCGKYFLGDASYHHHSRKLREEALRMYANGM 67 Query: 65 GCRATARIMGVGLNTILRHLKNSGRSR 91 RA +R++ V L T+ +K GR + Sbjct: 68 SMRAISRVLNVPLGTVFTWIKRYGRKK 94 >UniRef50_Q4KRH3 Transposase-like n=4 Tax=Bacteria RepID=Q4KRH3_STRAG Length = 345 Score = 93.1 bits (230), Expect = 2e-18, Method: Composition-based stats. Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 5/87 (5%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW----QLQFTYTASQPGTHQKIIDMAMNG 63 CP C VVRNG G QRY+C C K++ + T ++ ID MNG Sbjct: 52 CPLCGCI-HVVRNGHRKDGTQRYVCKDCGKSFVIATNSIVSGTRKDLSVWEQYIDCMMNG 110 Query: 64 VGCRATARIMGVGLNTILRHLKNSGRS 90 + R TA G+ NT + Sbjct: 111 LSIRKTAVACGIHRNTAFLWRHKILDA 137 >UniRef50_B7LWW4 Transposase ORF A, IS1 n=3 Tax=Enterobacteriaceae RepID=B7LWW4_ECO55 Length = 134 Score = 92.8 bits (229), Expect = 3e-18, Method: Composition-based stats. Identities = 29/91 (31%), Positives = 47/91 (51%), Gaps = 3/91 (3%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M+SV+I CP C + V R+G++ G R+ C + +QL +TY A +PG + I +MA Sbjct: 1 MSSVNIHCPRCQSAQ-VYRHGQNPKGRDRFRYRDCHRVFQLTYTYQARKPGMKELITEMA 59 Query: 61 MN--GVGCRATARIMGVGLNTILRHLKNSGR 89 N G+ AR+ G+ + + K Sbjct: 60 FNEPGMMLARMARLHGIQPCQLFKWKKQYLE 90 >UniRef50_A7N597 Putative uncharacterized protein n=6 Tax=Gammaproteobacteria RepID=A7N597_VIBHB Length = 91 Score = 92.0 bits (227), Expect = 6e-18, Method: Composition-based stats. Identities = 38/91 (41%), Positives = 58/91 (63%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA++ + C C+ T+ V ++GK +G R+ C CRK++QL + Y A +P +KI+DMA Sbjct: 1 MATIQVQCRFCNKTESVRKHGKGHSGFPRFRCIECRKSFQLDYVYEARKPNVKEKIVDMA 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 MN G R TA ++ V NT+L LKNS + + Sbjct: 61 MNSSGVRETAGVLNVAYNTVLSTLKNSRQGK 91 >UniRef50_D0FXR2 Putative insertion element protein n=2 Tax=Erwinia RepID=D0FXR2_ERWPY Length = 92 Score = 92.0 bits (227), Expect = 6e-18, Method: Composition-based stats. Identities = 40/91 (43%), Positives = 54/91 (59%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M I+CP CS + + RNG+S +G QRY C C KT+QL F Y S P + II+M Sbjct: 1 MKMGDIACPRCSESARIRRNGRSASGIQRYRCQGCLKTFQLHFYYAGSSPNMQKTIIEMM 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 +G R AR +GV L T+LRHLK+ ++ Sbjct: 61 NDGSEQRDIARKLGVSLETVLRHLKDLRLNK 91 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 91.2 bits (225), Expect = 8e-18, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 43/86 (50%), Gaps = 3/86 (3%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF-TYTASQPGTHQKIIDMAMNGV 64 + CP C AT+ + +NGK G Q ++C+ C + + + Q+ ++M +NG+ Sbjct: 1 MQCPYCGATE-IRKNGK-RRGKQNHICTKCERQFIDVYDPPKGYSEELKQECLEMYLNGM 58 Query: 65 GCRATARIMGVGLNTILRHLKNSGRS 90 G R R+ GV TI+ +K G Sbjct: 59 GFRPIERVKGVHHTTIIFWVKQMGEK 84 >UniRef50_C4MEL4 Transposase-like protein n=13 Tax=Proteobacteria RepID=C4MEL4_CAMCO Length = 339 Score = 90.4 bits (223), Expect = 1e-17, Method: Composition-based stats. Identities = 29/85 (34%), Positives = 38/85 (44%), Gaps = 6/85 (7%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT----YTASQPGTHQKIIDMAMN 62 CP C+ +D V+NGK+ HQRY+C C KT+ T G K ID +N Sbjct: 47 HCPYCN-SDKFVKNGKAKT-HQRYICKTCNKTFTDTNKTILFNTKKDIGIWYKYIDCLVN 104 Query: 63 GVGCRATARIMGVGLNTILRHLKNS 87 R TA+I G+ L T Sbjct: 105 KYPLRKTAKICGISLPTAFVWRHKI 129 >UniRef50_Q46CV1 Putative uncharacterized protein n=4 Tax=Methanosarcina RepID=Q46CV1_METBF Length = 139 Score = 89.3 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 44/84 (52%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 ++CP C+++ +NG G Q Y C C + ++ TAS P ++ + + + G+G Sbjct: 1 MNCPRCNSSTH-KKNG-IVFGRQHYKCHDCGYNYTVEVKSTASSPSVKRQALQLYLEGLG 58 Query: 66 CRATARIMGVGLNTILRHLKNSGR 89 R+ R +GV ++ + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQ 82 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 88.9 bits (219), Expect = 4e-17, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 44/84 (52%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 ++CP C+++ +NG G QRY C C + ++ T+ P ++ + + + G+G Sbjct: 1 MNCPRCNSSTH-KKNG-IVFGRQRYKCHDCGYNYTVEVKSTSISPSVKRQALQLYLEGLG 58 Query: 66 CRATARIMGVGLNTILRHLKNSGR 89 R+ R +GV ++ + +K G+ Sbjct: 59 FRSIGRFLGVSHVSVQKWIKKFGQ 82 >UniRef50_A3JAS9 Hypothetical transposase n=1 Tax=Marinobacter sp. ELB17 RepID=A3JAS9_9ALTE Length = 181 Score = 88.1 bits (217), Expect = 7e-17, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 33/87 (37%), Gaps = 4/87 (4%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ---LQFTYTASQPGTHQKIIDMAMNG 63 CP C + +R G S QRY C C KT+ Y + + ++ G Sbjct: 56 QCPYCQ-SKTFIRWGSSENERQRYRCKRCAKTFNALVGSPLYRMRKEELWLEYVETMRYG 114 Query: 64 VGCRATARIMGVGLNTILRHLKNSGRS 90 + R A++ GV L T R S Sbjct: 115 LSLRKAAKVTGVSLRTAFRWRHAFLSS 141 >UniRef50_Q116V8 Insertion element protein n=2 Tax=Oscillatoriales RepID=Q116V8_TRIEI Length = 108 Score = 86.6 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 38/81 (46%), Gaps = 2/81 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C + +V+NG G Q YLC C + ++ + + M ++G+G Sbjct: 1 MHCPYCQ-SHKIVKNGH-RNGKQSYLCRKCGRQFRENPCPIGYSSEVKEACLKMFLSGMG 58 Query: 66 CRATARIMGVGLNTILRHLKN 86 RA R G+ N++L ++ Sbjct: 59 FRAIERATGISHNSVLNWVRR 79 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 85.0 bits (209), Expect = 7e-16, Method: Composition-based stats. Identities = 67/73 (91%), Positives = 67/73 (91%) Query: 18 VRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGL 77 VRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGL Sbjct: 2 VRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGL 61 Query: 78 NTILRHLKNSGRS 90 NTILRHL Sbjct: 62 NTILRHLNKLRPQ 74 >UniRef50_B8HXD1 Insertion element protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HXD1_CYAP4 Length = 95 Score = 83.1 bits (204), Expect = 2e-15, Method: Composition-based stats. Identities = 30/94 (31%), Positives = 47/94 (50%), Gaps = 4/94 (4%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHC---RKTWQLQFTYTASQPGTHQKII 57 M + PSC ++D VV+ + T G QRY C + R T+ Q+ Y Q+I+ Sbjct: 1 MVLEPVLYPSCGSSD-VVKPRQLTEGIQRYKCRNAEWSRCTFIRQYAYRGYLVEVKQQIV 59 Query: 58 DMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 +M +NG G R AR++ + T+ LK S + Sbjct: 60 EMVVNGSGTRDPARVLKISRTTVTETLKKSSSAE 93 >UniRef50_Q2S4N1 ISSru3, transposase insA n=3 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N1_SALRD Length = 92 Score = 83.1 bits (204), Expect = 2e-15, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 1/86 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + C C +++ +V+NG S +G Q+Y C C L +KI+ Sbjct: 1 MIKETYECRECGSSN-IVKNGHSASGSQQYHCKDCGAHKVLDPEPRGYSEEEKEKILRAY 59 Query: 61 MNGVGCRATARIMGVGLNTILRHLKN 86 RA +RI G+ NT+ R LK Sbjct: 60 RERGSKRAISRIFGISRNTLTRWLKK 85 >UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZK1_TRIEI Length = 177 Score = 83.1 bits (204), Expect = 2e-15, Method: Composition-based stats. Identities = 22/85 (25%), Positives = 37/85 (43%), Gaps = 2/85 (2%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 I CP CS + +NG G Q Y+C C + + + + + +NG+ Sbjct: 10 PIQCPDCSC-QHIPKNGHQP-GKQNYICVACSHQFIKPYHPQEYSDNVKRLFLRIYVNGM 67 Query: 65 GCRATARIMGVGLNTILRHLKNSGR 89 G R A + GV TI+ +K++ Sbjct: 68 GIRRIAWVKGVTYPTIINLIKHTRE 92 >UniRef50_C6GT28 Putative transposase n=1 Tax=Streptococcus suis BM407 RepID=C6GT28_STRS4 Length = 341 Score = 82.7 bits (203), Expect = 3e-15, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 35/84 (41%), Gaps = 6/84 (7%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIIDMAMNG 63 CP C ++ + RNGK G QRY+C C+KT+ + K +NG Sbjct: 54 CPLCG-SETISRNGK-YNGKQRYICKSCKKTFTDFTNSATYKSKKTLDKWLKYAKCMING 111 Query: 64 VGCRATARIMGVGLNTILRHLKNS 87 R +A+I+ + + T Sbjct: 112 YSIRKSAKIVEINIATSFFWRHKI 135 >UniRef50_D2QCU0 Insertion element protein n=7 Tax=Bacteroidetes RepID=D2QCU0_9SPHI Length = 139 Score = 82.7 bits (203), Expect = 3e-15, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 38/87 (43%), Gaps = 4/87 (4%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA++ CP C++ D V RNG QR+ C C + + K + + Sbjct: 1 MATL--KCPKCNSVDAV-RNG-IVNQRQRFRCKKCNYNFTVGKVGKGISTYYVIKALQLY 56 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNS 87 + GV R R++G+ +++ +K Sbjct: 57 IEGVSFREIERLLGISHVSVMNWVKKY 83 >UniRef50_A3IXU3 Iso-IS1 ORF1 n=1 Tax=Cyanothece sp. CCY0110 RepID=A3IXU3_9CHRO Length = 92 Score = 82.4 bits (202), Expect = 4e-15, Method: Composition-based stats. Identities = 32/87 (36%), Positives = 49/87 (56%), Gaps = 4/87 (4%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLC--SHC-RKTWQLQFTYTASQPGTHQKIIDMA 60 ++I CP C +TD VV+NG S G QRY C C R+++ ++Y + ++I M Sbjct: 5 LAIECPHCHSTD-VVKNGFSGEGKQRYFCQNKSCERRSFIRDYSYNGCRKEVKKQIPKMV 63 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNS 87 +NG G R TAR++ + T+ LK S Sbjct: 64 VNGSGIRDTARVLEISPITVASELKKS 90 >UniRef50_P73782 Transposase n=3 Tax=Synechocystis sp. PCC 6803 RepID=P73782_SYNY3 Length = 141 Score = 82.4 bits (202), Expect = 4e-15, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 40/88 (45%), Gaps = 2/88 (2%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMN 62 S CP C + VV+NG G QR+ C C+ + + + + M Sbjct: 2 STHCHCPQCGHGN-VVKNGFVK-GKQRFKCKRCQYKFTNLSKERGKLLWMKLEAVLLYMG 59 Query: 63 GVGCRATARIMGVGLNTILRHLKNSGRS 90 G+ ATA+++GV ++L +++ G + Sbjct: 60 GMSMNATAKLLGVSTQSLLNWIRDFGEA 87 >UniRef50_C1DQR5 Transposase n=5 Tax=Bacteria RepID=C1DQR5_AZOVD Length = 317 Score = 82.4 bits (202), Expect = 5e-15, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 31/93 (33%), Gaps = 5/93 (5%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQPGTHQKII 57 M + SCP C +++ ++ S+ G RY C C KT + Q Sbjct: 38 MIATPSSCPHCQSSE--LQPWGSSGGLPRYRCKFCGKTSNPLTGTPMARLRKRHLWQGYA 95 Query: 58 DMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 + + R A+ GV NT Sbjct: 96 EALTQSLTVRRAAKHCGVSKNTAFLWRHRFLTQ 128 >UniRef50_A6CNB6 Putative uncharacterized protein n=8 Tax=Bacillus RepID=A6CNB6_9BACI Length = 335 Score = 82.0 bits (201), Expect = 5e-15, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 30/88 (34%), Gaps = 5/88 (5%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL---QFTYTASQPGTHQKIIDM 59 + C C + V RNGK QRYLC C K++ G K M Sbjct: 49 KEGLGCIHCGSV-KVKRNGKYRE-RQRYLCRDCGKSFNELSNTPIAGTRYLGKWAKYFHM 106 Query: 60 AMNGVGCRATARIMGVGLNTILRHLKNS 87 + G A+ + + ++T Sbjct: 107 MVEGYTLPKIAKRLKIHISTAFYWRHKI 134 >UniRef50_B1QSI6 Putative transposase n=10 Tax=Clostridium butyricum RepID=B1QSI6_CLOBU Length = 336 Score = 81.6 bits (200), Expect = 8e-15, Method: Composition-based stats. Identities = 22/95 (23%), Positives = 36/95 (37%), Gaps = 8/95 (8%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSH--CRKTWQLQ----FTYTASQPGTHQK 55 SCP C ++ GK QRY C + C KT+ + Y QP + Sbjct: 29 IKEYESCPYCGC-KHFIKYGK-YQDIQRYKCKNEECGKTFSNTTFSVWKYLKYQPEKWIE 86 Query: 56 IIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 I++ G+ ++ARI+ + T + Sbjct: 87 FIELMCEGMTLESSARILKITTTTAFYWRHKILHA 121 >UniRef50_C3L432 Putative uncharacterized protein n=16 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L432_AMOA5 Length = 118 Score = 80.8 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 22/86 (25%), Positives = 40/86 (46%), Gaps = 2/86 (2%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGV 64 +++CP C+ ++G G QRY C CR + + T +K + + + G+ Sbjct: 3 TMNCPRCNNAHSC-KDG-IVRGRQRYQCKSCRFRYTVSHKSDVKPLSTKRKALQLYLEGL 60 Query: 65 GCRATARIMGVGLNTILRHLKNSGRS 90 G RA RI+ + T+ + +K G Sbjct: 61 GFRAIGRILNISYGTVYQWVKACGDQ 86 >UniRef50_A9VV42 Insertion element protein n=7 Tax=Bacillus RepID=A9VV42_BACWK Length = 342 Score = 80.8 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 34/91 (37%), Gaps = 5/91 (5%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT---YTASQPGTHQKIIDM 59 CP C A++ VVR GK QRY C C KT+ Y + +D Sbjct: 51 KEGFECPHC-ASEHVVRFGK-HNNRQRYRCKCCSKTFTDTTNTVLYRTRKGNEWITFVDC 108 Query: 60 AMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 G R +A I+GV T+ + Sbjct: 109 MFKGYSLRKSAEIVGVTWVTLFYWRHKLLSA 139 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 80.4 bits (197), Expect = 2e-14, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 45/85 (52%), Gaps = 1/85 (1%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I+CP C V++NG + Q+Y C C + +T A + +QK + + G+G Sbjct: 12 INCPKCKE-KKVIKNGTTKNNKQQYYCKMCFYRFIQNYTNQAYKLDINQKNVQLTKEGLG 70 Query: 66 CRATARIMGVGLNTILRHLKNSGRS 90 R+TARI+ + T+L+ + + GR Sbjct: 71 IRSTARILEISATTLLKRIVSIGRK 95 >UniRef50_Q81ZP0 Putative uncharacterized protein n=2 Tax=Nitrosomonas europaea RepID=Q81ZP0_NITEU Length = 323 Score = 80.0 bits (196), Expect = 2e-14, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 31/89 (34%), Gaps = 5/89 (5%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ---LQFTYTASQPGTHQKIID 58 +S CP C ++ R G AG QR+ C C+ T+ Sbjct: 43 SSFEPICPVCQ-SNHFYRWGY-QAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA 100 Query: 59 MAMNGVGCRATARIMGVGLNTILRHLKNS 87 + G+ RA+AR + NT R Sbjct: 101 ALIEGLTVRASARQCRIDKNTSFRWRHRF 129 >UniRef50_Q9AMR3 Putative transposase n=1 Tax=Azotobacter vinelandii RepID=Q9AMR3_AZOVI Length = 214 Score = 80.0 bits (196), Expect = 2e-14, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 31/93 (33%), Gaps = 5/93 (5%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQPGTHQKII 57 M + SCP C +++ ++ S+ G RY C C KT + Q Sbjct: 38 MIATPSSCPHCQSSE--LQPWGSSGGLPRYRCKFCGKTSNPLTGTPMARLRKRHLWQGYA 95 Query: 58 DMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 + + R A+ GV NT Sbjct: 96 EALTQSLTVRRAAKHCGVSKNTAFLWRHRFLTQ 128 >UniRef50_A7HSL0 Insertion element protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HSL0_PARL1 Length = 342 Score = 79.3 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 32/94 (34%), Gaps = 11/94 (11%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCS-----HCRKTWQ---LQFTYTASQPGTHQKIIDM 59 CP C D +V++G+ G QR+ C C +T+ +P M Sbjct: 55 CPHCG-HDDIVKHGRDRGGRQRFRCRRSGSSGCGQTFNALTGTAFTRMRKPEKWAAYARM 113 Query: 60 AMNGVGCRATARI--MGVGLNTILRHLKNSGRSR 91 G + +G+ T R R++ Sbjct: 114 MATGFKSVDDVKTSGLGISRLTAWRWRHRLLRAQ 147 >UniRef50_Q1V9Z0 Putative transposase A n=2 Tax=Gammaproteobacteria RepID=Q1V9Z0_VIBAL Length = 88 Score = 79.3 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 32/86 (37%), Positives = 47/86 (54%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + + C C +D VV++G GHQRY C C +T+Q+ + Y A +PG +II+M Sbjct: 1 MTTNNPHCHFCCKSDSVVKHGYGPKGHQRYRCLSCCRTFQVNYCYEACKPGIRSRIIEMT 60 Query: 61 MNGVGCRATARIMGVGLNTILRHLKN 86 G RAT+R + V NT+L Sbjct: 61 AQNHGKRATSRHLQVSYNTVLSACHR 86 >UniRef50_C4IIL3 Putative transposase n=2 Tax=Clostridium butyricum RepID=C4IIL3_CLOBU Length = 325 Score = 79.3 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 31/89 (34%), Gaps = 6/89 (6%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKII 57 CP C + ++ GK G QRY C C+KT+ + Y P K I Sbjct: 29 IKEYSCCPHCKNVE-FIKFGK-YDGIQRYRCKSCKKTFSYTTNSLWKYLKHPPEKWFKFI 86 Query: 58 DMAMNGVGCRATARIMGVGLNTILRHLKN 86 ++ A+ + + + T Sbjct: 87 ELLGEKKTLEYCAKTLKISIVTAFNWRHK 115 >UniRef50_B6ARX2 Probable transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6ARX2_9BACT Length = 133 Score = 78.5 bits (192), Expect = 6e-14, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 31/89 (34%), Gaps = 5/89 (5%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL---QFTYTASQPGTHQKIIDM 59 S CP C + V + G+ G QRY C CR+ + + ++ Sbjct: 47 SEHPRCPHCQD-EHVAKWGRVK-GLQRYRCEACRRQFTPLTNTPLSGLRKREKWGAYLEA 104 Query: 60 AMNGVGCRATARIMGVGLNTILRHLKNSG 88 +G+ R A+ +GV T Sbjct: 105 MEDGLSVRKAAQRIGVNHKTTFLWRHRFS 133 >UniRef50_C6X4X6 Putative uncharacterized protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X4X6_FLAB3 Length = 169 Score = 78.5 bits (192), Expect = 6e-14, Method: Composition-based stats. Identities = 21/85 (24%), Positives = 35/85 (41%), Gaps = 2/85 (2%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGC 66 +CP C VV++G QR+LC C + ++ K + + + G+ Sbjct: 35 TCPKCQQ-QNVVKSGIVKE-RQRFLCRSCNYYFTVKKLGKQIDDYYVTKALQLYLEGLSY 92 Query: 67 RATARIMGVGLNTILRHLKNSGRSR 91 R RI+GV TI ++ R Sbjct: 93 REIERILGVSHVTISSWVRKYNIKR 117 >UniRef50_Q891N5 Putative transposase n=1 Tax=Clostridium tetani RepID=Q891N5_CLOTE Length = 279 Score = 77.7 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 37/87 (42%), Gaps = 6/87 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIIDMAMNG 63 C C ++ +V+NGK QRY+C C KT+ +Y+ + + G Sbjct: 59 CVHC-KSENIVKNGKYKE-KQRYICKDCHKTFTNYTNSPISYSKKNISKWIEYTKCMLAG 116 Query: 64 VGCRATARIMGVGLNTILRHLKNSGRS 90 R +++++G+ L+T S Sbjct: 117 YSLRKSSKLVGISLSTAFYWRHKILNS 143 >UniRef50_B0ABB1 Putative uncharacterized protein n=2 Tax=cellular organisms RepID=B0ABB1_9CLOT Length = 454 Score = 77.3 bits (189), Expect = 1e-13, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 36/89 (40%), Gaps = 6/89 (6%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIID 58 + CP C + D + +NGK+ QRY+C +CR T+ + T T K Sbjct: 136 KNDLKCPKCGSFD-LNKNGKT-NQRQRYICKNCRTTFDERSFSPLSNTKLSLDTWLKYCQ 193 Query: 59 MAMNGVGCRATARIMGVGLNTILRHLKNS 87 + G + A+ +GV + T Sbjct: 194 FMIEGGTIKYCAQKVGVSIPTSFFMRHRI 222 >UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YEG9_MICAE Length = 171 Score = 77.3 bits (189), Expect = 1e-13, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 5/80 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCR 67 CP+C + ++NG G + C C + + + T T Q I + + G+ R Sbjct: 37 CPNCG-SHHTIKNGSIHNGKPKRQCKECGRQFVINPTNKTVSDETKQLIDKLLLEGISLR 95 Query: 68 ATARIMGVGLNTILRHLKNS 87 AR+ G L+N Sbjct: 96 VIARVTGAS----WSWLQNY 111 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 76.6 bits (187), Expect = 2e-13, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 43/85 (50%), Gaps = 3/85 (3%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNG 63 + +CPSC +D V++NG S+ G +Y C+ CR+T+ ++I+ +N Sbjct: 67 IRPNCPSC-KSDKVIKNG-SSRGKTKYKCNVCRRTFYDA-NSRRMSREQKERILKEYLNR 123 Query: 64 VGCRATARIMGVGLNTILRHLKNSG 88 + R A++ G L T+ +K G Sbjct: 124 MSMRGIAKVEGKPLTTVYSLIKRKG 148 >UniRef50_A8ZMX8 Putative uncharacterized protein n=5 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMX8_ACAM1 Length = 134 Score = 76.6 bits (187), Expect = 2e-13, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 35/93 (37%), Gaps = 9/93 (9%) Query: 6 ISCPSCSATDGVVRNGKSTAG----HQRYLCSHCRKTWQLQFTYTASQPGT----HQKII 57 + CP C ++ +++ G + QRY C C + + + ++ T I Sbjct: 1 MECPYCQ-SEKILKRGFDSLQDGTLVQRYQCKDCNRRFNERTGTPMARLRTASSVVSYAI 59 Query: 58 DMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 G+G R+ R G TI+R K Sbjct: 60 KARTEGMGVRSAGRTFGKSHTTIMRWEKRLADQ 92 >UniRef50_Q2GDS3 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDS3_NEOSM Length = 134 Score = 76.2 bits (186), Expect = 3e-13, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 38/85 (44%), Gaps = 3/85 (3%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + CP C++ +++GK+ QRY C +C + + A + + ++G+ Sbjct: 1 MHCPKCNSV-RFIKSGKAKE-KQRYKCLNCGCQFSRNEKHGA-PLRLKMHAVQLFLSGIS 57 Query: 66 CRATARIMGVGLNTILRHLKNSGRS 90 + A+I V T++R + S Sbjct: 58 MNSIAKIFSVSPPTVMRWVNQFSDS 82 >UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5WC29_PSYWF Length = 233 Score = 76.2 bits (186), Expect = 3e-13, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 38/88 (43%), Gaps = 3/88 (3%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDM 59 ++ I CP+C +D + +NG + G Q Y C C++ + TY KI + Sbjct: 3 ITLYIKCPAC-LSDNIKKNGFKSYGKQNYKCKDCKRQFIGDHALTYQGCHSQKDSKIRYL 61 Query: 60 AMNGVGCRATARIMGVGLNTILRHLKNS 87 + G G + A + + +L LK Sbjct: 62 MVRGSGIKDIACVERISKGKVLATLKKC 89 >UniRef50_Q97IP3 Zn-finger DNA-binding domain n=1 Tax=Clostridium acetobutylicum RepID=Q97IP3_CLOAB Length = 171 Score = 75.4 bits (184), Expect = 5e-13, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 31/92 (33%), Gaps = 9/92 (9%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW----QLQFTYTASQPGTHQKIID 58 V + C + RNGK QRY+C C+KT+ + + + Sbjct: 50 KVYLHC----KLEMFSRNGKHDE-KQRYVCKTCKKTFTDFTYSPISSSKKPLDKWLQYAK 104 Query: 59 MAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 + G R A+ + + + T + Sbjct: 105 CMIVGYSIRKCAKTVNINIATSFFWRHKILEA 136 >UniRef50_C1I4B6 Putative uncharacterized protein n=2 Tax=Clostridium sp. 7_2_43FAA RepID=C1I4B6_9CLOT Length = 361 Score = 75.0 bits (183), Expect = 6e-13, Method: Composition-based stats. Identities = 20/86 (23%), Positives = 36/86 (41%), Gaps = 8/86 (9%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLC--SHCRKTWQLQ----FTYTASQPGTHQKIIDMAM 61 CP+C+ ++ ++ GK G QR+ C C KT+ + F+ + K + + Sbjct: 57 CPNCN-SNNFIKYGK-YRGLQRFKCLNKDCCKTFSQKTNSIFSNSKKPLELWLKYLILMN 114 Query: 62 NGVGCRATARIMGVGLNTILRHLKNS 87 N R + I+G+ L T Sbjct: 115 NKFSLRKCSSILGINLATSFYWRHKF 140 >UniRef50_Q64EP4 Putative uncharacterized protein n=10 Tax=environmental samples RepID=Q64EP4_9ARCH Length = 164 Score = 75.0 bits (183), Expect = 6e-13, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 32/91 (35%), Gaps = 6/91 (6%) Query: 5 SISCPSCSA--TDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIID 58 +C +VR G G QR+ C C K + F I Sbjct: 24 DPNCRDYGKRGEGNIVRYGHDKNGRQRFKCKTCGKVFVETKNTVFYNRKLSEDQIILICK 83 Query: 59 MAMNGVGCRATARIMGVGLNTILRHLKNSGR 89 + + G RA RIM + +TI +K+ R Sbjct: 84 LLVEKNGIRAIERIMEIHRDTISDVVKDLAR 114 >UniRef50_UPI000196AFFE hypothetical protein CATMIT_00334 n=2 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196AFFE Length = 357 Score = 75.0 bits (183), Expect = 7e-13, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 32/84 (38%), Gaps = 5/84 (5%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNG 63 CP C + +NGK HQRY+C C K++ + F ++ I++ + Sbjct: 50 CPICGSV-HFKKNGKDKNRHQRYICLDCHKSFSDRTNTLFYWSHFTLDQWLHFIELELYK 108 Query: 64 VGCRATARIMGVGLNTILRHLKNS 87 + A+++ T Sbjct: 109 MPLEGEAQVLETSKTTCFYMRHKL 132 >UniRef50_B9TDK1 Putative uncharacterized protein n=2 Tax=Ricinus communis RepID=B9TDK1_RICCO Length = 321 Score = 74.6 bits (182), Expect = 8e-13, Method: Composition-based stats. Identities = 18/83 (21%), Positives = 28/83 (33%), Gaps = 5/83 (6%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKT---WQLQFTYTASQPGTHQKIIDMAMNGV 64 CP C R G+ +G QR+ C HC ++ + + + Sbjct: 52 CPHCGCARK-HRCGQ-ASGLQRFRCLHCGRSHNALTKTPLARLRKKECWLPYLQCVLESR 109 Query: 65 GCRATARIMGVGLNTILRHLKNS 87 R A+I+GV T R Sbjct: 110 TVRDAAQIVGVHRTTSFRWRHRF 132 >UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachlamydiaceae RepID=Q6MD28_PARUW Length = 209 Score = 74.6 bits (182), Expect = 8e-13, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 28/79 (35%), Gaps = 1/79 (1%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + C C +D V +NG + Q + C C K W T + + + V Sbjct: 1 MRCTHCG-SDLVKKNGYTRHEKQNFRCLECGKQWSENKEAKIINEQTKELVRKALLEKVS 59 Query: 66 CRATARIMGVGLNTILRHL 84 RI V + +L + Sbjct: 60 LNGICRIFDVSMPWLLDFI 78 >UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETP6_AMOA5 Length = 232 Score = 74.3 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 36/84 (42%), Gaps = 2/84 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + C C ++ + NGK G QRY C C + + I + + +G Sbjct: 1 MECKGC-KSNKTINNGK-VRGKQRYNCKSCGFNFVEVDERRGKNIDKQRMAIHLYLENMG 58 Query: 66 CRATARIMGVGLNTILRHLKNSGR 89 RA R++GV +L+ ++ +G Sbjct: 59 FRAIGRVLGVSNLAVLKWIRAAGE 82 >UniRef50_Q97IZ3 Zn-finger DNA-binding domain n=1 Tax=Clostridium acetobutylicum RepID=Q97IZ3_CLOAB Length = 142 Score = 73.9 bits (180), Expect = 1e-12, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 33/84 (39%), Gaps = 6/84 (7%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIIDMAMNG 63 CP C ++ + RN K G Q Y+C C+K++ + K +NG Sbjct: 54 CPICG-SETISRNSK-YNGKQGYICKSCKKSFTDFTNSATYKSKKTLDKWLKYAKCMVNG 111 Query: 64 VGCRATARIMGVGLNTILRHLKNS 87 R +A+++ + + T Sbjct: 112 YSIRKSAKVVEINIATSFFWRHKI 135 >UniRef50_C9WIW2 Transposase n=4 Tax=Firmicutes RepID=C9WIW2_CLOPE Length = 348 Score = 73.9 bits (180), Expect = 2e-12, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 37/85 (43%), Gaps = 6/85 (7%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQKIIDMAMN 62 CP C D V +NGKS G QRY+C CR ++ F+ T K ++ + Sbjct: 52 ECPKCQCKD-VNKNGKS-NGRQRYICKRCRTSFDEFTMSPFSNTKLGLDKWIKYCELMIL 109 Query: 63 GVGCRATARIMGVGLNTILRHLKNS 87 G+ R A +GVG+ T Sbjct: 110 GLSIRKCAEEVGVGVKTSFYMRHRI 134 >UniRef50_A1SXI4 Conserved hypothetical transposase n=18 Tax=Gammaproteobacteria RepID=A1SXI4_PSYIN Length = 319 Score = 73.5 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 17/89 (19%), Positives = 29/89 (32%), Gaps = 5/89 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTY---TASQPGTHQKIIDMAMN 62 CP C + GK+ + QRY C C KT+ + + K + Sbjct: 53 PQCPHCHCA-HFTKWGKAGS-VQRYKCFSCHKTFNNKTKTPLAKLHRCELWDKYAECMSL 110 Query: 63 GVGCRATARIMGVGLNTILRHLKNSGRSR 91 + R A + + L T ++ Sbjct: 111 KLTLREAAAVCNINLKTSFLWRHRFLMAQ 139 >UniRef50_B5K5I7 Transposase n=4 Tax=Octadecabacter antarcticus 238 RepID=B5K5I7_9RHOB Length = 319 Score = 73.5 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 32/87 (36%), Gaps = 5/87 (5%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ---LQFTYTASQPGTHQKIIDMAMNG 63 +CP C+A V+R + G +RY C C KT+ + +G Sbjct: 50 NCPHCAAGGAVIR--GRSNGLKRYFCKICSKTFNALTGTPLARLRHKDCWTEFAGSLSDG 107 Query: 64 VGCRATARIMGVGLNTILRHLKNSGRS 90 + +A GV +T R R+ Sbjct: 108 DTVKTSAARCGVASSTAFRWRHRFLRA 134 >UniRef50_Q6AKY5 Probable transposase InsA n=1 Tax=Desulfotalea psychrophila RepID=Q6AKY5_DESPS Length = 101 Score = 73.1 bits (178), Expect = 3e-12, Method: Composition-based stats. Identities = 30/85 (35%), Positives = 46/85 (54%), Gaps = 6/85 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 +SC C TD V R+GK + G+QR+ CS C++T+QL++ Y A + + G Sbjct: 1 MSCRFCGGTDEVRRHGKDSNGNQRFRCSDCKRTFQLEYPYVADRHE------RYSPGNAG 54 Query: 66 CRATARIMGVGLNTILRHLKNSGRS 90 R TAR++ VG + R K + R Sbjct: 55 IRDTARVLKVGCMGLTRFRKLNPRQ 79 >UniRef50_A8ZNT7 Putative uncharacterized protein n=2 Tax=Acaryochloris marina MBIC11017 RepID=A8ZNT7_ACAM1 Length = 188 Score = 73.1 bits (178), Expect = 3e-12, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 41/93 (44%), Gaps = 9/93 (9%) Query: 6 ISCPSCSATDGVVRNG----KSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 + C C ++ VV+NG K+ Q +LC C + + + ++ T + I MA+ Sbjct: 1 MQCIHCQ-SENVVKNGTKTLKTAQVVQYFLCKDCGRRFNERSGTPMARLRTPVETISMAI 59 Query: 62 ----NGVGCRATARIMGVGLNTILRHLKNSGRS 90 G+G RA R++ N+I+ K Sbjct: 60 NARTEGLGIRAAGRVLRKSPNSIILWEKRLSAQ 92 >UniRef50_B6B4C9 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2083 RepID=B6B4C9_9RHOB Length = 321 Score = 72.7 bits (177), Expect = 3e-12, Method: Composition-based stats. Identities = 29/90 (32%), Positives = 42/90 (46%), Gaps = 7/90 (7%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGT----HQKIIDMAMN 62 +CP C A D R G++ AG QRY C C KT+ + + +Q +Q + DM + Sbjct: 49 TCPHCGAVDR-QRWGRTRAGSQRYRCQGCLKTFNGRTGSSIAQLQKLDQFYQVLKDMFSD 107 Query: 63 G--VGCRATARIMGVGLNTILRHLKNSGRS 90 G R AR + V +TI R +S Sbjct: 108 GPPRSIRRLARQLDVNKDTIWRWRLLIFQS 137 >UniRef50_Q8D3Z3 Transposase n=48 Tax=Vibrionales RepID=Q8D3Z3_VIBVU Length = 507 Score = 72.3 bits (176), Expect = 4e-12, Method: Composition-based stats. Identities = 13/86 (15%), Positives = 32/86 (37%), Gaps = 4/86 (4%) Query: 7 SCPSCSATDGVVRNGKSTAG----HQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMN 62 C + + ++ G QRY C C+ T+ +++ + + ++ + Sbjct: 103 DCANFGLSVHTHKHLYHAFGYSGDRQRYRCKSCQSTFVDKWSGANKKLQFQENLMGLLFT 162 Query: 63 GVGCRATARIMGVGLNTILRHLKNSG 88 G R R + + T H+++ Sbjct: 163 GYSVREICRKLAINPKTFYDHVEHIA 188 >UniRef50_A5FST1 Integrase, catalytic region n=1 Tax=Dehalococcoides sp. BAV1 RepID=A5FST1_DEHSB Length = 319 Score = 72.3 bits (176), Expect = 4e-12, Method: Composition-based stats. Identities = 20/94 (21%), Positives = 33/94 (35%), Gaps = 9/94 (9%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQ--PGTHQKIIDMAM 61 + I C C + R G S A QR+LC+ C T+ + P + M Sbjct: 6 LPIECKYCG-SRHTRRYGHSRAQKQRWLCNDCCHTFVETSAQPGMRTPPEQIGAAVSMFY 64 Query: 62 NGVGCRATAR----IMGVGLN--TILRHLKNSGR 89 G+ A R I + + T+ + + Sbjct: 65 EGLSLSAICRQMKQIHNISPSDGTVYGWITKYSK 98 >UniRef50_C6W2G4 Transcriptional regulator, LacI family n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W2G4_DYAFD Length = 388 Score = 71.9 bits (175), Expect = 5e-12, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 32/80 (40%), Gaps = 6/80 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 I C C+ DG+++ G G QRYLC C + + + +K Sbjct: 2 IECVKCAQVDGIMKAGY-VRGKQRYLCKWCNYYFTHAEKDDSIESLVKRKRHQ-----TT 55 Query: 66 CRATARIMGVGLNTILRHLK 85 A+ +GV +T+ R L Sbjct: 56 IIDIAKSLGVSNSTVSRALH 75 >UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TRD2_9FLAO Length = 219 Score = 71.2 bits (173), Expect = 9e-12, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 35/85 (41%), Gaps = 2/85 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 ++C +C + R GK G QRY C C ++Q + Y A +I + Sbjct: 1 MNCKNCDQAHCIKR-GK-RNGIQRYYCKICFTSFQENYHYKAYDSSIDTLLISLLRECCS 58 Query: 66 CRATARIMGVGLNTILRHLKNSGRS 90 AR++ + NT+L + + Sbjct: 59 VLGIARVLKISKNTVLSRMLKISKQ 83 >UniRef50_Q4FRR6 Putative transposase n=1 Tax=Psychrobacter arcticus 273-4 RepID=Q4FRR6_PSYA2 Length = 108 Score = 71.2 bits (173), Expect = 9e-12, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 35/85 (41%), Gaps = 3/85 (3%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDM 59 + ISCP C + + +NG + G Q Y C C++ + TY +I M Sbjct: 3 TQIDISCPDCHSI-SLKKNGIKSYGKQNYQCKDCQRQFIGDHALTYQGCHSRIEDRIRLM 61 Query: 60 AMNGVGCRATARIMGVGLNTILRHL 84 G G R A I V + +L L Sbjct: 62 TARGCGIRDIAVITSVSIGKVLSTL 86 >UniRef50_A8V2B8 Hypothetical insertion element protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8V2B8_9AQUI Length = 125 Score = 71.2 bits (173), Expect = 1e-11, Method: Composition-based stats. Identities = 22/86 (25%), Positives = 36/86 (41%), Gaps = 2/86 (2%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNG 63 I CP C ++ + GK+T G QRY C+ C + + Y + M G Sbjct: 12 EHIKCPECG-SNWCKKFGKNT-GKQRYKCNECGRHFYEGAKYHKHPEKVKLLALKMYSKG 69 Query: 64 VGCRATARIMGVGLNTILRHLKNSGR 89 + A AR++ + T+ R G+ Sbjct: 70 MSKSAIARVLNLPYRTVARWTYEVGK 95 >UniRef50_Q4JT92 Transposase for IS3503f n=4 Tax=Corynebacterium jeikeium K411 RepID=Q4JT92_CORJK Length = 165 Score = 71.2 bits (173), Expect = 1e-11, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 29/85 (34%), Gaps = 2/85 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + SCP C + +NG ++ R+ C+HC ++ T I A Sbjct: 1 MTTNRPSCPLCG--NNTKKNGTTSKSTTRWRCTHCGHSFTRNTQTHNKNTATMALFIQWA 58 Query: 61 MNGVGCRATARIMGVGLNTILRHLK 85 A GV T+ + Sbjct: 59 TGTQSLTTFAAHHGVTRQTMHHRFR 83 >UniRef50_D0WF86 Putative Zn-finger DNA-binding domain protein n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WF86_9ACTN Length = 243 Score = 70.0 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 25/86 (29%), Gaps = 5/86 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDMAM 61 CP C + +GK+ G +RY C C + F +I ++ Sbjct: 52 PVCPDCGSVRP-RLDGKAPNGARRYRCRECGCRFSALTGTIFADAKLPLHKIMRIAEVMC 110 Query: 62 NGVGCRATARIMGVGLNTILRHLKNS 87 + R + V T Sbjct: 111 HSASLRLMELVAEVSHGTAFLWRHKV 136 >UniRef50_C8SD87 Insertion element protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SD87_FERPL Length = 94 Score = 68.5 bits (166), Expect = 6e-11, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 38/91 (41%), Gaps = 5/91 (5%) Query: 6 ISCPSCSATDGVVR---NGKSTAGHQRYLCSHCRKTWQLQF-TYTASQPGTHQKIIDMAM 61 + CP C + V + KS QRY C +C +T+ L + ++ + Sbjct: 1 MMCPHCKSIKTVKMGCYHTKSGERRQRYKCKNCGRTFVLNPIKPRNYPEEFKEMVVKAVV 60 Query: 62 -NGVGCRATARIMGVGLNTILRHLKNSGRSR 91 GVG R +RI + NT+ ++ + R Sbjct: 61 REGVGVRQASRIFKLSPNTVTAWVREFSKKR 91 >UniRef50_A0L9I6 Insertion element protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L9I6_MAGSM Length = 89 Score = 68.1 bits (165), Expect = 8e-11, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 41/90 (45%), Gaps = 6/90 (6%) Query: 1 MAS--VSISCPSCSATDGVVRNGKSTAGHQRYLCSH--CRKT-WQLQFTYTASQPGTHQK 55 MA+ V + CP C + D V++ GK G QR+ C+ C +T + + ++ Sbjct: 1 MATMEVHVHCPDCGSLD-VIKFGKDRHGRQRFRCNDHFCDRTIFMMDDPDWWRFEEVKKQ 59 Query: 56 IIDMAMNGVGCRATARIMGVGLNTILRHLK 85 I ++G G TA +G+ + R K Sbjct: 60 IALHLLSGNGIHQTAHNLGLHPEFVNRMAK 89 >UniRef50_C0BSX6 Putative uncharacterized protein n=1 Tax=Bifidobacterium pseudocatenulatum DSM 20438 RepID=C0BSX6_9BIFI Length = 352 Score = 67.7 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 35/86 (40%), Gaps = 5/86 (5%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW----QLQFTYTASQPGTHQKIIDMAMNG 63 C C + ++R G+ G QR+ C +C +T+ + + G + ++ ++ Sbjct: 55 CVRCGSI-RIIRKGRGRDGSQRWKCMNCNRTFGVRTNRVMGMSKLKAGVWMRFLECFVDC 113 Query: 64 VGCRATARIMGVGLNTILRHLKNSGR 89 + R A+ GV L T + Sbjct: 114 LSLRKCAQRCGVCLKTAFLMRQRVIE 139 >UniRef50_C3PIK6 Putative transposase n=5 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PIK6_CORA7 Length = 403 Score = 67.7 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 4/88 (4%) Query: 1 MASVS-ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDM 59 MA+ + SC C G+V+NGK+ AG QR+LC C + +T+ ID Sbjct: 1 MANRNRPSCDMCG--HGLVKNGKTAAGTQRWLCPQCNVSSINTRAHTSDIRHFKI-FIDW 57 Query: 60 AMNGVGCRATARIMGVGLNTILRHLKNS 87 ++G A+ +GV T+ R K Sbjct: 58 ILSGESADHLAKRLGVTRRTLTRWFKLL 85 >UniRef50_A4SSR9 InsA n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SSR9_AERS4 Length = 91 Score = 67.3 bits (163), Expect = 1e-10, Method: Composition-based stats. Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 1/64 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MAS++I CP C+ +D V R+GK+ AG+ RY C C +QL +TY A P + ++++ Sbjct: 10 MASITIHCPRCN-SDHVYRHGKTPAGNIRYRCPACPHVFQLTYTYEARNPASKRRLLIWR 68 Query: 61 MNGV 64 G+ Sbjct: 69 STGL 72 >UniRef50_C0QU68 Putative transposase n=1 Tax=Persephonella marina EX-H1 RepID=C0QU68_PERMH Length = 94 Score = 67.3 bits (163), Expect = 1e-10, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 37/87 (42%), Gaps = 2/87 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M ISCP C ++ V+NGK G Q YLC C + + + ++ +++ Sbjct: 1 MGGKKISCPHCE-SERCVKNGK-ANGKQTYLCKECYYRFTINASKRKYPFKIRREAVNLY 58 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNS 87 G ++ + + + TI +K Sbjct: 59 KEGYTLTEISKKLNIKVQTIHHWVKKY 85 >UniRef50_P04137 Uncharacterized protein in transposable element ISH50 n=11 Tax=Halobacteriaceae RepID=YIH50_HALSA Length = 294 Score = 67.3 bits (163), Expect = 2e-10, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 7/90 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAM 61 + CPSC + V+R G S QRYLC C +T+ Q F ++A + + Sbjct: 26 VYCPSC-RAESVIRYG-SYRVFQRYLCKDCDRTFNDQTGTVFEHSAVALRKWFLAVYTYI 83 Query: 62 N-GVGCRATARIMGVGLNTILRHLKNSGRS 90 R + V T+ R ++ R+ Sbjct: 84 RLNTSIRQLDAEIDVSYKTVYRRVQRFLRA 113 >UniRef50_B5ETH8 Transposase n=15 Tax=Vibrionaceae RepID=B5ETH8_VIBFM Length = 489 Score = 66.9 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 16/89 (17%), Positives = 27/89 (30%), Gaps = 4/89 (4%) Query: 4 VSISCPSCSATDGVVRNGKSTAG----HQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDM 59 + C R G QRY C C T+ +++ + QK++ Sbjct: 82 NNSECEHFGFDVLTHRELYHAFGYSGDRQRYRCKSCASTFVDKWSGENQKSLIQQKLLGF 141 Query: 60 AMNGVGCRATARIMGVGLNTILRHLKNSG 88 G R R + + T H+ Sbjct: 142 LFTGYSVREICRRLHINPKTFYDHINQIA 170 >UniRef50_Q1VPB4 Putative uncharacterized protein n=4 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VPB4_9FLAO Length = 343 Score = 66.9 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 26/86 (30%), Gaps = 5/86 (5%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTA---SQPGTHQKIIDMAMNGV 64 CP C + VR G G QRY C C +++ + + + + Sbjct: 51 CPHC-LHEKYVRFGVDK-GSQRYKCKSCNRSFTEYTGTWMAGLQRKDMISSYLSLMVQEK 108 Query: 65 GCRATARIMGVGLNTILRHLKNSGRS 90 + +G+ T S Sbjct: 109 SLDKISSELGINKKTAFDWRHKILAS 134 >UniRef50_B2GC72 Transposase n=11 Tax=Lactobacillus RepID=B2GC72_LACF3 Length = 428 Score = 66.2 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 21/99 (21%), Positives = 35/99 (35%), Gaps = 21/99 (21%) Query: 7 SCPSCSATDGVVRNGKSTA-----------------GHQRYLCSHCRKTWQLQF----TY 45 CP C D ++NG S QR C +C+ ++ + Y Sbjct: 44 RCPHCGFADTFIKNGHSYQTIKYLSINESCPTMLRIDKQRLRCKNCQDSFMAKTNVVDKY 103 Query: 46 TASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 + K + M + V + ++ GV +TI R L Sbjct: 104 CSIAKAVKHKALTMLESNVSQKDVSKFTGVSPSTIGRLL 142 >UniRef50_UPI0001BC5CBF transposase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5CBF Length = 184 Score = 66.2 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 38/106 (35%), Gaps = 21/106 (19%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQF- 43 + + +CP C + V+NG T+ QR+LC C ++ L+ Sbjct: 42 LTKDTCACPHCH-SQTTVKNGFKTSKVRYLPFQNYPIIIALKKQRFLCKECHHSFTLETP 100 Query: 44 ---TYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKN 86 Y + +++ + A+ + + T+ R LK Sbjct: 101 IVKKYASISQTLKLSVLNSLQENMSLSLIAKQHRISIPTVQRILKQ 146 >UniRef50_UPI0001C348D8 hypothetical protein PretD1_11772 n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C348D8 Length = 467 Score = 65.8 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 36/89 (40%), Gaps = 3/89 (3%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M ++ +CPSC +T+ + + G + G RY C +C + L K+I+ Sbjct: 67 MKNIEKACPSCYSTENI-KYGTTAIGTVRYQCKNCNNVYSL--KNLNKFDDVDNKLIESL 123 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGR 89 + + + + + R L+N Sbjct: 124 LKNTKVSTIFKELKITPASFYRRLENINE 152 >UniRef50_B4WSN9 Insertion element protein n=3 Tax=Cyanobacteria RepID=B4WSN9_9SYNE Length = 83 Score = 65.0 bits (157), Expect = 7e-10, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 5/84 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGT----HQKIIDMAM 61 + CP C ++GK++ G QRY C+ CR+T+ F + + I+ + Sbjct: 1 MDCPFCDHPTP-HKHGKTSKGSQRYRCTACRRTFTETFDTLYDRRQVTSEQVKLILQTYV 59 Query: 62 NGVGCRATARIMGVGLNTILRHLK 85 G R +RI T++ ++ Sbjct: 60 EGSSLRGISRIGKRAYGTVVDIVR 83 >UniRef50_C5U8R8 Insertion element protein (Fragment) n=2 Tax=Methanocaldococcus infernus ME RepID=C5U8R8_9EURY Length = 100 Score = 65.0 bits (157), Expect = 7e-10, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 43/91 (47%), Gaps = 6/91 (6%) Query: 6 ISCPSCSATDGVVRNGKSTAGH----QRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 I C C+ +D VV+ GK + Q YLC C++ + + +K++ + Sbjct: 5 IRCKYCN-SDKVVKAGKHKSEKYGVRQMYLCKKCKRRFVEESKAPRYSDSFKEKVVRSVV 63 Query: 62 -NGVGCRATARIMGVGLNTILRHLKNSGRSR 91 G+G R R+ + TILR +K+ +++ Sbjct: 64 FEGLGIRQAGRVFKLSTTTILRWIKDFKKTK 94 >UniRef50_C8SCF9 Integrase catalytic region n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCF9_FERPL Length = 357 Score = 64.6 bits (156), Expect = 9e-10, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 32/96 (33%), Gaps = 11/96 (11%) Query: 3 SVSISCPSCSATDGVVRNGKSTAGH---QRYLCSHCRKTW--QLQFTYTASQPGTHQKII 57 +C +C D V++ G Q Y C C K + + F + + Sbjct: 81 KEERTCKNCGRDDEVIKKGIRYNKSGPVQMYYCKRCGKKFSARTGFGGMKKRAEAIVAAL 140 Query: 58 DMAMNGVGCRATARIMGVGLN------TILRHLKNS 87 D+ G+ R A+ + N T+ +K Sbjct: 141 DLYFRGLSLRQVAQHLKASYNVEVCHKTVHNWIKRY 176 >UniRef50_Q0W4F0 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4F0_UNCMA Length = 141 Score = 64.6 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 38/89 (42%), Gaps = 6/89 (6%) Query: 7 SCPSCSATDG--VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK----IIDMA 60 SC ++G VV+ G S AGHQ + C HC + + ++ I + Sbjct: 17 SCEFYLKSEGSRVVKKGFSRAGHQVFQCRHCGRHFCETINTPMYGRRITREDVILIGKLL 76 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGR 89 G RA RI G +T++R K+ R Sbjct: 77 NERNGIRAIERITGHHRDTVMRVAKDLAR 105 >UniRef50_Q2FRB5 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FRB5_METHJ Length = 138 Score = 64.2 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 6/91 (6%) Query: 5 SISCPSCSATD--GVVRNGKSTAGHQRYLCSHCRKTWQL---QFTYTASQPGTHQKII-D 58 + C D + +NG ++AG+Q+Y C HCR+ + Y + P T II Sbjct: 14 NPDCTYFQIEDGKNITKNGHNSAGNQQYYCHHCRRFFIETKNTPLYDSRLPRTAVLIIAK 73 Query: 59 MAMNGVGCRATARIMGVGLNTILRHLKNSGR 89 + R +R+ G +TI R+ G Sbjct: 74 HSTEKTSIRGVSRVTGHHRDTISRYYHLIGE 104 >UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C2KAD9_9FLAO Length = 239 Score = 63.5 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 42/90 (46%), Gaps = 1/90 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M C C+ + ++ G ++ QRY C C+K + +++Y A Q T+ I + Sbjct: 1 MNKRRNRCIHCNYS-YCIKAGITSQNKQRYQCKKCKKKFIGKYSYRAYQKSTNHNIQQLI 59 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNSGRS 90 GVG R +R++ V T+L+ + Sbjct: 60 KEGVGIRGISRLLNVSKTTVLKKILKIASK 89 >UniRef50_Q8PSY9 Conserved protein n=2 Tax=Methanosarcina RepID=Q8PSY9_METMA Length = 146 Score = 63.1 bits (152), Expect = 3e-09, Method: Composition-based stats. Identities = 24/93 (25%), Positives = 40/93 (43%), Gaps = 6/93 (6%) Query: 5 SISCPSCSATDG--VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDM--- 59 + C +G +++ GK GHQRY C HC K + + ++ I M Sbjct: 12 NPKCSYYLKAEGRAIIKRGKYKTGHQRYYCKHCEKFFMDTIGTAIYRKHLSKEEIRMIYR 71 Query: 60 -AMNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 + G R+ RI G +TI LK++ ++ Sbjct: 72 LFLEKNGIRSIERITGHHRDTISNLLKDTVKNE 104 >UniRef50_Q5LYW0 IS1193, transposase, ISL3 family n=185 Tax=Bacteria RepID=Q5LYW0_STRT1 Length = 448 Score = 62.7 bits (151), Expect = 4e-09, Method: Composition-based stats. Identities = 20/110 (18%), Positives = 35/110 (31%), Gaps = 19/110 (17%) Query: 1 MASVSISCPSCSAT---DGVVRNGKSTAGHQ------------RYLCSHCRKTWQLQFTY 45 + +++ SCP C +N K + Q R+ C CR+ + + Sbjct: 15 LITLAPSCPHCQGKMIKYDFQKNSKISLLEQAGTPTLLRLKKRRFQCKSCRRVTVAETSI 74 Query: 46 TASQPGT----HQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 QK+ + V AR + V +T+ R L Sbjct: 75 VEKNCQISNLVRQKVTQLLTEKVSLTDIARRLRVSTSTVYRKLYQFTFKE 124 >UniRef50_Q32IP8 IS1 ORF1 n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32IP8_SHIDS Length = 101 Score = 62.3 bits (150), Expect = 4e-09, Method: Composition-based stats. Identities = 51/52 (98%), Positives = 51/52 (98%) Query: 40 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGV LNTILRHLKNSGRSR Sbjct: 50 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVSLNTILRHLKNSGRSR 101 >UniRef50_C6MKY8 Putative uncharacterized protein n=1 Tax=Geobacter sp. M18 RepID=C6MKY8_9DELT Length = 632 Score = 62.3 bits (150), Expect = 4e-09, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 30/72 (41%), Gaps = 2/72 (2%) Query: 21 GKSTAGHQRYLCSHCRKTWQLQFTY--TASQPGTHQKIIDMAMNGVGCRATARIMGVGLN 78 G + AG QR+ C C KT+ + + G ++ + V R AR VG Sbjct: 123 GHTKAGSQRFRCKICHKTFSIPLAANLRQRKKGKSTEVFRLLTCQVAIRKMARNARVGKE 182 Query: 79 TILRHLKNSGRS 90 T+ R++ R Sbjct: 183 TVHRYIHLIHRQ 194 >UniRef50_Q6YGS8 Iso-IS1 insA n=8 Tax=Escherichia RepID=Q6YGS8_ECOLX Length = 71 Score = 62.3 bits (150), Expect = 5e-09, Method: Composition-based stats. Identities = 26/65 (40%), Positives = 45/65 (69%), Gaps = 1/65 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 MA+V++ P C+ +D V R+G+S + H+R+ C C++ +QL ++Y A +PG + I++MA Sbjct: 1 MATVTVHRPRCN-SDKVYRHGRSCSQHERFRCRSCKRVFQLTYSYEARKPGFKELIVEMA 59 Query: 61 MNGVG 65 NG G Sbjct: 60 HNGTG 64 >UniRef50_C6MXF0 Putative uncharacterized protein n=1 Tax=Geobacter sp. M18 RepID=C6MXF0_9DELT Length = 512 Score = 61.9 bits (149), Expect = 5e-09, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 35/87 (40%), Gaps = 7/87 (8%) Query: 5 SISCPSCSAT-----DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH--QKII 57 + CP+ R G++ AG +RY C C +T+ + TA Q TH +KI Sbjct: 24 TPECPNHERDVDSHPKEYHRFGETAAGARRYRCKLCSRTFSINGKPTARQRDTHKNKKIY 83 Query: 58 DMAMNGVGCRATARIMGVGLNTILRHL 84 +N + + T+ R + Sbjct: 84 MHLVNKSPFKRICEQAEISPATLYRKI 110 >UniRef50_C9BRL5 Transposase n=2 Tax=Enterococcus faecium RepID=C9BRL5_ENTFC Length = 433 Score = 61.9 bits (149), Expect = 6e-09, Method: Composition-based stats. Identities = 20/107 (18%), Positives = 32/107 (29%), Gaps = 23/107 (21%) Query: 1 MASVSISCPSCSATDG---VVRNGKSTA----------------GHQRYLCSHCRKTWQL 41 ++ CP C + +V+NGK + QRY C C + Sbjct: 40 LSKEIRRCPLCKQMNHEGMIVKNGKKKSLIQLNKCANQLTYLALAKQRYHCRGCHTYFTA 99 Query: 42 QFTYTASQP----GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 KI++ A+ GV +T+ R L Sbjct: 100 NTYIVDRNCFIAKQVRYKILEELTEKQAMTTIAKHCGVSWSTVSRTL 146 >UniRef50_A4WNK0 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WNK0_RHOS5 Length = 481 Score = 61.9 bits (149), Expect = 6e-09, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 30/67 (44%), Gaps = 1/67 (1%) Query: 19 RNGKSTAGHQRYLCSHCRKTWQ-LQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGL 77 R GK+ G R+ C C KT+ + + ++ ++DM N + +RI G+ Sbjct: 132 RFGKTKGGDARWRCKGCGKTFSVGKPARRHKRSDKNRLVLDMLCNDLSFAKMSRISGLAY 191 Query: 78 NTILRHL 84 I R + Sbjct: 192 RDIYRRV 198 >UniRef50_C0X375 Transposase n=23 Tax=Enterococcus RepID=C0X375_ENTFA Length = 446 Score = 60.4 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 17/112 (15%), Positives = 37/112 (33%), Gaps = 22/112 (19%) Query: 1 MASVSISCPSC--SATDGVVRNGKST----------------AGHQRYLCSHCRKTWQLQ 42 + C C +++ G QR+ C HC KT+ + Sbjct: 42 LTYQPEECFHCHYQNKQTIIKWGWKKVSILLNDVSNYKTILRINKQRFKCKHCGKTFLAE 101 Query: 43 FTYTASQ----PGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 + + + Q I+++ + AR+ + T++R L++ Sbjct: 102 DSVSDRRCSIARRVKQAILELLSEPISMSLIARMKHISPTTVIRILRSLRPK 153 >UniRef50_Q3Y3Y2 Transposase, IS204/IS1001/IS1096/IS1165 n=4 Tax=Enterococcus faecium RepID=Q3Y3Y2_ENTFC Length = 401 Score = 60.4 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 23/99 (23%), Positives = 36/99 (36%), Gaps = 21/99 (21%) Query: 7 SCPSCS-ATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYT--- 46 CP C +T +V+NGK + QRYLC C+K + + Sbjct: 46 RCPCCKDSTKQIVKNGKKISMILLNRSGNKRTYLRLKKQRYLCRACKKYFTARTYLVTPF 105 Query: 47 -ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 H KI++ +A + V + T+ R L Sbjct: 106 CFISKQIHYKILEELTERQSIKAIGKHCDVSVTTVQRTL 144 >UniRef50_Q3Y1C3 Transposase, IS204/IS1001/IS1096/IS1165 n=51 Tax=Enterococcus RepID=Q3Y1C3_ENTFC Length = 431 Score = 60.4 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 26/108 (24%), Positives = 37/108 (34%), Gaps = 27/108 (25%) Query: 7 SCPSCSATDG-------VVRNGKSTA----------------GHQRYLCSHCRKTWQLQF 43 +C +C +T VV+NGK QRY C +CR W Q Sbjct: 44 TCRNCGSTVVDGNGKVIVVKNGKKETIVRFEQYNHMPLVMRLKKQRYTCKNCRTHWTTQS 103 Query: 44 TYTASQP----GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNS 87 + + KI + V A+ V L T++R LK Sbjct: 104 YFVQPRHSIANHVRYKIASLLTEKVSLSFIAKNCQVSLTTVIRTLKEF 151 >UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae BGR1 RepID=C5A9A4_BURGB Length = 284 Score = 60.4 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 22/100 (22%), Positives = 39/100 (39%), Gaps = 15/100 (15%) Query: 1 MASVSISCPSCSATDGV-------VRNGKSTAGH-----QRYLCSHCRKTWQ---LQFTY 45 M + CP+ +NG H RY C C K + ++ + Sbjct: 1 MRNPRPVCPNPDCVHHTNPPADFYRKNGYRRTKHNGQPVPRYQCKACGKNFCATQVKPIH 60 Query: 46 TASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLK 85 +P + ++ MA++ VG R A ++ G TI R ++ Sbjct: 61 GQHRPDLNTQVFKMAVSRVGIRRMATVLDCGRETIQRKIE 100 >UniRef50_B4WST6 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WST6_9SYNE Length = 81 Score = 60.0 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 28/71 (39%), Gaps = 1/71 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + P+C + VV+NGK G Q + C +C + + T I D+ Sbjct: 1 MLDHQPTRPACHSKQ-VVKNGKIHNGKQNHRCKNCGRQFVKDPQQKRISDATKALIDDLL 59 Query: 61 MNGVGCRATAR 71 + + ++ Sbjct: 60 LERLSMNNPSK 70 >UniRef50_Q07NT9 Putative uncharacterized protein n=2 Tax=Rhizobiales RepID=Q07NT9_RHOP5 Length = 577 Score = 60.0 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 17/89 (19%), Positives = 35/89 (39%), Gaps = 11/89 (12%) Query: 7 SCP--SCSATDGVV--------RNGKSTAGHQRYLCSHCRKTWQLQFTY-TASQPGTHQK 55 CP SC + + R+G S G RY C CRKT+ ++ + + ++ Sbjct: 103 HCPDDSCENYNKLFDSHPKSYFRHGTSAIGAPRYRCKACRKTFSVRTGHSRHRKSHENKT 162 Query: 56 IIDMAMNGVGCRATARIMGVGLNTILRHL 84 + + ++ V +I + + + Sbjct: 163 VFQLLVSKVPITKIGQITDLSPAAVYDKI 191 >UniRef50_D1JAI8 Putative uncharacterized protein n=1 Tax=uncultured archaeon RepID=D1JAI8_9ARCH Length = 192 Score = 59.6 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 36/90 (40%), Gaps = 10/90 (11%) Query: 7 SCPSC---SATDGVVR---NGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKI 56 +CP+ + ++R GK Q C C K + + G Sbjct: 17 TCPNYARLGPDNKILRAGYYGKGEKRTQMLKCKVCGKRFSIHKGTPLFNLKADEGAFYGT 76 Query: 57 IDMAMNGVGCRATARIMGVGLNTILRHLKN 86 I + G G RATARIMG+ +T+ + LK Sbjct: 77 IAHLVEGNGIRATARIMGINKDTVSKWLKK 106 >UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=A0RXS8_CENSY Length = 436 Score = 58.9 bits (141), Expect = 5e-08, Method: Composition-based stats. Identities = 27/98 (27%), Positives = 40/98 (40%), Gaps = 17/98 (17%) Query: 4 VSISCPSCSATDGVV--RNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 + CP CS+T V RNG G Q + C CR + + A + Q II A+ Sbjct: 71 IVPECPKCSSTVRVKAGRNG----GRQMFQCKQCRTRYVSR-GPGARKTRYSQDIISAAL 125 Query: 62 N----GVGCRATARIMG------VGLNTILRHLKNSGR 89 N G+ R TA + + NTI+ + + Sbjct: 126 NKVMSGMSYRKTAEEVNTAHGRDLSPNTIMFWTRKYTQ 163 >UniRef50_B0K4X0 Integrase, catalytic region n=11 Tax=Clostridia RepID=B0K4X0_THEPX Length = 343 Score = 58.5 bits (140), Expect = 6e-08, Method: Composition-based stats. Identities = 16/62 (25%), Positives = 23/62 (37%), Gaps = 4/62 (6%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNG 63 V + CP C+ T + GK G+Q+YLC C + P K + G Sbjct: 5 VPLKCPKCNNTHLFYKYGKDKDGYQKYLCRKCYHQFA----PDKPSPKKTSKYPRCPVCG 60 Query: 64 VG 65 Sbjct: 61 KS 62 >UniRef50_UPI000196CFC8 hypothetical protein CATMIT_02848 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196CFC8 Length = 262 Score = 58.5 bits (140), Expect = 6e-08, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 23/82 (28%), Gaps = 5/82 (6%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW----QLQFTYTASQPGTHQKIIDMAMN 62 C C + D + G G QRY+C C K + F + + Sbjct: 97 QCLFCGSHD-FTKYGHKKDGTQRYICKGCGKRFTPLTNTIFDSKKIPISEWIEYLLHLFE 155 Query: 63 GVGCRATARIMGVGLNTILRHL 84 +TA T L Sbjct: 156 FHSINSTAYDNRNSPTTGKYWL 177 >UniRef50_C8T8A4 Putative uncharacterized protein insA n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T8A4_KLEPR Length = 83 Score = 58.5 bits (140), Expect = 6e-08, Method: Composition-based stats. Identities = 41/62 (66%), Positives = 46/62 (74%), Gaps = 1/62 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQ-KIIDM 59 MAS+ + PSC+ T+GV RNGKSTAGHQ YLC CRK W L FTYT SQ THQ KIIDM Sbjct: 7 MASIYVGSPSCAVTEGVDRNGKSTAGHQHYLCRQCRKPWTLTFTYTTSQRSTHQRKIIDM 66 Query: 60 AM 61 + Sbjct: 67 TI 68 >UniRef50_D0BMT4 Transposase n=10 Tax=Lactobacillales RepID=D0BMT4_9LACT Length = 426 Score = 58.1 bits (139), Expect = 8e-08, Method: Composition-based stats. Identities = 18/104 (17%), Positives = 34/104 (32%), Gaps = 21/104 (20%) Query: 7 SCPSCSATDGVVRNGKSTAGH----------------QRYLCSHCRKTWQLQFTYTASQP 50 SCP C ++ V+++ QR++C CRKTW Sbjct: 46 SCPYC-SSKNVIKHSPMEHKIRIPHLYGNKTLLELKVQRFICKDCRKTWVTDCPLVPKNS 104 Query: 51 GTHQ----KIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 +I+ + A+++ + T+ R +K Sbjct: 105 NISYDLACQIMLYLKENFSRKTIAKLLSISDKTVERVMKKFKIK 148 >UniRef50_Q4JSN3 Transposase for IS3507b n=53 Tax=Actinobacteridae RepID=Q4JSN3_CORJK Length = 422 Score = 58.1 bits (139), Expect = 8e-08, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 30/85 (35%), Gaps = 4/85 (4%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M+ P C + RNG ++ G R+ C HC + + + G ID Sbjct: 31 MSKNQ---PRCHCGGEMKRNGTTSKGTTRWRCKHCGASSVKRRIDITNSTGF-TAFIDHL 86 Query: 61 MNGVGCRATARIMGVGLNTILRHLK 85 G A +G T+ R + Sbjct: 87 TTGASLDTIASRVGCSPRTLQRRFE 111 >UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q0SUU8_CLOPS Length = 340 Score = 58.1 bits (139), Expect = 9e-08, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 21/52 (40%), Gaps = 1/52 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGT 52 M +I CP C ++ + + G +Q+Y C C + + + + Sbjct: 1 MNKTNIKCPRCH-SEKLYKFGFDKQANQKYQCKECGRQFAPDSVSSRPKSKY 51 >UniRef50_Q8RFT8 Transposase n=17 Tax=Fusobacterium RepID=Q8RFT8_FUSNN Length = 428 Score = 56.9 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 36/104 (34%), Gaps = 21/104 (20%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGH----------------QRYLCSHCRKTWQLQFT 44 + S +CP C ++ +V+NG QRY+C C+KT+ Sbjct: 46 LKSDYCTCPHC-SSKNIVKNGSRHRKIKYIPIQNHNIELELTVQRYICKDCKKTFSPSTN 104 Query: 45 ----YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 ++ I + + A+ + + ++ R + Sbjct: 105 IVSDNSSISNNLKYAIALELQKNISLTSIAKRYNISIPSVQRIM 148 >UniRef50_Q10ZA4 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZA4_TRIEI Length = 469 Score = 56.9 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 8/67 (11%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS------QPGTHQKIIDM 59 + CP+C +T + +NG+ QRY C C + + +Q + P + + + Sbjct: 1 MKCPTCGST-SLRKNGR-PNNRQRYRCKDCGRQFMVQSPTSNIEQKISVNPSESKALAEA 58 Query: 60 AMNGVGC 66 +G+ Sbjct: 59 PKSGMAI 65 >UniRef50_A7HJ99 Putative uncharacterized protein n=2 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ99_FERNB Length = 261 Score = 56.5 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 26/48 (54%), Gaps = 1/48 (2%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTAS 48 M ++ + CP C +++ ++NG +Q + C C++ ++L FT Sbjct: 1 MTNIQLKCPHCGSSN-FIKNGHDKFKNQIFFCKDCKRYFKLSFTKKHK 47 >UniRef50_C2FEQ0 IS1181 transposase n=1 Tax=Lactobacillus paracasei subsp. paracasei ATCC 25302 RepID=C2FEQ0_LACPA Length = 425 Score = 56.5 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 30/98 (30%), Gaps = 20/98 (20%) Query: 7 SCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTASQP 50 CP+C +VR G QR+ C CR +Q + Y + Sbjct: 48 HCPACGFASKLVRYGFERTCVLMPSYSYRPTYMKLSRQRFRCELCRSVFQSETDYVRPRS 107 Query: 51 GTHQKIIDMAM----NGVGCRATARIMGVGLNTILRHL 84 + M + + AR V T+ R + Sbjct: 108 TISTPVRQMVLFEAFSNCSLTDIARRFHVADKTVQRII 145 >UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ24_FERNB Length = 316 Score = 56.5 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 15/66 (22%), Positives = 31/66 (46%), Gaps = 3/66 (4%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M + ++SCP C +T + +NG G+Q++LC C +++L +++ + Sbjct: 1 MNNSTLSCPKCGST-SLYKNGHDKYGNQQFLCKLCHHSFKL--SHSQKRKNFPFPYPKCT 57 Query: 61 MNGVGC 66 G Sbjct: 58 SCGKSM 63 >UniRef50_C2BVZ4 Possible IS3509a transposase n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BVZ4_9ACTO Length = 225 Score = 55.4 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 19/75 (25%), Positives = 34/75 (45%), Gaps = 9/75 (12%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPG-------THQKIID 58 + CP+C+ + RNGK+++G QR+ C C ++ + +A + + Q+ D Sbjct: 41 MKCPACNT--PLKRNGKTSSGSQRWRCKECGRSKVGKIDNSAKELNRFLSWLLSRQRQKD 98 Query: 59 MAMNGVGCRATARIM 73 M G R A Sbjct: 99 MPGAGRTFRRHAAKF 113 >UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX7_PARUW Length = 163 Score = 54.6 bits (130), Expect = 9e-07, Method: Composition-based stats. Identities = 10/66 (15%), Positives = 21/66 (31%) Query: 19 RNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLN 78 + G G Q+Y C C + + L + T + + + V + Sbjct: 11 KKGHIHNGKQKYQCLACGRQFVLNPSQKIIDERTRLLTKKTLLECIALEGVCWVFDVSMP 70 Query: 79 TILRHL 84 +L + Sbjct: 71 WLLEFI 76 >UniRef50_C7N1Y2 Putative uncharacterized protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N1Y2_SLAHD Length = 332 Score = 54.2 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 28/87 (32%), Gaps = 5/87 (5%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDMA 60 CP C + + V R ++ AG + + C C + + F + I + Sbjct: 51 RPPCPRCGSGETVGRG-RTGAGRRFWECRDCGRKYTSLAGTIFESSKKPLSAWVLFIRLM 109 Query: 61 MNGVGCRATARIMGVGLNTILRHLKNS 87 V A A + G+ T Sbjct: 110 CYNVQLDAAAELCGMSHQTAWEWRHRV 136 >UniRef50_A8ZMW7 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZMW7_ACAM1 Length = 75 Score = 52.7 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 14/61 (22%), Positives = 28/61 (45%), Gaps = 1/61 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA 60 M+ + + P C + + GK++ G QRY C C++T+ F + ++I Sbjct: 1 MSYLLMQSPLCD-HPKIHKPGKTSKGSQRYRCLDCQQTFSETFDTLYYRLQISSEMIQAI 59 Query: 61 M 61 + Sbjct: 60 L 60 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 52.7 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 17/52 (32%), Positives = 27/52 (51%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGT 52 M ++ + C C T+ V ++ K A HQRY C C + +QL++ Y A Sbjct: 1 METLEVKCRFCQQTEFVKKHSKGDADHQRYRCFSCNQIFQLEYAYRACHCSY 52 >UniRef50_Q8PWW0 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PWW0_METMA Length = 155 Score = 52.7 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 18/101 (17%), Positives = 39/101 (38%), Gaps = 14/101 (13%) Query: 4 VSISCPS--CS-----ATDGVVRNGKSTAGHQR---YLCSHCRKTWQLQ----FTYTASQ 49 + CP+ C + ++ NG ++R Y+C C + + + F Sbjct: 11 TDVFCPNKDCKLYGITGKENIIGNGTYEIKNKRVRKYICRECGRVFNDRTGTFFDNVRKD 70 Query: 50 PGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 + I MA+ G+ +A + ++ V T+ L + + Sbjct: 71 ESDIKLAIKMAIKGMSIQAISDVLEVQPATVSNWLFRAAKQ 111 >UniRef50_Q64DF0 Putative uncharacterized protein n=6 Tax=cellular organisms RepID=Q64DF0_9ARCH Length = 337 Score = 52.3 bits (124), Expect = 4e-06, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 31/92 (33%), Gaps = 9/92 (9%) Query: 5 SISCPSCSATDGV---VRNGKSTAG-HQRYLCSHCRKTWQLQFTYTASQPGTHQ----KI 56 CP C +++ V + + G Q LC C ++ T K+ Sbjct: 7 PCKCPKC-SSENVRFDYKYDTISNGSRQMLLCRGCGASFSETKNTFLQNIRTPVSTIWKV 65 Query: 57 IDMAMNGVGCRATARIMGVGLNTILRHLKNSG 88 + G AT R+ + NT+L + Sbjct: 66 LKSRTEGTSLNATCRVFDIAKNTLLAWERKFS 97 >UniRef50_C5B8T3 IS1-family insertion element protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8T3_EDWI9 Length = 73 Score = 52.3 bits (124), Expect = 5e-06, Method: Composition-based stats. Identities = 20/47 (42%), Positives = 30/47 (63%) Query: 45 YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 Y A + ++II+MA G G R TA + +G+NT++R LKNS +S Sbjct: 27 YEAHKLDIKEQIIEMAFKGSGVRDTANTLKIGINTVIRTLKNSRQSE 73 >UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202 RepID=C8KX67_9PAST Length = 238 Score = 52.3 bits (124), Expect = 5e-06, Method: Composition-based stats. Identities = 28/87 (32%), Positives = 37/87 (42%), Gaps = 3/87 (3%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDMAM 61 ISCP CS+ + +NGK Q YLC C + + TY Q+I+ M + Sbjct: 5 TPISCPKCSSCQ-IKKNGKKPNNKQNYLCKCCGRQFIGDHALTYRGCHSKISQRILIMLV 63 Query: 62 NGVGCRATARIMGVGLNTILRHLKNSG 88 G G R A I V +L L N Sbjct: 64 RGCGIRDVAAIEKVSCTKVLSVLLNVR 90 >UniRef50_C2CJK1 ISSha1 transposase n=7 Tax=Anaerococcus RepID=C2CJK1_9FIRM Length = 422 Score = 51.9 bits (123), Expect = 6e-06, Method: Composition-based stats. Identities = 13/100 (13%), Positives = 28/100 (28%), Gaps = 20/100 (20%) Query: 8 CPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQFTYTASQP- 50 CP C + +++ G ++ Q+ C C K + L+ Sbjct: 48 CPHCGSNHNLIKYGFKSSNVRCSRAGDYPVIIDLKKQKMFCKSCNKYFLLETKIVDKHCN 107 Query: 51 ---GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNS 87 + I+ + + V T+ R + Sbjct: 108 ISNQIKRHILASLTKKLSMKDIGSNNYVSTTTVARFMAKL 147 >UniRef50_A4W908 Putative cytoplasmic protein n=2 Tax=Enterobacteriaceae RepID=A4W908_ENT38 Length = 414 Score = 51.5 bits (122), Expect = 7e-06, Method: Composition-based stats. Identities = 14/41 (34%), Positives = 22/41 (53%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT 44 V + CP+C D ++RNG G QR+ C C ++ + T Sbjct: 64 VLLYCPTCGQGDALIRNGCGLRGAQRWRCRTCNSSFTDKST 104 >UniRef50_B2A0V7 Integrase catalytic region n=15 Tax=Clostridia RepID=B2A0V7_NATTJ Length = 353 Score = 51.5 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 15/68 (22%), Positives = 23/68 (33%), Gaps = 7/68 (10%) Query: 1 MASVSISCPSCS--ATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTA---SQPGTHQK 55 M + CP C+ +D + G GHQ+Y C C + + P +K Sbjct: 1 MTK--VVCPRCNNNCSDKFYKFGFDNHGHQKYQCQECFSQFAPKTLSKGGDKRGPNMPRK 58 Query: 56 IIDMAMNG 63 G Sbjct: 59 YPSCPKCG 66 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 16/116 (13%), Positives = 29/116 (25%), Gaps = 30/116 (25%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLC--SHCRKTWQLQ---------------- 42 M SCP C + + C C ++ + Sbjct: 55 MPRKYPSCPKCGKATFLH---HDYEFYSNLRCCDKSCNHSFYVPKPQSIPEPSQLDINGK 111 Query: 43 --FTYTASQPGTHQKIIDMA-MNGVGCRATARIM------GVGLNTILRHLKNSGR 89 F+ T + + + +NG R ++ + V TI K Sbjct: 112 VDFSNMRHPLHTIIRALYLYFINGSSTRGVSQFLLDCEGIKVSHVTIADWTKKFAP 167 >UniRef50_A1VN28 Insertion element protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VN28_POLNA Length = 324 Score = 51.5 bits (122), Expect = 9e-06, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 35/91 (38%), Gaps = 5/91 (5%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTY---TASQPGTHQKIID 58 A+ CP C T+ + R+G +G QRY C CR+T+ + G + Sbjct: 47 ATEPRCCPHCQGTE-LYRHGH-VSGLQRYRCRTCRRTFNALTGTALARLRKKGKWFGFSE 104 Query: 59 MAMNGVGCRATARIMGVGLNTILRHLKNSGR 89 + R A + V NT LR R Sbjct: 105 ALAASLTLRRAATALQVHRNTALRWRHRFLR 135 >UniRef50_C9BRL4 Transposase n=30 Tax=Enterococcus RepID=C9BRL4_ENTFC Length = 431 Score = 51.5 bits (122), Expect = 9e-06, Method: Composition-based stats. Identities = 18/105 (17%), Positives = 32/105 (30%), Gaps = 22/105 (20%) Query: 7 SCPSCSAT--DGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTAS 48 C C ++R G +T QR+ C C++T+ + Sbjct: 48 ECSHCLCVVPSRIIRWGTTTVRLLLNDVSEYRTYLELKKQRFKCKSCQRTFVADTSVAEK 107 Query: 49 QPGTHQK----IIDMAMNGVGCRATARIMGVGLNTILRHLKNSGR 89 QK +I AR + +++ R +K R Sbjct: 108 HCFISQKVRWSVIARLKENTSMTEIARQKNISTSSVYRVMKRFYR 152 >UniRef50_C5BFY7 IS1, transposase OrfA n=3 Tax=Enterobacteriaceae RepID=C5BFY7_EDWI9 Length = 46 Score = 51.1 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 19/42 (45%), Positives = 26/42 (61%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 MA + + CP + T V+RNG +T+G Q Y C C KT+QL Sbjct: 1 MAKIDVVCPRGAKTQDVIRNGHATSGAQVYRCKLCLKTFQLS 42 >UniRef50_Q9V1K2 Putative uncharacterized protein n=2 Tax=Pyrococcus RepID=Q9V1K2_PYRAB Length = 141 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 30/93 (32%), Gaps = 9/93 (9%) Query: 3 SVSISCPSCSATDGVVRNGK----STAGHQRYLCSHCRKTWQL----QFTYTASQPGTHQ 54 I+CP C + +V+ G QRY C +C +T+ S Sbjct: 31 KGRITCPYC-KSPNIVKIGYIMRSGNFKIQRYKCKNCNRTFTELDGTPLKGAHSLKDIVI 89 Query: 55 KIIDMAMNGVGCRATARIMGVGLNTILRHLKNS 87 + + A+I+ + + R K Sbjct: 90 VAYLTLDLKLPPSSIAKILPINRPKLYRAYKRV 122 >UniRef50_Q7NH53 TetR family transcriptional regulatory protein n=1 Tax=Gloeobacter violaceus RepID=Q7NH53_GLOVI Length = 227 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 13/42 (30%), Positives = 20/42 (47%), Gaps = 2/42 (4%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTA 47 + CP C ++ + RNG QR LC C + + L +A Sbjct: 188 MKCPRCG-SERLSRNGHRHD-RQRLLCKDCSRQFLLPVGQSA 227 >UniRef50_C5S2C5 Putative transposase n=1 Tax=Actinobacillus minor NM305 RepID=C5S2C5_9PAST Length = 394 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 29/94 (30%), Gaps = 12/94 (12%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDM 59 I CP C T Q++ C C++ + L F T+ + Sbjct: 52 EKICCPHCQRTQP-----YFIKSRQKWRCRGCKREFSLTSGTLFASHKLPLRTYLLALVF 106 Query: 60 AMN---GVGCRATARIMGVGLNTILRHLKNSGRS 90 +N G+ + AR + V T S Sbjct: 107 YINAKQGITSKRLARELAVNYRTAFMLSHKIRES 140 >UniRef50_Q8PRR9 Conserved protein n=2 Tax=Archaea RepID=Q8PRR9_METMA Length = 148 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 27/78 (34%), Gaps = 4/78 (5%) Query: 12 SATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGT----HQKIIDMAMNGVGCR 67 + + V++ H + C C+K + + T + I M R Sbjct: 34 NQGNIVLKERYGKNNHALFKCKTCKKCFSETKGTIFFELNTPDEEVLRTIAMLPEKGSIR 93 Query: 68 ATARIMGVGLNTILRHLK 85 AR G +TI R L+ Sbjct: 94 GVARATGHSKDTICRWLE 111 >UniRef50_B7K7J3 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7K7J3_CYAP7 Length = 354 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 2/36 (5%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTW 39 + I CP C + +NG + AG QRY C C + + Sbjct: 2 ILIQCPKC-KSKNYRKNG-TIAGKQRYQCKSCGRNF 35 >UniRef50_Q03IY7 Transposase n=198 Tax=Lactobacillales RepID=Q03IY7_STRTD Length = 442 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 30/103 (29%), Gaps = 19/103 (18%) Query: 4 VSISCPSCSA---TDGVVRNGKSTA------------GHQRYLCSHCRKTWQLQFTYTAS 48 S CP+C + K +R+ C C K + + Sbjct: 63 ESPKCPACKGQMGKYDFQKASKIPYLECAGYRTLIRLKKRRFKCKECGKMAVAETSLVKK 122 Query: 49 QPGT----HQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNS 87 +QKI + + A+ + V +T+ R L Sbjct: 123 NHQIATVVYQKIAQLLIEKQSMTDIAKRLAVSTSTVSRKLNEF 165 >UniRef50_B4VTL4 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VTL4_9CYAN Length = 124 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 32/93 (34%), Gaps = 12/93 (12%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDMA 60 CP C +T + + +RY C+ C ++ F T I + Sbjct: 20 YPQCPYCQST-----HSRRLKKERRYQCNECFTSYSVTVGTLFHKTHVDLEKWVLAIYLV 74 Query: 61 M---NGVGCRATARIMGVGLNTILRHLKNSGRS 90 + + R A+ +GV NT + ++ Sbjct: 75 LNPPERISVRQLAKKIGVNKNTASYMIARIRQA 107 >UniRef50_Q11ZU0 Putative uncharacterized protein n=1 Tax=Polaromonas sp. JS666 RepID=Q11ZU0_POLSJ Length = 590 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 30/91 (32%), Gaps = 14/91 (15%) Query: 8 CPSCSATDGVV---------RNGKSTAGHQRYLCSHCRKTWQLQFT-----YTASQPGTH 53 CP C ++ +V G +TAG Y C C KT+ ++ + + Sbjct: 104 CPDCMCSNHLVPITQPKAYHSFGLTTAGSHCYRCKVCSKTFSVKPKGINPIARQLRSDKN 163 Query: 54 QKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 ++ M + R V + + Sbjct: 164 PPVLRMLTGKMPLRRICEAADVAPKVLYERI 194 >UniRef50_Q03NU3 Transposase n=12 Tax=Lactobacillus RepID=Q03NU3_LACBA Length = 423 Score = 50.0 bits (118), Expect = 3e-05, Method: Composition-based stats. Identities = 23/103 (22%), Positives = 32/103 (31%), Gaps = 21/103 (20%) Query: 8 CPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTASQPG 51 CP C+ VV NG T QR+ C C KT Q Q Sbjct: 47 CPYCAQRQ-VVCNGHKTVYVRLPNVSERTVILILRKQRFRCKACGKTSIAQTPVVRRQHQ 105 Query: 52 THQK----IIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 + I + R+ A V N++ R + G+ Sbjct: 106 ISENTRHAIDKTLIEDRTMRSIADQYNVSTNSVSRRILALGKQ 148 >UniRef50_B1JSC1 Insertion element protein n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JSC1_YERPY Length = 53 Score = 49.6 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 16/37 (43%), Positives = 21/37 (56%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRK 37 MA + CP C D V ++G +GHQRY C H +K Sbjct: 1 MAKIDEKCPFCERKDLVKKHGYGKSGHQRYRCPHAKK 37 >UniRef50_D0U1S9 Transposase n=1 Tax=Enterococcus faecium RepID=D0U1S9_ENTFC Length = 427 Score = 49.6 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 19/113 (16%), Positives = 37/113 (32%), Gaps = 23/113 (20%) Query: 1 MASVSISCPSCSATD---GVVRNGKSTAG----------------HQRYLCSHCRKTWQL 41 ++ V C C A + + +NG T+ QR++C C K++ Sbjct: 36 LSYVPKECAHCEAPNVGFSIYKNGTQTSRVTFPMAGILPTYLRIRKQRFMCKCCGKSFTA 95 Query: 42 QFTYTASQP----GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 + +I+ + + A+ V ++ R L G S Sbjct: 96 RTPVVERNCFISNYIKAQILTQSGETRSVKDIAKHTNVSEASVQRVLTLEGES 148 >UniRef50_Q2RQJ8 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RQJ8_RHORT Length = 150 Score = 49.2 bits (116), Expect = 3e-05, Method: Composition-based stats. Identities = 18/98 (18%), Positives = 31/98 (31%), Gaps = 14/98 (14%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQR--YLCSHCRKTWQLQFTYTASQPGT----HQK 55 A CP C R+ G QR + C+ CRK + + + Sbjct: 41 ARSRPVCPHCG-----FRHAYRLEGSQRVRFKCARCRKQYSARRGTVMERSNVPTAGWLT 95 Query: 56 IIDMAMN--GVGC-RATARIMGVGLNTILRHLKNSGRS 90 + + ++ G G R GV T ++ + Sbjct: 96 ALRLFISAPGAGLPARIERATGVSYKTAWSMVQRMRAA 133 >UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthrospira maxima CS-328 RepID=B5VWL6_SPIMA Length = 153 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 12/36 (33%), Positives = 21/36 (58%) Query: 50 PGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLK 85 + + M++NG+G RA R+ G+ NTIL ++ Sbjct: 19 SDVKELCVKMSLNGMGFRAIERVTGISHNTILNWVR 54 >UniRef50_B4WUH8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WUH8_9SYNE Length = 76 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 27/56 (48%), Gaps = 3/56 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT-YTASQPGTHQKIIDMA 60 ++CP C ++ + +NG G Q Y+C+ CR+ + ++ ++ + M Sbjct: 1 MACPECQ-SEHIRKNGH-KRGKQNYICADCRRQFVENPKEHSGYSDEERKQCLSMY 54 >UniRef50_A7JMB8 Predicted protein n=8 Tax=Francisella RepID=A7JMB8_FRANO Length = 82 Score = 48.8 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 18/83 (21%), Positives = 30/83 (36%), Gaps = 4/83 (4%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK--IIDMAMNG 63 I C C ++G+ + G QRY C C + L +I ++ Sbjct: 2 IKCNRCH-SEGIHKTGVVRN-KQRYKCKSCGYNFVLSDGRIKPDIAIKLALTVIMYSLGK 59 Query: 64 VGCRATARIMGVGLNTILRHLKN 86 A++ GV + TI L+ Sbjct: 60 YSYGFIAKLFGVRMTTIQNWLEQ 82 >UniRef50_Q6MK35 Putative transposase n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MK35_BDEBA Length = 300 Score = 48.8 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 17/96 (17%), Positives = 34/96 (35%), Gaps = 15/96 (15%) Query: 5 SISCPSCS-------ATDGVVRNGKSTAGHQ-----RYLCSHCRKTWQLQFTYT---ASQ 49 ++ CP C A + R G+ R C C K++ + Sbjct: 8 NLKCPYCHLQRDPKDANRTIRRLGRYYRKSDGQTLTRLWCVRCGKSFSAATQSRLKGQKK 67 Query: 50 PGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLK 85 ++ I D+ + R AR++ + T++R + Sbjct: 68 RHLNKLIRDLLTGEMSQREIARVLKINRKTVVRKFR 103 >UniRef50_UPI0001BC5E64 putative transposase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5E64 Length = 173 Score = 48.8 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 20/101 (19%), Positives = 32/101 (31%), Gaps = 21/101 (20%) Query: 7 SCPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQFTYTASQP 50 CP C ++RNG + QR+LC C KT+ + Sbjct: 48 KCPFCGE-KHIIRNGTKLSKIKILDVSNTPSYLYLRKQRFLCKSCSKTFSASTNFVRKYC 106 Query: 51 G----THQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNS 87 I + N + + A+ V +T+ R L Sbjct: 107 NIADSIKLSIALESKNIISEKDIAKRFRVSSSTVKRSLLQY 147 >UniRef50_C4W7G8 Transposase for ISSha1 n=2 Tax=Staphylococcus RepID=C4W7G8_STAWA Length = 434 Score = 48.1 bits (113), Expect = 9e-05, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 37/113 (32%), Gaps = 23/113 (20%) Query: 1 MASVSISCPSCSATDG---VVRNGKSTAG----------------HQRYLCSHCRKTWQL 41 + + + C C + ++++G Q + C C +T+ Sbjct: 39 LTYIPMGCECCGIKNDNHLIIKHGFRETKVYMGLILERPAYLQLKKQSFYCKECGQTFTA 98 Query: 42 QFTYTASQPGTHQKIIDMAMNGV----GCRATARIMGVGLNTILRHLKNSGRS 90 Q Y + + + M M + + A + +T+ R+LK S Sbjct: 99 QTPYIEPRCRISKDVKLMMMKKLAKVSSEKDVANSLFHSPSTVHRYLKEVSSS 151 >UniRef50_Q10VF2 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10VF2_TRIEI Length = 59 Score = 48.1 bits (113), Expect = 9e-05, Method: Composition-based stats. Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 1/59 (1%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDM 59 M+ + CPSC ++ +V+NG Q+Y C +C++ + T + I + Sbjct: 1 MSIHKLICPSCG-SNHIVKNGTIHNKKQKYQCQNCQRQFVENSQRDYISNETKELIDKL 58 >UniRef50_D1QQX4 Putative uncharacterized protein n=15 Tax=Prevotella RepID=D1QQX4_9BACT Length = 318 Score = 47.7 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 14/82 (17%), Positives = 29/82 (35%), Gaps = 7/82 (8%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 + C C +T +NG G Q Y C C ++ + + T + Sbjct: 4 MRCCVCGST-HTKKNG-VRKGLQLYKCQDCGYQFRSGSQVSNDELWTAYQ-----QQKQT 56 Query: 66 CRATARIMGVGLNTILRHLKNS 87 + + + ++T+ R L + Sbjct: 57 IKELSVRFKISVSTVKRRLHDI 78 >UniRef50_B3GXU2 Transposase n=15 Tax=Pasteurellaceae RepID=B3GXU2_ACTP7 Length = 373 Score = 47.7 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 11/96 (11%), Positives = 27/96 (28%), Gaps = 11/96 (11%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYT---ASQPGTHQKIID 58 + CP C + + +R+ C HC++ + + P Sbjct: 40 NLHDVCCPHCG----IRHHAYFLQSRKRWCCKHCQRHFYITTNTAFAFHKLPFVDILAAT 95 Query: 59 MA----MNGVGCRATARIMGVGLNTILRHLKNSGRS 90 + + G+ +R + + T + Sbjct: 96 LLFANEVKGISAITMSRHLNISYKTAFVLCHKLREA 131 >UniRef50_B2J098 Putative uncharacterized protein n=2 Tax=Nostocaceae RepID=B2J098_NOSP7 Length = 133 Score = 47.3 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 10/38 (26%), Positives = 23/38 (60%), Gaps = 1/38 (2%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQF 43 + CP C+ + + ++G+ G QRY+C +C + ++ + Sbjct: 34 MECPKCN-SHLLGKHGREPDGVQRYICKNCSRIFRARP 70 >UniRef50_B8F7V2 ISRssp2, family IS1595 n=4 Tax=Pasteurellaceae RepID=B8F7V2_HAEPS Length = 378 Score = 47.3 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 14/93 (15%), Positives = 28/93 (30%), Gaps = 11/93 (11%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDM 59 + CP C + + +R+ C HC++ + F + I + Sbjct: 43 NDVCCPFCG----IRHHAYFLQSRKRWTCKHCKRNFYITTNTAFAFHKLPLVDILLAISL 98 Query: 60 AMN---GVGCRATARIMGVGLNTILRHLKNSGR 89 +N G+ +R + V T Sbjct: 99 FVNEVKGISAITMSRHLNVNYKTAFVLCHKLRE 131 >UniRef50_D1PSS1 Insertion element protein (Fragment) n=14 Tax=Prevotella RepID=D1PSS1_9BACT Length = 113 Score = 47.3 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 27/78 (34%), Gaps = 7/78 (8%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCR 67 C C + VRNG G Q Y+C C + + T S+ + Sbjct: 1 CSVC-KSKHTVRNG-VRQGKQLYMCKECHSQF--RAGNTVSEDELWRSYQQ---EKQTIA 53 Query: 68 ATARIMGVGLNTILRHLK 85 + G+ L T+ R L Sbjct: 54 ELSSRFGISLATVKRRLH 71 >UniRef50_Q035C5 Transposase n=27 Tax=Lactobacillales RepID=Q035C5_LACC3 Length = 414 Score = 47.3 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 39/105 (37%), Gaps = 21/105 (20%) Query: 7 SCPSCSATDGVVRNGKST------AG----------HQRYLCSHCRKTWQLQFTYTASQP 50 CP C + + NG T G QR+ C +C T + Sbjct: 50 RCPLCGF-EALHPNGFYTAHVRVLNGVEIPTVIDLHKQRWRCHNCYHTVSAKTPLVQPNH 108 Query: 51 ----GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 ++I+ +A + + ARI+G+ +++ R + + + R Sbjct: 109 TIAAHMTERIMKLAHERLPVKTIARIIGISASSVQRIIDQNLKLR 153 >UniRef50_Q7VL05 Possible transposase n=4 Tax=Pasteurellaceae RepID=Q7VL05_HAEDU Length = 363 Score = 47.3 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 12/94 (12%), Positives = 24/94 (25%), Gaps = 11/94 (11%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTY-------TASQPGTHQKI 56 I CP C V Q++ C HC + + + + + Sbjct: 49 NDIECPHC----HVRHEAYFIKTRQQWQCKHCCYRFSITAGTIFHLAKLSLRKILKALRY 104 Query: 57 IDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 + G+ + + V T + Sbjct: 105 FALKSKGLSAIELSHEINVQYKTAWGLRHKFREA 138 >UniRef50_C8SCF8 Putative uncharacterized protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCF8_FERPL Length = 317 Score = 47.3 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 15/74 (20%), Positives = 26/74 (35%), Gaps = 5/74 (6%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAGH---QRYLCSHCRKTWQLQ--FTYTASQPGTHQK 55 + + SCP C++ + + ++YLC C T+ F +T P Sbjct: 28 LNKWNPSCPHCNSYHIIKKTDIKRERKGYAKKYLCRDCNSTFTFDNCFEWTHYPPRVVGD 87 Query: 56 IIDMAMNGVGCRAT 69 I + G R Sbjct: 88 IFHLIAKGESYRDI 101 >UniRef50_D1W685 Putative uncharacterized protein n=2 Tax=Prevotella RepID=D1W685_9BACT Length = 298 Score = 46.9 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 13/74 (17%), Positives = 24/74 (32%), Gaps = 6/74 (8%) Query: 17 VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVG 76 VV+ G QR+ C C +T+ + + + A V Sbjct: 2 VVKRGFHKN-RQRWYCKSCGRTFVG-----HKRLTEETVNTRYSKGNLTVEDLATEYAVS 55 Query: 77 LNTILRHLKNSGRS 90 T+ R L + ++ Sbjct: 56 TRTVYRRLSKTYKA 69 >UniRef50_Q70JT0 IsmA protein n=2 Tax=Microcystis aeruginosa RepID=Q70JT0_MICAE Length = 112 Score = 46.9 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 12/47 (25%), Positives = 20/47 (42%), Gaps = 1/47 (2%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTH 53 +CPSC + ++NG G + C C + + + T P T Sbjct: 34 TCPSCG-SHHTIKNGYLPKGKPKRHCQECGQPFVINPTNKTISPDTK 79 >UniRef50_B2J7N9 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J7N9_NOSP7 Length = 428 Score = 46.9 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 11/44 (25%), Positives = 19/44 (43%), Gaps = 2/44 (4%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQ 49 + CP C +T +NG Q YLC +C + + + + Sbjct: 1 MKCPRCEST-SCRQNGC-RNDKQNYLCKNCGQQFLEPVFPHSLK 42 >UniRef50_C7RJT2 Conserved possible transposase n=21 Tax=Proteobacteria RepID=C7RJT2_9PROT Length = 342 Score = 46.9 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 16/94 (17%), Positives = 29/94 (30%), Gaps = 11/94 (11%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDM 59 + CP C R+ A + C+ C++ + F + + + + Sbjct: 40 EEVVCPHCGMAH---RHYFRPARKI-WRCAGCQEDFSVTSGTIFAFHKLPLRLYLAAVIL 95 Query: 60 AMN---GVGCRATARIMGVGLNTILRHLKNSGRS 90 N G+ R +GV T L S Sbjct: 96 FTNAVKGISALQVGRDLGVSHKTAYVLLHKIRES 129 >UniRef50_Q93CQ1 Transposase TnpA n=1 Tax=Enterococcus faecium RepID=Q93CQ1_ENTFC Length = 446 Score = 46.5 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 15/100 (15%), Positives = 27/100 (27%), Gaps = 19/100 (19%) Query: 7 SCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTASQP 50 C C + ++GK QRY C C +T+ + Sbjct: 36 RCQKCGTIANLYKHGKKRQLFFDLPMHAKRVGIYLKRQRYKCRDCNETFFEKLPDLDDAR 95 Query: 51 GTHQK---IIDMAMNGVGCRATARIMGVGLNTILRHLKNS 87 ++ I + A +GV T+ + Sbjct: 96 SVTKRLNNFIQEVSLEKTFTSVAEEIGVDEKTVRNIFNDY 135 >UniRef50_B0CG58 Transcriptional regulator, TetR family n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CG58_ACAM1 Length = 260 Score = 46.5 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 2/37 (5%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 + CP C +D + +NGK Q Y+C CRK + Sbjct: 221 MICPHCQ-SDRLSKNGKRRNQ-QCYVCKDCRKQFVES 255 >UniRef50_A5FLG0 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FLG0_FLAJ1 Length = 311 Score = 46.5 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 28/92 (30%), Gaps = 11/92 (11%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTH---QKIID 58 +CP C + V NG + R C C+K + F T II Sbjct: 39 PTCPYCESEKVKVLNGTTK----RLKCYGCKKQFGVKVGTIFHDTKISLRKWFIAVYIIT 94 Query: 59 MAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 G+ +R + V T L + Sbjct: 95 AHKKGISSHQLSRDLKVTQKTAWFMLHRVREA 126 >UniRef50_A7HMZ5 Transposase IS204/IS1001/IS1096/IS1165 family protein n=14 Tax=Bacteria RepID=A7HMZ5_FERNB Length = 395 Score = 46.5 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 14/104 (13%), Positives = 31/104 (29%), Gaps = 19/104 (18%) Query: 7 SCPSCSA---------TDGVV------RNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPG 51 CP C T V + +RY+C C K + ++ Sbjct: 40 KCPKCGNITSKVHDYHTQKVKDVPIMGKKTYLIIRKRRYVCKACGKKFFEHISFLGKSQR 99 Query: 52 THQKIIDMAMNGV----GCRATARIMGVGLNTILRHLKNSGRSR 91 ++ ++ + + A+ V + T++R + Sbjct: 100 MTNRLAAYIISQLGSLTSMKEIAKHTNVSVTTVMRLFDKVNPGQ 143 >UniRef50_D2LK53 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LK53_RHOVA Length = 249 Score = 46.1 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 16/93 (17%), Positives = 24/93 (25%), Gaps = 12/93 (12%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK----IIDMA 60 + CP C T+ + C C K + L + I + Sbjct: 19 NPVCPECGGTNH-----YDLKSRPVWKCKACSKQFSLTSGTIFHSRKLRIRDILGAIAIF 73 Query: 61 MN---GVGCRATARIMGVGLNTILRHLKNSGRS 90 N G +R +G T L S Sbjct: 74 TNGAKGYSALQLSRDLGCDYKTCFVLLHKLRES 106 >UniRef50_Q3Y3Y3 Transposase, IS204/IS1001/IS1096/IS1165 n=11 Tax=Lactobacillales RepID=Q3Y3Y3_ENTFC Length = 424 Score = 46.1 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 34/105 (32%), Gaps = 21/105 (20%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQ----------------RYLCSHCRKTWQLQF----T 44 C C + ++RNG T Q R+LC C +T+ + Sbjct: 44 PSHCEHCGSI-RIIRNGSYTTRTQILKVKEKLTILELKRTRFLCYDCGQTFSAKTDLVDE 102 Query: 45 YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGR 89 + Q I+ + A+ V T+ R L+ + + Sbjct: 103 HHQLTKELKQAILMELYENQSRKLIAKKYFVSDGTVTRILREATK 147 >UniRef50_C1DPZ8 Transposase n=4 Tax=Bacteria RepID=C1DPZ8_AZOVD Length = 236 Score = 46.1 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 7/45 (15%), Positives = 11/45 (24%) Query: 46 TASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 + Q + + R A+ GV NT Sbjct: 3 RLRKRHLWQGYAEALTQSLTVRRAAKHCGVSKNTAFLWRHRFLTQ 47 >UniRef50_C6HZQ4 Transposase n=2 Tax=Leptospirillum ferrodiazotrophum RepID=C6HZQ4_9BACT Length = 443 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 18/102 (17%), Positives = 31/102 (30%), Gaps = 20/102 (19%) Query: 7 SCPSCSATDGV--------VR----NGKSTA---GHQRYLCSHCRKTWQLQFTYTASQPG 51 C C + D V +R +GK +R+ C C +T+ + Sbjct: 35 RCVHCGSIDLVGFGRREQWIRDLPIHGKRVGIAVDTRRFRCKSCGRTFYEPLPAVDDKRL 94 Query: 52 THQKIIDMAMNGVGCR----ATARIMGVGLNTILRHLKNSGR 89 ++ R A GVG TI + + + Sbjct: 95 MTTRLKTWL-EKKSLRPPFSQLAEETGVGALTIRKVFDDCAQ 135 >UniRef50_Q6V7R1 Bcep22gp32 n=1 Tax=Burkholderia phage Bcep22 RepID=Q6V7R1_9CAUD Length = 482 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 13/92 (14%), Positives = 21/92 (22%), Gaps = 19/92 (20%) Query: 8 CPSCSATDGVVRNGKS----------------TAGHQRYLCSHCRKTWQLQFTYTASQPG 51 C C V ++G A QR+ C C T+ Sbjct: 35 CQKCGVIGRVYKHGPKMIIFRDSPIRGRPVSIEANAQRFRCRDCGGTFIQPLGGIHPATR 94 Query: 52 THQKIIDMAMN---GVGCRATARIMGVGLNTI 80 + + A +G T+ Sbjct: 95 MTARCVQYIEEQCLRDTFTRIAEHVGCDDKTV 126 >UniRef50_B4WVD1 Putative uncharacterized protein n=7 Tax=Synechococcus sp. PCC 7335 RepID=B4WVD1_9SYNE Length = 298 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 29/94 (30%), Gaps = 10/94 (10%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKII-DMAMN-- 62 + CP C + + K+ G+ + C CR+T+ + + II + + Sbjct: 27 MDCPHCQSPRVSLLQRKTNLGYDMFRCKRCRRTFNERTGTPFNFIEVPTDIIFQVLLCRV 86 Query: 63 --GVGCRATA-----RIMGVGLNTILRHLKNSGR 89 + R A R T+ Sbjct: 87 RYKLSYRDVAEFFLLRGFQFTHETVRDWEARFLP 120 >UniRef50_B5S3H3 Probable transposase protein n=6 Tax=Burkholderiaceae RepID=B5S3H3_RALSO Length = 460 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 14/88 (15%), Positives = 29/88 (32%), Gaps = 6/88 (6%) Query: 6 ISCPSCSATDGVVRNGKSTAGH-QRYLCSHCRKTWQLQFTYTASQPGTHQK---IIDMAM 61 CP C +R ++ G+ + C C++++ S K +I + Sbjct: 103 PRCPHCDGLR--IRPDRNKGGNLPSFFCHGCKRSFNRLTGTPFSHLVNRAKGAAMIPLLS 160 Query: 62 NGVGCRATARIMGVGLNTILRHLKNSGR 89 + + +G +L L R Sbjct: 161 RQMSLDQAGKRLGRTKKAVLSWLLAFRR 188 Score = 45.4 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 13/91 (14%), Positives = 30/91 (32%), Gaps = 3/91 (3%) Query: 2 ASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP---GTHQKIID 58 ++ + SCP C + +G + C C + P + + Sbjct: 345 STHAASCPWCGSDQTKYHPAPRPSGLPGFRCRACLAYFTRVSNTPLVHPMARAYASRFVP 404 Query: 59 MAMNGVGCRATARIMGVGLNTILRHLKNSGR 89 M AR +G+ + T+ +++ + Sbjct: 405 MLGWHETGAGAARELGIAMGTLHTWVRSWRQ 435 >UniRef50_C3L491 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L491_AMOA5 Length = 119 Score = 45.8 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 9/51 (17%), Positives = 21/51 (41%) Query: 40 QLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 Q T + + + + G+G RA ++ + T+ + ++ SG Sbjct: 40 YSQPKSGVKPIQTKRLALQLYLEGLGFRAIGNLLQISYGTVYQWIEASGEQ 90 >UniRef50_B7X577 Transposase IS204/IS1001/IS1096/IS1165 family protein n=1 Tax=Comamonas testosteroni KF-1 RepID=B7X577_COMTE Length = 471 Score = 45.8 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 28/92 (30%), Gaps = 19/92 (20%) Query: 8 CPSCSATDGVVRNG----------------KSTAGHQRYLCSHCRKTWQLQFTYTASQPG 51 CP C D + R+G K A QRY C+ C++T+ Sbjct: 36 CPKCGTLDCIYRHGTKATTYVDIPMRGKPAKLRAKVQRYRCTSCKETFLQPLGGILEGRR 95 Query: 52 THQK---IIDMAMNGVGCRATARIMGVGLNTI 80 ++ I A +G T+ Sbjct: 96 MTERCATYIKAHSLRDTFTRIAENVGCDDKTV 127 >UniRef50_A8UDH0 Transposase n=5 Tax=Bacteria RepID=A8UDH0_9LACT Length = 439 Score = 45.8 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 33/113 (29%), Gaps = 23/113 (20%) Query: 1 MASVSISCPSCSATDG---VVRNGKSTA----------------GHQRYLCSHCRKTWQL 41 + C C + +V+NG T+ QR+LC C T+ Sbjct: 42 LTYQPSHCECCGMKNHSYSIVKNGYLTSRVKWVSSTHYPTYIQLKKQRFLCRECGVTFVA 101 Query: 42 QFTYT----ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 Q Q I + + ++ V T R LK +S Sbjct: 102 QSPEIEQGCFIAKRVKQSIAVELADTTSVKDLSKRHFVSPTTTDRVLKQLNQS 154 >UniRef50_C0WLQ9 Transposase n=3 Tax=Lactobacillus RepID=C0WLQ9_LACBU Length = 418 Score = 45.8 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 20/96 (20%), Positives = 31/96 (32%), Gaps = 20/96 (20%) Query: 7 SCPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQFTYTASQP 50 CP+C D +V++G T QR+LC HC Q Sbjct: 49 RCPNCGFADCLVKDGHKTVNLKLSPQRFHLLILRLAKQRFLCKHCGSIITSQTDAVKPNC 108 Query: 51 GT----HQKIIDMAMNGVGCRATARIMGVGLNTILR 82 Q ++ + + A+ GV T+ R Sbjct: 109 QISKNVWQSVVMDFHDNMAATLIAKQNGVSAGTVNR 144 >UniRef50_Q7N9S9 Transposase TnpA, ISL3 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N9S9_PHOLL Length = 429 Score = 45.8 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 23/107 (21%), Positives = 35/107 (32%), Gaps = 20/107 (18%) Query: 2 ASVSISCPSC--------SATDGVVR----NGKSTA---GHQRYLCSHCRKTWQLQFTYT 46 A+ C C D V+ NGK T +RY C C KT+ + Sbjct: 29 ANPPTHCIHCKHPEIVGFGRRDEVIMDTPVNGKRTGIILNRRRYRCQICCKTFMEPVPHK 88 Query: 47 ASQPGTHQKIIDMAMNGVGCRAT----ARIMGVGLNTILRHLKNSGR 89 + ++I + R T A +GV T+ +S Sbjct: 89 DGKRQMTHRLIQ-YIERESLRRTFSSVAEDVGVDEKTVRNIFHDSCE 134 >UniRef50_C9CRL2 Transposase n=3 Tax=Alphaproteobacteria RepID=C9CRL2_9RHOB Length = 432 Score = 45.8 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 12/95 (12%), Positives = 27/95 (28%), Gaps = 16/95 (16%) Query: 5 SISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMA---- 60 CP+C D +R+ C+ C + + + + +D+ Sbjct: 42 EPICPACGCVDV-----YDLTTRRRFKCAACHRQFSVTSGTIFASR--KLAFVDLLGAIC 94 Query: 61 -----MNGVGCRATARIMGVGLNTILRHLKNSGRS 90 G+ +R + V T + + Sbjct: 95 LFVNAAKGLSAVQMSRDLDVQHKTAFVLMHKLREA 129 >UniRef50_A7HVK5 Putative uncharacterized protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVK5_PARL1 Length = 608 Score = 45.4 bits (106), Expect = 5e-04, Method: Composition-based stats. Identities = 13/64 (20%), Positives = 26/64 (40%), Gaps = 2/64 (3%) Query: 23 STAGHQRYLCSHCRKTWQLQFTYTASQ--PGTHQKIIDMAMNGVGCRATARIMGVGLNTI 80 S G QR+ C C++T+ + T Q P ++ + ++ R + G+ + Sbjct: 133 SRGGAQRFRCKACQRTFSVALKSTVRQRAPHLNRTVFAEVVSKKPLRGIMEVTGLSAAAV 192 Query: 81 LRHL 84 L Sbjct: 193 YDKL 196 >UniRef50_D0SJK4 Predicted protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJK4_ACIJU Length = 460 Score = 45.4 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 30/93 (32%), Gaps = 20/93 (21%) Query: 7 SCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQFTYTASQP 50 CP C +D + ++G +RY C C+ T+ + T Sbjct: 33 KCPKCG-SDQLYKHGTKPVIYRDIPRHMKPTVINVEVKRYRCKSCKATFLQEVTGIYPDT 91 Query: 51 GTHQKI---IDMAMNGVGCRATARIMGVGLNTI 80 ++ I TAR+MG TI Sbjct: 92 RMTERFVKKIQDICLDYTFSDTARMMGCDSKTI 124 >UniRef50_Q2J1M8 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris HaA2 RepID=Q2J1M8_RHOP2 Length = 204 Score = 45.4 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 29/91 (31%), Gaps = 11/91 (12%) Query: 6 ISCPSCS--ATDGVVRNGKSTA-GHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIID 58 CP C + V GKS G YLCS CR+ + T Sbjct: 19 PECPHCGVGSPSVVAIAGKSHRPGL--YLCSACRRQFTVTVGTPLEGTKLPLKLWIGAAH 76 Query: 59 MAMNGVGC--RATARIMGVGLNTILRHLKNS 87 + + R R +GV T + ++ Sbjct: 77 LLNSHQPIAVREIERALGVTYKTAWKVVQRL 107 >UniRef50_UPI0001C31088 transcriptional regulator, TetR family n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31088 Length = 349 Score = 45.0 bits (105), Expect = 7e-04, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 25/88 (28%), Gaps = 10/88 (11%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDMAMNG 63 CP C A + G RY C C F + + GT M Sbjct: 31 CPRCGADRPF----RLRRGDIRYACRVCEMALDPRAGTAFEGSRTPLGTWFVATAMLRED 86 Query: 64 --VGCRATARIMGVGLNTILRHLKNSGR 89 + A A GV T R L+ Sbjct: 87 PQLTPTALAAEAGVSYATSWRMLRRLRE 114 >UniRef50_B9ZCS9 DNA topoisomerase type IA zn finger domain protein n=1 Tax=Natrialba magadii ATCC 43099 RepID=B9ZCS9_NATMA Length = 244 Score = 45.0 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 36/91 (39%), Gaps = 10/91 (10%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAM 61 ++CP C +D V+NG Q YLC +C +T+ + F ++ I + Sbjct: 26 VTCPRC-RSDLTVKNGSYGH-FQHYLCKNCDRTFNDKTGTIFAHSKVALRKWLFSIYAFL 83 Query: 62 N-GVGCRATARIMGV-GLNTILRHLKNSGRS 90 + + TI +H++ ++ Sbjct: 84 RFNTSLHQL--QLEIDQYKTIYQHIERFTKA 112 >UniRef50_A7HYI5 Putative uncharacterized protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HYI5_PARL1 Length = 594 Score = 44.6 bits (104), Expect = 9e-04, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 28/89 (31%), Gaps = 11/89 (12%) Query: 7 SCPS--CSATD--------GVVRNGKSTAGHQRYLCSHCRKTWQLQ-FTYTASQPGTHQK 55 +CP+ C GK+ G RY C CRKT+ + T + Sbjct: 112 TCPNAVCGNHRLPIGLMPSAYRLFGKTAKGDARYQCKACRKTFSIGLPTRRHKKTDKTGA 171 Query: 56 IIDMAMNGVGCRATARIMGVGLNTILRHL 84 I+ +N + V I + Sbjct: 172 IMRGLVNKMAMSRLCETAQVTFPHIHSKI 200 >UniRef50_UPI00016C448A hypothetical protein GobsU_12575 n=6 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C448A Length = 234 Score = 44.6 bits (104), Expect = 9e-04, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 36/97 (37%), Gaps = 16/97 (16%) Query: 10 SCSATDGVVRNGKSTAGH----QRY--------LCSHCRKTWQLQFTYTAS----QPGTH 53 C +GK G+ RY CS C+ + + T Sbjct: 7 FCCRNADCPDHGKRGHGNLTVPARYGPNRTRVLRCSTCQARFSERKGTPLYGTRLSAQTV 66 Query: 54 QKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 ++ G G R TAR++GV +T+ R+++ +G Sbjct: 67 TAVLAHVAEGAGTRKTARLVGVHRDTVTRYIRQAGHQ 103 >UniRef50_C2H217 Possible transposase n=5 Tax=Enterococcaceae RepID=C2H217_ENTFA Length = 438 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 19/104 (18%), Positives = 32/104 (30%), Gaps = 21/104 (20%) Query: 1 MASVSISCPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQFT 44 + + C C D V+R+ + QR+ C+ CR T+ + Sbjct: 53 LTGEAPRCEYCGF-DSVIRHSYQDSWIQLLPYQEVPTYLHLYKQRFRCTRCRHTFSAKTY 111 Query: 45 YTA----SQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 Y A I + + A+ V T+ R L Sbjct: 112 YVAENCYISQALKFAIAVDLKKKISMKDIAQRYFVSTKTVERVL 155 >UniRef50_A5VLK7 Transposase, IS204/IS1001/IS1096/IS1165 family protein n=19 Tax=Lactobacillus reuteri RepID=A5VLK7_LACRD Length = 343 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 15/105 (14%), Positives = 31/105 (29%), Gaps = 22/105 (20%) Query: 8 CPSCSATDG--VVRNGKSTAGH----------------QRYLCSHCRKTWQLQF----TY 45 CP+C + +++ G A H QR+ C C T+ Sbjct: 47 CPNCGVINRGQILKYGFYQAKHKYGQFRTQPLVLLVKTQRFQCPDCHTTFNATSYLFEKQ 106 Query: 46 TASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 +++I + A + + ++ R L + Sbjct: 107 RTISRDLRREVILRLTRIQTIKDIAHDLFISEASVQRILLDLADQ 151 >UniRef50_B9JNY3 Transposase n=4 Tax=Alphaproteobacteria RepID=B9JNY3_AGRRK Length = 365 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 20/95 (21%), Positives = 30/95 (31%), Gaps = 15/95 (15%) Query: 8 CPSCSATDGVVRNGKSTAGHQR-----YLCS--HCRKTWQ----LQFTYTASQPGTHQKI 56 CP+C + G+ G +R Y CS CR + T T K Sbjct: 43 CPACGYKRSIAIAGRDM-GKRRARPGLYQCSSGDCRFQFTVTTHTPLHATKLPLRTWLKA 101 Query: 57 IDMAMN---GVGCRATARIMGVGLNTILRHLKNSG 88 + + + G+ A +GV T R Sbjct: 102 MWLLLQSDKGLSSVRLAETLGVSQPTAWRIGHALR 136 >UniRef50_C6QEP3 ISSpo8, transposase n=4 Tax=Alphaproteobacteria RepID=C6QEP3_9RHIZ Length = 330 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 14/103 (13%), Positives = 31/103 (30%), Gaps = 18/103 (17%) Query: 6 ISCPSCSATDGVV-------RNGK-STAGHQRY---LCSHCRKTWQ----LQFTYTASQP 50 CP C A + + K + G +R+ C CRK + F + Sbjct: 28 PVCPHCGADKRIYDLKGVRSKPSKRNPKGVERHGLKKCGACRKQFTVRVGTVFESSHIPL 87 Query: 51 GTHQKIIDMAMN---GVGCRATARIMGVGLNTILRHLKNSGRS 90 + + + + G+ R++ + + + Sbjct: 88 HLWLQAVHLMCSSKKGISSHQLHRVLEIKYQSAWFMSHRIREA 130 >UniRef50_Q1GHU2 Putative uncharacterized protein n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GHU2_SILST Length = 124 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 25/63 (39%), Gaps = 6/63 (9%) Query: 28 QRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNG------VGCRATARIMGVGLNTIL 81 QRY C C+KT+ + ++ D+ + R AR +G+ +T+ Sbjct: 6 QRYRCGSCKKTFSGRTGTRIARIHRPGLFFDVLKDMPGPRPLSSVRVLARCLGLNKHTVW 65 Query: 82 RHL 84 R Sbjct: 66 RWR 68 >UniRef50_D1UAU0 Transposase, putative n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1UAU0_9DELT Length = 320 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 30/86 (34%), Gaps = 12/86 (13%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQ-----PGTHQKIIDMAMN 62 CP C R +G +R C+ C+ T+ F+ +I + ++ Sbjct: 65 CPRCG-----HRKVYDLSG-ERLRCADCKYTF-HPFSGRWINNGALTSLEWLNLITLFVD 117 Query: 63 GVGCRATARIMGVGLNTILRHLKNSG 88 + +G+ NT+ + L Sbjct: 118 ECSVHQMKQRLGLSYNTVYKALTAIR 143 >UniRef50_D0MDA7 Transposase-like protein n=7 Tax=Bacteria RepID=D0MDA7_RHOM4 Length = 279 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 27/90 (30%), Gaps = 16/90 (17%) Query: 7 SCPSCSATDGVVRNGKSTAGHQR---YLCSHCRKTWQLQFTYT----ASQPGTHQKIIDM 59 CP C + G R Y C CR+ W + G + + Sbjct: 31 HCPYCKSEHL---------GRVRRRFYKCYRCRREWSPRKGSLLEGLRLPLGKFLLALKL 81 Query: 60 AMNGVGCRATARIMGVGLNTILRHLKNSGR 89 V R AR +G+ NT+ R Sbjct: 82 FELEVSARRAARELGLAYNTVHRLFLLFRE 111 >UniRef50_C5RB59 Possible transposase n=1 Tax=Weissella paramesenteroides ATCC 33313 RepID=C5RB59_WEIPA Length = 228 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 15/95 (15%), Positives = 30/95 (31%), Gaps = 22/95 (23%) Query: 8 CPSCSATDGVVRNG----------------KSTAGHQRYLCSHCRKT----WQLQFTYTA 47 CP C+ + +NG + Q+Y+C C +T + Sbjct: 30 CPQCAC--LMNKNGTKLVQHIASRAANIFNQLAIRKQKYICPQCHQTALAEFTDIKAGDH 87 Query: 48 SQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILR 82 Q + V + A+ + +T++R Sbjct: 88 IIANVKQAAAMELSDNVSQKHIAQAYNISPHTVMR 122 >UniRef50_B1IC92 Transposase n=24 Tax=Lactobacillales RepID=B1IC92_STRPI Length = 415 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 32/97 (32%), Gaps = 19/97 (19%) Query: 7 SCPSCS----ATDGVVR--------NGKSTA---GHQRYLCSHCRKTW----QLQFTYTA 47 C +C DG + NG+ QRY C C T+ L Sbjct: 50 RCRNCGFPTVNKDGFRKTHVRLASLNGRRYELELRKQRYKCKSCHTTFGAITNLTKENQT 109 Query: 48 SQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 +I+ +A G+ + A + +++ R + Sbjct: 110 LSSDLKNQIMLLARKGLSGQLIAEMCHCSPSSVRRTI 146 >UniRef50_Q2P6H2 ISXo5 transposase n=74 Tax=Xanthomonas RepID=Q2P6H2_XANOM Length = 332 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 16/92 (17%), Positives = 30/92 (32%), Gaps = 14/92 (15%) Query: 8 CPSCSATDG--VVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK--IIDMAMNG 63 CP C+AT R+G + C+ C + L+ + ++ M + G Sbjct: 53 CPRCAATAHSRFQRHGTMY-----WQCTACYRQTSLRSGTVMDNSKLPLRTWLLGMYLLG 107 Query: 64 VGCRATA-----RIMGVGLNTILRHLKNSGRS 90 + R +GV T ++ Sbjct: 108 QSKTNLSALELMRHLGVSYPTAWPMKHKLMQA 139 >UniRef50_A8YX76 Transposase n=42 Tax=Lactobacillus RepID=A8YX76_LACH4 Length = 426 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 33/106 (31%), Gaps = 22/106 (20%) Query: 4 VSISCPSCSATDGVVRNGKSTAG-----------------HQRYLCSHCRK----TWQLQ 42 + +C C + D + NG QR C C + +L Sbjct: 43 IQPACLFCGSLDLLH-NGHLITNIHYPTANASLPVIIRLAKQRVKCRDCERWSMAQSELV 101 Query: 43 FTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSG 88 Y + + K++ + AR V +NT+ R L N Sbjct: 102 NKYCSISNASKLKVLSALTEDRSMTSIARENNVSINTVQRVLGNCS 147 >UniRef50_Q87RY6 Putative resolvase n=3 Tax=Vibrio parahaemolyticus RepID=Q87RY6_VIBPA Length = 216 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 13/52 (25%), Positives = 25/52 (48%) Query: 39 WQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 W +F + HQ+II++ + G R A+ +G +T+ R K + + Sbjct: 161 WAEKFQGRKANTKQHQRIIELLLEGKSIRGVAQELGCNASTVQRVKKKAVEA 212 >UniRef50_C7P9K3 Transcriptional regulator, ArsR family n=2 Tax=Methanocaldococcus RepID=C7P9K3_METFA Length = 206 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 12/39 (30%), Positives = 18/39 (46%) Query: 51 GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGR 89 KI+ NG G R TA+ +G+ T+ R +K Sbjct: 146 DRWVKILKSLYNGCGVRETAKNLGLSPATVSREVKKLQE 184 >UniRef50_UPI000186E028 transcription factor Sp4, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186E028 Length = 749 Score = 43.8 bits (102), Expect = 0.001, Method: Composition-based stats. Identities = 11/62 (17%), Positives = 26/62 (41%), Gaps = 5/62 (8%) Query: 10 SCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQL-QFTYTASQPGTHQKIIDMAMNGV 64 C +D + R+ ++ G +R+ C C+K + + QK+++ A + + Sbjct: 656 YCGKRFTRSDELQRHRRTHTGEKRFQCPDCQKKFMRSDHLSKHIKTHQKQKLMEAATSTI 715 Query: 65 GC 66 Sbjct: 716 SL 717 >UniRef50_C0MDD8 Putative transposase n=1 Tax=Steptococcus equi subsp. zooepidemicus H70 RepID=C0MDD8_STRS7 Length = 243 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 16/102 (15%), Positives = 25/102 (24%), Gaps = 19/102 (18%) Query: 5 SISCPSCSATD---GVVRNGKSTA------------GHQRYLCSHCRKTWQLQFTYTASQ 49 C C + K R+ C CRK + Sbjct: 52 PPKCKHCKRAQIKYDFQKPSKIPFIEIGGFPSLIRLKKSRFQCQTCRKVAVSETNLVKKN 111 Query: 50 PGT----HQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNS 87 QKI + +N A + + +T+ LK Sbjct: 112 CQISEPVRQKISQLLLNKEAFTHIAAKLAISTSTVYHKLKQC 153 >UniRef50_A2V378 Putative uncharacterized protein n=1 Tax=Shewanella putrefaciens 200 RepID=A2V378_SHEPU Length = 214 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 28/90 (31%), Gaps = 12/90 (13%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT----YTASQPGTHQKIIDMAM-- 61 CPSC + S Y C+ C + L T I + Sbjct: 29 CPSCGGKEYCKLKRHSL-----YQCNTCHQQTSLTAGTILDNTKLPLTKWFLAIFLLTQV 83 Query: 62 -NGVGCRATARIMGVGLNTILRHLKNSGRS 90 NG+ +R++ V NT R ++ Sbjct: 84 KNGISALELSRLIEVSYNTAWRMKHKLMQA 113 >UniRef50_B2SSB8 Transposase TnpA, ISL3 family n=6 Tax=Bacteria RepID=B2SSB8_XANOP Length = 472 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 17/98 (17%), Positives = 33/98 (33%), Gaps = 18/98 (18%) Query: 8 CPSCSA--------TDGVVR----NGKSTA---GHQRYLCSHCRKTWQLQFTYTASQPGT 52 C +C + + VVR +GK A +R+ C C KT+ ++ Sbjct: 35 CTACGSDRLIGHGRNEQVVRDLPTHGKRLAIYVDTRRWRCQSCGKTFMEPLPAVNAKREM 94 Query: 53 HQKIIDMAMN---GVGCRATARIMGVGLNTILRHLKNS 87 +++ + A G+ TI ++ Sbjct: 95 TDRLVKWIGQQSLKRTFASIADDTGLDEKTIRNIFRDY 132 >UniRef50_Q5ZT03 Transposase (IS652) n=29 Tax=Gammaproteobacteria RepID=Q5ZT03_LEGPH Length = 399 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 17/99 (17%), Positives = 33/99 (33%), Gaps = 19/99 (19%) Query: 6 ISCPSCSATD------GVVRNGKSTAGHQR---------YLCSHCRKTWQLQF----TYT 46 + C C + R + G +R Y C C + + +F Y Sbjct: 42 VRCIHCGNKKLRVKDSFIRRIRHESIGLRRSYLCLKAHKYYCPSCGRYFNQRFPGIGKYQ 101 Query: 47 ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLK 85 + +++ GV + AR + +G +T+ R Sbjct: 102 RASESLRKQVFHYHSKGVSQKDLARDLKLGKSTVERWYH 140 >UniRef50_Q12Y80 Transposase n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12Y80_METBU Length = 382 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 15/109 (13%), Positives = 29/109 (26%), Gaps = 23/109 (21%) Query: 4 VSISCPSCSATD------GVVRNGKSTAGHQ-----RYLCSHCRKTWQLQFTYTASQPGT 52 + CP C + + + G Q RY C C K + + Sbjct: 65 LHPQCPVCGSNKINKQEYYTRKLKLAEFGSQIIHVRRYYCKKCSKRFTTPLDPIVKKGHQ 124 Query: 53 HQKIIDMAMNG------VGCRATARI------MGVGLNTILRHLKNSGR 89 + + + + R +I TI ++ S + Sbjct: 125 YARTYEQYIEDSYETGYCSFRHLQKIFSSLYDCSPSHQTIYNWIRKSNK 173 >UniRef50_A9IG79 ISSod11, transposase n=14 Tax=Proteobacteria RepID=A9IG79_BORPD Length = 223 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 27/91 (29%), Gaps = 13/91 (14%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRK----TWQLQFTYTASQPGTHQKIIDMAMN- 62 CP C V R A R +C C+ T F T + N Sbjct: 46 CPRCGNAGDVYR-----ASRTRLMCRSCQYQGTVTSGTIFDKTRTPLRVWLAAAWYLTNQ 100 Query: 63 --GVGCRATARIMGV-GLNTILRHLKNSGRS 90 GV R++G+ T L R+ Sbjct: 101 KQGVSALGLQRVLGLGSYQTAWTMLHRFRRA 131 >UniRef50_C3MUP9 Resolvase helix-turn-helix domain protein n=40 Tax=Sulfolobus RepID=C3MUP9_SULIM Length = 369 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 9/39 (23%), Positives = 19/39 (48%) Query: 51 GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGR 89 T + I + + G+ A+I+ V +T+ R +K + Sbjct: 22 ETKARAILLHLEGMKISQIAKILQVHKSTVYRWVKEFEK 60 >UniRef50_B2SIA3 ISXo5 transposase n=157 Tax=Proteobacteria RepID=B2SIA3_XANOP Length = 341 Score = 43.1 bits (100), Expect = 0.002, Method: Composition-based stats. Identities = 14/90 (15%), Positives = 27/90 (30%), Gaps = 10/90 (11%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK--IIDMAMNGVG 65 CP C+A G + C+ C + L+ + ++ M + G Sbjct: 62 CPRCAANAHSR---FQRQGTTYWQCTACYRQTSLRSGTVMDNSKLPLRTWLLGMYLLGQS 118 Query: 66 CRATA-----RIMGVGLNTILRHLKNSGRS 90 + R +GV T ++ Sbjct: 119 KTNLSALELMRHLGVSYPTAWPMKHKLMQA 148 >UniRef50_C0W2A4 Transposase (Fragment) n=1 Tax=Actinomyces coleocanis DSM 15436 RepID=C0W2A4_9ACTO Length = 195 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 12/50 (24%), Positives = 19/50 (38%), Gaps = 3/50 (6%) Query: 20 NGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRAT 69 +GK+ AG QR+ C C T A + ++G + T Sbjct: 1 HGKTKAGRQRWRCKSCSITNLNPINTDAKNLE---LFLSWLLSGKTLKDT 47 >UniRef50_Q8R819 Transposase n=2 Tax=Thermoanaerobacter tengcongensis RepID=Q8R819_THETN Length = 455 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 13/49 (26%), Positives = 19/49 (38%), Gaps = 1/49 (2%) Query: 7 SCPSCSAT-DGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQ 54 +CP C A D + GK G Q+ C C+ W T++ Sbjct: 95 NCPVCGAPPDYLYSFGKDPDGFQKLQCKVCKHQWAPGKPAPKKSRPTYR 143 >UniRef50_C0WEV9 Transposase (Fragment) n=1 Tax=Acidaminococcus sp. D21 RepID=C0WEV9_9FIRM Length = 358 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 17/101 (16%), Positives = 30/101 (29%), Gaps = 19/101 (18%) Query: 6 ISCPSCS-ATDGV--VRNGKSTAG------------HQRYLCSHCRKTWQLQFT----YT 46 + CP+C TD + R + G +RY+C C +T+ Y Sbjct: 45 VQCPNCHAKTDRIKDYRWQRIAIGSILHQQAFVRLHKRRYVCPCCGRTFFETVPFLQRYQ 104 Query: 47 ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNS 87 +I+ A T++R+ Sbjct: 105 RKSKDLQMQIMVSCFQKRSFTDIAADFHTSTTTVIRYFDRL 145 >UniRef50_B9JG85 Putative uncharacterized protein n=1 Tax=Agrobacterium radiobacter K84 RepID=B9JG85_AGRRK Length = 191 Score = 42.7 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 31/78 (39%), Gaps = 3/78 (3%) Query: 15 DGVVR-NGKSTAGHQRYLCSHCRKTWQLQF--TYTASQPGTHQKIIDMAMNGVGCRATAR 71 DGVVR G S AG + C C ++ + + + + + A Sbjct: 9 DGVVRARGPSEAGLPVFRCLACDVHFRRTTGTPPSGLKFRKLELFVRLLSQQRPITDAAE 68 Query: 72 IMGVGLNTILRHLKNSGR 89 ++ V + T++R +K + Sbjct: 69 MIDVKVVTVIRWVKRMRQ 86 >UniRef50_Q894I5 Phage-related protein n=1 Tax=Clostridium tetani RepID=Q894I5_CLOTE Length = 142 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 11/36 (30%), Positives = 19/36 (52%) Query: 51 GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKN 86 K +++ +NG TA+I+GV TI R ++ Sbjct: 4 REKIKAMELLLNGETITDTAKIVGVERKTIYRWMEK 39 >UniRef50_A3VEU0 ISSpo8, transposase n=1 Tax=Rhodobacterales bacterium HTCC2654 RepID=A3VEU0_9RHOB Length = 308 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 22/68 (32%), Gaps = 7/68 (10%) Query: 30 YLCSHCRKTWQLQFTY----TASQPGTHQKIIDMAMN---GVGCRATARIMGVGLNTILR 82 Y C CRK + ++ + I M + G+ AR +GV T Sbjct: 56 YRCKDCRKHFSVRTGTVLAESRLPLQKWLLAIFMLTSARKGIPSTQMARELGVTQKTAWF 115 Query: 83 HLKNSGRS 90 + + Sbjct: 116 LAQRIRET 123 >UniRef50_B2JXE0 Putative uncharacterized protein n=2 Tax=Burkholderiaceae RepID=B2JXE0_BURP8 Length = 358 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 33/90 (36%), Gaps = 7/90 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGH---QRYLCSHCRKTWQLQFTYTASQPGTHQKI---IDM 59 CP C T +++ G + Y C C + S+ Q+ I + Sbjct: 57 PPCPRCRGT-RILKKGYARLRTGPLPTYRCEQCGHCFSRLSGTPLSKRPVRQQAGELIAL 115 Query: 60 AMNGVGCRATARIMGVGLNTILRHLKNSGR 89 + C AR +GV +T+L ++ R Sbjct: 116 LPQEISCAEAARQLGVMEHTVLETVRLVRR 145 Score = 38.8 bits (89), Expect = 0.050, Method: Composition-based stats. Identities = 11/68 (16%), Positives = 23/68 (33%), Gaps = 5/68 (7%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ---FTYTASQPGTHQKIIDMAMN 62 +CP+C + R G + +G R+ C C + + +++I Sbjct: 208 PACPACGGC-HIRRKG-TVSGLPRFRCPACGVQFNRRTGTPFTRNRDAARQRELIRYLGL 265 Query: 63 GVGCRATA 70 + A Sbjct: 266 PLPLAQLA 273 >UniRef50_D2EIL2 Transposase n=1 Tax=Pediococcus acidilactici 7_4 RepID=D2EIL2_PEDAC Length = 212 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 29/96 (30%), Gaps = 21/96 (21%) Query: 7 SCPSCSATDGVVRN-------------GKS---TAGHQRYLCSHCR----KTWQLQFTYT 46 SC C T +V+N GKS QR+LC C L + Sbjct: 54 SCTYCH-TRSIVKNEFKTVYIRDIPFNGKSVILQIDKQRFLCKACHLSIIAQTNLIKKHA 112 Query: 47 ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILR 82 II+ + + + V ++ R Sbjct: 113 QLTQRLKFSIINYLAKNLSVDNIVQKLNVSPVSVNR 148 >UniRef50_B9Y9S5 Putative uncharacterized protein (Fragment) n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y9S5_9FIRM Length = 238 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 7/40 (17%), Positives = 14/40 (35%) Query: 51 GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 Q+ ++ + G AR+ + NT R + Sbjct: 3 DQWQRFLECYLRGESLDVCARVAQIHRNTAFRWRHKVNDA 42 >UniRef50_UPI00016C46F4 hypothetical protein GobsU_15563 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C46F4 Length = 139 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 26/73 (35%), Gaps = 4/73 (5%) Query: 20 NGKSTAGHQRYLCSHCRKTWQLQFTYT----ASQPGTHQKIIDMAMNGVGCRATARIMGV 75 G + C+ C K + + I + G G R T R+ G Sbjct: 32 WSSKPRGIRCLRCTACGKNFSERKGTPLFGLHMSDEKALDIAHHLVEGNGMRPTGRLCGG 91 Query: 76 GLNTILRHLKNSG 88 LNT+LR + +G Sbjct: 92 TLNTVLRFARKAG 104 >UniRef50_Q5LW63 ISSpo8, transposase n=4 Tax=Rhodobacterales RepID=Q5LW63_SILPO Length = 355 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 16/93 (17%), Positives = 27/93 (29%), Gaps = 12/93 (12%) Query: 7 SCPSCSA-TDGVVRNGKSTAGHQRYLC--SHCRKTWQLQFTYTAS----QPGTHQKIIDM 59 CP C + + +R + G Y C C + + I + Sbjct: 39 HCPHCGSLSSTPIRGRTARPGL--YQCAERECCLQFTVTTKTPMHATKLDLRIWIAAIFL 96 Query: 60 AMN---GVGCRATARIMGVGLNTILRHLKNSGR 89 + G+ ARI+GV T + Sbjct: 97 MLTSSKGISSVVMARILGVNQKTAWKLGHAIRE 129 >UniRef50_A9BGL8 Transposase IS204/IS1001/IS1096/IS1165 family protein n=9 Tax=Petrotoga mobilis SJ95 RepID=A9BGL8_PETMO Length = 455 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 22/105 (20%), Positives = 34/105 (32%), Gaps = 30/105 (28%) Query: 5 SISCPSCS-ATDGVVRNGKSTAGH-----------------QRYLCSHCRKTWQLQFTYT 46 C C + +VRNGK+ QRY+C KT++ + Sbjct: 72 PYKCKGCKDKREYIVRNGKAKERIIKAGKVGTQRIYLIHRPQRYMCKKTGKTFRDE---- 127 Query: 47 ASQPGTHQKIIDMAMNG-------VGCRATARIMGVGLNTILRHL 84 Q+I + ATA+ GV + T+ L Sbjct: 128 -KISYRWQRITRAETENIVKGLRKMSISATAKEFGVSVRTVSNLL 171 >UniRef50_C9RDH8 Regulatory protein LacI n=1 Tax=Ammonifex degensii KC4 RepID=C9RDH8_AMMDK Length = 435 Score = 42.3 bits (98), Expect = 0.004, Method: Composition-based stats. Identities = 11/34 (32%), Positives = 17/34 (50%) Query: 55 KIIDMAMNGVGCRATARIMGVGLNTILRHLKNSG 88 +++ + G R AR +GV NT+ R L N Sbjct: 398 RVVRLRAEGRSLREIAREVGVSKNTVARWLNNLS 431 >UniRef50_C1F2K1 Unclassified family transposase n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F2K1_ACIC5 Length = 327 Score = 42.3 bits (98), Expect = 0.004, Method: Composition-based stats. Identities = 16/92 (17%), Positives = 27/92 (29%), Gaps = 12/92 (13%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDMAM 61 CP C + R T + + C CRK + F + + + + Sbjct: 39 PVCPHCGSA----RYSFLTT-RRIWKCKSCRKQYSVKSGTIFEDSPIPLDKWLMAVWLVV 93 Query: 62 ---NGVGCRATARIMGVGLNTILRHLKNSGRS 90 NGV R + V + L + Sbjct: 94 NCKNGVSSYEIMRAVKVTQKSAWFMLHRIRLA 125 >UniRef50_UPI00005872C7 PREDICTED: similar to ENSANGP00000019944, partial n=1 Tax=Strongylocentrotus purpuratus RepID=UPI00005872C7 Length = 929 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 29/94 (30%), Gaps = 8/94 (8%) Query: 2 ASVSISCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKII 57 A+ +C C T + R+ ++ G + Y C C K + + + + H + Sbjct: 751 ATKRYNCAFCGKGFNDTFDLKRHVRTHTGVRPYKCEKCGKAFTQRCSLESHLSKVHGEQH 810 Query: 58 D-MAMNGVGCRATARIMGVGLNTI---LRHLKNS 87 G+ T H+K Sbjct: 811 RYCYKQRRSKIFVCEECGITTETADFHYDHIKEI 844 >UniRef50_B8FWC8 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense DCB-2 RepID=B8FWC8_DESHD Length = 60 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 9/34 (26%), Positives = 17/34 (50%) Query: 55 KIIDMAMNGVGCRATARIMGVGLNTILRHLKNSG 88 I+D+ G R A+ +GV T++ ++ G Sbjct: 12 LILDLYKQGYTSREIAKQVGVSPTTVMNRIRKYG 45 >UniRef50_B2UM39 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UM39_AKKM8 Length = 313 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 30/95 (31%), Gaps = 14/95 (14%) Query: 6 ISCPSCSATDG---VVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIID 58 + CP C ++ V RNG + C C K + F + Sbjct: 35 VVCPFCGKSEKQYRVKRNGVEGY----FECGECGKVYTVRTGTIFERSHVPLHKWIFAFY 90 Query: 59 MAMN---GVGCRATARIMGVGLNTILRHLKNSGRS 90 + + G+ ++ +GV T L+ + Sbjct: 91 LVVTSRKGISSMQLSKEIGVTQKTAWFMLQRIREA 125 >UniRef50_Q4L7B5 Transposase for ISSha1 n=49 Tax=Staphylococcus RepID=Q4L7B5_STAHJ Length = 438 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 37/111 (33%), Gaps = 23/111 (20%) Query: 1 MASVSISCPSCSATD---GVVRNGKSTA----------------GHQRYLCSHCRKTWQL 41 + C +CS + +V+NGK T+ QR+ C C + Sbjct: 39 LTYQPTHCENCSTKNENFSIVKNGKKTSTITLLKIMEMPAYLELQKQRFYCKSCDSHFTA 98 Query: 42 QF----TYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSG 88 + + T ++D A ++ A+ V T+ R + + Sbjct: 99 KSNIVDAHCFISNKTKLAVLDKAQEYRSQKSIAKSCLVSSMTVSRVINQAA 149 >UniRef50_Q8U293 Transposase n=53 Tax=Pyrococcus RepID=Q8U293_PYRFU Length = 314 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 8/44 (18%), Positives = 20/44 (45%) Query: 47 ASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 P + +++ + G+ R TARI+ + T+ ++ + Sbjct: 102 KIPPEKKIRGVELYLRGLSYRQTARILKISHVTVWEAVQKLAEA 145 >UniRef50_D2MKS9 ISXo5 transposase n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MKS9_9BACT Length = 293 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 28/94 (29%), Gaps = 9/94 (9%) Query: 4 VSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQ----LQFTYTASQPGTHQKIIDM 59 +CP C + + + G R+ C CR ++ F T I + Sbjct: 27 EEPTCPHCESPHVARKADGTRQG--RWNCHGCRSSFTVLSGTIFEKTRIPLQKWFLAIGL 84 Query: 60 AMN---GVGCRATARIMGVGLNTILRHLKNSGRS 90 +N + AR + + T + Sbjct: 85 IVNAKKSLSSCQLARDLSLTQPTAWYIQARIRSA 118 >UniRef50_A2A935 PR domain zinc finger protein 16 n=35 Tax=Euteleostomi RepID=PRD16_MOUSE Length = 1275 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 21/57 (36%), Gaps = 4/57 (7%) Query: 3 SVSISCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 948 KERYTCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 1004 >UniRef50_B8HUB6 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HUB6_CYAP4 Length = 144 Score = 41.9 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 2/38 (5%) Query: 6 ISCPSCSATDGVVRN--GKSTAGHQRYLCSHCRKTWQL 41 + CP C + G G QR+ C CRKT+ L Sbjct: 107 LQCPYCEGRYLTKKGFTGCYQTGRQRWFCKDCRKTFSL 144 >UniRef50_C1SJT0 Transposase family protein, COG3464 n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SJT0_9BACT Length = 436 Score = 41.9 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 20/98 (20%), Positives = 29/98 (29%), Gaps = 20/98 (20%) Query: 2 ASVSISCPSCSATDGVVRNGKSTA----------------GHQRYLCSHCRKTWQLQ-FT 44 S SC C + V ++GK T QRY C C + Sbjct: 30 VSEPESCSGCG-SKPVYKHGKRTHVYADTPMHGMPVKVEIERQRYRCQSCGTVIVPNIPS 88 Query: 45 YTASQPGTHQKI--IDMAMNGVGCRATARIMGVGLNTI 80 + T + I + A G+ +NTI Sbjct: 89 LDEKRVVTKRLIEFVQARCFNNTFTLLANETGLAVNTI 126 >UniRef50_A0Q207 Transcriptional regulator n=3 Tax=Clostridium RepID=A0Q207_CLONN Length = 520 Score = 41.9 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 10/36 (27%), Positives = 16/36 (44%) Query: 53 HQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSG 88 + II R TA+++GV TI+ +K Sbjct: 481 KEAIIKALKKNKTFRKTAKVLGVSHTTIINKIKKYN 516 >UniRef50_Q9HAZ2 PR domain zinc finger protein 16 n=26 Tax=Euteleostomi RepID=PRD16_HUMAN Length = 1276 Score = 41.9 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 21/57 (36%), Gaps = 4/57 (7%) Query: 3 SVSISCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 948 KERYTCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 1004 >UniRef50_B7C761 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7C761_9FIRM Length = 74 Score = 41.9 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 11/63 (17%), Positives = 22/63 (34%), Gaps = 4/63 (6%) Query: 29 RYLCSHCRKTWQLQ----FTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 + C C+K + + Y+ ++I +NGV + TA + V + Sbjct: 1 MFRCKECKKRFVVDRGQLTFYSHHDQSKWNELILDTLNGVSLKETAAKINVNERNVFNMR 60 Query: 85 KNS 87 Sbjct: 61 HKL 63 >UniRef50_A8A9S5 Transcriptional regulator, AsnC family n=1 Tax=Ignicoccus hospitalis KIN4/I RepID=A8A9S5_IGNH4 Length = 152 Score = 41.9 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 11/42 (26%), Positives = 15/42 (35%), Gaps = 2/42 (4%) Query: 51 GTHQKIIDMAMNGV--GCRATARIMGVGLNTILRHLKNSGRS 90 KI+ + M G R AR + V T+ LK Sbjct: 9 ELDYKILSLLMENARKGVREIARELNVSPATVHNRLKKMLSK 50 >UniRef50_A3YWR5 IS1595 transposase n=2 Tax=Synechococcus sp. WH 5701 RepID=A3YWR5_9SYNE Length = 315 Score = 41.9 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 28/91 (30%), Gaps = 10/91 (10%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTY----TASQPGTHQKIIDMAMN 62 CP C + + G+ H+RY C CR L T T + Sbjct: 39 RCPRCEGKEYGLIGGRR---HKRYQCRSCRHQATLTAGTIMEATKLPLTTWFLAFYLVGQ 95 Query: 63 -GVGCRATA--RIMGVGLNTILRHLKNSGRS 90 G + A R +GV T ++ Sbjct: 96 AKTGISSLALMRHLGVNYRTAWLVHNKIMQA 126 >UniRef50_UPI0001793827 PREDICTED: similar to CG5669 CG5669-PA n=1 Tax=Acyrthosiphon pisum RepID=UPI0001793827 Length = 640 Score = 41.9 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 11/61 (18%), Positives = 21/61 (34%), Gaps = 7/61 (11%) Query: 10 SCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVG 65 C +D + R+ ++ G +R+ C+ C K + Q + M G Sbjct: 536 HCGKRFTRSDELQRHNRTHTGEKRFQCNECPKRFMR---SDHLQKHVRTHLKQKLMEGNS 592 Query: 66 C 66 Sbjct: 593 V 593 >UniRef50_A5KRX5 ISSpo8, transposase n=2 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KRX5_9BACT Length = 275 Score = 41.9 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 31/90 (34%), Gaps = 11/90 (12%) Query: 6 ISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTY----TASQPGTHQKIIDM-- 59 + CP C D V+ K +G R C+ CR + ++ + I + Sbjct: 32 VVCPKCGEID--VKYYKLASG--RMKCASCRSPFTVRMGSIFEESPVPLQKWFLAIYLCT 87 Query: 60 -AMNGVGCRATARIMGVGLNTILRHLKNSG 88 GV ++ +GV T L+ Sbjct: 88 SLKKGVSSIQLSKYIGVTQKTAWFMLQRIR 117 >UniRef50_UPI0001C34261 hypothetical protein CATC2_13668 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C34261 Length = 387 Score = 41.5 bits (96), Expect = 0.007, Method: Composition-based stats. Identities = 12/35 (34%), Positives = 18/35 (51%), Gaps = 1/35 (2%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 CP C + + R G++ G QR C C+K W + Sbjct: 74 CPDCYQRETI-RYGRNPQGSQRVQCRACKKVWTPK 107 >UniRef50_C3XYB0 Putative uncharacterized protein n=2 Tax=Chordata RepID=C3XYB0_BRAFL Length = 1482 Score = 41.5 bits (96), Expect = 0.007, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 21/57 (36%), Gaps = 4/57 (7%) Query: 3 SVSISCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 786 KDRYTCRYCGKLFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 842 >UniRef50_B0SXP6 Putative transposase n=1 Tax=Caulobacter sp. K31 RepID=B0SXP6_CAUSK Length = 334 Score = 41.5 bits (96), Expect = 0.007, Method: Composition-based stats. Identities = 16/89 (17%), Positives = 24/89 (26%), Gaps = 12/89 (13%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK----IIDMAMNG 63 C C D V A + + C C K + L + + I + +NG Sbjct: 24 CAHCGC-DAVYEY----AARRIFKCKACEKQFSLTSGTIFASRKLAIRDILTAIALFVNG 78 Query: 64 V---GCRATARIMGVGLNTILRHLKNSGR 89 R + V T L Sbjct: 79 ANGHAALRMGRDLNVSYKTAFVLLHKLRE 107 >UniRef50_Q54X15 Type-2 histone deacetylase 1 n=1 Tax=Dictyostelium discoideum RepID=HDA21_DICDI Length = 1489 Score = 41.5 bits (96), Expect = 0.007, Method: Composition-based stats. Identities = 11/36 (30%), Positives = 14/36 (38%) Query: 7 SCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQ 42 CP R GK+ G QR+ C C W + Sbjct: 32 HCPRIEDVLYSQRKGKTNKGAQRWRCKACGTKWTSK 67 >UniRef50_Q9H5H4 Zinc finger protein 768 n=9 Tax=Theria RepID=ZN768_HUMAN Length = 540 Score = 41.5 bits (96), Expect = 0.007, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 32/80 (40%), Gaps = 11/80 (13%) Query: 5 SISCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTW-------QLQFTYTASQPGTH 53 CP C + ++R+ ++ +G + Y C HC K + + Q T++ +P + Sbjct: 316 PYKCPRCGKAFADSSYLLRHQRTHSGQKPYKCPHCGKAFGDSSYLLRHQRTHSHERPYSC 375 Query: 54 QKIIDMAMNGVGCRATARIM 73 + R+ R+ Sbjct: 376 TECGKCYSQNSSLRSHQRVH 395 >UniRef50_C0QSB1 Integrase core domain protein n=1 Tax=Persephonella marina EX-H1 RepID=C0QSB1_PERMH Length = 484 Score = 41.5 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 9/39 (23%), Positives = 15/39 (38%) Query: 46 TASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHL 84 K+I + G R A I G+ +T+ R + Sbjct: 1 MRKNTKAKAKVIRLYSQGFSIRQIADITGISKSTVHRIV 39 >UniRef50_A2FJ98 Transposase family protein n=4 Tax=cellular organisms RepID=A2FJ98_TRIVA Length = 357 Score = 41.5 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 20/104 (19%), Positives = 34/104 (32%), Gaps = 19/104 (18%) Query: 7 SCPSCSAT------DGVVR------NGKS---TAGHQRYLCSHCRKTWQLQFTYTASQPG 51 +CP C + + + GK RY C C ++ + + A Sbjct: 3 TCPHCGSKKIWVHDHRIQKIKDTHIRGKKCLIHLKKTRYDCKSCGCRFERELDFIAKGHT 62 Query: 52 THQK----IIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR 91 + I+ + + A+ V NT+LR L SR Sbjct: 63 MTNRLVFSIVSEFDDVYSISSIAKRYNVSSNTVLRILNCLSVSR 106 >UniRef50_UPI00016C406B hypothetical protein GobsU_27181 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C406B Length = 208 Score = 41.5 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 4/63 (6%) Query: 31 LCSHCRKTWQLQFTYTAS----QPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKN 86 CS C+ + + T ++ G G R TAR++GV +T+ R+L+ Sbjct: 3 RCSTCKARFSERKGTPLYGIRLSADTVVSVLAHVAEGAGTRKTARLVGVHRDTVTRYLRQ 62 Query: 87 SGR 89 +G Sbjct: 63 AGE 65 >UniRef50_A7BQK2 Transposase n=3 Tax=Bacteria RepID=A7BQK2_9GAMM Length = 391 Score = 41.5 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 10/35 (28%), Positives = 19/35 (54%) Query: 52 THQKIIDMAMNGVGCRATARIMGVGLNTILRHLKN 86 +II ++ G G R +R++G+ +T+ R K Sbjct: 28 VRSRIILLSDEGFGSRKVSRMLGISRDTVQRWRKR 62 >UniRef50_A1SYK0 Putative uncharacterized protein n=2 Tax=Psychromonas RepID=A1SYK0_PSYIN Length = 453 Score = 41.5 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 15/88 (17%), Positives = 31/88 (35%), Gaps = 9/88 (10%) Query: 8 CPSCSATDGVVR------NGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 C C+ + +G ++AG QR CS C + L + + I++ M Sbjct: 75 CRKCAQYFFFEKSTKSISHGYTSAGTQRKKCSDCHAVFTLPYF---KNINALRLILNSIM 131 Query: 62 NGVGCRATARIMGVGLNTILRHLKNSGR 89 + + + G+ +L + Sbjct: 132 ANQEIKESIKATGLSARLYYFYLNKLAQ 159 >UniRef50_Q4S840 Chromosome 9 SCAF14710, whole genome shotgun sequence n=5 Tax=Tetraodontidae RepID=Q4S840_TETNG Length = 1167 Score = 41.5 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 21/57 (36%), Gaps = 4/57 (7%) Query: 3 SVSISCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 831 KERYACRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 887 >UniRef50_Q03112 MDS1 and EVI1 complex locus protein EVI1 n=58 Tax=Euteleostomi RepID=EVI1_HUMAN Length = 1051 Score = 41.1 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 21/57 (36%), Gaps = 4/57 (7%) Query: 3 SVSISCPSCSA----TDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQK 55 +C C + + R+ ++ G Q Y C +C +++ + H K Sbjct: 730 KERYTCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNK 786 >UniRef50_Q3C030 Putative sigma-54-dependent transcriptional regulator n=1 Tax=Xanthomonas campestris pv. vesicatoria str. 85-10 RepID=Q3C030_XANC5 Length = 363 Score = 41.1 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 9/32 (28%), Positives = 18/32 (56%) Query: 55 KIIDMAMNGVGCRATARIMGVGLNTILRHLKN 86 +++ + +G+ RA A+ +GV T+ R L Sbjct: 332 QVMRLHADGLSMRAIAKHVGVSAATVSRWLNK 363 >UniRef50_B7UNR4 Predicted protein n=51 Tax=Enterobacteriaceae RepID=B7UNR4_ECO27 Length = 374 Score = 41.1 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 13/54 (24%), Positives = 22/54 (40%), Gaps = 1/54 (1%) Query: 8 CPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAM 61 CP C TD + G + G QR C +C+K W + + + + + Sbjct: 74 CPVCYGTDMIC-YGHNPQGSQRIQCRNCKKVWTPKKYQKEITHPQAIETVQLFI 126 >UniRef50_Q51984 TnpA n=3 Tax=Gammaproteobacteria RepID=Q51984_PSEPU Length = 584 Score = 41.1 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 28/88 (31%), Gaps = 19/88 (21%) Query: 8 CPSCSATDGVVRNGKSTAG----------------HQRYLCSHCRKTWQLQFTYTASQPG 51 CP+C VVR GK +R++C C T+ A Sbjct: 168 CPNCGVCGEVVRFGKKQVKFRDLPLHGRWVTLWLIRRRFVCRACNTTFSPALPEMAENNR 227 Query: 52 THQKIIDMAMN---GVGCRATARIMGVG 76 Q++ + + G R +GV Sbjct: 228 MTQRLAEYIIKAAVGRTNSDVGREVGVN 255 >UniRef50_C7PCU2 Two component transcriptional regulator, LuxR family n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PCU2_CHIPD Length = 220 Score = 41.1 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 7/45 (15%), Positives = 18/45 (40%) Query: 46 TASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRS 90 T +++ M G+ + A+ + + + T+ H N + Sbjct: 151 HHKLSKTEFRVMQMIAEGMSTKEIAQSLNISIKTVENHRHNISKK 195 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.312 0.140 0.495 Lambda K H 0.267 0.0441 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 538,245,450 Number of Sequences: 3077464 Number of extensions: 17501508 Number of successful extensions: 172479 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 297 Number of HSP's successfully gapped in prelim test: 613 Number of HSP's that attempted gapping in prelim test: 169666 Number of HSP's gapped (non-prelim): 2967 length of query: 91 length of database: 1,040,396,356 effective HSP length: 61 effective length of query: 30 effective length of database: 852,671,052 effective search space: 25580131560 effective search space used: 25580131560 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 87 (38.0 bits)