BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (94 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q9JMS9 Uncharacterized protein yuaK n=3 Tax=Escherichia... 193 2e-48 UniRef50_Q9S116 Orf51 protein n=166 Tax=root RepID=Q9S116_ECOLX 92 4e-18 UniRef50_B3WYX6 Lysyl-tRNA synthetase, heat inducible n=4 Tax=Ga... 92 6e-18 UniRef50_Q31T57 Putative uncharacterized protein n=3 Tax=Enterob... 90 2e-17 UniRef50_Q8GAR2 Orf51 (Fragment) n=2 Tax=Proteobacteria RepID=Q8... 89 6e-17 UniRef50_B1EN06 IS element transposase n=1 Tax=Escherichia alber... 78 9e-14 UniRef50_C2DPG4 Nitrite extrusion protein 2 n=1 Tax=Escherichia ... 70 3e-11 UniRef50_D2WFI4 IS66 transposase n=1 Tax=Escherichia coli O26:H-... 60 3e-08 UniRef50_Q320F8 ISSfl4 ORF3 n=21 Tax=Enterobacteriaceae RepID=Q3... 55 9e-07 UniRef50_C1HRA2 Putative uncharacterized protein n=1 Tax=Escheri... 55 1e-06 UniRef50_B3HDR4 IS66 family element, transposase n=7 Tax=Enterob... 55 1e-06 UniRef50_Q7WTH2 Putative uncharacterized protein n=1 Tax=Escheri... 54 1e-06 UniRef50_Q327R1 Putative uncharacterized protein n=1 Tax=Shigell... 53 3e-06 UniRef50_B5K4C7 Transposase IS66 n=27 Tax=Rhodobacterales RepID=... 51 1e-05 UniRef50_B7LIJ6 Putative uncharacterized protein n=1 Tax=Escheri... 51 1e-05 UniRef50_Q1RPJ6 ECs1339 protein n=175 Tax=Bacteria RepID=Q1RPJ6_... 50 2e-05 UniRef50_Q3ZU23 OrfD, ISEc8 n=37 Tax=Proteobacteria RepID=Q3ZU23... 50 2e-05 UniRef50_B2AJ19 Transposase, IS66 familly n=24 Tax=cellular orga... 48 1e-04 UniRef50_A4JH71 Transposase IS66 n=4 Tax=Proteobacteria RepID=A4... 46 3e-04 UniRef50_C7XFR9 IS66 family transposase n=7 Tax=Bacteroidales Re... 45 7e-04 UniRef50_B5EKE2 Transposase IS66 n=6 Tax=Acidithiobacillus RepID... 44 0.002 UniRef50_A2V4E0 Transposase IS66 n=1 Tax=Shewanella putrefaciens... 44 0.002 UniRef50_D1N8F2 Transposase IS66 n=1 Tax=Victivallis vadensis AT... 42 0.004 UniRef50_A6FSH9 Transposase and inactivated derivative n=7 Tax=B... 42 0.009 UniRef50_Q11ZA7 Transposase IS66 n=6 Tax=Burkholderiales RepID=Q... 40 0.018 UniRef50_C8CGL3 Putative uncharacterized protein n=3 Tax=Escheri... 40 0.020 UniRef50_A9EDY1 Transposase n=3 Tax=Kordia algicida OT-1 RepID=A... 40 0.022 UniRef50_P50360 Uncharacterized protein y4hP n=117 Tax=cellular ... 40 0.022 UniRef50_Q6EZC1 L0015-like protein n=30 Tax=Enterobacteriaceae R... 40 0.024 UniRef50_A7IP73 Transposase IS66 n=23 Tax=Alphaproteobacteria Re... 39 0.064 UniRef50_A9ML85 Putative uncharacterized protein n=1 Tax=Salmone... 38 0.081 UniRef50_D1PFL7 Putative cytOchrome o ubiquinol oxidase, subunit... 38 0.083 UniRef50_Q2W201 Transposase and inactivated derivative n=22 Tax=... 38 0.089 >UniRef50_Q9JMS9 Uncharacterized protein yuaK n=3 Tax=Escherichia coli RepID=YUAK_ECOLI Length = 94 Score = 193 bits (490), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 94/94 (100%), Positives = 94/94 (100%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS 60 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS Sbjct: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS 60 Query: 61 LIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 LIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL Sbjct: 61 LIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 >UniRef50_Q9S116 Orf51 protein n=166 Tax=root RepID=Q9S116_ECOLX Length = 523 Score = 92.4 bits (228), Expect = 4e-18, Method: Composition-based stats. Identities = 43/50 (86%), Positives = 43/50 (86%) Query: 45 LAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 AGS SGGEHAAVLYSLIGTCRLNNVE EKWL Y IEHIQDW AN VRDL Sbjct: 464 FAGSDSGGEHAAVLYSLIGTCRLNNVEPEKWLRYVIEHIQDWPANRVRDL 513 >UniRef50_B3WYX6 Lysyl-tRNA synthetase, heat inducible n=4 Tax=Gammaproteobacteria RepID=B3WYX6_SHIDY Length = 258 Score = 91.7 bits (226), Expect = 6e-18, Method: Compositional matrix adjust. Identities = 43/45 (95%), Positives = 44/45 (97%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASL 45 MVRRLRFSGPKTSIIC+PMTSLKTSIKTITYLSD GCLEIQGASL Sbjct: 1 MVRRLRFSGPKTSIICTPMTSLKTSIKTITYLSDTGCLEIQGASL 45 >UniRef50_Q31T57 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=Q31T57_SHIBS Length = 197 Score = 89.7 bits (221), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 43/45 (95%), Positives = 44/45 (97%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASL 45 MVRRLRFSGPKTSIIC+PMTSLKTSIKTITYLSD GCLEIQGASL Sbjct: 1 MVRRLRFSGPKTSIICTPMTSLKTSIKTITYLSDTGCLEIQGASL 45 >UniRef50_Q8GAR2 Orf51 (Fragment) n=2 Tax=Proteobacteria RepID=Q8GAR2_ECOLX Length = 92 Score = 88.6 bits (218), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 42/50 (84%), Positives = 42/50 (84%) Query: 45 LAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 AGS S GEHAAVLYSLIGTCRLNNVE EKWL Y IEHIQDW AN VRDL Sbjct: 20 FAGSDSSGEHAAVLYSLIGTCRLNNVEPEKWLRYVIEHIQDWPANRVRDL 69 >UniRef50_B1EN06 IS element transposase n=1 Tax=Escherichia albertii TW07627 RepID=B1EN06_9ESCH Length = 89 Score = 78.2 bits (191), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 37/50 (74%), Positives = 38/50 (76%) Query: 45 LAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 A S SGGE AVLYS IGTCRLNNVE EKWL Y IE+IQDW AN RDL Sbjct: 30 FARSDSGGEQPAVLYSQIGTCRLNNVEPEKWLSYVIENIQDWPANRGRDL 79 >UniRef50_C2DPG4 Nitrite extrusion protein 2 n=1 Tax=Escherichia coli 83972 RepID=C2DPG4_ECOLX Length = 77 Score = 69.7 bits (169), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 34/45 (75%), Positives = 39/45 (86%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASL 45 +VRRL F+G K +IICS MT+LKT I+TITYLSDIGCLEIQGA L Sbjct: 20 VVRRLHFTGSKLTIICSLMTTLKTCIRTITYLSDIGCLEIQGACL 64 >UniRef50_D2WFI4 IS66 transposase n=1 Tax=Escherichia coli O26:H- RepID=D2WFI4_ECOLX Length = 522 Score = 59.7 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 27/48 (56%), Positives = 34/48 (70%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 GS GGE AAV+YSLIG+C+LN +E E WL + I I W AN V++L Sbjct: 465 GSDGGGESAAVMYSLIGSCKLNGIEPETWLRHVISVINTWPANRVKEL 512 >UniRef50_Q320F8 ISSfl4 ORF3 n=21 Tax=Enterobacteriaceae RepID=Q320F8_SHIBS Length = 187 Score = 54.7 bits (130), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 25/48 (52%), Positives = 32/48 (66%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 GS GGE AA++YSL+ TC+ N VE E WL IE + DW +N V +L Sbjct: 131 GSDKGGESAAIIYSLLVTCKQNEVEPEDWLREVIEKLNDWPSNQVHEL 178 >UniRef50_C1HRA2 Putative uncharacterized protein n=1 Tax=Escherichia sp. 3_2_53FAA RepID=C1HRA2_9ESCH Length = 223 Score = 54.7 bits (130), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 26/48 (54%), Positives = 34/48 (70%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 GSG GGE A+LYSLIGTC+LN+V+ E +L + + I DW N V +L Sbjct: 166 GSGHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVSEL 213 >UniRef50_B3HDR4 IS66 family element, transposase n=7 Tax=Enterobacteriaceae RepID=B3HDR4_ECOLX Length = 461 Score = 54.7 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 26/48 (54%), Positives = 33/48 (68%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 GSG GGE A+LYSLIGTC+LN+V+ E +L + I DW N V +L Sbjct: 404 GSGHGGERGALLYSLIGTCKLNDVDPESYLRHVPGVIADWPVNRVSEL 451 >UniRef50_Q7WTH2 Putative uncharacterized protein n=1 Tax=Escherichia coli RepID=Q7WTH2_ECOLX Length = 80 Score = 54.3 bits (129), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 27/35 (77%), Positives = 28/35 (80%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGE 53 MTSLKTSIKTITYLSD GCLEIQGASL + G Sbjct: 1 MTSLKTSIKTITYLSDTGCLEIQGASLRCTNPRGR 35 >UniRef50_Q327R1 Putative uncharacterized protein n=1 Tax=Shigella dysenteriae Sd197 RepID=Q327R1_SHIDS Length = 70 Score = 52.8 bits (125), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 26/27 (96%), Positives = 26/27 (96%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASL 45 MTSLKTSIKTITYLSD GCLEIQGASL Sbjct: 1 MTSLKTSIKTITYLSDTGCLEIQGASL 27 >UniRef50_B5K4C7 Transposase IS66 n=27 Tax=Rhodobacterales RepID=B5K4C7_9RHOB Length = 552 Score = 51.2 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 23/48 (47%), Positives = 32/48 (66%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 GS +GG+ AA+ Y+LI T ++N V E WL + +E IQD AN + DL Sbjct: 483 GSEAGGKSAAIAYTLIETAKMNKVNPEAWLAWVLERIQDHPANRINDL 530 >UniRef50_B7LIJ6 Putative uncharacterized protein n=1 Tax=Escherichia coli ED1a RepID=B7LIJ6_ECO81 Length = 85 Score = 50.8 bits (120), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 23/42 (54%), Positives = 30/42 (71%) Query: 53 EHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 E AAV+YSLIG+C+LN +E E WL + I I W AN V++L Sbjct: 25 ESAAVMYSLIGSCKLNGIEPETWLRHVISVINTWPANCVKEL 66 >UniRef50_Q1RPJ6 ECs1339 protein n=175 Tax=Bacteria RepID=Q1RPJ6_ECOLX Length = 537 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 24/58 (41%), Positives = 35/58 (60%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 CL + GS GGE A+LY LIGTCRLN ++ E +L + + + +W +N V +L Sbjct: 470 CLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWPSNRVDEL 527 >UniRef50_Q3ZU23 OrfD, ISEc8 n=37 Tax=Proteobacteria RepID=Q3ZU23_ECOLX Length = 331 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 24/58 (41%), Positives = 35/58 (60%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 CL + GS GGE A+LY LIGTCRLN ++ E +L + + + +W +N V +L Sbjct: 264 CLGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWPSNRVDEL 321 >UniRef50_B2AJ19 Transposase, IS66 familly n=24 Tax=cellular organisms RepID=B2AJ19_CUPTR Length = 523 Score = 47.8 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 24/50 (48%), Positives = 32/50 (64%) Query: 45 LAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 AG+ SGGE AA +YSLIGT +LN V+ E +L + + I D N V +L Sbjct: 461 FAGADSGGERAAAIYSLIGTAKLNGVDPEAYLRFVLARIADHPINRVDEL 510 >UniRef50_A4JH71 Transposase IS66 n=4 Tax=Proteobacteria RepID=A4JH71_BURVG Length = 514 Score = 46.2 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 24/50 (48%), Positives = 31/50 (62%) Query: 45 LAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 AGS GG+ AAV+YSLIGT RLN++E +L E I D N + +L Sbjct: 449 FAGSDGGGQSAAVIYSLIGTARLNDIEPFAYLHTVFERIADHPINRIDEL 498 >UniRef50_C7XFR9 IS66 family transposase n=7 Tax=Bacteroidales RepID=C7XFR9_9PORP Length = 557 Score = 45.1 bits (105), Expect = 7e-04, Method: Composition-based stats. Identities = 19/48 (39%), Positives = 25/48 (52%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 G+ E AAVLYS G C+ + WL Y +EHI D+ + DL Sbjct: 483 GNNDAAEDAAVLYSFFGCCKAAGADFRTWLIYFLEHIHDYDDDYSMDL 530 >UniRef50_B5EKE2 Transposase IS66 n=6 Tax=Acidithiobacillus RepID=B5EKE2_ACIF5 Length = 545 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 20/50 (40%), Positives = 29/50 (58%) Query: 45 LAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 G+ +GGE AA YS+I TC+LN VE +LC +E + W + +L Sbjct: 485 FVGNDAGGERAASFYSIIETCKLNGVEPFAYLCDVLEKLPTWPNKRLHEL 534 >UniRef50_A2V4E0 Transposase IS66 n=1 Tax=Shewanella putrefaciens 200 RepID=A2V4E0_SHEPU Length = 517 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 21/50 (42%), Positives = 32/50 (64%) Query: 45 LAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 AGS +GGE AAVLY+++GT RLN++ ++L ++ I N V +L Sbjct: 452 FAGSKAGGERAAVLYTILGTARLNDINPNQYLTAVLKRIGQHQINKVDEL 501 >UniRef50_D1N8F2 Transposase IS66 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N8F2_9BACT Length = 496 Score = 42.4 bits (98), Expect = 0.004, Method: Composition-based stats. Identities = 26/88 (29%), Positives = 40/88 (45%), Gaps = 10/88 (11%) Query: 7 FSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCR 66 + PK +I +P L + I CL AGS +GG+ A+LYS +C+ Sbjct: 409 LNNPKLNIDNNPAERLNRGVAIIRK----NCL------FAGSETGGQRLAILYSFAASCK 458 Query: 67 LNNVELEKWLCYGIEHIQDWSANLVRDL 94 NN+ +WL + + SAN + L Sbjct: 459 ANNICFRQWLEDVLPRLSSTSANQIESL 486 >UniRef50_A6FSH9 Transposase and inactivated derivative n=7 Tax=Bacteria RepID=A6FSH9_9RHOB Length = 509 Score = 41.6 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 18/48 (37%), Positives = 31/48 (64%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 GS GG+ AA+ Y+LI T ++N+V+ E WL + ++ + D N + +L Sbjct: 453 GSIGGGKAAAIAYTLIETAKMNDVDPEAWLTWVLQRLPDHKINRIDEL 500 >UniRef50_Q11ZA7 Transposase IS66 n=6 Tax=Burkholderiales RepID=Q11ZA7_POLSJ Length = 537 Score = 40.4 bits (93), Expect = 0.018, Method: Composition-based stats. Identities = 20/48 (41%), Positives = 29/48 (60%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 GS GG AAV+Y+LIGT +L + + +L Y +E I D N + +L Sbjct: 469 GSDGGGHTAAVIYTLIGTAKLCGINPQTYLRYVLERIADHPINRIDEL 516 >UniRef50_C8CGL3 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C8CGL3_ECOLX Length = 162 Score = 40.4 bits (93), Expect = 0.020, Method: Compositional matrix adjust. Identities = 23/44 (52%), Positives = 29/44 (65%), Gaps = 4/44 (9%) Query: 2 VRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASL 45 ++RL FSGP+TSII L S+KT+ LSD C +I GASL Sbjct: 18 LQRLHFSGPETSII----RILTISLKTVGKLSDASCQDIHGASL 57 >UniRef50_A9EDY1 Transposase n=3 Tax=Kordia algicida OT-1 RepID=A9EDY1_9FLAO Length = 485 Score = 40.0 bits (92), Expect = 0.022, Method: Composition-based stats. Identities = 18/50 (36%), Positives = 29/50 (58%) Query: 45 LAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 AGS ++AA++YS TC++N+V KWL E + + AN + +L Sbjct: 425 FAGSHKAAQNAAMMYSFFATCKINDVNPYKWLHDVFERLPEHKANKLEEL 474 >UniRef50_P50360 Uncharacterized protein y4hP n=117 Tax=cellular organisms RepID=Y4HP_RHISN Length = 552 Score = 40.0 bits (92), Expect = 0.022, Method: Composition-based stats. Identities = 17/42 (40%), Positives = 27/42 (64%) Query: 44 SLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + AGS G + AAV+ ++I TCRLN+++ + WL + I D Sbjct: 485 TFAGSQRGADRAAVMLTVITTCRLNDIDPKAWLADVLARIAD 526 >UniRef50_Q6EZC1 L0015-like protein n=30 Tax=Enterobacteriaceae RepID=Q6EZC1_ECOLX Length = 321 Score = 40.0 bits (92), Expect = 0.024, Method: Composition-based stats. Identities = 20/58 (34%), Positives = 31/58 (53%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 CL + G+ GE A+LY L G CRLN ++ E +L + + + W +N V +L Sbjct: 254 CLGKKNYMFFGNDHVGERGALLYGLTGNCRLNGIDPEAYLRHILSVLPKWLSNRVDEL 311 >UniRef50_A7IP73 Transposase IS66 n=23 Tax=Alphaproteobacteria RepID=A7IP73_XANP2 Length = 510 Score = 38.5 bits (88), Expect = 0.064, Method: Composition-based stats. Identities = 19/49 (38%), Positives = 29/49 (59%) Query: 41 QGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSAN 89 + A AGS G EH AV+ SL+ TC+LN+++ + +L I I + N Sbjct: 441 KNALFAGSDGGAEHWAVIASLVETCKLNDIDPQAYLADVITRIVNGHPN 489 >UniRef50_A9ML85 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9ML85_SALAR Length = 94 Score = 38.1 bits (87), Expect = 0.081, Method: Compositional matrix adjust. Identities = 16/27 (59%), Positives = 21/27 (77%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVELE 73 GS GGE AA++YSL+G C+LN VE+ Sbjct: 26 GSDRGGEAAAIIYSLLGMCKLNGVEVR 52 >UniRef50_D1PFL7 Putative cytOchrome o ubiquinol oxidase, subunit I n=1 Tax=Prevotella copri DSM 18205 RepID=D1PFL7_9BACT Length = 69 Score = 38.1 bits (87), Expect = 0.083, Method: Compositional matrix adjust. Identities = 14/32 (43%), Positives = 23/32 (71%) Query: 53 EHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ 84 + AA++YSL G C++ + E+WLCY ++HI Sbjct: 18 KRAAMMYSLFGACKVLDKNPERWLCYVLKHID 49 >UniRef50_Q2W201 Transposase and inactivated derivative n=22 Tax=Proteobacteria RepID=Q2W201_MAGSA Length = 532 Score = 38.1 bits (87), Expect = 0.089, Method: Composition-based stats. Identities = 19/50 (38%), Positives = 29/50 (58%) Query: 45 LAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 AGS +GG+ AA +Y+L T +LN ++ E +L + I D N + DL Sbjct: 476 FAGSDAGGDRAAAIYTLTETAKLNGLDPEAYLRDVLTRIADHPVNRIADL 525 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q9JMS9 Uncharacterized protein yuaK n=3 Tax=Escherichia... 159 3e-38 UniRef50_Q9S116 Orf51 protein n=166 Tax=root RepID=Q9S116_ECOLX 100 1e-20 UniRef50_Q1RPJ6 ECs1339 protein n=175 Tax=Bacteria RepID=Q1RPJ6_... 99 4e-20 UniRef50_Q8GAR2 Orf51 (Fragment) n=2 Tax=Proteobacteria RepID=Q8... 99 6e-20 UniRef50_Q3ZU23 OrfD, ISEc8 n=37 Tax=Proteobacteria RepID=Q3ZU23... 97 2e-19 UniRef50_D2WFI4 IS66 transposase n=1 Tax=Escherichia coli O26:H-... 96 3e-19 UniRef50_B3WYX6 Lysyl-tRNA synthetase, heat inducible n=4 Tax=Ga... 90 3e-17 UniRef50_Q31T57 Putative uncharacterized protein n=3 Tax=Enterob... 90 3e-17 UniRef50_A4JH71 Transposase IS66 n=4 Tax=Proteobacteria RepID=A4... 89 3e-17 UniRef50_B2AJ19 Transposase, IS66 familly n=24 Tax=cellular orga... 89 4e-17 UniRef50_B1EN06 IS element transposase n=1 Tax=Escherichia alber... 89 5e-17 UniRef50_Q320F8 ISSfl4 ORF3 n=21 Tax=Enterobacteriaceae RepID=Q3... 87 1e-16 UniRef50_B5EKE2 Transposase IS66 n=6 Tax=Acidithiobacillus RepID... 87 2e-16 UniRef50_C1HRA2 Putative uncharacterized protein n=1 Tax=Escheri... 87 2e-16 UniRef50_B5K4C7 Transposase IS66 n=27 Tax=Rhodobacterales RepID=... 85 4e-16 UniRef50_B3HDR4 IS66 family element, transposase n=7 Tax=Enterob... 84 1e-15 UniRef50_A2V4E0 Transposase IS66 n=1 Tax=Shewanella putrefaciens... 80 2e-14 UniRef50_C2DPG4 Nitrite extrusion protein 2 n=1 Tax=Escherichia ... 79 5e-14 UniRef50_C7XFR9 IS66 family transposase n=7 Tax=Bacteroidales Re... 76 3e-13 UniRef50_B7LIJ6 Putative uncharacterized protein n=1 Tax=Escheri... 74 1e-12 UniRef50_Q7WTH2 Putative uncharacterized protein n=1 Tax=Escheri... 62 6e-09 UniRef50_Q327R1 Putative uncharacterized protein n=1 Tax=Shigell... 53 2e-06 Sequences not found previously or not previously below threshold: UniRef50_Q6EZC1 L0015-like protein n=30 Tax=Enterobacteriaceae R... 85 6e-16 UniRef50_Q11ZA7 Transposase IS66 n=6 Tax=Burkholderiales RepID=Q... 82 6e-15 UniRef50_Q2W201 Transposase and inactivated derivative n=22 Tax=... 81 1e-14 UniRef50_A6FSH9 Transposase and inactivated derivative n=7 Tax=B... 80 2e-14 UniRef50_Q07SJ6 Putative uncharacterized protein n=4 Tax=Rhizobi... 76 4e-13 UniRef50_B9TBF4 Putative uncharacterized protein n=1 Tax=Ricinus... 75 7e-13 UniRef50_Q5P8K4 Transposase n=7 Tax=Proteobacteria RepID=Q5P8K4_... 74 1e-12 UniRef50_A9EDY1 Transposase n=3 Tax=Kordia algicida OT-1 RepID=A... 73 2e-12 UniRef50_P50360 Uncharacterized protein y4hP n=117 Tax=cellular ... 72 5e-12 UniRef50_A6BYX4 TnpC protein n=2 Tax=Planctomyces maris DSM 8797... 72 8e-12 UniRef50_Q1NGZ9 TnpC protein n=5 Tax=Sphingomonas RepID=Q1NGZ9_9... 70 3e-11 UniRef50_A7HGL2 Transposase IS66 n=7 Tax=Cystobacterineae RepID=... 69 3e-11 UniRef50_A7IP73 Transposase IS66 n=23 Tax=Alphaproteobacteria Re... 68 7e-11 UniRef50_C6MJH1 Transposase IS66 n=1 Tax=Nitrosomonas sp. AL212 ... 67 1e-10 UniRef50_A1ZEN9 TnpC protein n=6 Tax=Microscilla marina ATCC 231... 67 2e-10 UniRef50_C8QDU0 Transposase IS66 n=9 Tax=Enterobacteriaceae RepI... 65 6e-10 UniRef50_Q322B9 ISSfl3 orfC n=24 Tax=Enterobacteriaceae RepID=Q3... 65 6e-10 UniRef50_A6WXB5 Transposase IS66 n=39 Tax=Alphaproteobacteria Re... 65 7e-10 UniRef50_C5SQQ1 Transposase IS66 n=1 Tax=Asticcacaulis excentric... 64 1e-09 UniRef50_A9G0V4 Transposase n=6 Tax=Sorangium cellulosum 'So ce ... 64 2e-09 UniRef50_C7I5Z5 Transposase IS66 n=2 Tax=Thiomonas intermedia K1... 63 3e-09 UniRef50_D1UHS9 Transposase n=2 Tax=Burkholderiales RepID=D1UHS9... 63 3e-09 UniRef50_D2LKV6 Transposase IS66 n=1 Tax=Rhodomicrobium vannieli... 62 5e-09 UniRef50_A3X2G6 Putative transposase n=5 Tax=Alphaproteobacteria... 62 8e-09 UniRef50_Q1D8I4 Transposase, IS66 family, truncated n=1 Tax=Myxo... 61 1e-08 UniRef50_C8SL35 Putative uncharacterized protein n=1 Tax=Mesorhi... 61 1e-08 UniRef50_Q1VRT6 Transposase n=5 Tax=Psychroflexus torquis ATCC 7... 58 8e-08 UniRef50_D1N8F2 Transposase IS66 n=1 Tax=Victivallis vadensis AT... 58 1e-07 UniRef50_B4D7C9 Transposase IS66 n=5 Tax=Chthoniobacter flavus E... 58 1e-07 UniRef50_P55630 Uncharacterized protein y4qI n=19 Tax=Alphaprote... 57 2e-07 UniRef50_C9CRY0 Transposase IS66 n=2 Tax=Silicibacter sp. TrichC... 57 2e-07 UniRef50_B9TJY8 Putative uncharacterized protein n=1 Tax=Ricinus... 57 2e-07 UniRef50_D0TYT2 Integron integrase n=1 Tax=Bacteroides sp. 2_1_2... 56 3e-07 UniRef50_C3QM77 Transposase IS66 n=1 Tax=Bacteroides sp. D1 RepI... 56 3e-07 UniRef50_UPI00003825AD COG3436: Transposase and inactivated deri... 56 3e-07 UniRef50_UPI0001C38241 transposase IS66 n=1 Tax=Arthrospira plat... 56 3e-07 UniRef50_D1PFL7 Putative cytOchrome o ubiquinol oxidase, subunit... 56 4e-07 UniRef50_A6KXM9 Transposase n=23 Tax=Bacteroides RepID=A6KXM9_BACV8 56 4e-07 UniRef50_Q08RD8 Transposase IS66 family n=6 Tax=Stigmatella aura... 56 4e-07 UniRef50_A1VVP8 Transposase IS66 n=8 Tax=Proteobacteria RepID=A1... 56 4e-07 UniRef50_Q3IV06 Transposase IS66 n=9 Tax=Bacteria RepID=Q3IV06_R... 55 5e-07 UniRef50_A3X278 Putative transposase n=1 Tax=Nitrobacter sp. Nb-... 55 5e-07 UniRef50_A3ZP54 Putative uncharacterized protein n=1 Tax=Blastop... 55 5e-07 UniRef50_Q08VL0 Transposase and inactivated derivative n=13 Tax=... 55 6e-07 UniRef50_B9K420 Transposase n=13 Tax=Alphaproteobacteria RepID=B... 55 6e-07 UniRef50_Q2YKH5 Transposase IS66 family n=37 Tax=Brucella RepID=... 55 9e-07 UniRef50_C3RGW7 Transposase n=13 Tax=Bacteroides RepID=C3RGW7_9BACE 55 9e-07 UniRef50_C4ZMM2 Transposase IS66 n=23 Tax=Proteobacteria RepID=C... 54 2e-06 UniRef50_A4XN94 Transposase IS66 n=25 Tax=Gammaproteobacteria Re... 54 2e-06 UniRef50_A9HSI6 Probable insertion sequence transposase protein ... 53 2e-06 UniRef50_B8FCN1 Transposase IS66 n=18 Tax=Bacteria RepID=B8FCN1_... 53 3e-06 UniRef50_C6MXH4 Transposase IS66 n=3 Tax=Legionella drancourtii ... 53 3e-06 UniRef50_UPI000197B598 hypothetical protein BACCOPRO_01649 n=1 T... 52 4e-06 UniRef50_P55504 Uncharacterized protein y4jD n=13 Tax=cellular o... 52 5e-06 UniRef50_B9JRL5 Transposase n=55 Tax=cellular organisms RepID=B9... 52 6e-06 UniRef50_D1W152 IS66 family element, transposase n=3 Tax=Prevote... 52 6e-06 UniRef50_Q1NKD7 ISPsy5, transposase n=1 Tax=delta proteobacteriu... 52 8e-06 UniRef50_B4S337 Transposase n=3 Tax=Alteromonas macleodii 'Deep ... 51 9e-06 UniRef50_A6KWF8 Transposase n=7 Tax=Bacteroides RepID=A6KWF8_BACV8 51 9e-06 UniRef50_D1PX91 Cytochrome o ubiquinol oxidase n=1 Tax=Prevotell... 51 1e-05 UniRef50_Q1GW50 Transposase and inactivated derivative n=2 Tax=S... 51 1e-05 UniRef50_Q5P882 IS66 Orf1 transposase n=5 Tax=cellular organisms... 50 2e-05 UniRef50_A9GG21 Transposase n=1 Tax=Sorangium cellulosum 'So ce ... 50 2e-05 UniRef50_C0ZIN3 Putative uncharacterized protein n=1 Tax=Breviba... 50 2e-05 UniRef50_UPI00016C41EC ISPpu13, transposase Orf2 n=1 Tax=Gemmata... 50 2e-05 UniRef50_Q3IV15 Putative transposase n=1 Tax=Rhodobacter sphaero... 50 2e-05 UniRef50_A5ZJI7 Putative uncharacterized protein n=1 Tax=Bactero... 50 2e-05 UniRef50_B8KMJ5 ISPsy5, transposase n=1 Tax=gamma proteobacteriu... 50 2e-05 UniRef50_C6ZBZ0 Transposase n=26 Tax=Bacteroides RepID=C6ZBZ0_9BACE 50 2e-05 UniRef50_A9ML85 Putative uncharacterized protein n=1 Tax=Salmone... 50 2e-05 UniRef50_D2QZY9 Transposase IS66 n=1 Tax=Pirellula staleyi DSM 6... 50 2e-05 UniRef50_Q13ZD6 Transposase ISPpu14 orf3 like, IS66 family n=51 ... 50 3e-05 UniRef50_C8WYG4 Transposase IS66 n=5 Tax=Bacteria RepID=C8WYG4_A... 50 3e-05 UniRef50_A8YU80 Transposase ORF_C n=22 Tax=Lactobacillales RepID... 50 3e-05 UniRef50_C6N6I2 Truncated transposase IS66 n=2 Tax=Legionella Re... 50 3e-05 UniRef50_C0QE05 Transposase n=3 Tax=Deltaproteobacteria RepID=C0... 49 4e-05 UniRef50_B6EI61 Transposase n=28 Tax=Gammaproteobacteria RepID=B... 49 4e-05 UniRef50_A3DCS3 Transposase IS66 n=7 Tax=Clostridiales RepID=A3D... 49 4e-05 UniRef50_Q2K2K4 Putative insertion sequence transposase protein ... 49 4e-05 UniRef50_A6L6H8 Transposase n=8 Tax=Bacteroidales RepID=A6L6H8_B... 49 5e-05 UniRef50_UPI0001BC30F4 transposase n=1 Tax=Butyrivibrio crossotu... 48 6e-05 UniRef50_A5WEB8 Transposase IS66 n=20 Tax=Proteobacteria RepID=A... 48 7e-05 UniRef50_C4ZDI2 Transposase n=7 Tax=Clostridiales RepID=C4ZDI2_E... 48 8e-05 UniRef50_C7XG25 Transposase n=7 Tax=Bacteroidales RepID=C7XG25_9... 48 9e-05 UniRef50_A9HWT3 Probable insertion sequence transposase protein ... 48 1e-04 UniRef50_B8I6W2 Transposase IS66 n=3 Tax=Clostridiales RepID=B8I... 48 1e-04 UniRef50_A6TJJ0 Transposase IS66 n=1 Tax=Alkaliphilus metallired... 48 1e-04 UniRef50_A8S0A7 Putative uncharacterized protein n=1 Tax=Clostri... 47 2e-04 UniRef50_B9YEG7 Putative uncharacterized protein n=1 Tax=Holdema... 47 2e-04 UniRef50_B5JDW6 Transposase IS66 family n=1 Tax=Verrucomicrobiae... 47 2e-04 UniRef50_Q0ABQ0 Integron integrase n=29 Tax=Proteobacteria RepID... 47 2e-04 UniRef50_UPI0001C376A4 transposase IS66 n=2 Tax=Ruminococcus fla... 47 3e-04 UniRef50_UPI00016C567B ISPsy5, transposase n=1 Tax=Gemmata obscu... 47 3e-04 UniRef50_A8VYR6 Transposase and inactivated derivatives-like pro... 46 4e-04 UniRef50_B7CDZ5 Putative uncharacterized protein n=1 Tax=Eubacte... 46 4e-04 UniRef50_Q24RY6 Putative uncharacterized protein n=1 Tax=Desulfi... 46 4e-04 UniRef50_D1JMC2 Transposase n=1 Tax=Bacteroides sp. 2_1_16 RepID... 46 4e-04 UniRef50_A6NXJ3 Putative uncharacterized protein n=1 Tax=Bactero... 46 4e-04 UniRef50_UPI0001C34DE2 transposase IS66 n=4 Tax=Clostridium sp. ... 46 4e-04 UniRef50_C6LKU9 ISPsy5, transposase n=1 Tax=Bryantella formatexi... 45 5e-04 UniRef50_Q07GD7 Putative uncharacterized protein n=1 Tax=Roseoba... 45 7e-04 UniRef50_Q1A683 Transposase (Fragment) n=4 Tax=Clostridiales Rep... 45 8e-04 UniRef50_A0LAY3 Transposase IS66 n=7 Tax=Magnetococcus sp. MC-1 ... 45 9e-04 UniRef50_A6WIY2 Transposase IS66 n=9 Tax=Gammaproteobacteria Rep... 45 0.001 UniRef50_A3JL62 ISPpu15, transposase n=2 Tax=Marinobacter sp. EL... 45 0.001 UniRef50_A9AMV4 ISBmu30 transposase n=26 Tax=Proteobacteria RepI... 45 0.001 UniRef50_B2AIZ9 Fused transposase IS66/IS21 n=38 Tax=Proteobacte... 45 0.001 UniRef50_D0DW05 Transposase IS66 n=1 Tax=Lactobacillus fermentum... 44 0.001 UniRef50_A3JTL7 Probable insertion sequence transposase protein ... 44 0.001 UniRef50_C4I9X2 Fused transposase IS66/IS21 n=8 Tax=Burkholderia... 44 0.002 UniRef50_Q0AUV2 Transposase and inactivated derivatives-like pro... 44 0.002 UniRef50_A6LHQ0 Transposase n=2 Tax=Bacteroidales RepID=A6LHQ0_P... 44 0.002 UniRef50_A0LPQ3 ISPpu15, transposase Orf2 n=1 Tax=Syntrophobacte... 44 0.002 UniRef50_UPI0001744C57 ISPpu14, transposase Orf3 n=1 Tax=Verruco... 43 0.002 UniRef50_A9EDB3 Putative transposase n=1 Tax=Oceanibulbus indoli... 43 0.002 UniRef50_B3PIF8 IS66 family element, transposase n=8 Tax=Bacteri... 43 0.002 UniRef50_B9KMR1 Transposase IS66 n=14 Tax=Alphaproteobacteria Re... 43 0.003 UniRef50_A9DHW8 Putative uncharacterized protein n=10 Tax=Shewan... 43 0.003 UniRef50_B3WDV3 Putative uncharacterized protein n=1 Tax=Lactoba... 43 0.003 UniRef50_Q6LRS5 Hypothetical transposase n=10 Tax=Gammaproteobac... 43 0.003 UniRef50_D1PHR2 Putative transposase IS66 n=1 Tax=Prevotella cop... 43 0.004 UniRef50_C0ETK9 Putative uncharacterized protein n=1 Tax=Eubacte... 43 0.004 UniRef50_C9K6B9 IS66 family transposase n=1 Tax=Sphingomonas sp.... 42 0.005 UniRef50_C9LAQ6 ISPpu13, transposase Orf2 n=15 Tax=Clostridiales... 42 0.005 UniRef50_A5FT16 Putative uncharacterized protein n=2 Tax=Acidiph... 42 0.006 UniRef50_C6LKX4 Transposase IS66 n=13 Tax=Clostridiales RepID=C6... 42 0.006 UniRef50_UPI0001C34DDA transposase n=1 Tax=Clostridium sp. M62/1... 42 0.007 UniRef50_C2EYW0 Putative uncharacterized protein n=2 Tax=Lactoba... 42 0.007 UniRef50_UPI0001C34846 hypothetical protein PretD1_08486 n=2 Tax... 42 0.008 UniRef50_A3JZ99 Putative transposase n=1 Tax=Sagittula stellata ... 41 0.010 UniRef50_A1B0T3 Putative uncharacterized protein n=1 Tax=Paracoc... 41 0.010 UniRef50_Q2K2A5 Hypothetical conserved protein n=1 Tax=Rhizobium... 41 0.014 UniRef50_A6LGZ8 Putative uncharacterized protein n=1 Tax=Parabac... 41 0.014 UniRef50_UPI00017448EA transposase IS66 n=4 Tax=Verrucomicrobium... 41 0.015 UniRef50_C8CGL3 Putative uncharacterized protein n=3 Tax=Escheri... 40 0.017 UniRef50_A2UYM3 Conserved hypothetical ISPpu15, transposase n=1 ... 40 0.018 UniRef50_A8S332 Putative uncharacterized protein (Fragment) n=3 ... 40 0.020 UniRef50_B0UII5 Putative uncharacterized protein n=1 Tax=Methylo... 40 0.022 UniRef50_B3DSN6 Transposase n=4 Tax=Bifidobacterium longum RepID... 40 0.029 UniRef50_Q8VSS7 Putative uncharacterized protein n=2 Tax=Bactero... 40 0.030 UniRef50_B0NH70 Putative uncharacterized protein n=2 Tax=Clostri... 39 0.058 >UniRef50_Q9JMS9 Uncharacterized protein yuaK n=3 Tax=Escherichia coli RepID=YUAK_ECOLI Length = 94 Score = 159 bits (402), Expect = 3e-38, Method: Composition-based stats. Identities = 94/94 (100%), Positives = 94/94 (100%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS 60 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS Sbjct: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS 60 Query: 61 LIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 LIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL Sbjct: 61 LIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 >UniRef50_Q9S116 Orf51 protein n=166 Tax=root RepID=Q9S116_ECOLX Length = 523 Score = 100 bits (249), Expect = 1e-20, Method: Composition-based stats. Identities = 43/57 (75%), Positives = 45/57 (78%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + AGS SGGEHAAVLYSLIGTCRLNNVE EKWL Y IEHIQDW AN VRDL Sbjct: 457 VGRKNWLFAGSDSGGEHAAVLYSLIGTCRLNNVEPEKWLRYVIEHIQDWPANRVRDL 513 >UniRef50_Q1RPJ6 ECs1339 protein n=175 Tax=Bacteria RepID=Q1RPJ6_ECOLX Length = 537 Score = 99.3 bits (246), Expect = 4e-20, Method: Composition-based stats. Identities = 24/58 (41%), Positives = 35/58 (60%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 CL + GS GGE A+LY LIGTCRLN ++ E +L + + + +W +N V +L Sbjct: 470 CLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWPSNRVDEL 527 >UniRef50_Q8GAR2 Orf51 (Fragment) n=2 Tax=Proteobacteria RepID=Q8GAR2_ECOLX Length = 92 Score = 98.5 bits (244), Expect = 6e-20, Method: Composition-based stats. Identities = 42/57 (73%), Positives = 44/57 (77%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + AGS S GEHAAVLYSLIGTCRLNNVE EKWL Y IEHIQDW AN VRDL Sbjct: 13 VGRKNWLFAGSDSSGEHAAVLYSLIGTCRLNNVEPEKWLRYVIEHIQDWPANRVRDL 69 >UniRef50_Q3ZU23 OrfD, ISEc8 n=37 Tax=Proteobacteria RepID=Q3ZU23_ECOLX Length = 331 Score = 96.6 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 24/58 (41%), Positives = 35/58 (60%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 CL + GS GGE A+LY LIGTCRLN ++ E +L + + + +W +N V +L Sbjct: 264 CLGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWPSNRVDEL 321 >UniRef50_D2WFI4 IS66 transposase n=1 Tax=Escherichia coli O26:H- RepID=D2WFI4_ECOLX Length = 522 Score = 96.2 bits (238), Expect = 3e-19, Method: Composition-based stats. Identities = 28/57 (49%), Positives = 36/57 (63%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + GS GGE AAV+YSLIG+C+LN +E E WL + I I W AN V++L Sbjct: 456 LGRRNYMFFGSDGGGESAAVMYSLIGSCKLNGIEPETWLRHVISVINTWPANRVKEL 512 >UniRef50_B3WYX6 Lysyl-tRNA synthetase, heat inducible n=4 Tax=Gammaproteobacteria RepID=B3WYX6_SHIDY Length = 258 Score = 89.7 bits (221), Expect = 3e-17, Method: Composition-based stats. Identities = 43/45 (95%), Positives = 44/45 (97%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASL 45 MVRRLRFSGPKTSIIC+PMTSLKTSIKTITYLSD GCLEIQGASL Sbjct: 1 MVRRLRFSGPKTSIICTPMTSLKTSIKTITYLSDTGCLEIQGASL 45 >UniRef50_Q31T57 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=Q31T57_SHIBS Length = 197 Score = 89.7 bits (221), Expect = 3e-17, Method: Composition-based stats. Identities = 43/45 (95%), Positives = 44/45 (97%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASL 45 MVRRLRFSGPKTSIIC+PMTSLKTSIKTITYLSD GCLEIQGASL Sbjct: 1 MVRRLRFSGPKTSIICTPMTSLKTSIKTITYLSDTGCLEIQGASL 45 >UniRef50_A4JH71 Transposase IS66 n=4 Tax=Proteobacteria RepID=A4JH71_BURVG Length = 514 Score = 89.3 bits (220), Expect = 3e-17, Method: Composition-based stats. Identities = 25/57 (43%), Positives = 33/57 (57%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + AGS GG+ AAV+YSLIGT RLN++E +L E I D N + +L Sbjct: 442 LGRRNYLFAGSDGGGQSAAVIYSLIGTARLNDIEPFAYLHTVFERIADHPINRIDEL 498 >UniRef50_B2AJ19 Transposase, IS66 familly n=24 Tax=cellular organisms RepID=B2AJ19_CUPTR Length = 523 Score = 89.3 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 24/57 (42%), Positives = 34/57 (59%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + AG+ SGGE AA +YSLIGT +LN V+ E +L + + I D N V +L Sbjct: 454 IGRRNYLFAGADSGGERAAAIYSLIGTAKLNGVDPEAYLRFVLARIADHPINRVDEL 510 >UniRef50_B1EN06 IS element transposase n=1 Tax=Escherichia albertii TW07627 RepID=B1EN06_9ESCH Length = 89 Score = 88.9 bits (219), Expect = 5e-17, Method: Composition-based stats. Identities = 37/55 (67%), Positives = 39/55 (70%) Query: 40 IQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + A S SGGE AVLYS IGTCRLNNVE EKWL Y IE+IQDW AN RDL Sbjct: 25 RKNWLFARSDSGGEQPAVLYSQIGTCRLNNVEPEKWLSYVIENIQDWPANRGRDL 79 >UniRef50_Q320F8 ISSfl4 ORF3 n=21 Tax=Enterobacteriaceae RepID=Q320F8_SHIBS Length = 187 Score = 87.4 bits (215), Expect = 1e-16, Method: Composition-based stats. Identities = 25/57 (43%), Positives = 34/57 (59%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + GS GGE AA++YSL+ TC+ N VE E WL IE + DW +N V +L Sbjct: 122 VGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEVEPEDWLREVIEKLNDWPSNQVHEL 178 >UniRef50_B5EKE2 Transposase IS66 n=6 Tax=Acidithiobacillus RepID=B5EKE2_ACIF5 Length = 545 Score = 87.0 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 20/57 (35%), Positives = 31/57 (54%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + G+ +GGE AA YS+I TC+LN VE +LC +E + W + +L Sbjct: 478 IGRKNFLFVGNDAGGERAASFYSIIETCKLNGVEPFAYLCDVLEKLPTWPNKRLHEL 534 >UniRef50_C1HRA2 Putative uncharacterized protein n=1 Tax=Escherichia sp. 3_2_53FAA RepID=C1HRA2_9ESCH Length = 223 Score = 86.6 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 27/56 (48%), Positives = 35/56 (62%) Query: 39 EIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 E GSG GGE A+LYSLIGTC+LN+V+ E +L + + I DW N V +L Sbjct: 158 EKMKTLFFGSGHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVSEL 213 >UniRef50_B5K4C7 Transposase IS66 n=27 Tax=Rhodobacterales RepID=B5K4C7_9RHOB Length = 552 Score = 85.4 bits (210), Expect = 4e-16, Method: Composition-based stats. Identities = 23/57 (40%), Positives = 34/57 (59%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + GS +GG+ AA+ Y+LI T ++N V E WL + +E IQD AN + DL Sbjct: 474 VGRKNYLFMGSEAGGKSAAIAYTLIETAKMNKVNPEAWLAWVLERIQDHPANRINDL 530 >UniRef50_Q6EZC1 L0015-like protein n=30 Tax=Enterobacteriaceae RepID=Q6EZC1_ECOLX Length = 321 Score = 85.4 bits (210), Expect = 6e-16, Method: Composition-based stats. Identities = 20/58 (34%), Positives = 31/58 (53%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 CL + G+ GE A+LY L G CRLN ++ E +L + + + W +N V +L Sbjct: 254 CLGKKNYMFFGNDHVGERGALLYGLTGNCRLNGIDPEAYLRHILSVLPKWLSNRVDEL 311 >UniRef50_B3HDR4 IS66 family element, transposase n=7 Tax=Enterobacteriaceae RepID=B3HDR4_ECOLX Length = 461 Score = 83.9 bits (206), Expect = 1e-15, Method: Composition-based stats. Identities = 27/56 (48%), Positives = 34/56 (60%) Query: 39 EIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 E GSG GGE A+LYSLIGTC+LN+V+ E +L + I DW N V +L Sbjct: 396 EKMKTLFFGSGHGGERGALLYSLIGTCKLNDVDPESYLRHVPGVIADWPVNRVSEL 451 >UniRef50_Q11ZA7 Transposase IS66 n=6 Tax=Burkholderiales RepID=Q11ZA7_POLSJ Length = 537 Score = 82.0 bits (201), Expect = 6e-15, Method: Composition-based stats. Identities = 20/57 (35%), Positives = 31/57 (54%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + GS GG AAV+Y+LIGT +L + + +L Y +E I D N + +L Sbjct: 460 IGRKNYLHFGSDGGGHTAAVIYTLIGTAKLCGINPQTYLRYVLERIADHPINRIDEL 516 >UniRef50_Q2W201 Transposase and inactivated derivative n=22 Tax=Proteobacteria RepID=Q2W201_MAGSA Length = 532 Score = 80.8 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 20/57 (35%), Positives = 31/57 (54%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + AGS +GG+ AA +Y+L T +LN ++ E +L + I D N + DL Sbjct: 469 LGRKNWLFAGSDAGGDRAAAIYTLTETAKLNGLDPEAYLRDVLTRIADHPVNRIADL 525 >UniRef50_A6FSH9 Transposase and inactivated derivative n=7 Tax=Bacteria RepID=A6FSH9_9RHOB Length = 509 Score = 80.0 bits (196), Expect = 2e-14, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 33/57 (57%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + GS GG+ AA+ Y+LI T ++N+V+ E WL + ++ + D N + +L Sbjct: 444 LGRKNYLFMGSIGGGKAAAIAYTLIETAKMNDVDPEAWLTWVLQRLPDHKINRIDEL 500 >UniRef50_A2V4E0 Transposase IS66 n=1 Tax=Shewanella putrefaciens 200 RepID=A2V4E0_SHEPU Length = 517 Score = 80.0 bits (196), Expect = 2e-14, Method: Composition-based stats. Identities = 22/57 (38%), Positives = 34/57 (59%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + AGS +GGE AAVLY+++GT RLN++ ++L ++ I N V +L Sbjct: 445 LGRKNYLFAGSKAGGERAAVLYTILGTARLNDINPNQYLTAVLKRIGQHQINKVDEL 501 >UniRef50_C2DPG4 Nitrite extrusion protein 2 n=1 Tax=Escherichia coli 83972 RepID=C2DPG4_ECOLX Length = 77 Score = 78.9 bits (193), Expect = 5e-14, Method: Composition-based stats. Identities = 34/45 (75%), Positives = 39/45 (86%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASL 45 +VRRL F+G K +IICS MT+LKT I+TITYLSDIGCLEIQGA L Sbjct: 20 VVRRLHFTGSKLTIICSLMTTLKTCIRTITYLSDIGCLEIQGACL 64 >UniRef50_C7XFR9 IS66 family transposase n=7 Tax=Bacteroidales RepID=C7XFR9_9PORP Length = 557 Score = 76.2 bits (186), Expect = 3e-13, Method: Composition-based stats. Identities = 19/56 (33%), Positives = 26/56 (46%) Query: 39 EIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + G+ E AAVLYS G C+ + WL Y +EHI D+ + DL Sbjct: 475 GRRNYLFCGNNDAAEDAAVLYSFFGCCKAAGADFRTWLIYFLEHIHDYDDDYSMDL 530 >UniRef50_Q07SJ6 Putative uncharacterized protein n=4 Tax=Rhizobiales RepID=Q07SJ6_RHOP5 Length = 95 Score = 75.8 bits (185), Expect = 4e-13, Method: Composition-based stats. Identities = 18/57 (31%), Positives = 31/57 (54%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + GS GG+ AA++YSLI T ++N+V+ + WL + I + + +L Sbjct: 23 LGRKSWLFCGSDRGGDRAALMYSLIVTAKMNDVDPQAWLADVLARIAEHPVQRLDEL 79 >UniRef50_B9TBF4 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TBF4_RICCO Length = 103 Score = 75.0 bits (183), Expect = 7e-13, Method: Composition-based stats. Identities = 18/57 (31%), Positives = 30/57 (52%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + GS +GG+ AAV YS++ TCR N++ +L + I D N + +L Sbjct: 15 IARKNHLFFGSEAGGKVAAVFYSMLATCRANDINPYDYLSDVLGRINDHPINRIEEL 71 >UniRef50_B7LIJ6 Putative uncharacterized protein n=1 Tax=Escherichia coli ED1a RepID=B7LIJ6_ECO81 Length = 85 Score = 74.3 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 25/50 (50%), Positives = 32/50 (64%) Query: 45 LAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 LA E AAV+YSLIG+C+LN +E E WL + I I W AN V++L Sbjct: 17 LAVRHPTTESAAVMYSLIGSCKLNGIEPETWLRHVISVINTWPANCVKEL 66 >UniRef50_Q5P8K4 Transposase n=7 Tax=Proteobacteria RepID=Q5P8K4_AZOSE Length = 525 Score = 73.9 bits (180), Expect = 1e-12, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 29/57 (50%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + AGS + G+ AAV+ SL+ T R N E WL +E + W + + +L Sbjct: 458 LGKKNWMFAGSEAAGKRAAVIQSLLATARANGFEPLAWLSDTLEKLPAWPNSRIDEL 514 >UniRef50_A9EDY1 Transposase n=3 Tax=Kordia algicida OT-1 RepID=A9EDY1_9FLAO Length = 485 Score = 73.1 bits (178), Expect = 2e-12, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 31/57 (54%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + AGS ++AA++YS TC++N+V KWL E + + AN + +L Sbjct: 418 LGRKNYLFAGSHKAAQNAAMMYSFFATCKINDVNPYKWLHDVFERLPEHKANKLEEL 474 >UniRef50_P50360 Uncharacterized protein y4hP n=117 Tax=cellular organisms RepID=Y4HP_RHISN Length = 552 Score = 72.3 bits (176), Expect = 5e-12, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 32/57 (56%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + + AGS G + AAV+ ++I TCRLN+++ + WL + I D + +L Sbjct: 479 LGRRNWTFAGSQRGADRAAVMLTVITTCRLNDIDPKAWLADVLARIADHPVTRLYEL 535 >UniRef50_A6BYX4 TnpC protein n=2 Tax=Planctomyces maris DSM 8797 RepID=A6BYX4_9PLAN Length = 507 Score = 71.6 bits (174), Expect = 8e-12, Method: Composition-based stats. Identities = 20/57 (35%), Positives = 31/57 (54%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + GS GGE AAV YSL+ +C+ N VE +L + I D +A+ + +L Sbjct: 433 IGRKNYLFVGSDRGGEAAAVHYSLMASCKANEVEPFAYLRDVLAQITDHAADRLEEL 489 >UniRef50_Q1NGZ9 TnpC protein n=5 Tax=Sphingomonas RepID=Q1NGZ9_9SPHN Length = 517 Score = 70.0 bits (170), Expect = 3e-11, Method: Composition-based stats. Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 1/58 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ-DWSANLVRDL 94 L + AGS +GG+ AA +YS+I T +LN +E + ++ I I +W A +L Sbjct: 444 LGRKNWLFAGSKAGGDRAAAIYSVIETAKLNGLEPQAYIADVIARIAGNWPATRWDEL 501 >UniRef50_A7HGL2 Transposase IS66 n=7 Tax=Cystobacterineae RepID=A7HGL2_ANADF Length = 515 Score = 69.3 bits (168), Expect = 3e-11, Method: Composition-based stats. Identities = 18/60 (30%), Positives = 28/60 (46%) Query: 35 IGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + L + G+ GE+ A LYSLI TC N V +L + +Q A+ + +L Sbjct: 442 VAALGRKNFLFVGTNEAGENLAGLYSLIATCEANGVNPVDYLADVLIRVQTHPASQIDEL 501 >UniRef50_A7IP73 Transposase IS66 n=23 Tax=Alphaproteobacteria RepID=A7IP73_XANP2 Length = 510 Score = 68.1 bits (165), Expect = 7e-11, Method: Composition-based stats. Identities = 21/58 (36%), Positives = 32/58 (55%), Gaps = 1/58 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI-QDWSANLVRDL 94 L + A AGS G EH AV+ SL+ TC+LN+++ + +L I I + + DL Sbjct: 438 LTRKNALFAGSDGGAEHWAVIASLVETCKLNDIDPQAYLADVITRIVNGHPNSRIDDL 495 >UniRef50_C6MJH1 Transposase IS66 n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MJH1_9PROT Length = 527 Score = 67.3 bits (163), Expect = 1e-10, Method: Composition-based stats. Identities = 14/57 (24%), Positives = 27/57 (47%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + GS G+ AA + +L+GT +LN + WL + + W + + +L Sbjct: 456 IGKKNWLFTGSQRAGQRAANIQTLLGTAQLNGLNPGAWLNDILTKLPTWPNSRIDEL 512 >UniRef50_A1ZEN9 TnpC protein n=6 Tax=Microscilla marina ATCC 23134 RepID=A1ZEN9_9SPHI Length = 395 Score = 66.9 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 18/57 (31%), Positives = 28/57 (49%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + AGS E A+ YSLIG+C++ V +WL I++I + + L Sbjct: 329 LGRKNYLFAGSQDAAERTALFYSLIGSCKMAGVNPLEWLTDVIKNINNQPIQKLHLL 385 >UniRef50_C8QDU0 Transposase IS66 n=9 Tax=Enterobacteriaceae RepID=C8QDU0_9ENTR Length = 531 Score = 65.4 bits (158), Expect = 6e-10, Method: Composition-based stats. Identities = 16/57 (28%), Positives = 29/57 (50%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + AGS + GE AA + L+ T ++N +E WL ++ + WS + + +L Sbjct: 465 MGRNNWLFAGSLAAGERAARIMGLLETAKMNGLEPHAWLSDVLKRLPSWSEDRLDEL 521 >UniRef50_Q322B9 ISSfl3 orfC n=24 Tax=Enterobacteriaceae RepID=Q322B9_SHIBS Length = 533 Score = 65.0 bits (157), Expect = 6e-10, Method: Composition-based stats. Identities = 17/57 (29%), Positives = 27/57 (47%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + AGS GE AA + SL+ T + N +E WL + + +W + +L Sbjct: 467 LGRKSWLFAGSQMAGERAAQIMSLLETAKRNGLEPHAWLTDVLMRLPEWPEERLAEL 523 >UniRef50_A6WXB5 Transposase IS66 n=39 Tax=Alphaproteobacteria RepID=A6WXB5_OCHA4 Length = 545 Score = 65.0 bits (157), Expect = 7e-10, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 28/57 (49%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + AG+ +G E A ++I T ++N + + +L ++ I D N + +L Sbjct: 473 VGRRNWLFAGADTGAETLARAMTIIETAKMNGINPQAYLADVLDRIHDHKINRLDEL 529 >UniRef50_C5SQQ1 Transposase IS66 n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SQQ1_9CAUL Length = 537 Score = 64.2 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 19/58 (32%), Positives = 32/58 (55%), Gaps = 1/58 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD-WSANLVRDL 94 + + + AGS GG A++ SLI T R+N V + WL ++ I D W+ + + +L Sbjct: 468 ITRKNSLFAGSDGGGRTWAIIASLIQTARMNGVNPQAWLTQTLQRIADGWTVSRLDEL 525 >UniRef50_A9G0V4 Transposase n=6 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G0V4_SORC5 Length = 554 Score = 63.9 bits (154), Expect = 2e-09, Method: Composition-based stats. Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 1/58 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD-WSANLVRDL 94 L + AGS G + A+ Y++ G+CR++ V W I +Q W + + +L Sbjct: 476 LGRKNYLFAGSDKGAQRLAIGYTIFGSCRMHGVNPLAWATDVIGRLQAGWQRDRLDEL 533 >UniRef50_C7I5Z5 Transposase IS66 n=2 Tax=Thiomonas intermedia K12 RepID=C7I5Z5_THIIN Length = 513 Score = 63.1 bits (152), Expect = 3e-09, Method: Composition-based stats. Identities = 15/57 (26%), Positives = 26/57 (45%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + GS G AAVL +LI + +L V+ +L + + W + + +L Sbjct: 442 LGRKNWLFVGSPQAGSRAAVLMTLIESAKLCEVDPWAYLKDVLTKLPTWPNSRLSEL 498 >UniRef50_D1UHS9 Transposase n=2 Tax=Burkholderiales RepID=D1UHS9_9BURK Length = 57 Score = 62.7 bits (151), Expect = 3e-09, Method: Composition-based stats. Identities = 14/37 (37%), Positives = 24/37 (64%) Query: 58 LYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 +YSLIG+C+LN ++ +L + + HI D N + +L Sbjct: 1 MYSLIGSCKLNGIDPRAYLSHVLAHIADHKVNRIDEL 37 >UniRef50_D2LKV6 Transposase IS66 n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LKV6_RHOVA Length = 535 Score = 62.3 bits (150), Expect = 5e-09, Method: Composition-based stats. Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 1/58 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI-QDWSANLVRDL 94 L + A AG G EH ++ SLI T +LN V+ + WL + + W + +L Sbjct: 463 LNRKNALFAGHDRGAEHWGIVASLIETAKLNGVDPQAWLASILSRLVNGWPMRKIDEL 520 >UniRef50_Q7WTH2 Putative uncharacterized protein n=1 Tax=Escherichia coli RepID=Q7WTH2_ECOLX Length = 80 Score = 61.9 bits (149), Expect = 6e-09, Method: Composition-based stats. Identities = 27/35 (77%), Positives = 28/35 (80%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGE 53 MTSLKTSIKTITYLSD GCLEIQGASL + G Sbjct: 1 MTSLKTSIKTITYLSDTGCLEIQGASLRCTNPRGR 35 >UniRef50_A3X2G6 Putative transposase n=5 Tax=Alphaproteobacteria RepID=A3X2G6_9BRAD Length = 513 Score = 61.5 bits (148), Expect = 8e-09, Method: Composition-based stats. Identities = 19/75 (25%), Positives = 35/75 (46%), Gaps = 8/75 (10%) Query: 21 SLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGI 80 +++ +I+ +T L + AGS G + A + SLI T +LN+VE +L + Sbjct: 434 TVERAIRPVT-------LGRKNHLFAGSDGGAQRWATVCSLITTAKLNDVEPFTYLKDIL 486 Query: 81 EHI-QDWSANLVRDL 94 E + + + L Sbjct: 487 ERMSAGHPMSRLDQL 501 >UniRef50_Q1D8I4 Transposase, IS66 family, truncated n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D8I4_MYXXD Length = 184 Score = 60.8 bits (146), Expect = 1e-08, Method: Composition-based stats. Identities = 15/57 (26%), Positives = 25/57 (43%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + G + GE+ LY+L+ TC N V E +L + Q + + +L Sbjct: 106 LSRKNFLFVGHEAAGENLEGLYALVATCAANGVNPETYLTDVLLRAQTHPNSRIGEL 162 >UniRef50_C8SL35 Putative uncharacterized protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SL35_9RHIZ Length = 227 Score = 60.8 bits (146), Expect = 1e-08, Method: Composition-based stats. Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 1/55 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI-QDWSANLV 91 L + A AGS G EH AV+ SLI TC+LN VE +L + I + + Sbjct: 113 LNRKNALFAGSDGGAEHWAVVASLIETCKLNGVEPLGYLADVLARIVNGHPNSKL 167 >UniRef50_Q1VRT6 Transposase n=5 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRT6_9FLAO Length = 502 Score = 58.1 bits (139), Expect = 8e-08, Method: Composition-based stats. Identities = 16/57 (28%), Positives = 28/57 (49%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + AGS + A++YS C+ + V +WL Y +E+I + ++DL Sbjct: 435 LGRKNYLFAGSHDAAQRGAIMYSFFAICKKHEVNPYQWLKYTLENIMSINHKNIKDL 491 >UniRef50_D1N8F2 Transposase IS66 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N8F2_9BACT Length = 496 Score = 57.7 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 10/88 (11%) Query: 7 FSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCR 66 + PK +I +P L + I + AGS +GG+ A+LYS +C+ Sbjct: 409 LNNPKLNIDNNPAERLNRGVAII----------RKNCLFAGSETGGQRLAILYSFAASCK 458 Query: 67 LNNVELEKWLCYGIEHIQDWSANLVRDL 94 NN+ +WL + + SAN + L Sbjct: 459 ANNICFRQWLEDVLPRLSSTSANQIESL 486 >UniRef50_B4D7C9 Transposase IS66 n=5 Tax=Chthoniobacter flavus Ellin428 RepID=B4D7C9_9BACT Length = 527 Score = 57.7 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 11/57 (19%), Positives = 27/57 (47%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + G + GE +A+LY++I +CR ++ +L + + ++D+ Sbjct: 453 IGKKNWLFFGEAAAGERSAILYTIIESCRRRGIDPFAYLRDVFTRLPSMTNWQIKDI 509 >UniRef50_P55630 Uncharacterized protein y4qI n=19 Tax=Alphaproteobacteria RepID=Y4QI_RHISN Length = 539 Score = 56.9 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 29/58 (50%), Gaps = 1/58 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD-WSANLVRDL 94 + + + AGS GG A + +L+ TC++N V+ WL + I W A+ + L Sbjct: 471 ITRKNSLFAGSEGGGRTWATVATLLQTCKMNGVDPLDWLSQTLTRIAQGWPASEIEAL 528 >UniRef50_C9CRY0 Transposase IS66 n=2 Tax=Silicibacter sp. TrichCH4B RepID=C9CRY0_9RHOB Length = 542 Score = 56.9 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 30/58 (51%), Gaps = 1/58 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI-QDWSANLVRDL 94 L + A AG+ G A + SL+GTC+L+ + + +L + +E I ++DL Sbjct: 472 LTRKNALFAGNDDGAVTWARMASLVGTCKLSGINPQAYLEHVLEKILNGHMQENIKDL 529 >UniRef50_B9TJY8 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TJY8_RICCO Length = 104 Score = 56.9 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 14/42 (33%), Positives = 22/42 (52%) Query: 53 EHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 +Y LIGTC+L+ V + Y + HI D+ N + +L Sbjct: 43 REGRAMYDLIGTCKLDGVNPFTYFEYVLTHIADYKVNRIDEL 84 >UniRef50_D0TYT2 Integron integrase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYT2_9BACE Length = 443 Score = 56.2 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 15/58 (25%), Positives = 30/58 (51%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 C+ + GS G ++A++LYS+I TC++N + K++ + + N + L Sbjct: 378 CMGRKNYLFCGSELGAKNASMLYSIIETCKMNGLRPVKYIAEILTKLTAGETNYMSLL 435 >UniRef50_C3QM77 Transposase IS66 n=1 Tax=Bacteroides sp. D1 RepID=C3QM77_9BACE Length = 257 Score = 56.2 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 15/58 (25%), Positives = 30/58 (51%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 C+ + GS G ++A++LYS+I TC++N + K++ + + N + L Sbjct: 192 CMGRKNYLFCGSELGAKNASMLYSIIETCKMNGLRPVKYIAEILTKLTAGETNYMSLL 249 >UniRef50_UPI00003825AD COG3436: Transposase and inactivated derivatives n=1 Tax=Magnetospirillum magnetotacticum MS-1 RepID=UPI00003825AD Length = 229 Score = 56.2 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 18/56 (32%), Positives = 24/56 (42%), Gaps = 1/56 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRD 93 + + AGS G E AAV Y+LI T +LN + I D A + D Sbjct: 144 VGRKNYLFAGSDLGAERAAVFYTLIETAKLNRLIPRPPARRV-TRIADHPAKRLAD 198 >UniRef50_UPI0001C38241 transposase IS66 n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38241 Length = 535 Score = 56.2 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 14/57 (24%), Positives = 24/57 (42%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + AGS S G AA + SL+ T + N ++ WL + + + L Sbjct: 468 VGRKNWLFAGSQSAGVRAAAIMSLLATAKANGLDPHAWLSDVLTRLPTTKDRDIDTL 524 >UniRef50_D1PFL7 Putative cytOchrome o ubiquinol oxidase, subunit I n=1 Tax=Prevotella copri DSM 18205 RepID=D1PFL7_9BACT Length = 69 Score = 55.8 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 15/57 (26%), Positives = 27/57 (47%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + + AA++YSL G C++ + E+WLCY ++HI + L Sbjct: 3 MGKKAYLFCRDLDACKRAAMMYSLFGACKVLDKNPERWLCYVLKHIDSMPEDKYYTL 59 >UniRef50_A6KXM9 Transposase n=23 Tax=Bacteroides RepID=A6KXM9_BACV8 Length = 521 Score = 55.8 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 18/52 (34%), Positives = 27/52 (51%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSAN 89 L + + GS G E A+LY++ TCR+N V L ++L I +W N Sbjct: 451 LSRKNSLFFGSHKGAERGAILYTIALTCRMNKVNLFEYLTDIINRTAEWQPN 502 >UniRef50_Q08RD8 Transposase IS66 family n=6 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08RD8_STIAU Length = 529 Score = 55.8 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 25/57 (43%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + GS + A L++LI T RL+ ++ E +L + + W +L Sbjct: 432 VGRKAWLFVGSDDHAQSAGHLFTLIATARLHRLDPEAYLRDLLRVLAHWPRERYLEL 488 >UniRef50_A1VVP8 Transposase IS66 n=8 Tax=Proteobacteria RepID=A1VVP8_POLNA Length = 528 Score = 55.8 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 12/57 (21%), Positives = 27/57 (47%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + AGS G+ AAV+ SL+ + +L+ + +L + + + + +L Sbjct: 463 MGRRAWLFAGSELAGQRAAVVMSLLQSAKLHGHDPWAYLKDVLTRLPGHMNSRIDEL 519 >UniRef50_Q3IV06 Transposase IS66 n=9 Tax=Bacteria RepID=Q3IV06_RHOS4 Length = 523 Score = 55.4 bits (132), Expect = 5e-07, Method: Composition-based stats. Identities = 20/48 (41%), Positives = 26/48 (54%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 L + A AGS G + A + SLIGTCRLN V E ++ + I D Sbjct: 459 LTRKNALFAGSTDGAKTWARIASLIGTCRLNGVNPEAYIAATLRKILD 506 >UniRef50_A3X278 Putative transposase n=1 Tax=Nitrobacter sp. Nb-311A RepID=A3X278_9BRAD Length = 232 Score = 55.4 bits (132), Expect = 5e-07, Method: Composition-based stats. Identities = 20/59 (33%), Positives = 30/59 (50%), Gaps = 1/59 (1%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIE-HIQDWSANLVRDL 94 CL + A AG G E+ A+L S++ TC+LN+V + +E I + V DL Sbjct: 161 CLTRKNALFAGHEIGAENWALLGSIVATCKLNDVNPVAYNAETLEAIIAGHPQSKVDDL 219 >UniRef50_A3ZP54 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZP54_9PLAN Length = 561 Score = 55.4 bits (132), Expect = 5e-07, Method: Composition-based stats. Identities = 18/56 (32%), Positives = 27/56 (48%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRD 93 + + S +GGE AAVL S++ TC+ N VE +L E + N R+ Sbjct: 476 IGRKNWLFVASRTGGERAAVLMSVVQTCKRNQVEPWAYLRDVFEQLPSLGENPTRE 531 >UniRef50_Q08VL0 Transposase and inactivated derivative n=13 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08VL0_STIAU Length = 526 Score = 55.4 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 13/59 (22%), Positives = 28/59 (47%), Gaps = 2/59 (3%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD--WSANLVRDL 94 + AGS + G+ AA+ Y+L+ +C ++ +L + + D + A + +L Sbjct: 455 IGRNNYLFAGSDAAGQRAALAYTLVLSCYRLGMDPWAYLRDVLPKLGDTRFPAARLAEL 513 >UniRef50_B9K420 Transposase n=13 Tax=Alphaproteobacteria RepID=B9K420_AGRVS Length = 553 Score = 55.4 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 1/58 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD-WSANLVRDL 94 + + A AG GG + A SLIGTC++N +E +LC + + A + L Sbjct: 483 MNRRNALFAGHDEGGRNWARFASLIGTCKMNGIEPYAYLCDLFTRLANGHIAKDIDAL 540 >UniRef50_Q2YKH5 Transposase IS66 family n=37 Tax=Brucella RepID=Q2YKH5_BRUA2 Length = 523 Score = 54.6 bits (130), Expect = 9e-07, Method: Composition-based stats. Identities = 17/46 (36%), Positives = 24/46 (52%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI 83 L + AGS G E A+L SL+ T +LN ++ WL +E I Sbjct: 447 LGRVNSLFAGSDGGAETWAILGSLLTTAKLNGLDPYTWLNDVLERI 492 >UniRef50_C3RGW7 Transposase n=13 Tax=Bacteroides RepID=C3RGW7_9BACE Length = 523 Score = 54.6 bits (130), Expect = 9e-07, Method: Composition-based stats. Identities = 14/57 (24%), Positives = 30/57 (52%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + G+ ++ A++ SL+ +C+ +N+ +WL I + ++AN +DL Sbjct: 451 LSRKNFLFCGNHEAAQNTAIICSLLASCKASNINPREWLTEVIALLPYYAANKEKDL 507 >UniRef50_C4ZMM2 Transposase IS66 n=23 Tax=Proteobacteria RepID=C4ZMM2_THASP Length = 531 Score = 53.8 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 15/57 (26%), Positives = 27/57 (47%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + + + G +H V+ SLI TCRL+ ++ +L ++ + A V L Sbjct: 452 LGRKNWMFSWTELGAQHVGVVQSLIATCRLHELDPYDYLVDVLQRVDQHPAADVAQL 508 >UniRef50_A4XN94 Transposase IS66 n=25 Tax=Gammaproteobacteria RepID=A4XN94_PSEMY Length = 524 Score = 53.8 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 16/56 (28%), Positives = 27/56 (48%), Gaps = 2/56 (3%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRD 93 + + + + G +A +YSLI T + N E WL + +E + +AN V D Sbjct: 456 IGRKNWLFSDTPKGATASAQIYSLIETAKANGQEPYAWLRHILERLP--AANSVED 509 >UniRef50_A9HSI6 Probable insertion sequence transposase protein n=6 Tax=Rhodobacteraceae RepID=A9HSI6_9RHOB Length = 87 Score = 53.5 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 18/58 (31%), Positives = 29/58 (50%), Gaps = 1/58 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI-QDWSANLVRDL 94 L+ + A AG +G ++ AV+ SLI TC+LN +E +L + I + L Sbjct: 23 LQRKNALFAGHDAGAQNWAVIASLIETCKLNKIEPHSYLTGLLTAIVNGHKKKDIDQL 80 >UniRef50_Q327R1 Putative uncharacterized protein n=1 Tax=Shigella dysenteriae Sd197 RepID=Q327R1_SHIDS Length = 70 Score = 53.5 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 26/27 (96%), Positives = 26/27 (96%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASL 45 MTSLKTSIKTITYLSD GCLEIQGASL Sbjct: 1 MTSLKTSIKTITYLSDTGCLEIQGASL 27 >UniRef50_B8FCN1 Transposase IS66 n=18 Tax=Bacteria RepID=B8FCN1_DESAA Length = 546 Score = 53.1 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 15/47 (31%), Positives = 23/47 (48%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ 84 + + AGS G E +A+ +SLI T + N +E +L E I Sbjct: 462 VGRKNWLFAGSPRGAEASALFFSLIETAKANGLEPFAYLKVLFERIP 508 >UniRef50_C6MXH4 Transposase IS66 n=3 Tax=Legionella drancourtii LLAP12 RepID=C6MXH4_9GAMM Length = 491 Score = 52.7 bits (125), Expect = 3e-06, Method: Composition-based stats. Identities = 16/48 (33%), Positives = 24/48 (50%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + + G+ G A LYSLI TC+ + V++ W Y + HIQ Sbjct: 411 IGRKNWLFHGNDIGARAGATLYSLIETCKYHKVDVFSWFKYALTHIQQ 458 >UniRef50_UPI000197B598 hypothetical protein BACCOPRO_01649 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197B598 Length = 518 Score = 52.3 bits (124), Expect = 4e-06, Method: Composition-based stats. Identities = 9/48 (18%), Positives = 21/48 (43%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 L + G+ A++Y++I +C+LN + + ++L Sbjct: 456 LGMNNYLFFGNHESARRGAIIYTIIESCKLNGINVFEYLTDVFSREPQ 503 >UniRef50_P55504 Uncharacterized protein y4jD n=13 Tax=cellular organisms RepID=Y4JD_RHISN Length = 511 Score = 52.3 bits (124), Expect = 5e-06, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 25/46 (54%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI 83 L + + G+ GGE AVL SLI + +L+ ++ WL +E I Sbjct: 439 LTRKNSMFVGNVQGGETFAVLASLINSAKLSGLDPYAWLADVLERI 484 >UniRef50_B9JRL5 Transposase n=55 Tax=cellular organisms RepID=B9JRL5_AGRVS Length = 537 Score = 51.9 bits (123), Expect = 6e-06, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 29/58 (50%), Gaps = 1/58 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI-QDWSANLVRDL 94 L + A AG +G E+ A + SLI +C+LN V+ +L + I + + +L Sbjct: 467 LNRKNALFAGHDAGAENWATIASLIESCKLNAVDPLAYLSSTLTAIVNGHKQSKIDEL 524 >UniRef50_D1W152 IS66 family element, transposase n=3 Tax=Prevotella RepID=D1W152_9BACT Length = 552 Score = 51.9 bits (123), Expect = 6e-06, Method: Composition-based stats. Identities = 13/61 (21%), Positives = 28/61 (45%), Gaps = 7/61 (11%) Query: 22 LKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIE 81 ++ +I+ IT L + G+ G E+ A+ Y+ + CR +++ KW+ + Sbjct: 480 MEQAIRPIT-------LGRKNYLFCGNNEGAENNAIFYTFVACCREADIDPYKWMKKILS 532 Query: 82 H 82 Sbjct: 533 K 533 >UniRef50_Q1NKD7 ISPsy5, transposase n=1 Tax=delta proteobacterium MLMS-1 RepID=Q1NKD7_9DELT Length = 197 Score = 51.5 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 11/47 (23%), Positives = 23/47 (48%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ 84 + + +G+ G + +A +YS+I T + N +E +L E + Sbjct: 127 VGRKNWLFSGTAQGAKASAAIYSIIETAKANGLEPYWYLRALFERLP 173 >UniRef50_B4S337 Transposase n=3 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4S337_ALTMD Length = 525 Score = 51.1 bits (121), Expect = 9e-06, Method: Composition-based stats. Identities = 16/57 (28%), Positives = 24/57 (42%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + S G E +L SL+ TCRL V +L ++ + A V DL Sbjct: 444 MGRKNYLFCWSELGAEQLGILQSLMVTCRLQGVNPYHYLVDVLQRVALHPARDVIDL 500 >UniRef50_A6KWF8 Transposase n=7 Tax=Bacteroides RepID=A6KWF8_BACV8 Length = 537 Score = 51.1 bits (121), Expect = 9e-06, Method: Composition-based stats. Identities = 18/70 (25%), Positives = 32/70 (45%), Gaps = 7/70 (10%) Query: 16 CSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKW 75 C ++ SI+ +T L + +GS AA+ +SL+G CR N V + W Sbjct: 456 CIDNNPVERSIRPLT-------LNRKNTLFSGSHEAAHAAAIFFSLMGCCRENKVNPKLW 508 Query: 76 LCYGIEHIQD 85 + + +Q+ Sbjct: 509 MQDVLIRVQE 518 >UniRef50_D1PX91 Cytochrome o ubiquinol oxidase n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PX91_9BACT Length = 105 Score = 51.1 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 14/62 (22%), Positives = 27/62 (43%), Gaps = 7/62 (11%) Query: 21 SLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGI 80 S + +I+ IT L + G+ G E+ A+ Y+ + CR ++ KW+ + Sbjct: 32 SQEQAIRPIT-------LGGKNYLFCGNNEGAENNAIFYTFMACCRQAGLQPSKWMREFL 84 Query: 81 EH 82 Sbjct: 85 SK 86 >UniRef50_Q1GW50 Transposase and inactivated derivative n=2 Tax=Sphingomonadaceae RepID=Q1GW50_SPHAL Length = 57 Score = 50.8 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 12/38 (31%), Positives = 20/38 (52%) Query: 57 VLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 +++SL GT RLN V+ W + I D + + +L Sbjct: 1 MMFSLFGTARLNGVDPLAWFTDVLTRIADIPQSRLHEL 38 >UniRef50_Q5P882 IS66 Orf1 transposase n=5 Tax=cellular organisms RepID=Q5P882_AZOSE Length = 538 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 12/57 (21%), Positives = 26/57 (45%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + + G ++ + SLI TCRL++++ +L ++ + A V L Sbjct: 459 MGRRNWLFCWTEVGAKYVGIAQSLIATCRLHDIDPYDYLVDVLQRVGQHPAADVAQL 515 >UniRef50_A9GG21 Transposase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GG21_SORC5 Length = 200 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 13/51 (25%), Positives = 24/51 (47%) Query: 44 SLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 GS AA L+SL +C++++++ E +L I + W +L Sbjct: 111 LFFGSDDHASAAANLFSLAASCKVHHLDPEAYLADVIRVMPYWPRERYPEL 161 >UniRef50_C0ZIN3 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZIN3_BREBN Length = 217 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 24/48 (50%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + + A S G + +A++YSL+ T + N + ++L Y E I Sbjct: 141 IGRKNWLFANSPRGAKASAIIYSLLETAKENQLNPFQYLNYLFEQIPQ 188 >UniRef50_UPI00016C41EC ISPpu13, transposase Orf2 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C41EC Length = 323 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 15/43 (34%), Positives = 23/43 (53%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGI 80 L + GS GG AAVL+S+IGTC+ ++ +L + Sbjct: 242 LGRNNWGVFGSAGGGRTAAVLFSVIGTCKHLGLDPFAYLREAL 284 >UniRef50_Q3IV15 Putative transposase n=1 Tax=Rhodobacter sphaeroides 2.4.1 RepID=Q3IV15_RHOS4 Length = 176 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 30/58 (51%), Gaps = 1/58 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI-QDWSANLVRDL 94 L + A AG G E+ A+L SL+ TC++++V ++ + I A+ + DL Sbjct: 106 LTRKNALFAGHEVGAENWAMLASLVATCKMSDVNPVSYIAETLRAILNGHPASRIEDL 163 >UniRef50_A5ZJI7 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZJI7_9BACE Length = 71 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 12/57 (21%), Positives = 24/57 (42%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + Q + +AAV+Y+ C++ + EKWL ++ I + +L Sbjct: 1 MGKQNHLSCQNDEPCHYAAVMYTFFAACKVLGINPEKWLSDVLDKISLTPKEKLSEL 57 >UniRef50_B8KMJ5 ISPsy5, transposase n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KMJ5_9GAMM Length = 176 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 13/47 (27%), Positives = 24/47 (51%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ 84 + + A + G +A +YSLI T + N++E +L + +E I Sbjct: 95 VGRKAWLFADTTRGAHASATMYSLIETAKANHLEPRSYLLHVLERIG 141 >UniRef50_C6ZBZ0 Transposase n=26 Tax=Bacteroides RepID=C6ZBZ0_9BACE Length = 585 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 14/46 (30%), Positives = 29/46 (63%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI 83 L ++ GS + E++A ++SLI +C+LN+++ + +L + E I Sbjct: 520 LLLKNCMNIGSEAAAENSAFIFSLIESCKLNDIDPQDYLKHLFECI 565 >UniRef50_A9ML85 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9ML85_SALAR Length = 94 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 17/34 (50%), Positives = 22/34 (64%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVE 71 L + GS GGE AA++YSL+G C+LN VE Sbjct: 17 LGRRNYLFFGSDRGGEAAAIIYSLLGMCKLNGVE 50 >UniRef50_D2QZY9 Transposase IS66 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QZY9_9PLAN Length = 536 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 12/55 (21%), Positives = 26/55 (47%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVR 92 + + AG+ G AA+LYSLI + + ++ +++L + + + V Sbjct: 465 IGRKNWLFAGNDRAGGTAALLYSLIASAERHQLDPQRYLTSVLARLPALPPSDVN 519 >UniRef50_Q13ZD6 Transposase ISPpu14 orf3 like, IS66 family n=51 Tax=Proteobacteria RepID=Q13ZD6_BURXL Length = 540 Score = 50.0 bits (118), Expect = 3e-05, Method: Composition-based stats. Identities = 14/57 (24%), Positives = 25/57 (43%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + AGS G+ AA + SLI + +LN + +L + + A+ + L Sbjct: 472 IGRANWLFAGSLRAGQRAAAIMSLIRSAQLNGHDPHAYLKDILTRLPIHKASDISAL 528 >UniRef50_C8WYG4 Transposase IS66 n=5 Tax=Bacteria RepID=C8WYG4_ALIAD Length = 529 Score = 49.6 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 11/48 (22%), Positives = 22/48 (45%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + + A + G +AV YS++ T + N + +L Y E + + Sbjct: 455 IGRKNWLFANTPRGARASAVTYSIVETAKENGLNPTAYLTYLFERMPN 502 >UniRef50_A8YU80 Transposase ORF_C n=22 Tax=Lactobacillales RepID=A8YU80_LACH4 Length = 445 Score = 49.6 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 28/48 (58%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + + + S +G + +A++ SLI T + N ++ EK+L Y + ++ + Sbjct: 373 IGRKNWLFSQSFNGAQSSAIILSLIETAKRNGLDPEKYLVYLLSNLPN 420 >UniRef50_C6N6I2 Truncated transposase IS66 n=2 Tax=Legionella RepID=C6N6I2_9GAMM Length = 213 Score = 49.6 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 15/48 (31%), Positives = 22/48 (45%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 L + +GS G +A+ YSLI T N +L Y E+I+ Sbjct: 146 LGRKNWLFSGSPRGAHASALFYSLIATAIANGWNPFNYLRYLFENIRT 193 >UniRef50_C0QE05 Transposase n=3 Tax=Deltaproteobacteria RepID=C0QE05_DESAH Length = 531 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 24/48 (50%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + + +G+ G +A +YSLI T + N +E +L + E++ Sbjct: 457 IGRKNWLFSGAPEGATASAGIYSLIETAKANGLEPYWYLRFLFENLPQ 504 >UniRef50_B6EI61 Transposase n=28 Tax=Gammaproteobacteria RepID=B6EI61_ALISL Length = 495 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 10/46 (21%), Positives = 26/46 (56%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI 83 + + +GS +G + +A+LYS++ T + N + ++ Y ++ + Sbjct: 431 IGRKNWLFSGSTAGADSSAMLYSIVETAKANGLIPYDYIRYCLDRL 476 >UniRef50_A3DCS3 Transposase IS66 n=7 Tax=Clostridiales RepID=A3DCS3_CLOTH Length = 511 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 17/70 (24%), Positives = 33/70 (47%), Gaps = 8/70 (11%) Query: 16 CSPMTSL-KTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 CS +L + SI+ T L + +GS G + +A +YS++ + + N++ K Sbjct: 423 CSISNNLSENSIRPFT-------LGRKNWLFSGSPRGADASAAVYSIVESAKANDINPYK 475 Query: 75 WLCYGIEHIQ 84 +L Y + Sbjct: 476 YLYYIFSELP 485 >UniRef50_Q2K2K4 Putative insertion sequence transposase protein n=1 Tax=Rhizobium etli CFN 42 RepID=Q2K2K4_RHIEC Length = 113 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 1/49 (2%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVEL-EKWLCYGIEHIQDWSANLVRDL 94 G GGE A + ++I T +L+ E +L + IQD + +++L Sbjct: 45 GPDKGGERIANILTIIETAKLHGHNPPEIYLTDVLTRIQDHPKDHLQEL 93 >UniRef50_A6L6H8 Transposase n=8 Tax=Bacteroidales RepID=A6L6H8_BACV8 Length = 537 Score = 48.8 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 11/49 (22%), Positives = 22/49 (44%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDW 86 L + G+ E+ A++ SL+ TC+ + +WL I + + Sbjct: 460 LSRKNFLFCGNHEAAENTAIICSLLATCKAQEINPREWLNDVIAKLPYY 508 >UniRef50_UPI0001BC30F4 transposase n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC30F4 Length = 542 Score = 48.4 bits (114), Expect = 6e-05, Method: Composition-based stats. Identities = 9/58 (15%), Positives = 23/58 (39%) Query: 35 IGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVR 92 C+ + + G +A++YS+ T +LNN+ + + + + + Sbjct: 459 TFCIGKKNWMFHNTAKGAGASALVYSISETAKLNNLRPYYYFRHILTELPKYCDEKGN 516 >UniRef50_A5WEB8 Transposase IS66 n=20 Tax=Proteobacteria RepID=A5WEB8_PSYWF Length = 579 Score = 48.4 bits (114), Expect = 7e-05, Method: Composition-based stats. Identities = 14/57 (24%), Positives = 27/57 (47%), Gaps = 2/57 (3%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + AGS G+ AA + S+I + RLN +++ +L + + + +L Sbjct: 516 LGRKNWLFAGSLRSGQRAANIMSIIQSARLNGLDVSAYLTDVLRRLPTQED--LDEL 570 >UniRef50_C4ZDI2 Transposase n=7 Tax=Clostridiales RepID=C4ZDI2_EUBR3 Length = 90 Score = 48.1 bits (113), Expect = 8e-05, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 27/58 (46%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 C+ + + + +G E +A++YS+ T + NN++ + Y +E I D Sbjct: 14 CVGKKNWVMIDTVAGAEASAMIYSIAETAKANNLKPYNYFKYLLEEIPRHMDEHGVDF 71 >UniRef50_C7XG25 Transposase n=7 Tax=Bacteroidales RepID=C7XG25_9PORP Length = 561 Score = 48.1 bits (113), Expect = 9e-05, Method: Composition-based stats. Identities = 13/49 (26%), Positives = 23/49 (46%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDW 86 L + G+ E+ AV+ SL+G+C+ V +WL I + + Sbjct: 464 LTRKNMLFCGNQQAAENTAVICSLLGSCKECGVNPREWLNDVISKLPYY 512 >UniRef50_A9HWT3 Probable insertion sequence transposase protein n=2 Tax=Alphaproteobacteria RepID=A9HWT3_9RHOB Length = 62 Score = 48.1 bits (113), Expect = 1e-04, Method: Composition-based stats. Identities = 15/46 (32%), Positives = 26/46 (56%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI 83 L+ + A AG + ++ A+L SLI TC+ N++E +L + I Sbjct: 3 LQRKNALFAGHDARAQNWAMLASLIETCKFNSIEPHGYLLGVLITI 48 >UniRef50_B8I6W2 Transposase IS66 n=3 Tax=Clostridiales RepID=B8I6W2_CLOCE Length = 529 Score = 47.7 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 10/47 (21%), Positives = 23/47 (48%) Query: 39 EIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + A + G + +A++YS+I + + N + +L Y + + D Sbjct: 450 GRKNWLFADTTRGAKASAIVYSMIESAKANQLNPYMYLVYLLSKLPD 496 >UniRef50_A6TJJ0 Transposase IS66 n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TJJ0_ALKMQ Length = 533 Score = 47.7 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 13/51 (25%), Positives = 23/51 (45%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSA 88 L + A S G +A+ YS+I T + N + ++L Y E + + Sbjct: 455 LGRKNYLFAKSPKGATASALCYSIIETAKANKLIPFQYLTYLFEQLPNLDI 505 >UniRef50_A8S0A7 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S0A7_9CLOT Length = 389 Score = 47.3 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 17/71 (23%), Positives = 33/71 (46%), Gaps = 9/71 (12%) Query: 18 PMT--SLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKW 75 PMT + + SI+ T L + L + G + +A+ YS+ T + NN++ ++ Sbjct: 205 PMTNNAAEQSIRPFT-------LGRKNWYLIDTSGGAKSSAIAYSIAETAKANNLKPYEY 257 Query: 76 LCYGIEHIQDW 86 Y +E + Sbjct: 258 FKYLLEELPKH 268 >UniRef50_B9YEG7 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YEG7_9FIRM Length = 535 Score = 47.3 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 11/52 (21%), Positives = 29/52 (55%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSAN 89 + + + + SG +++ YSL+ + +LN++++ +L Y + IQ+ + Sbjct: 464 MGRKAWLFSKTKSGARMSSIYYSLVESAKLNHLDIHLYLEYVLTQIQEHPDS 515 >UniRef50_B5JDW6 Transposase IS66 family n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JDW6_9BACT Length = 496 Score = 46.9 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 8/57 (14%), Positives = 27/57 (47%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L ++ GS G +A+L++L+ + + + ++ ++ + + + + + L Sbjct: 426 LGLKNWMFIGSEGSGRTSAILFTLVESAKRHGLDPYGYIKELLRRLPESTNWQIPQL 482 >UniRef50_Q0ABQ0 Integron integrase n=29 Tax=Proteobacteria RepID=Q0ABQ0_ALHEH Length = 694 Score = 46.9 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 9/53 (16%), Positives = 24/53 (45%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANL 90 + + + + G +A++YS+I T + N +E ++L + + + Sbjct: 626 VGRKNWLFSHTTQGAAASAMIYSVIETAKANGLEPYEYLEDVLTRLPAADTDQ 678 >UniRef50_UPI0001C376A4 transposase IS66 n=2 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C376A4 Length = 532 Score = 46.5 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 10/56 (17%), Positives = 28/56 (50%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRD 93 + + + + G + +A + S+I T + NN+++ +L + + + +W N + Sbjct: 452 INRKNFLFSDTEKGADASAAVMSIIETAKRNNLDVYGYLTHLLTVLPEWGKNPTDE 507 >UniRef50_UPI00016C567B ISPsy5, transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C567B Length = 220 Score = 46.5 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 16/65 (24%), Positives = 29/65 (44%), Gaps = 12/65 (18%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGI------------EHIQD 85 + + GS GG AA LYS++GTC+ +++ +L + E ++D Sbjct: 136 VGRNNWGVVGSEVGGRTAATLYSVVGTCKHLSIDPWTYLRDTLPGIFALGDEPTAEQLRD 195 Query: 86 WSANL 90 W + Sbjct: 196 WLPDR 200 >UniRef50_A8VYR6 Transposase and inactivated derivatives-like protein n=2 Tax=Bacillus selenitireducens MLS10 RepID=A8VYR6_9BACI Length = 495 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 9/47 (19%), Positives = 26/47 (55%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ 84 + + + + G + ++++YS+I T + N ++ + +L Y E++ Sbjct: 411 IGRKNWIFSNTPRGAKSSSIIYSMIETAKENQLKPQAYLNYLFENLP 457 >UniRef50_B7CDZ5 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CDZ5_9FIRM Length = 523 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 12/49 (24%), Positives = 22/49 (44%) Query: 39 EIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWS 87 + + S SG E +A YS+I T + N ++ K+L ++ Sbjct: 452 GRKNWLFSASVSGAESSANAYSIIETAKANGLDPYKYLTTIFTYLPSQD 500 >UniRef50_Q24RY6 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense Y51 RepID=Q24RY6_DESHY Length = 533 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 10/48 (20%), Positives = 23/48 (47%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + + + + +G +A+ YSLI T R N + ++L + + + Sbjct: 444 IGRKNWLFSNTPNGARASAIYYSLIVTARENGLNPFEYLAWIFANSPN 491 >UniRef50_D1JMC2 Transposase n=1 Tax=Bacteroides sp. 2_1_16 RepID=D1JMC2_9BACE Length = 528 Score = 46.1 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 12/45 (26%), Positives = 22/45 (48%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEH 82 L + + GS +G E + YSL +CRL+ + ++L + Sbjct: 456 LSRKNSLFFGSHAGAERGCIFYSLACSCRLHKINFFEYLTDILNR 500 >UniRef50_A6NXJ3 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NXJ3_9BACE Length = 200 Score = 45.8 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 10/43 (23%), Positives = 25/43 (58%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGI 80 + + + + +G + +AV+YSLI T + N++ ++L + + Sbjct: 127 MGRKNWLFSNTPAGAQSSAVVYSLIETAKENDLAPYRYLVWLL 169 >UniRef50_UPI0001C34DE2 transposase IS66 n=4 Tax=Clostridium sp. M62/1 RepID=UPI0001C34DE2 Length = 394 Score = 45.8 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 9/49 (18%), Positives = 26/49 (53%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDW 86 + + + G ++++YSL+ T +LN++ + +L + ++ D+ Sbjct: 344 VGRKNFLFHDTVKGARASSIIYSLVETAKLNDLNIYAYLETVLLYMPDY 392 >UniRef50_C6LKU9 ISPsy5, transposase n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LKU9_9FIRM Length = 549 Score = 45.4 bits (106), Expect = 5e-04, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 25/48 (52%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + + + S G + +A +YS+ T LN ++ +L Y +E ++D Sbjct: 472 IGRRNWLFSKSIRGAQTSATVYSITETALLNGLKPYNYLTYVLEKMKD 519 >UniRef50_Q07GD7 Putative uncharacterized protein n=1 Tax=Roseobacter denitrificans OCh 114 RepID=Q07GD7_ROSDO Length = 111 Score = 45.0 bits (105), Expect = 7e-04, Method: Composition-based stats. Identities = 14/46 (30%), Positives = 25/46 (54%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI 83 L+ + A + +G ++ A+L SLI +LN+VE +L + I Sbjct: 22 LQRKNALFSDHDAGAQNWAMLASLIEIGKLNDVEPHSYLTSVLSAI 67 >UniRef50_Q1A683 Transposase (Fragment) n=4 Tax=Clostridiales RepID=Q1A683_9FIRM Length = 244 Score = 45.0 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 11/49 (22%), Positives = 27/49 (55%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDW 86 + + S +G +AV+YS++ T + NN+ + ++L + ++ D+ Sbjct: 155 VGRKAFLFHTSEAGAGASAVMYSIVETAKANNLNIFQYLYMVLLYMPDY 203 >UniRef50_A0LAY3 Transposase IS66 n=7 Tax=Magnetococcus sp. MC-1 RepID=A0LAY3_MAGSM Length = 526 Score = 44.6 bits (104), Expect = 9e-04, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 20/48 (41%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + + + S G +A LYSLI T + N E ++ E + Sbjct: 456 IGRKNWLFSNSVRGARASANLYSLIETAKANGWEPFEYFTKVFEGLAT 503 >UniRef50_A6WIY2 Transposase IS66 n=9 Tax=Gammaproteobacteria RepID=A6WIY2_SHEB8 Length = 514 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 25/57 (43%), Gaps = 1/57 (1%) Query: 39 EIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDW-SANLVRDL 94 + + S +G +A LYSL+ TCR N++ + + + + + DL Sbjct: 447 GRKNWMFSTSVNGAHASANLYSLVMTCRANDISPYYYFRHLFTELPKRLPTDDLTDL 503 >UniRef50_A3JL62 ISPpu15, transposase n=2 Tax=Marinobacter sp. ELB17 RepID=A3JL62_9ALTE Length = 77 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 10/45 (22%), Positives = 21/45 (46%) Query: 40 IQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ 84 + A + G +A YSL+ T + N +E ++ + +E + Sbjct: 4 RKAWLFADTSQGARASATCYSLVETAKANKLEPSVYIQHVLERVA 48 >UniRef50_A9AMV4 ISBmu30 transposase n=26 Tax=Proteobacteria RepID=A9AMV4_BURM1 Length = 546 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 1/56 (1%) Query: 40 IQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDW-SANLVRDL 94 + + + G + +A +YSL+ TCR VE +L + + + V DL Sbjct: 475 RKSWLFSDTVDGAKASATVYSLVLTCRACGVEPYDYLLHVLTELPQRAPDADVTDL 530 >UniRef50_B2AIZ9 Fused transposase IS66/IS21 n=38 Tax=Proteobacteria RepID=B2AIZ9_CUPTR Length = 936 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 11/47 (23%), Positives = 24/47 (51%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ 84 + + + +G +A LYSL+ TC+ N+V+ ++L + + Sbjct: 461 IGRRNFLFCDTVAGANASASLYSLVETCKANDVDSYQYLVALFKALP 507 >UniRef50_D0DW05 Transposase IS66 n=1 Tax=Lactobacillus fermentum 28-3-CHN RepID=D0DW05_LACFE Length = 507 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 17/57 (29%), Positives = 30/57 (52%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + + A S +G + A++Y++I T +LNN+ + +L Y + A V DL Sbjct: 438 LVRKNSLFATSKNGAKTNAMIYTIIQTAKLNNLRIFDYLKYVFDQYTKRVAVKVEDL 494 >UniRef50_A3JTL7 Probable insertion sequence transposase protein n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JTL7_9RHOB Length = 74 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 19/65 (29%), Positives = 28/65 (43%), Gaps = 2/65 (3%) Query: 31 YLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI-QDWSAN 89 YL L + AS AG + E+ +L S+I C LN +E + +L I Sbjct: 4 YLPST-TLNRKNASFAGHDANAENWVILVSIIEICILNKIEPQVYLTGVFTAIAHGHRQK 62 Query: 90 LVRDL 94 + DL Sbjct: 63 DIEDL 67 >UniRef50_C4I9X2 Fused transposase IS66/IS21 n=8 Tax=Burkholderia RepID=C4I9X2_BURPS Length = 172 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 11/48 (22%), Positives = 24/48 (50%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ 84 C+ +G + + G A LY+L+ T + N ++L ++L + + Sbjct: 79 CVGRRGWLFSDTVDGANACANLYTLVETSKTNGIDLYRYLAWLFRRLP 126 >UniRef50_Q0AUV2 Transposase and inactivated derivatives-like protein n=15 Tax=Firmicutes RepID=Q0AUV2_SYNWW Length = 530 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 10/48 (20%), Positives = 24/48 (50%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + + + G +A++YS+I T + NN++ ++ Y E + + Sbjct: 448 IGRKNWLFTNTPRGARGSAIIYSVIETAKENNLKPYNYMFYLFEQLPN 495 >UniRef50_A6LHQ0 Transposase n=2 Tax=Bacteroidales RepID=A6LHQ0_PARD8 Length = 528 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 16/61 (26%), Positives = 31/61 (50%), Gaps = 4/61 (6%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI----QDWSANLVRD 93 L + + GS G + AA++YSL +CR+NN+ ++ + + N++R+ Sbjct: 460 LSRRNSLFCGSHQGVKRAALIYSLACSCRMNNINTFEYFKELLNKAVSLNPNTDKNVLRE 519 Query: 94 L 94 L Sbjct: 520 L 520 >UniRef50_A0LPQ3 ISPpu15, transposase Orf2 n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LPQ3_SYNFM Length = 177 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 13/47 (27%), Positives = 23/47 (48%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ 84 + + +G + AA LYSLI T ++ +E ++L Y E + Sbjct: 103 IGRKNWLFSGHPNSANAAATLYSLIETAKVCRLEAYQYLRYLFERLP 149 >UniRef50_UPI0001744C57 ISPpu14, transposase Orf3 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744C57 Length = 73 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 19/48 (39%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 G G AA Y+LIG C N ++ +L + + V L Sbjct: 2 GDAQSGARAATFYTLIGNCHRNGIDAFAYLSDVFTRLPRETNRTVHRL 49 >UniRef50_A9EDB3 Putative transposase n=1 Tax=Oceanibulbus indolifex HEL-45 RepID=A9EDB3_9RHOB Length = 184 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 13/47 (27%), Positives = 22/47 (46%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ 84 L + A G G E A+L SLI C++ +V+ +L + + Sbjct: 122 LIRKNALFVGHEGGAESWALLASLIANCKMCDVDPVSYLSDTLRVLP 168 >UniRef50_B3PIF8 IS66 family element, transposase n=8 Tax=Bacteria RepID=B3PIF8_CELJU Length = 527 Score = 43.4 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 11/48 (22%), Positives = 21/48 (43%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + + + S G +A LYS+I T + N +E +L + + Sbjct: 460 IGRKNWLFSTSPKGATASANLYSVIETAKANGLEPYGYLKTIFTELPN 507 >UniRef50_B9KMR1 Transposase IS66 n=14 Tax=Alphaproteobacteria RepID=B9KMR1_RHOSK Length = 53 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 7/35 (20%), Positives = 18/35 (51%) Query: 60 SLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 +LI + +L+ ++ + +L + I D + +L Sbjct: 2 TLIESAKLSGLDPQAYLADVLARINDHINPRLHEL 36 >UniRef50_A9DHW8 Putative uncharacterized protein n=10 Tax=Shewanella benthica KT99 RepID=A9DHW8_9GAMM Length = 465 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 10/44 (22%), Positives = 21/44 (47%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIE 81 + + + G E +A+LYS+I T + N + ++ +E Sbjct: 422 IGRKNWLFNHNHRGAEASAILYSIIETAKANGLTPFDYIERCLE 465 >UniRef50_B3WDV3 Putative uncharacterized protein n=1 Tax=Lactobacillus casei BL23 RepID=B3WDV3_LACCB Length = 63 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 7/50 (14%), Positives = 23/50 (46%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWS 87 + + +G G + A+L ++I T + N ++ ++ ++ + + Sbjct: 1 MGRKNFLFSGVPEGAKINAILMTMIETAKANALDPMTYIGSLLDELAQFP 50 >UniRef50_Q6LRS5 Hypothetical transposase n=10 Tax=Gammaproteobacteria RepID=Q6LRS5_PHOPR Length = 514 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 11/52 (21%), Positives = 27/52 (51%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSAN 89 + + + + +G +A+LYSL+ T + NN+ + ++ ++ I + N Sbjct: 451 IGRKAWLFSYTNTGANASAILYSLVETAKANNLLVHDYIATCLQQIAEKPNN 502 >UniRef50_D1PHR2 Putative transposase IS66 n=1 Tax=Prevotella copri DSM 18205 RepID=D1PHR2_9BACT Length = 641 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 12/53 (22%), Positives = 27/53 (50%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANL 90 + + AGS E+ A +YSL +C++NN+ +++ + ++D + Sbjct: 570 MGRRNLGKAGSHEAAENLAFMYSLYESCKMNNLNFGRYIEDILTRMKDGDKDY 622 >UniRef50_C0ETK9 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0ETK9_9FIRM Length = 129 Score = 42.7 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 11/46 (23%), Positives = 25/46 (54%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI 83 + + A S G + +A++YS++ T L+ ++ +L Y +E + Sbjct: 50 VGRRNWLFAKSIRGADASAIVYSIVETALLSGLKPYLYLTYVLEKL 95 >UniRef50_C9K6B9 IS66 family transposase n=1 Tax=Sphingomonas sp. NP5 RepID=C9K6B9_9SPHN Length = 536 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 15/47 (31%), Positives = 24/47 (51%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ 84 L + A + G H AVL SL+ T +NNV+ WL + ++ + Sbjct: 468 LTRRNALFCATHEGAAHWAVLASLLHTAHINNVDPLAWLTHALDTLA 514 >UniRef50_C9LAQ6 ISPpu13, transposase Orf2 n=15 Tax=Clostridiales RepID=C9LAQ6_RUMHA Length = 532 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 14/72 (19%), Positives = 29/72 (40%), Gaps = 15/72 (20%) Query: 29 ITYLSDIGC---------------LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELE 73 +TYL D C + + + S G +A++Y+++ + N++ Sbjct: 437 MTYLEDGHCSLSNNLSENAIRPFTVGRKNWLFSASPKGAASSAIVYTMVEMAKANDLNTY 496 Query: 74 KWLCYGIEHIQD 85 K+L Y + D Sbjct: 497 KYLTYLLSQRPD 508 >UniRef50_A5FT16 Putative uncharacterized protein n=2 Tax=Acidiphilium cryptum JF-5 RepID=A5FT16_ACICJ Length = 81 Score = 41.9 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 17/49 (34%), Positives = 26/49 (53%) Query: 35 IGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI 83 +G L G S +G EH A++ +LIG+ RL+ VE L ++ I Sbjct: 5 VGVLRRTGHHEVASDTGEEHCALMATLIGSARLSGVEPLARLTDVLQRI 53 >UniRef50_C6LKX4 Transposase IS66 n=13 Tax=Clostridiales RepID=C6LKX4_9FIRM Length = 542 Score = 41.9 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 9/52 (17%), Positives = 25/52 (48%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSAN 89 + + + + +G +AV+YS+ T + NN++ ++ + + I + Sbjct: 467 VGRKNWQMIDTINGANASAVIYSIAETAKANNLKPYEYFEHLLSEIPKHMDD 518 >UniRef50_UPI0001C34DDA transposase n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001C34DDA Length = 69 Score = 41.9 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 8/37 (21%), Positives = 20/37 (54%) Query: 53 EHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSAN 89 + ++YS+ T + NN+ ++L Y + ++D + Sbjct: 4 KRLPIIYSITETAKANNLNPFRYLDYVLTVVKDHQDD 40 >UniRef50_C2EYW0 Putative uncharacterized protein n=2 Tax=Lactobacillus reuteri RepID=C2EYW0_LACRE Length = 88 Score = 41.9 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 5/56 (8%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI-----QDWSA 88 + + S G + A+ YS+I T +LN + + ++L Y +E DW A Sbjct: 17 IGRKNYLFTKSEVGAKANAMWYSIIQTAKLNKLRVREYLEYLLEAFAWTDQPDWKA 72 >UniRef50_UPI0001C34846 hypothetical protein PretD1_08486 n=2 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C34846 Length = 464 Score = 41.5 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 13/35 (37%), Positives = 18/35 (51%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVEL 72 + AGS + GE AA + L+ T RLN +E Sbjct: 428 MGRSNWLFAGSQAAGERAAKIMGLLETARLNGLEP 462 >UniRef50_A3JZ99 Putative transposase n=1 Tax=Sagittula stellata E-37 RepID=A3JZ99_9RHOB Length = 77 Score = 41.1 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 14/46 (30%), Positives = 24/46 (52%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI 83 L + A GS G +L S + TC+L+N+ +E +L ++ I Sbjct: 7 LLRKNALFIGSDEGAHAWGILSSNVETCKLDNINVESYLTRILDQI 52 >UniRef50_A1B0T3 Putative uncharacterized protein n=1 Tax=Paracoccus denitrificans PD1222 RepID=A1B0T3_PARDP Length = 124 Score = 41.1 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 15/36 (41%), Positives = 22/36 (61%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVEL 72 C +G +L GG+ AA +YSLI T R+N+V+ Sbjct: 28 CPREKGLALRRFARGGDRAAFIYSLIVTARMNDVDP 63 >UniRef50_Q2K2A5 Hypothetical conserved protein n=1 Tax=Rhizobium etli CFN 42 RepID=Q2K2A5_RHIEC Length = 104 Score = 40.7 bits (94), Expect = 0.014, Method: Composition-based stats. Identities = 13/32 (40%), Positives = 20/32 (62%) Query: 50 SGGEHAAVLYSLIGTCRLNNVELEKWLCYGIE 81 SG + AA + +LI T +LNN+E + WL + Sbjct: 22 SGADRAACMATLIMTAKLNNIEPQAWLADVLA 53 >UniRef50_A6LGZ8 Putative uncharacterized protein n=1 Tax=Parabacteroides distasonis ATCC 8503 RepID=A6LGZ8_PARD8 Length = 104 Score = 40.7 bits (94), Expect = 0.014, Method: Composition-based stats. Identities = 15/87 (17%), Positives = 31/87 (35%), Gaps = 10/87 (11%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS 60 M+ R+ FS + ++ + M + I L + GS +G A + YS Sbjct: 13 MIGRMAFSMALSKLLDNAMERINRCIS----------LMRHNSLFFGSHAGASRAVIYYS 62 Query: 61 LIGTCRLNNVELEKWLCYGIEHIQDWS 87 L +C + +++ + Sbjct: 63 LACSCSQRGINFFEYISDIMNRAAILP 89 >UniRef50_UPI00017448EA transposase IS66 n=4 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448EA Length = 503 Score = 40.7 bits (94), Expect = 0.015, Method: Composition-based stats. Identities = 7/57 (12%), Positives = 21/57 (36%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + G G AA Y+L+ + + ++L + + ++++ Sbjct: 435 IGKKNWLFVGDAQAGVRAATFYTLLDNAKRAGADAYEYLKDLFTKLPAMTNQQMKEI 491 >UniRef50_C8CGL3 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=C8CGL3_ECOLX Length = 162 Score = 40.4 bits (93), Expect = 0.017, Method: Composition-based stats. Identities = 23/44 (52%), Positives = 29/44 (65%), Gaps = 4/44 (9%) Query: 2 VRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASL 45 ++RL FSGP+TSII L S+KT+ LSD C +I GASL Sbjct: 18 LQRLHFSGPETSII----RILTISLKTVGKLSDASCQDIHGASL 57 >UniRef50_A2UYM3 Conserved hypothetical ISPpu15, transposase n=1 Tax=Shewanella putrefaciens 200 RepID=A2UYM3_SHEPU Length = 211 Score = 40.4 bits (93), Expect = 0.018, Method: Composition-based stats. Identities = 8/52 (15%), Positives = 25/52 (48%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSAN 89 + + + G E +A+ YS+I + N + ++ + +E + + +++ Sbjct: 146 IGHKNWLFNHNHRGAETSAIFYSIIKMAKANELTPFDYIEHCLEQLSNSNSD 197 >UniRef50_A8S332 Putative uncharacterized protein (Fragment) n=3 Tax=Clostridiales RepID=A8S332_9CLOT Length = 408 Score = 40.4 bits (93), Expect = 0.020, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 31/71 (43%), Gaps = 9/71 (12%) Query: 18 PMT--SLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKW 75 PMT + + SI+ T + + SG + +A+ YS+ T + N ++ ++ Sbjct: 314 PMTNNAAERSIRPFT-------VGRNNWFQIDTVSGAKASAIAYSIAETAKANQLKPYEY 366 Query: 76 LCYGIEHIQDW 86 Y +E + Sbjct: 367 FRYLLEELPKH 377 >UniRef50_B0UII5 Putative uncharacterized protein n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UII5_METS4 Length = 86 Score = 40.0 bits (92), Expect = 0.022, Method: Composition-based stats. Identities = 8/23 (34%), Positives = 12/23 (52%) Query: 66 RLNNVELEKWLCYGIEHIQDWSA 88 RLN+V+ WL + I D + Sbjct: 39 RLNDVDPRAWLADVLARINDHPS 61 >UniRef50_B3DSN6 Transposase n=4 Tax=Bifidobacterium longum RepID=B3DSN6_BIFLD Length = 133 Score = 39.6 bits (91), Expect = 0.029, Method: Composition-based stats. Identities = 6/48 (12%), Positives = 21/48 (43%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + + + + G +A +YS+ T + N + ++ + + + + Sbjct: 25 VGRKNWLFSDAPRGARASAAIYSVTTTAKANGLNPRLYVEWLLTEMPN 72 >UniRef50_Q8VSS7 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=Q8VSS7_BACFR Length = 193 Score = 39.6 bits (91), Expect = 0.030, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 27/73 (36%), Gaps = 15/73 (20%) Query: 26 IKTITYLSDIGCL---------------EIQGASLAGSGSGGEHAAVLYSLIGTCRLNNV 70 I+ YL D C E + + GS +A Y++I TC++ V Sbjct: 99 IQLFAYLKDGSCTIDNSIAERFICPLSGERKNSLFFGSDKMARVSAAYYTIISTCKMQGV 158 Query: 71 ELEKWLCYGIEHI 83 ++ ++ I Sbjct: 159 PALRYFKMFLQAI 171 >UniRef50_B0NH70 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0NH70_EUBSP Length = 229 Score = 38.8 bits (89), Expect = 0.058, Method: Composition-based stats. Identities = 8/49 (16%), Positives = 23/49 (46%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDW 86 + + + G + +A++YS+ T R NN+ + ++ + + + Sbjct: 149 IGRKNWVTINTVRGAQASAIIYSITETARANNLNVYYYIKHLLTQLPQH 197 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q9JMS9 Uncharacterized protein yuaK n=3 Tax=Escherichia... 130 1e-29 UniRef50_B5EKE2 Transposase IS66 n=6 Tax=Acidithiobacillus RepID... 101 8e-21 UniRef50_Q2W201 Transposase and inactivated derivative n=22 Tax=... 101 1e-20 UniRef50_Q9S116 Orf51 protein n=166 Tax=root RepID=Q9S116_ECOLX 100 2e-20 UniRef50_A6FSH9 Transposase and inactivated derivative n=7 Tax=B... 98 7e-20 UniRef50_Q3ZU23 OrfD, ISEc8 n=37 Tax=Proteobacteria RepID=Q3ZU23... 98 8e-20 UniRef50_A9EDY1 Transposase n=3 Tax=Kordia algicida OT-1 RepID=A... 98 9e-20 UniRef50_Q1RPJ6 ECs1339 protein n=175 Tax=Bacteria RepID=Q1RPJ6_... 98 1e-19 UniRef50_A4JH71 Transposase IS66 n=4 Tax=Proteobacteria RepID=A4... 98 1e-19 UniRef50_Q11ZA7 Transposase IS66 n=6 Tax=Burkholderiales RepID=Q... 97 2e-19 UniRef50_B2AJ19 Transposase, IS66 familly n=24 Tax=cellular orga... 95 6e-19 UniRef50_C7I5Z5 Transposase IS66 n=2 Tax=Thiomonas intermedia K1... 95 8e-19 UniRef50_B5K4C7 Transposase IS66 n=27 Tax=Rhodobacterales RepID=... 95 9e-19 UniRef50_Q5P8K4 Transposase n=7 Tax=Proteobacteria RepID=Q5P8K4_... 93 2e-18 UniRef50_Q8GAR2 Orf51 (Fragment) n=2 Tax=Proteobacteria RepID=Q8... 93 3e-18 UniRef50_C6MJH1 Transposase IS66 n=1 Tax=Nitrosomonas sp. AL212 ... 92 6e-18 UniRef50_P50360 Uncharacterized protein y4hP n=117 Tax=cellular ... 92 6e-18 UniRef50_D1N8F2 Transposase IS66 n=1 Tax=Victivallis vadensis AT... 92 8e-18 UniRef50_D2WFI4 IS66 transposase n=1 Tax=Escherichia coli O26:H-... 91 1e-17 UniRef50_Q07SJ6 Putative uncharacterized protein n=4 Tax=Rhizobi... 91 1e-17 UniRef50_A2V4E0 Transposase IS66 n=1 Tax=Shewanella putrefaciens... 91 1e-17 UniRef50_A7IP73 Transposase IS66 n=23 Tax=Alphaproteobacteria Re... 91 1e-17 UniRef50_A6BYX4 TnpC protein n=2 Tax=Planctomyces maris DSM 8797... 91 1e-17 UniRef50_Q1NGZ9 TnpC protein n=5 Tax=Sphingomonas RepID=Q1NGZ9_9... 90 2e-17 UniRef50_UPI0001C38241 transposase IS66 n=1 Tax=Arthrospira plat... 89 3e-17 UniRef50_A6WXB5 Transposase IS66 n=39 Tax=Alphaproteobacteria Re... 89 4e-17 UniRef50_A3X2G6 Putative transposase n=5 Tax=Alphaproteobacteria... 89 4e-17 UniRef50_A7HGL2 Transposase IS66 n=7 Tax=Cystobacterineae RepID=... 89 4e-17 UniRef50_C8QDU0 Transposase IS66 n=9 Tax=Enterobacteriaceae RepI... 89 5e-17 UniRef50_Q320F8 ISSfl4 ORF3 n=21 Tax=Enterobacteriaceae RepID=Q3... 88 7e-17 UniRef50_Q6EZC1 L0015-like protein n=30 Tax=Enterobacteriaceae R... 88 1e-16 UniRef50_Q322B9 ISSfl3 orfC n=24 Tax=Enterobacteriaceae RepID=Q3... 87 1e-16 UniRef50_B9TBF4 Putative uncharacterized protein n=1 Tax=Ricinus... 87 2e-16 UniRef50_A1VVP8 Transposase IS66 n=8 Tax=Proteobacteria RepID=A1... 87 2e-16 UniRef50_D2LKV6 Transposase IS66 n=1 Tax=Rhodomicrobium vannieli... 85 6e-16 UniRef50_C1HRA2 Putative uncharacterized protein n=1 Tax=Escheri... 85 7e-16 UniRef50_B9JRL5 Transposase n=55 Tax=cellular organisms RepID=B9... 85 7e-16 UniRef50_B1EN06 IS element transposase n=1 Tax=Escherichia alber... 84 2e-15 UniRef50_A4XN94 Transposase IS66 n=25 Tax=Gammaproteobacteria Re... 84 2e-15 UniRef50_A3DCS3 Transposase IS66 n=7 Tax=Clostridiales RepID=A3D... 83 2e-15 UniRef50_Q0ABQ0 Integron integrase n=29 Tax=Proteobacteria RepID... 83 2e-15 UniRef50_C5SQQ1 Transposase IS66 n=1 Tax=Asticcacaulis excentric... 83 2e-15 UniRef50_B4D7C9 Transposase IS66 n=5 Tax=Chthoniobacter flavus E... 83 3e-15 UniRef50_C4ZMM2 Transposase IS66 n=23 Tax=Proteobacteria RepID=C... 83 3e-15 UniRef50_C4ZDI2 Transposase n=7 Tax=Clostridiales RepID=C4ZDI2_E... 83 4e-15 UniRef50_C0QE05 Transposase n=3 Tax=Deltaproteobacteria RepID=C0... 82 5e-15 UniRef50_B9K420 Transposase n=13 Tax=Alphaproteobacteria RepID=B... 82 6e-15 UniRef50_C0ZIN3 Putative uncharacterized protein n=1 Tax=Breviba... 82 7e-15 UniRef50_A6WIY2 Transposase IS66 n=9 Tax=Gammaproteobacteria Rep... 82 7e-15 UniRef50_C9CRY0 Transposase IS66 n=2 Tax=Silicibacter sp. TrichC... 82 8e-15 UniRef50_B8FCN1 Transposase IS66 n=18 Tax=Bacteria RepID=B8FCN1_... 82 8e-15 UniRef50_A1ZEN9 TnpC protein n=6 Tax=Microscilla marina ATCC 231... 82 8e-15 UniRef50_C8SL35 Putative uncharacterized protein n=1 Tax=Mesorhi... 81 1e-14 UniRef50_B4S337 Transposase n=3 Tax=Alteromonas macleodii 'Deep ... 81 1e-14 UniRef50_Q5P882 IS66 Orf1 transposase n=5 Tax=cellular organisms... 81 1e-14 UniRef50_B3HDR4 IS66 family element, transposase n=7 Tax=Enterob... 81 1e-14 UniRef50_Q13ZD6 Transposase ISPpu14 orf3 like, IS66 family n=51 ... 80 2e-14 UniRef50_C8WYG4 Transposase IS66 n=5 Tax=Bacteria RepID=C8WYG4_A... 80 2e-14 UniRef50_Q1NKD7 ISPsy5, transposase n=1 Tax=delta proteobacteriu... 80 3e-14 UniRef50_C3RGW7 Transposase n=13 Tax=Bacteroides RepID=C3RGW7_9BACE 79 4e-14 UniRef50_A3ZP54 Putative uncharacterized protein n=1 Tax=Blastop... 79 5e-14 UniRef50_A3X278 Putative transposase n=1 Tax=Nitrobacter sp. Nb-... 79 5e-14 UniRef50_D0TYT2 Integron integrase n=1 Tax=Bacteroides sp. 2_1_2... 78 6e-14 UniRef50_A9G0V4 Transposase n=6 Tax=Sorangium cellulosum 'So ce ... 78 6e-14 UniRef50_A8S0A7 Putative uncharacterized protein n=1 Tax=Clostri... 78 6e-14 UniRef50_A6KWF8 Transposase n=7 Tax=Bacteroides RepID=A6KWF8_BACV8 78 7e-14 UniRef50_P55630 Uncharacterized protein y4qI n=19 Tax=Alphaprote... 78 7e-14 UniRef50_Q1VRT6 Transposase n=5 Tax=Psychroflexus torquis ATCC 7... 78 9e-14 UniRef50_C3QM77 Transposase IS66 n=1 Tax=Bacteroides sp. D1 RepI... 78 1e-13 UniRef50_A8VYR6 Transposase and inactivated derivatives-like pro... 78 1e-13 UniRef50_A6TJJ0 Transposase IS66 n=1 Tax=Alkaliphilus metallired... 78 1e-13 UniRef50_Q3IV06 Transposase IS66 n=9 Tax=Bacteria RepID=Q3IV06_R... 78 1e-13 UniRef50_B6EI61 Transposase n=28 Tax=Gammaproteobacteria RepID=B... 77 1e-13 UniRef50_UPI0001BC30F4 transposase n=1 Tax=Butyrivibrio crossotu... 77 2e-13 UniRef50_A0LAY3 Transposase IS66 n=7 Tax=Magnetococcus sp. MC-1 ... 77 2e-13 UniRef50_D2QZY9 Transposase IS66 n=1 Tax=Pirellula staleyi DSM 6... 77 3e-13 UniRef50_C6N6I2 Truncated transposase IS66 n=2 Tax=Legionella Re... 76 3e-13 UniRef50_B8I6W2 Transposase IS66 n=3 Tax=Clostridiales RepID=B8I... 76 4e-13 UniRef50_Q0AUV2 Transposase and inactivated derivatives-like pro... 75 5e-13 UniRef50_C6MXH4 Transposase IS66 n=3 Tax=Legionella drancourtii ... 75 6e-13 UniRef50_D1W152 IS66 family element, transposase n=3 Tax=Prevote... 75 6e-13 UniRef50_A6L6H8 Transposase n=8 Tax=Bacteroidales RepID=A6L6H8_B... 75 6e-13 UniRef50_Q24RY6 Putative uncharacterized protein n=1 Tax=Desulfi... 75 8e-13 UniRef50_B7CDZ5 Putative uncharacterized protein n=1 Tax=Eubacte... 75 9e-13 UniRef50_A9HSI6 Probable insertion sequence transposase protein ... 75 1e-12 UniRef50_B8KMJ5 ISPsy5, transposase n=1 Tax=gamma proteobacteriu... 75 1e-12 UniRef50_A9AMV4 ISBmu30 transposase n=26 Tax=Proteobacteria RepI... 74 1e-12 UniRef50_B5JDW6 Transposase IS66 family n=1 Tax=Verrucomicrobiae... 74 1e-12 UniRef50_A9GG21 Transposase n=1 Tax=Sorangium cellulosum 'So ce ... 73 2e-12 UniRef50_C7XFR9 IS66 family transposase n=7 Tax=Bacteroidales Re... 73 2e-12 UniRef50_Q1D8I4 Transposase, IS66 family, truncated n=1 Tax=Myxo... 73 2e-12 UniRef50_Q2YKH5 Transposase IS66 family n=37 Tax=Brucella RepID=... 73 2e-12 UniRef50_A5WEB8 Transposase IS66 n=20 Tax=Proteobacteria RepID=A... 73 3e-12 UniRef50_UPI000197B598 hypothetical protein BACCOPRO_01649 n=1 T... 73 3e-12 UniRef50_Q08VL0 Transposase and inactivated derivative n=13 Tax=... 73 3e-12 UniRef50_D0DW05 Transposase IS66 n=1 Tax=Lactobacillus fermentum... 73 4e-12 UniRef50_Q08RD8 Transposase IS66 family n=6 Tax=Stigmatella aura... 72 4e-12 UniRef50_Q3IV15 Putative transposase n=1 Tax=Rhodobacter sphaero... 72 5e-12 UniRef50_A6KXM9 Transposase n=23 Tax=Bacteroides RepID=A6KXM9_BACV8 72 6e-12 UniRef50_UPI00003825AD COG3436: Transposase and inactivated deri... 72 6e-12 UniRef50_A8YU80 Transposase ORF_C n=22 Tax=Lactobacillales RepID... 72 7e-12 UniRef50_C7XG25 Transposase n=7 Tax=Bacteroidales RepID=C7XG25_9... 72 7e-12 UniRef50_B3WYX6 Lysyl-tRNA synthetase, heat inducible n=4 Tax=Ga... 71 9e-12 UniRef50_A5ZJI7 Putative uncharacterized protein n=1 Tax=Bactero... 71 1e-11 UniRef50_A6NXJ3 Putative uncharacterized protein n=1 Tax=Bactero... 71 1e-11 UniRef50_Q31T57 Putative uncharacterized protein n=3 Tax=Enterob... 71 1e-11 UniRef50_A0LPQ3 ISPpu15, transposase Orf2 n=1 Tax=Syntrophobacte... 71 1e-11 UniRef50_B7LIJ6 Putative uncharacterized protein n=1 Tax=Escheri... 70 2e-11 UniRef50_B9YEG7 Putative uncharacterized protein n=1 Tax=Holdema... 70 2e-11 UniRef50_B2AIZ9 Fused transposase IS66/IS21 n=38 Tax=Proteobacte... 70 3e-11 UniRef50_C6LKU9 ISPsy5, transposase n=1 Tax=Bryantella formatexi... 69 4e-11 UniRef50_UPI0001C376A4 transposase IS66 n=2 Tax=Ruminococcus fla... 69 5e-11 UniRef50_Q07GD7 Putative uncharacterized protein n=1 Tax=Roseoba... 68 6e-11 UniRef50_C4I9X2 Fused transposase IS66/IS21 n=8 Tax=Burkholderia... 68 6e-11 UniRef50_UPI0001C34DE2 transposase IS66 n=4 Tax=Clostridium sp. ... 67 2e-10 UniRef50_D1PX91 Cytochrome o ubiquinol oxidase n=1 Tax=Prevotell... 67 2e-10 UniRef50_D1PFL7 Putative cytOchrome o ubiquinol oxidase, subunit... 67 3e-10 UniRef50_P55504 Uncharacterized protein y4jD n=13 Tax=cellular o... 67 3e-10 UniRef50_UPI00016C41EC ISPpu13, transposase Orf2 n=1 Tax=Gemmata... 66 3e-10 UniRef50_A3JTL7 Probable insertion sequence transposase protein ... 66 4e-10 UniRef50_A6LHQ0 Transposase n=2 Tax=Bacteroidales RepID=A6LHQ0_P... 66 4e-10 UniRef50_A9HWT3 Probable insertion sequence transposase protein ... 64 1e-09 UniRef50_C2DPG4 Nitrite extrusion protein 2 n=1 Tax=Escherichia ... 64 2e-09 UniRef50_C6ZBZ0 Transposase n=26 Tax=Bacteroides RepID=C6ZBZ0_9BACE 64 2e-09 UniRef50_UPI00016C567B ISPsy5, transposase n=1 Tax=Gemmata obscu... 63 2e-09 UniRef50_D1JMC2 Transposase n=1 Tax=Bacteroides sp. 2_1_16 RepID... 63 2e-09 UniRef50_D1UHS9 Transposase n=2 Tax=Burkholderiales RepID=D1UHS9... 63 2e-09 UniRef50_Q2K2K4 Putative insertion sequence transposase protein ... 63 3e-09 UniRef50_Q1A683 Transposase (Fragment) n=4 Tax=Clostridiales Rep... 63 3e-09 UniRef50_A3JL62 ISPpu15, transposase n=2 Tax=Marinobacter sp. EL... 62 7e-09 UniRef50_B9TJY8 Putative uncharacterized protein n=1 Tax=Ricinus... 61 1e-08 UniRef50_Q1GW50 Transposase and inactivated derivative n=2 Tax=S... 60 2e-08 UniRef50_A9ML85 Putative uncharacterized protein n=1 Tax=Salmone... 54 1e-06 Sequences not found previously or not previously below threshold: UniRef50_B3PIF8 IS66 family element, transposase n=8 Tax=Bacteri... 78 1e-13 UniRef50_A8S332 Putative uncharacterized protein (Fragment) n=3 ... 76 3e-13 UniRef50_C6LKX4 Transposase IS66 n=13 Tax=Clostridiales RepID=C6... 76 4e-13 UniRef50_C9LAQ6 ISPpu13, transposase Orf2 n=15 Tax=Clostridiales... 73 2e-12 UniRef50_A9DHW8 Putative uncharacterized protein n=10 Tax=Shewan... 72 7e-12 UniRef50_A1RLT8 Transposase IS66 n=94 Tax=Gammaproteobacteria Re... 70 3e-11 UniRef50_C0ETK9 Putative uncharacterized protein n=1 Tax=Eubacte... 69 4e-11 UniRef50_B0NH70 Putative uncharacterized protein n=2 Tax=Clostri... 68 7e-11 UniRef50_C7H2D2 ISPsy5, transposase n=1 Tax=Faecalibacterium pra... 66 3e-10 UniRef50_B0ACF8 Putative uncharacterized protein n=1 Tax=Clostri... 66 4e-10 UniRef50_Q6LRS5 Hypothetical transposase n=10 Tax=Gammaproteobac... 66 4e-10 UniRef50_B3DSN6 Transposase n=4 Tax=Bifidobacterium longum RepID... 65 6e-10 UniRef50_UPI00017448EA transposase IS66 n=4 Tax=Verrucomicrobium... 65 7e-10 UniRef50_A2UYM3 Conserved hypothetical ISPpu15, transposase n=1 ... 65 9e-10 UniRef50_UPI0001C35C79 hypothetical protein ChatD1_36139 n=1 Tax... 63 2e-09 UniRef50_A5VI71 Transposase IS66 n=28 Tax=Lactobacillus RepID=A5... 63 3e-09 UniRef50_Q3Y235 Transposase IS66 n=11 Tax=Enterococcus RepID=Q3Y... 63 4e-09 UniRef50_A9EDB3 Putative transposase n=1 Tax=Oceanibulbus indoli... 62 6e-09 UniRef50_D1PHR2 Putative transposase IS66 n=1 Tax=Prevotella cop... 61 1e-08 UniRef50_D0AFL5 Transposase n=5 Tax=Enterococcus faecium RepID=D... 61 1e-08 UniRef50_C9K6B9 IS66 family transposase n=1 Tax=Sphingomonas sp.... 60 2e-08 UniRef50_B3WDV3 Putative uncharacterized protein n=1 Tax=Lactoba... 60 3e-08 UniRef50_C2EYW0 Putative uncharacterized protein n=2 Tax=Lactoba... 59 4e-08 UniRef50_B3PCE0 IS66 family element, transposase n=2 Tax=Gammapr... 58 6e-08 UniRef50_B3W8G2 Transposase n=10 Tax=Lactobacillus RepID=B3W8G2_... 58 1e-07 UniRef50_A6LGZ8 Putative uncharacterized protein n=1 Tax=Parabac... 56 4e-07 UniRef50_UPI0001C34846 hypothetical protein PretD1_08486 n=2 Tax... 56 5e-07 UniRef50_C6IUV9 Transposase n=4 Tax=Bacteroides RepID=C6IUV9_9BACE 55 7e-07 UniRef50_A3JZ99 Putative transposase n=1 Tax=Sagittula stellata ... 55 8e-07 UniRef50_UPI0001744C57 ISPpu14, transposase Orf3 n=1 Tax=Verruco... 51 8e-06 UniRef50_C7GII7 ISPsy5, transposase (Fragment) n=3 Tax=Clostridi... 51 1e-05 UniRef50_B9KMR1 Transposase IS66 n=14 Tax=Alphaproteobacteria Re... 51 1e-05 UniRef50_UPI0001744E99 transposase IS66 n=3 Tax=Verrucomicrobium... 50 3e-05 UniRef50_C9L9U8 Putative uncharacterized protein n=1 Tax=Blautia... 50 3e-05 UniRef50_UPI000190820F insertion sequence transposase protein n=... 49 4e-05 UniRef50_B9KUH0 Putative uncharacterized protein n=1 Tax=Rhodoba... 49 4e-05 UniRef50_UPI0001C34DDA transposase n=1 Tax=Clostridium sp. M62/1... 48 8e-05 UniRef50_Q024Q9 Transposase IS66 n=1 Tax=Candidatus Solibacter u... 48 1e-04 UniRef50_Q8VSS7 Putative uncharacterized protein n=2 Tax=Bactero... 47 2e-04 UniRef50_Q2K2A5 Hypothetical conserved protein n=1 Tax=Rhizobium... 46 3e-04 UniRef50_D1K0Z1 Putative uncharacterized protein n=2 Tax=Bactero... 46 4e-04 UniRef50_A5FT16 Putative uncharacterized protein n=2 Tax=Acidiph... 45 6e-04 UniRef50_P39351 Uncharacterized protein yjgZ n=7 Tax=Escherichia... 45 8e-04 UniRef50_D1PFL8 Putative transposase number 3 n=1 Tax=Prevotella... 45 0.001 UniRef50_A7V2V7 Putative uncharacterized protein n=1 Tax=Bactero... 45 0.001 UniRef50_C7RJI1 Transposase IS66 n=1 Tax=Candidatus Accumulibact... 45 0.001 UniRef50_D1RLE0 Putative transposase n=3 Tax=Legionella longbeac... 44 0.002 UniRef50_A1VVP9 Putative uncharacterized protein n=1 Tax=Polarom... 43 0.003 UniRef50_Q2JA04 Transposase IS66 n=3 Tax=Frankia sp. CcI3 RepID=... 43 0.004 UniRef50_C0GVV2 Transposase IS66 n=2 Tax=Desulfonatronospira thi... 43 0.004 UniRef50_B7NGN8 Putative uncharacterized protein n=1 Tax=Escheri... 42 0.004 UniRef50_Q2JEM7 Transposase IS66 n=5 Tax=Frankia RepID=Q2JEM7_FRASC 41 0.010 UniRef50_Q3J215 Putative uncharacterized protein n=1 Tax=Rhodoba... 41 0.010 UniRef50_Q5WUX9 Putative uncharacterized protein n=1 Tax=Legione... 41 0.011 UniRef50_Q7WTH2 Putative uncharacterized protein n=1 Tax=Escheri... 40 0.017 UniRef50_A6L7G0 Transposase n=15 Tax=Bacteroides RepID=A6L7G0_BACV8 40 0.018 UniRef50_B0UII5 Putative uncharacterized protein n=1 Tax=Methylo... 40 0.021 UniRef50_C6HW71 Probable transposase n=1 Tax=Leptospirillum ferr... 40 0.025 >UniRef50_Q9JMS9 Uncharacterized protein yuaK n=3 Tax=Escherichia coli RepID=YUAK_ECOLI Length = 94 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 94/94 (100%), Positives = 94/94 (100%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS 60 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS Sbjct: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS 60 Query: 61 LIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 LIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL Sbjct: 61 LIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 >UniRef50_B5EKE2 Transposase IS66 n=6 Tax=Acidithiobacillus RepID=B5EKE2_ACIF5 Length = 545 Score = 101 bits (252), Expect = 8e-21, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 39/90 (43%), Gaps = 10/90 (11%) Query: 5 LRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGT 64 L + SI +P+ + + + G+ +GGE AA YS+I T Sbjct: 455 LYLEEGQLSIDNNPVERALRGVA----------IGRKNFLFVGNDAGGERAASFYSIIET 504 Query: 65 CRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 C+LN VE +LC +E + W + +L Sbjct: 505 CKLNGVEPFAYLCDVLEKLPTWPNKRLHEL 534 >UniRef50_Q2W201 Transposase and inactivated derivative n=22 Tax=Proteobacteria RepID=Q2W201_MAGSA Length = 532 Score = 101 bits (251), Expect = 1e-20, Method: Composition-based stats. Identities = 20/76 (26%), Positives = 37/76 (48%), Gaps = 7/76 (9%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + + +++ + L + AGS +GG+ AA +Y+L T +LN ++ E +L Sbjct: 457 NNAAERAMRPL-------ALGRKNWLFAGSDAGGDRAAAIYTLTETAKLNGLDPEAYLRD 509 Query: 79 GIEHIQDWSANLVRDL 94 + I D N + DL Sbjct: 510 VLTRIADHPVNRIADL 525 >UniRef50_Q9S116 Orf51 protein n=166 Tax=root RepID=Q9S116_ECOLX Length = 523 Score = 99.6 bits (247), Expect = 2e-20, Method: Composition-based stats. Identities = 45/87 (51%), Positives = 50/87 (57%), Gaps = 10/87 (11%) Query: 8 SGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRL 67 S I + + + + + AGS SGGEHAAVLYSLIGTCRL Sbjct: 437 SNGWVEIDNNIAENALRGVA----------VGRKNWLFAGSDSGGEHAAVLYSLIGTCRL 486 Query: 68 NNVELEKWLCYGIEHIQDWSANLVRDL 94 NNVE EKWL Y IEHIQDW AN VRDL Sbjct: 487 NNVEPEKWLRYVIEHIQDWPANRVRDL 513 >UniRef50_A6FSH9 Transposase and inactivated derivative n=7 Tax=Bacteria RepID=A6FSH9_9RHOB Length = 509 Score = 98.1 bits (243), Expect = 7e-20, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 7/77 (9%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 + + SI+ I L + GS GG+ AA+ Y+LI T ++N+V+ E WL Sbjct: 431 DNNTCERSIRPI-------ALGRKNYLFMGSIGGGKAAAIAYTLIETAKMNDVDPEAWLT 483 Query: 78 YGIEHIQDWSANLVRDL 94 + ++ + D N + +L Sbjct: 484 WVLQRLPDHKINRIDEL 500 >UniRef50_Q3ZU23 OrfD, ISEc8 n=37 Tax=Proteobacteria RepID=Q3ZU23_ECOLX Length = 331 Score = 98.1 bits (243), Expect = 8e-20, Method: Composition-based stats. Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + + + +++ + CL + GS GGE A+LY LIGTCRLN ++ E +L Sbjct: 251 TDNNTAERALRAV-------CLGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYL 303 Query: 77 CYGIEHIQDWSANLVRDL 94 + + + +W +N V +L Sbjct: 304 RHILSVLPEWPSNRVDEL 321 >UniRef50_A9EDY1 Transposase n=3 Tax=Kordia algicida OT-1 RepID=A9EDY1_9FLAO Length = 485 Score = 98.1 bits (243), Expect = 9e-20, Method: Composition-based stats. Identities = 20/76 (26%), Positives = 37/76 (48%), Gaps = 7/76 (9%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 ++ +I+ + L + AGS ++AA++YS TC++N+V KWL Sbjct: 406 NNLIENAIRPL-------ALGRKNYLFAGSHKAAQNAAMMYSFFATCKINDVNPYKWLHD 458 Query: 79 GIEHIQDWSANLVRDL 94 E + + AN + +L Sbjct: 459 VFERLPEHKANKLEEL 474 >UniRef50_Q1RPJ6 ECs1339 protein n=175 Tax=Bacteria RepID=Q1RPJ6_ECOLX Length = 537 Score = 97.7 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + + + +++ + CL + GS GGE A+LY LIGTCRLN ++ E +L Sbjct: 457 ADNNAAERALRAV-------CLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPEAYL 509 Query: 77 CYGIEHIQDWSANLVRDL 94 + + + +W +N V +L Sbjct: 510 RHILSVLPEWPSNRVDEL 527 >UniRef50_A4JH71 Transposase IS66 n=4 Tax=Proteobacteria RepID=A4JH71_BURVG Length = 514 Score = 97.7 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 7/77 (9%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 + + +I+ + L + AGS GG+ AAV+YSLIGT RLN++E +L Sbjct: 429 DNNTAERAIRPLV-------LGRRNYLFAGSDGGGQSAAVIYSLIGTARLNDIEPFAYLH 481 Query: 78 YGIEHIQDWSANLVRDL 94 E I D N + +L Sbjct: 482 TVFERIADHPINRIDEL 498 >UniRef50_Q11ZA7 Transposase IS66 n=6 Tax=Burkholderiales RepID=Q11ZA7_POLSJ Length = 537 Score = 97.0 bits (240), Expect = 2e-19, Method: Composition-based stats. Identities = 22/80 (27%), Positives = 38/80 (47%), Gaps = 7/80 (8%) Query: 15 ICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 I + + S++ + + + GS GG AAV+Y+LIGT +L + + Sbjct: 444 IEADNNIAERSLRGV-------AIGRKNYLHFGSDGGGHTAAVIYTLIGTAKLCGINPQT 496 Query: 75 WLCYGIEHIQDWSANLVRDL 94 +L Y +E I D N + +L Sbjct: 497 YLRYVLERIADHPINRIDEL 516 >UniRef50_B2AJ19 Transposase, IS66 familly n=24 Tax=cellular organisms RepID=B2AJ19_CUPTR Length = 523 Score = 95.4 bits (236), Expect = 6e-19, Method: Composition-based stats. Identities = 26/84 (30%), Positives = 39/84 (46%), Gaps = 10/84 (11%) Query: 11 KTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNV 70 + I P+ + + + AG+ SGGE AA +YSLIGT +LN V Sbjct: 437 RLEIDNLPVERALRGVA----------IGRRNYLFAGADSGGERAAAIYSLIGTAKLNGV 486 Query: 71 ELEKWLCYGIEHIQDWSANLVRDL 94 + E +L + + I D N V +L Sbjct: 487 DPEAYLRFVLARIADHPINRVDEL 510 >UniRef50_C7I5Z5 Transposase IS66 n=2 Tax=Thiomonas intermedia K12 RepID=C7I5Z5_THIIN Length = 513 Score = 94.6 bits (234), Expect = 8e-19, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 37/87 (42%), Gaps = 10/87 (11%) Query: 8 SGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRL 67 S + I +P ++ +I+ + L + GS G AAVL +LI + +L Sbjct: 422 SDGRVPIDNNP---VENAIRPL-------ALGRKNWLFVGSPQAGSRAAVLMTLIESAKL 471 Query: 68 NNVELEKWLCYGIEHIQDWSANLVRDL 94 V+ +L + + W + + +L Sbjct: 472 CEVDPWAYLKDVLTKLPTWPNSRLSEL 498 >UniRef50_B5K4C7 Transposase IS66 n=27 Tax=Rhodobacterales RepID=B5K4C7_9RHOB Length = 552 Score = 94.6 bits (234), Expect = 9e-19, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 7/77 (9%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 + + ++ + + + GS +GG+ AA+ Y+LI T ++N V E WL Sbjct: 461 DNNTAENAVHPV-------AVGRKNYLFMGSEAGGKSAAIAYTLIETAKMNKVNPEAWLA 513 Query: 78 YGIEHIQDWSANLVRDL 94 + +E IQD AN + DL Sbjct: 514 WVLERIQDHPANRINDL 530 >UniRef50_Q5P8K4 Transposase n=7 Tax=Proteobacteria RepID=Q5P8K4_AZOSE Length = 525 Score = 93.5 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 35/78 (44%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 ++ +I+ I L + AGS + G+ AAV+ SL+ T R N E WL Sbjct: 444 IDNNPIENAIRPI-------ALGKKNWMFAGSEAAGKRAAVIQSLLATARANGFEPLAWL 496 Query: 77 CYGIEHIQDWSANLVRDL 94 +E + W + + +L Sbjct: 497 SDTLEKLPAWPNSRIDEL 514 >UniRef50_Q8GAR2 Orf51 (Fragment) n=2 Tax=Proteobacteria RepID=Q8GAR2_ECOLX Length = 92 Score = 92.7 bits (229), Expect = 3e-18, Method: Composition-based stats. Identities = 42/58 (72%), Positives = 44/58 (75%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + AGS S GEHAAVLYSLIGTCRLNNVE EKWL Y IEHIQDW AN VRDL Sbjct: 12 AVGRKNWLFAGSDSSGEHAAVLYSLIGTCRLNNVEPEKWLRYVIEHIQDWPANRVRDL 69 >UniRef50_C6MJH1 Transposase IS66 n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MJH1_9PROT Length = 527 Score = 91.9 bits (227), Expect = 6e-18, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 ++ SI+ I + + GS G+ AA + +L+GT +LN + WL Sbjct: 442 IDNNPVENSIRPI-------AIGKKNWLFTGSQRAGQRAANIQTLLGTAQLNGLNPGAWL 494 Query: 77 CYGIEHIQDWSANLVRDL 94 + + W + + +L Sbjct: 495 NDILTKLPTWPNSRIDEL 512 >UniRef50_P50360 Uncharacterized protein y4hP n=117 Tax=cellular organisms RepID=Y4HP_RHISN Length = 552 Score = 91.6 bits (226), Expect = 6e-18, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 41/80 (51%), Gaps = 7/80 (8%) Query: 15 ICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 +C + + +++++ L + + AGS G + AAV+ ++I TCRLN+++ + Sbjct: 463 VCLTNNAAERALRSV-------ALGRRNWTFAGSQRGADRAAVMLTVITTCRLNDIDPKA 515 Query: 75 WLCYGIEHIQDWSANLVRDL 94 WL + I D + +L Sbjct: 516 WLADVLARIADHPVTRLYEL 535 >UniRef50_D1N8F2 Transposase IS66 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N8F2_9BACT Length = 496 Score = 91.6 bits (226), Expect = 8e-18, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 10/88 (11%) Query: 7 FSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCR 66 + PK +I +P L + I + AGS +GG+ A+LYS +C+ Sbjct: 409 LNNPKLNIDNNPAERLNRGVAII----------RKNCLFAGSETGGQRLAILYSFAASCK 458 Query: 67 LNNVELEKWLCYGIEHIQDWSANLVRDL 94 NN+ +WL + + SAN + L Sbjct: 459 ANNICFRQWLEDVLPRLSSTSANQIESL 486 >UniRef50_D2WFI4 IS66 transposase n=1 Tax=Escherichia coli O26:H- RepID=D2WFI4_ECOLX Length = 522 Score = 91.2 bits (225), Expect = 1e-17, Method: Composition-based stats. Identities = 28/60 (46%), Positives = 37/60 (61%) Query: 35 IGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + L + GS GGE AAV+YSLIG+C+LN +E E WL + I I W AN V++L Sbjct: 453 VVALGRRNYMFFGSDGGGESAAVMYSLIGSCKLNGIEPETWLRHVISVINTWPANRVKEL 512 >UniRef50_Q07SJ6 Putative uncharacterized protein n=4 Tax=Rhizobiales RepID=Q07SJ6_RHOP5 Length = 95 Score = 91.2 bits (225), Expect = 1e-17, Method: Composition-based stats. Identities = 18/58 (31%), Positives = 31/58 (53%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + GS GG+ AA++YSLI T ++N+V+ + WL + I + + +L Sbjct: 22 TLGRKSWLFCGSDRGGDRAALMYSLIVTAKMNDVDPQAWLADVLARIAEHPVQRLDEL 79 >UniRef50_A2V4E0 Transposase IS66 n=1 Tax=Shewanella putrefaciens 200 RepID=A2V4E0_SHEPU Length = 517 Score = 90.8 bits (224), Expect = 1e-17, Method: Composition-based stats. Identities = 26/78 (33%), Positives = 40/78 (51%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 S + SI+ I L + AGS +GGE AAVLY+++GT RLN++ ++L Sbjct: 431 IDNNSAERSIRPI-------ALGRKNYLFAGSKAGGERAAVLYTILGTARLNDINPNQYL 483 Query: 77 CYGIEHIQDWSANLVRDL 94 ++ I N V +L Sbjct: 484 TAVLKRIGQHQINKVDEL 501 >UniRef50_A7IP73 Transposase IS66 n=23 Tax=Alphaproteobacteria RepID=A7IP73_XANP2 Length = 510 Score = 90.8 bits (224), Expect = 1e-17, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 38/78 (48%), Gaps = 8/78 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 ++ SI+ + L + A AGS G EH AV+ SL+ TC+LN+++ + +L Sbjct: 425 DNNIVERSIRPL-------ALTRKNALFAGSDGGAEHWAVIASLVETCKLNDIDPQAYLA 477 Query: 78 YGIEHI-QDWSANLVRDL 94 I I + + DL Sbjct: 478 DVITRIVNGHPNSRIDDL 495 >UniRef50_A6BYX4 TnpC protein n=2 Tax=Planctomyces maris DSM 8797 RepID=A6BYX4_9PLAN Length = 507 Score = 90.8 bits (224), Expect = 1e-17, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 37/81 (45%), Gaps = 7/81 (8%) Query: 14 IICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELE 73 I+ + +++ + + GS GGE AAV YSL+ +C+ N VE Sbjct: 416 ILSIDNNLAERTLRP-------CAIGRKNYLFVGSDRGGEAAAVHYSLMASCKANEVEPF 468 Query: 74 KWLCYGIEHIQDWSANLVRDL 94 +L + I D +A+ + +L Sbjct: 469 AYLRDVLAQITDHAADRLEEL 489 >UniRef50_Q1NGZ9 TnpC protein n=5 Tax=Sphingomonas RepID=Q1NGZ9_9SPHN Length = 517 Score = 90.0 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 36/89 (40%), Gaps = 11/89 (12%) Query: 7 FSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCR 66 + I + + L + AGS +GG+ AA +YS+I T + Sbjct: 423 LDDGRLEIDNNIAERAMRCVA----------LGRKNWLFAGSKAGGDRAAAIYSVIETAK 472 Query: 67 LNNVELEKWLCYGIEHIQ-DWSANLVRDL 94 LN +E + ++ I I +W A +L Sbjct: 473 LNGLEPQAYIADVIARIAGNWPATRWDEL 501 >UniRef50_UPI0001C38241 transposase IS66 n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38241 Length = 535 Score = 89.2 bits (220), Expect = 3e-17, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 31/78 (39%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 +++ +I+ I + + AGS S G AA + SL+ T + N ++ WL Sbjct: 454 IDNNAVENAIRPI-------AVGRKNWLFAGSQSAGVRAAAIMSLLATAKANGLDPHAWL 506 Query: 77 CYGIEHIQDWSANLVRDL 94 + + + L Sbjct: 507 SDVLTRLPTTKDRDIDTL 524 >UniRef50_A6WXB5 Transposase IS66 n=39 Tax=Alphaproteobacteria RepID=A6WXB5_OCHA4 Length = 545 Score = 89.2 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 + + +++ I + + AG+ +G E A ++I T ++N + + +L Sbjct: 460 DNNAAERALRPIG-------VGRRNWLFAGADTGAETLARAMTIIETAKMNGINPQAYLA 512 Query: 78 YGIEHIQDWSANLVRDL 94 ++ I D N + +L Sbjct: 513 DVLDRIHDHKINRLDEL 529 >UniRef50_A3X2G6 Putative transposase n=5 Tax=Alphaproteobacteria RepID=A3X2G6_9BRAD Length = 513 Score = 89.2 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 35/78 (44%), Gaps = 8/78 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 +++ +I+ +T L + AGS G + A + SLI T +LN+VE +L Sbjct: 431 DTNTVERAIRPVT-------LGRKNHLFAGSDGGAQRWATVCSLITTAKLNDVEPFTYLK 483 Query: 78 YGIEHI-QDWSANLVRDL 94 +E + + + L Sbjct: 484 DILERMSAGHPMSRLDQL 501 >UniRef50_A7HGL2 Transposase IS66 n=7 Tax=Cystobacterineae RepID=A7HGL2_ANADF Length = 515 Score = 88.9 bits (219), Expect = 4e-17, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 10/90 (11%) Query: 5 LRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGT 64 L + P I + + +++ + L + G+ GE+ A LYSLI T Sbjct: 422 LFLTDPHLPIDNNAS---ERALR-------VAALGRKNFLFVGTNEAGENLAGLYSLIAT 471 Query: 65 CRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 C N V +L + +Q A+ + +L Sbjct: 472 CEANGVNPVDYLADVLIRVQTHPASQIDEL 501 >UniRef50_C8QDU0 Transposase IS66 n=9 Tax=Enterobacteriaceae RepID=C8QDU0_9ENTR Length = 531 Score = 88.9 bits (219), Expect = 5e-17, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + I+ + + AGS + GE AA + L+ T ++N +E WL Sbjct: 453 NNRAERVIRPV-------AMGRNNWLFAGSLAAGERAARIMGLLETAKMNGLEPHAWLSD 505 Query: 79 GIEHIQDWSANLVRDL 94 ++ + WS + + +L Sbjct: 506 VLKRLPSWSEDRLDEL 521 >UniRef50_Q320F8 ISSfl4 ORF3 n=21 Tax=Enterobacteriaceae RepID=Q320F8_SHIBS Length = 187 Score = 88.5 bits (218), Expect = 7e-17, Method: Composition-based stats. Identities = 27/83 (32%), Positives = 39/83 (46%), Gaps = 10/83 (12%) Query: 12 TSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVE 71 I + + S+ + + GS GGE AA++YSL+ TC+ N VE Sbjct: 106 VEIDNNIGENALRSVA----------VGRKNYLFFGSDKGGESAAIIYSLLVTCKQNEVE 155 Query: 72 LEKWLCYGIEHIQDWSANLVRDL 94 E WL IE + DW +N V +L Sbjct: 156 PEDWLREVIEKLNDWPSNQVHEL 178 >UniRef50_Q6EZC1 L0015-like protein n=30 Tax=Enterobacteriaceae RepID=Q6EZC1_ECOLX Length = 321 Score = 87.7 bits (216), Expect = 1e-16, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 38/78 (48%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + + + +++ + CL + G+ GE A+LY L G CRLN ++ E +L Sbjct: 241 ADNNTAERALRAV-------CLGKKNYMFFGNDHVGERGALLYGLTGNCRLNGIDPEAYL 293 Query: 77 CYGIEHIQDWSANLVRDL 94 + + + W +N V +L Sbjct: 294 RHILSVLPKWLSNRVDEL 311 >UniRef50_Q322B9 ISSfl3 orfC n=24 Tax=Enterobacteriaceae RepID=Q322B9_SHIBS Length = 533 Score = 87.3 bits (215), Expect = 1e-16, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 32/77 (41%), Gaps = 7/77 (9%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 + +IK + L + AGS GE AA + SL+ T + N +E WL Sbjct: 454 DNNVCERAIKNVV-------LGRKSWLFAGSQMAGERAAQIMSLLETAKRNGLEPHAWLT 506 Query: 78 YGIEHIQDWSANLVRDL 94 + + +W + +L Sbjct: 507 DVLMRLPEWPEERLAEL 523 >UniRef50_B9TBF4 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TBF4_RICCO Length = 103 Score = 86.9 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 7/77 (9%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 ++ +I+ + + + GS +GG+ AAV YS++ TCR N++ +L Sbjct: 2 DSNLIERAIRRV-------AIARKNHLFFGSEAGGKVAAVFYSMLATCRANDINPYDYLS 54 Query: 78 YGIEHIQDWSANLVRDL 94 + I D N + +L Sbjct: 55 DVLGRINDHPINRIEEL 71 >UniRef50_A1VVP8 Transposase IS66 n=8 Tax=Proteobacteria RepID=A1VVP8_POLNA Length = 528 Score = 86.6 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 14/78 (17%), Positives = 31/78 (39%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 L+ I+ + + AGS G+ AAV+ SL+ + +L+ + +L Sbjct: 449 IDNNHLENLIRP-------WAMGRRAWLFAGSELAGQRAAVVMSLLQSAKLHGHDPWAYL 501 Query: 77 CYGIEHIQDWSANLVRDL 94 + + + + +L Sbjct: 502 KDVLTRLPGHMNSRIDEL 519 >UniRef50_D2LKV6 Transposase IS66 n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LKV6_RHOVA Length = 535 Score = 85.4 bits (210), Expect = 6e-16, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 8/78 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 +++ SI+ + L + A AG G EH ++ SLI T +LN V+ + WL Sbjct: 450 DTNTVERSIRPL-------ALNRKNALFAGHDRGAEHWGIVASLIETAKLNGVDPQAWLA 502 Query: 78 YGIEHI-QDWSANLVRDL 94 + + W + +L Sbjct: 503 SILSRLVNGWPMRKIDEL 520 >UniRef50_C1HRA2 Putative uncharacterized protein n=1 Tax=Escherichia sp. 3_2_53FAA RepID=C1HRA2_9ESCH Length = 223 Score = 85.0 bits (209), Expect = 7e-16, Method: Composition-based stats. Identities = 28/72 (38%), Positives = 39/72 (54%) Query: 23 KTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEH 82 + K + + E GSG GGE A+LYSLIGTC+LN+V+ E +L + + Sbjct: 142 QRKTKPLLKSLESWLREKMKTLFFGSGHGGERGALLYSLIGTCKLNDVDPESYLRHVLGV 201 Query: 83 IQDWSANLVRDL 94 I DW N V +L Sbjct: 202 IADWPVNRVSEL 213 >UniRef50_B9JRL5 Transposase n=55 Tax=cellular organisms RepID=B9JRL5_AGRVS Length = 537 Score = 85.0 bits (209), Expect = 7e-16, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 11/92 (11%) Query: 4 RLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIG 63 +L + + I S++ +I+ I L + A AG +G E+ A + SLI Sbjct: 443 KLFLTDGRIEIDN---NSVERTIRPI-------ALNRKNALFAGHDAGAENWATIASLIE 492 Query: 64 TCRLNNVELEKWLCYGIEHI-QDWSANLVRDL 94 +C+LN V+ +L + I + + +L Sbjct: 493 SCKLNAVDPLAYLSSTLTAIVNGHKQSKIDEL 524 >UniRef50_B1EN06 IS element transposase n=1 Tax=Escherichia albertii TW07627 RepID=B1EN06_9ESCH Length = 89 Score = 83.9 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 38/65 (58%), Positives = 41/65 (63%) Query: 30 TYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSAN 89 + L + A S SGGE AVLYS IGTCRLNNVE EKWL Y IE+IQDW AN Sbjct: 15 SALRQSSARNRKNWLFARSDSGGEQPAVLYSQIGTCRLNNVEPEKWLSYVIENIQDWPAN 74 Query: 90 LVRDL 94 RDL Sbjct: 75 RGRDL 79 >UniRef50_A4XN94 Transposase IS66 n=25 Tax=Gammaproteobacteria RepID=A4XN94_PSEMY Length = 524 Score = 83.9 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 17/77 (22%), Positives = 30/77 (38%), Gaps = 9/77 (11%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + +I+ + + + + G +A +YSLI T + N E WL Sbjct: 442 IDNNRAENAIRPFV-------IGRKNWLFSDTPKGATASAQIYSLIETAKANGQEPYAWL 494 Query: 77 CYGIEHIQDWSANLVRD 93 + +E + AN V D Sbjct: 495 RHILERLPA--ANSVED 509 >UniRef50_A3DCS3 Transposase IS66 n=7 Tax=Clostridiales RepID=A3DCS3_CLOTH Length = 511 Score = 83.5 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 8/80 (10%) Query: 7 FSGPKTSIICSPMTSL-KTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTC 65 F CS +L + SI+ T L + +GS G + +A +YS++ + Sbjct: 414 FMNYLLDGNCSISNNLSENSIRPFT-------LGRKNWLFSGSPRGADASAAVYSIVESA 466 Query: 66 RLNNVELEKWLCYGIEHIQD 85 + N++ K+L Y + Sbjct: 467 KANDINPYKYLYYIFSELPG 486 >UniRef50_Q0ABQ0 Integron integrase n=29 Tax=Proteobacteria RepID=Q0ABQ0_ALHEH Length = 694 Score = 83.5 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 11/84 (13%), Positives = 32/84 (38%), Gaps = 10/84 (11%) Query: 7 FSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCR 66 + + +P + +I+ + + + + G +A++YS+I T + Sbjct: 605 LDDGRIPLDNNPA---ENAIRPFV-------VGRKNWLFSHTTQGAAASAMIYSVIETAK 654 Query: 67 LNNVELEKWLCYGIEHIQDWSANL 90 N +E ++L + + + Sbjct: 655 ANGLEPYEYLEDVLTRLPAADTDQ 678 >UniRef50_C5SQQ1 Transposase IS66 n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SQQ1_9CAUL Length = 537 Score = 83.5 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 8/78 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 ++ +I+ T + + + AGS GG A++ SLI T R+N V + WL Sbjct: 455 DSNIVERAIRPQT-------ITRKNSLFAGSDGGGRTWAIIASLIQTARMNGVNPQAWLT 507 Query: 78 YGIEHIQD-WSANLVRDL 94 ++ I D W+ + + +L Sbjct: 508 QTLQRIADGWTVSRLDEL 525 >UniRef50_B4D7C9 Transposase IS66 n=5 Tax=Chthoniobacter flavus Ellin428 RepID=B4D7C9_9BACT Length = 527 Score = 83.1 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 13/88 (14%), Positives = 35/88 (39%), Gaps = 10/88 (11%) Query: 7 FSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCR 66 + I + ++ +I+ + + G + GE +A+LY++I +CR Sbjct: 432 LEDGRLEIDNNL---VENAIRP-------TAIGKKNWLFFGEAAAGERSAILYTIIESCR 481 Query: 67 LNNVELEKWLCYGIEHIQDWSANLVRDL 94 ++ +L + + ++D+ Sbjct: 482 RRGIDPFAYLRDVFTRLPSMTNWQIKDI 509 >UniRef50_C4ZMM2 Transposase IS66 n=23 Tax=Proteobacteria RepID=C4ZMM2_THASP Length = 531 Score = 82.7 bits (203), Expect = 3e-15, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 36/88 (40%), Gaps = 10/88 (11%) Query: 7 FSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCR 66 + P I L+ +++ I L + + + G +H V+ SLI TCR Sbjct: 431 LADPDVPID---TNHLERALRPIP-------LGRKNWMFSWTELGAQHVGVVQSLIATCR 480 Query: 67 LNNVELEKWLCYGIEHIQDWSANLVRDL 94 L+ ++ +L ++ + A V L Sbjct: 481 LHELDPYDYLVDVLQRVDQHPAADVAQL 508 >UniRef50_C4ZDI2 Transposase n=7 Tax=Clostridiales RepID=C4ZDI2_EUBR3 Length = 90 Score = 82.7 bits (203), Expect = 4e-15, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 32/77 (41%), Gaps = 7/77 (9%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 + + SI+ C+ + + + +G E +A++YS+ T + NN++ + Sbjct: 2 DNNAAEQSIRPF-------CVGKKNWVMIDTVAGAEASAMIYSIAETAKANNLKPYNYFK 54 Query: 78 YGIEHIQDWSANLVRDL 94 Y +E I D Sbjct: 55 YLLEEIPRHMDEHGVDF 71 >UniRef50_C0QE05 Transposase n=3 Tax=Deltaproteobacteria RepID=C0QE05_DESAH Length = 531 Score = 81.9 bits (201), Expect = 5e-15, Method: Composition-based stats. Identities = 17/75 (22%), Positives = 33/75 (44%), Gaps = 7/75 (9%) Query: 11 KTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNV 70 +TS + + SI+ T + + +G+ G +A +YSLI T + N + Sbjct: 437 ETSHVTPDNNMAENSIRPFT-------IGRKNWLFSGAPEGATASAGIYSLIETAKANGL 489 Query: 71 ELEKWLCYGIEHIQD 85 E +L + E++ Sbjct: 490 EPYWYLRFLFENLPQ 504 >UniRef50_B9K420 Transposase n=13 Tax=Alphaproteobacteria RepID=B9K420_AGRVS Length = 553 Score = 81.9 bits (201), Expect = 6e-15, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 37/92 (40%), Gaps = 11/92 (11%) Query: 4 RLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIG 63 RL I + ++ +I+ + + A AG GG + A SLIG Sbjct: 459 RLFLDDSHVDIDSNL---VENAIR-------RPAMNRRNALFAGHDEGGRNWARFASLIG 508 Query: 64 TCRLNNVELEKWLCYGIEHIQD-WSANLVRDL 94 TC++N +E +LC + + A + L Sbjct: 509 TCKMNGIEPYAYLCDLFTRLANGHIAKDIDAL 540 >UniRef50_C0ZIN3 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZIN3_BREBN Length = 217 Score = 81.5 bits (200), Expect = 7e-15, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 31/78 (39%), Gaps = 8/78 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 + SIK + + A S G + +A++YSL+ T + N + ++L Sbjct: 128 DNNRSERSIKPFV-------IGRKNWLFANSPRGAKASAIIYSLLETAKENQLNPFQYLN 180 Query: 78 YGIEHIQDWSA-NLVRDL 94 Y E I S +L Sbjct: 181 YLFEQIPQLSDVKNGEEL 198 >UniRef50_A6WIY2 Transposase IS66 n=9 Tax=Gammaproteobacteria RepID=A6WIY2_SHEB8 Length = 514 Score = 81.5 bits (200), Expect = 7e-15, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 30/81 (37%), Gaps = 8/81 (9%) Query: 15 ICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 I + I+ T + + S +G +A LYSL+ TCR N++ Sbjct: 430 ISIDNNVTERDIRPFTT-------GRKNWMFSTSVNGAHASANLYSLVMTCRANDISPYY 482 Query: 75 WLCYGIEHIQDW-SANLVRDL 94 + + + + + DL Sbjct: 483 YFRHLFTELPKRLPTDDLTDL 503 >UniRef50_C9CRY0 Transposase IS66 n=2 Tax=Silicibacter sp. TrichCH4B RepID=C9CRY0_9RHOB Length = 542 Score = 81.5 bits (200), Expect = 8e-15, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 37/78 (47%), Gaps = 8/78 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 +++ +I+ I L + A AG+ G A + SL+GTC+L+ + + +L Sbjct: 459 DTNAVENAIRPIP-------LTRKNALFAGNDDGAVTWARMASLVGTCKLSGINPQAYLE 511 Query: 78 YGIEHI-QDWSANLVRDL 94 + +E I ++DL Sbjct: 512 HVLEKILNGHMQENIKDL 529 >UniRef50_B8FCN1 Transposase IS66 n=18 Tax=Bacteria RepID=B8FCN1_DESAA Length = 546 Score = 81.5 bits (200), Expect = 8e-15, Method: Composition-based stats. Identities = 16/67 (23%), Positives = 28/67 (41%), Gaps = 7/67 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 + + +I+ + + AGS G E +A+ +SLI T + N +E +L Sbjct: 449 DNNAAENAIRPFV-------VGRKNWLFAGSPRGAEASALFFSLIETAKANGLEPFAYLK 501 Query: 78 YGIEHIQ 84 E I Sbjct: 502 VLFERIP 508 >UniRef50_A1ZEN9 TnpC protein n=6 Tax=Microscilla marina ATCC 23134 RepID=A1ZEN9_9SPHI Length = 395 Score = 81.5 bits (200), Expect = 8e-15, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 33/78 (42%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 ++ +I+ L + AGS E A+ YSLIG+C++ V +WL Sbjct: 315 IDNNLIENAIRP-------AALGRKNYLFAGSQDAAERTALFYSLIGSCKMAGVNPLEWL 367 Query: 77 CYGIEHIQDWSANLVRDL 94 I++I + + L Sbjct: 368 TDVIKNINNQPIQKLHLL 385 >UniRef50_C8SL35 Putative uncharacterized protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SL35_9RHIZ Length = 227 Score = 81.2 bits (199), Expect = 1e-14, Method: Composition-based stats. Identities = 24/75 (32%), Positives = 34/75 (45%), Gaps = 8/75 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 ++ SI+ I L + A AGS G EH AV+ SLI TC+LN VE +L Sbjct: 100 DSNIVERSIRPI-------ALNRKNALFAGSDGGAEHWAVVASLIETCKLNGVEPLGYLA 152 Query: 78 YGIEHI-QDWSANLV 91 + I + + Sbjct: 153 DVLARIVNGHPNSKL 167 >UniRef50_B4S337 Transposase n=3 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4S337_ALTMD Length = 525 Score = 81.2 bits (199), Expect = 1e-14, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 35/91 (38%), Gaps = 10/91 (10%) Query: 4 RLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIG 63 ++ S P + L+ +++ I + + S G E +L SL+ Sbjct: 420 KVFLSNPALPMD---TNHLERALRVIP-------MGRKNYLFCWSELGAEQLGILQSLMV 469 Query: 64 TCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 TCRL V +L ++ + A V DL Sbjct: 470 TCRLQGVNPYHYLVDVLQRVALHPARDVIDL 500 >UniRef50_Q5P882 IS66 Orf1 transposase n=5 Tax=cellular organisms RepID=Q5P882_AZOSE Length = 538 Score = 80.8 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 15/90 (16%), Positives = 37/90 (41%), Gaps = 10/90 (11%) Query: 5 LRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGT 64 + P+ +I L+ +++ + + + + G ++ + SLI T Sbjct: 436 VYLRDPEVAID---TNHLERALRVVP-------MGRRNWLFCWTEVGAKYVGIAQSLIAT 485 Query: 65 CRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 CRL++++ +L ++ + A V L Sbjct: 486 CRLHDIDPYDYLVDVLQRVGQHPAADVAQL 515 >UniRef50_B3HDR4 IS66 family element, transposase n=7 Tax=Enterobacteriaceae RepID=B3HDR4_ECOLX Length = 461 Score = 80.8 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 28/72 (38%), Positives = 38/72 (52%) Query: 23 KTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEH 82 + K + + E GSG GGE A+LYSLIGTC+LN+V+ E +L + Sbjct: 380 QRKTKPLLKSLESWLREKMKTLFFGSGHGGERGALLYSLIGTCKLNDVDPESYLRHVPGV 439 Query: 83 IQDWSANLVRDL 94 I DW N V +L Sbjct: 440 IADWPVNRVSEL 451 >UniRef50_Q13ZD6 Transposase ISPpu14 orf3 like, IS66 family n=51 Tax=Proteobacteria RepID=Q13ZD6_BURXL Length = 540 Score = 80.0 bits (196), Expect = 2e-14, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 29/78 (37%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 ++ I+ + AGS G+ AA + SLI + +LN + +L Sbjct: 458 IDNNWVENQIRP-------WAIGRANWLFAGSLRAGQRAAAIMSLIRSAQLNGHDPHAYL 510 Query: 77 CYGIEHIQDWSANLVRDL 94 + + A+ + L Sbjct: 511 KDILTRLPIHKASDISAL 528 >UniRef50_C8WYG4 Transposase IS66 n=5 Tax=Bacteria RepID=C8WYG4_ALIAD Length = 529 Score = 80.0 bits (196), Expect = 2e-14, Method: Composition-based stats. Identities = 13/74 (17%), Positives = 26/74 (35%), Gaps = 7/74 (9%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + S+K + + A + G +AV YS++ T + N + +L Sbjct: 441 IDNNRCERSLKPFV-------IGRKNWLFANTPRGARASAVTYSIVETAKENGLNPTAYL 493 Query: 77 CYGIEHIQDWSANL 90 Y E + + Sbjct: 494 TYLFERMPNIDLKD 507 >UniRef50_Q1NKD7 ISPsy5, transposase n=1 Tax=delta proteobacterium MLMS-1 RepID=Q1NKD7_9DELT Length = 197 Score = 79.6 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 12/67 (17%), Positives = 27/67 (40%), Gaps = 7/67 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 + +I+ + + +G+ G + +A +YS+I T + N +E +L Sbjct: 114 DNNLAENAIRPFV-------VGRKNWLFSGTAQGAKASAAIYSIIETAKANGLEPYWYLR 166 Query: 78 YGIEHIQ 84 E + Sbjct: 167 ALFERLP 173 >UniRef50_C3RGW7 Transposase n=13 Tax=Bacteroides RepID=C3RGW7_9BACE Length = 523 Score = 79.2 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 RLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIG 63 R + I + + +I+ I L + G+ ++ A++ SL+ Sbjct: 427 RNYLKDGRLKIDNNLA---ENAIRPI-------ALSRKNFLFCGNHEAAQNTAIICSLLA 476 Query: 64 TCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 +C+ +N+ +WL I + ++AN +DL Sbjct: 477 SCKASNINPREWLTEVIALLPYYAANKEKDL 507 >UniRef50_A3ZP54 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZP54_9PLAN Length = 561 Score = 78.8 bits (193), Expect = 5e-14, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 33/77 (42%), Gaps = 7/77 (9%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + + +++ + + + S +GGE AAVL S++ TC+ N VE +L Sbjct: 462 IDNNAAERTMRPV-------AIGRKNWLFVASRTGGERAAVLMSVVQTCKRNQVEPWAYL 514 Query: 77 CYGIEHIQDWSANLVRD 93 E + N R+ Sbjct: 515 RDVFEQLPSLGENPTRE 531 >UniRef50_A3X278 Putative transposase n=1 Tax=Nitrobacter sp. Nb-311A RepID=A3X278_9BRAD Length = 232 Score = 78.8 bits (193), Expect = 5e-14, Method: Composition-based stats. Identities = 22/74 (29%), Positives = 36/74 (48%), Gaps = 8/74 (10%) Query: 22 LKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIE 81 ++ +I+ I CL + A AG G E+ A+L S++ TC+LN+V + +E Sbjct: 153 VENAIRPI-------CLTRKNALFAGHEIGAENWALLGSIVATCKLNDVNPVAYNAETLE 205 Query: 82 H-IQDWSANLVRDL 94 I + V DL Sbjct: 206 AIIAGHPQSKVDDL 219 >UniRef50_D0TYT2 Integron integrase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYT2_9BACE Length = 443 Score = 78.5 bits (192), Expect = 6e-14, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 35/78 (44%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + + +K I C+ + GS G ++A++LYS+I TC++N + K++ Sbjct: 365 IDNNTAERMMKPI-------CMGRKNYLFCGSELGAKNASMLYSIIETCKMNGLRPVKYI 417 Query: 77 CYGIEHIQDWSANLVRDL 94 + + N + L Sbjct: 418 AEILTKLTAGETNYMSLL 435 >UniRef50_A9G0V4 Transposase n=6 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G0V4_SORC5 Length = 554 Score = 78.5 bits (192), Expect = 6e-14, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 35/92 (38%), Gaps = 11/92 (11%) Query: 4 RLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIG 63 R F+ + I + + L + AGS G + A+ Y++ G Sbjct: 452 RRCFTDGRFEIDNGEVERQLRRVA----------LGRKNYLFAGSDKGAQRLAIGYTIFG 501 Query: 64 TCRLNNVELEKWLCYGIEHIQ-DWSANLVRDL 94 +CR++ V W I +Q W + + +L Sbjct: 502 SCRMHGVNPLAWATDVIGRLQAGWQRDRLDEL 533 >UniRef50_A8S0A7 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S0A7_9CLOT Length = 389 Score = 78.5 bits (192), Expect = 6e-14, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 31/71 (43%), Gaps = 7/71 (9%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + + SI+ T L + L + G + +A+ YS+ T + NN++ ++ Y Sbjct: 208 NNAAEQSIRPFT-------LGRKNWYLIDTSGGAKSSAIAYSIAETAKANNLKPYEYFKY 260 Query: 79 GIEHIQDWSAN 89 +E + A Sbjct: 261 LLEELPKHGAE 271 >UniRef50_A6KWF8 Transposase n=7 Tax=Bacteroides RepID=A6KWF8_BACV8 Length = 537 Score = 78.5 bits (192), Expect = 7e-14, Method: Composition-based stats. Identities = 21/82 (25%), Positives = 35/82 (42%), Gaps = 10/82 (12%) Query: 16 CSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKW 75 C ++ SI+ +T L + +GS AA+ +SL+G CR N V + W Sbjct: 456 CIDNNPVERSIRPLT-------LNRKNTLFSGSHEAAHAAAIFFSLMGCCRENKVNPKLW 508 Query: 76 LCYGIEHIQD---WSANLVRDL 94 + + +Q+ N DL Sbjct: 509 MQDVLIRVQEKEREEKNDYTDL 530 >UniRef50_P55630 Uncharacterized protein y4qI n=19 Tax=Alphaproteobacteria RepID=Y4QI_RHISN Length = 539 Score = 78.1 bits (191), Expect = 7e-14, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 36/81 (44%), Gaps = 8/81 (9%) Query: 15 ICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 I ++ +I+ T + + + AGS GG A + +L+ TC++N V+ Sbjct: 455 IEIDSNIVERAIRPQT-------ITRKNSLFAGSEGGGRTWATVATLLQTCKMNGVDPLD 507 Query: 75 WLCYGIEHIQD-WSANLVRDL 94 WL + I W A+ + L Sbjct: 508 WLSQTLTRIAQGWPASEIEAL 528 >UniRef50_Q1VRT6 Transposase n=5 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRT6_9FLAO Length = 502 Score = 78.1 bits (191), Expect = 9e-14, Method: Composition-based stats. Identities = 17/81 (20%), Positives = 35/81 (43%), Gaps = 7/81 (8%) Query: 14 IICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELE 73 ++ + +I+ + L + AGS + A++YS C+ + V Sbjct: 418 VLEIDNNLTENAIRKL-------ALGRKNYLFAGSHDAAQRGAIMYSFFAICKKHEVNPY 470 Query: 74 KWLCYGIEHIQDWSANLVRDL 94 +WL Y +E+I + ++DL Sbjct: 471 QWLKYTLENIMSINHKNIKDL 491 >UniRef50_C3QM77 Transposase IS66 n=1 Tax=Bacteroides sp. D1 RepID=C3QM77_9BACE Length = 257 Score = 77.7 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 35/78 (44%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + + +K I C+ + GS G ++A++LYS+I TC++N + K++ Sbjct: 179 IDNNTAERMMKPI-------CMGRKNYLFCGSELGAKNASMLYSIIETCKMNGLRPVKYI 231 Query: 77 CYGIEHIQDWSANLVRDL 94 + + N + L Sbjct: 232 AEILTKLTAGETNYMSLL 249 >UniRef50_B3PIF8 IS66 family element, transposase n=8 Tax=Bacteria RepID=B3PIF8_CELJU Length = 527 Score = 77.7 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 12/69 (17%), Positives = 26/69 (37%), Gaps = 7/69 (10%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + + +I+ + + + S G +A LYS+I T + N +E +L Sbjct: 446 IDNNAAENAIRPFV-------IGRKNWLFSTSPKGATASANLYSVIETAKANGLEPYGYL 498 Query: 77 CYGIEHIQD 85 + + Sbjct: 499 KTIFTELPN 507 >UniRef50_A8VYR6 Transposase and inactivated derivatives-like protein n=2 Tax=Bacillus selenitireducens MLS10 RepID=A8VYR6_9BACI Length = 495 Score = 77.7 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 14/91 (15%), Positives = 36/91 (39%), Gaps = 10/91 (10%) Query: 4 RLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIG 63 R + +I + SIK + + + + G + ++++YS+I Sbjct: 387 RTFLKDGRLAIDN---NRAERSIKPFV-------IGRKNWIFSNTPRGAKSSSIIYSMIE 436 Query: 64 TCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 T + N ++ + +L Y E++ + + Sbjct: 437 TAKENQLKPQAYLNYLFENLPSSKQSEMEQF 467 >UniRef50_A6TJJ0 Transposase IS66 n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TJJ0_ALKMQ Length = 533 Score = 77.7 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 17/80 (21%), Positives = 28/80 (35%), Gaps = 7/80 (8%) Query: 15 ICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 I + IK L + A S G +A+ YS+I T + N + + Sbjct: 439 IAIDNNLAERGIKPFV-------LGRKNYLFAKSPKGATASALCYSIIETAKANKLIPFQ 491 Query: 75 WLCYGIEHIQDWSANLVRDL 94 +L Y E + + L Sbjct: 492 YLTYLFEQLPNLDIEDPEAL 511 >UniRef50_Q3IV06 Transposase IS66 n=9 Tax=Bacteria RepID=Q3IV06_RHOS4 Length = 523 Score = 77.7 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 36/78 (46%), Gaps = 8/78 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 ++ +I+ I L + A AGS G + A + SLIGTCRLN V E ++ Sbjct: 446 DTNGVENAIRPIP-------LTRKNALFAGSTDGAKTWARIASLIGTCRLNGVNPEAYIA 498 Query: 78 YGIEHIQD-WSANLVRDL 94 + I D + + +L Sbjct: 499 ATLRKILDQHMQSDIAEL 516 >UniRef50_B6EI61 Transposase n=28 Tax=Gammaproteobacteria RepID=B6EI61_ALISL Length = 495 Score = 77.3 bits (189), Expect = 1e-13, Method: Composition-based stats. Identities = 11/70 (15%), Positives = 32/70 (45%), Gaps = 7/70 (10%) Query: 14 IICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELE 73 ++ + ++K + + +GS +G + +A+LYS++ T + N + Sbjct: 414 LLSIDNNRAERAVKPFV-------IGRKNWLFSGSTAGADSSAMLYSIVETAKANGLIPY 466 Query: 74 KWLCYGIEHI 83 ++ Y ++ + Sbjct: 467 DYIRYCLDRL 476 >UniRef50_UPI0001BC30F4 transposase n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC30F4 Length = 542 Score = 77.3 bits (189), Expect = 2e-13, Method: Composition-based stats. Identities = 11/74 (14%), Positives = 30/74 (40%), Gaps = 7/74 (9%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 ++ + +I+T C+ + + G +A++YS+ T +LNN+ + Sbjct: 448 IDNSASERAIRTF-------CIGKKNWMFHNTAKGAGASALVYSISETAKLNNLRPYYYF 500 Query: 77 CYGIEHIQDWSANL 90 + + + + Sbjct: 501 RHILTELPKYCDEK 514 >UniRef50_A0LAY3 Transposase IS66 n=7 Tax=Magnetococcus sp. MC-1 RepID=A0LAY3_MAGSM Length = 526 Score = 76.9 bits (188), Expect = 2e-13, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 26/78 (33%), Gaps = 7/78 (8%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + +I+ + + + S G +A LYSLI T + N E ++ Sbjct: 442 IDNNRAENAIRPFV-------IGRKNWLFSNSVRGARASANLYSLIETAKANGWEPFEYF 494 Query: 77 CYGIEHIQDWSANLVRDL 94 E + DL Sbjct: 495 TKVFEGLATAQTVDEFDL 512 >UniRef50_D2QZY9 Transposase IS66 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QZY9_9PLAN Length = 536 Score = 76.5 bits (187), Expect = 3e-13, Method: Composition-based stats. Identities = 13/78 (16%), Positives = 33/78 (42%), Gaps = 7/78 (8%) Query: 15 ICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 + + + ++K + + + AG+ G AA+LYSLI + + ++ ++ Sbjct: 449 LNIDNNAAERALKRV-------AIGRKNWLFAGNDRAGGTAALLYSLIASAERHQLDPQR 501 Query: 75 WLCYGIEHIQDWSANLVR 92 +L + + + V Sbjct: 502 YLTSVLARLPALPPSDVN 519 >UniRef50_A8S332 Putative uncharacterized protein (Fragment) n=3 Tax=Clostridiales RepID=A8S332_9CLOT Length = 408 Score = 76.2 bits (186), Expect = 3e-13, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 32/80 (40%), Gaps = 11/80 (13%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + + SI+ T + + SG + +A+ YS+ T + N ++ ++ Y Sbjct: 317 NNAAERSIRPFT-------VGRNNWFQIDTVSGAKASAIAYSIAETAKANQLKPYEYFRY 369 Query: 79 GIEHIQDWSA----NLVRDL 94 +E + + V +L Sbjct: 370 LLEELPKHGELEELSYVEEL 389 >UniRef50_C6N6I2 Truncated transposase IS66 n=2 Tax=Legionella RepID=C6N6I2_9GAMM Length = 213 Score = 76.2 bits (186), Expect = 3e-13, Method: Composition-based stats. Identities = 16/67 (23%), Positives = 24/67 (35%), Gaps = 7/67 (10%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + I+ L + +GS G +A+ YSLI T N +L Sbjct: 132 IDNNGAENQIRPF-------ALGRKNWLFSGSPRGAHASALFYSLIATAIANGWNPFNYL 184 Query: 77 CYGIEHI 83 Y E+I Sbjct: 185 RYLFENI 191 >UniRef50_C6LKX4 Transposase IS66 n=13 Tax=Clostridiales RepID=C6LKX4_9FIRM Length = 542 Score = 75.8 bits (185), Expect = 4e-13, Method: Composition-based stats. Identities = 13/81 (16%), Positives = 34/81 (41%), Gaps = 11/81 (13%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 + + +I+ T + + + + +G +AV+YS+ T + NN++ ++ Sbjct: 454 DNNASERAIRGFT-------VGRKNWQMIDTINGANASAVIYSIAETAKANNLKPYEYFE 506 Query: 78 YGIEHIQDWSANL----VRDL 94 + + I + + DL Sbjct: 507 HLLSEIPKHMDDHGLAFLEDL 527 >UniRef50_B8I6W2 Transposase IS66 n=3 Tax=Clostridiales RepID=B8I6W2_CLOCE Length = 529 Score = 75.8 bits (185), Expect = 4e-13, Method: Composition-based stats. Identities = 11/67 (16%), Positives = 27/67 (40%), Gaps = 7/67 (10%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + +I+ + A + G + +A++YS+I + + N + +L Y Sbjct: 437 NNRAENAIRPYVT-------GRKNWLFADTTRGAKASAIVYSMIESAKANQLNPYMYLVY 489 Query: 79 GIEHIQD 85 + + D Sbjct: 490 LLSKLPD 496 >UniRef50_Q0AUV2 Transposase and inactivated derivatives-like protein n=15 Tax=Firmicutes RepID=Q0AUV2_SYNWW Length = 530 Score = 75.4 bits (184), Expect = 5e-13, Method: Composition-based stats. Identities = 13/69 (18%), Positives = 28/69 (40%), Gaps = 7/69 (10%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + SIK + + + G +A++YS+I T + NN++ ++ Sbjct: 434 IDNNRAERSIKPFV-------IGRKNWLFTNTPRGARGSAIIYSVIETAKENNLKPYNYM 486 Query: 77 CYGIEHIQD 85 Y E + + Sbjct: 487 FYLFEQLPN 495 >UniRef50_C6MXH4 Transposase IS66 n=3 Tax=Legionella drancourtii LLAP12 RepID=C6MXH4_9GAMM Length = 491 Score = 75.4 bits (184), Expect = 6e-13, Method: Composition-based stats. Identities = 20/75 (26%), Positives = 31/75 (41%), Gaps = 10/75 (13%) Query: 11 KTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNV 70 + I + + SIK + + G+ G A LYSLI TC+ + V Sbjct: 394 RLEIDNNLS---ERSIKPFV-------IGRKNWLFHGNDIGARAGATLYSLIETCKYHKV 443 Query: 71 ELEKWLCYGIEHIQD 85 ++ W Y + HIQ Sbjct: 444 DVFSWFKYALTHIQQ 458 >UniRef50_D1W152 IS66 family element, transposase n=3 Tax=Prevotella RepID=D1W152_9BACT Length = 552 Score = 75.4 bits (184), Expect = 6e-13, Method: Composition-based stats. Identities = 15/72 (20%), Positives = 32/72 (44%), Gaps = 10/72 (13%) Query: 11 KTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNV 70 + +I + M + +I+ IT L + G+ G E+ A+ Y+ + CR ++ Sbjct: 472 RFNIDNNLM---EQAIRPIT-------LGRKNYLFCGNNEGAENNAIFYTFVACCREADI 521 Query: 71 ELEKWLCYGIEH 82 + KW+ + Sbjct: 522 DPYKWMKKILSK 533 >UniRef50_A6L6H8 Transposase n=8 Tax=Bacteroidales RepID=A6L6H8_BACV8 Length = 537 Score = 75.0 bits (183), Expect = 6e-13, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 34/93 (36%), Gaps = 14/93 (15%) Query: 6 RFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTC 65 I + + +I+ +T L + G+ E+ A++ SL+ TC Sbjct: 438 YLKDGNLKIDNNLA---ENAIRPLT-------LSRKNFLFCGNHEAAENTAIICSLLATC 487 Query: 66 RLNNVELEKWLCYGIEHIQDWSANL----VRDL 94 + + +WL I + + VR+L Sbjct: 488 KAQEINPREWLNDVIAKLPYYLEKDSGKNVREL 520 >UniRef50_Q24RY6 Putative uncharacterized protein n=1 Tax=Desulfitobacterium hafniense Y51 RepID=Q24RY6_DESHY Length = 533 Score = 74.6 bits (182), Expect = 8e-13, Method: Composition-based stats. Identities = 13/67 (19%), Positives = 27/67 (40%), Gaps = 7/67 (10%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + SIK + + + + +G +A+ YSLI T R N + ++L + Sbjct: 432 NNRAERSIKPFV-------IGRKNWLFSNTPNGARASAIYYSLIVTARENGLNPFEYLAW 484 Query: 79 GIEHIQD 85 + + Sbjct: 485 IFANSPN 491 >UniRef50_B7CDZ5 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CDZ5_9FIRM Length = 523 Score = 74.6 bits (182), Expect = 9e-13, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 30/73 (41%), Gaps = 9/73 (12%) Query: 17 SPMTSL--KTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 PMT+ + I+ T + + S SG E +A YS+I T + N ++ K Sbjct: 435 IPMTNSLDERVIRPFTT-------GRKNWLFSASVSGAESSANAYSIIETAKANGLDPYK 487 Query: 75 WLCYGIEHIQDWS 87 +L ++ Sbjct: 488 YLTTIFTYLPSQD 500 >UniRef50_A9HSI6 Probable insertion sequence transposase protein n=6 Tax=Rhodobacteraceae RepID=A9HSI6_9RHOB Length = 87 Score = 74.6 bits (182), Expect = 1e-12, Method: Composition-based stats. Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 1/59 (1%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI-QDWSANLVRDL 94 L+ + A AG +G ++ AV+ SLI TC+LN +E +L + I + L Sbjct: 22 ALQRKNALFAGHDAGAQNWAVIASLIETCKLNKIEPHSYLTGLLTAIVNGHKKKDIDQL 80 >UniRef50_B8KMJ5 ISPsy5, transposase n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KMJ5_9GAMM Length = 176 Score = 74.6 bits (182), Expect = 1e-12, Method: Composition-based stats. Identities = 15/69 (21%), Positives = 30/69 (43%), Gaps = 7/69 (10%) Query: 15 ICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 +C + +I+ + + A + G +A +YSLI T + N++E Sbjct: 79 LCISNALAENAIRPF-------AVGRKAWLFADTTRGAHASATMYSLIETAKANHLEPRS 131 Query: 75 WLCYGIEHI 83 +L + +E I Sbjct: 132 YLLHVLERI 140 >UniRef50_A9AMV4 ISBmu30 transposase n=26 Tax=Proteobacteria RepID=A9AMV4_BURM1 Length = 546 Score = 74.2 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 32/82 (39%), Gaps = 10/82 (12%) Query: 16 CSP--MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELE 73 C+P ++ ++ + + + G + +A +YSL+ TCR VE Sbjct: 456 CAPIDNNVIERDVRPFAT-------SRKSWLFSDTVDGAKASATVYSLVLTCRACGVEPY 508 Query: 74 KWLCYGIEHIQDW-SANLVRDL 94 +L + + + V DL Sbjct: 509 DYLLHVLTELPQRAPDADVTDL 530 >UniRef50_B5JDW6 Transposase IS66 family n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JDW6_9BACT Length = 496 Score = 73.8 bits (180), Expect = 1e-12, Method: Composition-based stats. Identities = 9/81 (11%), Positives = 34/81 (41%), Gaps = 7/81 (8%) Query: 14 IICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELE 73 ++ ++ +I+ L ++ GS G +A+L++L+ + + + ++ Sbjct: 409 VVEIDNNLVENAIRPTK-------LGLKNWMFIGSEGSGRTSAILFTLVESAKRHGLDPY 461 Query: 74 KWLCYGIEHIQDWSANLVRDL 94 ++ + + + + + L Sbjct: 462 GYIKELLRRLPESTNWQIPQL 482 >UniRef50_C9LAQ6 ISPpu13, transposase Orf2 n=15 Tax=Clostridiales RepID=C9LAQ6_RUMHA Length = 532 Score = 73.5 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 35/82 (42%), Gaps = 10/82 (12%) Query: 16 CSPMTSL-KTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 CS +L + +I+ T + + + S G +A++Y+++ + N++ K Sbjct: 445 CSLSNNLSENAIRPFT-------VGRKNWLFSASPKGAASSAIVYTMVEMAKANDLNTYK 497 Query: 75 WLCYGIEHIQD--WSANLVRDL 94 +L Y + D S + L Sbjct: 498 YLTYLLSQRPDAKMSDEQLEQL 519 >UniRef50_A9GG21 Transposase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GG21_SORC5 Length = 200 Score = 73.5 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 13/55 (23%), Positives = 24/55 (43%) Query: 40 IQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 GS AA L+SL +C++++++ E +L I + W +L Sbjct: 107 RNAWLFFGSDDHASAAANLFSLAASCKVHHLDPEAYLADVIRVMPYWPRERYPEL 161 >UniRef50_C7XFR9 IS66 family transposase n=7 Tax=Bacteroidales RepID=C7XFR9_9PORP Length = 557 Score = 73.5 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 31/77 (40%), Gaps = 7/77 (9%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 ++ ++ + + G+ E AAVLYS G C+ + WL Sbjct: 461 DNNEIENKVRPVAC-------GRRNYLFCGNNDAAEDAAVLYSFFGCCKAAGADFRTWLI 513 Query: 78 YGIEHIQDWSANLVRDL 94 Y +EHI D+ + DL Sbjct: 514 YFLEHIHDYDDDYSMDL 530 >UniRef50_Q1D8I4 Transposase, IS66 family, truncated n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D8I4_MYXXD Length = 184 Score = 73.5 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 15/59 (25%), Positives = 25/59 (42%) Query: 36 GCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 L + G + GE+ LY+L+ TC N V E +L + Q + + +L Sbjct: 104 AALSRKNFLFVGHEAAGENLEGLYALVATCAANGVNPETYLTDVLLRAQTHPNSRIGEL 162 >UniRef50_Q2YKH5 Transposase IS66 family n=37 Tax=Brucella RepID=Q2YKH5_BRUA2 Length = 523 Score = 73.5 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 7/66 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 +++ +++++ L + AGS G E A+L SL+ T +LN ++ WL Sbjct: 434 DSNTVERNMRSV-------ALGRVNSLFAGSDGGAETWAILGSLLTTAKLNGLDPYTWLN 486 Query: 78 YGIEHI 83 +E I Sbjct: 487 DVLERI 492 >UniRef50_A5WEB8 Transposase IS66 n=20 Tax=Proteobacteria RepID=A5WEB8_PSYWF Length = 579 Score = 73.1 bits (178), Expect = 3e-12, Method: Composition-based stats. Identities = 14/78 (17%), Positives = 30/78 (38%), Gaps = 9/78 (11%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + ++ L + AGS G+ AA + S+I + RLN +++ +L Sbjct: 502 IDNNWAENQMRP-------WALGRKNWLFAGSLRSGQRAANIMSIIQSARLNGLDVSAYL 554 Query: 77 CYGIEHIQDWSANLVRDL 94 + + + +L Sbjct: 555 TDVLRRLPT--QEDLDEL 570 >UniRef50_UPI000197B598 hypothetical protein BACCOPRO_01649 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197B598 Length = 518 Score = 73.1 bits (178), Expect = 3e-12, Method: Composition-based stats. Identities = 9/69 (13%), Positives = 27/69 (39%), Gaps = 7/69 (10%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + + + +++ L + G+ A++Y++I +C+LN + + ++L Sbjct: 442 ADNNAAERAMRPCK-------LGMNNYLFFGNHESARRGAIIYTIIESCKLNGINVFEYL 494 Query: 77 CYGIEHIQD 85 Sbjct: 495 TDVFSREPQ 503 >UniRef50_Q08VL0 Transposase and inactivated derivative n=13 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08VL0_STIAU Length = 526 Score = 72.7 bits (177), Expect = 3e-12, Method: Composition-based stats. Identities = 15/79 (18%), Positives = 33/79 (41%), Gaps = 9/79 (11%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 ++ I+ I + AGS + G+ AA+ Y+L+ +C ++ +L Sbjct: 442 DNGEVERLIRLI-------AIGRNNYLFAGSDAAGQRAALAYTLVLSCYRLGMDPWAYLR 494 Query: 78 YGIEHIQD--WSANLVRDL 94 + + D + A + +L Sbjct: 495 DVLPKLGDTRFPAARLAEL 513 >UniRef50_D0DW05 Transposase IS66 n=1 Tax=Lactobacillus fermentum 28-3-CHN RepID=D0DW05_LACFE Length = 507 Score = 72.7 bits (177), Expect = 4e-12, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 41/88 (46%), Gaps = 10/88 (11%) Query: 7 FSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCR 66 F + + +P ++ SI+ T L + + A S +G + A++Y++I T + Sbjct: 417 FDNAQLPLDNNP---VEQSIRPTT-------LVRKNSLFATSKNGAKTNAMIYTIIQTAK 466 Query: 67 LNNVELEKWLCYGIEHIQDWSANLVRDL 94 LNN+ + +L Y + A V DL Sbjct: 467 LNNLRIFDYLKYVFDQYTKRVAVKVEDL 494 >UniRef50_Q08RD8 Transposase IS66 family n=6 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08RD8_STIAU Length = 529 Score = 72.3 bits (176), Expect = 4e-12, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 25/58 (43%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + GS + A L++LI T RL+ ++ E +L + + W +L Sbjct: 431 AVGRKAWLFVGSDDHAQSAGHLFTLIATARLHRLDPEAYLRDLLRVLAHWPRERYLEL 488 >UniRef50_Q3IV15 Putative transposase n=1 Tax=Rhodobacter sphaeroides 2.4.1 RepID=Q3IV15_RHOS4 Length = 176 Score = 71.9 bits (175), Expect = 5e-12, Method: Composition-based stats. Identities = 18/75 (24%), Positives = 33/75 (44%), Gaps = 1/75 (1%) Query: 21 SLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGI 80 L+ + L + A AG G E+ A+L SL+ TC++++V ++ + Sbjct: 89 RLEMDTNPVENQIRRVALTRKNALFAGHEVGAENWAMLASLVATCKMSDVNPVSYIAETL 148 Query: 81 EHI-QDWSANLVRDL 94 I A+ + DL Sbjct: 149 RAILNGHPASRIEDL 163 >UniRef50_A6KXM9 Transposase n=23 Tax=Bacteroides RepID=A6KXM9_BACV8 Length = 521 Score = 71.9 bits (175), Expect = 6e-12, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 35/83 (42%), Gaps = 10/83 (12%) Query: 7 FSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCR 66 F+ T + + + + I L + + GS G E A+LY++ TCR Sbjct: 430 FNYGDTYLDNNIVERMNRYIS----------LSRKNSLFFGSHKGAERGAILYTIALTCR 479 Query: 67 LNNVELEKWLCYGIEHIQDWSAN 89 +N V L ++L I +W N Sbjct: 480 MNKVNLFEYLTDIINRTAEWQPN 502 >UniRef50_UPI00003825AD COG3436: Transposase and inactivated derivatives n=1 Tax=Magnetospirillum magnetotacticum MS-1 RepID=UPI00003825AD Length = 229 Score = 71.9 bits (175), Expect = 6e-12, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 30/87 (34%), Gaps = 11/87 (12%) Query: 7 FSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCR 66 F + ++ + + + + AGS G E AAV Y+LI T + Sbjct: 123 FDDGRIALDNNAAERALRGVA----------VGRKNYLFAGSDLGAERAAVFYTLIETAK 172 Query: 67 LNNVELEKWLCYGIEHIQDWSANLVRD 93 LN + I D A + D Sbjct: 173 LNRLIPRPPARRV-TRIADHPAKRLAD 198 >UniRef50_A8YU80 Transposase ORF_C n=22 Tax=Lactobacillales RepID=A8YU80_LACH4 Length = 445 Score = 71.9 bits (175), Expect = 7e-12, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 34/67 (50%), Gaps = 7/67 (10%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + +IK++ + + + S +G + +A++ SLI T + N ++ EK+L Y Sbjct: 361 NNLAERAIKSLV-------IGRKNWLFSQSFNGAQSSAIILSLIETAKRNGLDPEKYLVY 413 Query: 79 GIEHIQD 85 + ++ + Sbjct: 414 LLSNLPN 420 >UniRef50_A9DHW8 Putative uncharacterized protein n=10 Tax=Shewanella benthica KT99 RepID=A9DHW8_9GAMM Length = 465 Score = 71.5 bits (174), Expect = 7e-12, Method: Composition-based stats. Identities = 12/76 (15%), Positives = 28/76 (36%), Gaps = 10/76 (13%) Query: 6 RFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTC 65 + +I + ++K + + + G E +A+LYS+I T Sbjct: 400 YLEDGRLNIDN---NRAERAVKPFV-------IGRKNWLFNHNHRGAEASAILYSIIETA 449 Query: 66 RLNNVELEKWLCYGIE 81 + N + ++ +E Sbjct: 450 KANGLTPFDYIERCLE 465 >UniRef50_C7XG25 Transposase n=7 Tax=Bacteroidales RepID=C7XG25_9PORP Length = 561 Score = 71.5 bits (174), Expect = 7e-12, Method: Composition-based stats. Identities = 16/84 (19%), Positives = 33/84 (39%), Gaps = 11/84 (13%) Query: 15 ICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 I + +++ + L + G+ E+ AV+ SL+G+C+ V + Sbjct: 448 IFIDNNRAENALRPMV-------LTRKNMLFCGNQQAAENTAVICSLLGSCKECGVNPRE 500 Query: 75 WLCYGIEHIQDW----SANLVRDL 94 WL I + + S + +L Sbjct: 501 WLNDVISKLPYYLTPKSEKKLTEL 524 >UniRef50_B3WYX6 Lysyl-tRNA synthetase, heat inducible n=4 Tax=Gammaproteobacteria RepID=B3WYX6_SHIDY Length = 258 Score = 71.1 bits (173), Expect = 9e-12, Method: Composition-based stats. Identities = 43/45 (95%), Positives = 44/45 (97%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASL 45 MVRRLRFSGPKTSIIC+PMTSLKTSIKTITYLSD GCLEIQGASL Sbjct: 1 MVRRLRFSGPKTSIICTPMTSLKTSIKTITYLSDTGCLEIQGASL 45 >UniRef50_A5ZJI7 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZJI7_9BACE Length = 71 Score = 71.1 bits (173), Expect = 1e-11, Method: Composition-based stats. Identities = 12/57 (21%), Positives = 24/57 (42%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + Q + +AAV+Y+ C++ + EKWL ++ I + +L Sbjct: 1 MGKQNHLSCQNDEPCHYAAVMYTFFAACKVLGINPEKWLSDVLDKISLTPKEKLSEL 57 >UniRef50_A6NXJ3 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NXJ3_9BACE Length = 200 Score = 71.1 bits (173), Expect = 1e-11, Method: Composition-based stats. Identities = 13/66 (19%), Positives = 29/66 (43%), Gaps = 7/66 (10%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + SIK + + + + +G + +AV+YSLI T + N++ ++L + Sbjct: 115 NNRAERSIKPFV-------MGRKNWLFSNTPAGAQSSAVVYSLIETAKENDLAPYRYLVW 167 Query: 79 GIEHIQ 84 + Sbjct: 168 LLNTAP 173 >UniRef50_Q31T57 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=Q31T57_SHIBS Length = 197 Score = 71.1 bits (173), Expect = 1e-11, Method: Composition-based stats. Identities = 43/45 (95%), Positives = 44/45 (97%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASL 45 MVRRLRFSGPKTSIIC+PMTSLKTSIKTITYLSD GCLEIQGASL Sbjct: 1 MVRRLRFSGPKTSIICTPMTSLKTSIKTITYLSDTGCLEIQGASL 45 >UniRef50_A0LPQ3 ISPpu15, transposase Orf2 n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LPQ3_SYNFM Length = 177 Score = 70.8 bits (172), Expect = 1e-11, Method: Composition-based stats. Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 7/70 (10%) Query: 15 ICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 I + + +I+ + + +G + AA LYSLI T ++ +E + Sbjct: 87 ISPDNNAAENAIRRFV-------IGRKNWLFSGHPNSANAAATLYSLIETAKVCRLEAYQ 139 Query: 75 WLCYGIEHIQ 84 +L Y E + Sbjct: 140 YLRYLFERLP 149 >UniRef50_B7LIJ6 Putative uncharacterized protein n=1 Tax=Escherichia coli ED1a RepID=B7LIJ6_ECO81 Length = 85 Score = 70.4 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 25/52 (48%), Positives = 32/52 (61%) Query: 43 ASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 LA E AAV+YSLIG+C+LN +E E WL + I I W AN V++L Sbjct: 15 TGLAVRHPTTESAAVMYSLIGSCKLNGIEPETWLRHVISVINTWPANCVKEL 66 >UniRef50_B9YEG7 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YEG7_9FIRM Length = 535 Score = 70.0 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 12/71 (16%), Positives = 32/71 (45%), Gaps = 7/71 (9%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + +K + + + + SG +++ YSL+ + +LN++++ +L Y Sbjct: 452 NNRAERMVKPFV-------MGRKAWLFSKTKSGARMSSIYYSLVESAKLNHLDIHLYLEY 504 Query: 79 GIEHIQDWSAN 89 + IQ+ + Sbjct: 505 VLTQIQEHPDS 515 >UniRef50_B2AIZ9 Fused transposase IS66/IS21 n=38 Tax=Proteobacteria RepID=B2AIZ9_CUPTR Length = 936 Score = 69.6 bits (169), Expect = 3e-11, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 32/77 (41%), Gaps = 8/77 (10%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + +I+ + + + +G +A LYSL+ TC+ N+V+ ++L Sbjct: 449 NNPCENAIRPFV-------IGRRNFLFCDTVAGANASASLYSLVETCKANDVDSYQYLVA 501 Query: 79 GIEHIQ-DWSANLVRDL 94 + + +A+ L Sbjct: 502 LFKALPHAQTADDYEAL 518 >UniRef50_A1RLT8 Transposase IS66 n=94 Tax=Gammaproteobacteria RepID=A1RLT8_SHESW Length = 500 Score = 69.6 bits (169), Expect = 3e-11, Method: Composition-based stats. Identities = 12/84 (14%), Positives = 32/84 (38%), Gaps = 11/84 (13%) Query: 11 KTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNV 70 + SI + ++K + + + + +G +A LYS++ T ++N + Sbjct: 421 RLSIDN---NRAERAVKPFV-------IGRKNWLFSQTANGAHASATLYSIVETAKVNGL 470 Query: 71 ELEKWLCYGIEHIQDWSANLVRDL 94 ++ + + A + L Sbjct: 471 VPFDYIMACLNELCQ-PAPDIDSL 493 >UniRef50_C6LKU9 ISPsy5, transposase n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LKU9_9FIRM Length = 549 Score = 69.2 bits (168), Expect = 4e-11, Method: Composition-based stats. Identities = 16/80 (20%), Positives = 33/80 (41%), Gaps = 10/80 (12%) Query: 6 RFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTC 65 + SI S + ++K + + + S G + +A +YS+ T Sbjct: 450 YLEDGQLSIDNSVA---ERALK-------NFAIGRRNWLFSKSIRGAQTSATVYSITETA 499 Query: 66 RLNNVELEKWLCYGIEHIQD 85 LN ++ +L Y +E ++D Sbjct: 500 LLNGLKPYNYLTYVLEKMKD 519 >UniRef50_C0ETK9 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0ETK9_9FIRM Length = 129 Score = 68.8 bits (167), Expect = 4e-11, Method: Composition-based stats. Identities = 17/80 (21%), Positives = 33/80 (41%), Gaps = 10/80 (12%) Query: 4 RLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIG 63 R SI S + +IK + + A S G + +A++YS++ Sbjct: 26 RRYLEDGHLSIDN---NSAERAIK-------NFAVGRRNWLFAKSIRGADASAIVYSIVE 75 Query: 64 TCRLNNVELEKWLCYGIEHI 83 T L+ ++ +L Y +E + Sbjct: 76 TALLSGLKPYLYLTYVLEKL 95 >UniRef50_UPI0001C376A4 transposase IS66 n=2 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C376A4 Length = 532 Score = 68.8 bits (167), Expect = 5e-11, Method: Composition-based stats. Identities = 10/56 (17%), Positives = 28/56 (50%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRD 93 + + + + G + +A + S+I T + NN+++ +L + + + +W N + Sbjct: 452 INRKNFLFSDTEKGADASAAVMSIIETAKRNNLDVYGYLTHLLTVLPEWGKNPTDE 507 >UniRef50_Q07GD7 Putative uncharacterized protein n=1 Tax=Roseobacter denitrificans OCh 114 RepID=Q07GD7_ROSDO Length = 111 Score = 68.4 bits (166), Expect = 6e-11, Method: Composition-based stats. Identities = 16/73 (21%), Positives = 32/73 (43%), Gaps = 8/73 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 +++ +I+ I L+ + A + +G ++ A+L SLI +LN+VE +L Sbjct: 9 DSNAVEHTIRPIV-------LQRKNALFSDHDAGAQNWAMLASLIEIGKLNDVEPHSYLT 61 Query: 78 YGIEHI-QDWSAN 89 + I Sbjct: 62 SVLSAIVNGHKQK 74 >UniRef50_C4I9X2 Fused transposase IS66/IS21 n=8 Tax=Burkholderia RepID=C4I9X2_BURPS Length = 172 Score = 68.4 bits (166), Expect = 6e-11, Method: Composition-based stats. Identities = 12/66 (18%), Positives = 29/66 (43%), Gaps = 7/66 (10%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + +I+ + C+ +G + + G A LY+L+ T + N ++L ++L + Sbjct: 68 NNPCENAIR-------LFCVGRRGWLFSDTVDGANACANLYTLVETSKTNGIDLYRYLAW 120 Query: 79 GIEHIQ 84 + Sbjct: 121 LFRRLP 126 >UniRef50_B0NH70 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0NH70_EUBSP Length = 229 Score = 68.4 bits (166), Expect = 7e-11, Method: Composition-based stats. Identities = 12/83 (14%), Positives = 32/83 (38%), Gaps = 10/83 (12%) Query: 4 RLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIG 63 ++ S + SI S + + + + + G + +A++YS+ Sbjct: 125 KVFLSDGEVSIDDSASERVLRN----------FTIGRKNWVTINTVRGAQASAIIYSITE 174 Query: 64 TCRLNNVELEKWLCYGIEHIQDW 86 T R NN+ + ++ + + + Sbjct: 175 TARANNLNVYYYIKHLLTQLPQH 197 >UniRef50_UPI0001C34DE2 transposase IS66 n=4 Tax=Clostridium sp. M62/1 RepID=UPI0001C34DE2 Length = 394 Score = 67.3 bits (163), Expect = 2e-10, Method: Composition-based stats. Identities = 12/73 (16%), Positives = 32/73 (43%), Gaps = 6/73 (8%) Query: 16 CSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKW 75 CS ++ +I + + + G ++++YSL+ T +LN++ + + Sbjct: 328 CSISNNIAENIA------RPYAVGRKNFLFHDTVKGARASSIIYSLVETAKLNDLNIYAY 381 Query: 76 LCYGIEHIQDWSA 88 L + ++ D+ Sbjct: 382 LETVLLYMPDYKN 394 >UniRef50_D1PX91 Cytochrome o ubiquinol oxidase n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PX91_9BACT Length = 105 Score = 66.9 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 15/70 (21%), Positives = 30/70 (42%), Gaps = 8/70 (11%) Query: 21 SLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGI 80 S + +I+ IT L + G+ G E+ A+ Y+ + CR ++ KW+ + Sbjct: 32 SQEQAIRPIT-------LGGKNYLFCGNNEGAENNAIFYTFMACCRQAGLQPSKWMREFL 84 Query: 81 EH-IQDWSAN 89 + D + Sbjct: 85 SKPLPDMTEE 94 >UniRef50_D1PFL7 Putative cytOchrome o ubiquinol oxidase, subunit I n=1 Tax=Prevotella copri DSM 18205 RepID=D1PFL7_9BACT Length = 69 Score = 66.5 bits (161), Expect = 3e-10, Method: Composition-based stats. Identities = 15/57 (26%), Positives = 27/57 (47%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + + + AA++YSL G C++ + E+WLCY ++HI + L Sbjct: 3 MGKKAYLFCRDLDACKRAAMMYSLFGACKVLDKNPERWLCYVLKHIDSMPEDKYYTL 59 >UniRef50_P55504 Uncharacterized protein y4jD n=13 Tax=cellular organisms RepID=Y4JD_RHISN Length = 511 Score = 66.5 bits (161), Expect = 3e-10, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 36/79 (45%), Gaps = 9/79 (11%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 ++ S+K++ L + + G+ GGE AVL SLI + +L+ ++ WL Sbjct: 426 DSNIVERSMKSV-------ALTRKNSMFVGNVQGGETFAVLASLINSAKLSGLDPYAWLA 478 Query: 78 YGIEHI--QDWSANLVRDL 94 +E I + N + L Sbjct: 479 DVLERIVSGSTTINQLETL 497 >UniRef50_UPI00016C41EC ISPpu13, transposase Orf2 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C41EC Length = 323 Score = 66.1 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 28/69 (40%), Gaps = 10/69 (14%) Query: 12 TSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVE 71 ++ + +I L + GS GG AAVL+S+IGTC+ ++ Sbjct: 226 LALDNNLSERTLRAIA----------LGRNNWGVFGSAGGGRTAAVLFSVIGTCKHLGLD 275 Query: 72 LEKWLCYGI 80 +L + Sbjct: 276 PFAYLREAL 284 >UniRef50_C7H2D2 ISPsy5, transposase n=1 Tax=Faecalibacterium prausnitzii A2-165 RepID=C7H2D2_9FIRM Length = 84 Score = 66.1 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 11/65 (16%), Positives = 29/65 (44%), Gaps = 7/65 (10%) Query: 13 SIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVEL 72 S+ P + I+ T + + + + +G + +AV+YS++ T + N++ Sbjct: 9 SMEKQPSNRAENQIRPFT-------VGRKNWLFSDTPAGAKASAVIYSIVETAKANSLVP 61 Query: 73 EKWLC 77 ++ Sbjct: 62 RDYIQ 66 >UniRef50_B0ACF8 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0ACF8_9CLOT Length = 101 Score = 65.7 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 13/67 (19%), Positives = 27/67 (40%), Gaps = 7/67 (10%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + + +IK + + + G +A +YS+I T + NN E++L Sbjct: 12 IDNNAAERAIKPFV-------IGRKNWLFCKNQKGAHVSATIYSIIETAKANNSITERYL 64 Query: 77 CYGIEHI 83 Y + + Sbjct: 65 AYLFDKM 71 >UniRef50_A3JTL7 Probable insertion sequence transposase protein n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JTL7_9RHOB Length = 74 Score = 65.7 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 19/65 (29%), Positives = 28/65 (43%), Gaps = 2/65 (3%) Query: 31 YLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQ-DWSAN 89 YL L + AS AG + E+ +L S+I C LN +E + +L I Sbjct: 4 YLPST-TLNRKNASFAGHDANAENWVILVSIIEICILNKIEPQVYLTGVFTAIAHGHRQK 62 Query: 90 LVRDL 94 + DL Sbjct: 63 DIEDL 67 >UniRef50_A6LHQ0 Transposase n=2 Tax=Bacteroidales RepID=A6LHQ0_PARD8 Length = 528 Score = 65.7 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 38/89 (42%), Gaps = 14/89 (15%) Query: 10 PKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNN 69 P+ + + + + I L + + GS G + AA++YSL +CR+NN Sbjct: 442 PEYELDNNAIERINRYIS----------LSRRNSLFCGSHQGVKRAALIYSLACSCRMNN 491 Query: 70 VELEKWLCYGIEHI----QDWSANLVRDL 94 + ++ + + N++R+L Sbjct: 492 INTFEYFKELLNKAVSLNPNTDKNVLREL 520 >UniRef50_Q6LRS5 Hypothetical transposase n=10 Tax=Gammaproteobacteria RepID=Q6LRS5_PHOPR Length = 514 Score = 65.7 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 15/89 (16%), Positives = 36/89 (40%), Gaps = 11/89 (12%) Query: 6 RFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTC 65 + SI + ++K + + + + +G +A+LYSL+ T Sbjct: 429 YLEDGRLSIDN---NRAERAVKPFV-------IGRKAWLFSYTNTGANASAILYSLVETA 478 Query: 66 RLNNVELEKWLCYGIEHIQDWSANLVRDL 94 + NN+ + ++ ++ I + N + L Sbjct: 479 KANNLLVHDYIATCLQQIAEKPNN-IDAL 506 >UniRef50_B3DSN6 Transposase n=4 Tax=Bifidobacterium longum RepID=B3DSN6_BIFLD Length = 133 Score = 65.0 bits (157), Expect = 6e-10, Method: Composition-based stats. Identities = 7/51 (13%), Positives = 22/51 (43%) Query: 35 IGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 I + + + + G +A +YS+ T + N + ++ + + + + Sbjct: 22 IFVVGRKNWLFSDAPRGARASAAIYSVTTTAKANGLNPRLYVEWLLTEMPN 72 >UniRef50_UPI00017448EA transposase IS66 n=4 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448EA Length = 503 Score = 65.0 bits (157), Expect = 7e-10, Method: Composition-based stats. Identities = 8/77 (10%), Positives = 26/77 (33%), Gaps = 7/77 (9%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 ++ +I+ + + G G AA Y+L+ + + ++L Sbjct: 422 DNNLVENTIRP-------SAIGKKNWLFVGDAQAGVRAATFYTLLDNAKRAGADAYEYLK 474 Query: 78 YGIEHIQDWSANLVRDL 94 + + ++++ Sbjct: 475 DLFTKLPAMTNQQMKEI 491 >UniRef50_A2UYM3 Conserved hypothetical ISPpu15, transposase n=1 Tax=Shewanella putrefaciens 200 RepID=A2UYM3_SHEPU Length = 211 Score = 64.6 bits (156), Expect = 9e-10, Method: Composition-based stats. Identities = 12/86 (13%), Positives = 33/86 (38%), Gaps = 11/86 (12%) Query: 9 GPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLN 68 + +I + ++K + + + G E +A+ YS+I + N Sbjct: 127 NSQLNIYN---NRAEQAVKPFV-------IGHKNWLFNHNHRGAETSAIFYSIIKMAKAN 176 Query: 69 NVELEKWLCYGIEHIQDWSANLVRDL 94 + ++ + +E + + S + + L Sbjct: 177 ELTPFDYIEHCLEQLSN-SNSDLNSL 201 >UniRef50_A9HWT3 Probable insertion sequence transposase protein n=2 Tax=Alphaproteobacteria RepID=A9HWT3_9RHOB Length = 62 Score = 64.2 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 16/59 (27%), Positives = 28/59 (47%), Gaps = 1/59 (1%) Query: 37 CLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD-WSANLVRDL 94 L+ + A AG + ++ A+L SLI TC+ N++E +L + I + L Sbjct: 2 ALQRKNALFAGHDARAQNWAMLASLIETCKFNSIEPHGYLLGVLITITGKHKQTDINRL 60 >UniRef50_C2DPG4 Nitrite extrusion protein 2 n=1 Tax=Escherichia coli 83972 RepID=C2DPG4_ECOLX Length = 77 Score = 63.8 bits (154), Expect = 2e-09, Method: Composition-based stats. Identities = 34/45 (75%), Positives = 39/45 (86%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASL 45 +VRRL F+G K +IICS MT+LKT I+TITYLSDIGCLEIQGA L Sbjct: 20 VVRRLHFTGSKLTIICSLMTTLKTCIRTITYLSDIGCLEIQGACL 64 >UniRef50_C6ZBZ0 Transposase n=26 Tax=Bacteroides RepID=C6ZBZ0_9BACE Length = 585 Score = 63.8 bits (154), Expect = 2e-09, Method: Composition-based stats. Identities = 14/46 (30%), Positives = 29/46 (63%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI 83 L ++ GS + E++A ++SLI +C+LN+++ + +L + E I Sbjct: 520 LLLKNCMNIGSEAAAENSAFIFSLIESCKLNDIDPQDYLKHLFECI 565 >UniRef50_UPI00016C567B ISPsy5, transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C567B Length = 220 Score = 63.4 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 15/72 (20%), Positives = 29/72 (40%), Gaps = 10/72 (13%) Query: 12 TSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVE 71 +I + ++ + + GS GG AA LYS++GTC+ +++ Sbjct: 120 LAIDNNLAERTLRAVA----------VGRNNWGVVGSEVGGRTAATLYSVVGTCKHLSID 169 Query: 72 LEKWLCYGIEHI 83 +L + I Sbjct: 170 PWTYLRDTLPGI 181 >UniRef50_D1JMC2 Transposase n=1 Tax=Bacteroides sp. 2_1_16 RepID=D1JMC2_9BACE Length = 528 Score = 63.4 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 15/75 (20%), Positives = 29/75 (38%), Gaps = 10/75 (13%) Query: 8 SGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRL 67 +G S + + + I L + + GS +G E + YSL +CRL Sbjct: 436 TGGDYSWDNNLIERINRYIS----------LSRKNSLFFGSHAGAERGCIFYSLACSCRL 485 Query: 68 NNVELEKWLCYGIEH 82 + + ++L + Sbjct: 486 HKINFFEYLTDILNR 500 >UniRef50_UPI0001C35C79 hypothetical protein ChatD1_36139 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C35C79 Length = 80 Score = 63.4 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 7/65 (10%), Positives = 25/65 (38%), Gaps = 7/65 (10%) Query: 26 IKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 ++ +T L + + S G + ++Y+++ + + + +L Y ++ Sbjct: 1 LRPVT-------LGRKNWLFSDSQDGANASMIVYTMVEMAKAHGLHPYNYLKYLLDSRPG 53 Query: 86 WSANL 90 + Sbjct: 54 TDTSD 58 >UniRef50_D1UHS9 Transposase n=2 Tax=Burkholderiales RepID=D1UHS9_9BURK Length = 57 Score = 63.4 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 14/37 (37%), Positives = 24/37 (64%) Query: 58 LYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 +YSLIG+C+LN ++ +L + + HI D N + +L Sbjct: 1 MYSLIGSCKLNGIDPRAYLSHVLAHIADHKVNRIDEL 37 >UniRef50_A5VI71 Transposase IS66 n=28 Tax=Lactobacillus RepID=A5VI71_LACRD Length = 514 Score = 63.1 bits (152), Expect = 3e-09, Method: Composition-based stats. Identities = 13/74 (17%), Positives = 33/74 (44%), Gaps = 7/74 (9%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 ++ +I+ T L + + A S +G + A+ Y+L+ T N++ + K+ Y Sbjct: 433 NNPVEQAIRPST-------LIRKNSLFAKSSAGAQANAIFYTLVATANQNHLNIYKYFKY 485 Query: 79 GIEHIQDWSANLVR 92 +++ + + Sbjct: 486 LFDYLPNRKDAGLE 499 >UniRef50_Q2K2K4 Putative insertion sequence transposase protein n=1 Tax=Rhizobium etli CFN 42 RepID=Q2K2K4_RHIEC Length = 113 Score = 63.1 bits (152), Expect = 3e-09, Method: Composition-based stats. Identities = 14/50 (28%), Positives = 25/50 (50%), Gaps = 1/50 (2%) Query: 46 AGSGSGGEHAAVLYSLIGTCRLNNVEL-EKWLCYGIEHIQDWSANLVRDL 94 G GGE A + ++I T +L+ E +L + IQD + +++L Sbjct: 44 IGPDKGGERIANILTIIETAKLHGHNPPEIYLTDVLTRIQDHPKDHLQEL 93 >UniRef50_Q1A683 Transposase (Fragment) n=4 Tax=Clostridiales RepID=Q1A683_9FIRM Length = 244 Score = 63.1 bits (152), Expect = 3e-09, Method: Composition-based stats. Identities = 11/57 (19%), Positives = 29/57 (50%) Query: 33 SDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSAN 89 + + + S +G +AV+YS++ T + NN+ + ++L + ++ D+ + Sbjct: 150 AKSYAVGRKAFLFHTSEAGAGASAVMYSIVETAKANNLNIFQYLYMVLLYMPDYMNS 206 >UniRef50_Q3Y235 Transposase IS66 n=11 Tax=Enterococcus RepID=Q3Y235_ENTFC Length = 515 Score = 62.7 bits (151), Expect = 4e-09, Method: Composition-based stats. Identities = 13/79 (16%), Positives = 29/79 (36%), Gaps = 10/79 (12%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + +IK + + + + S G + ++ S+ T LN + K+L + Sbjct: 430 NNRAERNIKELV-------IGRKNWLHSTSLEGARTSGIILSVYKTAELNGLNPVKYLEF 482 Query: 79 GIEHIQDWS---ANLVRDL 94 + I + A + L Sbjct: 483 LFDKIPNLPVLSAETLDQL 501 >UniRef50_A9EDB3 Putative transposase n=1 Tax=Oceanibulbus indolifex HEL-45 RepID=A9EDB3_9RHOB Length = 184 Score = 61.9 bits (149), Expect = 6e-09, Method: Composition-based stats. Identities = 15/68 (22%), Positives = 27/68 (39%), Gaps = 7/68 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 ++ I+ I L + A G G E A+L SLI C++ +V+ +L Sbjct: 109 DTNPVENQIRKI-------ALIRKNALFVGHEGGAESWALLASLIANCKMCDVDPVSYLS 161 Query: 78 YGIEHIQD 85 + + Sbjct: 162 DTLRVLPG 169 >UniRef50_A3JL62 ISPpu15, transposase n=2 Tax=Marinobacter sp. ELB17 RepID=A3JL62_9ALTE Length = 77 Score = 61.9 bits (149), Expect = 7e-09, Method: Composition-based stats. Identities = 10/47 (21%), Positives = 21/47 (44%) Query: 39 EIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + A + G +A YSL+ T + N +E ++ + +E + Sbjct: 3 NRKAWLFADTSQGARASATCYSLVETAKANKLEPSVYIQHVLERVAG 49 >UniRef50_D1PHR2 Putative transposase IS66 n=1 Tax=Prevotella copri DSM 18205 RepID=D1PHR2_9BACT Length = 641 Score = 61.1 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 13/70 (18%), Positives = 32/70 (45%), Gaps = 7/70 (10%) Query: 21 SLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGI 80 +++ + I + + AGS E+ A +YSL +C++NN+ +++ + Sbjct: 560 AIERCFRHI-------AMGRRNLGKAGSHEAAENLAFMYSLYESCKMNNLNFGRYIEDIL 612 Query: 81 EHIQDWSANL 90 ++D + Sbjct: 613 TRMKDGDKDY 622 >UniRef50_B9TJY8 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TJY8_RICCO Length = 104 Score = 60.7 bits (146), Expect = 1e-08, Method: Composition-based stats. Identities = 14/42 (33%), Positives = 22/42 (52%) Query: 53 EHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 +Y LIGTC+L+ V + Y + HI D+ N + +L Sbjct: 43 REGRAMYDLIGTCKLDGVNPFTYFEYVLTHIADYKVNRIDEL 84 >UniRef50_D0AFL5 Transposase n=5 Tax=Enterococcus faecium RepID=D0AFL5_ENTFC Length = 503 Score = 60.7 bits (146), Expect = 1e-08, Method: Composition-based stats. Identities = 11/67 (16%), Positives = 25/67 (37%), Gaps = 7/67 (10%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 + +IK + + + + S G ++ S+I T N + + K+L + Sbjct: 420 NNRAERAIKELV-------IGRKNWLFSKSLKGARSNGIILSIIQTAVANGLNIRKYLNH 472 Query: 79 GIEHIQD 85 I + Sbjct: 473 LFTEIPN 479 >UniRef50_C9K6B9 IS66 family transposase n=1 Tax=Sphingomonas sp. NP5 RepID=C9K6B9_9SPHN Length = 536 Score = 60.4 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 29/67 (43%), Gaps = 7/67 (10%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 ++ +IK L + A + G H AVL SL+ T +NNV+ WL Sbjct: 455 DTNIVERAIKP-------QALTRRNALFCATHEGAAHWAVLASLLHTAHINNVDPLAWLT 507 Query: 78 YGIEHIQ 84 + ++ + Sbjct: 508 HALDTLA 514 >UniRef50_Q1GW50 Transposase and inactivated derivative n=2 Tax=Sphingomonadaceae RepID=Q1GW50_SPHAL Length = 57 Score = 60.0 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 12/38 (31%), Positives = 20/38 (52%) Query: 57 VLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 +++SL GT RLN V+ W + I D + + +L Sbjct: 1 MMFSLFGTARLNGVDPLAWFTDVLTRIADIPQSRLHEL 38 >UniRef50_B3WDV3 Putative uncharacterized protein n=1 Tax=Lactobacillus casei BL23 RepID=B3WDV3_LACCB Length = 63 Score = 59.6 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 7/53 (13%), Positives = 23/53 (43%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANL 90 + + +G G + A+L ++I T + N ++ ++ ++ + + Sbjct: 1 MGRKNFLFSGVPEGAKINAILMTMIETAKANALDPMTYIGSLLDELAQFPEWR 53 >UniRef50_C2EYW0 Putative uncharacterized protein n=2 Tax=Lactobacillus reuteri RepID=C2EYW0_LACRE Length = 88 Score = 59.2 bits (142), Expect = 4e-08, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 30/76 (39%), Gaps = 12/76 (15%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLC 77 + I+ T + + S G + A+ YS+I T +LN + + ++L Sbjct: 4 DNNHDEQQIRPTT-------IGRKNYLFTKSEVGAKANAMWYSIIQTAKLNKLRVREYLE 56 Query: 78 YGIEHI-----QDWSA 88 Y +E DW A Sbjct: 57 YLLEAFAWTDQPDWKA 72 >UniRef50_B3PCE0 IS66 family element, transposase n=2 Tax=Gammaproteobacteria RepID=B3PCE0_CELJU Length = 74 Score = 58.4 bits (140), Expect = 6e-08, Method: Composition-based stats. Identities = 11/41 (26%), Positives = 19/41 (46%) Query: 45 LAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD 85 + S G +A LYS+I T + N +E +L + + Sbjct: 14 FSTSPKGATASANLYSVIETAKANGLEPYGYLKTIFTELPN 54 >UniRef50_B3W8G2 Transposase n=10 Tax=Lactobacillus RepID=B3W8G2_LACCB Length = 500 Score = 57.7 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 8/67 (11%), Positives = 27/67 (40%), Gaps = 7/67 (10%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCY 78 ++ ++K + + + + S G + ++ S++ + N ++ K+L Y Sbjct: 415 NNRVERAVKELV-------IGRKNWLFSTSFKGARSSGIILSVMRSAEANGLDCRKYLEY 467 Query: 79 GIEHIQD 85 + + Sbjct: 468 LFTELPN 474 >UniRef50_A6LGZ8 Putative uncharacterized protein n=1 Tax=Parabacteroides distasonis ATCC 8503 RepID=A6LGZ8_PARD8 Length = 104 Score = 55.7 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 15/88 (17%), Positives = 31/88 (35%), Gaps = 10/88 (11%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS 60 M+ R+ FS + ++ + M + I L + GS +G A + YS Sbjct: 13 MIGRMAFSMALSKLLDNAMERINRCIS----------LMRHNSLFFGSHAGASRAVIYYS 62 Query: 61 LIGTCRLNNVELEKWLCYGIEHIQDWSA 88 L +C + +++ + Sbjct: 63 LACSCSQRGINFFEYISDIMNRAAILPP 90 >UniRef50_UPI0001C34846 hypothetical protein PretD1_08486 n=2 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C34846 Length = 464 Score = 55.7 bits (133), Expect = 5e-07, Method: Composition-based stats. Identities = 14/57 (24%), Positives = 23/57 (40%), Gaps = 7/57 (12%) Query: 18 PMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 + +I+ + + AGS + GE AA + L+ T RLN +E Sbjct: 415 DNNRCERAIRRVV-------MGRSNWLFAGSQAAGERAAKIMGLLETARLNGLEPYD 464 >UniRef50_C6IUV9 Transposase n=4 Tax=Bacteroides RepID=C6IUV9_9BACE Length = 571 Score = 55.0 bits (131), Expect = 7e-07, Method: Composition-based stats. Identities = 12/67 (17%), Positives = 29/67 (43%), Gaps = 7/67 (10%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + + +I+ IT ++ + + GS G +++A+ + I TC+ V + Sbjct: 495 IDNMTAERAIRPIT-------VQRKNSLFFGSVKGIQNSAIYNTFIETCKQAGVSFRNYF 547 Query: 77 CYGIEHI 83 C + + Sbjct: 548 CKLLREL 554 >UniRef50_A3JZ99 Putative transposase n=1 Tax=Sagittula stellata E-37 RepID=A3JZ99_9RHOB Length = 77 Score = 55.0 bits (131), Expect = 8e-07, Method: Composition-based stats. Identities = 15/58 (25%), Positives = 25/58 (43%), Gaps = 1/58 (1%) Query: 38 LEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI-QDWSANLVRDL 94 L + A GS G +L S + TC+L+N+ +E +L ++ I L Sbjct: 7 LLRKNALFIGSDEGAHAWGILSSNVETCKLDNINVESYLTRILDQIEAKLPRRDYAKL 64 >UniRef50_A9ML85 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9ML85_SALAR Length = 94 Score = 54.2 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 18/60 (30%), Positives = 26/60 (43%), Gaps = 10/60 (16%) Query: 12 TSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVE 71 I + + + L + GS GGE AA++YSL+G C+LN VE Sbjct: 1 MEIDNNICENALRCVA----------LGRRNYLFFGSDRGGEAAAIIYSLLGMCKLNGVE 50 >UniRef50_UPI0001744C57 ISPpu14, transposase Orf3 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744C57 Length = 73 Score = 51.5 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 19/48 (39%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 G G AA Y+LIG C N ++ +L + + V L Sbjct: 2 GDAQSGARAATFYTLIGNCHRNGIDAFAYLSDVFTRLPRETNRTVHRL 49 >UniRef50_C7GII7 ISPsy5, transposase (Fragment) n=3 Tax=Clostridiales RepID=C7GII7_9FIRM Length = 69 Score = 50.7 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 24/48 (50%), Gaps = 1/48 (2%) Query: 48 SGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDW-SANLVRDL 94 S G + +AV+YS+ T LN ++ +L Y ++ ++ DL Sbjct: 2 SIRGADASAVVYSIAETALLNGLKPYVYLSYVLDELRKMGPFPKPDDL 49 >UniRef50_B9KMR1 Transposase IS66 n=14 Tax=Alphaproteobacteria RepID=B9KMR1_RHOSK Length = 53 Score = 50.7 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 7/36 (19%), Positives = 18/36 (50%) Query: 59 YSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVRDL 94 +LI + +L+ ++ + +L + I D + +L Sbjct: 1 MTLIESAKLSGLDPQAYLADVLARINDHINPRLHEL 36 >UniRef50_UPI0001744E99 transposase IS66 n=3 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744E99 Length = 560 Score = 50.0 bits (118), Expect = 3e-05, Method: Composition-based stats. Identities = 9/81 (11%), Positives = 31/81 (38%), Gaps = 10/81 (12%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + +++ + L + GS G A + +++ + + + + ++L Sbjct: 465 IDNNWCENAMRPV-------ALGRKNWLHLGSHESGPKVAAILTVLASAQRLGLNVREYL 517 Query: 77 CYGIEHIQD---WSANLVRDL 94 +E + D ++ + +L Sbjct: 518 GEALETLCDGEGFNITRIGEL 538 >UniRef50_C9L9U8 Putative uncharacterized protein n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L9U8_RUMHA Length = 193 Score = 49.6 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 10/50 (20%), Positives = 22/50 (44%), Gaps = 2/50 (4%) Query: 47 GSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQD--WSANLVRDL 94 G +A++Y+++ + N++ K+L Y + D S + L Sbjct: 131 HHLKGAASSAIVYTMVEMAKANDLNTYKYLTYLLSQRPDAKMSDEQLEQL 180 >UniRef50_UPI000190820F insertion sequence transposase protein n=1 Tax=Rhizobium etli Kim 5 RepID=UPI000190820F Length = 127 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 26/69 (37%), Gaps = 9/69 (13%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS 60 M R LR P I ++ +I I L + A A + E+ A + S Sbjct: 1 MGRALRL--PDCGRIDIDNNRVERAISPI-------ALNRKHALFAVHDAAAENWATIAS 51 Query: 61 LIGTCRLNN 69 LI T + N Sbjct: 52 LIETGKFNG 60 >UniRef50_B9KUH0 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides KD131 RepID=B9KUH0_RHOSK Length = 61 Score = 49.2 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 7/49 (14%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRL 67 +++ I+ +T L + A AG GG A S+IGTCRL Sbjct: 3 SNPVESRIRPLT-------LGQKNALFAGHDEGGRSWARFASVIGTCRL 44 >UniRef50_UPI0001C34DDA transposase n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001C34DDA Length = 69 Score = 48.0 bits (113), Expect = 8e-05, Method: Composition-based stats. Identities = 8/37 (21%), Positives = 20/37 (54%) Query: 53 EHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSAN 89 + ++YS+ T + NN+ ++L Y + ++D + Sbjct: 4 KRLPIIYSITETAKANNLNPFRYLDYVLTVVKDHQDD 40 >UniRef50_Q024Q9 Transposase IS66 n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q024Q9_SOLUE Length = 500 Score = 47.6 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 14/83 (16%), Positives = 32/83 (38%), Gaps = 12/83 (14%) Query: 1 MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYS 60 +VR L + + S + S++ + + + GS G A + S Sbjct: 409 LVRCLEYEEVELS-----NNLAENSMRPV-------AVGRKNWLHVGSVKAGPKVAAILS 456 Query: 61 LIGTCRLNNVELEKWLCYGIEHI 83 ++ +CR V + ++L + + Sbjct: 457 VVESCRRIGVPVREYLGGMLPGL 479 >UniRef50_Q8VSS7 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=Q8VSS7_BACFR Length = 193 Score = 46.9 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 32/80 (40%), Gaps = 8/80 (10%) Query: 16 CSPMTSL-KTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEK 74 C+ S+ + I ++ E + + GS +A Y++I TC++ V + Sbjct: 110 CTIDNSIAERFICPLSG-------ERKNSLFFGSDKMARVSAAYYTIISTCKMQGVPALR 162 Query: 75 WLCYGIEHIQDWSANLVRDL 94 + ++ I + N L Sbjct: 163 YFKMFLQAIVNGRRNYENLL 182 >UniRef50_Q2K2A5 Hypothetical conserved protein n=1 Tax=Rhizobium etli CFN 42 RepID=Q2K2A5_RHIEC Length = 104 Score = 46.5 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 15/45 (33%), Positives = 23/45 (51%), Gaps = 5/45 (11%) Query: 49 GSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEH-----IQDWSA 88 SG + AA + +LI T +LNN+E + WL + + WS Sbjct: 21 PSGADRAACMATLIMTAKLNNIEPQAWLADVLAASLTRLLPGWSN 65 >UniRef50_D1K0Z1 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=D1K0Z1_9BACE Length = 91 Score = 45.7 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 29/77 (37%), Gaps = 12/77 (15%) Query: 12 TSIICSPMTS-----LKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCR 66 + C M++ + I+ + E + + GS +AV +++I TCR Sbjct: 3 LKVFCRTMSADNRVCAERFIRPLAG-------ERKNSLFFGSDKMAGVSAVYHTIIFTCR 55 Query: 67 LNNVELEKWLCYGIEHI 83 + V + + I Sbjct: 56 MQGVSVLDYFKRFFSEI 72 >UniRef50_A5FT16 Putative uncharacterized protein n=2 Tax=Acidiphilium cryptum JF-5 RepID=A5FT16_ACICJ Length = 81 Score = 45.3 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 17/49 (34%), Positives = 26/49 (53%) Query: 35 IGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI 83 +G L G S +G EH A++ +LIG+ RL+ VE L ++ I Sbjct: 5 VGVLRRTGHHEVASDTGEEHCALMATLIGSARLSGVEPLARLTDVLQRI 53 >UniRef50_P39351 Uncharacterized protein yjgZ n=7 Tax=Escherichia coli RepID=YJGZ_ECOLI Length = 109 Score = 44.9 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 5/25 (20%), Positives = 11/25 (44%) Query: 70 VELEKWLCYGIEHIQDWSANLVRDL 94 +E WL + + +W + +L Sbjct: 75 LEPHAWLTDVLTRLPEWPEERLAEL 99 >UniRef50_D1PFL8 Putative transposase number 3 n=1 Tax=Prevotella copri DSM 18205 RepID=D1PFL8_9BACT Length = 78 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 9/64 (14%), Positives = 20/64 (31%), Gaps = 7/64 (10%) Query: 20 TSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYG 79 + +++ T L + S G A + + TC+L + + Sbjct: 3 NCCERAVRPFTNL-------RKNFGGFSSEQGARVTATFLTFVETCKLMAMAPLDFFRGF 55 Query: 80 IEHI 83 + I Sbjct: 56 FDMI 59 >UniRef50_A7V2V7 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V2V7_BACUN Length = 218 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 8/67 (11%), Positives = 23/67 (34%), Gaps = 7/67 (10%) Query: 17 SPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + +I+ + + S G A Y++I T ++ +++ +L Sbjct: 142 IDNMVAERAIRPF-------AVSRNNSLHFSSEEGVNVAMTFYTIIETAKMYLSDIKGYL 194 Query: 77 CYGIEHI 83 + + Sbjct: 195 IHVFREL 201 >UniRef50_C7RJI1 Transposase IS66 n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RJI1_9PROT Length = 564 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 13/72 (18%), Positives = 26/72 (36%), Gaps = 10/72 (13%) Query: 10 PKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNN 69 P+ + + + + + +GS G AA ++SL+ T RL Sbjct: 460 PEVPMDNNVAERDQR----------TPVVARKNFYGSGSPWSGALAATMFSLLMTMRLWG 509 Query: 70 VELEKWLCYGIE 81 + WL ++ Sbjct: 510 INPRTWLTAYLD 521 >UniRef50_D1RLE0 Putative transposase n=3 Tax=Legionella longbeachae D-4968 RepID=D1RLE0_LEGLO Length = 308 Score = 43.8 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 9/38 (23%), Positives = 16/38 (42%) Query: 39 EIQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWL 76 + + G +VL S+I TC L + ++L Sbjct: 236 GRKTWLFYKTEYGAMVGSVLTSIIYTCELAGINPFEYL 273 >UniRef50_A1VVP9 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VVP9_POLNA Length = 81 Score = 43.0 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 7/41 (17%), Positives = 16/41 (39%) Query: 52 GEHAAVLYSLIGTCRLNNVELEKWLCYGIEHIQDWSANLVR 92 G+ A + + I + LN +L +L + + + Sbjct: 31 GQRTAAIINFIQSVNLNGHDLYAYLKDVLTRLPTHKNRQIA 71 >UniRef50_Q2JA04 Transposase IS66 n=3 Tax=Frankia sp. CcI3 RepID=Q2JA04_FRASC Length = 546 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 8/71 (11%), Positives = 24/71 (33%), Gaps = 10/71 (14%) Query: 10 PKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNN 69 P + + + +++ + + A + AA +++++ T N Sbjct: 439 PDLDLDN---NAAERALR-------TPVVGRKNYYGAHAEWAAHLAARVWTIVATAERNG 488 Query: 70 VELEKWLCYGI 80 E +L + Sbjct: 489 REPLAFLTGYL 499 >UniRef50_C0GVV2 Transposase IS66 n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GVV2_9DELT Length = 530 Score = 42.6 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 13/61 (21%), Positives = 22/61 (36%), Gaps = 7/61 (11%) Query: 40 IQGASLAGSGSGGEHAAVLYSLIGTCRLNNVELEKWLCYGIEHI-------QDWSANLVR 92 + + + G + S I TC+LN V +L ++I Q W R Sbjct: 465 RKNSLFYRTLRGARIGDLFMSFIHTCQLNKVNSFDYLTQLQKNIEEALAAPQKWMPWNYR 524 Query: 93 D 93 + Sbjct: 525 E 525 >UniRef50_B7NGN8 Putative uncharacterized protein n=1 Tax=Escherichia coli UMN026 RepID=B7NGN8_ECOLU Length = 73 Score = 42.3 bits (98), Expect = 0.004, Method: Composition-based stats. Identities = 11/47 (23%), Positives = 19/47 (40%), Gaps = 7/47 (14%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTC 65 + +I+ + + + AGS G AA + SL+GT Sbjct: 27 NNICERAIRPVV-------MGRKAWLFAGSLVAGNRAAQIMSLLGTA 66 >UniRef50_Q2JEM7 Transposase IS66 n=5 Tax=Frankia RepID=Q2JEM7_FRASC Length = 577 Score = 41.5 bits (96), Expect = 0.010, Method: Composition-based stats. Identities = 11/76 (14%), Positives = 30/76 (39%), Gaps = 10/76 (13%) Query: 6 RFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTC 65 P + +P + +I+ + + A + + HAA ++++ T Sbjct: 476 HRDYPMIGMDNNPA---ERAIR-------GPVVTRRNAGGSRTEDTARHAATIFTVTATA 525 Query: 66 RLNNVELEKWLCYGIE 81 ++N+ L +L ++ Sbjct: 526 AMHNLNLLTYLENYLD 541 >UniRef50_Q3J215 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides 2.4.1 RepID=Q3J215_RHOS4 Length = 130 Score = 41.1 bits (95), Expect = 0.010, Method: Composition-based stats. Identities = 12/90 (13%), Positives = 28/90 (31%), Gaps = 13/90 (14%) Query: 14 IICSPMTSLKTSIKTITYLSDIGCLEIQGASLAG--------SGSGGEH-AAVLYSLI-- 62 I + L ++ IT + C G + + L +L+ Sbjct: 6 IDQHHLADLMREVREITKGEFLICYCESGWIAMANGVRYSTIHPTPADAVGEALNTLLDD 65 Query: 63 --GTCRLNNVELEKWLCYGIEHIQDWSANL 90 + +L+ ++ + +L + I D Sbjct: 66 EAESAKLSGLDPQAYLADVLARINDHINPR 95 >UniRef50_Q5WUX9 Putative uncharacterized protein n=1 Tax=Legionella pneumophila str. Lens RepID=Q5WUX9_LEGPL Length = 79 Score = 41.1 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 11/27 (40%), Positives = 15/27 (55%) Query: 53 EHAAVLYSLIGTCRLNNVELEKWLCYG 79 A+LYSLI TC+ + V+ W Y Sbjct: 34 RAGAILYSLIETCKYHQVDDFSWFKYV 60 >UniRef50_Q7WTH2 Putative uncharacterized protein n=1 Tax=Escherichia coli RepID=Q7WTH2_ECOLX Length = 80 Score = 40.3 bits (93), Expect = 0.017, Method: Composition-based stats. Identities = 27/35 (77%), Positives = 28/35 (80%) Query: 19 MTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGE 53 MTSLKTSIKTITYLSD GCLEIQGASL + G Sbjct: 1 MTSLKTSIKTITYLSDTGCLEIQGASLRCTNPRGR 35 >UniRef50_A6L7G0 Transposase n=15 Tax=Bacteroides RepID=A6L7G0_BACV8 Length = 324 Score = 40.3 bits (93), Expect = 0.018, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 32/78 (41%), Gaps = 10/78 (12%) Query: 6 RFSGPKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTC 65 + I + + +I+ +T + + GS +G E AA +S+IGT Sbjct: 235 YLDDGELPIDNNLA---ERTIRKLTT-------QRNNSLHYGSDAGAEMAATYHSVIGTV 284 Query: 66 RLNNVELEKWLCYGIEHI 83 +L + ++ ++I Sbjct: 285 KLLGSSIWNFIGTFFKNI 302 >UniRef50_B0UII5 Putative uncharacterized protein n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UII5_METS4 Length = 86 Score = 40.3 bits (93), Expect = 0.021, Method: Composition-based stats. Identities = 8/23 (34%), Positives = 12/23 (52%) Query: 66 RLNNVELEKWLCYGIEHIQDWSA 88 RLN+V+ WL + I D + Sbjct: 39 RLNDVDPRAWLADVLARINDHPS 61 >UniRef50_C6HW71 Probable transposase n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HW71_9BACT Length = 277 Score = 39.9 bits (92), Expect = 0.025, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 24/71 (33%), Gaps = 10/71 (14%) Query: 10 PKTSIICSPMTSLKTSIKTITYLSDIGCLEIQGASLAGSGSGGEHAAVLYSLIGTCRLNN 69 P+ + + + + + + AGS G AA LYSL GT Sbjct: 156 PQVPMDNNIAERAQRGVA----------VGRKVFYGAGSLVSGHMAAHLYSLFGTWSNAG 205 Query: 70 VELEKWLCYGI 80 + L L + Sbjct: 206 LNLRTTLTDYL 216 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.144 0.393 Lambda K H 0.267 0.0440 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 549,637,128 Number of Sequences: 3077464 Number of extensions: 17991812 Number of successful extensions: 41840 Number of sequences better than 1.0e-01: 192 Number of HSP's better than 0.1 without gapping: 378 Number of HSP's successfully gapped in prelim test: 10 Number of HSP's that attempted gapping in prelim test: 41449 Number of HSP's gapped (non-prelim): 389 length of query: 94 length of database: 1,040,396,356 effective HSP length: 63 effective length of query: 31 effective length of database: 846,516,124 effective search space: 26241999844 effective search space used: 26241999844 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 87 (38.0 bits)