BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (167 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P57998 Insertion element IS1 4 protein insB n=553 Tax=r... 342 2e-93 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 133 2e-30 UniRef50_B2K0W2 IS1 transposase n=30 Tax=Enterobacteriaceae RepI... 122 4e-27 UniRef50_P03832 Insertion element iso-IS1n protein insB n=127 Ta... 121 7e-27 UniRef50_C5BAQ0 IS1 ORF n=14 Tax=Gammaproteobacteria RepID=C5BAQ... 100 2e-20 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 98 8e-20 UniRef50_A3IXU4 Iso-IS1 ORF2 n=3 Tax=Bacteria RepID=A3IXU4_9CHRO 92 5e-18 UniRef50_Q1IXR6 IS1 transposase n=3 Tax=Deinococcus geothermalis... 80 2e-14 UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candida... 78 9e-14 UniRef50_C5BB57 Iso-IS1 ORF2 n=24 Tax=Enterobacteriaceae RepID=C... 78 1e-13 UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202... 78 1e-13 UniRef50_Q8VSP6 Putative IS1 ORF n=1 Tax=Shigella flexneri RepID... 77 2e-13 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 75 1e-12 UniRef50_C5U8R9 IS1 transposase (Fragment) n=1 Tax=Methanocaldoc... 71 1e-11 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 71 1e-11 UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5W... 71 2e-11 UniRef50_B4WT39 IS1 transposase subfamily, putative n=3 Tax=Cyan... 68 1e-10 UniRef50_B2TXL7 IS1 ORF2 n=1 Tax=Shigella boydii CDC 3083-94 Rep... 68 1e-10 UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum A... 66 4e-10 UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaen... 65 1e-09 UniRef50_B2SG01 IS1 transposase n=7 Tax=Francisella tularensis R... 59 8e-08 UniRef50_B0C1Z5 IS1 transposase n=3 Tax=Cyanobacteria RepID=B0C1... 57 3e-07 UniRef50_P73781 Transposase n=5 Tax=Bacteria RepID=P73781_SYNY3 56 4e-07 UniRef50_Q2S4N0 ISSru3, transposase insB n=1 Tax=Salinibacter ru... 48 1e-04 UniRef50_Q1CBA9 Putative transposase n=3 Tax=Yersinia pestis Rep... 45 6e-04 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 43 0.005 UniRef50_C4GXL2 Insertion sequence protein n=5 Tax=Yersinia pest... 42 0.006 UniRef50_Q46GF8 Putative uncharacterized protein n=3 Tax=Methano... 42 0.007 UniRef50_B0CCX7 Putative uncharacterized protein n=7 Tax=Cyanoba... 42 0.008 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 42 0.008 UniRef50_A8YFR2 ImeAB protein n=2 Tax=Microcystis aeruginosa PCC... 41 0.013 UniRef50_A8ZKR3 Putative uncharacterized protein n=1 Tax=Acaryoc... 41 0.018 UniRef50_B0JRY8 Transposase n=27 Tax=Bacteria RepID=B0JRY8_MICAN 41 0.019 UniRef50_B0CEC0 Putative uncharacterized protein n=6 Tax=Acaryoc... 40 0.024 UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candida... 40 0.033 UniRef50_Q46CV2 Putative uncharacterized protein n=14 Tax=Methan... 40 0.035 UniRef50_C5B7X2 Iso-IS1 ORF2 n=1 Tax=Edwardsiella ictaluri 93-14... 40 0.040 UniRef50_Q327E8 IS1 ORF, n=6 Tax=Enterobacteriaceae RepID=Q327E8... 39 0.059 >UniRef50_P57998 Insertion element IS1 4 protein insB n=553 Tax=root RepID=INSB4_ECOLI Length = 167 Score = 342 bits (877), Expect = 2e-93, Method: Compositional matrix adjust. Identities = 164/167 (98%), Positives = 165/167 (98%) Query: 1 MPGNRPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF 60 MPGN PHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF Sbjct: 1 MPGNSPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF 60 Query: 61 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR 120 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR Sbjct: 61 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR 120 Query: 121 YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 YTQRIER+NLNLRQHLARLGRKSLSFSKSVE HDKVIGHYLNIKHYQ Sbjct: 121 YTQRIERYNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 133 bits (335), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 64/143 (44%), Positives = 93/143 (65%) Query: 19 FKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGER 78 KKL P+ +TS +DV E+DEQW YVG+K+RQ W++YAY+ V+A+ FG R Sbjct: 83 LKKLAPKRITSSPVTHADVAFICELDEQWSYVGSKARQHWIWYAYNTKTGGVLAYTFGPR 142 Query: 79 TMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLAR 138 T T L++LL+PF++ + +D W Y + H+ K +TQ IER+NL LR + R Sbjct: 143 TDQTCRELLALLTPFNIGMLTSDDWGSYGREVPKNKHLTGKIFTQCIERNNLTLRTRIKR 202 Query: 139 LGRKSLSFSKSVEQHDKVIGHYL 161 LGRK++ FS+SVE H+KVIG ++ Sbjct: 203 LGRKTICFSRSVEIHEKVIGAFI 225 >UniRef50_B2K0W2 IS1 transposase n=30 Tax=Enterobacteriaceae RepID=B2K0W2_YERPB Length = 122 Score = 122 bits (306), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 60/120 (50%), Positives = 82/120 (68%), Gaps = 1/120 (0%) Query: 47 WGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLY 106 W +VG K +QRWL+YA++ K ++AHVFG R+ T +L+ LLS F++V W TD + Y Sbjct: 2 WSFVGNKKQQRWLWYAWEPRLKRIIAHVFGRRSKKTFRQLLGLLSGFNIVFWCTDNFSAY 61 Query: 107 ESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 E L + H+ SK YTQRIER NLN+R L RL RK+L SKS E HD++IG ++ +HY Sbjct: 62 EM-LPDEKHIRSKLYTQRIERENLNIRNRLKRLNRKTLGDSKSAEMHDRIIGTFIEREHY 120 >UniRef50_P03832 Insertion element iso-IS1n protein insB n=127 Tax=Gammaproteobacteria RepID=INBN_SHIDY Length = 131 Score = 121 bits (304), Expect = 7e-27, Method: Compositional matrix adjust. Identities = 57/126 (45%), Positives = 85/126 (67%), Gaps = 1/126 (0%) Query: 37 VIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVV 96 ++C E+DEQW +VG+K+RQ WL+YAY+ V+A+ FG RT T L++LL+PF++ Sbjct: 2 ALIC-ELDEQWSFVGSKARQHWLWYAYNTKTGGVLAYTFGPRTDETCRELLALLTPFNIG 60 Query: 97 IWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKV 156 + +D W Y + H+ K +TQRIER+NL LR + RL RK++ FS+SVE H+KV Sbjct: 61 MLTSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRIKRLARKTICFSRSVEIHEKV 120 Query: 157 IGHYLN 162 IG ++ Sbjct: 121 IGTFIE 126 >UniRef50_C5BAQ0 IS1 ORF n=14 Tax=Gammaproteobacteria RepID=C5BAQ0_EDWI9 Length = 78 Score = 100 bits (248), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 45/78 (57%), Positives = 56/78 (71%) Query: 90 LSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 + F++ +MTD WP+Y + L HV+SK+YTQRIERHNLNLR HL RL R+++ FS S Sbjct: 1 MRKFNIAFYMTDAWPVYRTLLDPAHHVVSKKYTQRIERHNLNLRTHLKRLTRRTICFSNS 60 Query: 150 VEQHDKVIGHYLNIKHYQ 167 E HDKVIG YL I HY Sbjct: 61 EEMHDKVIGWYLTINHYH 78 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 98.2 bits (243), Expect = 8e-20, Method: Compositional matrix adjust. Identities = 44/46 (95%), Positives = 44/46 (95%) Query: 19 FKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD 64 KLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD Sbjct: 68 LNKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD 113 >UniRef50_A3IXU4 Iso-IS1 ORF2 n=3 Tax=Bacteria RepID=A3IXU4_9CHRO Length = 138 Score = 92.4 bits (228), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 48/122 (39%), Positives = 72/122 (59%) Query: 41 AEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT 100 AE+D+ +V K +RWL++A D T++A+V G+RT +L ++L PF + + T Sbjct: 8 AEVDKMKIFVAKKEHERWLWHAIDHQTGTILAYVLGQRTDQMFLKLKTMLKPFGISEFYT 67 Query: 101 DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHY 160 D W Y+ L + +SK Q+IER +L LR + RL RK++ FSK HD VIG Y Sbjct: 68 DNWGSYKRHLSDEQRTVSKYKMQKIERKHLTLRTRIKRLQRKTICFSKISPMHDLVIGLY 127 Query: 161 LN 162 +N Sbjct: 128 IN 129 >UniRef50_Q1IXR6 IS1 transposase n=3 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1IXR6_DEIGD Length = 148 Score = 80.5 bits (197), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 54/131 (41%), Positives = 73/131 (55%), Gaps = 6/131 (4%) Query: 25 QSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLG 84 Q+V + P +V+V E+DE W +VG K + RWL+ A +R + V+A V G+R+ T Sbjct: 3 QTVPVCLTPPEEVVV--ELDELWTFVGKKKQARWLWIALERSTRKVLAWVLGDRSEQTAF 60 Query: 85 RLMSLL--SPFDVV--IWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLG 140 +L L SP + + TD W Y+ L G + K T +ER N LRQ L RL Sbjct: 61 KLWDRLPLSPEQRLKGTFCTDLWRAYDEPLLGVKRLTRKGETNHVERLNCTLRQRLGRLV 120 Query: 141 RKSLSFSKSVE 151 RKSLSFSKS E Sbjct: 121 RKSLSFSKSDE 131 >UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETP6_AMOA5 Length = 232 Score = 78.2 bits (191), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 50/142 (35%), Positives = 72/142 (50%), Gaps = 12/142 (8%) Query: 9 GRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRK 68 G W Q K R Q+V +VI EMDE W YVG+K ++ W+++A +R Sbjct: 81 GEWIQAYHNQNKPKRRQAV--------EVI---EMDEMWHYVGSKKKKLWIWFALERSGG 129 Query: 69 TVVAHVFGERTMATLGRLMSLLSPFDV-VIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 +++ V G R +T RL + + TD WP Y + H +SK+ T IE Sbjct: 130 SILDFVTGSREASTGKRLWIKIKDIACRSFYATDHWPAYTQFINAHKHKVSKKQTTHIES 189 Query: 128 HNLNLRQHLARLGRKSLSFSKS 149 HN N+R +LAR RK+ +SKS Sbjct: 190 HNANVRHYLARFRRKTKCYSKS 211 >UniRef50_C5BB57 Iso-IS1 ORF2 n=24 Tax=Enterobacteriaceae RepID=C5BB57_EDWI9 Length = 131 Score = 78.2 bits (191), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 40/106 (37%), Positives = 63/106 (59%) Query: 50 VGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESR 109 +G+K+RQ WL+YAY+ V+A+ FG +T + L+ L++PF++ + +D Sbjct: 1 MGSKARQHWLWYAYNTKTGGVLAYTFGPKTDESCRELLVLITPFNIGMITSDNRSSDGRE 60 Query: 110 LKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDK 155 + H+ K TQRI R+NL LR H+ RL RK++ FS+SV K Sbjct: 61 VPKDKHLTGKILTQRIVRNNLTLRTHIKRLARKTICFSRSVRSTKK 106 >UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202 RepID=C8KX67_9PAST Length = 238 Score = 77.8 bits (190), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 42/114 (36%), Positives = 66/114 (57%), Gaps = 2/114 (1%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVI--WM 99 E+DE W +VG KS + WL YA+DR+ K ++++V+G+R T+ RL L + Sbjct: 104 EIDEFWTFVGRKSERVWLIYAFDRVSKKIISYVWGKRNSETVMRLKIQLCKSQISFRYVY 163 Query: 100 TDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQH 153 +D W + KG H + ++YT IE ++ LR + R RKS +FSKS++ H Sbjct: 164 SDRWICFRKIFKGYPHYLGRKYTIGIEGNHCLLRHRVRRFFRKSCNFSKSLKYH 217 >UniRef50_Q8VSP6 Putative IS1 ORF n=1 Tax=Shigella flexneri RepID=Q8VSP6_SHIFL Length = 67 Score = 77.4 bits (189), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 38/64 (59%), Positives = 46/64 (71%) Query: 104 PLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNI 163 P+Y + L HVISK+ TQRIERHNLNLR HL RL RK++ FSKS + H K+IG YL I Sbjct: 4 PVYRTLLSSTSHVISKKCTQRIERHNLNLRTHLKRLTRKTICFSKSDDMHYKIIGWYLTI 63 Query: 164 KHYQ 167 H+ Sbjct: 64 NHHH 67 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 74.7 bits (182), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 44/120 (36%), Positives = 61/120 (50%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTD 101 E+DE Y+G+K WL YA D+ KTVV+ +RT TL R++ L + T Sbjct: 108 EVDEMCTYIGSKQNFIWLVYALDKNSKTVVSFNVAKRTNKTLSRVLDTLKLSEAKKIFTG 167 Query: 102 GWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 Y L K+H + + T IER NL LR HL RL R+++ SKS+ V+ Y Sbjct: 168 RLKNYRYLLDEKMHSVKRFGTNHIERKNLTLRTHLKRLNRRTICSSKSLLIFTAVLKIYF 227 >UniRef50_C5U8R9 IS1 transposase (Fragment) n=1 Tax=Methanocaldococcus infernus ME RepID=C5U8R9_9EURY Length = 133 Score = 71.2 bits (173), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 40/119 (33%), Positives = 65/119 (54%), Gaps = 3/119 (2%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL--SPFDVVIWM 99 E+DE +V +K + W++ A D+ ++AH G+R+ +L +L+ + D + Sbjct: 7 EIDEMHSFVRSKDNKVWIWIAVDKNTGLIIAHKTGDRSDKSLKKLLKEIPKKVLDKCTFY 66 Query: 100 TDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIG 158 TD W Y L + H I K YT+R+ER L R ARL R+ + +SKS+E H+ +I Sbjct: 67 TDKWKAYNI-LPNERHKIGKEYTRRVERTFLTFRNSCARLVRRGIRYSKSMEMHNIIID 124 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 71.2 bits (173), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 45/141 (31%), Positives = 73/141 (51%), Gaps = 12/141 (8%) Query: 33 PGSDVI-VCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMA--------TL 83 P +VI E+DE +VG+K + WL+ A + + ++A V G+ ++ T Sbjct: 89 PEENVIPEVGELDELETFVGSKKTKIWLWTAVNHFTQGILAWVLGDHSLVLSEVEVAETF 148 Query: 84 GRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKS 143 L + + ++TDGW +Y S + ++SK Y R+E N LR +LARL RK+ Sbjct: 149 KPLWENIEKWKCYFYVTDGWKVYPSFIPDGDQIVSKTYMTRVENENTRLRHYLARLHRKT 208 Query: 144 LSFSKS---VEQHDKVIGHYL 161 L +SKS + K++ HYL Sbjct: 209 LCYSKSEQILRYSIKLLLHYL 229 >UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5WC29_PSYWF Length = 233 Score = 70.9 bits (172), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 44/120 (36%), Positives = 67/120 (55%), Gaps = 2/120 (1%) Query: 40 CAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIW- 98 C E+DE W +VG K+ ++WL YAY R +VA+V+G+R + T+ +L + L V Sbjct: 102 CLEIDELWTFVGKKTNKQWLIYAYHRDTGEIVAYVWGKRDLNTVKKLKAKLKALGVSCAR 161 Query: 99 -MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVI 157 +D W + + KG VI K +T IE +N +R + R R+S +FSK +E H K Sbjct: 162 IASDTWDSFVTGFKGFTQVIGKFFTVGIEGNNCTIRHRVRRAFRRSCNFSKKLENHFKAF 221 >UniRef50_B4WT39 IS1 transposase subfamily, putative n=3 Tax=Cyanobacteria RepID=B4WT39_9SYNE Length = 243 Score = 67.8 bits (164), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 50/148 (33%), Positives = 67/148 (45%), Gaps = 24/148 (16%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWM 99 E DE W +VG+KS ++W++ A +R + + G R L + L P + Sbjct: 93 ECDEAWSFVGSKSNKQWIWLAINRDTRETIGMHIGGRNREGARSLWACLPPVYRQCAVCY 152 Query: 100 TDGWP-----------------LYESRLKGKLH-VISKRY--TQRIERHNLNLRQHLARL 139 TD W YE L K H +SK T IER N LRQ ++RL Sbjct: 153 TDFWERCDPASLCGARERAPRQAYEIVLPSKRHRAVSKNSGQTNHIERFNCTLRQRVSRL 212 Query: 140 GRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 RKSLSFSK +E H I ++ I HY Sbjct: 213 VRKSLSFSKKLENHIGAIWYF--IHHYN 238 >UniRef50_B2TXL7 IS1 ORF2 n=1 Tax=Shigella boydii CDC 3083-94 RepID=B2TXL7_SHIB3 Length = 44 Score = 67.8 bits (164), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 32/33 (96%), Positives = 32/33 (96%) Query: 135 HLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 HLARLGRKSLSFSKSVE HDKVIGHYLNIKHYQ Sbjct: 12 HLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 44 >UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C2KAD9_9FLAO Length = 239 Score = 66.2 bits (160), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 39/123 (31%), Positives = 63/123 (51%) Query: 39 VCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIW 98 + E+DE Y +K+ +RW+ AY R K V+ + G RT TL ++ L + Sbjct: 99 ITIEIDELKTYTQSKTNERWVVAAYCRETKKVIDYKLGRRTTKTLQCIIDTLLYANPKKI 158 Query: 99 MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIG 158 +D +Y + LH +R T IER L+LR H+ RLGRKS++ ++ + D ++ Sbjct: 159 YSDRLNIYPKLIPKHLHSTKRRETNHIERKFLDLRTHIKRLGRKSINKAQRDKYTDAILR 218 Query: 159 HYL 161 Y Sbjct: 219 IYF 221 >UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TRD2_9FLAO Length = 219 Score = 64.7 bits (156), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 38/121 (31%), Positives = 64/121 (52%), Gaps = 9/121 (7%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMS---LLSPFDVVIW 98 E+DE W ++G K W+ YA ++ +V+ G +T + L++ LL P + Sbjct: 97 EVDELWSFIGNKKNSTWITYAIEQKTGSVIDFFVGRKTKENIKPLINKVLLLQPTRI--- 153 Query: 99 MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS---VEQHDK 155 TD +Y S + ++H + T +IER NL LR H+ RL R+++ FS+ +E H K Sbjct: 154 YTDRLNIYPSLIPKEMHKRFQYCTNKIERMNLTLRTHIKRLSRRTICFSRKQEYLEAHLK 213 Query: 156 V 156 + Sbjct: 214 I 214 >UniRef50_B2SG01 IS1 transposase n=7 Tax=Francisella tularensis RepID=B2SG01_FRATM Length = 102 Score = 58.5 bits (140), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 36/103 (34%), Positives = 51/103 (49%), Gaps = 2/103 (1%) Query: 47 WGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLY 106 W ++G+K + W+ AYDR + V G R AT RL + + TD W + Sbjct: 2 WNFIGSK--KCWIIKAYDRRVGKTIIWVTGGRDNATFRRLYKKVQHLTNCNFYTDDWVAF 59 Query: 107 ESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 L K H+I K T IER N N R +LAR+ R++ S+S Sbjct: 60 VEVLPKKRHIIGKSGTVAIERDNSNTRHNLARMTRRTKVISRS 102 >UniRef50_B0C1Z5 IS1 transposase n=3 Tax=Cyanobacteria RepID=B0C1Z5_ACAM1 Length = 130 Score = 56.6 bits (135), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 39/125 (31%), Positives = 61/125 (48%), Gaps = 7/125 (5%) Query: 47 WGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGWP 104 W +V KS ++W++ A D + + +V G R+ +L + L + TD W Sbjct: 2 WSFVNDKSNKQWIWLALDVITREIVGVYVGARSKQGARQLWNSLPGIYRQCAVAYTDFWD 61 Query: 105 LYESRLKGKLH-VISKRYTQR--IERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 Y + H + K Q IER N +RQ ++RL RK+LSFSK +E H I ++ Sbjct: 62 AYGCVFPKQRHQAVGKETGQTCYIERFNCTMRQRVSRLVRKTLSFSKKLENHIGAI--WM 119 Query: 162 NIKHY 166 + HY Sbjct: 120 FVHHY 124 >UniRef50_P73781 Transposase n=5 Tax=Bacteria RepID=P73781_SYNY3 Length = 138 Score = 56.2 bits (134), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 36/106 (33%), Positives = 52/106 (49%), Gaps = 3/106 (2%) Query: 48 GYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYE 107 + + + WL+ AYDR+ ++ G R TL RL+ L+ + V + TD W Y+ Sbjct: 2 AFSSGQKNKLWLWKAYDRVTGRLIDWELGNRDSQTLSRLLERLAKWKVTVSCTDDWRPYQ 61 Query: 108 SRLK---GKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV 150 L H ISKR T IER+N + R LAR R + S+S Sbjct: 62 QLLDEHPDAFHGISKRETVGIERNNSDNRHWLARFHRPTKVISRSA 107 >UniRef50_Q2S4N0 ISSru3, transposase insB n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N0_SALRD Length = 158 Score = 48.1 bits (113), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 41/129 (31%), Positives = 61/129 (47%), Gaps = 6/129 (4%) Query: 26 SVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGR 85 SV ++P + V E+DE W YV ++ +RWL+ A R + VVA V G+R+ T R Sbjct: 11 SVAEGLRPAEEGDVL-ELDECWTYVRERANKRWLWVALCRRTRQVVAFVIGDRSARTCAR 69 Query: 86 LMSLL-SPFDVVIWMTDGWPLYESRLKG----KLHVISKRYTQRIERHNLNLRQHLARLG 140 L S + + +D W Y G + S +ER LRQ LAR Sbjct: 70 LWSRIPEEYRQGRSFSDFWKSYRPVFAGDPSHRQVGKSSGEMAHVERFFGRLRQKLARYV 129 Query: 141 RKSLSFSKS 149 R++ + S+S Sbjct: 130 RRTRAASES 138 >UniRef50_Q1CBA9 Putative transposase n=3 Tax=Yersinia pestis RepID=Q1CBA9_YERPA Length = 85 Score = 45.4 bits (106), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 25/61 (40%), Positives = 33/61 (54%), Gaps = 1/61 (1%) Query: 68 KTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 +T F T +L+ LLS F++V W TD + YE L + H+ SK YTQRIER Sbjct: 22 QTYYCSYFWSSEQKTFRQLLGLLSGFNIVFWCTDNFSAYE-MLPDEKHIRSKLYTQRIER 80 Query: 128 H 128 Sbjct: 81 E 81 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 42.7 bits (99), Expect = 0.005, Method: Compositional matrix adjust. Identities = 35/115 (30%), Positives = 54/115 (46%), Gaps = 12/115 (10%) Query: 44 DEQWGYV----GAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWM 99 DE W Y+ G+K W++ A L + G+R T L++ L +V Sbjct: 173 DESWTYLRVRHGSKRENLWIWNA---LADGLPFFTTGDRDYKTFSFLLNSLPKSEV--NY 227 Query: 100 TDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHD 154 TD + +Y+ HV SK+YT +E +N R HLARL R + + ++S D Sbjct: 228 TDDYSVYQVLDN---HVASKKYTYTVESYNSYCRAHLARLARDTRAVNRSERMVD 279 >UniRef50_C4GXL2 Insertion sequence protein n=5 Tax=Yersinia pestis RepID=C4GXL2_YERPN Length = 111 Score = 42.4 bits (98), Expect = 0.006, Method: Compositional matrix adjust. Identities = 16/33 (48%), Positives = 25/33 (75%) Query: 47 WGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERT 79 W +VG K +QRWL+YA++ K ++AH+FG R+ Sbjct: 2 WSFVGNKKQQRWLWYAWEPRLKRIIAHIFGRRS 34 >UniRef50_Q46GF8 Putative uncharacterized protein n=3 Tax=Methanosarcina barkeri str. Fusaro RepID=Q46GF8_METBF Length = 112 Score = 42.0 bits (97), Expect = 0.007, Method: Compositional matrix adjust. Identities = 30/86 (34%), Positives = 41/86 (47%) Query: 66 LRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRI 125 L K + FG R T + L ++ MTD W Y L +H SK T + Sbjct: 6 LGKKFINCSFGSRGTETGQLIWEKLKQKEIGEVMTDHWRAYAEFLPENIHTQSKAETYTV 65 Query: 126 ERHNLNLRQHLARLGRKSLSFSKSVE 151 E +N LR LARL RK+ ++KS+E Sbjct: 66 EGYNGILRHFLARLRRKTKCYTKSIE 91 >UniRef50_B0CCX7 Putative uncharacterized protein n=7 Tax=Cyanobacteria RepID=B0CCX7_ACAM1 Length = 196 Score = 42.0 bits (97), Expect = 0.008, Method: Compositional matrix adjust. Identities = 24/56 (42%), Positives = 31/56 (55%), Gaps = 1/56 (1%) Query: 98 WMTDGWPLYESRLKGK-LHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQ 152 W TDGW Y +L + +H +SK TQR+ER N LRQ R R+ F K +Q Sbjct: 93 WQTDGWEGYSRQLADEVIHHVSKALTQRLERTNGILRQQTGRWHRRQNKFGKVWQQ 148 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 42.0 bits (97), Expect = 0.008, Method: Compositional matrix adjust. Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 8/82 (9%) Query: 59 LFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVIS 118 L YAY R ++++ E+T +L+ L+ F+VV W TD + Y + L H Sbjct: 41 LEYAY---RACHCSYIWNEKT---FRKLLKKLASFNVVFWCTDNFKTY-NLLPKSQHRAG 93 Query: 119 KRYTQRIERHNLNLRQHLARLG 140 K +TQ IER NL +R + RL Sbjct: 94 KIFTQHIERENL-MRTRIKRLN 114 >UniRef50_A8YFR2 ImeAB protein n=2 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YFR2_MICAE Length = 122 Score = 41.2 bits (95), Expect = 0.013, Method: Compositional matrix adjust. Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 3/62 (4%) Query: 95 VVIWMTDGWPLYESRLKGKLH-VISKRY--TQRIERHNLNLRQHLARLGRKSLSFSKSVE 151 + TD W Y++ + K H + K T IER N RQ ++RL R+SLSFSK +E Sbjct: 26 CAVAYTDCWESYKTGIPSKRHRPVGKETGQTNPIERLNNTFRQRISRLVRESLSFSKKME 85 Query: 152 QH 153 H Sbjct: 86 NH 87 >UniRef50_A8ZKR3 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZKR3_ACAM1 Length = 241 Score = 40.8 bits (94), Expect = 0.018, Method: Compositional matrix adjust. Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 6/72 (8%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLM-----SLLSPFDVV 96 EMDE+ GYV K +Q W A D K ++ G R + RLM L P D+V Sbjct: 126 EMDERHGYVAIKQQQCWDAVAIDAASKFIIQVEVGPRNTNLIDRLMRATHKRLAHPRDLV 185 Query: 97 IWMTDGWPLYES 108 + MTDG Y + Sbjct: 186 L-MTDGDASYRT 196 >UniRef50_B0JRY8 Transposase n=27 Tax=Bacteria RepID=B0JRY8_MICAN Length = 111 Score = 40.8 bits (94), Expect = 0.019, Method: Compositional matrix adjust. Identities = 29/91 (31%), Positives = 47/91 (51%), Gaps = 5/91 (5%) Query: 76 GERTMATLGRLMSLLSPF--DVVIWMTDGWPLYESRLKGKLH-VISKRYTQR--IERHNL 130 G+R+ + +L + L + TD W Y++ + K H + K Q IER N Sbjct: 11 GDRSRQSAKKLWASLPGVYRQCAVAYTDFWESYKTVIPSKRHRPVGKETGQTNPIERLNN 70 Query: 131 NLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 RQ ++RL R+SLSFSK +E H + +++ Sbjct: 71 TFRQRISRLVRESLSFSKKMENHVGAVWYFI 101 >UniRef50_B0CEC0 Putative uncharacterized protein n=6 Tax=Acaryochloris marina MBIC11017 RepID=B0CEC0_ACAM1 Length = 172 Score = 40.4 bits (93), Expect = 0.024, Method: Compositional matrix adjust. Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 1/56 (1%) Query: 98 WMTDGWPLYESRLKGKL-HVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQ 152 W TDGW Y +L ++ H +SK TQR+ER N +RQ R R+ F K +Q Sbjct: 63 WQTDGWEGYARQLPDEVVHEVSKALTQRLERTNGIVRQQTGRWHRRQNKFGKVWQQ 118 >UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX7_PARUW Length = 163 Score = 39.7 bits (91), Expect = 0.033, Method: Compositional matrix adjust. Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 2/66 (3%) Query: 24 PQSVTSRIQPGSDV--IVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMA 81 P+++ + + +D +V E+DE W YVG+K+ +WL+ + VVA G R Sbjct: 84 PENLNAEVVSENDELEVVVLEVDELWSYVGSKANPQWLWLVMHSKTRQVVAMQIGPRNKE 143 Query: 82 TLGRLM 87 T +L+ Sbjct: 144 TAEKLL 149 >UniRef50_Q46CV2 Putative uncharacterized protein n=14 Tax=Methanosarcina RepID=Q46CV2_METBF Length = 75 Score = 39.7 bits (91), Expect = 0.035, Method: Compositional matrix adjust. Identities = 22/53 (41%), Positives = 30/53 (56%) Query: 99 MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVE 151 MTD W Y L +H SK T +E +N L+ LARL RK+ ++KS+E Sbjct: 2 MTDHWRAYAEFLPENIHTQSKAETYTVEGYNGILKHFLARLRRKTKCYTKSIE 54 >UniRef50_C5B7X2 Iso-IS1 ORF2 n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B7X2_EDWI9 Length = 99 Score = 39.7 bits (91), Expect = 0.040, Method: Compositional matrix adjust. Identities = 25/75 (33%), Positives = 40/75 (53%), Gaps = 12/75 (16%) Query: 90 LSPFDVVIWMTDGW--PLYE----SRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKS 143 L+ F++ + D W P+ E L G + +TQ ER++L LR + RL RK Sbjct: 3 LTAFNIGMITRDDWGNPIREVPWGKPLTGTI------FTQHSERNSLMLRTRIKRLARKR 56 Query: 144 LSFSKSVEQHDKVIG 158 + FS+++ H+KV G Sbjct: 57 IGFSRAIALHEKVTG 71 >UniRef50_Q327E8 IS1 ORF, n=6 Tax=Enterobacteriaceae RepID=Q327E8_SHIDS Length = 94 Score = 38.9 bits (89), Expect = 0.059, Method: Compositional matrix adjust. Identities = 17/31 (54%), Positives = 24/31 (77%) Query: 125 IERHNLNLRQHLARLGRKSLSFSKSVEQHDK 155 +ER+NL LR + RL RK++ FS+SVE H+K Sbjct: 35 LERNNLPLRTRIKRLARKTICFSRSVEIHEK 65 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P57998 Insertion element IS1 4 protein insB n=553 Tax=r... 253 2e-66 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 205 4e-52 UniRef50_P03832 Insertion element iso-IS1n protein insB n=127 Ta... 185 4e-46 UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candida... 164 1e-39 UniRef50_A3IXU4 Iso-IS1 ORF2 n=3 Tax=Bacteria RepID=A3IXU4_9CHRO 157 1e-37 UniRef50_Q1IXR6 IS1 transposase n=3 Tax=Deinococcus geothermalis... 157 1e-37 UniRef50_B2K0W2 IS1 transposase n=30 Tax=Enterobacteriaceae RepI... 156 2e-37 UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202... 152 3e-36 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 150 1e-35 UniRef50_B4WT39 IS1 transposase subfamily, putative n=3 Tax=Cyan... 150 2e-35 UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaen... 145 4e-34 UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum A... 143 2e-33 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 142 3e-33 UniRef50_B0C1Z5 IS1 transposase n=3 Tax=Cyanobacteria RepID=B0C1... 140 2e-32 UniRef50_C5U8R9 IS1 transposase (Fragment) n=1 Tax=Methanocaldoc... 139 4e-32 UniRef50_Q2S4N0 ISSru3, transposase insB n=1 Tax=Salinibacter ru... 137 1e-31 UniRef50_C5BB57 Iso-IS1 ORF2 n=24 Tax=Enterobacteriaceae RepID=C... 136 3e-31 UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5W... 130 1e-29 UniRef50_P73781 Transposase n=5 Tax=Bacteria RepID=P73781_SYNY3 124 1e-27 UniRef50_B2SG01 IS1 transposase n=7 Tax=Francisella tularensis R... 120 2e-26 UniRef50_C5BAQ0 IS1 ORF n=14 Tax=Gammaproteobacteria RepID=C5BAQ... 110 1e-23 UniRef50_Q8VSP6 Putative IS1 ORF n=1 Tax=Shigella flexneri RepID... 91 1e-17 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 77 2e-13 UniRef50_Q1CBA9 Putative transposase n=3 Tax=Yersinia pestis Rep... 74 1e-12 UniRef50_B2TXL7 IS1 ORF2 n=1 Tax=Shigella boydii CDC 3083-94 Rep... 57 2e-07 Sequences not found previously or not previously below threshold: UniRef50_B0JRY8 Transposase n=27 Tax=Bacteria RepID=B0JRY8_MICAN 89 6e-17 UniRef50_Q9CJQ7 Putative uncharacterized protein n=2 Tax=Pasteur... 88 8e-17 UniRef50_Q46GF8 Putative uncharacterized protein n=3 Tax=Methano... 83 3e-15 UniRef50_Q10VW0 ISSru3, transposase InsB n=1 Tax=Trichodesmium e... 76 4e-13 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 75 7e-13 UniRef50_O67144 Putative uncharacterized protein n=1 Tax=Aquifex... 73 4e-12 UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candida... 72 7e-12 UniRef50_C0A223 Putative uncharacterized protein n=1 Tax=Opituta... 72 8e-12 UniRef50_A8YFR2 ImeAB protein n=2 Tax=Microcystis aeruginosa PCC... 68 1e-10 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 68 1e-10 UniRef50_C8SAB2 IS1 transposase (Fragment) n=1 Tax=Ferroglobus p... 67 2e-10 UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachl... 66 3e-10 UniRef50_Q32DI9 Iso-IS1 ORF2 n=2 Tax=Shigella RepID=Q32DI9_SHIDS 66 4e-10 UniRef50_B0CCX7 Putative uncharacterized protein n=7 Tax=Cyanoba... 66 6e-10 UniRef50_B0CEC0 Putative uncharacterized protein n=6 Tax=Acaryoc... 63 3e-09 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 63 4e-09 UniRef50_Q10ZU2 Putative uncharacterized protein n=3 Tax=Trichod... 63 4e-09 UniRef50_Q6MBQ1 Putative uncharacterized protein n=2 Tax=Candida... 62 5e-09 UniRef50_Q46CV2 Putative uncharacterized protein n=14 Tax=Methan... 62 7e-09 UniRef50_C5B7X2 Iso-IS1 ORF2 n=1 Tax=Edwardsiella ictaluri 93-14... 62 7e-09 UniRef50_Q6MCH2 Putative uncharacterized protein n=1 Tax=Candida... 61 2e-08 UniRef50_D1JFE2 Putative uncharacterized protein n=3 Tax=uncultu... 60 3e-08 UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium ... 60 3e-08 UniRef50_UPI00016C465A IS1 transposase n=1 Tax=Gemmata obscurigl... 59 5e-08 UniRef50_C4GXL2 Insertion sequence protein n=5 Tax=Yersinia pest... 59 7e-08 UniRef50_A9GLN9 Putative uncharacterized protein n=2 Tax=Sorangi... 58 7e-08 UniRef50_A9GLP8 Putative uncharacterized protein n=1 Tax=Sorangi... 58 7e-08 UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthros... 58 9e-08 UniRef50_B0URB1 Putative uncharacterized protein n=1 Tax=Methylo... 57 3e-07 UniRef50_A8GX98 Transposase and inactivated derivative n=2 Tax=R... 56 5e-07 UniRef50_A9FJP3 Putative uncharacterized protein n=5 Tax=Proteob... 55 1e-06 UniRef50_Q2FSQ2 Putative uncharacterized protein n=1 Tax=Methano... 55 1e-06 UniRef50_Q7NJH9 Gsl1853 protein n=1 Tax=Gloeobacter violaceus Re... 53 3e-06 UniRef50_B0CAP5 Putative uncharacterized protein n=3 Tax=Acaryoc... 52 8e-06 UniRef50_A8ZKR3 Putative uncharacterized protein n=1 Tax=Acaryoc... 51 1e-05 UniRef50_UPI00018554DD transposase n=1 Tax=Francisella novicida ... 50 2e-05 UniRef50_Q327E8 IS1 ORF, n=6 Tax=Enterobacteriaceae RepID=Q327E8... 50 4e-05 UniRef50_Q648U8 Putative uncharacterized protein n=6 Tax=environ... 49 5e-05 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 49 5e-05 UniRef50_Q972H6 Putative uncharacterized protein ST1154 n=1 Tax=... 49 6e-05 UniRef50_Q64CQ0 Putative uncharacterized protein n=1 Tax=uncultu... 48 1e-04 UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis ae... 47 2e-04 UniRef50_Q8PRQ0 Putative uncharacterized protein n=1 Tax=Methano... 47 3e-04 UniRef50_Q649W7 Putative uncharacterized protein n=1 Tax=uncultu... 47 3e-04 UniRef50_Q6MCX8 Putative uncharacterized protein n=2 Tax=Candida... 46 5e-04 UniRef50_Q10ZQ2 Putative uncharacterized protein n=7 Tax=Cyanoba... 44 0.001 UniRef50_B9K3D6 Transposase n=32 Tax=Bacteria RepID=B9K3D6_AGRVS 44 0.001 UniRef50_C7DAC3 Transposase n=36 Tax=Rhodobacterales RepID=C7DAC... 44 0.002 UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q... 44 0.002 UniRef50_A4AD66 Transposase n=19 Tax=unclassified Gammaproteobac... 43 0.004 UniRef50_Q469A1 Putative uncharacterized protein n=1 Tax=Methano... 43 0.005 UniRef50_Q6MD18 Putative uncharacterized protein n=2 Tax=Candida... 42 0.006 UniRef50_Q218S2 Putative uncharacterized protein n=1 Tax=Rhodops... 42 0.006 UniRef50_A9FZD9 Putative uncharacterized protein n=1 Tax=Sorangi... 42 0.009 UniRef50_A7C135 Putative uncharacterized protein n=1 Tax=Beggiat... 42 0.010 UniRef50_B9K4Q6 Transposase n=2 Tax=Alphaproteobacteria RepID=B9... 41 0.016 UniRef50_Q11MN9 Transposase n=37 Tax=Bacteria RepID=Q11MN9_MESSB 41 0.018 UniRef50_B9K5F7 Transposase n=3 Tax=Bacteria RepID=B9K5F7_AGRVS 41 0.020 UniRef50_A0LBE3 Putative uncharacterized protein n=1 Tax=Magneto... 40 0.039 UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=... 40 0.041 UniRef50_A5N1B9 Transposase n=2 Tax=Clostridium kluyveri RepID=A... 40 0.044 UniRef50_A7C324 Putative uncharacterized protein n=3 Tax=Beggiat... 39 0.057 UniRef50_Q0W4E9 Putative uncharacterized protein n=1 Tax=uncultu... 39 0.063 UniRef50_A3W3Q5 Transposase n=1 Tax=Roseovarius sp. 217 RepID=A3... 38 0.080 UniRef50_A9EF44 Transposase n=2 Tax=Rhodobacteraceae RepID=A9EF4... 38 0.096 >UniRef50_P57998 Insertion element IS1 4 protein insB n=553 Tax=root RepID=INSB4_ECOLI Length = 167 Score = 253 bits (646), Expect = 2e-66, Method: Composition-based stats. Identities = 164/167 (98%), Positives = 165/167 (98%) Query: 1 MPGNRPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF 60 MPGN PHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF Sbjct: 1 MPGNSPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF 60 Query: 61 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR 120 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR Sbjct: 61 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR 120 Query: 121 YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 YTQRIER+NLNLRQHLARLGRKSLSFSKSVE HDKVIGHYLNIKHYQ Sbjct: 121 YTQRIERYNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 205 bits (522), Expect = 4e-52, Method: Composition-based stats. Identities = 64/148 (43%), Positives = 94/148 (63%) Query: 19 FKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGER 78 KKL P+ +TS +DV E+DEQW YVG+K+RQ W++YAY+ V+A+ FG R Sbjct: 83 LKKLAPKRITSSPVTHADVAFICELDEQWSYVGSKARQHWIWYAYNTKTGGVLAYTFGPR 142 Query: 79 TMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLAR 138 T T L++LL+PF++ + +D W Y + H+ K +TQ IER+NL LR + R Sbjct: 143 TDQTCRELLALLTPFNIGMLTSDDWGSYGREVPKNKHLTGKIFTQCIERNNLTLRTRIKR 202 Query: 139 LGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 LGRK++ FS+SVE H+KVIG ++ + Sbjct: 203 LGRKTICFSRSVEIHEKVIGAFIEKHMF 230 >UniRef50_P03832 Insertion element iso-IS1n protein insB n=127 Tax=Gammaproteobacteria RepID=INBN_SHIDY Length = 131 Score = 185 bits (471), Expect = 4e-46, Method: Composition-based stats. Identities = 56/130 (43%), Positives = 85/130 (65%) Query: 37 VIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVV 96 + + E+DEQW +VG+K+RQ WL+YAY+ V+A+ FG RT T L++LL+PF++ Sbjct: 1 MALICELDEQWSFVGSKARQHWLWYAYNTKTGGVLAYTFGPRTDETCRELLALLTPFNIG 60 Query: 97 IWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKV 156 + +D W Y + H+ K +TQRIER+NL LR + RL RK++ FS+SVE H+KV Sbjct: 61 MLTSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRIKRLARKTICFSRSVEIHEKV 120 Query: 157 IGHYLNIKHY 166 IG ++ + Sbjct: 121 IGTFIEKHMF 130 >UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETP6_AMOA5 Length = 232 Score = 164 bits (414), Expect = 1e-39, Method: Composition-based stats. Identities = 48/145 (33%), Positives = 69/145 (47%), Gaps = 12/145 (8%) Query: 9 GRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRK 68 G W Q K R Q+V EMDE W YVG+K ++ W+++A +R Sbjct: 81 GEWIQAYHNQNKPKRRQAV-----------EVIEMDEMWHYVGSKKKKLWIWFALERSGG 129 Query: 69 TVVAHVFGERTMATLGRLMSLLSPFDV-VIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 +++ V G R +T RL + + TD WP Y + H +SK+ T IE Sbjct: 130 SILDFVTGSREASTGKRLWIKIKDIACRSFYATDHWPAYTQFINAHKHKVSKKQTTHIES 189 Query: 128 HNLNLRQHLARLGRKSLSFSKSVEQ 152 HN N+R +LAR RK+ +SKS Sbjct: 190 HNANVRHYLARFRRKTKCYSKSERL 214 >UniRef50_A3IXU4 Iso-IS1 ORF2 n=3 Tax=Bacteria RepID=A3IXU4_9CHRO Length = 138 Score = 157 bits (397), Expect = 1e-37, Method: Composition-based stats. Identities = 48/127 (37%), Positives = 73/127 (57%) Query: 41 AEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT 100 AE+D+ +V K +RWL++A D T++A+V G+RT +L ++L PF + + T Sbjct: 8 AEVDKMKIFVAKKEHERWLWHAIDHQTGTILAYVLGQRTDQMFLKLKTMLKPFGISEFYT 67 Query: 101 DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHY 160 D W Y+ L + +SK Q+IER +L LR + RL RK++ FSK HD VIG Y Sbjct: 68 DNWGSYKRHLSDEQRTVSKYKMQKIERKHLTLRTRIKRLQRKTICFSKISPMHDLVIGLY 127 Query: 161 LNIKHYQ 167 +N + Sbjct: 128 INKYEFH 134 >UniRef50_Q1IXR6 IS1 transposase n=3 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1IXR6_DEIGD Length = 148 Score = 157 bits (397), Expect = 1e-37, Method: Composition-based stats. Identities = 54/148 (36%), Positives = 74/148 (50%), Gaps = 8/148 (5%) Query: 24 PQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATL 83 Q+V + P +V+V E+DE W +VG K + RWL+ A +R + V+A V G+R+ T Sbjct: 2 RQTVPVCLTPPEEVVV--ELDELWTFVGKKKQARWLWIALERSTRKVLAWVLGDRSEQTA 59 Query: 84 GRLMSLLSPFD----VVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARL 139 +L L + TD W Y+ L G + K T +ER N LRQ L RL Sbjct: 60 FKLWDRLPLSPEQRLKGTFCTDLWRAYDEPLLGVKRLTRKGETNHVERLNCTLRQRLGRL 119 Query: 140 GRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 RKSLSFSKS E + + L Y Sbjct: 120 VRKSLSFSKSDEMLEASLT--LAFHRYN 145 >UniRef50_B2K0W2 IS1 transposase n=30 Tax=Enterobacteriaceae RepID=B2K0W2_YERPB Length = 122 Score = 156 bits (395), Expect = 2e-37, Method: Composition-based stats. Identities = 60/121 (49%), Positives = 82/121 (67%), Gaps = 1/121 (0%) Query: 46 QWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPL 105 W +VG K +QRWL+YA++ K ++AHVFG R+ T +L+ LLS F++V W TD + Sbjct: 1 MWSFVGNKKQQRWLWYAWEPRLKRIIAHVFGRRSKKTFRQLLGLLSGFNIVFWCTDNFSA 60 Query: 106 YESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKH 165 YE L + H+ SK YTQRIER NLN+R L RL RK+L SKS E HD++IG ++ +H Sbjct: 61 YE-MLPDEKHIRSKLYTQRIERENLNIRNRLKRLNRKTLGDSKSAEMHDRIIGTFIEREH 119 Query: 166 Y 166 Y Sbjct: 120 Y 120 >UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202 RepID=C8KX67_9PAST Length = 238 Score = 152 bits (385), Expect = 3e-36, Method: Composition-based stats. Identities = 43/131 (32%), Positives = 68/131 (51%), Gaps = 2/131 (1%) Query: 33 PGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSP 92 P E+DE W +VG KS + WL YA+DR+ K ++++V+G+R T+ RL L Sbjct: 95 PHHCFYESIEIDEFWTFVGRKSERVWLIYAFDRVSKKIISYVWGKRNSETVMRLKIQLCK 154 Query: 93 FDVVI--WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV 150 + +D W + KG H + ++YT IE ++ LR + R RKS +FSKS+ Sbjct: 155 SQISFRYVYSDRWICFRKIFKGYPHYLGRKYTIGIEGNHCLLRHRVRRFFRKSCNFSKSL 214 Query: 151 EQHDKVIGHYL 161 + H + Sbjct: 215 KYHFSAFRLMI 225 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 150 bits (380), Expect = 1e-35, Method: Composition-based stats. Identities = 45/142 (31%), Positives = 72/142 (50%), Gaps = 12/142 (8%) Query: 32 QPGSDVI-VCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERT--------MAT 82 P +VI E+DE +VG+K + WL+ A + + ++A V G+ + T Sbjct: 88 VPEENVIPEVGELDELETFVGSKKTKIWLWTAVNHFTQGILAWVLGDHSLVLSEVEVAET 147 Query: 83 LGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRK 142 L + + ++TDGW +Y S + ++SK Y R+E N LR +LARL RK Sbjct: 148 FKPLWENIEKWKCYFYVTDGWKVYPSFIPDGDQIVSKTYMTRVENENTRLRHYLARLHRK 207 Query: 143 SLSFSKS---VEQHDKVIGHYL 161 +L +SKS + K++ HYL Sbjct: 208 TLCYSKSEQILRYSIKLLLHYL 229 >UniRef50_B4WT39 IS1 transposase subfamily, putative n=3 Tax=Cyanobacteria RepID=B4WT39_9SYNE Length = 243 Score = 150 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 54/179 (30%), Positives = 75/179 (41%), Gaps = 27/179 (15%) Query: 11 WPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTV 70 W Q P+ + + G + E DE W +VG+KS ++W++ A +R + Sbjct: 65 WLQQYASEEYADVPRQAKTSPKKGP---LTLECDEAWSFVGSKSNKQWIWLAINRDTRET 121 Query: 71 VAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGWP-----------------LYESRLK 111 + G R L + L P + TD W YE L Sbjct: 122 IGMHIGGRNREGARSLWACLPPVYRQCAVCYTDFWERCDPASLCGARERAPRQAYEIVLP 181 Query: 112 GKLH-VISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 K H +SK T IER N LRQ ++RL RKSLSFSK +E H I ++ I HY Sbjct: 182 SKRHRAVSKNSGQTNHIERFNCTLRQRVSRLVRKSLSFSKKLENHIGAIWYF--IHHYN 238 >UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TRD2_9FLAO Length = 219 Score = 145 bits (367), Expect = 4e-34, Method: Composition-based stats. Identities = 34/120 (28%), Positives = 60/120 (50%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTD 101 E+DE W ++G K W+ YA ++ +V+ G +T + L++ + TD Sbjct: 97 EVDELWSFIGNKKNSTWITYAIEQKTGSVIDFFVGRKTKENIKPLINKVLLLQPTRIYTD 156 Query: 102 GWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 +Y S + ++H + T +IER NL LR H+ RL R+++ FS+ E + + Y Sbjct: 157 RLNIYPSLIPKEMHKRFQYCTNKIERMNLTLRTHIKRLSRRTICFSRKQEYLEAHLKIYF 216 >UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C2KAD9_9FLAO Length = 239 Score = 143 bits (361), Expect = 2e-33, Method: Composition-based stats. Identities = 40/131 (30%), Positives = 66/131 (50%) Query: 31 IQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL 90 ++P + E+DE Y +K+ +RW+ AY R K V+ + G RT TL ++ L Sbjct: 91 VKPPIPQNITIEIDELKTYTQSKTNERWVVAAYCRETKKVIDYKLGRRTTKTLQCIIDTL 150 Query: 91 SPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV 150 + +D +Y + LH +R T IER L+LR H+ RLGRKS++ ++ Sbjct: 151 LYANPKKIYSDRLNIYPKLIPKHLHSTKRRETNHIERKFLDLRTHIKRLGRKSINKAQRD 210 Query: 151 EQHDKVIGHYL 161 + D ++ Y Sbjct: 211 KYTDAILRIYF 221 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 142 bits (359), Expect = 3e-33, Method: Composition-based stats. Identities = 45/124 (36%), Positives = 62/124 (50%) Query: 40 CAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWM 99 E+DE Y+G+K WL YA D+ KTVV+ +RT TL R++ L + Sbjct: 106 TYEVDEMCTYIGSKQNFIWLVYALDKNSKTVVSFNVAKRTNKTLSRVLDTLKLSEAKKIF 165 Query: 100 TDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGH 159 T Y L K+H + + T IER NL LR HL RL R+++ SKS+ V+ Sbjct: 166 TGRLKNYRYLLDEKMHSVKRFGTNHIERKNLTLRTHLKRLNRRTICSSKSLLIFTAVLKI 225 Query: 160 YLNI 163 Y I Sbjct: 226 YFWI 229 >UniRef50_B0C1Z5 IS1 transposase n=3 Tax=Cyanobacteria RepID=B0C1Z5_ACAM1 Length = 130 Score = 140 bits (353), Expect = 2e-32, Method: Composition-based stats. Identities = 39/127 (30%), Positives = 61/127 (48%), Gaps = 7/127 (5%) Query: 46 QWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGW 103 W +V KS ++W++ A D + + +V G R+ +L + L + TD W Sbjct: 1 MWSFVNDKSNKQWIWLALDVITREIVGVYVGARSKQGARQLWNSLPGIYRQCAVAYTDFW 60 Query: 104 PLYESRLKGKLH-VISKRYTQR--IERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHY 160 Y + H + K Q IER N +RQ ++RL RK+LSFSK +E H I + Sbjct: 61 DAYGCVFPKQRHQAVGKETGQTCYIERFNCTMRQRVSRLVRKTLSFSKKLENHIGAI--W 118 Query: 161 LNIKHYQ 167 + + HY Sbjct: 119 MFVHHYN 125 >UniRef50_C5U8R9 IS1 transposase (Fragment) n=1 Tax=Methanocaldococcus infernus ME RepID=C5U8R9_9EURY Length = 133 Score = 139 bits (350), Expect = 4e-32, Method: Composition-based stats. Identities = 40/123 (32%), Positives = 66/123 (53%), Gaps = 3/123 (2%) Query: 39 VCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSP--FDVV 96 + E+DE +V +K + W++ A D+ ++AH G+R+ +L +L+ + D Sbjct: 4 IHLEIDEMHSFVRSKDNKVWIWIAVDKNTGLIIAHKTGDRSDKSLKKLLKEIPKKVLDKC 63 Query: 97 IWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKV 156 + TD W Y L + H I K YT+R+ER L R ARL R+ + +SKS+E H+ + Sbjct: 64 TFYTDKWKAYN-ILPNERHKIGKEYTRRVERTFLTFRNSCARLVRRGIRYSKSMEMHNII 122 Query: 157 IGH 159 I Sbjct: 123 IDL 125 >UniRef50_Q2S4N0 ISSru3, transposase insB n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N0_SALRD Length = 158 Score = 137 bits (345), Expect = 1e-31, Method: Composition-based stats. Identities = 42/150 (28%), Positives = 63/150 (42%), Gaps = 6/150 (4%) Query: 18 PFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGE 77 K SV ++P + V E+DE W YV ++ +RWL+ A R + VVA V G+ Sbjct: 3 QKKGRESDSVAEGLRPAEEGDVL-ELDECWTYVRERANKRWLWVALCRRTRQVVAFVIGD 61 Query: 78 RTMATLGRLMSLL-SPFDVVIWMTDGWPLYESRLKGK----LHVISKRYTQRIERHNLNL 132 R+ T RL S + + +D W Y G S +ER L Sbjct: 62 RSARTCARLWSRIPEEYRQGRSFSDFWKSYRPVFAGDPSHRQVGKSSGEMAHVERFFGRL 121 Query: 133 RQHLARLGRKSLSFSKSVEQHDKVIGHYLN 162 RQ LAR R++ + S+S ++ Sbjct: 122 RQKLARYVRRTRAASESERMLHLTTKLFVE 151 >UniRef50_C5BB57 Iso-IS1 ORF2 n=24 Tax=Enterobacteriaceae RepID=C5BB57_EDWI9 Length = 131 Score = 136 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 40/106 (37%), Positives = 63/106 (59%) Query: 50 VGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESR 109 +G+K+RQ WL+YAY+ V+A+ FG +T + L+ L++PF++ + +D Sbjct: 1 MGSKARQHWLWYAYNTKTGGVLAYTFGPKTDESCRELLVLITPFNIGMITSDNRSSDGRE 60 Query: 110 LKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDK 155 + H+ K TQRI R+NL LR H+ RL RK++ FS+SV K Sbjct: 61 VPKDKHLTGKILTQRIVRNNLTLRTHIKRLARKTICFSRSVRSTKK 106 >UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5WC29_PSYWF Length = 233 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 45/138 (32%), Positives = 68/138 (49%), Gaps = 3/138 (2%) Query: 31 IQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATL--GRLMS 88 I P C E+DE W +VG K+ ++WL YAY R +VA+V+G+R + T+ + Sbjct: 93 ITPKQRQYDCLEIDELWTFVGKKTNKQWLIYAYHRDTGEIVAYVWGKRDLNTVKKLKAKL 152 Query: 89 LLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSK 148 +D W + + KG VI K +T IE +N +R + R R+S +FSK Sbjct: 153 KALGVSCARIASDTWDSFVTGFKGFTQVIGKFFTVGIEGNNCTIRHRVRRAFRRSCNFSK 212 Query: 149 SVEQHDKVIGH-YLNIKH 165 +E H K + I H Sbjct: 213 KLENHFKAFDLAFFYINH 230 >UniRef50_P73781 Transposase n=5 Tax=Bacteria RepID=P73781_SYNY3 Length = 138 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 3/116 (2%) Query: 48 GYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYE 107 + + + WL+ AYDR+ ++ G R TL RL+ L+ + V + TD W Y+ Sbjct: 2 AFSSGQKNKLWLWKAYDRVTGRLIDWELGNRDSQTLSRLLERLAKWKVTVSCTDDWRPYQ 61 Query: 108 SRL---KGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHY 160 L H ISKR T IER+N + R LAR R + S+S + + + Sbjct: 62 QLLDEHPDAFHGISKRETVGIERNNSDNRHWLARFHRPTKVISRSAHMVNITMAIF 117 >UniRef50_B2SG01 IS1 transposase n=7 Tax=Francisella tularensis RepID=B2SG01_FRATM Length = 102 Score = 120 bits (301), Expect = 2e-26, Method: Composition-based stats. Identities = 36/104 (34%), Positives = 51/104 (49%), Gaps = 2/104 (1%) Query: 46 QWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPL 105 W ++G+K + W+ AYDR + V G R AT RL + + TD W Sbjct: 1 MWNFIGSK--KCWIIKAYDRRVGKTIIWVTGGRDNATFRRLYKKVQHLTNCNFYTDDWVA 58 Query: 106 YESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 + L K H+I K T IER N N R +LAR+ R++ S+S Sbjct: 59 FVEVLPKKRHIIGKSGTVAIERDNSNTRHNLARMTRRTKVISRS 102 >UniRef50_C5BAQ0 IS1 ORF n=14 Tax=Gammaproteobacteria RepID=C5BAQ0_EDWI9 Length = 78 Score = 110 bits (276), Expect = 1e-23, Method: Composition-based stats. Identities = 45/78 (57%), Positives = 56/78 (71%) Query: 90 LSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 + F++ +MTD WP+Y + L HV+SK+YTQRIERHNLNLR HL RL R+++ FS S Sbjct: 1 MRKFNIAFYMTDAWPVYRTLLDPAHHVVSKKYTQRIERHNLNLRTHLKRLTRRTICFSNS 60 Query: 150 VEQHDKVIGHYLNIKHYQ 167 E HDKVIG YL I HY Sbjct: 61 EEMHDKVIGWYLTINHYH 78 >UniRef50_Q8VSP6 Putative IS1 ORF n=1 Tax=Shigella flexneri RepID=Q8VSP6_SHIFL Length = 67 Score = 91.2 bits (225), Expect = 1e-17, Method: Composition-based stats. Identities = 38/64 (59%), Positives = 46/64 (71%) Query: 104 PLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNI 163 P+Y + L HVISK+ TQRIERHNLNLR HL RL RK++ FSKS + H K+IG YL I Sbjct: 4 PVYRTLLSSTSHVISKKCTQRIERHNLNLRTHLKRLTRKTICFSKSDDMHYKIIGWYLTI 63 Query: 164 KHYQ 167 H+ Sbjct: 64 NHHH 67 >UniRef50_B0JRY8 Transposase n=27 Tax=Bacteria RepID=B0JRY8_MICAN Length = 111 Score = 88.9 bits (219), Expect = 6e-17, Method: Composition-based stats. Identities = 31/105 (29%), Positives = 51/105 (48%), Gaps = 7/105 (6%) Query: 68 KTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGWPLYESRLKGKLHV-ISK--RYT 122 ++ + G+R+ + +L + L + TD W Y++ + K H + K T Sbjct: 3 GKLLVAMRGDRSRQSAKKLWASLPGVYRQCAVAYTDFWESYKTVIPSKRHRPVGKETGQT 62 Query: 123 QRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 IER N RQ ++RL R+SLSFSK +E H + ++ I Y Sbjct: 63 NPIERLNNTFRQRISRLVRESLSFSKKMENHVGAVWYF--IHDYN 105 >UniRef50_Q9CJQ7 Putative uncharacterized protein n=2 Tax=Pasteurellaceae RepID=Q9CJQ7_PASMU Length = 181 Score = 88.2 bits (217), Expect = 8e-17, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 50/118 (42%), Gaps = 5/118 (4%) Query: 47 WGYV--GAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVV--IWMTDG 102 W +V ++ ++ Y +VA V+G+R + T L L V D Sbjct: 62 WHFVPPNRIDQKYRIYIGYHAKTSEIVAFVWGKRDLQTALALKQRLKELKVSYERIAGDN 121 Query: 103 WPLYESRLKG-KLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGH 159 W + + + K++T+ IE +N +R L+R R+S FSKS+ H K Sbjct: 122 WDAFVNAFSDTGDQWVGKQHTKAIEGNNCRIRHRLSRAVRRSCCFSKSMFYHVKSFNI 179 >UniRef50_Q46GF8 Putative uncharacterized protein n=3 Tax=Methanosarcina barkeri str. Fusaro RepID=Q46GF8_METBF Length = 112 Score = 83.1 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 29/85 (34%), Positives = 40/85 (47%) Query: 68 KTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 K + FG R T + L ++ MTD W Y L +H SK T +E Sbjct: 8 KKFINCSFGSRGTETGQLIWEKLKQKEIGEVMTDHWRAYAEFLPENIHTQSKAETYTVEG 67 Query: 128 HNLNLRQHLARLGRKSLSFSKSVEQ 152 +N LR LARL RK+ ++KS+E Sbjct: 68 YNGILRHFLARLRRKTKCYTKSIEM 92 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 77.4 bits (189), Expect = 2e-13, Method: Composition-based stats. Identities = 44/46 (95%), Positives = 44/46 (95%) Query: 19 FKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD 64 KLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD Sbjct: 68 LNKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD 113 >UniRef50_Q10VW0 ISSru3, transposase InsB n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10VW0_TRIEI Length = 76 Score = 76.2 bits (186), Expect = 4e-13, Method: Composition-based stats. Identities = 20/73 (27%), Positives = 30/73 (41%), Gaps = 2/73 (2%) Query: 46 QWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGW 103 W +VG+K+ Q+W + A D K +VA GER +L + I TD W Sbjct: 1 MWSFVGSKNNQQWFWLAIDIETKEIVAFSLGERGEKGANQLWNSWPGIYRQCAICYTDFW 60 Query: 104 PLYESRLKGKLHV 116 Y+ + Sbjct: 61 SAYDVIFPHCRQL 73 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 75.4 bits (184), Expect = 7e-13, Method: Composition-based stats. Identities = 34/115 (29%), Positives = 53/115 (46%), Gaps = 12/115 (10%) Query: 44 DEQWGYV----GAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWM 99 DE W Y+ G+K W++ A + G+R T L++ L +V Sbjct: 173 DESWTYLRVRHGSKRENLWIWNAL---ADGLPFFTTGDRDYKTFSFLLNSLPKSEVN--Y 227 Query: 100 TDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHD 154 TD + +Y+ HV SK+YT +E +N R HLARL R + + ++S D Sbjct: 228 TDDYSVYQVL---DNHVASKKYTYTVESYNSYCRAHLARLARDTRAVNRSERMVD 279 >UniRef50_Q1CBA9 Putative transposase n=3 Tax=Yersinia pestis RepID=Q1CBA9_YERPA Length = 85 Score = 74.3 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 25/61 (40%), Positives = 33/61 (54%), Gaps = 1/61 (1%) Query: 68 KTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 +T F T +L+ LLS F++V W TD + YE L + H+ SK YTQRIER Sbjct: 22 QTYYCSYFWSSEQKTFRQLLGLLSGFNIVFWCTDNFSAYE-MLPDEKHIRSKLYTQRIER 80 Query: 128 H 128 Sbjct: 81 E 81 >UniRef50_O67144 Putative uncharacterized protein n=1 Tax=Aquifex aeolicus RepID=O67144_AQUAE Length = 147 Score = 72.7 bits (177), Expect = 4e-12, Method: Composition-based stats. Identities = 33/145 (22%), Positives = 65/145 (44%), Gaps = 7/145 (4%) Query: 24 PQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF-YAYDRLRKTVVAHVF-GERTMA 81 P+ + ++ D + DE W YVG K + W++ + T+ +F G+R++ Sbjct: 4 PEYGSEKVVKTEDNMENKPTDEMWSYVGTKGNEVWIWSVVVELKDGTIKKFLFAGDRSLR 63 Query: 82 TLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRY-TQRIERHNLNLRQHLARLG 140 T ++++ + + + TD + +YE L H++ K R E + LR L Sbjct: 64 TFLKILAKMPEAE--EYETDAYRVYE-WLPRDRHIVRKYGRVNRNEALHSKLRDKLVAFK 120 Query: 141 RKSLSFSKSVEQHDKVIGHYLNIKH 165 RK+ +F +S + + +I H Sbjct: 121 RKTKAFFRSFLYLRYALALF-SIHH 144 >UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX7_PARUW Length = 163 Score = 72.0 bits (175), Expect = 7e-12, Method: Composition-based stats. Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 2/83 (2%) Query: 11 WPQHDFPPFKKLRPQSVTSRIQPGSDV--IVCAEMDEQWGYVGAKSRQRWLFYAYDRLRK 68 W K P+++ + + +D +V E+DE W YVG+K+ +WL+ + Sbjct: 71 WLLEFIGELTKELPENLNAEVVSENDELEVVVLEVDELWSYVGSKANPQWLWLVMHSKTR 130 Query: 69 TVVAHVFGERTMATLGRLMSLLS 91 VVA G R T +L+ L Sbjct: 131 QVVAMQIGPRNKETAEKLLYKLP 153 >UniRef50_C0A223 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A223_9BACT Length = 269 Score = 72.0 bits (175), Expect = 8e-12, Method: Composition-based stats. Identities = 34/169 (20%), Positives = 50/169 (29%), Gaps = 48/169 (28%) Query: 41 AEMDEQWGYVGAKSRQR----------WLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL 90 + DE W +VG K + W + A D K V G R + + M L Sbjct: 63 IQCDEIWSFVGCKEKNVTNNGKRQGDTWTWIACDPDTKLVPCWFIGRRDSESAKKFMRRL 122 Query: 91 SP---FDVVIWMTDGWPLY-------------------ESRLKGKLHVISKRY------- 121 + TDG Y + + G H Sbjct: 123 ARHLSLGSTQITTDGLKAYINAIKEILWIETSYGMVEKKYDVSGDDHRTRYIGSEKTAIF 182 Query: 122 ---------TQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 T +ER NL +R + R RK+ +SK + H I + Sbjct: 183 GNPDPDTMNTSIVERQNLTMRMSMRRFTRKTNGYSKKIANHRYAIALHF 231 >UniRef50_A8YFR2 ImeAB protein n=2 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YFR2_MICAE Length = 122 Score = 68.1 bits (165), Expect = 1e-10, Method: Composition-based stats. Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 3/63 (4%) Query: 94 DVVIWMTDGWPLYESRLKGKLHV-ISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSV 150 + TD W Y++ + K H + K T IER N RQ ++RL R+SLSFSK + Sbjct: 25 QCAVAYTDCWESYKTGIPSKRHRPVGKETGQTNPIERLNNTFRQRISRLVRESLSFSKKM 84 Query: 151 EQH 153 E H Sbjct: 85 ENH 87 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 67.7 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 23/62 (37%), Positives = 31/62 (50%), Gaps = 2/62 (3%) Query: 79 TMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLAR 138 T +L+ L+ F+VV W TD + Y L H K +TQ IER NL +R + R Sbjct: 55 NEKTFRKLLKKLASFNVVFWCTDNFKTYN-LLPKSQHRAGKIFTQHIERENL-MRTRIKR 112 Query: 139 LG 140 L Sbjct: 113 LN 114 >UniRef50_C8SAB2 IS1 transposase (Fragment) n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SAB2_FERPL Length = 75 Score = 67.0 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 25/72 (34%), Positives = 35/72 (48%), Gaps = 3/72 (4%) Query: 96 VIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDK 155 I+ TD W Y + K +I K T +ER L LR R RKS+ FSKS+E + Sbjct: 3 AIFYTDRWDAYN-LIPYKQRIIKKGGTNHVERLFLTLRNDNPRFARKSIRFSKSIEMLEN 61 Query: 156 VIGHYLNIKHYQ 167 + + I +Y Sbjct: 62 SLKLW--IHYYN 71 >UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachlamydiaceae RepID=Q6MD28_PARUW Length = 209 Score = 66.2 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 29/67 (43%), Gaps = 1/67 (1%) Query: 27 VTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRL 86 VT + +V E+DE W +VG K +WL+ + + V+A G R T L Sbjct: 93 VTCCEKDELEVAKL-EVDELWNFVGNKKNDQWLWLILHKKSRQVLAMQVGPRDKKTAELL 151 Query: 87 MSLLSPF 93 + L Sbjct: 152 FAKLPES 158 >UniRef50_Q32DI9 Iso-IS1 ORF2 n=2 Tax=Shigella RepID=Q32DI9_SHIDS Length = 94 Score = 66.2 bits (160), Expect = 4e-10, Method: Composition-based stats. Identities = 14/52 (26%), Positives = 25/52 (48%) Query: 63 YDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKL 114 + +A+ FG RT T L++LL+PF++ + +D W Y + Sbjct: 32 ITPKQGGGLAYTFGPRTDETCRELLALLTPFNIGMITSDDWGSYGREVPKDK 83 >UniRef50_B0CCX7 Putative uncharacterized protein n=7 Tax=Cyanobacteria RepID=B0CCX7_ACAM1 Length = 196 Score = 65.8 bits (159), Expect = 6e-10, Method: Composition-based stats. Identities = 33/117 (28%), Positives = 46/117 (39%), Gaps = 12/117 (10%) Query: 44 DEQWGYVGAKSR----------QRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLS-P 92 DE W V K + W+ + + V+ G+ T L+ Sbjct: 28 DELWSSVKKKQKHCEPEELSLGDCWIALSLAKDSGLVLTGRIGKHTDELAQELIENTEGK 87 Query: 93 FDVVIWMTDGWPLYESRLKGK-LHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSK 148 W TDGW Y +L + +H +SK TQR+ER N LRQ R R+ F K Sbjct: 88 TACHHWQTDGWEGYSRQLADEVIHHVSKALTQRLERTNGILRQQTGRWHRRQNKFGK 144 >UniRef50_B0CEC0 Putative uncharacterized protein n=6 Tax=Acaryochloris marina MBIC11017 RepID=B0CEC0_ACAM1 Length = 172 Score = 63.1 bits (152), Expect = 3e-09, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 46/111 (41%), Gaps = 5/111 (4%) Query: 57 RWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLS-PFDVVIWMTDGWPLYESRLKGKL- 114 W+ + + V++ G+ T L+ W TDGW Y +L ++ Sbjct: 21 CWIALSLAKESGLVLSGRIGKHTDELAQELIENTEGKTACHHWQTDGWEGYARQLPDEVV 80 Query: 115 HVISKRYTQRIERHNLNLRQHLARLGRKSLSFSK---SVEQHDKVIGHYLN 162 H +SK TQR+ER N +RQ R R+ F K +++ Y N Sbjct: 81 HEVSKALTQRLERTNGIVRQQTGRWHRRQNKFGKVWQQSAMTLRLVLSYFN 131 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 63.1 bits (152), Expect = 4e-09, Method: Composition-based stats. Identities = 24/93 (25%), Positives = 32/93 (34%), Gaps = 9/93 (9%) Query: 35 SDVIVCAEMDEQWGYVGAKSRQRWLFYA----YDRLRKTVVAHVFGERTMATLGRLMSLL 90 + I EMDE Y+G K A + FG R T + L Sbjct: 90 ENEISIVEMDEMHTYIGNKKN-----IAGSGLLLIELGKFIHCSFGNRGTETGQLIWEKL 144 Query: 91 SPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQ 123 ++ MTD W Y L +H SK+ Q Sbjct: 145 KQKEIGEVMTDHWRAYAEFLPENIHTQSKKRIQ 177 >UniRef50_Q10ZU2 Putative uncharacterized protein n=3 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZU2_TRIEI Length = 79 Score = 62.7 bits (151), Expect = 4e-09, Method: Composition-based stats. Identities = 14/45 (31%), Positives = 23/45 (51%) Query: 39 VCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATL 83 + + DE W +VG K+ ++WL+ A D + +V GER Sbjct: 34 LTIQCDEMWSFVGNKNNKQWLWLAIDIETQEIVGFYLGERGEKGA 78 >UniRef50_Q6MBQ1 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MBQ1_PARUW Length = 138 Score = 62.3 bits (150), Expect = 5e-09, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 1/67 (1%) Query: 27 VTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRL 86 VT + +V E+DE+W +VG K +WL+ + + V+A G R T L Sbjct: 65 VTCCEKDELEVARL-EVDERWSFVGNKKNDQWLWLILHKKSRQVLAMQVGPRDKKTAELL 123 Query: 87 MSLLSPF 93 + L Sbjct: 124 FTKLPES 130 >UniRef50_Q46CV2 Putative uncharacterized protein n=14 Tax=Methanosarcina RepID=Q46CV2_METBF Length = 75 Score = 62.0 bits (149), Expect = 7e-09, Method: Composition-based stats. Identities = 22/54 (40%), Positives = 30/54 (55%) Query: 99 MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQ 152 MTD W Y L +H SK T +E +N L+ LARL RK+ ++KS+E Sbjct: 2 MTDHWRAYAEFLPENIHTQSKAETYTVEGYNGILKHFLARLRRKTKCYTKSIEM 55 >UniRef50_C5B7X2 Iso-IS1 ORF2 n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B7X2_EDWI9 Length = 99 Score = 62.0 bits (149), Expect = 7e-09, Method: Composition-based stats. Identities = 21/69 (30%), Positives = 36/69 (52%) Query: 90 LSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 L+ F++ + D W + + +TQ ER++L LR + RL RK + FS++ Sbjct: 3 LTAFNIGMITRDDWGNPIREVPWGKPLTGTIFTQHSERNSLMLRTRIKRLARKRIGFSRA 62 Query: 150 VEQHDKVIG 158 + H+KV G Sbjct: 63 IALHEKVTG 71 >UniRef50_Q6MCH2 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCH2_PARUW Length = 121 Score = 60.8 bits (146), Expect = 2e-08, Method: Composition-based stats. Identities = 26/74 (35%), Positives = 33/74 (44%), Gaps = 3/74 (4%) Query: 86 LMSLLSPFDVVIWMTDGWPLYESRLKGKLHV-ISK--RYTQRIERHNLNLRQHLARLGRK 142 L L + TD + +Y H +SK T IER N RQ ARL RK Sbjct: 33 LQKLPESLKKAFYFTDKFNVYYETNPWSQHQPVSKQSGQTSYIERFNCTRRQRCARLVRK 92 Query: 143 SLSFSKSVEQHDKV 156 +LSFSK + H + Sbjct: 93 TLSFSKKLTNHIGL 106 >UniRef50_D1JFE2 Putative uncharacterized protein n=3 Tax=uncultured archaeon RepID=D1JFE2_9ARCH Length = 217 Score = 60.0 bits (144), Expect = 3e-08, Method: Composition-based stats. Identities = 35/141 (24%), Positives = 53/141 (37%), Gaps = 37/141 (26%) Query: 57 RWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF-------DVVIWMTDGWPLYESR 109 W++ A K +AH G+R T L++L+ D +++DG Y Sbjct: 24 CWIYTAIKSDTKLHLAHCTGKRVQETANALVALVKNRGKAPDTDDKATFVSDGNNQYTKA 83 Query: 110 L-----------------KGKLHVISK-------------RYTQRIERHNLNLRQHLARL 139 L + V+ K T +ER+NL LR +++L Sbjct: 84 LFENFDVNAINYGQLVKERDNGRVVGKTRTIIFGSLEVDEIETVYVERYNLTLRHGISKL 143 Query: 140 GRKSLSFSKSVEQHDKVIGHY 160 RKSL FSK E D + Y Sbjct: 144 VRKSLCFSKCKEMLDDHLDLY 164 >UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZK1_TRIEI Length = 177 Score = 60.0 bits (144), Expect = 3e-08, Method: Composition-based stats. Identities = 14/72 (19%), Positives = 26/72 (36%) Query: 40 CAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWM 99 E+ E +V K + L+ R+ ++ V G + T L + + + Sbjct: 106 VGELHELETFVSDKKNKVLLWTLVYHFRQGILGWVVGNHSGDTFQPLWQAIGFWKCYFQV 165 Query: 100 TDGWPLYESRLK 111 TDG P+ Sbjct: 166 TDGNPVASRLYP 177 >UniRef50_UPI00016C465A IS1 transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C465A Length = 88 Score = 59.3 bits (142), Expect = 5e-08, Method: Composition-based stats. Identities = 23/68 (33%), Positives = 31/68 (45%), Gaps = 3/68 (4%) Query: 96 VIWMTDGWPLYESRLKGKLH-VISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQ 152 V TD P + + H + K T IER L LRQ AR RK+L+FSK Sbjct: 12 VTVYTDLLPACRAAIPRARHRAVRKVTGLTAHIERFWLTLRQRCARFVRKTLTFSKCPRN 71 Query: 153 HDKVIGHY 160 H + ++ Sbjct: 72 HLGALWYF 79 >UniRef50_C4GXL2 Insertion sequence protein n=5 Tax=Yersinia pestis RepID=C4GXL2_YERPN Length = 111 Score = 58.9 bits (141), Expect = 7e-08, Method: Composition-based stats. Identities = 16/44 (36%), Positives = 25/44 (56%) Query: 46 QWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSL 89 W +VG K +QRWL+YA++ K ++AH+FG R+ Sbjct: 1 MWSFVGNKKQQRWLWYAWEPRLKRIIAHIFGRRSKRHFANYWGC 44 >UniRef50_A9GLN9 Putative uncharacterized protein n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GLN9_SORC5 Length = 405 Score = 58.5 bits (140), Expect = 7e-08, Method: Composition-based stats. Identities = 38/178 (21%), Positives = 58/178 (32%), Gaps = 54/178 (30%) Query: 39 VCAEMDEQWGYVGAKS-----------RQRWLFYAYDRLRKTVVAHVFGERTMATLGRLM 87 + DE + YVG K + + F A D + V+A G+R M T G + Sbjct: 96 ELIQADEVFSYVGKKQARVTEKDAPGIGETYSFTALDTASRLVIAWRVGKRDMETCGPFI 155 Query: 88 SLLSPFDVVI--WMTDGWPLYESRLKGK-------------------------------- 113 + L +V+ TDG+ Y + + Sbjct: 156 ADLRSRLLVMPQITTDGFAPYIATVAEHFGLSVDYMQTVKNYRTGSYRGPDHRYEPPRDP 215 Query: 114 ---LHVI------SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLN 162 H I K T +ER N R L R+ R +FSK+ E H + + Sbjct: 216 FITKHTIYGAPDAKKASTSYVERLNGTTRHLLGRMRRLCYAFSKAPEHHRAAVALHYT 273 >UniRef50_A9GLP8 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GLP8_SORC5 Length = 337 Score = 58.5 bits (140), Expect = 7e-08, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 56/179 (31%), Gaps = 58/179 (32%) Query: 37 VIVCAEMDEQWGYVGAKSRQR-----------WLFYAYDRLRKTVVAHVFGERTMAT--- 82 + +MDE W +V K + + + A D K ++ G+R Sbjct: 2 AVHVIQMDEMWSFVQKKQARVTAEDPAEHGDAYFYVALDANTKLAISFHVGKRDGENTEA 61 Query: 83 -LGRLMSLLSPFDVVIWMTDGWPLY------------------------ESRLKGKLH-- 115 + L S L+ V +DGW Y R + Sbjct: 62 FIKDLRSRLTV--VPHITSDGWQPYIEAMATSFRGSADYAQCVKNYRGGPQRSPDHRYEP 119 Query: 116 ----VISKR-----------YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGH 159 ++K T +ER NL R + R R L+FSK++ H IG Sbjct: 120 PRNPFVTKTPIFGAPKDELLSTSFVERFNLQTRHTVGRTRRLCLAFSKTLRGHRAAIGL 178 >UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthrospira maxima CS-328 RepID=B5VWL6_SPIMA Length = 153 Score = 58.5 bits (140), Expect = 9e-08, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 31/63 (49%), Gaps = 1/63 (1%) Query: 36 DVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDV 95 ++ A++DE +VG+K W++ + ++ V G+R++ T L ++ + Sbjct: 66 EIPEIAQIDELQTFVGSKKT-IWVWTVVNTKLPGILKFVIGDRSLLTFTTLWQMIQGWAC 124 Query: 96 VIW 98 ++ Sbjct: 125 FLY 127 >UniRef50_B2TXL7 IS1 ORF2 n=1 Tax=Shigella boydii CDC 3083-94 RepID=B2TXL7_SHIB3 Length = 44 Score = 57.3 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 32/34 (94%), Positives = 32/34 (94%) Query: 134 QHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 HLARLGRKSLSFSKSVE HDKVIGHYLNIKHYQ Sbjct: 11 THLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 44 >UniRef50_B0URB1 Putative uncharacterized protein n=1 Tax=Methylobacterium sp. 4-46 RepID=B0URB1_METS4 Length = 82 Score = 56.6 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 30/67 (44%) Query: 98 WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVI 157 + TD + Y + L H + K TQ +E +N R AR R++ SKSVE + + Sbjct: 4 FCTDNYAPYAAALPAGRHHVGKDQTQLVESNNARQRHWFARFRRRTCVVSKSVEMVEATM 63 Query: 158 GHYLNIK 164 + Sbjct: 64 ALFAFYH 70 >UniRef50_A8GX98 Transposase and inactivated derivative n=2 Tax=Rickettsia bellii RepID=A8GX98_RICB8 Length = 99 Score = 55.8 bits (133), Expect = 5e-07, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 40/77 (51%), Gaps = 1/77 (1%) Query: 77 ERTMATLGRLMSLLSP-FDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQH 135 R +++ + L +++ I +D + +Y + K H +K+ T +E N +R + Sbjct: 4 GRDISSYLPMALRLEENYEIDISCSDHYDVYGAYKIAKRHYFTKKETALVESFNSLIRNY 63 Query: 136 LARLGRKSLSFSKSVEQ 152 LAR RK+ +SK+++ Sbjct: 64 LARFNRKTKRYSKAIDM 80 >UniRef50_A9FJP3 Putative uncharacterized protein n=5 Tax=Proteobacteria RepID=A9FJP3_SORC5 Length = 349 Score = 55.0 bits (131), Expect = 1e-06, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 50/176 (28%), Gaps = 54/176 (30%) Query: 40 CAEMDEQWGYVGAKSRQR-----------WLFYAYDRLRKTVVAHVFGERTMATLGRLMS 88 A+ DE W YV K + + F K ++++ G+R + Sbjct: 62 VAQCDEIWSYVQKKQSRVTASDPAEYGDAYTFVGMASASKLIISYRVGKRDEENTRAFVK 121 Query: 89 LLSP--FDVVIWMTDGWPLY------------------------------ESRLKGKLHV 116 L + TDGW Y + Sbjct: 122 DLRARLTTIPQLYTDGWQPYIGAVGASFTGGVDYCQVVKNYSRRPRRDDEVRYEPPRDPF 181 Query: 117 ISKR-----------YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 I+K T +ER N +R H+ R R FS+ + H + ++ Sbjct: 182 ITKTPIFGIPDVEHASTSHVERQNWTIRMHIRRFTRLCNGFSRKLANHRAAVALHV 237 >UniRef50_Q2FSQ2 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FSQ2_METHJ Length = 201 Score = 54.6 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 51/147 (34%), Gaps = 37/147 (25%) Query: 57 RWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL-------SPFDVVIWMTDGWPLYESR 109 W + + R +A G+R + T ++ +P + + TDG Y Sbjct: 22 CWSYTCFKRDSGLFLAFESGKRNIDTCADMLVRFFNRMELPTPENKISIFTDGNVQYSIC 81 Query: 110 LK--------GKLHVISKRYTQR----------------------IERHNLNLRQHLARL 139 L VI + + IE +N +RQ L+R Sbjct: 82 LPELYCEPCLDYGQVIKVKEKNKLVYVIREKIMGNPDSKAISTSVIEGYNNKIRQRLSRF 141 Query: 140 GRKSLSFSKSVEQHDKVIGHYLNIKHY 166 GRK+ SFSK + + + + + ++ Sbjct: 142 GRKTASFSKKLNRFISALNIFQFVHNF 168 >UniRef50_Q7NJH9 Gsl1853 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJH9_GLOVI Length = 71 Score = 53.1 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 3/56 (5%) Query: 103 WPLYESRLKGKLH---VISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDK 155 Y L K H + T IER N +RQ + RL RK+LSFSK + H+ Sbjct: 2 LKNYGQVLASKRHRAAGKATGTTSCIERFNNTVRQRVGRLVRKALSFSKCLSNHNA 57 >UniRef50_B0CAP5 Putative uncharacterized protein n=3 Tax=Acaryochloris marina MBIC11017 RepID=B0CAP5_ACAM1 Length = 144 Score = 51.9 bits (123), Expect = 8e-06, Method: Composition-based stats. Identities = 26/108 (24%), Positives = 45/108 (41%), Gaps = 2/108 (1%) Query: 56 QRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLS-PFDVVIWMTDGWPLYESRLKGKL 114 + W+ + + V++ G+ T L+ W TDGW + ++ Sbjct: 5 ECWIALSLAKDSSLVLSGRIGKHTDELAQDLIENTEGKTTCHHWQTDGWEGSSRQPPDEV 64 Query: 115 -HVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 H +SK TQR++R N LRQ R ++ F K +QH + + Sbjct: 65 IHHVSKVLTQRLKRTNGILRQQTGRWHQRQNKFGKVWQQHAVTLTLFY 112 >UniRef50_A8ZKR3 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZKR3_ACAM1 Length = 241 Score = 50.8 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 34/78 (43%), Gaps = 4/78 (5%) Query: 38 IVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLM----SLLSPF 93 I EMDE+ GYV K +Q W A D K ++ G R + RLM L+ Sbjct: 122 IDVLEMDERHGYVAIKQQQCWDAVAIDAASKFIIQVEVGPRNTNLIDRLMRATHKRLAHP 181 Query: 94 DVVIWMTDGWPLYESRLK 111 ++ MTDG Y + Sbjct: 182 RDLVLMTDGDASYRTLFP 199 >UniRef50_UPI00018554DD transposase n=1 Tax=Francisella novicida FTG RepID=UPI00018554DD Length = 97 Score = 50.4 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 13/36 (36%), Positives = 20/36 (55%) Query: 35 SDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTV 70 D I E DE W ++G+K ++ W+ AYDR + Sbjct: 40 EDNISEIEFDEMWHFIGSKKKKCWIIKAYDRRVGKL 75 >UniRef50_Q327E8 IS1 ORF, n=6 Tax=Enterobacteriaceae RepID=Q327E8_SHIDS Length = 94 Score = 49.6 bits (117), Expect = 4e-05, Method: Composition-based stats. Identities = 19/48 (39%), Positives = 27/48 (56%), Gaps = 4/48 (8%) Query: 112 GKLHVISKR----YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDK 155 V K + +ER+NL LR + RL RK++ FS+SVE H+K Sbjct: 18 KDKQVTRKGIFIQHMLYLERNNLPLRTRIKRLARKTICFSRSVEIHEK 65 >UniRef50_Q648U8 Putative uncharacterized protein n=6 Tax=environmental samples RepID=Q648U8_9ARCH Length = 173 Score = 49.2 bits (116), Expect = 5e-05, Method: Composition-based stats. Identities = 19/39 (48%), Positives = 23/39 (58%) Query: 122 TQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHY 160 T IER+NL LR ++RL RKSL FSK D + Y Sbjct: 81 TVYIERYNLTLRHGISRLVRKSLCFSKCKGMLDNHLDVY 119 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 49.2 bits (116), Expect = 5e-05, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 42/111 (37%), Gaps = 9/111 (8%) Query: 44 DEQWGYVGAKSRQRWLFYAYD---RLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT 100 DE W Y+ +R + + + + G+R +T + L W++ Sbjct: 119 DEMWTYLYKNARAFYKWVFTCYVYTKLGVYLIYSVGDRDESTFLEVKKYLPDE--GRWVS 176 Query: 101 DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVE 151 D + LY V+S E + +LR L R R + + ++S+ Sbjct: 177 DDYNLY--FWLKDHTVVSPVNPN--ESFHSSLRDRLIRFKRATKAINRSIR 223 >UniRef50_Q972H6 Putative uncharacterized protein ST1154 n=1 Tax=Sulfolobus tokodaii RepID=Q972H6_SULTO Length = 152 Score = 48.9 bits (115), Expect = 6e-05, Method: Composition-based stats. Identities = 22/112 (19%), Positives = 42/112 (37%), Gaps = 9/112 (8%) Query: 44 DEQWGYVGAKSRQRWLFYAYDR---LRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT 100 DE W Y+ +R + + + + G+R T + L D W++ Sbjct: 27 DEMWTYLYRNTRAFYKWVFNCHVYTRLGLYIIYSVGDRDENTFREVKMYLP--DDGRWVS 84 Query: 101 DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQ 152 D + +Y V+S E + +LR L R R + + ++S+ Sbjct: 85 DDYNVY--FWLKNHTVVSLVNPN--ESFHSSLRDRLVRFKRATKAVNRSINM 132 >UniRef50_Q64CQ0 Putative uncharacterized protein n=1 Tax=uncultured archaeon GZfos1D1 RepID=Q64CQ0_9ARCH Length = 168 Score = 47.7 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 26/127 (20%), Positives = 43/127 (33%), Gaps = 37/127 (29%) Query: 71 VAHVFGERTMATLGRLMSLL-------SPFDVVIWMTDGWPLYESRLK------------ 111 +A G++T + GR+M + SP + TDG Y L Sbjct: 1 MAFSVGKQTQESCGRMMKKVFGRTEQPSPQTKMEMFTDGNDDYTYVLPDYCADACIEYGQ 60 Query: 112 ------------GKLHVI------SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQH 153 + +I T +E +N LR+ + RL RK+ FSK Sbjct: 61 LVKIRENGRVVRKEKRIIYGNPDLGDIETTDVENYNGILRERIGRLVRKTKCFSKRKRML 120 Query: 154 DKVIGHY 160 + + + Sbjct: 121 ECSLQVF 127 >UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YEG9_MICAE Length = 171 Score = 47.3 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 11/62 (17%), Positives = 28/62 (45%), Gaps = 2/62 (3%) Query: 11 WPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTV 70 W Q+ P+ + + +++ E DE W +V +K+ + +++ DR + + Sbjct: 107 WLQNYVNNKLASVPRQIKVSDKLKGKLVI--ECDEMWSFVFSKTIKVYIWRLIDRNTREI 164 Query: 71 VA 72 + Sbjct: 165 IG 166 >UniRef50_Q8PRQ0 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PRQ0_METMA Length = 129 Score = 46.5 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 18/43 (41%), Positives = 23/43 (53%) Query: 118 SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHY 160 S T ER NL +R LAR RK ++FSK+ H K I + Sbjct: 34 SYIGTSYAERINLTIRTSLARFIRKGMNFSKTKRMHQKAIDLF 76 >UniRef50_Q649W7 Putative uncharacterized protein n=1 Tax=uncultured archaeon GZfos34A6 RepID=Q649W7_9ARCH Length = 217 Score = 46.5 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 51/170 (30%), Gaps = 67/170 (39%) Query: 61 YAYDRLRKTVVAHVFGERTMATLGRLMSLL-----SPFDVVIWMTDGWPL---------- 105 A + K V ++ G R L++ + ++ I+ TD W Sbjct: 2 VAQEAKTKLVTSYHVGRRAFEDAVELLAEMESRRDKSTELPIFTTDDWDAYKNALVEVYG 61 Query: 106 -------------------------YESRLK----------GKLHVISKRY--------- 121 Y +K K V Sbjct: 62 VEEQPEYKGRGRPPNSKKVPPPDLKYGQVIKYREGNEVTDVKKRVVFGNEEEVLSALKLA 121 Query: 122 -----TQRIERHNLNLRQHLARLGRKSLSFSKSVE---QHDKVIGHYLNI 163 T IER+NL +R ++RL RK+++FSK + H + + N+ Sbjct: 122 GNSINTSYIERNNLTVRNGVSRLIRKTINFSKRLNPLVMHLCLFFAWFNL 171 >UniRef50_Q6MCX8 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX8_PARUW Length = 72 Score = 45.8 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 18/59 (30%), Positives = 27/59 (45%), Gaps = 3/59 (5%) Query: 106 YESRLKGKLHV-ISKR--YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 Y + H + K+ T IER N L +R K+LSFSK + H +I ++ Sbjct: 2 YFESIPFGQHRPVGKQSDKTSYIERLNCTLGYRCSRFVGKTLSFSKKLINHIGMITSFI 60 >UniRef50_Q10ZQ2 Putative uncharacterized protein n=7 Tax=Cyanobacteria RepID=Q10ZQ2_TRIEI Length = 44 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 2/40 (5%) Query: 128 HNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 N LRQ ++RL RK+LSFSK + H I ++ I HY Sbjct: 1 MNNTLRQRISRLVRKTLSFSKKLRSHLGDIWYF--INHYN 38 >UniRef50_B9K3D6 Transposase n=32 Tax=Bacteria RepID=B9K3D6_AGRVS Length = 243 Score = 44.2 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 46/132 (34%), Gaps = 11/132 (8%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFD---VVIWM 99 +DE +G K + WL+ A D+ + V R RLM L + + Sbjct: 87 LDEVVISIGGK--KHWLWRAVDQDGFVLDVLVQSRRNAKAAKRLMRKLLKGQGRSPRVMI 144 Query: 100 TDGWPLY----ESRLKGKLHVISKRYTQRIERHN--LNLRQHLARLGRKSLSFSKSVEQH 153 TD Y + H K R E + + R+ + + + + + V H Sbjct: 145 TDKLRSYGAAKREIMPAVEHRSHKGLNNRAENSHQPIRRRERIMKRFKSARHLQRFVSIH 204 Query: 154 DKVIGHYLNIKH 165 D + + +H Sbjct: 205 DPIANLFQIPRH 216 >UniRef50_C7DAC3 Transposase n=36 Tax=Rhodobacterales RepID=C7DAC3_9RHOB Length = 237 Score = 44.2 bits (103), Expect = 0.002, Method: Composition-based stats. Identities = 25/126 (19%), Positives = 48/126 (38%), Gaps = 10/126 (7%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL--SPFDVVIWMT 100 +DE +V ++ +L+ A D + + A V R A + + L +T Sbjct: 85 VDEV--FVKVNGKRHYLWRAVDHEGEVLEAVVTKRRNKAAALKFLKKLMKRHGKAEEVVT 142 Query: 101 DGWPLYESRLKG----KLHVISKRYTQRIERHNLNLRQHLARL--GRKSLSFSKSVEQHD 154 D + Y++ L+ + + R+E +L R+ + R+ S K H Sbjct: 143 DRFAPYKAALRDLGALEKQSTGRWLNNRVENSHLPFRRRERAMQRFRRMRSLQKFAAVHS 202 Query: 155 KVIGHY 160 V H+ Sbjct: 203 SVYNHF 208 >UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q0SUU8_CLOPS Length = 340 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 58/165 (35%), Gaps = 23/165 (13%) Query: 14 HDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAH 73 H F P+ K + + +++ SD +DE ++ K + +L+ A D + V+A Sbjct: 155 HKFAPYFKEKAKIFNAQLDLNSD---DWHVDETVVFISGK--KYYLWLAIDSETRFVLAF 209 Query: 74 VFGE-RTMATLGRLMSLLSPF-DVVIWMTDGWPLY----ESRLKGKLHV-----ISKRYT 122 + R LM+ ++TD P Y ++ L H+ S Sbjct: 210 HLTQARDSDAAFILMNQAKSMGKPNNFITDRLPSYNEAVKTVLNESTHIPVPPMSSDTNN 269 Query: 123 QRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 IE N + +K + + Y+ I HY Sbjct: 270 NLIESFNKTFKAWYK--AKKGFNSFEKANN-----LIYMFIFHYN 307 >UniRef50_A4AD66 Transposase n=19 Tax=unclassified Gammaproteobacteria RepID=A4AD66_9GAMM Length = 227 Score = 43.1 bits (100), Expect = 0.004, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 52/132 (39%), Gaps = 11/132 (8%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF---DVVIWM 99 +DE + + K Q++L+ A D+ + V ++ +R A R L + + Sbjct: 75 IDEVFVTINGK--QQYLWRAVDQDGEVVDVYLQTKRDGAAAKRFFKRLLRSHGGEPRKIV 132 Query: 100 TDGWPLY----ESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGR-KSLSFSKS-VEQH 153 TD Y + +H+ + R E+ + R + R KS++ ++ V H Sbjct: 133 TDKLRSYGVAHRELIPETVHITEQYENNRAEQSHETTRARERGMRRFKSVAQAQRFVAAH 192 Query: 154 DKVIGHYLNIKH 165 V + +H Sbjct: 193 AAVFNLFNLGRH 204 >UniRef50_Q469A1 Putative uncharacterized protein n=1 Tax=Methanosarcina barkeri str. Fusaro RepID=Q469A1_METBF Length = 180 Score = 42.7 bits (99), Expect = 0.005, Method: Composition-based stats. Identities = 16/84 (19%), Positives = 31/84 (36%), Gaps = 11/84 (13%) Query: 41 AEMDEQWGYVGA--------KSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSP 92 EMDE W + + W++ A+ + ++ V G R +L+ + Sbjct: 81 IEMDELWIIIKKIVSRMKDYEDDGPWMWVAFVPGCQLILGFVIGPRKQYVTDKLVESVKK 140 Query: 93 F---DVVIWMTDGWPLYESRLKGK 113 + +++TDG Y L Sbjct: 141 HLSDKIPLFVTDGLNFYREALLKH 164 >UniRef50_Q6MD18 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MD18_PARUW Length = 89 Score = 42.3 bits (98), Expect = 0.006, Method: Composition-based stats. Identities = 14/32 (43%), Positives = 21/32 (65%) Query: 130 LNLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 L LR AR RK+LSFSK + H ++I +++ Sbjct: 46 LLLRHRYARFVRKTLSFSKKLTNHIELIKYFI 77 >UniRef50_Q218S2 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris BisB18 RepID=Q218S2_RHOPB Length = 191 Score = 42.3 bits (98), Expect = 0.006, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 2/46 (4%) Query: 122 TQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 T +ER NL+LR R R + FSK ++ H + Y + HY Sbjct: 56 TSYVERQNLSLRMGSRRFTRLTNGFSKKLDNHVAAVALY--VAHYN 99 >UniRef50_A9FZD9 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FZD9_SORC5 Length = 216 Score = 41.9 bits (97), Expect = 0.009, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 29/82 (35%), Gaps = 17/82 (20%) Query: 40 CAEMDEQWGYVGAKSRQR-----------WLFYAYDRLRKTVVAHVFGERTMAT----LG 84 +MDE W +V K + +L+ A D K ++ G+ + Sbjct: 63 VIQMDEMWSFVQKKQARVTAKDPAEHGDAYLYVALDANTKPAISFHVGKCDGENTEMFIK 122 Query: 85 RLMSLLSPFDVVIWMTDGWPLY 106 L L+ V +DGW Y Sbjct: 123 DLRGRLTV--VPHVTSDGWQPY 142 >UniRef50_A7C135 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C135_9GAMM Length = 372 Score = 41.5 bits (96), Expect = 0.010, Method: Composition-based stats. Identities = 20/50 (40%), Positives = 24/50 (48%) Query: 116 VISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKH 165 V S+ T IER NL RQ RL R+S FSK + D + L H Sbjct: 245 VRSEINTSFIERDNLTQRQSNRRLTRRSNGFSKELSWFDSPLWLSLAYYH 294 Score = 39.6 bits (91), Expect = 0.044, Method: Composition-based stats. Identities = 15/100 (15%), Positives = 35/100 (35%), Gaps = 19/100 (19%) Query: 38 IVCAEMDEQWGYVGAKSRQR-------------WLFYAYDRLRKTVVAHVFGERTMATLG 84 + ++DE W ++ W++ A+ + + V+A V G Sbjct: 88 VTSLQLDELWSFILTLEHNCTEAKLYHESYGDAWVWLAFAPVWRVVLAFVIGSLPQKNAN 147 Query: 85 RLMSLLSPFD---VVIWMTDGWPLYESRLKGKLHVISKRY 121 L+ ++ + + +D + + L LH + Y Sbjct: 148 LLLDRVAHVTDAHIPFFTSDQFSSSRTAL---LHTYGQWY 184 >UniRef50_B9K4Q6 Transposase n=2 Tax=Alphaproteobacteria RepID=B9K4Q6_AGRVS Length = 232 Score = 40.8 bits (94), Expect = 0.016, Method: Composition-based stats. Identities = 23/95 (24%), Positives = 33/95 (34%), Gaps = 9/95 (9%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL---SPFDVVIWM 99 +DE V K R+ WL+ A D + A + R +LM L + + Sbjct: 85 LDEM--VVTFKGRKYWLWRAVDAEGYMLEALLQSRRNKKAALKLMRKLLKGQGLTPRVMV 142 Query: 100 TDGWPLY----ESRLKGKLHVISKRYTQRIERHNL 130 TD Y + G H K R E +L Sbjct: 143 TDKLRSYDAAKRDIMPGVEHRSHKGLNNRAENSHL 177 >UniRef50_Q11MN9 Transposase n=37 Tax=Bacteria RepID=Q11MN9_MESSB Length = 237 Score = 40.8 bits (94), Expect = 0.018, Method: Composition-based stats. Identities = 24/95 (25%), Positives = 34/95 (35%), Gaps = 9/95 (9%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL---SPFDVVIWM 99 +DE V K ++ WL+ A D + A + R A RLM L + + Sbjct: 81 LDEM--VVTIKGKKYWLWRAVDTNGYVLDALLQSRRNKAAAMRLMRKLLKDQGTAPRVMV 138 Query: 100 TDGWPLY----ESRLKGKLHVISKRYTQRIERHNL 130 TD Y + G H K R E +L Sbjct: 139 TDKLRSYSAAKSQLMPGVEHRSHKGLNNRAENSHL 173 >UniRef50_B9K5F7 Transposase n=3 Tax=Bacteria RepID=B9K5F7_AGRVS Length = 196 Score = 40.8 bits (94), Expect = 0.020, Method: Composition-based stats. Identities = 24/127 (18%), Positives = 43/127 (33%), Gaps = 11/127 (8%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFD---VVIWM 99 +DE V + ++ WL+ A D+ + V R LM L + + Sbjct: 43 LDE--AVVSIRGKKHWLWRAVDQDGFVLDVLVQSRRNAKAARHLMRQLLKGQGRAPRVMI 100 Query: 100 TDGWPLYE----SRLKGKLHVISKRYTQRIERHN--LNLRQHLARLGRKSLSFSKSVEQH 153 TD Y G H K + R E + + R+ + + + + V H Sbjct: 101 TDKLRSYGAAKWELTPGVEHRSHKGLSNRAENFHQPVRRRERIMKRFKSQRHLQRFVSIH 160 Query: 154 DKVIGHY 160 D + + Sbjct: 161 DPIANLF 167 >UniRef50_A0LBE3 Putative uncharacterized protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0LBE3_MAGSM Length = 116 Score = 39.6 bits (91), Expect = 0.039, Method: Composition-based stats. Identities = 15/38 (39%), Positives = 21/38 (55%) Query: 120 RYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVI 157 T +ER+N R A GRK+L FSK + H+ V+ Sbjct: 23 IKTAFVERNNATDRHQNAHKGRKTLCFSKGWDVHNAVM 60 >UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=A0RXS8_CENSY Length = 436 Score = 39.6 bits (91), Expect = 0.041, Method: Composition-based stats. Identities = 23/124 (18%), Positives = 46/124 (37%), Gaps = 15/124 (12%) Query: 52 AKSRQRWLFYAYDRLRKTVVAHVF--GERTMATLGRLMSLLSPF--DVVIWMTDGWPLYE 107 K WL+ A D + ++ G RT+ ++ + +TD Y Sbjct: 197 NKGHGNWLWSAIDPRTRYLLCTRIAEGSRTLPDAESVIREARKMSEEPDYMITDSLRSYA 256 Query: 108 SR----LKGKLHVISK----RYTQ-RIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIG 158 + L H+ +K +T IER++ +R+ L + L + S + ++ Sbjct: 257 TAAAKCLPRTAHIKTKAIRDGFTNMAIERYHNEIREKLKSC--RGLHSADSAQIFMDLLR 314 Query: 159 HYLN 162 + N Sbjct: 315 IHHN 318 >UniRef50_A5N1B9 Transposase n=2 Tax=Clostridium kluyveri RepID=A5N1B9_CLOK5 Length = 127 Score = 39.6 bits (91), Expect = 0.044, Method: Composition-based stats. Identities = 20/99 (20%), Positives = 35/99 (35%), Gaps = 13/99 (13%) Query: 48 GYVGAKSRQRWLFYAYDRLRKTVVAHVFGE-RTMATLGRLM---SLLSPFDVVIWMTDGW 103 YV K +L+ D + +++ V R +L S+L+ +TD W Sbjct: 28 TYVKIKGIDYYLWLILDSKTRVIISFVLSRFRNSTQAYKLFFYSSILTRTSPKKIVTDKW 87 Query: 104 PLYESRLKG-KLHVISKRYT--------QRIERHNLNLR 133 Y +K H + +Y+ IE N + Sbjct: 88 DAYNEAIKNLHCHTLHHKYSAFSEDLNNNFIESFNKTFK 126 >UniRef50_A7C324 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7C324_9GAMM Length = 137 Score = 39.2 bits (90), Expect = 0.057, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 24/45 (53%) Query: 122 TQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 T IER NL LRQH++ L RK+L + K ++ L +Y Sbjct: 41 TSFIERFNLTLRQHVSYLTRKTLGYCKKKANFKYILWINLYNYNY 85 >UniRef50_Q0W4E9 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4E9_UNCMA Length = 160 Score = 38.8 bits (89), Expect = 0.063, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 35/113 (30%), Gaps = 31/113 (27%) Query: 71 VAHVFGERTMATLGRLMSLLSPF---DVVIWMTDGWPLYESRLKGKLH------------ 115 + G T T ++S +S V +DG Y L Sbjct: 1 MGFSVGRWTQGTCRVMLSQVSNSVQDGVFTVYSDGNDDYYYTLTDFFQEVRYGQLVKIRE 60 Query: 116 ---VISKR-------------YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQ 152 V+ K T +E N LR + RL RK+ +FSK E Sbjct: 61 KGRVVGKEIRVLIGDVDSEQVETFNVENFNSILRGRVGRLVRKTKTFSKIPEM 113 >UniRef50_A3W3Q5 Transposase n=1 Tax=Roseovarius sp. 217 RepID=A3W3Q5_9RHOB Length = 180 Score = 38.5 bits (88), Expect = 0.080, Method: Composition-based stats. Identities = 26/134 (19%), Positives = 41/134 (30%), Gaps = 19/134 (14%) Query: 10 RWPQHDFPPFKK--LRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLR 67 RW P + R Q+ + +V+V R+ WL+ A D+ Sbjct: 22 RWTAKFGPQIARNLRRRQARPGDVWHLDEVVVKIS-----------GRKFWLWRAVDQHG 70 Query: 68 KTVVAHVFGERTMATLGRLMSLL--SPFDVVIWMTDGWPLY----ESRLKGKLHVISKRY 121 + V +R R++ L +TD Y G H K Sbjct: 71 VVLEEIVQSKRDKRAAKRVLRRLIKCYGLPKRIVTDKLRAYGAAKREVAPGLDHWSHKDL 130 Query: 122 TQRIERHNLNLRQH 135 R E +L R+ Sbjct: 131 NNRAENSHLPFRKR 144 >UniRef50_A9EF44 Transposase n=2 Tax=Rhodobacteraceae RepID=A9EF44_9RHOB Length = 156 Score = 38.5 bits (88), Expect = 0.096, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 35/105 (33%), Gaps = 8/105 (7%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL--SPFDVVIWMT 100 MDE + K + WL+ A D + V R + R + L + + +T Sbjct: 1 MDEVVITIRGK--KHWLWRAIDADGDVLDILVQTRRNAKSAKRFLQRLVSQFGEPRVVIT 58 Query: 101 DGWPLY----ESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGR 141 D Y ++ H K IE + R+ G+ Sbjct: 59 DKLRSYLKPVKTLTPNADHRAHKGLNNAIEVSHRPTRKREKIFGK 103 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P57998 Insertion element IS1 4 protein insB n=553 Tax=r... 210 1e-53 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 179 3e-44 UniRef50_P03832 Insertion element iso-IS1n protein insB n=127 Ta... 166 2e-40 UniRef50_B4WT39 IS1 transposase subfamily, putative n=3 Tax=Cyan... 163 2e-39 UniRef50_Q1IXR6 IS1 transposase n=3 Tax=Deinococcus geothermalis... 151 1e-35 UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candida... 149 3e-35 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 143 2e-33 UniRef50_A3IXU4 Iso-IS1 ORF2 n=3 Tax=Bacteria RepID=A3IXU4_9CHRO 139 3e-32 UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202... 138 7e-32 UniRef50_B2K0W2 IS1 transposase n=30 Tax=Enterobacteriaceae RepI... 137 1e-31 UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaen... 136 3e-31 UniRef50_B0C1Z5 IS1 transposase n=3 Tax=Cyanobacteria RepID=B0C1... 133 2e-30 UniRef50_C5U8R9 IS1 transposase (Fragment) n=1 Tax=Methanocaldoc... 128 7e-29 UniRef50_Q2S4N0 ISSru3, transposase insB n=1 Tax=Salinibacter ru... 127 2e-28 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 123 2e-27 UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum A... 122 6e-27 UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5W... 118 7e-26 UniRef50_C5BB57 Iso-IS1 ORF2 n=24 Tax=Enterobacteriaceae RepID=C... 115 7e-25 UniRef50_P73781 Transposase n=5 Tax=Bacteria RepID=P73781_SYNY3 112 5e-24 UniRef50_O67144 Putative uncharacterized protein n=1 Tax=Aquifex... 111 7e-24 UniRef50_B2SG01 IS1 transposase n=7 Tax=Francisella tularensis R... 107 1e-22 UniRef50_C0A223 Putative uncharacterized protein n=1 Tax=Opituta... 107 2e-22 UniRef50_B0CCX7 Putative uncharacterized protein n=7 Tax=Cyanoba... 100 2e-20 UniRef50_Q46GF8 Putative uncharacterized protein n=3 Tax=Methano... 98 8e-20 UniRef50_A9FJP3 Putative uncharacterized protein n=5 Tax=Proteob... 98 8e-20 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 98 1e-19 UniRef50_A9GLP8 Putative uncharacterized protein n=1 Tax=Sorangi... 96 5e-19 UniRef50_A9GLN9 Putative uncharacterized protein n=2 Tax=Sorangi... 95 1e-18 UniRef50_C5BAQ0 IS1 ORF n=14 Tax=Gammaproteobacteria RepID=C5BAQ... 94 1e-18 UniRef50_Q9CJQ7 Putative uncharacterized protein n=2 Tax=Pasteur... 94 1e-18 UniRef50_B0JRY8 Transposase n=27 Tax=Bacteria RepID=B0JRY8_MICAN 94 1e-18 UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candida... 92 8e-18 UniRef50_B0CEC0 Putative uncharacterized protein n=6 Tax=Acaryoc... 91 2e-17 UniRef50_B0CAP5 Putative uncharacterized protein n=3 Tax=Acaryoc... 90 2e-17 UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q... 88 2e-16 UniRef50_Q972H6 Putative uncharacterized protein ST1154 n=1 Tax=... 87 2e-16 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 86 5e-16 UniRef50_D1JFE2 Putative uncharacterized protein n=3 Tax=uncultu... 84 1e-15 UniRef50_B9K3D6 Transposase n=32 Tax=Bacteria RepID=B9K3D6_AGRVS 84 2e-15 UniRef50_Q2FSQ2 Putative uncharacterized protein n=1 Tax=Methano... 83 3e-15 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 83 3e-15 UniRef50_Q10VW0 ISSru3, transposase InsB n=1 Tax=Trichodesmium e... 82 6e-15 UniRef50_Q8VSP6 Putative IS1 ORF n=1 Tax=Shigella flexneri RepID... 79 5e-14 UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachl... 79 6e-14 UniRef50_C7DAC3 Transposase n=36 Tax=Rhodobacterales RepID=C7DAC... 78 1e-13 UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium ... 76 4e-13 UniRef50_Q6MBQ1 Putative uncharacterized protein n=2 Tax=Candida... 76 6e-13 UniRef50_C8SAB2 IS1 transposase (Fragment) n=1 Tax=Ferroglobus p... 74 2e-12 UniRef50_A8YFR2 ImeAB protein n=2 Tax=Microcystis aeruginosa PCC... 73 4e-12 UniRef50_Q64CQ0 Putative uncharacterized protein n=1 Tax=uncultu... 72 7e-12 UniRef50_Q6MCH2 Putative uncharacterized protein n=1 Tax=Candida... 72 7e-12 UniRef50_Q46CV2 Putative uncharacterized protein n=14 Tax=Methan... 71 2e-11 UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis ae... 69 5e-11 UniRef50_A8ZKR3 Putative uncharacterized protein n=1 Tax=Acaryoc... 69 6e-11 UniRef50_A8GX98 Transposase and inactivated derivative n=2 Tax=R... 67 2e-10 UniRef50_UPI00016C465A IS1 transposase n=1 Tax=Gemmata obscurigl... 67 2e-10 UniRef50_B0URB1 Putative uncharacterized protein n=1 Tax=Methylo... 67 2e-10 UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthros... 66 4e-10 UniRef50_Q10ZU2 Putative uncharacterized protein n=3 Tax=Trichod... 66 6e-10 UniRef50_Q1CBA9 Putative transposase n=3 Tax=Yersinia pestis Rep... 66 7e-10 UniRef50_C5B7X2 Iso-IS1 ORF2 n=1 Tax=Edwardsiella ictaluri 93-14... 65 1e-09 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 63 5e-09 UniRef50_Q648U8 Putative uncharacterized protein n=6 Tax=environ... 60 3e-08 UniRef50_Q32DI9 Iso-IS1 ORF2 n=2 Tax=Shigella RepID=Q32DI9_SHIDS 60 3e-08 UniRef50_C4GXL2 Insertion sequence protein n=5 Tax=Yersinia pest... 60 3e-08 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 60 4e-08 UniRef50_Q649W7 Putative uncharacterized protein n=1 Tax=uncultu... 58 1e-07 UniRef50_Q7NJH9 Gsl1853 protein n=1 Tax=Gloeobacter violaceus Re... 56 3e-07 UniRef50_Q6MCX8 Putative uncharacterized protein n=2 Tax=Candida... 56 6e-07 UniRef50_Q8PRQ0 Putative uncharacterized protein n=1 Tax=Methano... 56 6e-07 UniRef50_UPI00018554DD transposase n=1 Tax=Francisella novicida ... 51 2e-05 UniRef50_Q327E8 IS1 ORF, n=6 Tax=Enterobacteriaceae RepID=Q327E8... 50 3e-05 UniRef50_Q10ZQ2 Putative uncharacterized protein n=7 Tax=Cyanoba... 47 2e-04 UniRef50_B2TXL7 IS1 ORF2 n=1 Tax=Shigella boydii CDC 3083-94 Rep... 47 2e-04 Sequences not found previously or not previously below threshold: UniRef50_B9K5F7 Transposase n=3 Tax=Bacteria RepID=B9K5F7_AGRVS 74 1e-12 UniRef50_Q2G895 Transposase n=36 Tax=Alphaproteobacteria RepID=Q... 68 1e-10 UniRef50_A4AD66 Transposase n=19 Tax=unclassified Gammaproteobac... 68 2e-10 UniRef50_Q11MN9 Transposase n=37 Tax=Bacteria RepID=Q11MN9_MESSB 67 2e-10 UniRef50_A3W3Q5 Transposase n=1 Tax=Roseovarius sp. 217 RepID=A3... 64 2e-09 UniRef50_B9K4Q6 Transposase n=2 Tax=Alphaproteobacteria RepID=B9... 64 2e-09 UniRef50_A9FZD9 Putative uncharacterized protein n=1 Tax=Sorangi... 63 4e-09 UniRef50_Q2G8C0 Transposase n=5 Tax=Alphaproteobacteria RepID=Q2... 62 6e-09 UniRef50_Q0RZ53 Transposase n=23 Tax=Bacteria RepID=Q0RZ53_RHOSR 62 8e-09 UniRef50_A3XA77 Putative transposase n=1 Tax=Roseobacter sp. MED... 60 2e-08 UniRef50_A9EF44 Transposase n=2 Tax=Rhodobacteraceae RepID=A9EF4... 60 2e-08 UniRef50_Q0RWC6 Transposase n=24 Tax=Bacteria RepID=Q0RWC6_RHOSR 58 1e-07 UniRef50_Q0W4E9 Putative uncharacterized protein n=1 Tax=uncultu... 56 3e-07 UniRef50_Q218S2 Putative uncharacterized protein n=1 Tax=Rhodops... 54 2e-06 UniRef50_A8LAQ0 Integrase catalytic region n=1 Tax=Frankia sp. E... 54 3e-06 UniRef50_B4WST7 Putative uncharacterized protein n=3 Tax=Synecho... 52 5e-06 UniRef50_Q8TRX5 Predicted protein n=3 Tax=Methanosarcina acetivo... 52 7e-06 UniRef50_B4WU12 Putative uncharacterized protein n=1 Tax=Synecho... 52 9e-06 UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=... 52 1e-05 UniRef50_A5N1B9 Transposase n=2 Tax=Clostridium kluyveri RepID=A... 51 1e-05 UniRef50_A9HNK8 Transposase, putative n=1 Tax=Roseobacter litora... 50 3e-05 UniRef50_A7C135 Putative uncharacterized protein n=1 Tax=Beggiat... 50 3e-05 UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nod... 49 5e-05 UniRef50_Q469A1 Putative uncharacterized protein n=1 Tax=Methano... 49 7e-05 UniRef50_C6GYT4 IS1216, transposase (Fragment) n=121 Tax=root Re... 48 1e-04 UniRef50_C7S9U1 IS6100 transposase n=358 Tax=root RepID=C7S9U1_E... 48 1e-04 UniRef50_C5AG18 IS element transposase n=2 Tax=Burkholderia RepI... 48 1e-04 UniRef50_A9VUP9 Integrase catalytic region n=149 Tax=Bacteria Re... 47 2e-04 UniRef50_A3NK27 IS6 family transposase n=29 Tax=Burkholderia Rep... 47 3e-04 UniRef50_A9EF82 Transposase, putative n=1 Tax=Oceanibulbus indol... 46 3e-04 UniRef50_B9K4C6 Transposase n=4 Tax=Proteobacteria RepID=B9K4C6_... 46 4e-04 UniRef50_Q6MAQ6 Putative uncharacterized protein n=1 Tax=Candida... 46 5e-04 UniRef50_A9VUQ5 Integrase catalytic region n=24 Tax=Bacteria Rep... 46 6e-04 UniRef50_B6ET23 Transposase n=7 Tax=Gammaproteobacteria RepID=B6... 45 7e-04 UniRef50_Q6MBH4 Putative uncharacterized protein n=1 Tax=Candida... 45 8e-04 UniRef50_Q64DF0 Putative uncharacterized protein n=6 Tax=cellula... 45 8e-04 UniRef50_B8IVA0 Transposase and inactivated derivatives-like pro... 45 0.001 UniRef50_B9K4Z7 Transposase n=5 Tax=Bacteria RepID=B9K4Z7_AGRVS 45 0.001 UniRef50_Q6MD18 Putative uncharacterized protein n=2 Tax=Candida... 45 0.001 UniRef50_Q64DD5 Putative uncharacterized protein n=1 Tax=uncultu... 44 0.001 UniRef50_C8SCF9 Integrase catalytic region n=1 Tax=Ferroglobus p... 44 0.001 UniRef50_Q8PWV9 Putative uncharacterized protein n=1 Tax=Methano... 44 0.002 UniRef50_A9EG16 Transposase n=1 Tax=Oceanibulbus indolifex HEL-4... 44 0.002 UniRef50_C6IUV9 Transposase n=4 Tax=Bacteroides RepID=C6IUV9_9BACE 43 0.003 UniRef50_A8LH39 Integrase catalytic region n=26 Tax=Bacteria Rep... 43 0.004 UniRef50_A0YAP3 Transposase n=1 Tax=marine gamma proteobacterium... 43 0.004 UniRef50_B4WVD1 Putative uncharacterized protein n=7 Tax=Synecho... 43 0.004 UniRef50_A0NLN9 Putative IS6 family transposase n=1 Tax=Labrenzi... 42 0.006 UniRef50_A0LBE3 Putative uncharacterized protein n=1 Tax=Magneto... 42 0.006 UniRef50_Q2GA88 Transposase, putative n=1 Tax=Novosphingobium ar... 42 0.007 UniRef50_UPI00016C51C4 hypothetical protein GobsU_02291 n=6 Tax=... 42 0.008 UniRef50_A3W3V5 Putative IS6 family transposase n=1 Tax=Roseovar... 42 0.009 UniRef50_B5WJN4 Integrase, catalytic region n=1 Tax=Burkholderia... 42 0.009 UniRef50_B9LVP3 Transposase n=20 Tax=Halobacteriaceae RepID=B9LV... 41 0.011 UniRef50_C2JSP7 IS431mec transposase n=2 Tax=Enterococcus faecal... 41 0.017 UniRef50_A7C324 Putative uncharacterized protein n=3 Tax=Beggiat... 40 0.023 UniRef50_A5L0K3 Putative transposase n=1 Tax=Vibrionales bacteri... 40 0.030 UniRef50_Q8R819 Transposase n=2 Tax=Thermoanaerobacter tengconge... 40 0.033 UniRef50_C6KV49 Transposase n=2 Tax=Bacteria RepID=C6KV49_9BACT 40 0.033 UniRef50_Q1J2T9 Transposase n=3 Tax=Deinococcus geothermalis DSM... 39 0.041 UniRef50_P0C1L0 Transposase for insertion sequence-like element ... 39 0.072 UniRef50_C2SK84 Transposase for insertion sequence element IS257... 39 0.078 UniRef50_Q1WCQ1 COG3316 n=1 Tax=Streptococcus thermophilus RepID... 39 0.079 UniRef50_B2JY75 Integrase catalytic region n=5 Tax=Burkholderia ... 39 0.086 >UniRef50_P57998 Insertion element IS1 4 protein insB n=553 Tax=root RepID=INSB4_ECOLI Length = 167 Score = 210 bits (534), Expect = 1e-53, Method: Composition-based stats. Identities = 164/167 (98%), Positives = 165/167 (98%) Query: 1 MPGNRPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF 60 MPGN PHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF Sbjct: 1 MPGNSPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF 60 Query: 61 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR 120 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR Sbjct: 61 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR 120 Query: 121 YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 YTQRIER+NLNLRQHLARLGRKSLSFSKSVE HDKVIGHYLNIKHYQ Sbjct: 121 YTQRIERYNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 179 bits (454), Expect = 3e-44, Method: Composition-based stats. Identities = 64/148 (43%), Positives = 94/148 (63%) Query: 19 FKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGER 78 KKL P+ +TS +DV E+DEQW YVG+K+RQ W++YAY+ V+A+ FG R Sbjct: 83 LKKLAPKRITSSPVTHADVAFICELDEQWSYVGSKARQHWIWYAYNTKTGGVLAYTFGPR 142 Query: 79 TMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLAR 138 T T L++LL+PF++ + +D W Y + H+ K +TQ IER+NL LR + R Sbjct: 143 TDQTCRELLALLTPFNIGMLTSDDWGSYGREVPKNKHLTGKIFTQCIERNNLTLRTRIKR 202 Query: 139 LGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 LGRK++ FS+SVE H+KVIG ++ + Sbjct: 203 LGRKTICFSRSVEIHEKVIGAFIEKHMF 230 >UniRef50_P03832 Insertion element iso-IS1n protein insB n=127 Tax=Gammaproteobacteria RepID=INBN_SHIDY Length = 131 Score = 166 bits (421), Expect = 2e-40, Method: Composition-based stats. Identities = 56/130 (43%), Positives = 85/130 (65%) Query: 37 VIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVV 96 + + E+DEQW +VG+K+RQ WL+YAY+ V+A+ FG RT T L++LL+PF++ Sbjct: 1 MALICELDEQWSFVGSKARQHWLWYAYNTKTGGVLAYTFGPRTDETCRELLALLTPFNIG 60 Query: 97 IWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKV 156 + +D W Y + H+ K +TQRIER+NL LR + RL RK++ FS+SVE H+KV Sbjct: 61 MLTSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRIKRLARKTICFSRSVEIHEKV 120 Query: 157 IGHYLNIKHY 166 IG ++ + Sbjct: 121 IGTFIEKHMF 130 >UniRef50_B4WT39 IS1 transposase subfamily, putative n=3 Tax=Cyanobacteria RepID=B4WT39_9SYNE Length = 243 Score = 163 bits (411), Expect = 2e-39, Method: Composition-based stats. Identities = 54/179 (30%), Positives = 75/179 (41%), Gaps = 27/179 (15%) Query: 11 WPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTV 70 W Q P+ + + G + E DE W +VG+KS ++W++ A +R + Sbjct: 65 WLQQYASEEYADVPRQAKTSPKKG---PLTLECDEAWSFVGSKSNKQWIWLAINRDTRET 121 Query: 71 VAHVFGERTMATLGRLMSLLSP--FDVVIWMTDGWP-----------------LYESRLK 111 + G R L + L P + TD W YE L Sbjct: 122 IGMHIGGRNREGARSLWACLPPVYRQCAVCYTDFWERCDPASLCGARERAPRQAYEIVLP 181 Query: 112 GKLHV-ISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 K H +SK T IER N LRQ ++RL RKSLSFSK +E H I ++ I HY Sbjct: 182 SKRHRAVSKNSGQTNHIERFNCTLRQRVSRLVRKSLSFSKKLENHIGAIWYF--IHHYN 238 >UniRef50_Q1IXR6 IS1 transposase n=3 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1IXR6_DEIGD Length = 148 Score = 151 bits (380), Expect = 1e-35, Method: Composition-based stats. Identities = 53/148 (35%), Positives = 73/148 (49%), Gaps = 8/148 (5%) Query: 24 PQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATL 83 Q+V + P +V+V E+DE W +VG K + RWL+ A +R + V+A V G+R+ T Sbjct: 2 RQTVPVCLTPPEEVVV--ELDELWTFVGKKKQARWLWIALERSTRKVLAWVLGDRSEQTA 59 Query: 84 GRLMSLLS----PFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARL 139 +L L + TD W Y+ L G + K T +ER N LRQ L RL Sbjct: 60 FKLWDRLPLSPEQRLKGTFCTDLWRAYDEPLLGVKRLTRKGETNHVERLNCTLRQRLGRL 119 Query: 140 GRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 RKSLSFSKS E + + Y Sbjct: 120 VRKSLSFSKSDEMLEASLTL--AFHRYN 145 >UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETP6_AMOA5 Length = 232 Score = 149 bits (377), Expect = 3e-35, Method: Composition-based stats. Identities = 48/155 (30%), Positives = 72/155 (46%), Gaps = 12/155 (7%) Query: 9 GRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRK 68 G W Q K R Q+V EMDE W YVG+K ++ W+++A +R Sbjct: 81 GEWIQAYHNQNKPKRRQAV-----------EVIEMDEMWHYVGSKKKKLWIWFALERSGG 129 Query: 69 TVVAHVFGERTMATLGRLMSLLSPFDVVIWM-TDGWPLYESRLKGKLHVISKRYTQRIER 127 +++ V G R +T RL + + TD WP Y + H +SK+ T IE Sbjct: 130 SILDFVTGSREASTGKRLWIKIKDIACRSFYATDHWPAYTQFINAHKHKVSKKQTTHIES 189 Query: 128 HNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLN 162 HN N+R +LAR RK+ +SKS + + + Sbjct: 190 HNANVRHYLARFRRKTKCYSKSERLVELSLYLLIY 224 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 143 bits (361), Expect = 2e-33, Method: Composition-based stats. Identities = 45/146 (30%), Positives = 71/146 (48%), Gaps = 9/146 (6%) Query: 31 IQPGSDVI-VCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERT--------MA 81 P +VI E+DE +VG+K + WL+ A + + ++A V G+ + Sbjct: 87 DVPEENVIPEVGELDELETFVGSKKTKIWLWTAVNHFTQGILAWVLGDHSLVLSEVEVAE 146 Query: 82 TLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGR 141 T L + + ++TDGW +Y S + ++SK Y R+E N LR +LARL R Sbjct: 147 TFKPLWENIEKWKCYFYVTDGWKVYPSFIPDGDQIVSKTYMTRVENENTRLRHYLARLHR 206 Query: 142 KSLSFSKSVEQHDKVIGHYLNIKHYQ 167 K+L +SKS + I L+ YQ Sbjct: 207 KTLCYSKSEQILRYSIKLLLHYLKYQ 232 >UniRef50_A3IXU4 Iso-IS1 ORF2 n=3 Tax=Bacteria RepID=A3IXU4_9CHRO Length = 138 Score = 139 bits (350), Expect = 3e-32, Method: Composition-based stats. Identities = 48/127 (37%), Positives = 73/127 (57%) Query: 41 AEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT 100 AE+D+ +V K +RWL++A D T++A+V G+RT +L ++L PF + + T Sbjct: 8 AEVDKMKIFVAKKEHERWLWHAIDHQTGTILAYVLGQRTDQMFLKLKTMLKPFGISEFYT 67 Query: 101 DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHY 160 D W Y+ L + +SK Q+IER +L LR + RL RK++ FSK HD VIG Y Sbjct: 68 DNWGSYKRHLSDEQRTVSKYKMQKIERKHLTLRTRIKRLQRKTICFSKISPMHDLVIGLY 127 Query: 161 LNIKHYQ 167 +N + Sbjct: 128 INKYEFH 134 >UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202 RepID=C8KX67_9PAST Length = 238 Score = 138 bits (347), Expect = 7e-32, Method: Composition-based stats. Identities = 43/133 (32%), Positives = 68/133 (51%), Gaps = 2/133 (1%) Query: 33 PGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSP 92 P E+DE W +VG KS + WL YA+DR+ K ++++V+G+R T+ RL L Sbjct: 95 PHHCFYESIEIDEFWTFVGRKSERVWLIYAFDRVSKKIISYVWGKRNSETVMRLKIQLCK 154 Query: 93 FDVVI--WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV 150 + +D W + KG H + ++YT IE ++ LR + R RKS +FSKS+ Sbjct: 155 SQISFRYVYSDRWICFRKIFKGYPHYLGRKYTIGIEGNHCLLRHRVRRFFRKSCNFSKSL 214 Query: 151 EQHDKVIGHYLNI 163 + H + Sbjct: 215 KYHFSAFRLMIWF 227 >UniRef50_B2K0W2 IS1 transposase n=30 Tax=Enterobacteriaceae RepID=B2K0W2_YERPB Length = 122 Score = 137 bits (344), Expect = 1e-31, Method: Composition-based stats. Identities = 59/121 (48%), Positives = 81/121 (66%), Gaps = 1/121 (0%) Query: 46 QWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPL 105 W +VG K +QRWL+YA++ K ++AHVFG R+ T +L+ LLS F++V W TD + Sbjct: 1 MWSFVGNKKQQRWLWYAWEPRLKRIIAHVFGRRSKKTFRQLLGLLSGFNIVFWCTDNFSA 60 Query: 106 YESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKH 165 Y L + H+ SK YTQRIER NLN+R L RL RK+L SKS E HD++IG ++ +H Sbjct: 61 Y-EMLPDEKHIRSKLYTQRIERENLNIRNRLKRLNRKTLGDSKSAEMHDRIIGTFIEREH 119 Query: 166 Y 166 Y Sbjct: 120 Y 120 >UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TRD2_9FLAO Length = 219 Score = 136 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 34/121 (28%), Positives = 60/121 (49%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTD 101 E+DE W ++G K W+ YA ++ +V+ G +T + L++ + TD Sbjct: 97 EVDELWSFIGNKKNSTWITYAIEQKTGSVIDFFVGRKTKENIKPLINKVLLLQPTRIYTD 156 Query: 102 GWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 +Y S + ++H + T +IER NL LR H+ RL R+++ FS+ E + + Y Sbjct: 157 RLNIYPSLIPKEMHKRFQYCTNKIERMNLTLRTHIKRLSRRTICFSRKQEYLEAHLKIYF 216 Query: 162 N 162 Sbjct: 217 W 217 >UniRef50_B0C1Z5 IS1 transposase n=3 Tax=Cyanobacteria RepID=B0C1Z5_ACAM1 Length = 130 Score = 133 bits (334), Expect = 2e-30, Method: Composition-based stats. Identities = 39/127 (30%), Positives = 60/127 (47%), Gaps = 7/127 (5%) Query: 46 QWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGW 103 W +V KS ++W++ A D + + +V G R+ +L + L + TD W Sbjct: 1 MWSFVNDKSNKQWIWLALDVITREIVGVYVGARSKQGARQLWNSLPGIYRQCAVAYTDFW 60 Query: 104 PLYESRLKGKLH-VISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHY 160 Y + H + K T IER N +RQ ++RL RK+LSFSK +E H I + Sbjct: 61 DAYGCVFPKQRHQAVGKETGQTCYIERFNCTMRQRVSRLVRKTLSFSKKLENHIGAIWMF 120 Query: 161 LNIKHYQ 167 + HY Sbjct: 121 --VHHYN 125 >UniRef50_C5U8R9 IS1 transposase (Fragment) n=1 Tax=Methanocaldococcus infernus ME RepID=C5U8R9_9EURY Length = 133 Score = 128 bits (321), Expect = 7e-29, Method: Composition-based stats. Identities = 40/129 (31%), Positives = 69/129 (53%), Gaps = 3/129 (2%) Query: 39 VCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSP--FDVV 96 + E+DE +V +K + W++ A D+ ++AH G+R+ +L +L+ + D Sbjct: 4 IHLEIDEMHSFVRSKDNKVWIWIAVDKNTGLIIAHKTGDRSDKSLKKLLKEIPKKVLDKC 63 Query: 97 IWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKV 156 + TD W Y + L + H I K YT+R+ER L R ARL R+ + +SKS+E H+ + Sbjct: 64 TFYTDKWKAY-NILPNERHKIGKEYTRRVERTFLTFRNSCARLVRRGIRYSKSMEMHNII 122 Query: 157 IGHYLNIKH 165 I + + Sbjct: 123 IDLLVYFYN 131 >UniRef50_Q2S4N0 ISSru3, transposase insB n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N0_SALRD Length = 158 Score = 127 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 41/154 (26%), Positives = 63/154 (40%), Gaps = 6/154 (3%) Query: 17 PPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFG 76 K SV ++P + E+DE W YV ++ +RWL+ A R + VVA V G Sbjct: 2 AQKKGRESDSVAEGLRPAEEG-DVLELDECWTYVRERANKRWLWVALCRRTRQVVAFVIG 60 Query: 77 ERTMATLGRLMSLLS-PFDVVIWMTDGWPLYESRL---KGKLHV-ISKRYTQRIERHNLN 131 +R+ T RL S + + +D W Y V S +ER Sbjct: 61 DRSARTCARLWSRIPEEYRQGRSFSDFWKSYRPVFAGDPSHRQVGKSSGEMAHVERFFGR 120 Query: 132 LRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKH 165 LRQ LAR R++ + S+S ++ + Sbjct: 121 LRQKLARYVRRTRAASESERMLHLTTKLFVEWYN 154 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 123 bits (308), Expect = 2e-27, Method: Composition-based stats. Identities = 47/132 (35%), Positives = 64/132 (48%) Query: 32 QPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLS 91 QP E+DE Y+G+K WL YA D+ KTVV+ +RT TL R++ L Sbjct: 98 QPIISKCKTYEVDEMCTYIGSKQNFIWLVYALDKNSKTVVSFNVAKRTNKTLSRVLDTLK 157 Query: 92 PFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVE 151 + T Y L K+H + + T IER NL LR HL RL R+++ SKS+ Sbjct: 158 LSEAKKIFTGRLKNYRYLLDEKMHSVKRFGTNHIERKNLTLRTHLKRLNRRTICSSKSLL 217 Query: 152 QHDKVIGHYLNI 163 V+ Y I Sbjct: 218 IFTAVLKIYFWI 229 >UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C2KAD9_9FLAO Length = 239 Score = 122 bits (305), Expect = 6e-27, Method: Composition-based stats. Identities = 41/143 (28%), Positives = 69/143 (48%) Query: 20 KKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERT 79 K + + ++P + E+DE Y +K+ +RW+ AY R K V+ + G RT Sbjct: 80 LKKILKIASKVVKPPIPQNITIEIDELKTYTQSKTNERWVVAAYCRETKKVIDYKLGRRT 139 Query: 80 MATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARL 139 TL ++ L + +D +Y + LH +R T IER L+LR H+ RL Sbjct: 140 TKTLQCIIDTLLYANPKKIYSDRLNIYPKLIPKHLHSTKRRETNHIERKFLDLRTHIKRL 199 Query: 140 GRKSLSFSKSVEQHDKVIGHYLN 162 GRKS++ ++ + D ++ Y Sbjct: 200 GRKSINKAQRDKYTDAILRIYFW 222 >UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5WC29_PSYWF Length = 233 Score = 118 bits (295), Expect = 7e-26, Method: Composition-based stats. Identities = 45/138 (32%), Positives = 68/138 (49%), Gaps = 3/138 (2%) Query: 31 IQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATL--GRLMS 88 I P C E+DE W +VG K+ ++WL YAY R +VA+V+G+R + T+ + Sbjct: 93 ITPKQRQYDCLEIDELWTFVGKKTNKQWLIYAYHRDTGEIVAYVWGKRDLNTVKKLKAKL 152 Query: 89 LLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSK 148 +D W + + KG VI K +T IE +N +R + R R+S +FSK Sbjct: 153 KALGVSCARIASDTWDSFVTGFKGFTQVIGKFFTVGIEGNNCTIRHRVRRAFRRSCNFSK 212 Query: 149 SVEQHDKVIGH-YLNIKH 165 +E H K + I H Sbjct: 213 KLENHFKAFDLAFFYINH 230 >UniRef50_C5BB57 Iso-IS1 ORF2 n=24 Tax=Enterobacteriaceae RepID=C5BB57_EDWI9 Length = 131 Score = 115 bits (287), Expect = 7e-25, Method: Composition-based stats. Identities = 40/107 (37%), Positives = 63/107 (58%) Query: 50 VGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESR 109 +G+K+RQ WL+YAY+ V+A+ FG +T + L+ L++PF++ + +D Sbjct: 1 MGSKARQHWLWYAYNTKTGGVLAYTFGPKTDESCRELLVLITPFNIGMITSDNRSSDGRE 60 Query: 110 LKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKV 156 + H+ K TQRI R+NL LR H+ RL RK++ FS+SV K Sbjct: 61 VPKDKHLTGKILTQRIVRNNLTLRTHIKRLARKTICFSRSVRSTKKS 107 >UniRef50_P73781 Transposase n=5 Tax=Bacteria RepID=P73781_SYNY3 Length = 138 Score = 112 bits (280), Expect = 5e-24, Method: Composition-based stats. Identities = 36/120 (30%), Positives = 56/120 (46%), Gaps = 3/120 (2%) Query: 48 GYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYE 107 + + + WL+ AYDR+ ++ G R TL RL+ L+ + V + TD W Y+ Sbjct: 2 AFSSGQKNKLWLWKAYDRVTGRLIDWELGNRDSQTLSRLLERLAKWKVTVSCTDDWRPYQ 61 Query: 108 SRL---KGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIK 164 L H ISKR T IER+N + R LAR R + S+S + + + + Sbjct: 62 QLLDEHPDAFHGISKRETVGIERNNSDNRHWLARFHRPTKVISRSAHMVNITMAIFAKFR 121 >UniRef50_O67144 Putative uncharacterized protein n=1 Tax=Aquifex aeolicus RepID=O67144_AQUAE Length = 147 Score = 111 bits (278), Expect = 7e-24, Method: Composition-based stats. Identities = 32/145 (22%), Positives = 64/145 (44%), Gaps = 7/145 (4%) Query: 24 PQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF-YAYDRLRKTVVAHVF-GERTMA 81 P+ + ++ D + DE W YVG K + W++ + T+ +F G+R++ Sbjct: 4 PEYGSEKVVKTEDNMENKPTDEMWSYVGTKGNEVWIWSVVVELKDGTIKKFLFAGDRSLR 63 Query: 82 TLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRY-TQRIERHNLNLRQHLARLG 140 T ++++ + + + TD + +Y L H++ K R E + LR L Sbjct: 64 TFLKILAKMPEAE--EYETDAYRVY-EWLPRDRHIVRKYGRVNRNEALHSKLRDKLVAFK 120 Query: 141 RKSLSFSKSVEQHDKVIGHYLNIKH 165 RK+ +F +S + + +I H Sbjct: 121 RKTKAFFRSFLYLRYALALF-SIHH 144 >UniRef50_B2SG01 IS1 transposase n=7 Tax=Francisella tularensis RepID=B2SG01_FRATM Length = 102 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 36/104 (34%), Positives = 50/104 (48%), Gaps = 2/104 (1%) Query: 46 QWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPL 105 W ++G+K W+ AYDR + V G R AT RL + + TD W Sbjct: 1 MWNFIGSKK--CWIIKAYDRRVGKTIIWVTGGRDNATFRRLYKKVQHLTNCNFYTDDWVA 58 Query: 106 YESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 + L K H+I K T IER N N R +LAR+ R++ S+S Sbjct: 59 FVEVLPKKRHIIGKSGTVAIERDNSNTRHNLARMTRRTKVISRS 102 >UniRef50_C0A223 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A223_9BACT Length = 269 Score = 107 bits (266), Expect = 2e-22, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 53/174 (30%), Gaps = 48/174 (27%) Query: 41 AEMDEQWGYVGAKSRQR----------WLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL 90 + DE W +VG K + W + A D K V G R + + M L Sbjct: 63 IQCDEIWSFVGCKEKNVTNNGKRQGDTWTWIACDPDTKLVPCWFIGRRDSESAKKFMRRL 122 Query: 91 S---PFDVVIWMTDGWPLYESRLK-------------------GKLHVISKR-------- 120 + TDG Y + +K G H Sbjct: 123 ARHLSLGSTQITTDGLKAYINAIKEILWIETSYGMVEKKYDVSGDDHRTRYIGSEKTAIF 182 Query: 121 --------YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 T +ER NL +R + R RK+ +SK + H I + ++ Sbjct: 183 GNPDPDTMNTSIVERQNLTMRMSMRRFTRKTNGYSKKIANHRYAIALHFMYYNF 236 >UniRef50_B0CCX7 Putative uncharacterized protein n=7 Tax=Cyanobacteria RepID=B0CCX7_ACAM1 Length = 196 Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats. Identities = 34/139 (24%), Positives = 52/139 (37%), Gaps = 12/139 (8%) Query: 40 CAEMDEQWGYVGAKSR----------QRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSL 89 DE W V K + W+ + + V+ G+ T L+ Sbjct: 24 IINADELWSSVKKKQKHCEPEELSLGDCWIALSLAKDSGLVLTGRIGKHTDELAQELIEN 83 Query: 90 LS-PFDVVIWMTDGWPLYESRLKGK-LHVISKRYTQRIERHNLNLRQHLARLGRKSLSFS 147 W TDGW Y +L + +H +SK TQR+ER N LRQ R R+ F Sbjct: 84 TEGKTACHHWQTDGWEGYSRQLADEVIHHVSKALTQRLERTNGILRQQTGRWHRRQNKFG 143 Query: 148 KSVEQHDKVIGHYLNIKHY 166 K +Q + + ++ Sbjct: 144 KVWQQSAVTLRLVMAYFNW 162 >UniRef50_Q46GF8 Putative uncharacterized protein n=3 Tax=Methanosarcina barkeri str. Fusaro RepID=Q46GF8_METBF Length = 112 Score = 98.3 bits (243), Expect = 8e-20, Method: Composition-based stats. Identities = 29/95 (30%), Positives = 42/95 (44%) Query: 67 RKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIE 126 K + FG R T + L ++ MTD W Y L +H SK T +E Sbjct: 7 GKKFINCSFGSRGTETGQLIWEKLKQKEIGEVMTDHWRAYAEFLPENIHTQSKAETYTVE 66 Query: 127 RHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 +N LR LARL RK+ ++KS+E + + Sbjct: 67 GYNGILRHFLARLRRKTKCYTKSIEMLKYSVLLLM 101 >UniRef50_A9FJP3 Putative uncharacterized protein n=5 Tax=Proteobacteria RepID=A9FJP3_SORC5 Length = 349 Score = 98.3 bits (243), Expect = 8e-20, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 50/177 (28%), Gaps = 54/177 (30%) Query: 39 VCAEMDEQWGYVGAKSRQR-----------WLFYAYDRLRKTVVAHVFGERTMATLGRLM 87 A+ DE W YV K + + F K ++++ G+R + Sbjct: 61 HVAQCDEIWSYVQKKQSRVTASDPAEYGDAYTFVGMASASKLIISYRVGKRDEENTRAFV 120 Query: 88 SLLSP--FDVVIWMTDGWPLY------------------------------ESRLKGKLH 115 L + TDGW Y + Sbjct: 121 KDLRARLTTIPQLYTDGWQPYIGAVGASFTGGVDYCQVVKNYSRRPRRDDEVRYEPPRDP 180 Query: 116 VISKR-----------YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 I+K T +ER N +R H+ R R FS+ + H + ++ Sbjct: 181 FITKTPIFGIPDVEHASTSHVERQNWTIRMHIRRFTRLCNGFSRKLANHRAAVALHV 237 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 97.9 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 12/120 (10%) Query: 44 DEQWGYV----GAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWM 99 DE W Y+ G+K W++ A + G+R T L++ L +V Sbjct: 173 DESWTYLRVRHGSKRENLWIWNAL---ADGLPFFTTGDRDYKTFSFLLNSLPKSEVN--Y 227 Query: 100 TDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGH 159 TD + +Y+ HV SK+YT +E +N R HLARL R + + ++S D + Sbjct: 228 TDDYSVYQVL---DNHVASKKYTYTVESYNSYCRAHLARLARDTRAVNRSERMVDYSLAL 284 >UniRef50_A9GLP8 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GLP8_SORC5 Length = 337 Score = 95.6 bits (236), Expect = 5e-19, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 52/176 (29%), Gaps = 54/176 (30%) Query: 38 IVCAEMDEQWGYVGAKSRQR-----------WLFYAYDRLRKTVVAHVFGERTMATLGRL 86 + +MDE W +V K + + + A D K ++ G+R Sbjct: 3 VHVIQMDEMWSFVQKKQARVTAEDPAEHGDAYFYVALDANTKLAISFHVGKRDGENTEAF 62 Query: 87 MSLLSP--FDVVIWMTDGWPLYE------------------------------SRLKGKL 114 + L V +DGW Y + Sbjct: 63 IKDLRSRLTVVPHITSDGWQPYIEAMATSFRGSADYAQCVKNYRGGPQRSPDHRYEPPRN 122 Query: 115 HVISKR-----------YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGH 159 ++K T +ER NL R + R R L+FSK++ H IG Sbjct: 123 PFVTKTPIFGAPKDELLSTSFVERFNLQTRHTVGRTRRLCLAFSKTLRGHRAAIGL 178 >UniRef50_A9GLN9 Putative uncharacterized protein n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GLN9_SORC5 Length = 405 Score = 94.9 bits (234), Expect = 1e-18, Method: Composition-based stats. Identities = 38/181 (20%), Positives = 59/181 (32%), Gaps = 54/181 (29%) Query: 39 VCAEMDEQWGYVGAKSRQR-----------WLFYAYDRLRKTVVAHVFGERTMATLGRLM 87 + DE + YVG K + + F A D + V+A G+R M T G + Sbjct: 96 ELIQADEVFSYVGKKQARVTEKDAPGIGETYSFTALDTASRLVIAWRVGKRDMETCGPFI 155 Query: 88 SLLSPFDVVI--WMTDGWPLYESRLKGK-------------------------------- 113 + L +V+ TDG+ Y + + Sbjct: 156 ADLRSRLLVMPQITTDGFAPYIATVAEHFGLSVDYMQTVKNYRTGSYRGPDHRYEPPRDP 215 Query: 114 ---LHVI------SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIK 164 H I K T +ER N R L R+ R +FSK+ E H + + Sbjct: 216 FITKHTIYGAPDAKKASTSYVERLNGTTRHLLGRMRRLCYAFSKAPEHHRAAVALHYTYF 275 Query: 165 H 165 + Sbjct: 276 N 276 >UniRef50_C5BAQ0 IS1 ORF n=14 Tax=Gammaproteobacteria RepID=C5BAQ0_EDWI9 Length = 78 Score = 94.5 bits (233), Expect = 1e-18, Method: Composition-based stats. Identities = 45/78 (57%), Positives = 56/78 (71%) Query: 90 LSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 + F++ +MTD WP+Y + L HV+SK+YTQRIERHNLNLR HL RL R+++ FS S Sbjct: 1 MRKFNIAFYMTDAWPVYRTLLDPAHHVVSKKYTQRIERHNLNLRTHLKRLTRRTICFSNS 60 Query: 150 VEQHDKVIGHYLNIKHYQ 167 E HDKVIG YL I HY Sbjct: 61 EEMHDKVIGWYLTINHYH 78 >UniRef50_Q9CJQ7 Putative uncharacterized protein n=2 Tax=Pasteurellaceae RepID=Q9CJQ7_PASMU Length = 181 Score = 94.1 bits (232), Expect = 1e-18, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 50/118 (42%), Gaps = 5/118 (4%) Query: 47 WGYV--GAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVV--IWMTDG 102 W +V ++ ++ Y +VA V+G+R + T L L V D Sbjct: 62 WHFVPPNRIDQKYRIYIGYHAKTSEIVAFVWGKRDLQTALALKQRLKELKVSYERIAGDN 121 Query: 103 WPLYESRLKG-KLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGH 159 W + + + K++T+ IE +N +R L+R R+S FSKS+ H K Sbjct: 122 WDAFVNAFSDTGDQWVGKQHTKAIEGNNCRIRHRLSRAVRRSCCFSKSMFYHVKSFNI 179 >UniRef50_B0JRY8 Transposase n=27 Tax=Bacteria RepID=B0JRY8_MICAN Length = 111 Score = 94.1 bits (232), Expect = 1e-18, Method: Composition-based stats. Identities = 31/106 (29%), Positives = 52/106 (49%), Gaps = 7/106 (6%) Query: 67 RKTVVAHVFGERTMATLGRLMSLLSP--FDVVIWMTDGWPLYESRLKGKLHV-ISK--RY 121 + ++ + G+R+ + +L + L + TD W Y++ + K H + K Sbjct: 2 QGKLLVAMRGDRSRQSAKKLWASLPGVYRQCAVAYTDFWESYKTVIPSKRHRPVGKETGQ 61 Query: 122 TQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 T IER N RQ ++RL R+SLSFSK +E H + ++ I Y Sbjct: 62 TNPIERLNNTFRQRISRLVRESLSFSKKMENHVGAVWYF--IHDYN 105 >UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX7_PARUW Length = 163 Score = 91.8 bits (226), Expect = 8e-18, Method: Composition-based stats. Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 2/83 (2%) Query: 11 WPQHDFPPFKKLRPQSVTSRIQPGSDV--IVCAEMDEQWGYVGAKSRQRWLFYAYDRLRK 68 W K P+++ + + +D +V E+DE W YVG+K+ +WL+ + Sbjct: 71 WLLEFIGELTKELPENLNAEVVSENDELEVVVLEVDELWSYVGSKANPQWLWLVMHSKTR 130 Query: 69 TVVAHVFGERTMATLGRLMSLLS 91 VVA G R T +L+ L Sbjct: 131 QVVAMQIGPRNKETAEKLLYKLP 153 >UniRef50_B0CEC0 Putative uncharacterized protein n=6 Tax=Acaryochloris marina MBIC11017 RepID=B0CEC0_ACAM1 Length = 172 Score = 90.6 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 29/113 (25%), Positives = 48/113 (42%), Gaps = 2/113 (1%) Query: 56 QRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLS-PFDVVIWMTDGWPLYESRLKGKL 114 W+ + + V++ G+ T L+ W TDGW Y +L ++ Sbjct: 20 DCWIALSLAKESGLVLSGRIGKHTDELAQELIENTEGKTACHHWQTDGWEGYARQLPDEV 79 Query: 115 -HVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 H +SK TQR+ER N +RQ R R+ F K +Q + L+ ++ Sbjct: 80 VHEVSKALTQRLERTNGIVRQQTGRWHRRQNKFGKVWQQSAMTLRLVLSYFNW 132 >UniRef50_B0CAP5 Putative uncharacterized protein n=3 Tax=Acaryochloris marina MBIC11017 RepID=B0CAP5_ACAM1 Length = 144 Score = 90.2 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 26/111 (23%), Positives = 47/111 (42%), Gaps = 2/111 (1%) Query: 56 QRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLS-PFDVVIWMTDGWPLYESRLKGK- 113 + W+ + + V++ G+ T L+ W TDGW + + Sbjct: 5 ECWIALSLAKDSSLVLSGRIGKHTDELAQDLIENTEGKTTCHHWQTDGWEGSSRQPPDEV 64 Query: 114 LHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIK 164 +H +SK TQR++R N LRQ R ++ F K +QH + + + + Sbjct: 65 IHHVSKVLTQRLKRTNGILRQQTGRWHQRQNKFGKVWQQHAVTLTLFYHFR 115 >UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q0SUU8_CLOPS Length = 340 Score = 87.5 bits (215), Expect = 2e-16, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 58/168 (34%), Gaps = 24/168 (14%) Query: 11 WPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTV 70 W H F P+ K + + +++ SD +DE ++ K +L+ A D + V Sbjct: 153 WV-HKFAPYFKEKAKIFNAQLDLNSD---DWHVDETVVFISGKK--YYLWLAIDSETRFV 206 Query: 71 VAHVFGE-RTMATLGRLMSLLSPF-DVVIWMTDGWPLY----ESRLKGKLHV-----ISK 119 +A + R LM+ ++TD P Y ++ L H+ S Sbjct: 207 LAFHLTQARDSDAAFILMNQAKSMGKPNNFITDRLPSYNEAVKTVLNESTHIPVPPMSSD 266 Query: 120 RYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 IE N + +K + + Y+ I HY Sbjct: 267 TNNNLIESFNKTFKAWYK--AKKGFNSFEKANNL-----IYMFIFHYN 307 >UniRef50_Q972H6 Putative uncharacterized protein ST1154 n=1 Tax=Sulfolobus tokodaii RepID=Q972H6_SULTO Length = 152 Score = 87.2 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 24/122 (19%), Positives = 44/122 (36%), Gaps = 9/122 (7%) Query: 44 DEQWGYVGAKSRQRWLFYAYDR---LRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT 100 DE W Y+ +R + + + + G+R T + L D W++ Sbjct: 27 DEMWTYLYRNTRAFYKWVFNCHVYTRLGLYIIYSVGDRDENTFREVKMYLP--DDGRWVS 84 Query: 101 DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHY 160 D + +Y V+S E + +LR L R R + + ++S+ I Sbjct: 85 DDYNVY--FWLKNHTVVSLVNPN--ESFHSSLRDRLVRFKRATKAVNRSINMVKYSIALV 140 Query: 161 LN 162 L Sbjct: 141 LW 142 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 86.0 bits (211), Expect = 5e-16, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 52/155 (33%), Gaps = 9/155 (5%) Query: 11 WPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD---RLR 67 W + + + + +V +DE W Y+ +R + + Sbjct: 86 WIKRYGRKKHEKLVELWGRAKELVKGKVVAKVVDEMWTYLYKNARAFYKWVFTCYVYTKL 145 Query: 68 KTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 + + G+R +T + L W++D + LY V+S E Sbjct: 146 GVYLIYSVGDRDESTFLEVKKYLPDE--GRWVSDDYNLY--FWLKDHTVVSPVNPN--ES 199 Query: 128 HNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLN 162 + +LR L R R + + ++S+ I L Sbjct: 200 FHSSLRDRLIRFKRATKAINRSIRTMMYSIALVLW 234 >UniRef50_D1JFE2 Putative uncharacterized protein n=3 Tax=uncultured archaeon RepID=D1JFE2_9ARCH Length = 217 Score = 84.1 bits (206), Expect = 1e-15, Method: Composition-based stats. Identities = 35/148 (23%), Positives = 55/148 (37%), Gaps = 37/148 (25%) Query: 56 QRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF-------DVVIWMTDGWPLYES 108 W++ A K +AH G+R T L++L+ D +++DG Y Sbjct: 23 DCWIYTAIKSDTKLHLAHCTGKRVQETANALVALVKNRGKAPDTDDKATFVSDGNNQYTK 82 Query: 109 RL-----------------KGKLHVISK-------------RYTQRIERHNLNLRQHLAR 138 L + V+ K T +ER+NL LR +++ Sbjct: 83 ALFENFDVNAINYGQLVKERDNGRVVGKTRTIIFGSLEVDEIETVYVERYNLTLRHGISK 142 Query: 139 LGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 L RKSL FSK E D + Y ++ Sbjct: 143 LVRKSLCFSKCKEMLDDHLDLYQCYTNF 170 >UniRef50_B9K3D6 Transposase n=32 Tax=Bacteria RepID=B9K3D6_AGRVS Length = 243 Score = 83.7 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 26/134 (19%), Positives = 45/134 (33%), Gaps = 11/134 (8%) Query: 41 AEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFD---VVI 97 +DE +G K WL+ A D+ + V R RLM L + Sbjct: 85 WYLDEVVISIGGKK--HWLWRAVDQDGFVLDVLVQSRRNAKAAKRLMRKLLKGQGRSPRV 142 Query: 98 WMTDGWPLY----ESRLKGKLHVISKRYTQRIERHN--LNLRQHLARLGRKSLSFSKSVE 151 +TD Y + H K R E + + R+ + + + + + V Sbjct: 143 MITDKLRSYGAAKREIMPAVEHRSHKGLNNRAENSHQPIRRRERIMKRFKSARHLQRFVS 202 Query: 152 QHDKVIGHYLNIKH 165 HD + + +H Sbjct: 203 IHDPIANLFQIPRH 216 >UniRef50_Q2FSQ2 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FSQ2_METHJ Length = 201 Score = 83.3 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 51/148 (34%), Gaps = 37/148 (25%) Query: 56 QRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL-------SPFDVVIWMTDGWPLYES 108 W + + R +A G+R + T ++ +P + + TDG Y Sbjct: 21 DCWSYTCFKRDSGLFLAFESGKRNIDTCADMLVRFFNRMELPTPENKISIFTDGNVQYSI 80 Query: 109 RLK--------GKLHVISKRYTQR----------------------IERHNLNLRQHLAR 138 L VI + + IE +N +RQ L+R Sbjct: 81 CLPELYCEPCLDYGQVIKVKEKNKLVYVIREKIMGNPDSKAISTSVIEGYNNKIRQRLSR 140 Query: 139 LGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 GRK+ SFSK + + + + + ++ Sbjct: 141 FGRKTASFSKKLNRFISALNIFQFVHNF 168 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 83.3 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 31/91 (34%), Gaps = 1/91 (1%) Query: 33 PGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSP 92 + I EMDE Y+G K + FG R T + L Sbjct: 88 KSENEISIVEMDEMHTYIGNKKNIAGSGLLL-IELGKFIHCSFGNRGTETGQLIWEKLKQ 146 Query: 93 FDVVIWMTDGWPLYESRLKGKLHVISKRYTQ 123 ++ MTD W Y L +H SK+ Q Sbjct: 147 KEIGEVMTDHWRAYAEFLPENIHTQSKKRIQ 177 >UniRef50_Q10VW0 ISSru3, transposase InsB n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10VW0_TRIEI Length = 76 Score = 82.1 bits (201), Expect = 6e-15, Method: Composition-based stats. Identities = 20/73 (27%), Positives = 30/73 (41%), Gaps = 2/73 (2%) Query: 46 QWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGW 103 W +VG+K+ Q+W + A D K +VA GER +L + I TD W Sbjct: 1 MWSFVGSKNNQQWFWLAIDIETKEIVAFSLGERGEKGANQLWNSWPGIYRQCAICYTDFW 60 Query: 104 PLYESRLKGKLHV 116 Y+ + Sbjct: 61 SAYDVIFPHCRQL 73 >UniRef50_Q8VSP6 Putative IS1 ORF n=1 Tax=Shigella flexneri RepID=Q8VSP6_SHIFL Length = 67 Score = 79.1 bits (193), Expect = 5e-14, Method: Composition-based stats. Identities = 38/64 (59%), Positives = 46/64 (71%) Query: 104 PLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNI 163 P+Y + L HVISK+ TQRIERHNLNLR HL RL RK++ FSKS + H K+IG YL I Sbjct: 4 PVYRTLLSSTSHVISKKCTQRIERHNLNLRTHLKRLTRKTICFSKSDDMHYKIIGWYLTI 63 Query: 164 KHYQ 167 H+ Sbjct: 64 NHHH 67 >UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachlamydiaceae RepID=Q6MD28_PARUW Length = 209 Score = 78.7 bits (192), Expect = 6e-14, Method: Composition-based stats. Identities = 21/86 (24%), Positives = 32/86 (37%), Gaps = 5/86 (5%) Query: 11 WPQHDFPPFKKLRPQ----SVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRL 66 W P+ VT + +V E+DE W +VG K +WL+ + Sbjct: 73 WLLDFINFIINDLPEDLNAQVTCCEKDELEVAK-LEVDELWNFVGNKKNDQWLWLILHKK 131 Query: 67 RKTVVAHVFGERTMATLGRLMSLLSP 92 + V+A G R T L + L Sbjct: 132 SRQVLAMQVGPRDKKTAELLFAKLPE 157 >UniRef50_C7DAC3 Transposase n=36 Tax=Rhodobacterales RepID=C7DAC3_9RHOB Length = 237 Score = 77.9 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 25/128 (19%), Positives = 48/128 (37%), Gaps = 10/128 (7%) Query: 41 AEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL--SPFDVVIW 98 +DE +V ++ +L+ A D + + A V R A + + L Sbjct: 83 WHVDEV--FVKVNGKRHYLWRAVDHEGEVLEAVVTKRRNKAAALKFLKKLMKRHGKAEEV 140 Query: 99 MTDGWPLYESRLKG----KLHVISKRYTQRIERHNLNLRQHLARL--GRKSLSFSKSVEQ 152 +TD + Y++ L+ + + R+E +L R+ + R+ S K Sbjct: 141 VTDRFAPYKAALRDLGALEKQSTGRWLNNRVENSHLPFRRRERAMQRFRRMRSLQKFAAV 200 Query: 153 HDKVIGHY 160 H V H+ Sbjct: 201 HSSVYNHF 208 >UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZK1_TRIEI Length = 177 Score = 76.0 bits (185), Expect = 4e-13, Method: Composition-based stats. Identities = 14/72 (19%), Positives = 26/72 (36%) Query: 40 CAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWM 99 E+ E +V K + L+ R+ ++ V G + T L + + + Sbjct: 106 VGELHELETFVSDKKNKVLLWTLVYHFRQGILGWVVGNHSGDTFQPLWQAIGFWKCYFQV 165 Query: 100 TDGWPLYESRLK 111 TDG P+ Sbjct: 166 TDGNPVASRLYP 177 >UniRef50_Q6MBQ1 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MBQ1_PARUW Length = 138 Score = 75.6 bits (184), Expect = 6e-13, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 33/85 (38%), Gaps = 3/85 (3%) Query: 11 WPQHDFPPFKKLRPQSVTSRIQ---PGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLR 67 W P+ + +++ + E+DE+W +VG K +WL+ + Sbjct: 45 WLLDFINLLINDLPEDLNTQVTCCEKDELEVARLEVDERWSFVGNKKNDQWLWLILHKKS 104 Query: 68 KTVVAHVFGERTMATLGRLMSLLSP 92 + V+A G R T L + L Sbjct: 105 RQVLAMQVGPRDKKTAELLFTKLPE 129 >UniRef50_B9K5F7 Transposase n=3 Tax=Bacteria RepID=B9K5F7_AGRVS Length = 196 Score = 74.4 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 43/136 (31%), Gaps = 11/136 (8%) Query: 39 VCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFD---V 95 +DE + K WL+ A D+ + V R LM L Sbjct: 39 EKWHLDEAVVSIRGKK--HWLWRAVDQDGFVLDVLVQSRRNAKAARHLMRQLLKGQGRAP 96 Query: 96 VIWMTDGWPLYE----SRLKGKLHVISKRYTQRIERHN--LNLRQHLARLGRKSLSFSKS 149 + +TD Y G H K + R E + + R+ + + + + Sbjct: 97 RVMITDKLRSYGAAKWELTPGVEHRSHKGLSNRAENFHQPVRRRERIMKRFKSQRHLQRF 156 Query: 150 VEQHDKVIGHYLNIKH 165 V HD + + ++ Sbjct: 157 VSIHDPIANLFHIPRN 172 >UniRef50_C8SAB2 IS1 transposase (Fragment) n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SAB2_FERPL Length = 75 Score = 73.7 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 25/72 (34%), Positives = 35/72 (48%), Gaps = 3/72 (4%) Query: 96 VIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDK 155 I+ TD W Y + K +I K T +ER L LR R RKS+ FSKS+E + Sbjct: 3 AIFYTDRWDAYN-LIPYKQRIIKKGGTNHVERLFLTLRNDNPRFARKSIRFSKSIEMLEN 61 Query: 156 VIGHYLNIKHYQ 167 + + I +Y Sbjct: 62 SLKLW--IHYYN 71 >UniRef50_A8YFR2 ImeAB protein n=2 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YFR2_MICAE Length = 122 Score = 72.9 bits (177), Expect = 4e-12, Method: Composition-based stats. Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 3/63 (4%) Query: 94 DVVIWMTDGWPLYESRLKGKLHV-ISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSV 150 + TD W Y++ + K H + K T IER N RQ ++RL R+SLSFSK + Sbjct: 25 QCAVAYTDCWESYKTGIPSKRHRPVGKETGQTNPIERLNNTFRQRISRLVRESLSFSKKM 84 Query: 151 EQH 153 E H Sbjct: 85 ENH 87 >UniRef50_Q64CQ0 Putative uncharacterized protein n=1 Tax=uncultured archaeon GZfos1D1 RepID=Q64CQ0_9ARCH Length = 168 Score = 72.1 bits (175), Expect = 7e-12, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 45/133 (33%), Gaps = 37/133 (27%) Query: 71 VAHVFGERTMATLGRLMSLL-------SPFDVVIWMTDGWPLYESRLKG----------- 112 +A G++T + GR+M + SP + TDG Y L Sbjct: 1 MAFSVGKQTQESCGRMMKKVFGRTEQPSPQTKMEMFTDGNDDYTYVLPDYCADACIEYGQ 60 Query: 113 ------KLHVISK-------------RYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQH 153 V+ K T +E +N LR+ + RL RK+ FSK Sbjct: 61 LVKIRENGRVVRKEKRIIYGNPDLGDIETTDVENYNGILRERIGRLVRKTKCFSKRKRML 120 Query: 154 DKVIGHYLNIKHY 166 + + + ++ Sbjct: 121 ECSLQVFQFYWNF 133 >UniRef50_Q6MCH2 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCH2_PARUW Length = 121 Score = 72.1 bits (175), Expect = 7e-12, Method: Composition-based stats. Identities = 26/74 (35%), Positives = 33/74 (44%), Gaps = 3/74 (4%) Query: 86 LMSLLSPFDVVIWMTDGWPLYESRLKGKLHV-ISK--RYTQRIERHNLNLRQHLARLGRK 142 L L + TD + +Y H +SK T IER N RQ ARL RK Sbjct: 33 LQKLPESLKKAFYFTDKFNVYYETNPWSQHQPVSKQSGQTSYIERFNCTRRQRCARLVRK 92 Query: 143 SLSFSKSVEQHDKV 156 +LSFSK + H + Sbjct: 93 TLSFSKKLTNHIGL 106 >UniRef50_Q46CV2 Putative uncharacterized protein n=14 Tax=Methanosarcina RepID=Q46CV2_METBF Length = 75 Score = 71.0 bits (172), Expect = 2e-11, Method: Composition-based stats. Identities = 22/64 (34%), Positives = 32/64 (50%) Query: 98 WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVI 157 MTD W Y L +H SK T +E +N L+ LARL RK+ ++KS+E + Sbjct: 1 MMTDHWRAYAEFLPENIHTQSKAETYTVEGYNGILKHFLARLRRKTKCYTKSIEMLKYSV 60 Query: 158 GHYL 161 + Sbjct: 61 LLLM 64 >UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YEG9_MICAE Length = 171 Score = 69.4 bits (168), Expect = 5e-11, Method: Composition-based stats. Identities = 11/64 (17%), Positives = 26/64 (40%), Gaps = 2/64 (3%) Query: 11 WPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTV 70 W Q+ P+ + + + E DE W +V +K+ + +++ DR + + Sbjct: 107 WLQNYVNNKLASVPRQIKVSDKLKGK--LVIECDEMWSFVFSKTIKVYIWRLIDRNTREI 164 Query: 71 VAHV 74 + Sbjct: 165 IGCY 168 >UniRef50_A8ZKR3 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZKR3_ACAM1 Length = 241 Score = 69.1 bits (167), Expect = 6e-11, Method: Composition-based stats. Identities = 27/97 (27%), Positives = 37/97 (38%), Gaps = 5/97 (5%) Query: 19 FKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGER 78 Q + D I EMDE+ GYV K +Q W A D K ++ G R Sbjct: 104 KFGQHAQDFHEQEAQQLD-IDVLEMDERHGYVAIKQQQCWDAVAIDAASKFIIQVEVGPR 162 Query: 79 TMATLGRLM----SLLSPFDVVIWMTDGWPLYESRLK 111 + RLM L+ ++ MTDG Y + Sbjct: 163 NTNLIDRLMRATHKRLAHPRDLVLMTDGDASYRTLFP 199 >UniRef50_Q2G895 Transposase n=36 Tax=Alphaproteobacteria RepID=Q2G895_NOVAD Length = 238 Score = 67.9 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 10/133 (7%) Query: 41 AEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL--SPFDVVIW 98 +DE +V + +L+ A D + + ++V R A + Sbjct: 85 WHLDEV--FVKINGERHYLWRAVDHEGEVLESYVTKTRDKAAALTFLKKALKRHGRAEAI 142 Query: 99 MTDGWPLYESRLKG----KLHVISKRYTQRIERHNLNLRQHLARL--GRKSLSFSKSVEQ 152 +TDG Y + ++ + + R+E +L R+ + R+ + K Sbjct: 143 VTDGLRSYPAAMRQLGNLDRRKMGRWLNNRVENSHLPFRRRERAMLRFRQMKTLQKFASV 202 Query: 153 HDKVIGHYLNIKH 165 H + H+ +H Sbjct: 203 HGSLHNHFSQDRH 215 >UniRef50_A4AD66 Transposase n=19 Tax=unclassified Gammaproteobacteria RepID=A4AD66_9GAMM Length = 227 Score = 67.5 bits (163), Expect = 2e-10, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 47/132 (35%), Gaps = 11/132 (8%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLS---PFDVVIWM 99 +DE +V +Q++L+ A D+ + V ++ +R A R L + + Sbjct: 75 IDEV--FVTINGKQQYLWRAVDQDGEVVDVYLQTKRDGAAAKRFFKRLLRSHGGEPRKIV 132 Query: 100 TDGWPLY----ESRLKGKLHVISKRYTQRIERHNLNLRQHLA--RLGRKSLSFSKSVEQH 153 TD Y + +H+ + R E+ + R R + + V H Sbjct: 133 TDKLRSYGVAHRELIPETVHITEQYENNRAEQSHETTRARERGMRRFKSVAQAQRFVAAH 192 Query: 154 DKVIGHYLNIKH 165 V + +H Sbjct: 193 AAVFNLFNLGRH 204 >UniRef50_A8GX98 Transposase and inactivated derivative n=2 Tax=Rickettsia bellii RepID=A8GX98_RICB8 Length = 99 Score = 67.1 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 41/82 (50%), Gaps = 1/82 (1%) Query: 77 ERTMATLGRLMSLLSP-FDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQH 135 R +++ + L +++ I +D + +Y + K H +K+ T +E N +R + Sbjct: 4 GRDISSYLPMALRLEENYEIDISCSDHYDVYGAYKIAKRHYFTKKETALVESFNSLIRNY 63 Query: 136 LARLGRKSLSFSKSVEQHDKVI 157 LAR RK+ +SK+++ I Sbjct: 64 LARFNRKTKRYSKAIDMIYNSI 85 >UniRef50_UPI00016C465A IS1 transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C465A Length = 88 Score = 67.1 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 23/74 (31%), Positives = 32/74 (43%), Gaps = 5/74 (6%) Query: 97 IWMTDGWPLYESRLKGKLHV-ISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQH 153 TD P + + H + K T IER L LRQ AR RK+L+FSK H Sbjct: 13 TVYTDLLPACRAAIPRARHRAVRKVTGLTAHIERFWLTLRQRCARFVRKTLTFSKCPRNH 72 Query: 154 DKVIGHYLNIKHYQ 167 + ++ + Y Sbjct: 73 LGALWYFA--RRYN 84 >UniRef50_B0URB1 Putative uncharacterized protein n=1 Tax=Methylobacterium sp. 4-46 RepID=B0URB1_METS4 Length = 82 Score = 67.1 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 30/67 (44%) Query: 98 WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVI 157 + TD + Y + L H + K TQ +E +N R AR R++ SKSVE + + Sbjct: 4 FCTDNYAPYAAALPAGRHHVGKDQTQLVESNNARQRHWFARFRRRTCVVSKSVEMVEATM 63 Query: 158 GHYLNIK 164 + Sbjct: 64 ALFAFYH 70 >UniRef50_Q11MN9 Transposase n=37 Tax=Bacteria RepID=Q11MN9_MESSB Length = 237 Score = 67.1 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 29/135 (21%), Positives = 48/135 (35%), Gaps = 11/135 (8%) Query: 41 AEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL---SPFDVVI 97 +DE V K ++ WL+ A D + A + R A RLM L + Sbjct: 79 WHLDEMV--VTIKGKKYWLWRAVDTNGYVLDALLQSRRNKAAAMRLMRKLLKDQGTAPRV 136 Query: 98 WMTDGWPLYE----SRLKGKLHVISKRYTQRIERHNLNL--RQHLARLGRKSLSFSKSVE 151 +TD Y + G H K R E +L + R+ + + + V Sbjct: 137 MVTDKLRSYSAAKSQLMPGVEHRSHKGLNNRAENSHLPVRRRERRMMRFKSARQCQRFVS 196 Query: 152 QHDKVIGHYLNIKHY 166 H ++ +L + Y Sbjct: 197 AHGQIANLFLLHRKY 211 >UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthrospira maxima CS-328 RepID=B5VWL6_SPIMA Length = 153 Score = 66.0 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 31/63 (49%), Gaps = 1/63 (1%) Query: 36 DVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDV 95 ++ A++DE +VG+K W++ + ++ V G+R++ T L ++ + Sbjct: 66 EIPEIAQIDELQTFVGSKKT-IWVWTVVNTKLPGILKFVIGDRSLLTFTTLWQMIQGWAC 124 Query: 96 VIW 98 ++ Sbjct: 125 FLY 127 >UniRef50_Q10ZU2 Putative uncharacterized protein n=3 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZU2_TRIEI Length = 79 Score = 65.6 bits (158), Expect = 6e-10, Method: Composition-based stats. Identities = 14/46 (30%), Positives = 23/46 (50%) Query: 39 VCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLG 84 + + DE W +VG K+ ++WL+ A D + +V GER Sbjct: 34 LTIQCDEMWSFVGNKNNKQWLWLAIDIETQEIVGFYLGERGEKGAA 79 >UniRef50_Q1CBA9 Putative transposase n=3 Tax=Yersinia pestis RepID=Q1CBA9_YERPA Length = 85 Score = 65.6 bits (158), Expect = 7e-10, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 34/78 (43%), Gaps = 1/78 (1%) Query: 51 GAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRL 110 G + +T F T +L+ LLS F++V W TD + Y L Sbjct: 5 GQQKAATLALVCLGASPQTYYCSYFWSSEQKTFRQLLGLLSGFNIVFWCTDNFSAY-EML 63 Query: 111 KGKLHVISKRYTQRIERH 128 + H+ SK YTQRIER Sbjct: 64 PDEKHIRSKLYTQRIERE 81 >UniRef50_C5B7X2 Iso-IS1 ORF2 n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B7X2_EDWI9 Length = 99 Score = 64.8 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 21/69 (30%), Positives = 36/69 (52%) Query: 90 LSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 L+ F++ + D W + + +TQ ER++L LR + RL RK + FS++ Sbjct: 3 LTAFNIGMITRDDWGNPIREVPWGKPLTGTIFTQHSERNSLMLRTRIKRLARKRIGFSRA 62 Query: 150 VEQHDKVIG 158 + H+KV G Sbjct: 63 IALHEKVTG 71 >UniRef50_A3W3Q5 Transposase n=1 Tax=Roseovarius sp. 217 RepID=A3W3Q5_9RHOB Length = 180 Score = 63.7 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 46/155 (29%), Gaps = 17/155 (10%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW P + + + +PG +DE V R+ WL+ A D+ Sbjct: 22 RWTAKFGPQIARNLRRR---QARPG----DVWHLDEVV--VKISGRKFWLWRAVDQHGVV 72 Query: 70 VVAHVFGERTMATLGRLMSLL--SPFDVVIWMTDGWPLY----ESRLKGKLHVISKRYTQ 123 + V +R R++ L +TD Y G H K Sbjct: 73 LEEIVQSKRDKRAAKRVLRRLIKCYGLPKRIVTDKLRAYGAAKREVAPGLDHWSHKDLNN 132 Query: 124 RIERHNLNLRQHLARL--GRKSLSFSKSVEQHDKV 156 R E +L R+ + R + H Sbjct: 133 RAENSHLPFRKRERAMQSFRSPGGLQRFGSIHSAT 167 >UniRef50_B9K4Q6 Transposase n=2 Tax=Alphaproteobacteria RepID=B9K4Q6_AGRVS Length = 232 Score = 63.7 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 40/129 (31%), Gaps = 15/129 (11%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 W F L + + +DE V K R+ WL+ A D Sbjct: 58 EWAAKFGSEFAALLRR------RSKGGFADKWHLDEMV--VTFKGRKYWLWRAVDAEGYM 109 Query: 70 VVAHVFGERTMATLGRLMSLL---SPFDVVIWMTDGWPLYES----RLKGKLHVISKRYT 122 + A + R +LM L + +TD Y++ + G H K Sbjct: 110 LEALLQSRRNKKAALKLMRKLLKGQGLTPRVMVTDKLRSYDAAKRDIMPGVEHRSHKGLN 169 Query: 123 QRIERHNLN 131 R E +L Sbjct: 170 NRAENSHLP 178 >UniRef50_A9FZD9 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FZD9_SORC5 Length = 216 Score = 62.9 bits (151), Expect = 4e-09, Method: Composition-based stats. Identities = 17/96 (17%), Positives = 29/96 (30%), Gaps = 13/96 (13%) Query: 38 IVCAEMDEQWGYVGAKSRQR-----------WLFYAYDRLRKTVVAHVFGERTMATLGRL 86 +MDE W +V K + +L+ A D K ++ G+ Sbjct: 61 AHVIQMDEMWSFVQKKQARVTAKDPAEHGDAYLYVALDANTKPAISFHVGKCDGENTEMF 120 Query: 87 MSLLSP--FDVVIWMTDGWPLYESRLKGKLHVISKR 120 + L V +DGW Y + + Sbjct: 121 IKDLRGRLTVVPHVTSDGWQPYIEAMAASFRGSTDY 156 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 62.5 bits (150), Expect = 5e-09, Method: Composition-based stats. Identities = 44/46 (95%), Positives = 44/46 (95%) Query: 19 FKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD 64 KLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD Sbjct: 68 LNKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD 113 >UniRef50_Q2G8C0 Transposase n=5 Tax=Alphaproteobacteria RepID=Q2G8C0_NOVAD Length = 166 Score = 62.1 bits (149), Expect = 6e-09, Method: Composition-based stats. Identities = 19/114 (16%), Positives = 36/114 (31%), Gaps = 8/114 (7%) Query: 60 FYAYDRLRKTVVAHVFGERTMATLGRLMSLL--SPFDVVIWMTDGWPLYESRL----KGK 113 + A D + + + V +R M +TDG Y + + Sbjct: 30 WRAVDHEGEVLESFVTRKRDKTAALTFMKKALKRHGKAEAIVTDGLRSYPAAMRELGNEG 89 Query: 114 LHVISKRYTQRIERHNLNLRQHLARL--GRKSLSFSKSVEQHDKVIGHYLNIKH 165 + + R E +L R+ + R+ S K H + H+ +H Sbjct: 90 RREVGRHLNNRAENSHLPFRRRERAMLRFRQMKSLQKFASVHASIHNHFSQERH 143 >UniRef50_Q0RZ53 Transposase n=23 Tax=Bacteria RepID=Q0RZ53_RHOSR Length = 317 Score = 61.7 bits (148), Expect = 8e-09, Method: Composition-based stats. Identities = 25/164 (15%), Positives = 44/164 (26%), Gaps = 24/164 (14%) Query: 13 QHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVA 72 Q ++ RP+ +DE + Q +L+ A D+ + Sbjct: 60 QDFANQLRRRRPRPG-----------DKWHLDEVV--IRMNGTQHYLWRAVDQDGNVLDV 106 Query: 73 HVFGERTMATLGRLMSLLSPFDVV---IWMTDGWPLY----ESRLKGKLHVISKRYTQRI 125 V R + L + +TD Y + H S+ R Sbjct: 107 LVQSRRNAVAAKKFFRKLLKRQCAVPRVLVTDKLGSYQVAHREVMPSVEHRRSRYLNNRA 166 Query: 126 ERHNLNLRQH----LARLGRKSLSFSKSVEQHDKVIGHYLNIKH 165 E + RL R + V +H + + H Sbjct: 167 ENSHQPAATRAGDETVRLARSGAAVPLGVRRHRRTLSGSATPAH 210 >UniRef50_A3XA77 Putative transposase n=1 Tax=Roseobacter sp. MED193 RepID=A3XA77_9RHOB Length = 154 Score = 60.2 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 46/133 (34%), Gaps = 10/133 (7%) Query: 41 AEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIW 98 +DE + + + WL+ A D + + V R R +S L + Sbjct: 23 LHLDEVVIMI--RGVKHWLWRAMDSEGQVLDILVQSRRNARAAKRFISRLVARWGVPRVI 80 Query: 99 MTDGWPLYESRLKG----KLHVISKRYTQRIERHNLNLRQH--LARLGRKSLSFSKSVEQ 152 +TD Y + L+ H K RIE + R+ + + + + + Sbjct: 81 ITDRLRSYGAALRKLALGVDHRAHKGLNIRIEGTHRPTRKREKIQGRFKSARQAQRFLVV 140 Query: 153 HDKVIGHYLNIKH 165 HD+ + +H Sbjct: 141 HDEAANLFRPCRH 153 >UniRef50_A9EF44 Transposase n=2 Tax=Rhodobacteraceae RepID=A9EF44_9RHOB Length = 156 Score = 60.2 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 42/133 (31%), Gaps = 11/133 (8%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL--SPFDVVIWMT 100 MDE + K WL+ A D + V R + R + L + + +T Sbjct: 1 MDEVVITIRGKK--HWLWRAIDADGDVLDILVQTRRNAKSAKRFLQRLVSQFGEPRVVIT 58 Query: 101 DGWPLY----ESRLKGKLHVISKRYTQRIERHNLNLRQHLARL--GRKSLSFSKSVEQHD 154 D Y ++ H K IE + R+ + + + HD Sbjct: 59 DKLRSYLKPVKTLTPNADHRAHKGLNNAIEVSHRPTRKREKIFGKFKSHRQAHRFLAAHD 118 Query: 155 KVIGHYLNIKHYQ 167 + I + YQ Sbjct: 119 Q-INLLFRPRRYQ 130 >UniRef50_Q648U8 Putative uncharacterized protein n=6 Tax=environmental samples RepID=Q648U8_9ARCH Length = 173 Score = 60.2 bits (144), Expect = 3e-08, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 40/116 (34%), Gaps = 30/116 (25%) Query: 80 MATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKL-----------------HVISKRYT 122 M T+ + + + + +DG Y S + V+ K T Sbjct: 9 MKTVRKRGKKPTKDEKATFASDGNVQYTSAILENFDVEAINYGQLVKEREGGRVVGKTRT 68 Query: 123 -------------QRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKH 165 IER+NL LR ++RL RKSL FSK D + Y + Sbjct: 69 IIFGEVDDVDIDTVYIERYNLTLRHGISRLVRKSLCFSKCKGMLDNHLDVYQCYNN 124 >UniRef50_Q32DI9 Iso-IS1 ORF2 n=2 Tax=Shigella RepID=Q32DI9_SHIDS Length = 94 Score = 59.8 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 14/52 (26%), Positives = 25/52 (48%) Query: 63 YDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKL 114 + +A+ FG RT T L++LL+PF++ + +D W Y + Sbjct: 32 ITPKQGGGLAYTFGPRTDETCRELLALLTPFNIGMITSDDWGSYGREVPKDK 83 >UniRef50_C4GXL2 Insertion sequence protein n=5 Tax=Yersinia pestis RepID=C4GXL2_YERPN Length = 111 Score = 59.8 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 16/44 (36%), Positives = 25/44 (56%) Query: 46 QWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSL 89 W +VG K +QRWL+YA++ K ++AH+FG R+ Sbjct: 1 MWSFVGNKKQQRWLWYAWEPRLKRIIAHIFGRRSKRHFANYWGC 44 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 59.8 bits (143), Expect = 4e-08, Method: Composition-based stats. Identities = 23/61 (37%), Positives = 32/61 (52%), Gaps = 2/61 (3%) Query: 79 TMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLAR 138 T +L+ L+ F+VV W TD + Y + L H K +TQ IER NL +R + R Sbjct: 55 NEKTFRKLLKKLASFNVVFWCTDNFKTY-NLLPKSQHRAGKIFTQHIERENL-MRTRIKR 112 Query: 139 L 139 L Sbjct: 113 L 113 >UniRef50_Q0RWC6 Transposase n=24 Tax=Bacteria RepID=Q0RWC6_RHOSR Length = 236 Score = 57.9 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 39/136 (28%), Gaps = 20/136 (14%) Query: 13 QHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVA 72 Q ++ R + +DE ++ + +L+ A D+ + Sbjct: 60 QAYANQLRRRRARPG-----------DKWHLDEV--FIRINGKLHYLWRAVDQGGNVLDV 106 Query: 73 HVFGERTMATLGRLMSLLSP---FDVVIWMTDGWPLY----ESRLKGKLHVISKRYTQRI 125 V R + L + + +TD Y L H SK R Sbjct: 107 LVQSRRNAKAAKKFFRKLLKGLRYVPRVIITDKLASYQVVHREMLASVEHRRSKYLNNRA 166 Query: 126 ERHNLNLRQHLARLGR 141 E + RQ + R Sbjct: 167 ENSHQPTRQRERAMKR 182 >UniRef50_Q649W7 Putative uncharacterized protein n=1 Tax=uncultured archaeon GZfos34A6 RepID=Q649W7_9ARCH Length = 217 Score = 57.9 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 52/170 (30%), Gaps = 67/170 (39%) Query: 61 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSP-----FDVVIWMTDGWPLYESRL----- 110 A + K V ++ G R L++ + ++ I+ TD W Y++ L Sbjct: 2 VAQEAKTKLVTSYHVGRRAFEDAVELLAEMESRRDKSTELPIFTTDDWDAYKNALVEVYG 61 Query: 111 -----------------------KGKLHVISKRYTQRI---------------------- 125 VI R + Sbjct: 62 VEEQPEYKGRGRPPNSKKVPPPDLKYGQVIKYREGNEVTDVKKRVVFGNEEEVLSALKLA 121 Query: 126 ---------ERHNLNLRQHLARLGRKSLSFSKSVE---QHDKVIGHYLNI 163 ER+NL +R ++RL RK+++FSK + H + + N+ Sbjct: 122 GNSINTSYIERNNLTVRNGVSRLIRKTINFSKRLNPLVMHLCLFFAWFNL 171 >UniRef50_Q7NJH9 Gsl1853 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJH9_GLOVI Length = 71 Score = 56.3 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 3/56 (5%) Query: 103 WPLYESRLKGKLHV---ISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDK 155 Y L K H + T IER N +RQ + RL RK+LSFSK + H+ Sbjct: 2 LKNYGQVLASKRHRAAGKATGTTSCIERFNNTVRQRVGRLVRKALSFSKCLSNHNA 57 >UniRef50_Q0W4E9 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4E9_UNCMA Length = 160 Score = 56.3 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 23/127 (18%), Positives = 37/127 (29%), Gaps = 31/127 (24%) Query: 71 VAHVFGERTMATLGRLMSLLSPF---DVVIWMTDGWPLYESRLKGKLHVISKR------- 120 + G T T ++S +S V +DG Y L + Sbjct: 1 MGFSVGRWTQGTCRVMLSQVSNSVQDGVFTVYSDGNDDYYYTLTDFFQEVRYGQLVKIRE 60 Query: 121 ---------------------YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGH 159 T +E N LR + RL RK+ +FSK E + Sbjct: 61 KGRVVGKEIRVLIGDVDSEQVETFNVENFNSILRGRVGRLVRKTKTFSKIPEMLYYSVAL 120 Query: 160 YLNIKHY 166 + ++ Sbjct: 121 FQFYWNF 127 >UniRef50_Q6MCX8 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX8_PARUW Length = 72 Score = 55.6 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 18/65 (27%), Positives = 28/65 (43%), Gaps = 3/65 (4%) Query: 106 YESRLKGKLHV-ISKRY--TQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLN 162 Y + H + K+ T IER N L +R K+LSFSK + H +I ++ Sbjct: 2 YFESIPFGQHRPVGKQSDKTSYIERLNCTLGYRCSRFVGKTLSFSKKLINHIGMITSFIC 61 Query: 163 IKHYQ 167 + Sbjct: 62 DYNLH 66 >UniRef50_Q8PRQ0 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PRQ0_METMA Length = 129 Score = 55.6 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 18/49 (36%), Positives = 25/49 (51%) Query: 118 SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 S T ER NL +R LAR RK ++FSK+ H K I + ++ Sbjct: 34 SYIGTSYAERINLTIRTSLARFIRKGMNFSKTKRMHQKAIDLFQAWYNF 82 >UniRef50_Q218S2 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris BisB18 RepID=Q218S2_RHOPB Length = 191 Score = 53.6 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 2/48 (4%) Query: 120 RYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 T +ER NL+LR R R + FSK ++ H + Y+ HY Sbjct: 54 ISTSYVERQNLSLRMGSRRFTRLTNGFSKKLDNHVAAVALYVA--HYN 99 >UniRef50_A8LAQ0 Integrase catalytic region n=1 Tax=Frankia sp. EAN1pec RepID=A8LAQ0_FRASN Length = 175 Score = 53.6 bits (127), Expect = 3e-06, Method: Composition-based stats. Identities = 29/125 (23%), Positives = 41/125 (32%), Gaps = 10/125 (8%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLS--PFDVVIWMT 100 +DE YV R +L+ A D+ + + R A R V T Sbjct: 25 IDE--TYVKVAGRWTYLYRAVDQHSQVIDVLASTRRDQAAARRFFVRALTHGRRPVKVTT 82 Query: 101 DGWPLYES----RLKGKLHVISKRYTQRIERHNLNLRQHLA--RLGRKSLSFSKSVEQHD 154 D P+Y L HV + R RIE + L+ L R ++ S H Sbjct: 83 DKAPVYPRILDELLPEACHVDAARENNRIEADHGRLKARLRPMRGLKRLRSVQTVSAGHA 142 Query: 155 KVIGH 159 V Sbjct: 143 LVQNI 147 >UniRef50_B4WST7 Putative uncharacterized protein n=3 Tax=Synechococcus sp. PCC 7335 RepID=B4WST7_9SYNE Length = 186 Score = 52.5 bits (124), Expect = 5e-06, Method: Composition-based stats. Identities = 26/125 (20%), Positives = 45/125 (36%), Gaps = 9/125 (7%) Query: 50 VGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL---SPFDVVIWMTDGWPLY 106 + K Q +L+ A D+ + + R A R L + F + +TD Sbjct: 34 IKIKGEQFYLWGAVDQHGMVLDILMQRRRNTAAAYRFFRKLLKSTGFAPRVIITDKLKSC 93 Query: 107 ES----RLKGKLHVISKRYTQRIERHNLNLRQHLARLGR--KSLSFSKSVEQHDKVIGHY 160 + LKG H K R E + R R+GR + + + + + GH+ Sbjct: 94 GAAKKDILKGVEHRQHKGLNNRAENSHRPTRIRERRMGRFKSASHAQRFLSAFEPIRGHF 153 Query: 161 LNIKH 165 +H Sbjct: 154 HPHQH 158 >UniRef50_Q8TRX5 Predicted protein n=3 Tax=Methanosarcina acetivorans RepID=Q8TRX5_METAC Length = 221 Score = 52.1 bits (123), Expect = 7e-06, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 49/165 (29%), Gaps = 57/165 (34%) Query: 58 WLFYAYDRLRKTVVAHVFGERTM---ATLGRLMSLLSPFDVVIWMTDGWPLYESRLKG-- 112 W++ A+ + ++ V G R L++ + +++TDG Y L Sbjct: 10 WMWVAFVPGCRLILDFVIGPRKQYVADKFIELVNKHISDKIPVFVTDGLNFYREALLKQF 69 Query: 113 ----KLHVISKRY----------------------------------------------T 122 + KR T Sbjct: 70 GVLREFPRTGKRGRPKKPKIVPSEDLRYAQVVKTRVNGVLEKVEKKIIFGENIEQSEIST 129 Query: 123 QRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 +ER NL RQ R+ RK++ FSK E + + Y H+ Sbjct: 130 TLLERQNLTFRQDNNRVSRKTIGFSKMKEWLEIQMKLYCT--HFN 172 >UniRef50_B4WU12 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WU12_9SYNE Length = 228 Score = 51.7 bits (122), Expect = 9e-06, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 52/168 (30%), Gaps = 31/168 (18%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW Q P K + ++P +D +DE YV K ++L+ A D T Sbjct: 44 RWVQAYSPELDKRCRRY----LKPTND---SWRVDE--TYVKVKGVWKYLYRAVDSAGNT 94 Query: 70 VVAHVFGERTMATLGRLMSLLSP----FDVVIWMTDGWPLYE---------SRLKGK-LH 115 + + +R R + + + D Y L Sbjct: 95 LDFMLSAKRDAKAAKRFLRKVLNASHTIEPRAITVDKNAAYPPAINELKADEVLPEATKT 154 Query: 116 VISKRYTQRIERHNLNLRQHLA--------RLGRKSLSFSKSVEQHDK 155 S +E+ + +++ + R++L +++ K Sbjct: 155 RQSNYLNNTVEQDHRFIKRRVNPGLGFGSFNTARRTLKGYEAMNMIRK 202 >UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=A0RXS8_CENSY Length = 436 Score = 51.7 bits (122), Expect = 1e-05, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 51/145 (35%), Gaps = 25/145 (17%) Query: 40 CAEMDEQWGYVGAKSRQR--------WLFYAYDRLRKTVVAHVF--GERTMATLGRLMSL 89 +DE YV K WL+ A D + ++ G RT+ ++ Sbjct: 179 VWSVDE--AYVNVKRSPVLENKGHGNWLWSAIDPRTRYLLCTRIAEGSRTLPDAESVIRE 236 Query: 90 LSPF--DVVIWMTDGWPLYESR----LKGKLHVISK----RYTQ-RIERHNLNLRQHLAR 138 + +TD Y + L H+ +K +T IER++ +R+ L Sbjct: 237 ARKMSEEPDYMITDSLRSYATAAAKCLPRTAHIKTKAIRDGFTNMAIERYHNEIREKLKS 296 Query: 139 LGRKSLSFSKSVEQHDKVIGHYLNI 163 + L + S + ++ + N Sbjct: 297 C--RGLHSADSAQIFMDLLRIHHNF 319 >UniRef50_A5N1B9 Transposase n=2 Tax=Clostridium kluyveri RepID=A5N1B9_CLOK5 Length = 127 Score = 50.9 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 32/109 (29%), Gaps = 15/109 (13%) Query: 39 VCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGE-RTMATLGRLMSL---LSPFD 94 DE YV K +L+ D + +++ V R +L L+ Sbjct: 21 DEWHADE--TYVKIKGIDYYLWLILDSKTRVIISFVLSRFRNSTQAYKLFFYSSILTRTS 78 Query: 95 VVIWMTDGWPLYESRLK--GKLHVISKR-------YTQRIERHNLNLRQ 134 +TD W Y +K + K IE N + Sbjct: 79 PKKIVTDKWDAYNEAIKNLHCHTLHHKYSAFSEDLNNNFIESFNKTFKA 127 >UniRef50_UPI00018554DD transposase n=1 Tax=Francisella novicida FTG RepID=UPI00018554DD Length = 97 Score = 50.6 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 13/36 (36%), Positives = 20/36 (55%) Query: 35 SDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTV 70 D I E DE W ++G+K ++ W+ AYDR + Sbjct: 40 EDNISEIEFDEMWHFIGSKKKKCWIIKAYDRRVGKL 75 >UniRef50_Q327E8 IS1 ORF, n=6 Tax=Enterobacteriaceae RepID=Q327E8_SHIDS Length = 94 Score = 50.2 bits (118), Expect = 3e-05, Method: Composition-based stats. Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 4/49 (8%) Query: 112 GKLHVISKR----YTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKV 156 V K + +ER+NL LR + RL RK++ FS+SVE H+K Sbjct: 18 KDKQVTRKGIFIQHMLYLERNNLPLRTRIKRLARKTICFSRSVEIHEKS 66 >UniRef50_A9HNK8 Transposase, putative n=1 Tax=Roseobacter litoralis Och 149 RepID=A9HNK8_9RHOB Length = 175 Score = 49.8 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 42/139 (30%), Gaps = 12/139 (8%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW Q P K + + +DE Y+ A + R+L+ A D + Sbjct: 28 RWVQKFGPELAKRAEKH-------HKRSSLDWHVDE--TYIRAGGKWRYLWRAIDANDQL 78 Query: 70 VVAHVFGERTMATLG-RLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERH 128 V + R R + + V TD P Y + + ER Sbjct: 79 VEFRLTARRDAKAFLNRAIERVRLHRPVSICTDKAPTYRKAICAGDVSGDRDEKDTQERP 138 Query: 129 NLNLRQHLARLGRKSLSFS 147 + Q R R + S Sbjct: 139 HSQ--QAAPRQRRDCIHAS 155 >UniRef50_A7C135 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C135_9GAMM Length = 372 Score = 49.8 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 24/78 (30%), Positives = 29/78 (37%), Gaps = 2/78 (2%) Query: 88 SLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFS 147 L I D L + V S+ T IER NL RQ RL R+S FS Sbjct: 219 GRLIEVKNKIIFGDENELASKL--AESPVRSEINTSFIERDNLTQRQSNRRLTRRSNGFS 276 Query: 148 KSVEQHDKVIGHYLNIKH 165 K + D + L H Sbjct: 277 KELSWFDSPLWLSLAYYH 294 Score = 44.4 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 12/91 (13%), Positives = 31/91 (34%), Gaps = 16/91 (17%) Query: 38 IVCAEMDEQWGYVGAKSRQR-------------WLFYAYDRLRKTVVAHVFGERTMATLG 84 + ++DE W ++ W++ A+ + + V+A V G Sbjct: 88 VTSLQLDELWSFILTLEHNCTEAKLYHESYGDAWVWLAFAPVWRVVLAFVIGSLPQKNAN 147 Query: 85 RLMSLLSPFD---VVIWMTDGWPLYESRLKG 112 L+ ++ + + +D + + L Sbjct: 148 LLLDRVAHVTDAHIPFFTSDQFSSSRTALLH 178 >UniRef50_A7HJ24 ISCpe7, transposase n=6 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJ24_FERNB Length = 316 Score = 49.0 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 22/141 (15%), Positives = 48/141 (34%), Gaps = 13/141 (9%) Query: 31 IQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGE-RTMATLGRLMSL 89 + + DE + K ++ +++ D ++ + R M + L+ Sbjct: 157 PTFTIENVFSVHADE--TVLVFKEQKYYVWLLVDHETNLILCWHVSKYRDMGQVKVLLEK 214 Query: 90 L---SPFDVVIWMTDGWPLYESRLKG-----KLHVISKRYTQRIERHNLNLRQ--HLARL 139 S + +TDG YES +K V+ + E L+ L R Sbjct: 215 FFGNSKPRNIELITDGLGAYESAVKLLFRNINHVVVPLGKNNQCESKFSLLKDFFRLKRG 274 Query: 140 GRKSLSFSKSVEQHDKVIGHY 160 + + + +K ++ V + Sbjct: 275 LKNTKNLAKYIQGFCVVKNLW 295 >UniRef50_Q469A1 Putative uncharacterized protein n=1 Tax=Methanosarcina barkeri str. Fusaro RepID=Q469A1_METBF Length = 180 Score = 48.6 bits (114), Expect = 7e-05, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 38/115 (33%), Gaps = 17/115 (14%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQR--------WLFY 61 RW + K+ + P EMDE W + + W++ Sbjct: 56 RWLTRAAEQYDKVNDNMMKDLNTPK------IEMDELWIIIKKIVSRMKDYEDDGPWMWV 109 Query: 62 AYDRLRKTVVAHVFGERTMATLGRLMSLLSPF---DVVIWMTDGWPLYESRLKGK 113 A+ + ++ V G R +L+ + + +++TDG Y L Sbjct: 110 AFVPGCQLILGFVIGPRKQYVTDKLVESVKKHLSDKIPLFVTDGLNFYREALLKH 164 >UniRef50_C6GYT4 IS1216, transposase (Fragment) n=121 Tax=root RepID=C6GYT4_STRS4 Length = 234 Score = 48.3 bits (113), Expect = 1e-04, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 53/162 (32%), Gaps = 21/162 (12%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW Q ++ + +MDE Y+ K + +L+ A D T Sbjct: 57 RWVQEYGKLLYQIWKK-------KNKKSFYSWKMDE--TYIKIKGKWHYLYRAIDADGLT 107 Query: 70 VVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGWPLYESRLKG---------KLHVIS 118 + + +R + L + + +TD P S K H Sbjct: 108 LDIWLRKKRDTQAAYAFLKRLVKQFDEPKVVVTDKAPSITSAFKKLKEYGFYQGTEHRTI 167 Query: 119 KRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHY 160 K IE+ + +++ R + S +++ + + G Y Sbjct: 168 KYLNNLIEQDHRPVKRRNK-FYRSLRTASTTIKGMEAIRGLY 208 >UniRef50_C7S9U1 IS6100 transposase n=358 Tax=root RepID=C7S9U1_ECOLX Length = 266 Score = 48.3 bits (113), Expect = 1e-04, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 45/139 (32%), Gaps = 23/139 (16%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW Q P +K P +DE YV + + +L+ A D+ T Sbjct: 61 RWVQCYAPEMEKRLRWFWRRGFDPS------WRLDE--TYVKVRGKWTYLYRAVDKRGDT 112 Query: 70 VVAHVFGERTMATLGRLMSL----LSPFD-VVIWMTDGWPLYESRL----------KGKL 114 + ++ R+ R + L ++ TD P Y + + + Sbjct: 113 IDFYLSPTRSAKAAKRFLGKALRGLKHWEKPATLNTDKAPSYGAAITELKREGKLDRETA 172 Query: 115 HVISKRYTQRIERHNLNLR 133 H K IE + L+ Sbjct: 173 HRQVKYLNNVIEADHGKLK 191 >UniRef50_C5AG18 IS element transposase n=2 Tax=Burkholderia RepID=C5AG18_BURGB Length = 276 Score = 47.9 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 45/139 (32%), Gaps = 15/139 (10%) Query: 40 CAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVV--I 97 +DE +V + L+ A D + + R A R + + V Sbjct: 76 TWHLDEL--FVKLRGEPYVLWRAVDEHGIELDVLLQKRRDKAAAKRFFRRILRANPVPRK 133 Query: 98 WMTDGWPLYES------RLKGKLHVISKRY---TQRIERHNLNLRQHLARL--GRKSLSF 146 +TD Y + L G HV K R E + R+ ++ R L Sbjct: 134 IVTDQLRSYPAAKAEVPELGGVKHVFVKACAKVNNRAENSHQPTRRRERQMQGFRDPLRT 193 Query: 147 SKSVEQHDKVIGHYLNIKH 165 S+ + + H+ +H Sbjct: 194 QASLSRFGPIRQHFALPRH 212 >UniRef50_Q10ZQ2 Putative uncharacterized protein n=7 Tax=Cyanobacteria RepID=Q10ZQ2_TRIEI Length = 44 Score = 47.5 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 17/40 (42%), Positives = 22/40 (55%), Gaps = 2/40 (5%) Query: 128 HNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 N LRQ ++RL RK+LSFSK + H + I HY Sbjct: 1 MNNTLRQRISRLVRKTLSFSKKLRSHLG--DIWYFINHYN 38 >UniRef50_A9VUP9 Integrase catalytic region n=149 Tax=Bacteria RepID=A9VUP9_BACWK Length = 235 Score = 47.1 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 44/151 (29%), Gaps = 27/151 (17%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW P ++ + S +DE Y+ K + +L+ A D T Sbjct: 52 RWVHQYGPQLEEKVRHHLKST-------NDSWRVDE--TYIKVKGQWMYLYRAVDSKGNT 102 Query: 70 VVAHVFGERTMATLGRLMSLLSPF----DVVIWMTDGWPLYE---SRLKGKLH------- 115 + H+ R F + D P Y LK + H Sbjct: 103 IDFHLSKSRDKQAAKCFFKKALAFSYVSKPRVITVDKNPAYPVAIQALKEEKHMPEGIKL 162 Query: 116 VISKRYTQRIERHNLNLRQHLARLGRKSLSF 146 + +E+ + + + + R L F Sbjct: 163 RQVRYLNNIVEQDH----RFIKKRVRSMLGF 189 >UniRef50_B2TXL7 IS1 ORF2 n=1 Tax=Shigella boydii CDC 3083-94 RepID=B2TXL7_SHIB3 Length = 44 Score = 47.1 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 32/34 (94%), Positives = 32/34 (94%) Query: 134 QHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 HLARLGRKSLSFSKSVE HDKVIGHYLNIKHYQ Sbjct: 11 THLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 44 >UniRef50_A3NK27 IS6 family transposase n=29 Tax=Burkholderia RepID=A3NK27_BURP6 Length = 242 Score = 46.7 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 39/139 (28%), Gaps = 15/139 (10%) Query: 40 CAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVI 97 +DE +V + L+ A D + + R A R + Sbjct: 75 TWHLDEM--FVNLRGEPWLLWRAVDEHGAELDILLQKRRDKAAAKRSFQRVLRSCPAPCN 132 Query: 98 WMTDGWPLYES------RLKGKLHVISKRY---TQRIERHNLNLRQHLARL--GRKSLSF 146 +TD Y + L HV K R E + R+ R+ R Sbjct: 133 IVTDQLRSYPAAKAGIPELANVKHVFVKAAARVNNRAENSHQPTRERERRMRGFRDPKRT 192 Query: 147 SKSVEQHDKVIGHYLNIKH 165 + + H+ +H Sbjct: 193 QAFLASFGPIRQHFALKRH 211 >UniRef50_A9EF82 Transposase, putative n=1 Tax=Oceanibulbus indolifex HEL-45 RepID=A9EF82_9RHOB Length = 158 Score = 46.3 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 29/138 (21%), Positives = 46/138 (33%), Gaps = 21/138 (15%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFD----VVIW 98 MDE YV R +L+ A D+ + + + R M S + Sbjct: 1 MDE--TYVRVNGRWCYLWRAVDQRGQLIDFRLTARRDANAARAFMRQASETARCYYPMTI 58 Query: 99 MTDGWPLYESRL--------KGK--LHVISKRYTQRIERHNLNLRQ--HLARLGRK---S 143 +TD Y + + HV K RIE + L+Q R RK + Sbjct: 59 VTDKAHSYAKVIEEMNLGNGPDERIRHVDRKYLNNRIEADHAALKQLLRPKRSFRKLTAA 118 Query: 144 LSFSKSVEQHDKVIGHYL 161 + K +E H + + Sbjct: 119 KNTLKGIETHRAIKKGHF 136 >UniRef50_B9K4C6 Transposase n=4 Tax=Proteobacteria RepID=B9K4C6_AGRVS Length = 346 Score = 45.9 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 60/172 (34%), Gaps = 31/172 (18%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW P +K Q +P + E YV + + R+++ A D+ Sbjct: 159 RWVLAYAPMIEKRLRQF----RRPHCGSVRVNE-----TYVKIRGKWRYVYRAIDKHGNP 209 Query: 70 VVAHVFGERTMATLGRLMSLLSP----FDVVIWMTDGWPLYESRL----------KGKLH 115 V + +R + R + TDG + S + +H Sbjct: 210 VDFLLTAKRDLDAAKRFFRKMLKDEPLLSPNKIGTDGANTFPSAIKTLVDSGLLHPDPVH 269 Query: 116 VISKRYTQRIERHNLNLRQHLARL--------GRKSLSFSKSVEQHDKVIGH 159 +K Q IE + L++++ ++ R++++ +++ K G+ Sbjct: 270 YATKHLQQGIESDHFRLKKNMPKIGGVQSFNTARRTIAGFQAMLWLRKGFGY 321 >UniRef50_Q6MAQ6 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MAQ6_PARUW Length = 83 Score = 45.9 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 20/52 (38%), Gaps = 3/52 (5%) Query: 11 WPQHDFPPFKKLRPQSVTSRIQPGSDV---IVCAEMDEQWGYVGAKSRQRWL 59 W P+ + +++ + E+DE+W +V K +WL Sbjct: 16 WLLDFINFIINDLPEDLNAQVTCHEKNELEVAKLEVDERWSFVRNKENDQWL 67 >UniRef50_A9VUQ5 Integrase catalytic region n=24 Tax=Bacteria RepID=A9VUQ5_BACWK Length = 235 Score = 45.6 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 40/144 (27%), Gaps = 23/144 (15%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW P K + +DE Y+ K + +L+ A D T Sbjct: 52 RWVHQYGPELDKRIRSHLK-------QTNDSWRVDE--TYIKVKGQWMYLYRAVDSKGNT 102 Query: 70 VVAHVFGERTMATLGRLMSLLSP----FDVVIWMTDGWPLY---------ESRLKGKLH- 115 + ++ R + D P Y E + G + Sbjct: 103 IDFYLSKTRDQKAAKHFFKKALQSFHVSKPPVITVDKNPAYPIAIEQLKKEKSIPGGMRL 162 Query: 116 VISKRYTQRIERHNLNLRQHLARL 139 K +E+ + +++ + + Sbjct: 163 RQQKYLNNIVEQDHRFIKKRIRSM 186 >UniRef50_B6ET23 Transposase n=7 Tax=Gammaproteobacteria RepID=B6ET23_ALISL Length = 246 Score = 45.2 bits (105), Expect = 7e-04, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 53/175 (30%), Gaps = 30/175 (17%) Query: 4 NRPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAY 63 NR RW P +K ++ ++DE YV K + +L+ A Sbjct: 45 NRSTIYRWFIEYSPKLRKKLRRNYQF-----IKTDSSWQLDE--TYVKVKGKWHYLYRAI 97 Query: 64 DRLRKTVVAHVFGERTMATLGRLMSLLSPF-----DVVIWMTDGWPLYESRLK------- 111 ++ +T+ + +R + + + TD Y + + Sbjct: 98 NKQGETLDFYFSHKRNKEAAYQFLKRCLRYYDIDNQPKTLNTDKHSSYANAIARLKKEGR 157 Query: 112 ---GKLHVISKRYTQRIERHNLNLRQ--------HLARLGRKSLSFSKSVEQHDK 155 K IE + +++ L + ++ +S+ +K Sbjct: 158 LREDVEQRQVKCLNNGIESDHAPIKKLIVAAGGFKLRKRAWSTIQGFESLRMLNK 212 >UniRef50_Q6MBH4 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MBH4_PARUW Length = 82 Score = 45.2 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 14/69 (20%), Positives = 24/69 (34%), Gaps = 4/69 (5%) Query: 77 ERTMATLGRLM-SLLSPFDVVIWMTDGWPLYESRLKGKLH-VISK--RYTQRIERHNLNL 132 R T L ++ TD + Y + H +SK T IE+ N L Sbjct: 8 PRDKKTAELLFAKRPESLKKALYFTDKFNAYYETILWSKHQAVSKLSGQTSYIEKFNFTL 67 Query: 133 RQHLARLGR 141 + + + + Sbjct: 68 KTKVCKFCK 76 >UniRef50_Q64DF0 Putative uncharacterized protein n=6 Tax=cellular organisms RepID=Q64DF0_9ARCH Length = 337 Score = 45.2 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 53/190 (27%), Gaps = 62/190 (32%) Query: 39 VCAEMDEQWGYVGAK----SRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF- 93 + E DE + V W DR + + G++ + + L+ Sbjct: 115 LVIEGDEFYTKVDKNVPAEQSSGWTIVLMDRASRFIWELSCGKKDRSLFENAIETLAELV 174 Query: 94 ---DVVIWMTDGWPLYESRL---------------------------------------- 110 + +TDG Y L Sbjct: 175 VQTKDITLLTDGERRYGKILFEICHELLLTGKPGRPKKTLKKGVTVRVKNKGSQTHKKGR 234 Query: 111 ------------KGKLHVISKRYT--QRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKV 156 + IS + T +E +N +R+ + RK+ +++KS ++ Sbjct: 235 KKPKYQTTCPQHPETSNNISDKETHANHVEANNSAMRRKCSAYRRKTNTYAKSETGLQRI 294 Query: 157 IGHYLNIKHY 166 + Y I ++ Sbjct: 295 LNVYWVIHNF 304 >UniRef50_B8IVA0 Transposase and inactivated derivatives-like protein n=38 Tax=Bacteria RepID=B8IVA0_METNO Length = 346 Score = 44.8 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 57/166 (34%), Gaps = 25/166 (15%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW P ++ ++ +DE Y+ + + R+L+ A D+ + Sbjct: 63 RWVLAYAPAIERRLR-----MLRKPHCG--SVRVDE--TYICIRGQWRYLYRAIDKHGEP 113 Query: 70 VVAHVFGERTMATLGRLMSLLSPFD----VVIWMTDGWPLYESRLKGKL----------H 115 V + R + R + + TDG Y + H Sbjct: 114 VDFLLTAHRDLDAAKRFFRKMLKEEPLLAPDRIGTDGAGPYPPAIAESHEEGLLPRAPTH 173 Query: 116 VISKRYTQRIERHNLNLRQHLARL--GRKSLSFSKSVEQHDKVIGH 159 ++K Q IE + +++ + R+ R + ++++ + ++ Sbjct: 174 HVTKHLQQGIESDHFRVKRPMPRVGGFRSFTTGRRTIQGFEAMLWL 219 >UniRef50_B9K4Z7 Transposase n=5 Tax=Bacteria RepID=B9K4Z7_AGRVS Length = 104 Score = 44.8 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 10/61 (16%), Positives = 20/61 (32%), Gaps = 2/61 (3%) Query: 107 ESRLKGKLHVISKRYTQRIERHN--LNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIK 164 + H K R E + R+ + + + + + V HD V + + Sbjct: 17 RDIMPDVEHRSHKGLNNRAENSHQPTRRRERIMKGFKSARHLQRFVSIHDPVANLFHIPR 76 Query: 165 H 165 H Sbjct: 77 H 77 >UniRef50_Q6MD18 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MD18_PARUW Length = 89 Score = 44.8 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 14/32 (43%), Positives = 21/32 (65%) Query: 130 LNLRQHLARLGRKSLSFSKSVEQHDKVIGHYL 161 L LR AR RK+LSFSK + H ++I +++ Sbjct: 46 LLLRHRYARFVRKTLSFSKKLTNHIELIKYFI 77 >UniRef50_Q64DD5 Putative uncharacterized protein n=1 Tax=uncultured archaeon GZfos18F2 RepID=Q64DD5_9ARCH Length = 230 Score = 44.4 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 53/190 (27%), Gaps = 62/190 (32%) Query: 39 VCAEMDEQWGYVGAK----SRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF- 93 + E DE + V W DR + + G++ + + L+ Sbjct: 8 LVIEGDEFYTKVDKNVPAEQSSGWTIVLMDRASRFIWELSCGKKDRSLFENAIETLAELV 67 Query: 94 ---DVVIWMTDGWPLYESRL---------------------------------------- 110 + +TDG Y L Sbjct: 68 VQTKDITLLTDGERRYGKILFEICHELLLTGKPGRPKKTLKKGVTVRVKNKGSQTHKKGR 127 Query: 111 ------------KGKLHVISKRYT--QRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKV 156 + IS + T +E +N +R+ + RK+ +++KS ++ Sbjct: 128 KKPKYQTTCPQHPETSNNISDKETHANHVEANNSAMRRKCSAYRRKTNTYAKSETGLQRI 187 Query: 157 IGHYLNIKHY 166 + Y I ++ Sbjct: 188 LNVYWVIHNF 197 >UniRef50_C8SCF9 Integrase catalytic region n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCF9_FERPL Length = 357 Score = 44.4 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 43/137 (31%), Gaps = 19/137 (13%) Query: 11 WPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTV 70 W + KL + V S D +D+ + K ++ D + + Sbjct: 172 WIKRYV----KLVNEFVKS---VNLDSSARWHVDD--SVIKVKGDHIRIWTLLDSETRFI 222 Query: 71 VAHVFGE-RTMATLGRLMSLLSPF--DVVIWMTDGWPLYESR----LKGKLHVISK---R 120 +A + R +L+ ++DG YE G +H+ S Sbjct: 223 LAIHISKSRGAEEALKLLKKGLEVSKKPEEIVSDGLKSYEKAISQKFSGVIHIQSSLREG 282 Query: 121 YTQRIERHNLNLRQHLA 137 R ER L++ + Sbjct: 283 LNNRAERFFKELKRRVK 299 >UniRef50_Q8PWV9 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PWV9_METMA Length = 150 Score = 44.0 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 16/49 (32%), Positives = 24/49 (48%) Query: 118 SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 S T +ER NL RQ R+ RK++ FSK ++ I Y ++ Sbjct: 54 SDISTSLLERQNLTFRQDNNRISRKTIGFSKKIKCLYNQIRLYSTYFNF 102 >UniRef50_A9EG16 Transposase n=1 Tax=Oceanibulbus indolifex HEL-45 RepID=A9EG16_9RHOB Length = 182 Score = 44.0 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 19/102 (18%), Positives = 30/102 (29%), Gaps = 8/102 (7%) Query: 25 QSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLG 84 Q + + V +DE + K WL A D + + R Sbjct: 32 QYAKAIRRGRLKVADKWHLDEVVLPINVKK--YWLCRAVDSKGDVLDILIQSRRNKRAAI 89 Query: 85 RLMSLL--SPFDVVIWMTDGWPLYESRL----KGKLHVISKR 120 R L + + + +TD Y + L G H K Sbjct: 90 RFFRKLFKAFGEPRVIVTDKLKSYGAALKELAPGIEHRSHKG 131 >UniRef50_C6IUV9 Transposase n=4 Tax=Bacteroides RepID=C6IUV9_9BACE Length = 571 Score = 43.2 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 37/106 (34%), Gaps = 14/106 (13%) Query: 11 WPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQW--GYVGAKSRQRWLFYAYDRLRK 68 W KL P +Q G++V +DE W + K R+ +++ +R + Sbjct: 273 WADKGAMQLNKLIPALKKIALQDGANVN----VDETWLRYHAYNKKRKTYMWCLVNRKAR 328 Query: 69 TVVAHVFGERTMATLGR--------LMSLLSPFDVVIWMTDGWPLY 106 V+ + + L L + +DG+ +Y Sbjct: 329 IVIFFYEDTTDDEGVQKHGGRNRNVLKEFLGDAKIKSLQSDGYNVY 374 >UniRef50_A8LH39 Integrase catalytic region n=26 Tax=Bacteria RepID=A8LH39_FRASN Length = 262 Score = 42.9 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 33/158 (20%), Positives = 47/158 (29%), Gaps = 17/158 (10%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW Q P V + +DE YV R +++ A D+ + Sbjct: 86 RWVQRF-------TPLLVEAARPCRHRSGDRWFVDE--TYVKVAGRWTYVYRAVDQHGQV 136 Query: 70 VVAHVFGERTMATLGRLMSLLS--PFDVVIWMTDGWPLYES----RLKGKLHVISKRYTQ 123 + R A R V TD P+Y L HV + R Sbjct: 137 IDVLASARRDQAAARRFFVRALSHGHRPVEVTTDKAPVYPRVLDEFLPEACHVDAARENN 196 Query: 124 RIERHNLNLRQHLA--RLGRKSLSFSKSVEQHDKVIGH 159 RIE + L+ L R ++ S H V Sbjct: 197 RIEADHGRLKARLRPMRGLKRLRSAQTISAGHAFVQNI 234 >UniRef50_A0YAP3 Transposase n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YAP3_9GAMM Length = 133 Score = 42.9 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 16/97 (16%), Positives = 32/97 (32%), Gaps = 9/97 (9%) Query: 78 RTMATLGRLMSLL---SPFDVVIWMTDGWPLYES----RLKGKLHVISKRYTQRIERHNL 130 R A R L S ++ +TD Y + +H + R E+ + Sbjct: 2 RDGAAAKRFSKRLVRSSGTELRKIVTDTLQSYGVAHRGFIPDTIHSNQQYENNRAEQSHK 61 Query: 131 --NLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKH 165 +R+ R + + + + H V + +H Sbjct: 62 ATRVRERGMRKFKSAKQAQRFLGAHAAVSNLFNLGRH 98 >UniRef50_B4WVD1 Putative uncharacterized protein n=7 Tax=Synechococcus sp. PCC 7335 RepID=B4WVD1_9SYNE Length = 298 Score = 42.9 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 16/115 (13%), Positives = 33/115 (28%), Gaps = 10/115 (8%) Query: 30 RIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSL 89 R + V +DE ++ K +L+ D + + R M + Sbjct: 127 RTKRKGKVGNVWLIDE--TFIRVKGVWCYLYRGIDEDGNLMDVRLSKTRDMVGTKAFFAQ 184 Query: 90 LSPF---DVVIWMTDGWPLYESRLKGK-----LHVISKRYTQRIERHNLNLRQHL 136 TDG Y +K + H + +E+ + ++ Sbjct: 185 ALGLHEDAPEKIATDGLASYPRAIKEELGKNVEHEVRPCTANPVEQSHRRIKHRY 239 >UniRef50_A0NLN9 Putative IS6 family transposase n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NLN9_9RHOB Length = 127 Score = 42.5 bits (98), Expect = 0.006, Method: Composition-based stats. Identities = 14/92 (15%), Positives = 27/92 (29%), Gaps = 8/92 (8%) Query: 77 ERTMATLGRLMSLL--SPFDVVIWMTDGWPLYESRL----KGKLHVISKRYTQRIERHNL 130 R R L + + +TD Y + H K R+E + Sbjct: 4 RRNAKAARRFFGALVTQFGEPRVVVTDKLRSYTKPVQVLAPDADHRAHKGLNNRVENSHR 63 Query: 131 NLRQHLARLGR--KSLSFSKSVEQHDKVIGHY 160 R+ GR + + +D++ + Sbjct: 64 PTRKREKIFGRFKSPRHAQRFLSANDQIKTIF 95 >UniRef50_A0LBE3 Putative uncharacterized protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0LBE3_MAGSM Length = 116 Score = 42.1 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 18/57 (31%), Positives = 27/57 (47%) Query: 110 LKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 L + V T +ER+N R A GRK+L FSK + H+ V+ I ++ Sbjct: 13 LLNRSTVSETIKTAFVERNNATDRHQNAHKGRKTLCFSKGWDVHNAVMVFVAYIYNF 69 >UniRef50_Q2GA88 Transposase, putative n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=Q2GA88_NOVAD Length = 110 Score = 42.1 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 12/80 (15%), Positives = 25/80 (31%), Gaps = 6/80 (7%) Query: 87 MSLLSPFDVVIWMTDGWPLYESRLKG----KLHVISKRYTQRIERHNLNLRQHLARL--G 140 +L T G Y + + I + R+E +L R+ + Sbjct: 3 KALQRHGSPERVTTGGLRSYRAAMTELGCEDEQEIGRWANNRVENSHLPFRRRERAMLRF 62 Query: 141 RKSLSFSKSVEQHDKVIGHY 160 R+ + + H + H+ Sbjct: 63 RQMKALQEFALMHASLHNHF 82 >UniRef50_UPI00016C51C4 hypothetical protein GobsU_02291 n=6 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C51C4 Length = 298 Score = 41.7 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 14/51 (27%), Positives = 20/51 (39%) Query: 116 VISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 V + T +ERHN R +R RK FSK + H ++ Sbjct: 107 VSTVVNTCFVERHNGTDRNRCSRKVRKGYGFSKDWDTHRAATAFRYFSDNF 157 >UniRef50_A3W3V5 Putative IS6 family transposase n=1 Tax=Roseovarius sp. 217 RepID=A3W3V5_9RHOB Length = 211 Score = 41.7 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 32/128 (25%), Gaps = 27/128 (21%) Query: 39 VCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIW 98 +DE V R WL+ A D+ + Sbjct: 75 DVWRLDEVV--VKIAGRSFWLWRAVDQHGVVLE-------------------EILQPKRI 113 Query: 99 MTDGWPLY----ESRLKGKLHVISKRYTQRIERHNLNLRQHLARL--GRKSLSFSKSVEQ 152 +TD Y G H K R E ++L R+ + R + + V Sbjct: 114 ITDKLRSYGAAKREVAPGLDHWSHKGLNNRAENNHLPFRKRERVMQGFRSPGALQRFVSI 173 Query: 153 HDKVIGHY 160 + Sbjct: 174 QSATRNCF 181 >UniRef50_B5WJN4 Integrase, catalytic region n=1 Tax=Burkholderia sp. H160 RepID=B5WJN4_9BURK Length = 239 Score = 41.7 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 51/170 (30%), Gaps = 28/170 (16%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW Q P F K + G +DE Y+ + R +L+ A DR +T Sbjct: 56 RWVQRYAPEFVKRWNRF-------GVPTGQSWRVDE--TYLKVRGRWVYLYRAVDRAGQT 106 Query: 70 VVAHVFGERTMATLGRLMSLLSPFD---VVIWMTDGWPLYESR--------LKGKLHVI- 117 V + + + DG+ L + + Sbjct: 107 VDFMLRAKGDVKAAKGFFRKALKHQGQPPKTITLDGYAASHRAVREMKEDGLPPEDTRVR 166 Query: 118 -SKRYTQRIERHNLNLRQHL------ARLGRKSLSFSKSVEQHDKVIGHY 160 SK IE+ + N++ + RL +++ + G + Sbjct: 167 SSKYLNDLIEQDHRNIKSRITVMLGFKRLRSATIALAGIELMLRIRKGQF 216 >UniRef50_B9LVP3 Transposase n=20 Tax=Halobacteriaceae RepID=B9LVP3_HALLT Length = 226 Score = 41.3 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 51/162 (31%), Gaps = 9/162 (5%) Query: 12 PQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVV 71 Q + + S P +DE V WL+ A D K ++ Sbjct: 54 VQRSHQAIFQWVHRVADSVPDPPEAQPKRVAVDE--TAVKINGEWSWLYAAIDLDTKLIL 111 Query: 72 AH-VFGERTMATLGRLMSLLSP---FDVVIWMTDGWPLYESRLKGKLHVISKRYTQR--I 125 +FG + LS +++ DG+ Y++ L + YT R I Sbjct: 112 GVDLFGSHGTDPAAAFLHRLSEKHDLSEAVFLVDGFG-YQTALARLGLSGRRDYTDRNLI 170 Query: 126 ERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHYQ 167 E+ L+ + R + SV HY N + Sbjct: 171 EKWFQTLKMRIDRFHNSWVGSRSSVRSWCSQFTHYYNRQRPH 212 >UniRef50_C2JSP7 IS431mec transposase n=2 Tax=Enterococcus faecalis RepID=C2JSP7_ENTFA Length = 229 Score = 40.9 bits (94), Expect = 0.017, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 35/109 (32%), Gaps = 14/109 (12%) Query: 41 AEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIW 98 +DE YV K + R+L+ A D T+ + R + L Sbjct: 73 WRIDE--TYVKVKGQDRYLYRAIDSKGNTLDMWLRNHRDTVSTKAFFKRLIRVYGQPRSI 130 Query: 99 MTDGWPLYESRLK----------GKLHVISKRYTQRIERHNLNLRQHLA 137 +TD + +K H SK +E+ + L+ L Sbjct: 131 VTDKYAPSLKAIKELKEEGILYQKVKHWKSKYLNNILEQDHRQLKGKLP 179 >UniRef50_A7C324 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7C324_9GAMM Length = 137 Score = 40.2 bits (92), Expect = 0.023, Method: Composition-based stats. Identities = 17/50 (34%), Positives = 25/50 (50%) Query: 117 ISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEQHDKVIGHYLNIKHY 166 + T IER NL LRQH++ L RK+L + K ++ L +Y Sbjct: 36 TKTQNTSFIERFNLTLRQHVSYLTRKTLGYCKKKANFKYILWINLYNYNY 85 >UniRef50_A5L0K3 Putative transposase n=1 Tax=Vibrionales bacterium SWAT-3 RepID=A5L0K3_9GAMM Length = 173 Score = 39.8 bits (91), Expect = 0.030, Method: Composition-based stats. Identities = 22/112 (19%), Positives = 39/112 (34%), Gaps = 13/112 (11%) Query: 4 NRPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAY 63 NR RW H P K + +RI ++DE YV K + +L+ A Sbjct: 45 NRSTIYRWFIHYGPILHKKLRRYQFTRIDTS------WQLDE--TYVKVKGKWHYLYRAI 96 Query: 64 DRLRKTVVAHVFGERTMATLGRLMSLL-----SPFDVVIWMTDGWPLYESRL 110 ++ +T+ +R + + + F TD Y + + Sbjct: 97 NKRGETLDFFFSRKRNKEAAYQFLKRCLRRYKTSFHPQTLNTDKHSSYGNAI 148 >UniRef50_Q8R819 Transposase n=2 Tax=Thermoanaerobacter tengcongensis RepID=Q8R819_THETN Length = 455 Score = 39.8 bits (91), Expect = 0.033, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 49/146 (33%), Gaps = 30/146 (20%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGE-RTMATLGRLMSLLSPF---DVVIW 98 +DE Y+ K + +LF A D LR +++ R ++ + I Sbjct: 276 IDE--TYIKVKGKWHYLFTACDGLRGFIISQHLSPHRDALAALTILKEVIDRYNNREFIL 333 Query: 99 MTDGWPLYESRL----------------------KGKLHVISKRYTQRIERHNLNLRQHL 136 +TD P+Y+ + G + Y R+ER + + H Sbjct: 334 VTDKAPIYDVAVHFASVFFGANIRHRPVLGISPPPGGDSHTYRPYKNRLERLFGSYKAHY 393 Query: 137 ARLGRKSLSFSKSVEQHDKVIGHYLN 162 R KS S + H + Y N Sbjct: 394 KR--HKSFSSFEGAVAHALLYQLYYN 417 >UniRef50_C6KV49 Transposase n=2 Tax=Bacteria RepID=C6KV49_9BACT Length = 244 Score = 39.8 bits (91), Expect = 0.033, Method: Composition-based stats. Identities = 30/157 (19%), Positives = 53/157 (33%), Gaps = 26/157 (16%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW Q P K + + V +DE + VG K R +L+ A D + Sbjct: 59 RWVQKFGPELTKRTEKHLR-------RASVDWHVDETYIRVGGKWR--YLWRAVDANGQM 109 Query: 70 VVAHVFGERTMAT----LGRLMSLLSPFDVVIWMTDGWPLYESRLKG-----------KL 114 + + R L + + + V +TD P Y ++ Sbjct: 110 IDFRLTARRDAKAAKAFLNKAIERVRLHRPVTIVTDKAPTYRRVIREINCRYDPHFDSIR 169 Query: 115 HVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVE 151 H+ K IE + +++ L R+S +S + Sbjct: 170 HIDKKWRNNLIESDHAAMKRILG--YRQSFRSLRSAK 204 >UniRef50_Q1J2T9 Transposase n=3 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J2T9_DEIGD Length = 251 Score = 39.4 bits (90), Expect = 0.041, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 41/165 (24%), Gaps = 24/165 (14%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 +W P K +DE VG + WL+ A D Sbjct: 72 QWNMKFAPLLTKELRHR-------EPRRGSRWHLDEVCVKVGG--VKHWLWRAVDDRGDV 122 Query: 70 VVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGWPLYE---SRLK------GKLHVIS 118 + + R L + TD Y L V + Sbjct: 123 LDILLQEHRDTEAAKSFFVRLLSEYDVPEVIHTDKLWSYGAAPRELPVLRTAEHVQVVST 182 Query: 119 KRYTQRIERHNLNLRQHLAR---LGRKSLSFSKSVEQHDKVIGHY 160 R +E + RQ R+ + + + H +V + Sbjct: 183 SRCNNLVEHSHRPTRQQERAQLGFKRRPRT-QEFLAPHARVSNLH 226 >UniRef50_P0C1L0 Transposase for insertion sequence-like element IS431mec n=404 Tax=Bacteria RepID=T431_STAA8 Length = 224 Score = 38.6 bits (88), Expect = 0.072, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 44/141 (31%), Gaps = 20/141 (14%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW Q P + + +DE Y+ K + +L+ A D T Sbjct: 49 RWVQEYAPILYQ-------IWKKKHKKAYYKWRIDE--TYIKIKGKWSYLYRAIDAEGHT 99 Query: 70 VVAHVFGERTMATLGRLMSLL--SPFDVVIWMTDGWPLYESR---------LKGKLHVIS 118 + + +R + + L +TD P + LK H S Sbjct: 100 LDIWLRKQRDNHSAYAFIKRLIKQFGKPQKVITDQAPSTKVAMAKVIKAFKLKPDCHCTS 159 Query: 119 KRYTQRIERHNLNLRQHLARL 139 K IE+ + +++ R Sbjct: 160 KYLNNLIEQDHRHIKVRKTRY 180 >UniRef50_C2SK84 Transposase for insertion sequence element IS257 in transposon Tn4003 n=24 Tax=Bacillus RepID=C2SK84_BACCE Length = 248 Score = 38.6 bits (88), Expect = 0.078, Method: Composition-based stats. Identities = 26/127 (20%), Positives = 40/127 (31%), Gaps = 17/127 (13%) Query: 32 QPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLS 91 + +DE Y+ K R+L+ A D+ T+ + +R M L Sbjct: 64 VKNKSARLSWHLDE--TYIKVKGEWRYLYRAIDKDGNTLDIQLRKKRDYRAAYAFMKRLV 121 Query: 92 PF--DVVIWMTDGWPLYESRLKG---------KLHVISKRYTQRIERHNLNLRQHLARLG 140 + TD P LK H K IE+ + +H+ R Sbjct: 122 KTFGGPTVLTTDKAPALLCALKKLKEQGFYKHTTHCTIKHLNNLIEQDH----RHVKRRF 177 Query: 141 RKSLSFS 147 KS F Sbjct: 178 AKSAGFQ 184 >UniRef50_Q1WCQ1 COG3316 n=1 Tax=Streptococcus thermophilus RepID=Q1WCQ1_STRTR Length = 119 Score = 38.6 bits (88), Expect = 0.079, Method: Composition-based stats. Identities = 23/121 (19%), Positives = 43/121 (35%), Gaps = 15/121 (12%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLL--SPFDVVIWMT 100 MDE Y+ K R +L++A D T+ + +R + L + +T Sbjct: 1 MDE--TYIKIKGRWHYLYHAIDADGLTLDIWLRKKRDTQAAYAFLKRLHKQFGQPRVIVT 58 Query: 101 DGWPLYESRLK---------GKLHVISKRYTQRIERHNLNLR--QHLARLGRKSLSFSKS 149 D P S + H K IE+ + ++ + R ++ F +S Sbjct: 59 DKAPSIGSAFRKLQCNGLYTKTEHRTVKYLNHLIEQDHRPIKLIEQDHRPIKRRNKFYRS 118 Query: 150 V 150 + Sbjct: 119 L 119 >UniRef50_B2JY75 Integrase catalytic region n=5 Tax=Burkholderia RepID=B2JY75_BURP8 Length = 236 Score = 38.6 bits (88), Expect = 0.086, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 43/139 (30%), Gaps = 22/139 (15%) Query: 10 RWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKT 69 RW + P F K + T +DE Y+ + R +L+ A DR +T Sbjct: 56 RWVRRYTPEFIKRWNRFAT-------PAGRSWRVDE--TYLKIRGRWVYLYRAVDRAGQT 106 Query: 70 VVAHVFGERTMATLGRLMSLLSPFD---VVIWMTDGWPLYESRLK----------GKLHV 116 V + R +A S DG+ ++ Sbjct: 107 VDFMLRARRDVAAAKAFFSQAIKRQGQPPETITLDGYAASHRAVREMKTDGLLPEDTKVR 166 Query: 117 ISKRYTQRIERHNLNLRQH 135 SK IE+ + +++ Sbjct: 167 SSKYLNNVIEQDHRHIKSR 185 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.306 0.124 0.355 Lambda K H 0.267 0.0379 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,007,243,028 Number of Sequences: 3077464 Number of extensions: 35581356 Number of successful extensions: 133358 Number of sequences better than 1.0e-01: 149 Number of HSP's better than 0.1 without gapping: 182 Number of HSP's successfully gapped in prelim test: 109 Number of HSP's that attempted gapping in prelim test: 132964 Number of HSP's gapped (non-prelim): 309 length of query: 167 length of database: 1,040,396,356 effective HSP length: 119 effective length of query: 48 effective length of database: 674,178,140 effective search space: 32360550720 effective search space used: 32360550720 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.7 bits) S2: 88 (38.6 bits)