BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (167 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P57998 Insertion element IS1 4 protein insB n=553 Tax=r... 338 6e-92 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 139 4e-32 UniRef50_B2K0W2 IS1 transposase n=30 Tax=Enterobacteriaceae RepI... 125 7e-28 UniRef50_P03832 Insertion element iso-IS1n protein insB n=127 Ta... 124 8e-28 UniRef50_C5BAQ0 IS1 ORF n=14 Tax=Gammaproteobacteria RepID=C5BAQ... 101 1e-20 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 100 2e-20 UniRef50_A3IXU4 Iso-IS1 ORF2 n=3 Tax=Bacteria RepID=A3IXU4_9CHRO 94 2e-18 UniRef50_Q1IXR6 IS1 transposase n=3 Tax=Deinococcus geothermalis... 79 4e-14 UniRef50_C5BB57 Iso-IS1 ORF2 n=24 Tax=Enterobacteriaceae RepID=C... 79 6e-14 UniRef50_Q8VSP6 Putative IS1 ORF n=1 Tax=Shigella flexneri RepID... 78 9e-14 UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candida... 77 2e-13 UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202... 76 4e-13 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 75 7e-13 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 72 7e-12 UniRef50_C5U8R9 IS1 transposase (Fragment) n=1 Tax=Methanocaldoc... 72 9e-12 UniRef50_B2TXL7 IS1 ORF2 n=1 Tax=Shigella boydii CDC 3083-94 Rep... 70 2e-11 UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5W... 66 3e-10 UniRef50_B4WT39 IS1 transposase subfamily, putative n=3 Tax=Cyan... 65 1e-09 UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum A... 65 1e-09 UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaen... 65 1e-09 UniRef50_B2SG01 IS1 transposase n=7 Tax=Francisella tularensis R... 56 4e-07 UniRef50_B0C1Z5 IS1 transposase n=3 Tax=Cyanobacteria RepID=B0C1... 56 5e-07 UniRef50_P73781 Transposase n=5 Tax=Bacteria RepID=P73781_SYNY3 55 1e-06 UniRef50_Q2S4N0 ISSru3, transposase insB n=1 Tax=Salinibacter ru... 47 3e-04 UniRef50_Q1CBA9 Putative transposase n=3 Tax=Yersinia pestis Rep... 46 6e-04 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 44 0.002 UniRef50_Q46GF8 Putative uncharacterized protein n=3 Tax=Methano... 44 0.003 UniRef50_C4GXL2 Insertion sequence protein n=5 Tax=Yersinia pest... 44 0.003 UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candida... 43 0.004 UniRef50_A8ZKR3 Putative uncharacterized protein n=1 Tax=Acaryoc... 42 0.007 UniRef50_C5B7X2 Iso-IS1 ORF2 n=1 Tax=Edwardsiella ictaluri 93-14... 42 0.007 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 42 0.007 UniRef50_B0CCX7 Putative uncharacterized protein n=7 Tax=Cyanoba... 42 0.009 UniRef50_Q327E8 IS1 ORF, n=6 Tax=Enterobacteriaceae RepID=Q327E8... 41 0.013 UniRef50_Q46CV2 Putative uncharacterized protein n=14 Tax=Methan... 41 0.018 UniRef50_A8YFR2 ImeAB protein n=2 Tax=Microcystis aeruginosa PCC... 40 0.024 UniRef50_B0CEC0 Putative uncharacterized protein n=6 Tax=Acaryoc... 40 0.026 UniRef50_B0JRY8 Transposase n=27 Tax=Bacteria RepID=B0JRY8_MICAN 40 0.033 UniRef50_C8SAB2 IS1 transposase (Fragment) n=1 Tax=Ferroglobus p... 39 0.049 >UniRef50_P57998 Insertion element IS1 4 protein insB n=553 Tax=root RepID=INSB4_ECOLI Length = 167 Score = 338 bits (866), Expect = 6e-92, Method: Compositional matrix adjust. Identities = 162/167 (97%), Positives = 163/167 (97%) Query: 1 MPGNSPHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF 60 MPGNSPHYGRWPQHDF KKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF Sbjct: 1 MPGNSPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF 60 Query: 61 YAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR 120 YAYD LRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR Sbjct: 61 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR 120 Query: 121 YTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 YTQRIER+NLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ Sbjct: 121 YTQRIERYNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 139 bits (349), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 65/144 (45%), Positives = 97/144 (67%) Query: 18 SLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGE 77 +LKKL P+ +TS +DV E+DEQW YVG+K+RQ W++YAY++ V+A+ FG Sbjct: 82 TLKKLAPKRITSSPVTHADVAFICELDEQWSYVGSKARQHWIWYAYNTKTGGVLAYTFGP 141 Query: 78 RTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLA 137 RT T L++LL+PF++ + +D W Y + H+ K +TQ IER+NL LR + Sbjct: 142 RTDQTCRELLALLTPFNIGMLTSDDWGSYGREVPKNKHLTGKIFTQCIERNNLTLRTRIK 201 Query: 138 RLGRKSLSFSKSVELHDKVIGHYL 161 RLGRK++ FS+SVE+H+KVIG ++ Sbjct: 202 RLGRKTICFSRSVEIHEKVIGAFI 225 >UniRef50_B2K0W2 IS1 transposase n=30 Tax=Enterobacteriaceae RepID=B2K0W2_YERPB Length = 122 Score = 125 bits (313), Expect = 7e-28, Method: Compositional matrix adjust. Identities = 60/120 (50%), Positives = 83/120 (69%), Gaps = 1/120 (0%) Query: 47 WGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLY 106 W +VG K +QRWL+YA++ K ++AHVFG R+ T +L+ LLS F++V W TD + Y Sbjct: 2 WSFVGNKKQQRWLWYAWEPRLKRIIAHVFGRRSKKTFRQLLGLLSGFNIVFWCTDNFSAY 61 Query: 107 ESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHY 166 E L + H+ SK YTQRIER NLN+R L RL RK+L SKS E+HD++IG ++ +HY Sbjct: 62 EM-LPDEKHIRSKLYTQRIERENLNIRNRLKRLNRKTLGDSKSAEMHDRIIGTFIEREHY 120 >UniRef50_P03832 Insertion element iso-IS1n protein insB n=127 Tax=Gammaproteobacteria RepID=INBN_SHIDY Length = 131 Score = 124 bits (312), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 57/126 (45%), Positives = 87/126 (69%), Gaps = 1/126 (0%) Query: 37 VIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVV 96 ++C E+DEQW +VG+K+RQ WL+YAY++ V+A+ FG RT T L++LL+PF++ Sbjct: 2 ALIC-ELDEQWSFVGSKARQHWLWYAYNTKTGGVLAYTFGPRTDETCRELLALLTPFNIG 60 Query: 97 IWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKV 156 + +D W Y + H+ K +TQRIER+NL LR + RL RK++ FS+SVE+H+KV Sbjct: 61 MLTSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRIKRLARKTICFSRSVEIHEKV 120 Query: 157 IGHYLN 162 IG ++ Sbjct: 121 IGTFIE 126 >UniRef50_C5BAQ0 IS1 ORF n=14 Tax=Gammaproteobacteria RepID=C5BAQ0_EDWI9 Length = 78 Score = 101 bits (251), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 45/78 (57%), Positives = 57/78 (73%) Query: 90 LSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 + F++ +MTD WP+Y + L HV+SK+YTQRIERHNLNLR HL RL R+++ FS S Sbjct: 1 MRKFNIAFYMTDAWPVYRTLLDPAHHVVSKKYTQRIERHNLNLRTHLKRLTRRTICFSNS 60 Query: 150 VELHDKVIGHYLNIKHYQ 167 E+HDKVIG YL I HY Sbjct: 61 EEMHDKVIGWYLTINHYH 78 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 100 bits (248), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 45/46 (97%), Positives = 45/46 (97%) Query: 19 LKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD 64 L KLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD Sbjct: 68 LNKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD 113 >UniRef50_A3IXU4 Iso-IS1 ORF2 n=3 Tax=Bacteria RepID=A3IXU4_9CHRO Length = 138 Score = 93.6 bits (231), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 48/122 (39%), Positives = 73/122 (59%) Query: 41 AEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT 100 AE+D+ +V K +RWL++A D T++A+V G+RT +L ++L PF + + T Sbjct: 8 AEVDKMKIFVAKKEHERWLWHAIDHQTGTILAYVLGQRTDQMFLKLKTMLKPFGISEFYT 67 Query: 101 DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHY 160 D W Y+ L + +SK Q+IER +L LR + RL RK++ FSK +HD VIG Y Sbjct: 68 DNWGSYKRHLSDEQRTVSKYKMQKIERKHLTLRTRIKRLQRKTICFSKISPMHDLVIGLY 127 Query: 161 LN 162 +N Sbjct: 128 IN 129 >UniRef50_Q1IXR6 IS1 transposase n=3 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1IXR6_DEIGD Length = 148 Score = 79.3 bits (194), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 53/132 (40%), Positives = 73/132 (55%), Gaps = 6/132 (4%) Query: 25 QSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLG 84 Q+V + P +V+V E+DE W +VG K + RWL+ A + + V+A V G+R+ T Sbjct: 3 QTVPVCLTPPEEVVV--ELDELWTFVGKKKQARWLWIALERSTRKVLAWVLGDRSEQTAF 60 Query: 85 RLMSLL--SPFDVV--IWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLG 140 +L L SP + + TD W Y+ L G + K T +ER N LRQ L RL Sbjct: 61 KLWDRLPLSPEQRLKGTFCTDLWRAYDEPLLGVKRLTRKGETNHVERLNCTLRQRLGRLV 120 Query: 141 RKSLSFSKSVEL 152 RKSLSFSKS E+ Sbjct: 121 RKSLSFSKSDEM 132 >UniRef50_C5BB57 Iso-IS1 ORF2 n=24 Tax=Enterobacteriaceae RepID=C5BB57_EDWI9 Length = 131 Score = 79.0 bits (193), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 39/101 (38%), Positives = 63/101 (62%) Query: 50 VGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESR 109 +G+K+RQ WL+YAY++ V+A+ FG +T + L+ L++PF++ + +D Sbjct: 1 MGSKARQHWLWYAYNTKTGGVLAYTFGPKTDESCRELLVLITPFNIGMITSDNRSSDGRE 60 Query: 110 LKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV 150 + H+ K TQRI R+NL LR H+ RL RK++ FS+SV Sbjct: 61 VPKDKHLTGKILTQRIVRNNLTLRTHIKRLARKTICFSRSV 101 >UniRef50_Q8VSP6 Putative IS1 ORF n=1 Tax=Shigella flexneri RepID=Q8VSP6_SHIFL Length = 67 Score = 78.2 bits (191), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 38/64 (59%), Positives = 47/64 (73%) Query: 104 PLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNI 163 P+Y + L HVISK+ TQRIERHNLNLR HL RL RK++ FSKS ++H K+IG YL I Sbjct: 4 PVYRTLLSSTSHVISKKCTQRIERHNLNLRTHLKRLTRKTICFSKSDDMHYKIIGWYLTI 63 Query: 164 KHYQ 167 H+ Sbjct: 64 NHHH 67 >UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETP6_AMOA5 Length = 232 Score = 77.0 bits (188), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 50/145 (34%), Positives = 72/145 (49%), Gaps = 12/145 (8%) Query: 9 GRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRK 68 G W Q K R Q+V +VI EMDE W YVG+K ++ W+++A + Sbjct: 81 GEWIQAYHNQNKPKRRQAV--------EVI---EMDEMWHYVGSKKKKLWIWFALERSGG 129 Query: 69 TVVAHVFGERTMATLGRLMSLLSPFDV-VIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 +++ V G R +T RL + + TD WP Y + H +SK+ T IE Sbjct: 130 SILDFVTGSREASTGKRLWIKIKDIACRSFYATDHWPAYTQFINAHKHKVSKKQTTHIES 189 Query: 128 HNLNLRQHLARLGRKSLSFSKSVEL 152 HN N+R +LAR RK+ +SKS L Sbjct: 190 HNANVRHYLARFRRKTKCYSKSERL 214 >UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202 RepID=C8KX67_9PAST Length = 238 Score = 75.9 bits (185), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 2/114 (1%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVI--WM 99 E+DE W +VG KS + WL YA+D + K ++++V+G+R T+ RL L + Sbjct: 104 EIDEFWTFVGRKSERVWLIYAFDRVSKKIISYVWGKRNSETVMRLKIQLCKSQISFRYVY 163 Query: 100 TDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELH 153 +D W + KG H + ++YT IE ++ LR + R RKS +FSKS++ H Sbjct: 164 SDRWICFRKIFKGYPHYLGRKYTIGIEGNHCLLRHRVRRFFRKSCNFSKSLKYH 217 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 75.1 bits (183), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 44/120 (36%), Positives = 61/120 (50%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTD 101 E+DE Y+G+K WL YA D KTVV+ +RT TL R++ L + T Sbjct: 108 EVDEMCTYIGSKQNFIWLVYALDKNSKTVVSFNVAKRTNKTLSRVLDTLKLSEAKKIFTG 167 Query: 102 GWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYL 161 Y L K+H + + T IER NL LR HL RL R+++ SKS+ + V+ Y Sbjct: 168 RLKNYRYLLDEKMHSVKRFGTNHIERKNLTLRTHLKRLNRRTICSSKSLLIFTAVLKIYF 227 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 72.0 bits (175), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 45/141 (31%), Positives = 74/141 (52%), Gaps = 12/141 (8%) Query: 33 PGSDVI-VCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMA--------TL 83 P +VI E+DE +VG+K + WL+ A + + ++A V G+ ++ T Sbjct: 89 PEENVIPEVGELDELETFVGSKKTKIWLWTAVNHFTQGILAWVLGDHSLVLSEVEVAETF 148 Query: 84 GRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKS 143 L + + ++TDGW +Y S + ++SK Y R+E N LR +LARL RK+ Sbjct: 149 KPLWENIEKWKCYFYVTDGWKVYPSFIPDGDQIVSKTYMTRVENENTRLRHYLARLHRKT 208 Query: 144 LSFSKSVEL---HDKVIGHYL 161 L +SKS ++ K++ HYL Sbjct: 209 LCYSKSEQILRYSIKLLLHYL 229 >UniRef50_C5U8R9 IS1 transposase (Fragment) n=1 Tax=Methanocaldococcus infernus ME RepID=C5U8R9_9EURY Length = 133 Score = 71.6 bits (174), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 40/119 (33%), Positives = 65/119 (54%), Gaps = 3/119 (2%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL--SPFDVVIWM 99 E+DE +V +K + W++ A D ++AH G+R+ +L +L+ + D + Sbjct: 7 EIDEMHSFVRSKDNKVWIWIAVDKNTGLIIAHKTGDRSDKSLKKLLKEIPKKVLDKCTFY 66 Query: 100 TDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIG 158 TD W Y L + H I K YT+R+ER L R ARL R+ + +SKS+E+H+ +I Sbjct: 67 TDKWKAYNI-LPNERHKIGKEYTRRVERTFLTFRNSCARLVRRGIRYSKSMEMHNIIID 124 >UniRef50_B2TXL7 IS1 ORF2 n=1 Tax=Shigella boydii CDC 3083-94 RepID=B2TXL7_SHIB3 Length = 44 Score = 70.1 bits (170), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 33/33 (100%), Positives = 33/33 (100%) Query: 135 HLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 HLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ Sbjct: 12 HLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 44 >UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5WC29_PSYWF Length = 233 Score = 66.2 bits (160), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 43/120 (35%), Positives = 66/120 (55%), Gaps = 2/120 (1%) Query: 40 CAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIW- 98 C E+DE W +VG K+ ++WL YAY +VA+V+G+R + T+ +L + L V Sbjct: 102 CLEIDELWTFVGKKTNKQWLIYAYHRDTGEIVAYVWGKRDLNTVKKLKAKLKALGVSCAR 161 Query: 99 -MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVI 157 +D W + + KG VI K +T IE +N +R + R R+S +FSK +E H K Sbjct: 162 IASDTWDSFVTGFKGFTQVIGKFFTVGIEGNNCTIRHRVRRAFRRSCNFSKKLENHFKAF 221 >UniRef50_B4WT39 IS1 transposase subfamily, putative n=3 Tax=Cyanobacteria RepID=B4WT39_9SYNE Length = 243 Score = 64.7 bits (156), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 52/149 (34%), Positives = 70/149 (46%), Gaps = 26/149 (17%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYD-SLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIW 98 E DE W +VG+KS ++W++ A + R+T+ H+ G R L + L P + Sbjct: 93 ECDEAWSFVGSKSNKQWIWLAINRDTRETIGMHI-GGRNREGARSLWACLPPVYRQCAVC 151 Query: 99 MTDGWP-----------------LYESRLKGKLH-VISKR--YTQRIERHNLNLRQHLAR 138 TD W YE L K H +SK T IER N LRQ ++R Sbjct: 152 YTDFWERCDPASLCGARERAPRQAYEIVLPSKRHRAVSKNSGQTNHIERFNCTLRQRVSR 211 Query: 139 LGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 L RKSLSFSK +E H I ++ I HY Sbjct: 212 LVRKSLSFSKKLENHIGAIWYF--IHHYN 238 >UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C2KAD9_9FLAO Length = 239 Score = 64.7 bits (156), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 39/131 (29%), Positives = 65/131 (49%) Query: 31 IQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL 90 ++P + E+DE Y +K+ +RW+ AY K V+ + G RT TL ++ L Sbjct: 91 VKPPIPQNITIEIDELKTYTQSKTNERWVVAAYCRETKKVIDYKLGRRTTKTLQCIIDTL 150 Query: 91 SPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV 150 + +D +Y + LH +R T IER L+LR H+ RLGRKS++ ++ Sbjct: 151 LYANPKKIYSDRLNIYPKLIPKHLHSTKRRETNHIERKFLDLRTHIKRLGRKSINKAQRD 210 Query: 151 ELHDKVIGHYL 161 + D ++ Y Sbjct: 211 KYTDAILRIYF 221 >UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TRD2_9FLAO Length = 219 Score = 64.7 bits (156), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 36/113 (31%), Positives = 59/113 (52%), Gaps = 6/113 (5%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMS---LLSPFDVVIW 98 E+DE W ++G K W+ YA + +V+ G +T + L++ LL P + Sbjct: 97 EVDELWSFIGNKKNSTWITYAIEQKTGSVIDFFVGRKTKENIKPLINKVLLLQPTRI--- 153 Query: 99 MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVE 151 TD +Y S + ++H + T +IER NL LR H+ RL R+++ FS+ E Sbjct: 154 YTDRLNIYPSLIPKEMHKRFQYCTNKIERMNLTLRTHIKRLSRRTICFSRKQE 206 >UniRef50_B2SG01 IS1 transposase n=7 Tax=Francisella tularensis RepID=B2SG01_FRATM Length = 102 Score = 56.2 bits (134), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 2/103 (1%) Query: 47 WGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLY 106 W ++G+K + W+ AYD + V G R AT RL + + TD W + Sbjct: 2 WNFIGSK--KCWIIKAYDRRVGKTIIWVTGGRDNATFRRLYKKVQHLTNCNFYTDDWVAF 59 Query: 107 ESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 L K H+I K T IER N N R +LAR+ R++ S+S Sbjct: 60 VEVLPKKRHIIGKSGTVAIERDNSNTRHNLARMTRRTKVISRS 102 >UniRef50_B0C1Z5 IS1 transposase n=3 Tax=Cyanobacteria RepID=B0C1Z5_ACAM1 Length = 130 Score = 55.8 bits (133), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 39/125 (31%), Positives = 61/125 (48%), Gaps = 7/125 (5%) Query: 47 WGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGWP 104 W +V KS ++W++ A D + + +V G R+ +L + L + TD W Sbjct: 2 WSFVNDKSNKQWIWLALDVITREIVGVYVGARSKQGARQLWNSLPGIYRQCAVAYTDFWD 61 Query: 105 LYESRLKGKLH-VISKRYTQR--IERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYL 161 Y + H + K Q IER N +RQ ++RL RK+LSFSK +E H I ++ Sbjct: 62 AYGCVFPKQRHQAVGKETGQTCYIERFNCTMRQRVSRLVRKTLSFSKKLENHIGAI--WM 119 Query: 162 NIKHY 166 + HY Sbjct: 120 FVHHY 124 >UniRef50_P73781 Transposase n=5 Tax=Bacteria RepID=P73781_SYNY3 Length = 138 Score = 54.7 bits (130), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 3/108 (2%) Query: 48 GYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYE 107 + + + WL+ AYD + ++ G R TL RL+ L+ + V + TD W Y+ Sbjct: 2 AFSSGQKNKLWLWKAYDRVTGRLIDWELGNRDSQTLSRLLERLAKWKVTVSCTDDWRPYQ 61 Query: 108 SRLK---GKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEL 152 L H ISKR T IER+N + R LAR R + S+S + Sbjct: 62 QLLDEHPDAFHGISKRETVGIERNNSDNRHWLARFHRPTKVISRSAHM 109 >UniRef50_Q2S4N0 ISSru3, transposase insB n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N0_SALRD Length = 158 Score = 46.6 bits (109), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 6/129 (4%) Query: 26 SVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGR 85 SV ++P + V E+DE W YV ++ +RWL+ A + VVA V G+R+ T R Sbjct: 11 SVAEGLRPAEEGDVL-ELDECWTYVRERANKRWLWVALCRRTRQVVAFVIGDRSARTCAR 69 Query: 86 LMSLL-SPFDVVIWMTDGWPLYESRLKG----KLHVISKRYTQRIERHNLNLRQHLARLG 140 L S + + +D W Y G + S +ER LRQ LAR Sbjct: 70 LWSRIPEEYRQGRSFSDFWKSYRPVFAGDPSHRQVGKSSGEMAHVERFFGRLRQKLARYV 129 Query: 141 RKSLSFSKS 149 R++ + S+S Sbjct: 130 RRTRAASES 138 >UniRef50_Q1CBA9 Putative transposase n=3 Tax=Yersinia pestis RepID=Q1CBA9_YERPA Length = 85 Score = 45.8 bits (107), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 25/61 (40%), Positives = 33/61 (54%), Gaps = 1/61 (1%) Query: 68 KTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 +T F T +L+ LLS F++V W TD + YE L + H+ SK YTQRIER Sbjct: 22 QTYYCSYFWSSEQKTFRQLLGLLSGFNIVFWCTDNFSAYE-MLPDEKHIRSKLYTQRIER 80 Query: 128 H 128 Sbjct: 81 E 81 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust. Identities = 34/116 (29%), Positives = 56/116 (48%), Gaps = 12/116 (10%) Query: 43 MDEQWGYV----GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIW 98 DE W Y+ G+K W+ +++L + G+R T L++ L +V Sbjct: 172 FDESWTYLRVRHGSKRENLWI---WNALADGLPFFTTGDRDYKTFSFLLNSLPKSEV--N 226 Query: 99 MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHD 154 TD + +Y+ HV SK+YT +E +N R HLARL R + + ++S + D Sbjct: 227 YTDDYSVYQVLDN---HVASKKYTYTVESYNSYCRAHLARLARDTRAVNRSERMVD 279 >UniRef50_Q46GF8 Putative uncharacterized protein n=3 Tax=Methanosarcina barkeri str. Fusaro RepID=Q46GF8_METBF Length = 112 Score = 43.5 bits (101), Expect = 0.003, Method: Compositional matrix adjust. Identities = 30/88 (34%), Positives = 42/88 (47%) Query: 65 SLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQR 124 L K + FG R T + L ++ MTD W Y L +H SK T Sbjct: 5 ELGKKFINCSFGSRGTETGQLIWEKLKQKEIGEVMTDHWRAYAEFLPENIHTQSKAETYT 64 Query: 125 IERHNLNLRQHLARLGRKSLSFSKSVEL 152 +E +N LR LARL RK+ ++KS+E+ Sbjct: 65 VEGYNGILRHFLARLRRKTKCYTKSIEM 92 >UniRef50_C4GXL2 Insertion sequence protein n=5 Tax=Yersinia pestis RepID=C4GXL2_YERPN Length = 111 Score = 43.5 bits (101), Expect = 0.003, Method: Compositional matrix adjust. Identities = 16/33 (48%), Positives = 25/33 (75%) Query: 47 WGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERT 79 W +VG K +QRWL+YA++ K ++AH+FG R+ Sbjct: 2 WSFVGNKKQQRWLWYAWEPRLKRIIAHIFGRRS 34 >UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX7_PARUW Length = 163 Score = 42.7 bits (99), Expect = 0.004, Method: Compositional matrix adjust. Identities = 24/79 (30%), Positives = 39/79 (49%), Gaps = 2/79 (2%) Query: 11 WPQHDFTSLKKLRPQSVTSRIQPGSDV--IVCAEMDEQWGYVGAKSRQRWLFYAYDSLRK 68 W L K P+++ + + +D +V E+DE W YVG+K+ +WL+ S + Sbjct: 71 WLLEFIGELTKELPENLNAEVVSENDELEVVVLEVDELWSYVGSKANPQWLWLVMHSKTR 130 Query: 69 TVVAHVFGERTMATLGRLM 87 VVA G R T +L+ Sbjct: 131 QVVAMQIGPRNKETAEKLL 149 >UniRef50_A8ZKR3 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZKR3_ACAM1 Length = 241 Score = 42.0 bits (97), Expect = 0.007, Method: Compositional matrix adjust. Identities = 27/72 (37%), Positives = 36/72 (50%), Gaps = 6/72 (8%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLM-----SLLSPFDVV 96 EMDE+ GYV K +Q W A D+ K ++ G R + RLM L P D+V Sbjct: 126 EMDERHGYVAIKQQQCWDAVAIDAASKFIIQVEVGPRNTNLIDRLMRATHKRLAHPRDLV 185 Query: 97 IWMTDGWPLYES 108 + MTDG Y + Sbjct: 186 L-MTDGDASYRT 196 >UniRef50_C5B7X2 Iso-IS1 ORF2 n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B7X2_EDWI9 Length = 99 Score = 42.0 bits (97), Expect = 0.007, Method: Compositional matrix adjust. Identities = 26/75 (34%), Positives = 41/75 (54%), Gaps = 12/75 (16%) Query: 90 LSPFDVVIWMTDGW--PLYE----SRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKS 143 L+ F++ + D W P+ E L G + +TQ ER++L LR + RL RK Sbjct: 3 LTAFNIGMITRDDWGNPIREVPWGKPLTGTI------FTQHSERNSLMLRTRIKRLARKR 56 Query: 144 LSFSKSVELHDKVIG 158 + FS+++ LH+KV G Sbjct: 57 IGFSRAIALHEKVTG 71 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 42.0 bits (97), Expect = 0.007, Method: Compositional matrix adjust. Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 8/82 (9%) Query: 59 LFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVIS 118 L YAY R ++++ E+T +L+ L+ F+VV W TD + Y + L H Sbjct: 41 LEYAY---RACHCSYIWNEKT---FRKLLKKLASFNVVFWCTDNFKTY-NLLPKSQHRAG 93 Query: 119 KRYTQRIERHNLNLRQHLARLG 140 K +TQ IER NL +R + RL Sbjct: 94 KIFTQHIERENL-MRTRIKRLN 114 >UniRef50_B0CCX7 Putative uncharacterized protein n=7 Tax=Cyanobacteria RepID=B0CCX7_ACAM1 Length = 196 Score = 41.6 bits (96), Expect = 0.009, Method: Compositional matrix adjust. Identities = 23/52 (44%), Positives = 29/52 (55%), Gaps = 1/52 (1%) Query: 98 WMTDGWPLYESRLKGK-LHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSK 148 W TDGW Y +L + +H +SK TQR+ER N LRQ R R+ F K Sbjct: 93 WQTDGWEGYSRQLADEVIHHVSKALTQRLERTNGILRQQTGRWHRRQNKFGK 144 >UniRef50_Q327E8 IS1 ORF, n=6 Tax=Enterobacteriaceae RepID=Q327E8_SHIDS Length = 94 Score = 41.2 bits (95), Expect = 0.013, Method: Compositional matrix adjust. Identities = 17/31 (54%), Positives = 25/31 (80%) Query: 125 IERHNLNLRQHLARLGRKSLSFSKSVELHDK 155 +ER+NL LR + RL RK++ FS+SVE+H+K Sbjct: 35 LERNNLPLRTRIKRLARKTICFSRSVEIHEK 65 >UniRef50_Q46CV2 Putative uncharacterized protein n=14 Tax=Methanosarcina RepID=Q46CV2_METBF Length = 75 Score = 40.8 bits (94), Expect = 0.018, Method: Compositional matrix adjust. Identities = 22/54 (40%), Positives = 31/54 (57%) Query: 99 MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEL 152 MTD W Y L +H SK T +E +N L+ LARL RK+ ++KS+E+ Sbjct: 2 MTDHWRAYAEFLPENIHTQSKAETYTVEGYNGILKHFLARLRRKTKCYTKSIEM 55 >UniRef50_A8YFR2 ImeAB protein n=2 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YFR2_MICAE Length = 122 Score = 40.4 bits (93), Expect = 0.024, Method: Compositional matrix adjust. Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 3/63 (4%) Query: 94 DVVIWMTDGWPLYESRLKGKLH-VISKRY--TQRIERHNLNLRQHLARLGRKSLSFSKSV 150 + TD W Y++ + K H + K T IER N RQ ++RL R+SLSFSK + Sbjct: 25 QCAVAYTDCWESYKTGIPSKRHRPVGKETGQTNPIERLNNTFRQRISRLVRESLSFSKKM 84 Query: 151 ELH 153 E H Sbjct: 85 ENH 87 >UniRef50_B0CEC0 Putative uncharacterized protein n=6 Tax=Acaryochloris marina MBIC11017 RepID=B0CEC0_ACAM1 Length = 172 Score = 40.0 bits (92), Expect = 0.026, Method: Compositional matrix adjust. Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 1/52 (1%) Query: 98 WMTDGWPLYESRLKGKL-HVISKRYTQRIERHNLNLRQHLARLGRKSLSFSK 148 W TDGW Y +L ++ H +SK TQR+ER N +RQ R R+ F K Sbjct: 63 WQTDGWEGYARQLPDEVVHEVSKALTQRLERTNGIVRQQTGRWHRRQNKFGK 114 >UniRef50_B0JRY8 Transposase n=27 Tax=Bacteria RepID=B0JRY8_MICAN Length = 111 Score = 39.7 bits (91), Expect = 0.033, Method: Compositional matrix adjust. Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 5/91 (5%) Query: 76 GERTMATLGRLMSLLSPF--DVVIWMTDGWPLYESRLKGKLHVISKRYTQR---IERHNL 130 G+R+ + +L + L + TD W Y++ + K H + T + IER N Sbjct: 11 GDRSRQSAKKLWASLPGVYRQCAVAYTDFWESYKTVIPSKRHRPVGKETGQTNPIERLNN 70 Query: 131 NLRQHLARLGRKSLSFSKSVELHDKVIGHYL 161 RQ ++RL R+SLSFSK +E H + +++ Sbjct: 71 TFRQRISRLVRESLSFSKKMENHVGAVWYFI 101 >UniRef50_C8SAB2 IS1 transposase (Fragment) n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SAB2_FERPL Length = 75 Score = 39.3 bits (90), Expect = 0.049, Method: Compositional matrix adjust. Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 4/71 (5%) Query: 96 VIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHD- 154 I+ TD W Y + + K +I K T +ER L LR R RKS+ FSKS+E+ + Sbjct: 3 AIFYTDRWDAY-NLIPYKQRIIKKGGTNHVERLFLTLRNDNPRFARKSIRFSKSIEMLEN 61 Query: 155 --KVIGHYLNI 163 K+ HY N+ Sbjct: 62 SLKLWIHYYNL 72 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P57998 Insertion element IS1 4 protein insB n=553 Tax=r... 254 5e-67 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 209 3e-53 UniRef50_P03832 Insertion element iso-IS1n protein insB n=127 Ta... 189 3e-47 UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candida... 168 8e-41 UniRef50_B2K0W2 IS1 transposase n=30 Tax=Enterobacteriaceae RepI... 160 1e-38 UniRef50_A3IXU4 Iso-IS1 ORF2 n=3 Tax=Bacteria RepID=A3IXU4_9CHRO 160 1e-38 UniRef50_Q1IXR6 IS1 transposase n=3 Tax=Deinococcus geothermalis... 160 1e-38 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 153 2e-36 UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202... 153 3e-36 UniRef50_B4WT39 IS1 transposase subfamily, putative n=3 Tax=Cyan... 149 3e-35 UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum A... 149 5e-35 UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaen... 147 1e-34 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 145 3e-34 UniRef50_B0C1Z5 IS1 transposase n=3 Tax=Cyanobacteria RepID=B0C1... 143 3e-33 UniRef50_C5U8R9 IS1 transposase (Fragment) n=1 Tax=Methanocaldoc... 140 1e-32 UniRef50_Q2S4N0 ISSru3, transposase insB n=1 Tax=Salinibacter ru... 139 3e-32 UniRef50_C5BB57 Iso-IS1 ORF2 n=24 Tax=Enterobacteriaceae RepID=C... 136 2e-31 UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5W... 129 3e-29 UniRef50_P73781 Transposase n=5 Tax=Bacteria RepID=P73781_SYNY3 128 8e-29 UniRef50_B2SG01 IS1 transposase n=7 Tax=Francisella tularensis R... 122 6e-27 UniRef50_C5BAQ0 IS1 ORF n=14 Tax=Gammaproteobacteria RepID=C5BAQ... 115 6e-25 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 111 1e-23 UniRef50_Q8VSP6 Putative IS1 ORF n=1 Tax=Shigella flexneri RepID... 95 8e-19 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 77 3e-13 UniRef50_Q1CBA9 Putative transposase n=3 Tax=Yersinia pestis Rep... 74 2e-12 UniRef50_B2TXL7 IS1 ORF2 n=1 Tax=Shigella boydii CDC 3083-94 Rep... 60 3e-08 Sequences not found previously or not previously below threshold: UniRef50_B0JRY8 Transposase n=27 Tax=Bacteria RepID=B0JRY8_MICAN 92 7e-18 UniRef50_Q9CJQ7 Putative uncharacterized protein n=2 Tax=Pasteur... 90 4e-17 UniRef50_Q46GF8 Putative uncharacterized protein n=3 Tax=Methano... 88 1e-16 UniRef50_Q10VW0 ISSru3, transposase InsB n=1 Tax=Trichodesmium e... 77 2e-13 UniRef50_O67144 Putative uncharacterized protein n=1 Tax=Aquifex... 76 4e-13 UniRef50_C0A223 Putative uncharacterized protein n=1 Tax=Opituta... 74 3e-12 UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candida... 71 1e-11 UniRef50_A8YFR2 ImeAB protein n=2 Tax=Microcystis aeruginosa PCC... 70 3e-11 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 69 5e-11 UniRef50_C8SAB2 IS1 transposase (Fragment) n=1 Tax=Ferroglobus p... 69 6e-11 UniRef50_Q46CV2 Putative uncharacterized protein n=14 Tax=Methan... 67 2e-10 UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachl... 66 5e-10 UniRef50_Q32DI9 Iso-IS1 ORF2 n=2 Tax=Shigella RepID=Q32DI9_SHIDS 66 5e-10 UniRef50_Q6MCH2 Putative uncharacterized protein n=1 Tax=Candida... 64 2e-09 UniRef50_B0CCX7 Putative uncharacterized protein n=7 Tax=Cyanoba... 64 2e-09 UniRef50_C5B7X2 Iso-IS1 ORF2 n=1 Tax=Edwardsiella ictaluri 93-14... 63 2e-09 UniRef50_D1JFE2 Putative uncharacterized protein n=3 Tax=uncultu... 63 4e-09 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 63 4e-09 UniRef50_Q6MBQ1 Putative uncharacterized protein n=2 Tax=Candida... 62 7e-09 UniRef50_B0CEC0 Putative uncharacterized protein n=6 Tax=Acaryoc... 62 7e-09 UniRef50_Q10ZU2 Putative uncharacterized protein n=3 Tax=Trichod... 62 1e-08 UniRef50_UPI00016C465A IS1 transposase n=1 Tax=Gemmata obscurigl... 61 1e-08 UniRef50_B0URB1 Putative uncharacterized protein n=1 Tax=Methylo... 61 1e-08 UniRef50_A8GX98 Transposase and inactivated derivative n=2 Tax=R... 60 3e-08 UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium ... 60 4e-08 UniRef50_C4GXL2 Insertion sequence protein n=5 Tax=Yersinia pest... 59 6e-08 UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthros... 58 8e-08 UniRef50_A9GLN9 Putative uncharacterized protein n=2 Tax=Sorangi... 57 2e-07 UniRef50_A9GLP8 Putative uncharacterized protein n=1 Tax=Sorangi... 57 2e-07 UniRef50_Q2FSQ2 Putative uncharacterized protein n=1 Tax=Methano... 57 3e-07 UniRef50_A9FJP3 Putative uncharacterized protein n=5 Tax=Proteob... 56 6e-07 UniRef50_Q972H6 Putative uncharacterized protein ST1154 n=1 Tax=... 54 1e-06 UniRef50_Q7NJH9 Gsl1853 protein n=1 Tax=Gloeobacter violaceus Re... 54 1e-06 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 54 2e-06 UniRef50_Q327E8 IS1 ORF, n=6 Tax=Enterobacteriaceae RepID=Q327E8... 53 3e-06 UniRef50_Q648U8 Putative uncharacterized protein n=6 Tax=environ... 53 5e-06 UniRef50_Q64CQ0 Putative uncharacterized protein n=1 Tax=uncultu... 52 9e-06 UniRef50_UPI00018554DD transposase n=1 Tax=Francisella novicida ... 51 2e-05 UniRef50_A8ZKR3 Putative uncharacterized protein n=1 Tax=Acaryoc... 51 2e-05 UniRef50_B0CAP5 Putative uncharacterized protein n=3 Tax=Acaryoc... 49 5e-05 UniRef50_Q8PRQ0 Putative uncharacterized protein n=1 Tax=Methano... 49 7e-05 UniRef50_Q649W7 Putative uncharacterized protein n=1 Tax=uncultu... 48 1e-04 UniRef50_Q6MCX8 Putative uncharacterized protein n=2 Tax=Candida... 47 2e-04 UniRef50_Q10ZQ2 Putative uncharacterized protein n=7 Tax=Cyanoba... 46 4e-04 UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis ae... 46 4e-04 UniRef50_Q6MD18 Putative uncharacterized protein n=2 Tax=Candida... 45 0.001 UniRef50_A4AD66 Transposase n=19 Tax=unclassified Gammaproteobac... 45 0.001 UniRef50_C7DAC3 Transposase n=36 Tax=Rhodobacterales RepID=C7DAC... 44 0.002 UniRef50_B9K3D6 Transposase n=32 Tax=Bacteria RepID=B9K3D6_AGRVS 43 0.003 UniRef50_Q218S2 Putative uncharacterized protein n=1 Tax=Rhodops... 43 0.003 UniRef50_Q0W4E9 Putative uncharacterized protein n=1 Tax=uncultu... 43 0.004 UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q... 42 0.005 UniRef50_A7C135 Putative uncharacterized protein n=1 Tax=Beggiat... 42 0.009 UniRef50_Q469A1 Putative uncharacterized protein n=1 Tax=Methano... 42 0.010 UniRef50_B9K4Q6 Transposase n=2 Tax=Alphaproteobacteria RepID=B9... 41 0.017 UniRef50_UPI00016C5273 transposase, unclassified family protein ... 40 0.023 UniRef50_B9K5F7 Transposase n=3 Tax=Bacteria RepID=B9K5F7_AGRVS 40 0.028 UniRef50_A0LBE3 Putative uncharacterized protein n=1 Tax=Magneto... 40 0.030 UniRef50_Q11MN9 Transposase n=37 Tax=Bacteria RepID=Q11MN9_MESSB 40 0.031 UniRef50_A7C324 Putative uncharacterized protein n=3 Tax=Beggiat... 40 0.034 UniRef50_A9EF44 Transposase n=2 Tax=Rhodobacteraceae RepID=A9EF4... 40 0.040 UniRef50_Q2G895 Transposase n=36 Tax=Alphaproteobacteria RepID=Q... 39 0.049 UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=... 39 0.059 UniRef50_A5N1B9 Transposase n=2 Tax=Clostridium kluyveri RepID=A... 39 0.065 UniRef50_C6IUV9 Transposase n=4 Tax=Bacteroides RepID=C6IUV9_9BACE 38 0.082 >UniRef50_P57998 Insertion element IS1 4 protein insB n=553 Tax=root RepID=INSB4_ECOLI Length = 167 Score = 254 bits (650), Expect = 5e-67, Method: Composition-based stats. Identities = 162/167 (97%), Positives = 163/167 (97%) Query: 1 MPGNSPHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF 60 MPGNSPHYGRWPQHDF KKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF Sbjct: 1 MPGNSPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF 60 Query: 61 YAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR 120 YAYD LRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR Sbjct: 61 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR 120 Query: 121 YTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 YTQRIER+NLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ Sbjct: 121 YTQRIERYNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 209 bits (532), Expect = 3e-53, Method: Composition-based stats. Identities = 65/150 (43%), Positives = 98/150 (65%) Query: 18 SLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGE 77 +LKKL P+ +TS +DV E+DEQW YVG+K+RQ W++YAY++ V+A+ FG Sbjct: 82 TLKKLAPKRITSSPVTHADVAFICELDEQWSYVGSKARQHWIWYAYNTKTGGVLAYTFGP 141 Query: 78 RTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLA 137 RT T L++LL+PF++ + +D W Y + H+ K +TQ IER+NL LR + Sbjct: 142 RTDQTCRELLALLTPFNIGMLTSDDWGSYGREVPKNKHLTGKIFTQCIERNNLTLRTRIK 201 Query: 138 RLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 RLGRK++ FS+SVE+H+KVIG ++ + Sbjct: 202 RLGRKTICFSRSVEIHEKVIGAFIEKHMFY 231 >UniRef50_P03832 Insertion element iso-IS1n protein insB n=127 Tax=Gammaproteobacteria RepID=INBN_SHIDY Length = 131 Score = 189 bits (481), Expect = 3e-47, Method: Composition-based stats. Identities = 56/129 (43%), Positives = 86/129 (66%) Query: 39 VCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIW 98 + E+DEQW +VG+K+RQ WL+YAY++ V+A+ FG RT T L++LL+PF++ + Sbjct: 3 LICELDEQWSFVGSKARQHWLWYAYNTKTGGVLAYTFGPRTDETCRELLALLTPFNIGML 62 Query: 99 MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIG 158 +D W Y + H+ K +TQRIER+NL LR + RL RK++ FS+SVE+H+KVIG Sbjct: 63 TSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRIKRLARKTICFSRSVEIHEKVIG 122 Query: 159 HYLNIKHYQ 167 ++ + Sbjct: 123 TFIEKHMFY 131 >UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETP6_AMOA5 Length = 232 Score = 168 bits (425), Expect = 8e-41, Method: Composition-based stats. Identities = 48/150 (32%), Positives = 71/150 (47%), Gaps = 12/150 (8%) Query: 9 GRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRK 68 G W Q K R Q+V EMDE W YVG+K ++ W+++A + Sbjct: 81 GEWIQAYHNQNKPKRRQAV-----------EVIEMDEMWHYVGSKKKKLWIWFALERSGG 129 Query: 69 TVVAHVFGERTMATLGRLMSLLSPFDV-VIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 +++ V G R +T RL + + TD WP Y + H +SK+ T IE Sbjct: 130 SILDFVTGSREASTGKRLWIKIKDIACRSFYATDHWPAYTQFINAHKHKVSKKQTTHIES 189 Query: 128 HNLNLRQHLARLGRKSLSFSKSVELHDKVI 157 HN N+R +LAR RK+ +SKS L + + Sbjct: 190 HNANVRHYLARFRRKTKCYSKSERLVELSL 219 >UniRef50_B2K0W2 IS1 transposase n=30 Tax=Enterobacteriaceae RepID=B2K0W2_YERPB Length = 122 Score = 160 bits (406), Expect = 1e-38, Method: Composition-based stats. Identities = 60/121 (49%), Positives = 83/121 (68%), Gaps = 1/121 (0%) Query: 46 QWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPL 105 W +VG K +QRWL+YA++ K ++AHVFG R+ T +L+ LLS F++V W TD + Sbjct: 1 MWSFVGNKKQQRWLWYAWEPRLKRIIAHVFGRRSKKTFRQLLGLLSGFNIVFWCTDNFSA 60 Query: 106 YESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKH 165 YE L + H+ SK YTQRIER NLN+R L RL RK+L SKS E+HD++IG ++ +H Sbjct: 61 YE-MLPDEKHIRSKLYTQRIERENLNIRNRLKRLNRKTLGDSKSAEMHDRIIGTFIEREH 119 Query: 166 Y 166 Y Sbjct: 120 Y 120 >UniRef50_A3IXU4 Iso-IS1 ORF2 n=3 Tax=Bacteria RepID=A3IXU4_9CHRO Length = 138 Score = 160 bits (405), Expect = 1e-38, Method: Composition-based stats. Identities = 48/127 (37%), Positives = 74/127 (58%) Query: 41 AEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT 100 AE+D+ +V K +RWL++A D T++A+V G+RT +L ++L PF + + T Sbjct: 8 AEVDKMKIFVAKKEHERWLWHAIDHQTGTILAYVLGQRTDQMFLKLKTMLKPFGISEFYT 67 Query: 101 DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHY 160 D W Y+ L + +SK Q+IER +L LR + RL RK++ FSK +HD VIG Y Sbjct: 68 DNWGSYKRHLSDEQRTVSKYKMQKIERKHLTLRTRIKRLQRKTICFSKISPMHDLVIGLY 127 Query: 161 LNIKHYQ 167 +N + Sbjct: 128 INKYEFH 134 >UniRef50_Q1IXR6 IS1 transposase n=3 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1IXR6_DEIGD Length = 148 Score = 160 bits (405), Expect = 1e-38, Method: Composition-based stats. Identities = 53/148 (35%), Positives = 74/148 (50%), Gaps = 8/148 (5%) Query: 24 PQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATL 83 Q+V + P +V+V E+DE W +VG K + RWL+ A + + V+A V G+R+ T Sbjct: 2 RQTVPVCLTPPEEVVV--ELDELWTFVGKKKQARWLWIALERSTRKVLAWVLGDRSEQTA 59 Query: 84 GRLMSLLSPFD----VVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARL 139 +L L + TD W Y+ L G + K T +ER N LRQ L RL Sbjct: 60 FKLWDRLPLSPEQRLKGTFCTDLWRAYDEPLLGVKRLTRKGETNHVERLNCTLRQRLGRL 119 Query: 140 GRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 RKSLSFSKS E+ + + L Y Sbjct: 120 VRKSLSFSKSDEMLEASLT--LAFHRYN 145 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 153 bits (387), Expect = 2e-36, Method: Composition-based stats. Identities = 45/142 (31%), Positives = 71/142 (50%), Gaps = 12/142 (8%) Query: 32 QPGSDVI-VCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERT--------MAT 82 P +VI E+DE +VG+K + WL+ A + + ++A V G+ + T Sbjct: 88 VPEENVIPEVGELDELETFVGSKKTKIWLWTAVNHFTQGILAWVLGDHSLVLSEVEVAET 147 Query: 83 LGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRK 142 L + + ++TDGW +Y S + ++SK Y R+E N LR +LARL RK Sbjct: 148 FKPLWENIEKWKCYFYVTDGWKVYPSFIPDGDQIVSKTYMTRVENENTRLRHYLARLHRK 207 Query: 143 SLSFSKSV---ELHDKVIGHYL 161 +L +SKS K++ HYL Sbjct: 208 TLCYSKSEQILRYSIKLLLHYL 229 >UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202 RepID=C8KX67_9PAST Length = 238 Score = 153 bits (386), Expect = 3e-36, Method: Composition-based stats. Identities = 42/131 (32%), Positives = 67/131 (51%), Gaps = 2/131 (1%) Query: 33 PGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSP 92 P E+DE W +VG KS + WL YA+D + K ++++V+G+R T+ RL L Sbjct: 95 PHHCFYESIEIDEFWTFVGRKSERVWLIYAFDRVSKKIISYVWGKRNSETVMRLKIQLCK 154 Query: 93 FDVVI--WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV 150 + +D W + KG H + ++YT IE ++ LR + R RKS +FSKS+ Sbjct: 155 SQISFRYVYSDRWICFRKIFKGYPHYLGRKYTIGIEGNHCLLRHRVRRFFRKSCNFSKSL 214 Query: 151 ELHDKVIGHYL 161 + H + Sbjct: 215 KYHFSAFRLMI 225 >UniRef50_B4WT39 IS1 transposase subfamily, putative n=3 Tax=Cyanobacteria RepID=B4WT39_9SYNE Length = 243 Score = 149 bits (376), Expect = 3e-35, Method: Composition-based stats. Identities = 53/179 (29%), Positives = 75/179 (41%), Gaps = 27/179 (15%) Query: 11 WPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTV 70 W Q + P+ + + G + E DE W +VG+KS ++W++ A + + Sbjct: 65 WLQQYASEEYADVPRQAKTSPKKGP---LTLECDEAWSFVGSKSNKQWIWLAINRDTRET 121 Query: 71 VAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGWP-----------------LYESRLK 111 + G R L + L P + TD W YE L Sbjct: 122 IGMHIGGRNREGARSLWACLPPVYRQCAVCYTDFWERCDPASLCGARERAPRQAYEIVLP 181 Query: 112 GKLH-VISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 K H +SK T IER N LRQ ++RL RKSLSFSK +E H I ++ I HY Sbjct: 182 SKRHRAVSKNSGQTNHIERFNCTLRQRVSRLVRKSLSFSKKLENHIGAIWYF--IHHYN 238 >UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C2KAD9_9FLAO Length = 239 Score = 149 bits (375), Expect = 5e-35, Method: Composition-based stats. Identities = 41/145 (28%), Positives = 71/145 (48%) Query: 17 TSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFG 76 T++ K + + ++P + E+DE Y +K+ +RW+ AY K V+ + G Sbjct: 77 TTVLKKILKIASKVVKPPIPQNITIEIDELKTYTQSKTNERWVVAAYCRETKKVIDYKLG 136 Query: 77 ERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL 136 RT TL ++ L + +D +Y + LH +R T IER L+LR H+ Sbjct: 137 RRTTKTLQCIIDTLLYANPKKIYSDRLNIYPKLIPKHLHSTKRRETNHIERKFLDLRTHI 196 Query: 137 ARLGRKSLSFSKSVELHDKVIGHYL 161 RLGRKS++ ++ + D ++ Y Sbjct: 197 KRLGRKSINKAQRDKYTDAILRIYF 221 >UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TRD2_9FLAO Length = 219 Score = 147 bits (371), Expect = 1e-34, Method: Composition-based stats. Identities = 34/120 (28%), Positives = 59/120 (49%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTD 101 E+DE W ++G K W+ YA + +V+ G +T + L++ + TD Sbjct: 97 EVDELWSFIGNKKNSTWITYAIEQKTGSVIDFFVGRKTKENIKPLINKVLLLQPTRIYTD 156 Query: 102 GWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYL 161 +Y S + ++H + T +IER NL LR H+ RL R+++ FS+ E + + Y Sbjct: 157 RLNIYPSLIPKEMHKRFQYCTNKIERMNLTLRTHIKRLSRRTICFSRKQEYLEAHLKIYF 216 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 145 bits (367), Expect = 3e-34, Method: Composition-based stats. Identities = 47/132 (35%), Positives = 64/132 (48%) Query: 32 QPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLS 91 QP E+DE Y+G+K WL YA D KTVV+ +RT TL R++ L Sbjct: 98 QPIISKCKTYEVDEMCTYIGSKQNFIWLVYALDKNSKTVVSFNVAKRTNKTLSRVLDTLK 157 Query: 92 PFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVE 151 + T Y L K+H + + T IER NL LR HL RL R+++ SKS+ Sbjct: 158 LSEAKKIFTGRLKNYRYLLDEKMHSVKRFGTNHIERKNLTLRTHLKRLNRRTICSSKSLL 217 Query: 152 LHDKVIGHYLNI 163 + V+ Y I Sbjct: 218 IFTAVLKIYFWI 229 >UniRef50_B0C1Z5 IS1 transposase n=3 Tax=Cyanobacteria RepID=B0C1Z5_ACAM1 Length = 130 Score = 143 bits (360), Expect = 3e-33, Method: Composition-based stats. Identities = 39/127 (30%), Positives = 61/127 (48%), Gaps = 7/127 (5%) Query: 46 QWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGW 103 W +V KS ++W++ A D + + +V G R+ +L + L + TD W Sbjct: 1 MWSFVNDKSNKQWIWLALDVITREIVGVYVGARSKQGARQLWNSLPGIYRQCAVAYTDFW 60 Query: 104 PLYESRLKGKLH-VISKRYTQR--IERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHY 160 Y + H + K Q IER N +RQ ++RL RK+LSFSK +E H I + Sbjct: 61 DAYGCVFPKQRHQAVGKETGQTCYIERFNCTMRQRVSRLVRKTLSFSKKLENHIGAI--W 118 Query: 161 LNIKHYQ 167 + + HY Sbjct: 119 MFVHHYN 125 >UniRef50_C5U8R9 IS1 transposase (Fragment) n=1 Tax=Methanocaldococcus infernus ME RepID=C5U8R9_9EURY Length = 133 Score = 140 bits (354), Expect = 1e-32, Method: Composition-based stats. Identities = 40/123 (32%), Positives = 66/123 (53%), Gaps = 3/123 (2%) Query: 39 VCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSP--FDVV 96 + E+DE +V +K + W++ A D ++AH G+R+ +L +L+ + D Sbjct: 4 IHLEIDEMHSFVRSKDNKVWIWIAVDKNTGLIIAHKTGDRSDKSLKKLLKEIPKKVLDKC 63 Query: 97 IWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKV 156 + TD W Y L + H I K YT+R+ER L R ARL R+ + +SKS+E+H+ + Sbjct: 64 TFYTDKWKAYN-ILPNERHKIGKEYTRRVERTFLTFRNSCARLVRRGIRYSKSMEMHNII 122 Query: 157 IGH 159 I Sbjct: 123 IDL 125 >UniRef50_Q2S4N0 ISSru3, transposase insB n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N0_SALRD Length = 158 Score = 139 bits (351), Expect = 3e-32, Method: Composition-based stats. Identities = 41/150 (27%), Positives = 63/150 (42%), Gaps = 6/150 (4%) Query: 18 SLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGE 77 K SV ++P + V E+DE W YV ++ +RWL+ A + VVA V G+ Sbjct: 3 QKKGRESDSVAEGLRPAEEGDVL-ELDECWTYVRERANKRWLWVALCRRTRQVVAFVIGD 61 Query: 78 RTMATLGRLMSLLS-PFDVVIWMTDGWPLYESRLKGK----LHVISKRYTQRIERHNLNL 132 R+ T RL S + + +D W Y G S +ER L Sbjct: 62 RSARTCARLWSRIPEEYRQGRSFSDFWKSYRPVFAGDPSHRQVGKSSGEMAHVERFFGRL 121 Query: 133 RQHLARLGRKSLSFSKSVELHDKVIGHYLN 162 RQ LAR R++ + S+S + ++ Sbjct: 122 RQKLARYVRRTRAASESERMLHLTTKLFVE 151 >UniRef50_C5BB57 Iso-IS1 ORF2 n=24 Tax=Enterobacteriaceae RepID=C5BB57_EDWI9 Length = 131 Score = 136 bits (343), Expect = 2e-31, Method: Composition-based stats. Identities = 40/106 (37%), Positives = 64/106 (60%) Query: 50 VGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESR 109 +G+K+RQ WL+YAY++ V+A+ FG +T + L+ L++PF++ + +D Sbjct: 1 MGSKARQHWLWYAYNTKTGGVLAYTFGPKTDESCRELLVLITPFNIGMITSDNRSSDGRE 60 Query: 110 LKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDK 155 + H+ K TQRI R+NL LR H+ RL RK++ FS+SV K Sbjct: 61 VPKDKHLTGKILTQRIVRNNLTLRTHIKRLARKTICFSRSVRSTKK 106 >UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5WC29_PSYWF Length = 233 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 44/138 (31%), Positives = 67/138 (48%), Gaps = 3/138 (2%) Query: 31 IQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATL--GRLMS 88 I P C E+DE W +VG K+ ++WL YAY +VA+V+G+R + T+ + Sbjct: 93 ITPKQRQYDCLEIDELWTFVGKKTNKQWLIYAYHRDTGEIVAYVWGKRDLNTVKKLKAKL 152 Query: 89 LLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSK 148 +D W + + KG VI K +T IE +N +R + R R+S +FSK Sbjct: 153 KALGVSCARIASDTWDSFVTGFKGFTQVIGKFFTVGIEGNNCTIRHRVRRAFRRSCNFSK 212 Query: 149 SVELHDKVIGH-YLNIKH 165 +E H K + I H Sbjct: 213 KLENHFKAFDLAFFYINH 230 >UniRef50_P73781 Transposase n=5 Tax=Bacteria RepID=P73781_SYNY3 Length = 138 Score = 128 bits (321), Expect = 8e-29, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 55/116 (47%), Gaps = 3/116 (2%) Query: 48 GYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYE 107 + + + WL+ AYD + ++ G R TL RL+ L+ + V + TD W Y+ Sbjct: 2 AFSSGQKNKLWLWKAYDRVTGRLIDWELGNRDSQTLSRLLERLAKWKVTVSCTDDWRPYQ 61 Query: 108 SRL---KGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHY 160 L H ISKR T IER+N + R LAR R + S+S + + + + Sbjct: 62 QLLDEHPDAFHGISKRETVGIERNNSDNRHWLARFHRPTKVISRSAHMVNITMAIF 117 >UniRef50_B2SG01 IS1 transposase n=7 Tax=Francisella tularensis RepID=B2SG01_FRATM Length = 102 Score = 122 bits (305), Expect = 6e-27, Method: Composition-based stats. Identities = 35/104 (33%), Positives = 50/104 (48%), Gaps = 2/104 (1%) Query: 46 QWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPL 105 W ++G+K + W+ AYD + V G R AT RL + + TD W Sbjct: 1 MWNFIGSK--KCWIIKAYDRRVGKTIIWVTGGRDNATFRRLYKKVQHLTNCNFYTDDWVA 58 Query: 106 YESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 + L K H+I K T IER N N R +LAR+ R++ S+S Sbjct: 59 FVEVLPKKRHIIGKSGTVAIERDNSNTRHNLARMTRRTKVISRS 102 >UniRef50_C5BAQ0 IS1 ORF n=14 Tax=Gammaproteobacteria RepID=C5BAQ0_EDWI9 Length = 78 Score = 115 bits (288), Expect = 6e-25, Method: Composition-based stats. Identities = 45/78 (57%), Positives = 57/78 (73%) Query: 90 LSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 + F++ +MTD WP+Y + L HV+SK+YTQRIERHNLNLR HL RL R+++ FS S Sbjct: 1 MRKFNIAFYMTDAWPVYRTLLDPAHHVVSKKYTQRIERHNLNLRTHLKRLTRRTICFSNS 60 Query: 150 VELHDKVIGHYLNIKHYQ 167 E+HDKVIG YL I HY Sbjct: 61 EEMHDKVIGWYLTINHYH 78 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 111 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 34/116 (29%), Positives = 54/116 (46%), Gaps = 12/116 (10%) Query: 43 MDEQWGYV----GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIW 98 DE W Y+ G+K W++ A + G+R T L++ L +V Sbjct: 172 FDESWTYLRVRHGSKRENLWIWNAL---ADGLPFFTTGDRDYKTFSFLLNSLPKSEV--N 226 Query: 99 MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHD 154 TD + +Y+ HV SK+YT +E +N R HLARL R + + ++S + D Sbjct: 227 YTDDYSVYQVLDN---HVASKKYTYTVESYNSYCRAHLARLARDTRAVNRSERMVD 279 >UniRef50_Q8VSP6 Putative IS1 ORF n=1 Tax=Shigella flexneri RepID=Q8VSP6_SHIFL Length = 67 Score = 95.1 bits (235), Expect = 8e-19, Method: Composition-based stats. Identities = 38/64 (59%), Positives = 47/64 (73%) Query: 104 PLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNI 163 P+Y + L HVISK+ TQRIERHNLNLR HL RL RK++ FSKS ++H K+IG YL I Sbjct: 4 PVYRTLLSSTSHVISKKCTQRIERHNLNLRTHLKRLTRKTICFSKSDDMHYKIIGWYLTI 63 Query: 164 KHYQ 167 H+ Sbjct: 64 NHHH 67 >UniRef50_B0JRY8 Transposase n=27 Tax=Bacteria RepID=B0JRY8_MICAN Length = 111 Score = 92.0 bits (227), Expect = 7e-18, Method: Composition-based stats. Identities = 31/105 (29%), Positives = 51/105 (48%), Gaps = 7/105 (6%) Query: 68 KTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGWPLYESRLKGKLHV-ISK--RYT 122 ++ + G+R+ + +L + L + TD W Y++ + K H + K T Sbjct: 3 GKLLVAMRGDRSRQSAKKLWASLPGVYRQCAVAYTDFWESYKTVIPSKRHRPVGKETGQT 62 Query: 123 QRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 IER N RQ ++RL R+SLSFSK +E H + ++ I Y Sbjct: 63 NPIERLNNTFRQRISRLVRESLSFSKKMENHVGAVWYF--IHDYN 105 >UniRef50_Q9CJQ7 Putative uncharacterized protein n=2 Tax=Pasteurellaceae RepID=Q9CJQ7_PASMU Length = 181 Score = 89.7 bits (221), Expect = 4e-17, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 51/118 (43%), Gaps = 5/118 (4%) Query: 47 WGYV--GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVV--IWMTDG 102 W +V ++ ++ Y + +VA V+G+R + T L L V D Sbjct: 62 WHFVPPNRIDQKYRIYIGYHAKTSEIVAFVWGKRDLQTALALKQRLKELKVSYERIAGDN 121 Query: 103 WPLYESRLKG-KLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGH 159 W + + + K++T+ IE +N +R L+R R+S FSKS+ H K Sbjct: 122 WDAFVNAFSDTGDQWVGKQHTKAIEGNNCRIRHRLSRAVRRSCCFSKSMFYHVKSFNI 179 >UniRef50_Q46GF8 Putative uncharacterized protein n=3 Tax=Methanosarcina barkeri str. Fusaro RepID=Q46GF8_METBF Length = 112 Score = 88.2 bits (217), Expect = 1e-16, Method: Composition-based stats. Identities = 29/85 (34%), Positives = 41/85 (48%) Query: 68 KTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 K + FG R T + L ++ MTD W Y L +H SK T +E Sbjct: 8 KKFINCSFGSRGTETGQLIWEKLKQKEIGEVMTDHWRAYAEFLPENIHTQSKAETYTVEG 67 Query: 128 HNLNLRQHLARLGRKSLSFSKSVEL 152 +N LR LARL RK+ ++KS+E+ Sbjct: 68 YNGILRHFLARLRRKTKCYTKSIEM 92 >UniRef50_Q10VW0 ISSru3, transposase InsB n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10VW0_TRIEI Length = 76 Score = 77.0 bits (188), Expect = 2e-13, Method: Composition-based stats. Identities = 20/73 (27%), Positives = 30/73 (41%), Gaps = 2/73 (2%) Query: 46 QWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGW 103 W +VG+K+ Q+W + A D K +VA GER +L + I TD W Sbjct: 1 MWSFVGSKNNQQWFWLAIDIETKEIVAFSLGERGEKGANQLWNSWPGIYRQCAICYTDFW 60 Query: 104 PLYESRLKGKLHV 116 Y+ + Sbjct: 61 SAYDVIFPHCRQL 73 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 76.6 bits (187), Expect = 3e-13, Method: Composition-based stats. Identities = 45/46 (97%), Positives = 45/46 (97%) Query: 19 LKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD 64 L KLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD Sbjct: 68 LNKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD 113 >UniRef50_O67144 Putative uncharacterized protein n=1 Tax=Aquifex aeolicus RepID=O67144_AQUAE Length = 147 Score = 76.2 bits (186), Expect = 4e-13, Method: Composition-based stats. Identities = 33/145 (22%), Positives = 65/145 (44%), Gaps = 7/145 (4%) Query: 24 PQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF-YAYDSLRKTVVAHVF-GERTMA 81 P+ + ++ D + DE W YVG K + W++ + T+ +F G+R++ Sbjct: 4 PEYGSEKVVKTEDNMENKPTDEMWSYVGTKGNEVWIWSVVVELKDGTIKKFLFAGDRSLR 63 Query: 82 TLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRY-TQRIERHNLNLRQHLARLG 140 T ++++ + + + TD + +YE L H++ K R E + LR L Sbjct: 64 TFLKILAKMPEAE--EYETDAYRVYE-WLPRDRHIVRKYGRVNRNEALHSKLRDKLVAFK 120 Query: 141 RKSLSFSKSVELHDKVIGHYLNIKH 165 RK+ +F +S + + +I H Sbjct: 121 RKTKAFFRSFLYLRYALALF-SIHH 144 >UniRef50_Q1CBA9 Putative transposase n=3 Tax=Yersinia pestis RepID=Q1CBA9_YERPA Length = 85 Score = 73.9 bits (180), Expect = 2e-12, Method: Composition-based stats. Identities = 25/61 (40%), Positives = 33/61 (54%), Gaps = 1/61 (1%) Query: 68 KTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 +T F T +L+ LLS F++V W TD + YE L + H+ SK YTQRIER Sbjct: 22 QTYYCSYFWSSEQKTFRQLLGLLSGFNIVFWCTDNFSAYE-MLPDEKHIRSKLYTQRIER 80 Query: 128 H 128 Sbjct: 81 E 81 >UniRef50_C0A223 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A223_9BACT Length = 269 Score = 73.5 bits (179), Expect = 3e-12, Method: Composition-based stats. Identities = 34/169 (20%), Positives = 50/169 (29%), Gaps = 48/169 (28%) Query: 41 AEMDEQWGYVGAKSRQR----------WLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL 90 + DE W +VG K + W + A D K V G R + + M L Sbjct: 63 IQCDEIWSFVGCKEKNVTNNGKRQGDTWTWIACDPDTKLVPCWFIGRRDSESAKKFMRRL 122 Query: 91 SP---FDVVIWMTDGWPLY-------------------ESRLKGKLHVISKRY------- 121 + TDG Y + + G H Sbjct: 123 ARHLSLGSTQITTDGLKAYINAIKEILWIETSYGMVEKKYDVSGDDHRTRYIGSEKTAIF 182 Query: 122 ---------TQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYL 161 T +ER NL +R + R RK+ +SK + H I + Sbjct: 183 GNPDPDTMNTSIVERQNLTMRMSMRRFTRKTNGYSKKIANHRYAIALHF 231 >UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX7_PARUW Length = 163 Score = 71.2 bits (173), Expect = 1e-11, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 40/83 (48%), Gaps = 2/83 (2%) Query: 11 WPQHDFTSLKKLRPQSVTSRIQPGSDV--IVCAEMDEQWGYVGAKSRQRWLFYAYDSLRK 68 W L K P+++ + + +D +V E+DE W YVG+K+ +WL+ S + Sbjct: 71 WLLEFIGELTKELPENLNAEVVSENDELEVVVLEVDELWSYVGSKANPQWLWLVMHSKTR 130 Query: 69 TVVAHVFGERTMATLGRLMSLLS 91 VVA G R T +L+ L Sbjct: 131 QVVAMQIGPRNKETAEKLLYKLP 153 >UniRef50_A8YFR2 ImeAB protein n=2 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YFR2_MICAE Length = 122 Score = 69.7 bits (169), Expect = 3e-11, Method: Composition-based stats. Identities = 25/65 (38%), Positives = 34/65 (52%), Gaps = 3/65 (4%) Query: 94 DVVIWMTDGWPLYESRLKGKLHV-ISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSV 150 + TD W Y++ + K H + K T IER N RQ ++RL R+SLSFSK + Sbjct: 25 QCAVAYTDCWESYKTGIPSKRHRPVGKETGQTNPIERLNNTFRQRISRLVRESLSFSKKM 84 Query: 151 ELHDK 155 E H Sbjct: 85 ENHVG 89 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 69.3 bits (168), Expect = 5e-11, Method: Composition-based stats. Identities = 23/62 (37%), Positives = 31/62 (50%), Gaps = 2/62 (3%) Query: 79 TMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLAR 138 T +L+ L+ F+VV W TD + Y L H K +TQ IER NL +R + R Sbjct: 55 NEKTFRKLLKKLASFNVVFWCTDNFKTYN-LLPKSQHRAGKIFTQHIERENL-MRTRIKR 112 Query: 139 LG 140 L Sbjct: 113 LN 114 >UniRef50_C8SAB2 IS1 transposase (Fragment) n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SAB2_FERPL Length = 75 Score = 68.9 bits (167), Expect = 6e-11, Method: Composition-based stats. Identities = 25/72 (34%), Positives = 36/72 (50%), Gaps = 3/72 (4%) Query: 96 VIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDK 155 I+ TD W Y + K +I K T +ER L LR R RKS+ FSKS+E+ + Sbjct: 3 AIFYTDRWDAYN-LIPYKQRIIKKGGTNHVERLFLTLRNDNPRFARKSIRFSKSIEMLEN 61 Query: 156 VIGHYLNIKHYQ 167 + + I +Y Sbjct: 62 SLKLW--IHYYN 71 >UniRef50_Q46CV2 Putative uncharacterized protein n=14 Tax=Methanosarcina RepID=Q46CV2_METBF Length = 75 Score = 67.0 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 22/54 (40%), Positives = 31/54 (57%) Query: 99 MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEL 152 MTD W Y L +H SK T +E +N L+ LARL RK+ ++KS+E+ Sbjct: 2 MTDHWRAYAEFLPENIHTQSKAETYTVEGYNGILKHFLARLRRKTKCYTKSIEM 55 >UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachlamydiaceae RepID=Q6MD28_PARUW Length = 209 Score = 65.8 bits (159), Expect = 5e-10, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 1/67 (1%) Query: 27 VTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRL 86 VT + +V E+DE W +VG K +WL+ + V+A G R T L Sbjct: 93 VTCCEKDELEVAKL-EVDELWNFVGNKKNDQWLWLILHKKSRQVLAMQVGPRDKKTAELL 151 Query: 87 MSLLSPF 93 + L Sbjct: 152 FAKLPES 158 >UniRef50_Q32DI9 Iso-IS1 ORF2 n=2 Tax=Shigella RepID=Q32DI9_SHIDS Length = 94 Score = 65.8 bits (159), Expect = 5e-10, Method: Composition-based stats. Identities = 14/47 (29%), Positives = 24/47 (51%) Query: 68 KTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKL 114 +A+ FG RT T L++LL+PF++ + +D W Y + Sbjct: 37 GGGLAYTFGPRTDETCRELLALLTPFNIGMITSDDWGSYGREVPKDK 83 >UniRef50_Q6MCH2 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCH2_PARUW Length = 121 Score = 64.3 bits (155), Expect = 2e-09, Method: Composition-based stats. Identities = 26/81 (32%), Positives = 34/81 (41%), Gaps = 4/81 (4%) Query: 80 MATLGRLMSLLSPF-DVVIWMTDGWPLYESRLKGKLHV-ISK--RYTQRIERHNLNLRQH 135 L + L + TD + +Y H +SK T IER N RQ Sbjct: 26 KKPLSFFLQKLPESLKKAFYFTDKFNVYYETNPWSQHQPVSKQSGQTSYIERFNCTRRQR 85 Query: 136 LARLGRKSLSFSKSVELHDKV 156 ARL RK+LSFSK + H + Sbjct: 86 CARLVRKTLSFSKKLTNHIGL 106 >UniRef50_B0CCX7 Putative uncharacterized protein n=7 Tax=Cyanobacteria RepID=B0CCX7_ACAM1 Length = 196 Score = 64.3 bits (155), Expect = 2e-09, Method: Composition-based stats. Identities = 33/117 (28%), Positives = 45/117 (38%), Gaps = 12/117 (10%) Query: 44 DEQWGYVGAKSR----------QRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLS-P 92 DE W V K + W+ + V+ G+ T L+ Sbjct: 28 DELWSSVKKKQKHCEPEELSLGDCWIALSLAKDSGLVLTGRIGKHTDELAQELIENTEGK 87 Query: 93 FDVVIWMTDGWPLYESRLKGKL-HVISKRYTQRIERHNLNLRQHLARLGRKSLSFSK 148 W TDGW Y +L ++ H +SK TQR+ER N LRQ R R+ F K Sbjct: 88 TACHHWQTDGWEGYSRQLADEVIHHVSKALTQRLERTNGILRQQTGRWHRRQNKFGK 144 >UniRef50_C5B7X2 Iso-IS1 ORF2 n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B7X2_EDWI9 Length = 99 Score = 63.5 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 22/69 (31%), Positives = 37/69 (53%) Query: 90 LSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 L+ F++ + D W + + +TQ ER++L LR + RL RK + FS++ Sbjct: 3 LTAFNIGMITRDDWGNPIREVPWGKPLTGTIFTQHSERNSLMLRTRIKRLARKRIGFSRA 62 Query: 150 VELHDKVIG 158 + LH+KV G Sbjct: 63 IALHEKVTG 71 >UniRef50_D1JFE2 Putative uncharacterized protein n=3 Tax=uncultured archaeon RepID=D1JFE2_9ARCH Length = 217 Score = 63.1 bits (152), Expect = 4e-09, Method: Composition-based stats. Identities = 36/141 (25%), Positives = 55/141 (39%), Gaps = 37/141 (26%) Query: 57 RWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF-------DVVIWMTDGWPLYESR 109 W++ A S K +AH G+R T L++L+ D +++DG Y Sbjct: 24 CWIYTAIKSDTKLHLAHCTGKRVQETANALVALVKNRGKAPDTDDKATFVSDGNNQYTKA 83 Query: 110 L-----------------KGKLHVISK-------------RYTQRIERHNLNLRQHLARL 139 L + V+ K T +ER+NL LR +++L Sbjct: 84 LFENFDVNAINYGQLVKERDNGRVVGKTRTIIFGSLEVDEIETVYVERYNLTLRHGISKL 143 Query: 140 GRKSLSFSKSVELHDKVIGHY 160 RKSL FSK E+ D + Y Sbjct: 144 VRKSLCFSKCKEMLDDHLDLY 164 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 62.7 bits (151), Expect = 4e-09, Method: Composition-based stats. Identities = 24/93 (25%), Positives = 32/93 (34%), Gaps = 9/93 (9%) Query: 35 SDVIVCAEMDEQWGYVGAKSR----QRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL 90 + I EMDE Y+G K L + FG R T + L Sbjct: 90 ENEISIVEMDEMHTYIGNKKNIAGSGLLLI-----ELGKFIHCSFGNRGTETGQLIWEKL 144 Query: 91 SPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQ 123 ++ MTD W Y L +H SK+ Q Sbjct: 145 KQKEIGEVMTDHWRAYAEFLPENIHTQSKKRIQ 177 >UniRef50_Q6MBQ1 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MBQ1_PARUW Length = 138 Score = 62.0 bits (149), Expect = 7e-09, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 29/67 (43%), Gaps = 1/67 (1%) Query: 27 VTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRL 86 VT + +V E+DE+W +VG K +WL+ + V+A G R T L Sbjct: 65 VTCCEKDELEVARL-EVDERWSFVGNKKNDQWLWLILHKKSRQVLAMQVGPRDKKTAELL 123 Query: 87 MSLLSPF 93 + L Sbjct: 124 FTKLPES 130 >UniRef50_B0CEC0 Putative uncharacterized protein n=6 Tax=Acaryochloris marina MBIC11017 RepID=B0CEC0_ACAM1 Length = 172 Score = 62.0 bits (149), Expect = 7e-09, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 46/111 (41%), Gaps = 5/111 (4%) Query: 57 RWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLS-PFDVVIWMTDGWPLYESRLKGKL- 114 W+ + V++ G+ T L+ W TDGW Y +L ++ Sbjct: 21 CWIALSLAKESGLVLSGRIGKHTDELAQELIENTEGKTACHHWQTDGWEGYARQLPDEVV 80 Query: 115 HVISKRYTQRIERHNLNLRQHLARLGRKSLSFSK---SVELHDKVIGHYLN 162 H +SK TQR+ER N +RQ R R+ F K + +++ Y N Sbjct: 81 HEVSKALTQRLERTNGIVRQQTGRWHRRQNKFGKVWQQSAMTLRLVLSYFN 131 >UniRef50_Q10ZU2 Putative uncharacterized protein n=3 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZU2_TRIEI Length = 79 Score = 61.6 bits (148), Expect = 1e-08, Method: Composition-based stats. Identities = 14/45 (31%), Positives = 23/45 (51%) Query: 39 VCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATL 83 + + DE W +VG K+ ++WL+ A D + +V GER Sbjct: 34 LTIQCDEMWSFVGNKNNKQWLWLAIDIETQEIVGFYLGERGEKGA 78 >UniRef50_UPI00016C465A IS1 transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C465A Length = 88 Score = 61.2 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 22/68 (32%), Positives = 31/68 (45%), Gaps = 3/68 (4%) Query: 96 VIWMTDGWPLYESRLKGKLHVISKR---YTQRIERHNLNLRQHLARLGRKSLSFSKSVEL 152 V TD P + + H ++ T IER L LRQ AR RK+L+FSK Sbjct: 12 VTVYTDLLPACRAAIPRARHRAVRKVTGLTAHIERFWLTLRQRCARFVRKTLTFSKCPRN 71 Query: 153 HDKVIGHY 160 H + ++ Sbjct: 72 HLGALWYF 79 >UniRef50_B0URB1 Putative uncharacterized protein n=1 Tax=Methylobacterium sp. 4-46 RepID=B0URB1_METS4 Length = 82 Score = 61.2 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 31/67 (46%) Query: 98 WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVI 157 + TD + Y + L H + K TQ +E +N R AR R++ SKSVE+ + + Sbjct: 4 FCTDNYAPYAAALPAGRHHVGKDQTQLVESNNARQRHWFARFRRRTCVVSKSVEMVEATM 63 Query: 158 GHYLNIK 164 + Sbjct: 64 ALFAFYH 70 >UniRef50_B2TXL7 IS1 ORF2 n=1 Tax=Shigella boydii CDC 3083-94 RepID=B2TXL7_SHIB3 Length = 44 Score = 60.0 bits (144), Expect = 3e-08, Method: Composition-based stats. Identities = 33/34 (97%), Positives = 33/34 (97%) Query: 134 QHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 HLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ Sbjct: 11 THLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 44 >UniRef50_A8GX98 Transposase and inactivated derivative n=2 Tax=Rickettsia bellii RepID=A8GX98_RICB8 Length = 99 Score = 59.6 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 42/82 (51%), Gaps = 1/82 (1%) Query: 77 ERTMATLGRLMSLLSP-FDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQH 135 R +++ + L +++ I +D + +Y + K H +K+ T +E N +R + Sbjct: 4 GRDISSYLPMALRLEENYEIDISCSDHYDVYGAYKIAKRHYFTKKETALVESFNSLIRNY 63 Query: 136 LARLGRKSLSFSKSVELHDKVI 157 LAR RK+ +SK++++ I Sbjct: 64 LARFNRKTKRYSKAIDMIYNSI 85 >UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZK1_TRIEI Length = 177 Score = 59.6 bits (143), Expect = 4e-08, Method: Composition-based stats. Identities = 14/72 (19%), Positives = 26/72 (36%) Query: 40 CAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWM 99 E+ E +V K + L+ R+ ++ V G + T L + + + Sbjct: 106 VGELHELETFVSDKKNKVLLWTLVYHFRQGILGWVVGNHSGDTFQPLWQAIGFWKCYFQV 165 Query: 100 TDGWPLYESRLK 111 TDG P+ Sbjct: 166 TDGNPVASRLYP 177 >UniRef50_C4GXL2 Insertion sequence protein n=5 Tax=Yersinia pestis RepID=C4GXL2_YERPN Length = 111 Score = 58.9 bits (141), Expect = 6e-08, Method: Composition-based stats. Identities = 16/44 (36%), Positives = 25/44 (56%) Query: 46 QWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSL 89 W +VG K +QRWL+YA++ K ++AH+FG R+ Sbjct: 1 MWSFVGNKKQQRWLWYAWEPRLKRIIAHIFGRRSKRHFANYWGC 44 >UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthrospira maxima CS-328 RepID=B5VWL6_SPIMA Length = 153 Score = 58.5 bits (140), Expect = 8e-08, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 32/63 (50%), Gaps = 1/63 (1%) Query: 36 DVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDV 95 ++ A++DE +VG+K W++ ++ ++ V G+R++ T L ++ + Sbjct: 66 EIPEIAQIDELQTFVGSKKT-IWVWTVVNTKLPGILKFVIGDRSLLTFTTLWQMIQGWAC 124 Query: 96 VIW 98 ++ Sbjct: 125 FLY 127 >UniRef50_A9GLN9 Putative uncharacterized protein n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GLN9_SORC5 Length = 405 Score = 57.3 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 38/178 (21%), Positives = 59/178 (33%), Gaps = 54/178 (30%) Query: 39 VCAEMDEQWGYVGAKSRQR-----------WLFYAYDSLRKTVVAHVFGERTMATLGRLM 87 + DE + YVG K + + F A D+ + V+A G+R M T G + Sbjct: 96 ELIQADEVFSYVGKKQARVTEKDAPGIGETYSFTALDTASRLVIAWRVGKRDMETCGPFI 155 Query: 88 SLLSPFDVVI--WMTDGWPLYESRLKGK-------------------------------- 113 + L +V+ TDG+ Y + + Sbjct: 156 ADLRSRLLVMPQITTDGFAPYIATVAEHFGLSVDYMQTVKNYRTGSYRGPDHRYEPPRDP 215 Query: 114 ---LHVI------SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLN 162 H I K T +ER N R L R+ R +FSK+ E H + + Sbjct: 216 FITKHTIYGAPDAKKASTSYVERLNGTTRHLLGRMRRLCYAFSKAPEHHRAAVALHYT 273 >UniRef50_A9GLP8 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GLP8_SORC5 Length = 337 Score = 57.0 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 36/176 (20%), Positives = 56/176 (31%), Gaps = 58/176 (32%) Query: 40 CAEMDEQWGYVGAKSRQR-----------WLFYAYDSLRKTVVAHVFGERTMAT----LG 84 +MDE W +V K + + + A D+ K ++ G+R + Sbjct: 5 VIQMDEMWSFVQKKQARVTAEDPAEHGDAYFYVALDANTKLAISFHVGKRDGENTEAFIK 64 Query: 85 RLMSLLSPFDVVIWMTDGWPLY------------------------ESRLKGKLH----- 115 L S L+ V +DGW Y R + Sbjct: 65 DLRSRLTV--VPHITSDGWQPYIEAMATSFRGSADYAQCVKNYRGGPQRSPDHRYEPPRN 122 Query: 116 -VISKR-----------YTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGH 159 ++K T +ER NL R + R R L+FSK++ H IG Sbjct: 123 PFVTKTPIFGAPKDELLSTSFVERFNLQTRHTVGRTRRLCLAFSKTLRGHRAAIGL 178 >UniRef50_Q2FSQ2 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FSQ2_METHJ Length = 201 Score = 56.6 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 49/147 (33%), Gaps = 37/147 (25%) Query: 57 RWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL-------SPFDVVIWMTDGWPLYESR 109 W + + +A G+R + T ++ +P + + TDG Y Sbjct: 22 CWSYTCFKRDSGLFLAFESGKRNIDTCADMLVRFFNRMELPTPENKISIFTDGNVQYSIC 81 Query: 110 LK--------GKLHVISKRYTQR----------------------IERHNLNLRQHLARL 139 L VI + + IE +N +RQ L+R Sbjct: 82 LPELYCEPCLDYGQVIKVKEKNKLVYVIREKIMGNPDSKAISTSVIEGYNNKIRQRLSRF 141 Query: 140 GRKSLSFSKSVELHDKVIGHYLNIKHY 166 GRK+ SFSK + + + + ++ Sbjct: 142 GRKTASFSKKLNRFISALNIFQFVHNF 168 >UniRef50_A9FJP3 Putative uncharacterized protein n=5 Tax=Proteobacteria RepID=A9FJP3_SORC5 Length = 349 Score = 55.8 bits (133), Expect = 6e-07, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 51/176 (28%), Gaps = 54/176 (30%) Query: 40 CAEMDEQWGYVGAKSRQR-----------WLFYAYDSLRKTVVAHVFGERTMATLGRLMS 88 A+ DE W YV K + + F S K ++++ G+R + Sbjct: 62 VAQCDEIWSYVQKKQSRVTASDPAEYGDAYTFVGMASASKLIISYRVGKRDEENTRAFVK 121 Query: 89 LLSP--FDVVIWMTDGWPLY------------------------------ESRLKGKLHV 116 L + TDGW Y + Sbjct: 122 DLRARLTTIPQLYTDGWQPYIGAVGASFTGGVDYCQVVKNYSRRPRRDDEVRYEPPRDPF 181 Query: 117 ISKR-----------YTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYL 161 I+K T +ER N +R H+ R R FS+ + H + ++ Sbjct: 182 ITKTPIFGIPDVEHASTSHVERQNWTIRMHIRRFTRLCNGFSRKLANHRAAVALHV 237 >UniRef50_Q972H6 Putative uncharacterized protein ST1154 n=1 Tax=Sulfolobus tokodaii RepID=Q972H6_SULTO Length = 152 Score = 54.3 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 22/112 (19%), Positives = 43/112 (38%), Gaps = 9/112 (8%) Query: 44 DEQWGYVGAKSRQRWLFYAYDS---LRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT 100 DE W Y+ +R + + + + G+R T + L D W++ Sbjct: 27 DEMWTYLYRNTRAFYKWVFNCHVYTRLGLYIIYSVGDRDENTFREVKMYLP--DDGRWVS 84 Query: 101 DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEL 152 D + +Y V+S E + +LR L R R + + ++S+ + Sbjct: 85 DDYNVY--FWLKNHTVVS--LVNPNESFHSSLRDRLVRFKRATKAVNRSINM 132 >UniRef50_Q7NJH9 Gsl1853 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJH9_GLOVI Length = 71 Score = 54.3 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 22/56 (39%), Positives = 27/56 (48%), Gaps = 3/56 (5%) Query: 103 WPLYESRLKGKLH-VISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDK 155 Y L K H K T IER N +RQ + RL RK+LSFSK + H+ Sbjct: 2 LKNYGQVLASKRHRAAGKATGTTSCIERFNNTVRQRVGRLVRKALSFSKCLSNHNA 57 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 53.9 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 44/111 (39%), Gaps = 9/111 (8%) Query: 44 DEQWGYVGAKSRQRWLFYAYD---SLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT 100 DE W Y+ +R + + + + + G+R +T + L D W++ Sbjct: 119 DEMWTYLYKNARAFYKWVFTCYVYTKLGVYLIYSVGDRDESTFLEVKKYLP--DEGRWVS 176 Query: 101 DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVE 151 D + LY V+S E + +LR L R R + + ++S+ Sbjct: 177 DDYNLY--FWLKDHTVVSPVNPN--ESFHSSLRDRLIRFKRATKAINRSIR 223 >UniRef50_Q327E8 IS1 ORF, n=6 Tax=Enterobacteriaceae RepID=Q327E8_SHIDS Length = 94 Score = 53.1 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 4/48 (8%) Query: 112 GKLHVISKR----YTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDK 155 V K + +ER+NL LR + RL RK++ FS+SVE+H+K Sbjct: 18 KDKQVTRKGIFIQHMLYLERNNLPLRTRIKRLARKTICFSRSVEIHEK 65 >UniRef50_Q648U8 Putative uncharacterized protein n=6 Tax=environmental samples RepID=Q648U8_9ARCH Length = 173 Score = 52.7 bits (125), Expect = 5e-06, Method: Composition-based stats. Identities = 19/39 (48%), Positives = 24/39 (61%) Query: 122 TQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHY 160 T IER+NL LR ++RL RKSL FSK + D + Y Sbjct: 81 TVYIERYNLTLRHGISRLVRKSLCFSKCKGMLDNHLDVY 119 >UniRef50_Q64CQ0 Putative uncharacterized protein n=1 Tax=uncultured archaeon GZfos1D1 RepID=Q64CQ0_9ARCH Length = 168 Score = 51.9 bits (123), Expect = 9e-06, Method: Composition-based stats. Identities = 26/127 (20%), Positives = 44/127 (34%), Gaps = 37/127 (29%) Query: 71 VAHVFGERTMATLGRLMSLL-------SPFDVVIWMTDGWPLYESRLK------------ 111 +A G++T + GR+M + SP + TDG Y L Sbjct: 1 MAFSVGKQTQESCGRMMKKVFGRTEQPSPQTKMEMFTDGNDDYTYVLPDYCADACIEYGQ 60 Query: 112 ------------GKLHVI------SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELH 153 + +I T +E +N LR+ + RL RK+ FSK + Sbjct: 61 LVKIRENGRVVRKEKRIIYGNPDLGDIETTDVENYNGILRERIGRLVRKTKCFSKRKRML 120 Query: 154 DKVIGHY 160 + + + Sbjct: 121 ECSLQVF 127 >UniRef50_UPI00018554DD transposase n=1 Tax=Francisella novicida FTG RepID=UPI00018554DD Length = 97 Score = 50.8 bits (120), Expect = 2e-05, Method: Composition-based stats. Identities = 12/36 (33%), Positives = 19/36 (52%) Query: 35 SDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTV 70 D I E DE W ++G+K ++ W+ AYD + Sbjct: 40 EDNISEIEFDEMWHFIGSKKKKCWIIKAYDRRVGKL 75 >UniRef50_A8ZKR3 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZKR3_ACAM1 Length = 241 Score = 50.8 bits (120), Expect = 2e-05, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 4/78 (5%) Query: 38 IVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLM----SLLSPF 93 I EMDE+ GYV K +Q W A D+ K ++ G R + RLM L+ Sbjct: 122 IDVLEMDERHGYVAIKQQQCWDAVAIDAASKFIIQVEVGPRNTNLIDRLMRATHKRLAHP 181 Query: 94 DVVIWMTDGWPLYESRLK 111 ++ MTDG Y + Sbjct: 182 RDLVLMTDGDASYRTLFP 199 >UniRef50_B0CAP5 Putative uncharacterized protein n=3 Tax=Acaryochloris marina MBIC11017 RepID=B0CAP5_ACAM1 Length = 144 Score = 49.2 bits (116), Expect = 5e-05, Method: Composition-based stats. Identities = 25/108 (23%), Positives = 43/108 (39%), Gaps = 2/108 (1%) Query: 56 QRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLS-PFDVVIWMTDGWPLYESRLKGKL 114 + W+ + V++ G+ T L+ W TDGW + ++ Sbjct: 5 ECWIALSLAKDSSLVLSGRIGKHTDELAQDLIENTEGKTTCHHWQTDGWEGSSRQPPDEV 64 Query: 115 -HVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYL 161 H +SK TQR++R N LRQ R ++ F K + H + + Sbjct: 65 IHHVSKVLTQRLKRTNGILRQQTGRWHQRQNKFGKVWQQHAVTLTLFY 112 >UniRef50_Q8PRQ0 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PRQ0_METMA Length = 129 Score = 48.9 bits (115), Expect = 7e-05, Method: Composition-based stats. Identities = 18/43 (41%), Positives = 24/43 (55%) Query: 118 SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHY 160 S T ER NL +R LAR RK ++FSK+ +H K I + Sbjct: 34 SYIGTSYAERINLTIRTSLARFIRKGMNFSKTKRMHQKAIDLF 76 >UniRef50_Q649W7 Putative uncharacterized protein n=1 Tax=uncultured archaeon GZfos34A6 RepID=Q649W7_9ARCH Length = 217 Score = 47.7 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 16/45 (35%), Positives = 28/45 (62%), Gaps = 3/45 (6%) Query: 122 TQRIERHNLNLRQHLARLGRKSLSFSKSVE---LHDKVIGHYLNI 163 T IER+NL +R ++RL RK+++FSK + +H + + N+ Sbjct: 127 TSYIERNNLTVRNGVSRLIRKTINFSKRLNPLVMHLCLFFAWFNL 171 >UniRef50_Q6MCX8 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX8_PARUW Length = 72 Score = 47.3 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 18/59 (30%), Positives = 27/59 (45%), Gaps = 3/59 (5%) Query: 106 YESRLKGKLHV-ISKR--YTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYL 161 Y + H + K+ T IER N L +R K+LSFSK + H +I ++ Sbjct: 2 YFESIPFGQHRPVGKQSDKTSYIERLNCTLGYRCSRFVGKTLSFSKKLINHIGMITSFI 60 >UniRef50_Q10ZQ2 Putative uncharacterized protein n=7 Tax=Cyanobacteria RepID=Q10ZQ2_TRIEI Length = 44 Score = 46.2 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 2/40 (5%) Query: 128 HNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 N LRQ ++RL RK+LSFSK + H I ++ I HY Sbjct: 1 MNNTLRQRISRLVRKTLSFSKKLRSHLGDIWYF--INHYN 38 >UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YEG9_MICAE Length = 171 Score = 46.2 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 10/61 (16%), Positives = 26/61 (42%), Gaps = 2/61 (3%) Query: 11 WPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTV 70 W Q+ + P+ + + + E DE W +V +K+ + +++ D + + Sbjct: 107 WLQNYVNNKLASVPRQIKVSDK--LKGKLVIECDEMWSFVFSKTIKVYIWRLIDRNTREI 164 Query: 71 V 71 + Sbjct: 165 I 165 >UniRef50_Q6MD18 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MD18_PARUW Length = 89 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 14/32 (43%), Positives = 21/32 (65%) Query: 130 LNLRQHLARLGRKSLSFSKSVELHDKVIGHYL 161 L LR AR RK+LSFSK + H ++I +++ Sbjct: 46 LLLRHRYARFVRKTLSFSKKLTNHIELIKYFI 77 >UniRef50_A4AD66 Transposase n=19 Tax=unclassified Gammaproteobacteria RepID=A4AD66_9GAMM Length = 227 Score = 44.6 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 49/133 (36%), Gaps = 13/133 (9%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF---DVVIWM 99 +DE + + K Q++L+ A D + V ++ +R A R L + + Sbjct: 75 IDEVFVTINGK--QQYLWRAVDQDGEVVDVYLQTKRDGAAAKRFFKRLLRSHGGEPRKIV 132 Query: 100 TDGWPLY----ESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEL--- 152 TD Y + +H+ + R E+ + R R R+ S +++ Sbjct: 133 TDKLRSYGVAHRELIPETVHITEQYENNRAEQSHETTRAR-ERGMRRFKSVAQAQRFVAA 191 Query: 153 HDKVIGHYLNIKH 165 H V + +H Sbjct: 192 HAAVFNLFNLGRH 204 >UniRef50_C7DAC3 Transposase n=36 Tax=Rhodobacterales RepID=C7DAC3_9RHOB Length = 237 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 26/126 (20%), Positives = 48/126 (38%), Gaps = 10/126 (7%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL--SPFDVVIWMT 100 +DE + V K +L+ A D + + A V R A + + L +T Sbjct: 85 VDEVFVKVNGKRH--YLWRAVDHEGEVLEAVVTKRRNKAAALKFLKKLMKRHGKAEEVVT 142 Query: 101 DGWPLYESRLKG----KLHVISKRYTQRIERHNLNLRQHLARL--GRKSLSFSKSVELHD 154 D + Y++ L+ + + R+E +L R+ + R+ S K +H Sbjct: 143 DRFAPYKAALRDLGALEKQSTGRWLNNRVENSHLPFRRRERAMQRFRRMRSLQKFAAVHS 202 Query: 155 KVIGHY 160 V H+ Sbjct: 203 SVYNHF 208 >UniRef50_B9K3D6 Transposase n=32 Tax=Bacteria RepID=B9K3D6_AGRVS Length = 243 Score = 43.5 bits (101), Expect = 0.003, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 46/132 (34%), Gaps = 11/132 (8%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFD---VVIWM 99 +DE +G K + WL+ A D + V R RLM L + + Sbjct: 87 LDEVVISIGGK--KHWLWRAVDQDGFVLDVLVQSRRNAKAAKRLMRKLLKGQGRSPRVMI 144 Query: 100 TDGWPLY----ESRLKGKLHVISKRYTQRIERHN--LNLRQHLARLGRKSLSFSKSVELH 153 TD Y + H K R E + + R+ + + + + + V +H Sbjct: 145 TDKLRSYGAAKREIMPAVEHRSHKGLNNRAENSHQPIRRRERIMKRFKSARHLQRFVSIH 204 Query: 154 DKVIGHYLNIKH 165 D + + +H Sbjct: 205 DPIANLFQIPRH 216 >UniRef50_Q218S2 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris BisB18 RepID=Q218S2_RHOPB Length = 191 Score = 43.5 bits (101), Expect = 0.003, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 2/46 (4%) Query: 122 TQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 T +ER NL+LR R R + FSK ++ H + Y + HY Sbjct: 56 TSYVERQNLSLRMGSRRFTRLTNGFSKKLDNHVAAVALY--VAHYN 99 >UniRef50_Q0W4E9 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4E9_UNCMA Length = 160 Score = 43.1 bits (100), Expect = 0.004, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 36/113 (31%), Gaps = 31/113 (27%) Query: 71 VAHVFGERTMATLGRLMSLLSPF---DVVIWMTDGWPLYESRLKGKLH------------ 115 + G T T ++S +S V +DG Y L Sbjct: 1 MGFSVGRWTQGTCRVMLSQVSNSVQDGVFTVYSDGNDDYYYTLTDFFQEVRYGQLVKIRE 60 Query: 116 ---VISKR-------------YTQRIERHNLNLRQHLARLGRKSLSFSKSVEL 152 V+ K T +E N LR + RL RK+ +FSK E+ Sbjct: 61 KGRVVGKEIRVLIGDVDSEQVETFNVENFNSILRGRVGRLVRKTKTFSKIPEM 113 >UniRef50_Q0SUU8 ISCpe7, transposase n=22 Tax=Clostridium RepID=Q0SUU8_CLOPS Length = 340 Score = 42.3 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 47/136 (34%), Gaps = 20/136 (14%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAH-VFGERTMATLGRLMSLLSPF-DVVIWMT 100 +DE ++ K + +L+ A DS + V+A + R LM+ ++T Sbjct: 181 VDETVVFISGK--KYYLWLAIDSETRFVLAFHLTQARDSDAAFILMNQAKSMGKPNNFIT 238 Query: 101 DGWPLY----ESRLKGKLHV-----ISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVE 151 D P Y ++ L H+ S IE N + +K + + Sbjct: 239 DRLPSYNEAVKTVLNESTHIPVPPMSSDTNNNLIESFNKTFKAWYK--AKKGFNSFEKAN 296 Query: 152 LHDKVIGHYLNIKHYQ 167 Y+ I HY Sbjct: 297 N-----LIYMFIFHYN 307 >UniRef50_A7C135 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C135_9GAMM Length = 372 Score = 41.5 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 20/50 (40%), Positives = 24/50 (48%) Query: 116 VISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKH 165 V S+ T IER NL RQ RL R+S FSK + D + L H Sbjct: 245 VRSEINTSFIERDNLTQRQSNRRLTRRSNGFSKELSWFDSPLWLSLAYYH 294 Score = 39.2 bits (90), Expect = 0.050, Method: Composition-based stats. Identities = 15/100 (15%), Positives = 35/100 (35%), Gaps = 19/100 (19%) Query: 38 IVCAEMDEQWGYVGAKSRQR-------------WLFYAYDSLRKTVVAHVFGERTMATLG 84 + ++DE W ++ W++ A+ + + V+A V G Sbjct: 88 VTSLQLDELWSFILTLEHNCTEAKLYHESYGDAWVWLAFAPVWRVVLAFVIGSLPQKNAN 147 Query: 85 RLMSLLSPFD---VVIWMTDGWPLYESRLKGKLHVISKRY 121 L+ ++ + + +D + + L LH + Y Sbjct: 148 LLLDRVAHVTDAHIPFFTSDQFSSSRTAL---LHTYGQWY 184 >UniRef50_Q469A1 Putative uncharacterized protein n=1 Tax=Methanosarcina barkeri str. Fusaro RepID=Q469A1_METBF Length = 180 Score = 41.5 bits (96), Expect = 0.010, Method: Composition-based stats. Identities = 16/84 (19%), Positives = 31/84 (36%), Gaps = 11/84 (13%) Query: 41 AEMDEQWGYVGA--------KSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSP 92 EMDE W + + W++ A+ + ++ V G R +L+ + Sbjct: 81 IEMDELWIIIKKIVSRMKDYEDDGPWMWVAFVPGCQLILGFVIGPRKQYVTDKLVESVKK 140 Query: 93 F---DVVIWMTDGWPLYESRLKGK 113 + +++TDG Y L Sbjct: 141 HLSDKIPLFVTDGLNFYREALLKH 164 >UniRef50_B9K4Q6 Transposase n=2 Tax=Alphaproteobacteria RepID=B9K4Q6_AGRVS Length = 232 Score = 40.8 bits (94), Expect = 0.017, Method: Composition-based stats. Identities = 23/95 (24%), Positives = 34/95 (35%), Gaps = 9/95 (9%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL---SPFDVVIWM 99 +DE V K R+ WL+ A D+ + A + R +LM L + + Sbjct: 85 LDEM--VVTFKGRKYWLWRAVDAEGYMLEALLQSRRNKKAALKLMRKLLKGQGLTPRVMV 142 Query: 100 TDGWPLY----ESRLKGKLHVISKRYTQRIERHNL 130 TD Y + G H K R E +L Sbjct: 143 TDKLRSYDAAKRDIMPGVEHRSHKGLNNRAENSHL 177 >UniRef50_UPI00016C5273 transposase, unclassified family protein n=18 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5273 Length = 481 Score = 40.4 bits (93), Expect = 0.023, Method: Composition-based stats. Identities = 20/95 (21%), Positives = 37/95 (38%), Gaps = 16/95 (16%) Query: 41 AEMDEQ-WGYVGAKSRQRWLFYAY-DSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIW 98 A +DE W + WL+ A D L V+ R+ +++ L P + Sbjct: 223 ANVDETGW---REGRSRAWLWVAVADRLTGFVI-----RRSR--ARKVLGELIPGTPGVL 272 Query: 99 MTDGWPLYESRLKGKLHV----ISKRYTQRIERHN 129 TD + +Y+ + V + + + I+R N Sbjct: 273 TTDRYSVYDHLSPDRRQVCWAHLRRDFQAMIDRGN 307 >UniRef50_B9K5F7 Transposase n=3 Tax=Bacteria RepID=B9K5F7_AGRVS Length = 196 Score = 40.0 bits (92), Expect = 0.028, Method: Composition-based stats. Identities = 24/127 (18%), Positives = 42/127 (33%), Gaps = 11/127 (8%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFD---VVIWM 99 +DE + K + WL+ A D + V R LM L + + Sbjct: 43 LDEAVVSIRGK--KHWLWRAVDQDGFVLDVLVQSRRNAKAARHLMRQLLKGQGRAPRVMI 100 Query: 100 TDGWPLYE----SRLKGKLHVISKRYTQRIERHN--LNLRQHLARLGRKSLSFSKSVELH 153 TD Y G H K + R E + + R+ + + + + V +H Sbjct: 101 TDKLRSYGAAKWELTPGVEHRSHKGLSNRAENFHQPVRRRERIMKRFKSQRHLQRFVSIH 160 Query: 154 DKVIGHY 160 D + + Sbjct: 161 DPIANLF 167 >UniRef50_A0LBE3 Putative uncharacterized protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0LBE3_MAGSM Length = 116 Score = 40.0 bits (92), Expect = 0.030, Method: Composition-based stats. Identities = 15/36 (41%), Positives = 22/36 (61%) Query: 122 TQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVI 157 T +ER+N R A GRK+L FSK ++H+ V+ Sbjct: 25 TAFVERNNATDRHQNAHKGRKTLCFSKGWDVHNAVM 60 >UniRef50_Q11MN9 Transposase n=37 Tax=Bacteria RepID=Q11MN9_MESSB Length = 237 Score = 40.0 bits (92), Expect = 0.031, Method: Composition-based stats. Identities = 24/95 (25%), Positives = 35/95 (36%), Gaps = 9/95 (9%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL---SPFDVVIWM 99 +DE V K ++ WL+ A D+ + A + R A RLM L + + Sbjct: 81 LDEM--VVTIKGKKYWLWRAVDTNGYVLDALLQSRRNKAAAMRLMRKLLKDQGTAPRVMV 138 Query: 100 TDGWPLY----ESRLKGKLHVISKRYTQRIERHNL 130 TD Y + G H K R E +L Sbjct: 139 TDKLRSYSAAKSQLMPGVEHRSHKGLNNRAENSHL 173 >UniRef50_A7C324 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7C324_9GAMM Length = 137 Score = 39.6 bits (91), Expect = 0.034, Method: Composition-based stats. Identities = 17/46 (36%), Positives = 27/46 (58%), Gaps = 2/46 (4%) Query: 122 TQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 T IER NL LRQH++ L RK+L + K ++ ++N+ +Y Sbjct: 41 TSFIERFNLTLRQHVSYLTRKTLGYCKKKANFKYIL--WINLYNYN 84 >UniRef50_A9EF44 Transposase n=2 Tax=Rhodobacteraceae RepID=A9EF44_9RHOB Length = 156 Score = 39.6 bits (91), Expect = 0.040, Method: Composition-based stats. Identities = 20/99 (20%), Positives = 34/99 (34%), Gaps = 8/99 (8%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL--SPFDVVIWMT 100 MDE + K + WL+ A D+ + V R + R + L + + +T Sbjct: 1 MDEVVITIRGK--KHWLWRAIDADGDVLDILVQTRRNAKSAKRFLQRLVSQFGEPRVVIT 58 Query: 101 DGWPLY----ESRLKGKLHVISKRYTQRIERHNLNLRQH 135 D Y ++ H K IE + R+ Sbjct: 59 DKLRSYLKPVKTLTPNADHRAHKGLNNAIEVSHRPTRKR 97 >UniRef50_Q2G895 Transposase n=36 Tax=Alphaproteobacteria RepID=Q2G895_NOVAD Length = 238 Score = 39.2 bits (90), Expect = 0.049, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 50/139 (35%), Gaps = 16/139 (11%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTM-ATLGRLMSLLSPFDVVI-WMT 100 +DE +V + +L+ A D + + ++V R A L L L +T Sbjct: 87 LDEV--FVKINGERHYLWRAVDHEGEVLESYVTKTRDKAAALTFLKKALKRHGRAEAIVT 144 Query: 101 DGWPLY----ESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGR----KSL----SFSK 148 DG Y + + R+E +L R+ + R K+L S Sbjct: 145 DGLRSYPAAMRQLGNLDRRKMGRWLNNRVENSHLPFRRRERAMLRFRQMKTLQKFASVHG 204 Query: 149 SVELHDKVIGHYLNIKHYQ 167 S+ H H ++ K Y+ Sbjct: 205 SLHNHFSQDRHLIDRKTYK 223 >UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=A0RXS8_CENSY Length = 436 Score = 38.8 bits (89), Expect = 0.059, Method: Composition-based stats. Identities = 20/99 (20%), Positives = 37/99 (37%), Gaps = 13/99 (13%) Query: 52 AKSRQRWLFYAYDSLRKTVVAHVF--GERTMATLGRLMSLLSPF--DVVIWMTDGWPLYE 107 K WL+ A D + ++ G RT+ ++ + +TD Y Sbjct: 197 NKGHGNWLWSAIDPRTRYLLCTRIAEGSRTLPDAESVIREARKMSEEPDYMITDSLRSYA 256 Query: 108 SR----LKGKLHVISK----RYTQ-RIERHNLNLRQHLA 137 + L H+ +K +T IER++ +R+ L Sbjct: 257 TAAAKCLPRTAHIKTKAIRDGFTNMAIERYHNEIREKLK 295 >UniRef50_A5N1B9 Transposase n=2 Tax=Clostridium kluyveri RepID=A5N1B9_CLOK5 Length = 127 Score = 38.8 bits (89), Expect = 0.065, Method: Composition-based stats. Identities = 21/99 (21%), Positives = 36/99 (36%), Gaps = 13/99 (13%) Query: 48 GYVGAKSRQRWLFYAYDSLRKTVVAHVFGE-RTMATLGRLM---SLLSPFDVVIWMTDGW 103 YV K +L+ DS + +++ V R +L S+L+ +TD W Sbjct: 28 TYVKIKGIDYYLWLILDSKTRVIISFVLSRFRNSTQAYKLFFYSSILTRTSPKKIVTDKW 87 Query: 104 PLYESRLKG-KLHVISKRYT--------QRIERHNLNLR 133 Y +K H + +Y+ IE N + Sbjct: 88 DAYNEAIKNLHCHTLHHKYSAFSEDLNNNFIESFNKTFK 126 >UniRef50_C6IUV9 Transposase n=4 Tax=Bacteroides RepID=C6IUV9_9BACE Length = 571 Score = 38.5 bits (88), Expect = 0.082, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 37/112 (33%), Gaps = 14/112 (12%) Query: 11 WPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQW--GYVGAKSRQRWLFYAYDSLRK 68 W L KL P +Q G++V V DE W + K R+ +++ + + Sbjct: 273 WADKGAMQLNKLIPALKKIALQDGANVNV----DETWLRYHAYNKKRKTYMWCLVNRKAR 328 Query: 69 TVVAHVFGERTMATLGR--------LMSLLSPFDVVIWMTDGWPLYESRLKG 112 V+ + + L L + +DG+ +Y Sbjct: 329 IVIFFYEDTTDDEGVQKHGGRNRNVLKEFLGDAKIKSLQSDGYNVYMYLDNE 380 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P57998 Insertion element IS1 4 protein insB n=553 Tax=r... 208 6e-53 UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae Rep... 181 7e-45 UniRef50_P03832 Insertion element iso-IS1n protein insB n=127 Ta... 166 2e-40 UniRef50_B4WT39 IS1 transposase subfamily, putative n=3 Tax=Cyan... 161 8e-39 UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candida... 152 5e-36 UniRef50_Q1IXR6 IS1 transposase n=3 Tax=Deinococcus geothermalis... 150 1e-35 UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J... 143 2e-33 UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202... 139 4e-32 UniRef50_A3IXU4 Iso-IS1 ORF2 n=3 Tax=Bacteria RepID=A3IXU4_9CHRO 139 4e-32 UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaen... 136 3e-31 UniRef50_B2K0W2 IS1 transposase n=30 Tax=Enterobacteriaceae RepI... 135 5e-31 UniRef50_B0C1Z5 IS1 transposase n=3 Tax=Cyanobacteria RepID=B0C1... 132 3e-30 UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsonia... 131 1e-29 UniRef50_Q2S4N0 ISSru3, transposase insB n=1 Tax=Salinibacter ru... 129 5e-29 UniRef50_C5U8R9 IS1 transposase (Fragment) n=1 Tax=Methanocaldoc... 128 6e-29 UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum A... 128 7e-29 UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5W... 120 2e-26 UniRef50_C5BB57 Iso-IS1 ORF2 n=24 Tax=Enterobacteriaceae RepID=C... 114 8e-25 UniRef50_O67144 Putative uncharacterized protein n=1 Tax=Aquifex... 113 2e-24 UniRef50_P73781 Transposase n=5 Tax=Bacteria RepID=P73781_SYNY3 112 3e-24 UniRef50_B2SG01 IS1 transposase n=7 Tax=Francisella tularensis R... 109 5e-23 UniRef50_C0A223 Putative uncharacterized protein n=1 Tax=Opituta... 104 1e-21 UniRef50_B0CCX7 Putative uncharacterized protein n=7 Tax=Cyanoba... 100 3e-20 UniRef50_Q46GF8 Putative uncharacterized protein n=3 Tax=Methano... 99 6e-20 UniRef50_A9FJP3 Putative uncharacterized protein n=5 Tax=Proteob... 99 7e-20 UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepI... 98 1e-19 UniRef50_A9GLN9 Putative uncharacterized protein n=2 Tax=Sorangi... 98 1e-19 UniRef50_Q9CJQ7 Putative uncharacterized protein n=2 Tax=Pasteur... 97 2e-19 UniRef50_A9GLP8 Putative uncharacterized protein n=1 Tax=Sorangi... 96 5e-19 UniRef50_C5BAQ0 IS1 ORF n=14 Tax=Gammaproteobacteria RepID=C5BAQ... 95 7e-19 UniRef50_B0JRY8 Transposase n=27 Tax=Bacteria RepID=B0JRY8_MICAN 94 2e-18 UniRef50_B0CAP5 Putative uncharacterized protein n=3 Tax=Acaryoc... 92 8e-18 UniRef50_B0CEC0 Putative uncharacterized protein n=6 Tax=Acaryoc... 91 2e-17 UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candida... 89 4e-17 UniRef50_Q972H6 Putative uncharacterized protein ST1154 n=1 Tax=... 88 1e-16 UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus Re... 86 3e-16 UniRef50_D1JFE2 Putative uncharacterized protein n=3 Tax=uncultu... 85 9e-16 UniRef50_Q2FSQ2 Putative uncharacterized protein n=1 Tax=Methano... 83 3e-15 UniRef50_Q10VW0 ISSru3, transposase InsB n=1 Tax=Trichodesmium e... 81 1e-14 UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methan... 81 1e-14 UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachl... 81 1e-14 UniRef50_A4AD66 Transposase n=19 Tax=unclassified Gammaproteobac... 79 4e-14 UniRef50_Q8VSP6 Putative IS1 ORF n=1 Tax=Shigella flexneri RepID... 79 6e-14 UniRef50_Q6MBQ1 Putative uncharacterized protein n=2 Tax=Candida... 77 2e-13 UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium ... 76 4e-13 UniRef50_C8SAB2 IS1 transposase (Fragment) n=1 Tax=Ferroglobus p... 74 2e-12 UniRef50_Q64CQ0 Putative uncharacterized protein n=1 Tax=uncultu... 74 2e-12 UniRef50_Q46CV2 Putative uncharacterized protein n=14 Tax=Methan... 72 5e-12 UniRef50_A8YFR2 ImeAB protein n=2 Tax=Microcystis aeruginosa PCC... 72 9e-12 UniRef50_Q6MCH2 Putative uncharacterized protein n=1 Tax=Candida... 71 2e-11 UniRef50_A8GX98 Transposase and inactivated derivative n=2 Tax=R... 70 3e-11 UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis ae... 69 4e-11 UniRef50_UPI00016C465A IS1 transposase n=1 Tax=Gemmata obscurigl... 69 6e-11 UniRef50_B0URB1 Putative uncharacterized protein n=1 Tax=Methylo... 67 2e-10 UniRef50_A8ZKR3 Putative uncharacterized protein n=1 Tax=Acaryoc... 67 2e-10 UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthros... 67 3e-10 UniRef50_Q10ZU2 Putative uncharacterized protein n=3 Tax=Trichod... 65 9e-10 UniRef50_Q1CBA9 Putative transposase n=3 Tax=Yersinia pestis Rep... 65 1e-09 UniRef50_C5B7X2 Iso-IS1 ORF2 n=1 Tax=Edwardsiella ictaluri 93-14... 64 1e-09 UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmone... 62 5e-09 UniRef50_Q648U8 Putative uncharacterized protein n=6 Tax=environ... 61 2e-08 UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus lum... 59 6e-08 UniRef50_C4GXL2 Insertion sequence protein n=5 Tax=Yersinia pest... 58 8e-08 UniRef50_Q32DI9 Iso-IS1 ORF2 n=2 Tax=Shigella RepID=Q32DI9_SHIDS 58 1e-07 UniRef50_Q6MCX8 Putative uncharacterized protein n=2 Tax=Candida... 57 2e-07 UniRef50_Q8PRQ0 Putative uncharacterized protein n=1 Tax=Methano... 56 3e-07 UniRef50_Q7NJH9 Gsl1853 protein n=1 Tax=Gloeobacter violaceus Re... 55 1e-06 UniRef50_Q649W7 Putative uncharacterized protein n=1 Tax=uncultu... 53 3e-06 UniRef50_UPI00018554DD transposase n=1 Tax=Francisella novicida ... 51 1e-05 UniRef50_Q10ZQ2 Putative uncharacterized protein n=7 Tax=Cyanoba... 49 8e-05 UniRef50_Q327E8 IS1 ORF, n=6 Tax=Enterobacteriaceae RepID=Q327E8... 48 1e-04 UniRef50_B2TXL7 IS1 ORF2 n=1 Tax=Shigella boydii CDC 3083-94 Rep... 47 3e-04 UniRef50_Q6MD18 Putative uncharacterized protein n=2 Tax=Candida... 46 4e-04 Sequences not found previously or not previously below threshold: UniRef50_A9FZD9 Putative uncharacterized protein n=1 Tax=Sorangi... 62 8e-09 UniRef50_Q0W4E9 Putative uncharacterized protein n=1 Tax=uncultu... 58 1e-07 UniRef50_Q218S2 Putative uncharacterized protein n=1 Tax=Rhodops... 54 2e-06 UniRef50_B9K3D6 Transposase n=32 Tax=Bacteria RepID=B9K3D6_AGRVS 54 2e-06 UniRef50_C7DAC3 Transposase n=36 Tax=Rhodobacterales RepID=C7DAC... 54 2e-06 UniRef50_Q0RZ53 Transposase n=23 Tax=Bacteria RepID=Q0RZ53_RHOSR 53 4e-06 UniRef50_Q11MN9 Transposase n=37 Tax=Bacteria RepID=Q11MN9_MESSB 51 1e-05 UniRef50_B9K4Q6 Transposase n=2 Tax=Alphaproteobacteria RepID=B9... 51 1e-05 UniRef50_A8LAQ0 Integrase catalytic region n=1 Tax=Frankia sp. E... 51 2e-05 UniRef50_A9EF44 Transposase n=2 Tax=Rhodobacteraceae RepID=A9EF4... 51 2e-05 UniRef50_Q8TRX5 Predicted protein n=3 Tax=Methanosarcina acetivo... 49 4e-05 UniRef50_A7C135 Putative uncharacterized protein n=1 Tax=Beggiat... 49 4e-05 UniRef50_Q0RWC6 Transposase n=24 Tax=Bacteria RepID=Q0RWC6_RHOSR 47 1e-04 UniRef50_B9K5F7 Transposase n=3 Tax=Bacteria RepID=B9K5F7_AGRVS 47 2e-04 UniRef50_A3XA77 Putative transposase n=1 Tax=Roseobacter sp. MED... 47 3e-04 UniRef50_Q2G895 Transposase n=36 Tax=Alphaproteobacteria RepID=Q... 47 3e-04 UniRef50_A3W3Q5 Transposase n=1 Tax=Roseovarius sp. 217 RepID=A3... 47 3e-04 UniRef50_B4WU12 Putative uncharacterized protein n=1 Tax=Synecho... 47 3e-04 UniRef50_A0YAP3 Transposase n=1 Tax=marine gamma proteobacterium... 46 4e-04 UniRef50_B9K4C6 Transposase n=4 Tax=Proteobacteria RepID=B9K4C6_... 46 5e-04 UniRef50_Q2G8C0 Transposase n=5 Tax=Alphaproteobacteria RepID=Q2... 46 5e-04 UniRef50_Q64DF0 Putative uncharacterized protein n=6 Tax=cellula... 46 6e-04 UniRef50_Q64DD5 Putative uncharacterized protein n=1 Tax=uncultu... 45 0.001 UniRef50_Q469A1 Putative uncharacterized protein n=1 Tax=Methano... 45 0.001 UniRef50_Q6MAQ6 Putative uncharacterized protein n=1 Tax=Candida... 44 0.001 UniRef50_B8IVA0 Transposase and inactivated derivatives-like pro... 44 0.002 UniRef50_B4WST7 Putative uncharacterized protein n=3 Tax=Synecho... 43 0.003 UniRef50_C7S9U1 IS6100 transposase n=358 Tax=root RepID=C7S9U1_E... 43 0.003 UniRef50_Q8PWV9 Putative uncharacterized protein n=1 Tax=Methano... 43 0.004 UniRef50_Q6MBH4 Putative uncharacterized protein n=1 Tax=Candida... 43 0.005 UniRef50_A9VUP9 Integrase catalytic region n=149 Tax=Bacteria Re... 42 0.005 UniRef50_A9HNK8 Transposase, putative n=1 Tax=Roseobacter litora... 42 0.005 UniRef50_A0LBE3 Putative uncharacterized protein n=1 Tax=Magneto... 42 0.005 UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=... 42 0.007 UniRef50_A5N1B9 Transposase n=2 Tax=Clostridium kluyveri RepID=A... 42 0.008 UniRef50_UPI00016C51C4 hypothetical protein GobsU_02291 n=6 Tax=... 42 0.009 UniRef50_A9EF82 Transposase, putative n=1 Tax=Oceanibulbus indol... 42 0.011 UniRef50_A9VUQ5 Integrase catalytic region n=24 Tax=Bacteria Rep... 41 0.011 UniRef50_C6IUV9 Transposase n=4 Tax=Bacteroides RepID=C6IUV9_9BACE 41 0.013 UniRef50_C6GYT4 IS1216, transposase (Fragment) n=121 Tax=root Re... 41 0.021 UniRef50_A7C324 Putative uncharacterized protein n=3 Tax=Beggiat... 40 0.027 UniRef50_C2JSP7 IS431mec transposase n=2 Tax=Enterococcus faecal... 40 0.028 UniRef50_B4WVD1 Putative uncharacterized protein n=7 Tax=Synecho... 40 0.034 UniRef50_A3NK27 IS6 family transposase n=29 Tax=Burkholderia Rep... 39 0.042 UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae... 39 0.048 UniRef50_UPI00006CAE3F hypothetical protein n=1 Tax=Tetrahymena ... 39 0.076 UniRef50_B5WJN4 Integrase, catalytic region n=1 Tax=Burkholderia... 39 0.087 UniRef50_Q10UW1 Putative uncharacterized protein n=1 Tax=Trichod... 38 0.095 UniRef50_A8LH39 Integrase catalytic region n=26 Tax=Bacteria Rep... 38 0.098 >UniRef50_P57998 Insertion element IS1 4 protein insB n=553 Tax=root RepID=INSB4_ECOLI Length = 167 Score = 208 bits (529), Expect = 6e-53, Method: Composition-based stats. Identities = 162/167 (97%), Positives = 163/167 (97%) Query: 1 MPGNSPHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF 60 MPGNSPHYGRWPQHDF KKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF Sbjct: 1 MPGNSPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF 60 Query: 61 YAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR 120 YAYD LRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR Sbjct: 61 YAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKR 120 Query: 121 YTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 YTQRIER+NLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ Sbjct: 121 YTQRIERYNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 >UniRef50_Q0H058 IS1N transposase n=32 Tax=Enterobacteriaceae RepID=Q0H058_ECOLX Length = 231 Score = 181 bits (459), Expect = 7e-45, Method: Composition-based stats. Identities = 65/152 (42%), Positives = 98/152 (64%) Query: 16 FTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVF 75 +LKKL P+ +TS +DV E+DEQW YVG+K+RQ W++YAY++ V+A+ F Sbjct: 80 IRTLKKLAPKRITSSPVTHADVAFICELDEQWSYVGSKARQHWIWYAYNTKTGGVLAYTF 139 Query: 76 GERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQH 135 G RT T L++LL+PF++ + +D W Y + H+ K +TQ IER+NL LR Sbjct: 140 GPRTDQTCRELLALLTPFNIGMLTSDDWGSYGREVPKNKHLTGKIFTQCIERNNLTLRTR 199 Query: 136 LARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 + RLGRK++ FS+SVE+H+KVIG ++ + Sbjct: 200 IKRLGRKTICFSRSVEIHEKVIGAFIEKHMFY 231 >UniRef50_P03832 Insertion element iso-IS1n protein insB n=127 Tax=Gammaproteobacteria RepID=INBN_SHIDY Length = 131 Score = 166 bits (421), Expect = 2e-40, Method: Composition-based stats. Identities = 56/130 (43%), Positives = 86/130 (66%) Query: 38 IVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVI 97 + E+DEQW +VG+K+RQ WL+YAY++ V+A+ FG RT T L++LL+PF++ + Sbjct: 2 ALICELDEQWSFVGSKARQHWLWYAYNTKTGGVLAYTFGPRTDETCRELLALLTPFNIGM 61 Query: 98 WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVI 157 +D W Y + H+ K +TQRIER+NL LR + RL RK++ FS+SVE+H+KVI Sbjct: 62 LTSDDWGSYGREVPKDKHLTGKIFTQRIERNNLTLRTRIKRLARKTICFSRSVEIHEKVI 121 Query: 158 GHYLNIKHYQ 167 G ++ + Sbjct: 122 GTFIEKHMFY 131 >UniRef50_B4WT39 IS1 transposase subfamily, putative n=3 Tax=Cyanobacteria RepID=B4WT39_9SYNE Length = 243 Score = 161 bits (407), Expect = 8e-39, Method: Composition-based stats. Identities = 53/179 (29%), Positives = 75/179 (41%), Gaps = 27/179 (15%) Query: 11 WPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTV 70 W Q + P+ + + G + E DE W +VG+KS ++W++ A + + Sbjct: 65 WLQQYASEEYADVPRQAKTSPKKG---PLTLECDEAWSFVGSKSNKQWIWLAINRDTRET 121 Query: 71 VAHVFGERTMATLGRLMSLLSP--FDVVIWMTDGWP-----------------LYESRLK 111 + G R L + L P + TD W YE L Sbjct: 122 IGMHIGGRNREGARSLWACLPPVYRQCAVCYTDFWERCDPASLCGARERAPRQAYEIVLP 181 Query: 112 GKLHV-ISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 K H +SK T IER N LRQ ++RL RKSLSFSK +E H I ++ I HY Sbjct: 182 SKRHRAVSKNSGQTNHIERFNCTLRQRVSRLVRKSLSFSKKLENHIGAIWYF--IHHYN 238 >UniRef50_B3ETP6 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ETP6_AMOA5 Length = 232 Score = 152 bits (383), Expect = 5e-36, Method: Composition-based stats. Identities = 48/156 (30%), Positives = 72/156 (46%), Gaps = 12/156 (7%) Query: 9 GRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRK 68 G W Q K R Q+V EMDE W YVG+K ++ W+++A + Sbjct: 81 GEWIQAYHNQNKPKRRQAV-----------EVIEMDEMWHYVGSKKKKLWIWFALERSGG 129 Query: 69 TVVAHVFGERTMATLGRLMSLLSPFDVVIWM-TDGWPLYESRLKGKLHVISKRYTQRIER 127 +++ V G R +T RL + + TD WP Y + H +SK+ T IE Sbjct: 130 SILDFVTGSREASTGKRLWIKIKDIACRSFYATDHWPAYTQFINAHKHKVSKKQTTHIES 189 Query: 128 HNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNI 163 HN N+R +LAR RK+ +SKS L + + + Sbjct: 190 HNANVRHYLARFRRKTKCYSKSERLVELSLYLLIYK 225 >UniRef50_Q1IXR6 IS1 transposase n=3 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1IXR6_DEIGD Length = 148 Score = 150 bits (379), Expect = 1e-35, Method: Composition-based stats. Identities = 51/148 (34%), Positives = 71/148 (47%), Gaps = 8/148 (5%) Query: 24 PQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATL 83 Q+V + P +V E+DE W +VG K + RWL+ A + + V+A V G+R+ T Sbjct: 2 RQTVPVCLTPPEEV--VVELDELWTFVGKKKQARWLWIALERSTRKVLAWVLGDRSEQTA 59 Query: 84 GRLMSLLS----PFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARL 139 +L L + TD W Y+ L G + K T +ER N LRQ L RL Sbjct: 60 FKLWDRLPLSPEQRLKGTFCTDLWRAYDEPLLGVKRLTRKGETNHVERLNCTLRQRLGRL 119 Query: 140 GRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 RKSLSFSKS E+ + + Y Sbjct: 120 VRKSLSFSKSDEMLEASLTL--AFHRYN 145 >UniRef50_B2J1G2 IS1 transposase n=24 Tax=Cyanobacteria RepID=B2J1G2_NOSP7 Length = 236 Score = 143 bits (361), Expect = 2e-33, Method: Composition-based stats. Identities = 45/146 (30%), Positives = 73/146 (50%), Gaps = 9/146 (6%) Query: 31 IQPGSDVI-VCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTM--------A 81 P +VI E+DE +VG+K + WL+ A + + ++A V G+ ++ Sbjct: 87 DVPEENVIPEVGELDELETFVGSKKTKIWLWTAVNHFTQGILAWVLGDHSLVLSEVEVAE 146 Query: 82 TLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGR 141 T L + + ++TDGW +Y S + ++SK Y R+E N LR +LARL R Sbjct: 147 TFKPLWENIEKWKCYFYVTDGWKVYPSFIPDGDQIVSKTYMTRVENENTRLRHYLARLHR 206 Query: 142 KSLSFSKSVELHDKVIGHYLNIKHYQ 167 K+L +SKS ++ I L+ YQ Sbjct: 207 KTLCYSKSEQILRYSIKLLLHYLKYQ 232 >UniRef50_C8KX67 IS1 transposase n=1 Tax=Actinobacillus minor 202 RepID=C8KX67_9PAST Length = 238 Score = 139 bits (349), Expect = 4e-32, Method: Composition-based stats. Identities = 43/140 (30%), Positives = 69/140 (49%), Gaps = 2/140 (1%) Query: 26 SVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGR 85 +V P E+DE W +VG KS + WL YA+D + K ++++V+G+R T+ R Sbjct: 88 NVRDVELPHHCFYESIEIDEFWTFVGRKSERVWLIYAFDRVSKKIISYVWGKRNSETVMR 147 Query: 86 LMSLLSPFDVVI--WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKS 143 L L + +D W + KG H + ++YT IE ++ LR + R RKS Sbjct: 148 LKIQLCKSQISFRYVYSDRWICFRKIFKGYPHYLGRKYTIGIEGNHCLLRHRVRRFFRKS 207 Query: 144 LSFSKSVELHDKVIGHYLNI 163 +FSKS++ H + Sbjct: 208 CNFSKSLKYHFSAFRLMIWF 227 >UniRef50_A3IXU4 Iso-IS1 ORF2 n=3 Tax=Bacteria RepID=A3IXU4_9CHRO Length = 138 Score = 139 bits (349), Expect = 4e-32, Method: Composition-based stats. Identities = 48/127 (37%), Positives = 74/127 (58%) Query: 41 AEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT 100 AE+D+ +V K +RWL++A D T++A+V G+RT +L ++L PF + + T Sbjct: 8 AEVDKMKIFVAKKEHERWLWHAIDHQTGTILAYVLGQRTDQMFLKLKTMLKPFGISEFYT 67 Query: 101 DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHY 160 D W Y+ L + +SK Q+IER +L LR + RL RK++ FSK +HD VIG Y Sbjct: 68 DNWGSYKRHLSDEQRTVSKYKMQKIERKHLTLRTRIKRLQRKTICFSKISPMHDLVIGLY 127 Query: 161 LNIKHYQ 167 +N + Sbjct: 128 INKYEFH 134 >UniRef50_A2TRD2 Putative transposase B n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TRD2_9FLAO Length = 219 Score = 136 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 34/123 (27%), Positives = 59/123 (47%) Query: 42 EMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTD 101 E+DE W ++G K W+ YA + +V+ G +T + L++ + TD Sbjct: 97 EVDELWSFIGNKKNSTWITYAIEQKTGSVIDFFVGRKTKENIKPLINKVLLLQPTRIYTD 156 Query: 102 GWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYL 161 +Y S + ++H + T +IER NL LR H+ RL R+++ FS+ E + + Y Sbjct: 157 RLNIYPSLIPKEMHKRFQYCTNKIERMNLTLRTHIKRLSRRTICFSRKQEYLEAHLKIYF 216 Query: 162 NIK 164 Sbjct: 217 WGY 219 >UniRef50_B2K0W2 IS1 transposase n=30 Tax=Enterobacteriaceae RepID=B2K0W2_YERPB Length = 122 Score = 135 bits (340), Expect = 5e-31, Method: Composition-based stats. Identities = 59/121 (48%), Positives = 82/121 (67%), Gaps = 1/121 (0%) Query: 46 QWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPL 105 W +VG K +QRWL+YA++ K ++AHVFG R+ T +L+ LLS F++V W TD + Sbjct: 1 MWSFVGNKKQQRWLWYAWEPRLKRIIAHVFGRRSKKTFRQLLGLLSGFNIVFWCTDNFSA 60 Query: 106 YESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKH 165 Y L + H+ SK YTQRIER NLN+R L RL RK+L SKS E+HD++IG ++ +H Sbjct: 61 Y-EMLPDEKHIRSKLYTQRIERENLNIRNRLKRLNRKTLGDSKSAEMHDRIIGTFIEREH 119 Query: 166 Y 166 Y Sbjct: 120 Y 120 >UniRef50_B0C1Z5 IS1 transposase n=3 Tax=Cyanobacteria RepID=B0C1Z5_ACAM1 Length = 130 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 39/127 (30%), Positives = 60/127 (47%), Gaps = 7/127 (5%) Query: 46 QWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGW 103 W +V KS ++W++ A D + + +V G R+ +L + L + TD W Sbjct: 1 MWSFVNDKSNKQWIWLALDVITREIVGVYVGARSKQGARQLWNSLPGIYRQCAVAYTDFW 60 Query: 104 PLYESRLKGKLHV-ISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHY 160 Y + H + K T IER N +RQ ++RL RK+LSFSK +E H I + Sbjct: 61 DAYGCVFPKQRHQAVGKETGQTCYIERFNCTMRQRVSRLVRKTLSFSKKLENHIGAIWMF 120 Query: 161 LNIKHYQ 167 + HY Sbjct: 121 --VHHYN 125 >UniRef50_A5FDL2 IS1 transposase n=4 Tax=Flavobacterium johnsoniae UW101 RepID=A5FDL2_FLAJ1 Length = 229 Score = 131 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 51/155 (32%), Positives = 70/155 (45%) Query: 9 GRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRK 68 R + T+L K QP E+DE Y+G+K WL YA D K Sbjct: 75 ARILEISATTLLKRIVSIGRKINQPIISKCKTYEVDEMCTYIGSKQNFIWLVYALDKNSK 134 Query: 69 TVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERH 128 TVV+ +RT TL R++ L + T Y L K+H + + T IER Sbjct: 135 TVVSFNVAKRTNKTLSRVLDTLKLSEAKKIFTGRLKNYRYLLDEKMHSVKRFGTNHIERK 194 Query: 129 NLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNI 163 NL LR HL RL R+++ SKS+ + V+ Y I Sbjct: 195 NLTLRTHLKRLNRRTICSSKSLLIFTAVLKIYFWI 229 >UniRef50_Q2S4N0 ISSru3, transposase insB n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S4N0_SALRD Length = 158 Score = 129 bits (323), Expect = 5e-29, Method: Composition-based stats. Identities = 40/153 (26%), Positives = 63/153 (41%), Gaps = 6/153 (3%) Query: 18 SLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGE 77 K SV ++P + E+DE W YV ++ +RWL+ A + VVA V G+ Sbjct: 3 QKKGRESDSVAEGLRPAEEG-DVLELDECWTYVRERANKRWLWVALCRRTRQVVAFVIGD 61 Query: 78 RTMATLGRLMSLLS-PFDVVIWMTDGWPLYESRL---KGKLHV-ISKRYTQRIERHNLNL 132 R+ T RL S + + +D W Y V S +ER L Sbjct: 62 RSARTCARLWSRIPEEYRQGRSFSDFWKSYRPVFAGDPSHRQVGKSSGEMAHVERFFGRL 121 Query: 133 RQHLARLGRKSLSFSKSVELHDKVIGHYLNIKH 165 RQ LAR R++ + S+S + ++ + Sbjct: 122 RQKLARYVRRTRAASESERMLHLTTKLFVEWYN 154 >UniRef50_C5U8R9 IS1 transposase (Fragment) n=1 Tax=Methanocaldococcus infernus ME RepID=C5U8R9_9EURY Length = 133 Score = 128 bits (322), Expect = 6e-29, Method: Composition-based stats. Identities = 40/129 (31%), Positives = 69/129 (53%), Gaps = 3/129 (2%) Query: 39 VCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSP--FDVV 96 + E+DE +V +K + W++ A D ++AH G+R+ +L +L+ + D Sbjct: 4 IHLEIDEMHSFVRSKDNKVWIWIAVDKNTGLIIAHKTGDRSDKSLKKLLKEIPKKVLDKC 63 Query: 97 IWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKV 156 + TD W Y + L + H I K YT+R+ER L R ARL R+ + +SKS+E+H+ + Sbjct: 64 TFYTDKWKAY-NILPNERHKIGKEYTRRVERTFLTFRNSCARLVRRGIRYSKSMEMHNII 122 Query: 157 IGHYLNIKH 165 I + + Sbjct: 123 IDLLVYFYN 131 >UniRef50_C2KAD9 IS1 transposase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C2KAD9_9FLAO Length = 239 Score = 128 bits (321), Expect = 7e-29, Method: Composition-based stats. Identities = 41/147 (27%), Positives = 71/147 (48%) Query: 17 TSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFG 76 T++ K + + ++P + E+DE Y +K+ +RW+ AY K V+ + G Sbjct: 77 TTVLKKILKIASKVVKPPIPQNITIEIDELKTYTQSKTNERWVVAAYCRETKKVIDYKLG 136 Query: 77 ERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHL 136 RT TL ++ L + +D +Y + LH +R T IER L+LR H+ Sbjct: 137 RRTTKTLQCIIDTLLYANPKKIYSDRLNIYPKLIPKHLHSTKRRETNHIERKFLDLRTHI 196 Query: 137 ARLGRKSLSFSKSVELHDKVIGHYLNI 163 RLGRKS++ ++ + D ++ Y Sbjct: 197 KRLGRKSINKAQRDKYTDAILRIYFWG 223 >UniRef50_A5WC29 IS1 transposase n=30 Tax=Moraxellaceae RepID=A5WC29_PSYWF Length = 233 Score = 120 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 45/152 (29%), Positives = 70/152 (46%), Gaps = 6/152 (3%) Query: 20 KKLRPQSVTSR---IQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFG 76 K ++ I P C E+DE W +VG K+ ++WL YAY +VA+V+G Sbjct: 79 KGKVLATLKKCHYPITPKQRQYDCLEIDELWTFVGKKTNKQWLIYAYHRDTGEIVAYVWG 138 Query: 77 ERTMATL--GRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQ 134 +R + T+ + +D W + + KG VI K +T IE +N +R Sbjct: 139 KRDLNTVKKLKAKLKALGVSCARIASDTWDSFVTGFKGFTQVIGKFFTVGIEGNNCTIRH 198 Query: 135 HLARLGRKSLSFSKSVELHDKVIGH-YLNIKH 165 + R R+S +FSK +E H K + I H Sbjct: 199 RVRRAFRRSCNFSKKLENHFKAFDLAFFYINH 230 >UniRef50_C5BB57 Iso-IS1 ORF2 n=24 Tax=Enterobacteriaceae RepID=C5BB57_EDWI9 Length = 131 Score = 114 bits (286), Expect = 8e-25, Method: Composition-based stats. Identities = 40/107 (37%), Positives = 64/107 (59%) Query: 50 VGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESR 109 +G+K+RQ WL+YAY++ V+A+ FG +T + L+ L++PF++ + +D Sbjct: 1 MGSKARQHWLWYAYNTKTGGVLAYTFGPKTDESCRELLVLITPFNIGMITSDNRSSDGRE 60 Query: 110 LKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKV 156 + H+ K TQRI R+NL LR H+ RL RK++ FS+SV K Sbjct: 61 VPKDKHLTGKILTQRIVRNNLTLRTHIKRLARKTICFSRSVRSTKKS 107 >UniRef50_O67144 Putative uncharacterized protein n=1 Tax=Aquifex aeolicus RepID=O67144_AQUAE Length = 147 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 32/145 (22%), Positives = 64/145 (44%), Gaps = 7/145 (4%) Query: 24 PQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLF-YAYDSLRKTVVAHVF-GERTMA 81 P+ + ++ D + DE W YVG K + W++ + T+ +F G+R++ Sbjct: 4 PEYGSEKVVKTEDNMENKPTDEMWSYVGTKGNEVWIWSVVVELKDGTIKKFLFAGDRSLR 63 Query: 82 TLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRY-TQRIERHNLNLRQHLARLG 140 T ++++ + + + TD + +Y L H++ K R E + LR L Sbjct: 64 TFLKILAKMPEAE--EYETDAYRVY-EWLPRDRHIVRKYGRVNRNEALHSKLRDKLVAFK 120 Query: 141 RKSLSFSKSVELHDKVIGHYLNIKH 165 RK+ +F +S + + +I H Sbjct: 121 RKTKAFFRSFLYLRYALALF-SIHH 144 >UniRef50_P73781 Transposase n=5 Tax=Bacteria RepID=P73781_SYNY3 Length = 138 Score = 112 bits (281), Expect = 3e-24, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 56/120 (46%), Gaps = 3/120 (2%) Query: 48 GYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYE 107 + + + WL+ AYD + ++ G R TL RL+ L+ + V + TD W Y+ Sbjct: 2 AFSSGQKNKLWLWKAYDRVTGRLIDWELGNRDSQTLSRLLERLAKWKVTVSCTDDWRPYQ 61 Query: 108 SRL---KGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIK 164 L H ISKR T IER+N + R LAR R + S+S + + + + + Sbjct: 62 QLLDEHPDAFHGISKRETVGIERNNSDNRHWLARFHRPTKVISRSAHMVNITMAIFAKFR 121 >UniRef50_B2SG01 IS1 transposase n=7 Tax=Francisella tularensis RepID=B2SG01_FRATM Length = 102 Score = 109 bits (271), Expect = 5e-23, Method: Composition-based stats. Identities = 35/104 (33%), Positives = 49/104 (47%), Gaps = 2/104 (1%) Query: 46 QWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPL 105 W ++G+K W+ AYD + V G R AT RL + + TD W Sbjct: 1 MWNFIGSKK--CWIIKAYDRRVGKTIIWVTGGRDNATFRRLYKKVQHLTNCNFYTDDWVA 58 Query: 106 YESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 + L K H+I K T IER N N R +LAR+ R++ S+S Sbjct: 59 FVEVLPKKRHIIGKSGTVAIERDNSNTRHNLARMTRRTKVISRS 102 >UniRef50_C0A223 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A223_9BACT Length = 269 Score = 104 bits (258), Expect = 1e-21, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 53/174 (30%), Gaps = 48/174 (27%) Query: 41 AEMDEQWGYVGAKSRQR----------WLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL 90 + DE W +VG K + W + A D K V G R + + M L Sbjct: 63 IQCDEIWSFVGCKEKNVTNNGKRQGDTWTWIACDPDTKLVPCWFIGRRDSESAKKFMRRL 122 Query: 91 SP---FDVVIWMTDGWPLYESRLK-------------------GKLHVISKRY------- 121 + TDG Y + +K G H Sbjct: 123 ARHLSLGSTQITTDGLKAYINAIKEILWIETSYGMVEKKYDVSGDDHRTRYIGSEKTAIF 182 Query: 122 ---------TQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHY 166 T +ER NL +R + R RK+ +SK + H I + ++ Sbjct: 183 GNPDPDTMNTSIVERQNLTMRMSMRRFTRKTNGYSKKIANHRYAIALHFMYYNF 236 >UniRef50_B0CCX7 Putative uncharacterized protein n=7 Tax=Cyanobacteria RepID=B0CCX7_ACAM1 Length = 196 Score = 99.8 bits (247), Expect = 3e-20, Method: Composition-based stats. Identities = 35/143 (24%), Positives = 52/143 (36%), Gaps = 16/143 (11%) Query: 40 CAEMDEQWGYVGAKSR----------QRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSL 89 DE W V K + W+ + V+ G+ T L+ Sbjct: 24 IINADELWSSVKKKQKHCEPEELSLGDCWIALSLAKDSGLVLTGRIGKHTDELAQELIEN 83 Query: 90 LS-PFDVVIWMTDGWPLYESRLKGK-LHVISKRYTQRIERHNLNLRQHLARLGRKSLSFS 147 W TDGW Y +L + +H +SK TQR+ER N LRQ R R+ F Sbjct: 84 TEGKTACHHWQTDGWEGYSRQLADEVIHHVSKALTQRLERTNGILRQQTGRWHRRQNKFG 143 Query: 148 KSVEL----HDKVIGHYLNIKHY 166 K + V+ ++ I + Sbjct: 144 KVWQQSAVTLRLVMAYFNWIWRH 166 >UniRef50_Q46GF8 Putative uncharacterized protein n=3 Tax=Methanosarcina barkeri str. Fusaro RepID=Q46GF8_METBF Length = 112 Score = 98.7 bits (244), Expect = 6e-20, Method: Composition-based stats. Identities = 29/92 (31%), Positives = 42/92 (45%) Query: 68 KTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 K + FG R T + L ++ MTD W Y L +H SK T +E Sbjct: 8 KKFINCSFGSRGTETGQLIWEKLKQKEIGEVMTDHWRAYAEFLPENIHTQSKAETYTVEG 67 Query: 128 HNLNLRQHLARLGRKSLSFSKSVELHDKVIGH 159 +N LR LARL RK+ ++KS+E+ + Sbjct: 68 YNGILRHFLARLRRKTKCYTKSIEMLKYSVLL 99 >UniRef50_A9FJP3 Putative uncharacterized protein n=5 Tax=Proteobacteria RepID=A9FJP3_SORC5 Length = 349 Score = 98.7 bits (244), Expect = 7e-20, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 60/208 (28%), Gaps = 59/208 (28%) Query: 17 TSLKKLRPQSVTSRIQPGSDVIV-----CAEMDEQWGYVGAKSRQR-----------WLF 60 ++ L + + + ++ A+ DE W YV K + + F Sbjct: 34 PTVLALLLRIGAGCERLHNRIVRGVTCHVAQCDEIWSYVQKKQSRVTASDPAEYGDAYTF 93 Query: 61 YAYDSLRKTVVAHVFGERTMATLGRLMSLLSP--FDVVIWMTDGWPLY------------ 106 S K ++++ G+R + L + TDGW Y Sbjct: 94 VGMASASKLIISYRVGKRDEENTRAFVKDLRARLTTIPQLYTDGWQPYIGAVGASFTGGV 153 Query: 107 ------------------ESRLKGKLHVISKR-----------YTQRIERHNLNLRQHLA 137 + I+K T +ER N +R H+ Sbjct: 154 DYCQVVKNYSRRPRRDDEVRYEPPRDPFITKTPIFGIPDVEHASTSHVERQNWTIRMHIR 213 Query: 138 RLGRKSLSFSKSVELHDKVIGHYLNIKH 165 R R FS+ + H + ++ + Sbjct: 214 RFTRLCNGFSRKLANHRAAVALHVAWYN 241 >UniRef50_C3NN20 IS1 transposase n=30 Tax=cellular organisms RepID=C3NN20_SULIN Length = 316 Score = 97.9 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 34/121 (28%), Positives = 55/121 (45%), Gaps = 12/121 (9%) Query: 43 MDEQWGYV----GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIW 98 DE W Y+ G+K W++ A + G+R T L++ L +V Sbjct: 172 FDESWTYLRVRHGSKRENLWIWNALA---DGLPFFTTGDRDYKTFSFLLNSLPKSEV--N 226 Query: 99 MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIG 158 TD + +Y+ HV SK+YT +E +N R HLARL R + + ++S + D + Sbjct: 227 YTDDYSVYQVL---DNHVASKKYTYTVESYNSYCRAHLARLARDTRAVNRSERMVDYSLA 283 Query: 159 H 159 Sbjct: 284 L 284 >UniRef50_A9GLN9 Putative uncharacterized protein n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GLN9_SORC5 Length = 405 Score = 97.9 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 38/180 (21%), Positives = 59/180 (32%), Gaps = 54/180 (30%) Query: 38 IVCAEMDEQWGYVGAKSRQR-----------WLFYAYDSLRKTVVAHVFGERTMATLGRL 86 + DE + YVG K + + F A D+ + V+A G+R M T G Sbjct: 95 CELIQADEVFSYVGKKQARVTEKDAPGIGETYSFTALDTASRLVIAWRVGKRDMETCGPF 154 Query: 87 MSLLSPFDVVI--WMTDGWPLYESRLKGK------------------------------- 113 ++ L +V+ TDG+ Y + + Sbjct: 155 IADLRSRLLVMPQITTDGFAPYIATVAEHFGLSVDYMQTVKNYRTGSYRGPDHRYEPPRD 214 Query: 114 ----LHVI------SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNI 163 H I K T +ER N R L R+ R +FSK+ E H + + Sbjct: 215 PFITKHTIYGAPDAKKASTSYVERLNGTTRHLLGRMRRLCYAFSKAPEHHRAAVALHYTY 274 >UniRef50_Q9CJQ7 Putative uncharacterized protein n=2 Tax=Pasteurellaceae RepID=Q9CJQ7_PASMU Length = 181 Score = 97.1 bits (240), Expect = 2e-19, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 51/118 (43%), Gaps = 5/118 (4%) Query: 47 WGYV--GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVV--IWMTDG 102 W +V ++ ++ Y + +VA V+G+R + T L L V D Sbjct: 62 WHFVPPNRIDQKYRIYIGYHAKTSEIVAFVWGKRDLQTALALKQRLKELKVSYERIAGDN 121 Query: 103 WPLYESRLKG-KLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGH 159 W + + + K++T+ IE +N +R L+R R+S FSKS+ H K Sbjct: 122 WDAFVNAFSDTGDQWVGKQHTKAIEGNNCRIRHRLSRAVRRSCCFSKSMFYHVKSFNI 179 >UniRef50_A9GLP8 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GLP8_SORC5 Length = 337 Score = 96.0 bits (237), Expect = 5e-19, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 52/174 (29%), Gaps = 54/174 (31%) Query: 40 CAEMDEQWGYVGAKSRQR-----------WLFYAYDSLRKTVVAHVFGERTMATLGRLMS 88 +MDE W +V K + + + A D+ K ++ G+R + Sbjct: 5 VIQMDEMWSFVQKKQARVTAEDPAEHGDAYFYVALDANTKLAISFHVGKRDGENTEAFIK 64 Query: 89 LLSPFD--VVIWMTDGWPLYE------------------------------SRLKGKLHV 116 L V +DGW Y + Sbjct: 65 DLRSRLTVVPHITSDGWQPYIEAMATSFRGSADYAQCVKNYRGGPQRSPDHRYEPPRNPF 124 Query: 117 ISKR-----------YTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGH 159 ++K T +ER NL R + R R L+FSK++ H IG Sbjct: 125 VTKTPIFGAPKDELLSTSFVERFNLQTRHTVGRTRRLCLAFSKTLRGHRAAIGL 178 >UniRef50_C5BAQ0 IS1 ORF n=14 Tax=Gammaproteobacteria RepID=C5BAQ0_EDWI9 Length = 78 Score = 95.2 bits (235), Expect = 7e-19, Method: Composition-based stats. Identities = 45/78 (57%), Positives = 57/78 (73%) Query: 90 LSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 + F++ +MTD WP+Y + L HV+SK+YTQRIERHNLNLR HL RL R+++ FS S Sbjct: 1 MRKFNIAFYMTDAWPVYRTLLDPAHHVVSKKYTQRIERHNLNLRTHLKRLTRRTICFSNS 60 Query: 150 VELHDKVIGHYLNIKHYQ 167 E+HDKVIG YL I HY Sbjct: 61 EEMHDKVIGWYLTINHYH 78 >UniRef50_B0JRY8 Transposase n=27 Tax=Bacteria RepID=B0JRY8_MICAN Length = 111 Score = 93.7 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 31/105 (29%), Positives = 51/105 (48%), Gaps = 7/105 (6%) Query: 68 KTVVAHVFGERTMATLGRLMSLLSP--FDVVIWMTDGWPLYESRLKGKLHV-ISK--RYT 122 ++ + G+R+ + +L + L + TD W Y++ + K H + K T Sbjct: 3 GKLLVAMRGDRSRQSAKKLWASLPGVYRQCAVAYTDFWESYKTVIPSKRHRPVGKETGQT 62 Query: 123 QRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 IER N RQ ++RL R+SLSFSK +E H + ++ I Y Sbjct: 63 NPIERLNNTFRQRISRLVRESLSFSKKMENHVGAVWYF--IHDYN 105 >UniRef50_B0CAP5 Putative uncharacterized protein n=3 Tax=Acaryochloris marina MBIC11017 RepID=B0CAP5_ACAM1 Length = 144 Score = 91.7 bits (226), Expect = 8e-18, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 45/111 (40%), Gaps = 2/111 (1%) Query: 56 QRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLS-PFDVVIWMTDGWPLYESRLKGK- 113 + W+ + V++ G+ T L+ W TDGW + + Sbjct: 5 ECWIALSLAKDSSLVLSGRIGKHTDELAQDLIENTEGKTTCHHWQTDGWEGSSRQPPDEV 64 Query: 114 LHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIK 164 +H +SK TQR++R N LRQ R ++ F K + H + + + + Sbjct: 65 IHHVSKVLTQRLKRTNGILRQQTGRWHQRQNKFGKVWQQHAVTLTLFYHFR 115 >UniRef50_B0CEC0 Putative uncharacterized protein n=6 Tax=Acaryochloris marina MBIC11017 RepID=B0CEC0_ACAM1 Length = 172 Score = 90.6 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 28/113 (24%), Positives = 46/113 (40%), Gaps = 2/113 (1%) Query: 56 QRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLS-PFDVVIWMTDGWPLYESRLKGKL 114 W+ + V++ G+ T L+ W TDGW Y +L ++ Sbjct: 20 DCWIALSLAKESGLVLSGRIGKHTDELAQELIENTEGKTACHHWQTDGWEGYARQLPDEV 79 Query: 115 -HVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHY 166 H +SK TQR+ER N +RQ R R+ F K + + L+ ++ Sbjct: 80 VHEVSKALTQRLERTNGIVRQQTGRWHRRQNKFGKVWQQSAMTLRLVLSYFNW 132 >UniRef50_Q6MCX7 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX7_PARUW Length = 163 Score = 89.4 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 40/83 (48%), Gaps = 2/83 (2%) Query: 11 WPQHDFTSLKKLRPQSVTSRIQPGSDV--IVCAEMDEQWGYVGAKSRQRWLFYAYDSLRK 68 W L K P+++ + + +D +V E+DE W YVG+K+ +WL+ S + Sbjct: 71 WLLEFIGELTKELPENLNAEVVSENDELEVVVLEVDELWSYVGSKANPQWLWLVMHSKTR 130 Query: 69 TVVAHVFGERTMATLGRLMSLLS 91 VVA G R T +L+ L Sbjct: 131 QVVAMQIGPRNKETAEKLLYKLP 153 >UniRef50_Q972H6 Putative uncharacterized protein ST1154 n=1 Tax=Sulfolobus tokodaii RepID=Q972H6_SULTO Length = 152 Score = 87.5 bits (215), Expect = 1e-16, Method: Composition-based stats. Identities = 24/122 (19%), Positives = 45/122 (36%), Gaps = 9/122 (7%) Query: 44 DEQWGYVGAKSRQRWLFYAYDS---LRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT 100 DE W Y+ +R + + + + G+R T + L D W++ Sbjct: 27 DEMWTYLYRNTRAFYKWVFNCHVYTRLGLYIIYSVGDRDENTFREVKMYLP--DDGRWVS 84 Query: 101 DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHY 160 D + +Y V+S E + +LR L R R + + ++S+ + I Sbjct: 85 DDYNVY--FWLKNHTVVS--LVNPNESFHSSLRDRLVRFKRATKAVNRSINMVKYSIALV 140 Query: 161 LN 162 L Sbjct: 141 LW 142 >UniRef50_C3NLB2 Insertion element protein n=50 Tax=Sulfolobus RepID=C3NLB2_SULIN Length = 244 Score = 86.3 bits (212), Expect = 3e-16, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 54/155 (34%), Gaps = 9/155 (5%) Query: 11 WPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD---SLR 67 W + + + + +V +DE W Y+ +R + + + Sbjct: 86 WIKRYGRKKHEKLVELWGRAKELVKGKVVAKVVDEMWTYLYKNARAFYKWVFTCYVYTKL 145 Query: 68 KTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIER 127 + + G+R +T + L D W++D + LY V+S E Sbjct: 146 GVYLIYSVGDRDESTFLEVKKYLP--DEGRWVSDDYNLY--FWLKDHTVVSPVNPN--ES 199 Query: 128 HNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLN 162 + +LR L R R + + ++S+ I L Sbjct: 200 FHSSLRDRLIRFKRATKAINRSIRTMMYSIALVLW 234 >UniRef50_D1JFE2 Putative uncharacterized protein n=3 Tax=uncultured archaeon RepID=D1JFE2_9ARCH Length = 217 Score = 84.8 bits (208), Expect = 9e-16, Method: Composition-based stats. Identities = 36/148 (24%), Positives = 57/148 (38%), Gaps = 37/148 (25%) Query: 56 QRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF-------DVVIWMTDGWPLYES 108 W++ A S K +AH G+R T L++L+ D +++DG Y Sbjct: 23 DCWIYTAIKSDTKLHLAHCTGKRVQETANALVALVKNRGKAPDTDDKATFVSDGNNQYTK 82 Query: 109 RL-----------------KGKLHVISK-------------RYTQRIERHNLNLRQHLAR 138 L + V+ K T +ER+NL LR +++ Sbjct: 83 ALFENFDVNAINYGQLVKERDNGRVVGKTRTIIFGSLEVDEIETVYVERYNLTLRHGISK 142 Query: 139 LGRKSLSFSKSVELHDKVIGHYLNIKHY 166 L RKSL FSK E+ D + Y ++ Sbjct: 143 LVRKSLCFSKCKEMLDDHLDLYQCYTNF 170 >UniRef50_Q2FSQ2 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FSQ2_METHJ Length = 201 Score = 83.3 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 49/148 (33%), Gaps = 37/148 (25%) Query: 56 QRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL-------SPFDVVIWMTDGWPLYES 108 W + + +A G+R + T ++ +P + + TDG Y Sbjct: 21 DCWSYTCFKRDSGLFLAFESGKRNIDTCADMLVRFFNRMELPTPENKISIFTDGNVQYSI 80 Query: 109 RLK--------GKLHVISKRYTQR----------------------IERHNLNLRQHLAR 138 L VI + + IE +N +RQ L+R Sbjct: 81 CLPELYCEPCLDYGQVIKVKEKNKLVYVIREKIMGNPDSKAISTSVIEGYNNKIRQRLSR 140 Query: 139 LGRKSLSFSKSVELHDKVIGHYLNIKHY 166 GRK+ SFSK + + + + ++ Sbjct: 141 FGRKTASFSKKLNRFISALNIFQFVHNF 168 >UniRef50_Q10VW0 ISSru3, transposase InsB n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10VW0_TRIEI Length = 76 Score = 81.3 bits (199), Expect = 1e-14, Method: Composition-based stats. Identities = 20/73 (27%), Positives = 30/73 (41%), Gaps = 2/73 (2%) Query: 46 QWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGW 103 W +VG+K+ Q+W + A D K +VA GER +L + I TD W Sbjct: 1 MWSFVGSKNNQQWFWLAIDIETKEIVAFSLGERGEKGANQLWNSWPGIYRQCAICYTDFW 60 Query: 104 PLYESRLKGKLHV 116 Y+ + Sbjct: 61 SAYDVIFPHCRQL 73 >UniRef50_Q466Y9 Putative uncharacterized protein n=13 Tax=Methanosarcina RepID=Q466Y9_METBF Length = 184 Score = 81.3 bits (199), Expect = 1e-14, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 39/116 (33%), Gaps = 2/116 (1%) Query: 9 GRWPQHDFTSLKKLRPQSVTS-RIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLR 67 GR+ S++K + + I EMDE Y+G K Sbjct: 63 GRFLGVSHVSVQKWIKKFGQELEDLKSENEISIVEMDEMHTYIGNKKNIAGSGLLL-IEL 121 Query: 68 KTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQ 123 + FG R T + L ++ MTD W Y L +H SK+ Q Sbjct: 122 GKFIHCSFGNRGTETGQLIWEKLKQKEIGEVMTDHWRAYAEFLPENIHTQSKKRIQ 177 >UniRef50_Q6MD28 Putative uncharacterized protein n=3 Tax=Parachlamydiaceae RepID=Q6MD28_PARUW Length = 209 Score = 81.0 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 32/87 (36%), Gaps = 5/87 (5%) Query: 11 WPQHDFTSLKKLRPQ----SVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSL 66 W + P+ VT + +V E+DE W +VG K +WL+ Sbjct: 73 WLLDFINFIINDLPEDLNAQVTCCEKDELEVAK-LEVDELWNFVGNKKNDQWLWLILHKK 131 Query: 67 RKTVVAHVFGERTMATLGRLMSLLSPF 93 + V+A G R T L + L Sbjct: 132 SRQVLAMQVGPRDKKTAELLFAKLPES 158 >UniRef50_A4AD66 Transposase n=19 Tax=unclassified Gammaproteobacteria RepID=A4AD66_9GAMM Length = 227 Score = 79.4 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 48/133 (36%), Gaps = 13/133 (9%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF---DVVIWM 99 +DE + + K + +L+ A D + V ++ +R A R L + + Sbjct: 75 IDEVFVTINGKQQ--YLWRAVDQDGEVVDVYLQTKRDGAAAKRFFKRLLRSHGGEPRKIV 132 Query: 100 TDGWPLY----ESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEL--- 152 TD Y + +H+ + R E+ + R R R+ S +++ Sbjct: 133 TDKLRSYGVAHRELIPETVHITEQYENNRAEQSHETTRAR-ERGMRRFKSVAQAQRFVAA 191 Query: 153 HDKVIGHYLNIKH 165 H V + +H Sbjct: 192 HAAVFNLFNLGRH 204 >UniRef50_Q8VSP6 Putative IS1 ORF n=1 Tax=Shigella flexneri RepID=Q8VSP6_SHIFL Length = 67 Score = 79.0 bits (193), Expect = 6e-14, Method: Composition-based stats. Identities = 38/64 (59%), Positives = 47/64 (73%) Query: 104 PLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNI 163 P+Y + L HVISK+ TQRIERHNLNLR HL RL RK++ FSKS ++H K+IG YL I Sbjct: 4 PVYRTLLSSTSHVISKKCTQRIERHNLNLRTHLKRLTRKTICFSKSDDMHYKIIGWYLTI 63 Query: 164 KHYQ 167 H+ Sbjct: 64 NHHH 67 >UniRef50_Q6MBQ1 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MBQ1_PARUW Length = 138 Score = 77.1 bits (188), Expect = 2e-13, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 33/87 (37%), Gaps = 5/87 (5%) Query: 11 WPQHDFTSLKKLRPQ----SVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSL 66 W L P+ VT + +V E+DE+W +VG K +WL+ Sbjct: 45 WLLDFINLLINDLPEDLNTQVTCCEKDELEVAR-LEVDERWSFVGNKKNDQWLWLILHKK 103 Query: 67 RKTVVAHVFGERTMATLGRLMSLLSPF 93 + V+A G R T L + L Sbjct: 104 SRQVLAMQVGPRDKKTAELLFTKLPES 130 >UniRef50_Q10ZK1 Insertion element protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZK1_TRIEI Length = 177 Score = 75.9 bits (185), Expect = 4e-13, Method: Composition-based stats. Identities = 14/72 (19%), Positives = 26/72 (36%) Query: 40 CAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWM 99 E+ E +V K + L+ R+ ++ V G + T L + + + Sbjct: 106 VGELHELETFVSDKKNKVLLWTLVYHFRQGILGWVVGNHSGDTFQPLWQAIGFWKCYFQV 165 Query: 100 TDGWPLYESRLK 111 TDG P+ Sbjct: 166 TDGNPVASRLYP 177 >UniRef50_C8SAB2 IS1 transposase (Fragment) n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SAB2_FERPL Length = 75 Score = 74.0 bits (180), Expect = 2e-12, Method: Composition-based stats. Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 3/72 (4%) Query: 96 VIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDK 155 I+ TD W Y + + K +I K T +ER L LR R RKS+ FSKS+E+ + Sbjct: 3 AIFYTDRWDAY-NLIPYKQRIIKKGGTNHVERLFLTLRNDNPRFARKSIRFSKSIEMLEN 61 Query: 156 VIGHYLNIKHYQ 167 + + I +Y Sbjct: 62 SLKLW--IHYYN 71 >UniRef50_Q64CQ0 Putative uncharacterized protein n=1 Tax=uncultured archaeon GZfos1D1 RepID=Q64CQ0_9ARCH Length = 168 Score = 73.6 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 46/133 (34%), Gaps = 37/133 (27%) Query: 71 VAHVFGERTMATLGRLMSLL-------SPFDVVIWMTDGWPLYESRLKG----------- 112 +A G++T + GR+M + SP + TDG Y L Sbjct: 1 MAFSVGKQTQESCGRMMKKVFGRTEQPSPQTKMEMFTDGNDDYTYVLPDYCADACIEYGQ 60 Query: 113 ------KLHVISK-------------RYTQRIERHNLNLRQHLARLGRKSLSFSKSVELH 153 V+ K T +E +N LR+ + RL RK+ FSK + Sbjct: 61 LVKIRENGRVVRKEKRIIYGNPDLGDIETTDVENYNGILRERIGRLVRKTKCFSKRKRML 120 Query: 154 DKVIGHYLNIKHY 166 + + + ++ Sbjct: 121 ECSLQVFQFYWNF 133 >UniRef50_Q46CV2 Putative uncharacterized protein n=14 Tax=Methanosarcina RepID=Q46CV2_METBF Length = 75 Score = 72.5 bits (176), Expect = 5e-12, Method: Composition-based stats. Identities = 22/62 (35%), Positives = 32/62 (51%) Query: 98 WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVI 157 MTD W Y L +H SK T +E +N L+ LARL RK+ ++KS+E+ + Sbjct: 1 MMTDHWRAYAEFLPENIHTQSKAETYTVEGYNGILKHFLARLRRKTKCYTKSIEMLKYSV 60 Query: 158 GH 159 Sbjct: 61 LL 62 >UniRef50_A8YFR2 ImeAB protein n=2 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YFR2_MICAE Length = 122 Score = 71.7 bits (174), Expect = 9e-12, Method: Composition-based stats. Identities = 25/66 (37%), Positives = 34/66 (51%), Gaps = 3/66 (4%) Query: 93 FDVVIWMTDGWPLYESRLKGKLHV-ISK--RYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 + TD W Y++ + K H + K T IER N RQ ++RL R+SLSFSK Sbjct: 24 RQCAVAYTDCWESYKTGIPSKRHRPVGKETGQTNPIERLNNTFRQRISRLVRESLSFSKK 83 Query: 150 VELHDK 155 +E H Sbjct: 84 MENHVG 89 >UniRef50_Q6MCH2 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCH2_PARUW Length = 121 Score = 70.6 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 26/81 (32%), Positives = 34/81 (41%), Gaps = 4/81 (4%) Query: 80 MATLGRLMSLLSPF-DVVIWMTDGWPLYESRLKGKLHV-ISK--RYTQRIERHNLNLRQH 135 L + L + TD + +Y H +SK T IER N RQ Sbjct: 26 KKPLSFFLQKLPESLKKAFYFTDKFNVYYETNPWSQHQPVSKQSGQTSYIERFNCTRRQR 85 Query: 136 LARLGRKSLSFSKSVELHDKV 156 ARL RK+LSFSK + H + Sbjct: 86 CARLVRKTLSFSKKLTNHIGL 106 >UniRef50_A8GX98 Transposase and inactivated derivative n=2 Tax=Rickettsia bellii RepID=A8GX98_RICB8 Length = 99 Score = 70.2 bits (170), Expect = 3e-11, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 42/82 (51%), Gaps = 1/82 (1%) Query: 77 ERTMATLGRLMSLLSP-FDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQH 135 R +++ + L +++ I +D + +Y + K H +K+ T +E N +R + Sbjct: 4 GRDISSYLPMALRLEENYEIDISCSDHYDVYGAYKIAKRHYFTKKETALVESFNSLIRNY 63 Query: 136 LARLGRKSLSFSKSVELHDKVI 157 LAR RK+ +SK++++ I Sbjct: 64 LARFNRKTKRYSKAIDMIYNSI 85 >UniRef50_A8YEG9 Q6EVL0_MICAE ImeA protein n=4 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YEG9_MICAE Length = 171 Score = 69.4 bits (168), Expect = 4e-11, Method: Composition-based stats. Identities = 10/64 (15%), Positives = 26/64 (40%), Gaps = 2/64 (3%) Query: 11 WPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTV 70 W Q+ + P+ + + + E DE W +V +K+ + +++ D + + Sbjct: 107 WLQNYVNNKLASVPRQIKVSDK--LKGKLVIECDEMWSFVFSKTIKVYIWRLIDRNTREI 164 Query: 71 VAHV 74 + Sbjct: 165 IGCY 168 >UniRef50_UPI00016C465A IS1 transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C465A Length = 88 Score = 68.6 bits (166), Expect = 6e-11, Method: Composition-based stats. Identities = 22/73 (30%), Positives = 32/73 (43%), Gaps = 3/73 (4%) Query: 96 VIWMTDGWPLYESRLKGKLHVISKR---YTQRIERHNLNLRQHLARLGRKSLSFSKSVEL 152 V TD P + + H ++ T IER L LRQ AR RK+L+FSK Sbjct: 12 VTVYTDLLPACRAAIPRARHRAVRKVTGLTAHIERFWLTLRQRCARFVRKTLTFSKCPRN 71 Query: 153 HDKVIGHYLNIKH 165 H + ++ + Sbjct: 72 HLGALWYFARRYN 84 >UniRef50_B0URB1 Putative uncharacterized protein n=1 Tax=Methylobacterium sp. 4-46 RepID=B0URB1_METS4 Length = 82 Score = 67.5 bits (163), Expect = 2e-10, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 31/67 (46%) Query: 98 WMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVI 157 + TD + Y + L H + K TQ +E +N R AR R++ SKSVE+ + + Sbjct: 4 FCTDNYAPYAAALPAGRHHVGKDQTQLVESNNARQRHWFARFRRRTCVVSKSVEMVEATM 63 Query: 158 GHYLNIK 164 + Sbjct: 64 ALFAFYH 70 >UniRef50_A8ZKR3 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=A8ZKR3_ACAM1 Length = 241 Score = 66.7 bits (161), Expect = 2e-10, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 4/78 (5%) Query: 38 IVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLM----SLLSPF 93 I EMDE+ GYV K +Q W A D+ K ++ G R + RLM L+ Sbjct: 122 IDVLEMDERHGYVAIKQQQCWDAVAIDAASKFIIQVEVGPRNTNLIDRLMRATHKRLAHP 181 Query: 94 DVVIWMTDGWPLYESRLK 111 ++ MTDG Y + Sbjct: 182 RDLVLMTDGDASYRTLFP 199 >UniRef50_B5VWL6 Putative uncharacterized protein n=2 Tax=Arthrospira maxima CS-328 RepID=B5VWL6_SPIMA Length = 153 Score = 66.7 bits (161), Expect = 3e-10, Method: Composition-based stats. Identities = 13/83 (15%), Positives = 37/83 (44%), Gaps = 1/83 (1%) Query: 16 FTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVF 75 ++ + T + ++ A++DE +VG+K W++ ++ ++ V Sbjct: 46 HNTILNWVRVAETHIDEENYEIPEIAQIDELQTFVGSKKT-IWVWTVVNTKLPGILKFVI 104 Query: 76 GERTMATLGRLMSLLSPFDVVIW 98 G+R++ T L ++ + ++ Sbjct: 105 GDRSLLTFTTLWQMIQGWACFLY 127 >UniRef50_Q10ZU2 Putative uncharacterized protein n=3 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZU2_TRIEI Length = 79 Score = 64.8 bits (156), Expect = 9e-10, Method: Composition-based stats. Identities = 14/48 (29%), Positives = 23/48 (47%) Query: 37 VIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLG 84 + + DE W +VG K+ ++WL+ A D + +V GER Sbjct: 32 GKLTIQCDEMWSFVGNKNNKQWLWLAIDIETQEIVGFYLGERGEKGAA 79 >UniRef50_Q1CBA9 Putative transposase n=3 Tax=Yersinia pestis RepID=Q1CBA9_YERPA Length = 85 Score = 64.8 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 1/78 (1%) Query: 51 GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRL 110 G + + +T F T +L+ LLS F++V W TD + Y L Sbjct: 5 GQQKAATLALVCLGASPQTYYCSYFWSSEQKTFRQLLGLLSGFNIVFWCTDNFSAY-EML 63 Query: 111 KGKLHVISKRYTQRIERH 128 + H+ SK YTQRIER Sbjct: 64 PDEKHIRSKLYTQRIERE 81 >UniRef50_C5B7X2 Iso-IS1 ORF2 n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B7X2_EDWI9 Length = 99 Score = 64.4 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 22/69 (31%), Positives = 37/69 (53%) Query: 90 LSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKS 149 L+ F++ + D W + + +TQ ER++L LR + RL RK + FS++ Sbjct: 3 LTAFNIGMITRDDWGNPIREVPWGKPLTGTIFTQHSERNSLMLRTRIKRLARKRIGFSRA 62 Query: 150 VELHDKVIG 158 + LH+KV G Sbjct: 63 IALHEKVTG 71 >UniRef50_UPI000190BD22 insertion element protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 RepID=UPI000190BD22 Length = 113 Score = 62.5 bits (150), Expect = 5e-09, Method: Composition-based stats. Identities = 45/46 (97%), Positives = 45/46 (97%) Query: 19 LKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD 64 L KLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD Sbjct: 68 LNKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYD 113 >UniRef50_A9FZD9 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FZD9_SORC5 Length = 216 Score = 61.7 bits (148), Expect = 8e-09, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 28/83 (33%), Gaps = 13/83 (15%) Query: 40 CAEMDEQWGYVGAKSRQR-----------WLFYAYDSLRKTVVAHVFGERTMATLGRLMS 88 +MDE W +V K + +L+ A D+ K ++ G+ + Sbjct: 63 VIQMDEMWSFVQKKQARVTAKDPAEHGDAYLYVALDANTKPAISFHVGKCDGENTEMFIK 122 Query: 89 LLSPFD--VVIWMTDGWPLYESR 109 L V +DGW Y Sbjct: 123 DLRGRLTVVPHVTSDGWQPYIEA 145 >UniRef50_Q648U8 Putative uncharacterized protein n=6 Tax=environmental samples RepID=Q648U8_9ARCH Length = 173 Score = 60.5 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 41/116 (35%), Gaps = 30/116 (25%) Query: 80 MATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKL-----------------HVISKRYT 122 M T+ + + + + +DG Y S + V+ K T Sbjct: 9 MKTVRKRGKKPTKDEKATFASDGNVQYTSAILENFDVEAINYGQLVKEREGGRVVGKTRT 68 Query: 123 -------------QRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKH 165 IER+NL LR ++RL RKSL FSK + D + Y + Sbjct: 69 IIFGEVDDVDIDTVYIERYNLTLRHGISRLVRKSLCFSKCKGMLDNHLDVYQCYNN 124 >UniRef50_Q7MZ59 Transposase, IS1 family n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MZ59_PHOLL Length = 115 Score = 59.0 bits (141), Expect = 6e-08, Method: Composition-based stats. Identities = 23/61 (37%), Positives = 32/61 (52%), Gaps = 2/61 (3%) Query: 79 TMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLAR 138 T +L+ L+ F+VV W TD + Y + L H K +TQ IER NL +R + R Sbjct: 55 NEKTFRKLLKKLASFNVVFWCTDNFKTY-NLLPKSQHRAGKIFTQHIERENL-MRTRIKR 112 Query: 139 L 139 L Sbjct: 113 L 113 >UniRef50_C4GXL2 Insertion sequence protein n=5 Tax=Yersinia pestis RepID=C4GXL2_YERPN Length = 111 Score = 58.2 bits (139), Expect = 8e-08, Method: Composition-based stats. Identities = 16/44 (36%), Positives = 25/44 (56%) Query: 46 QWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSL 89 W +VG K +QRWL+YA++ K ++AH+FG R+ Sbjct: 1 MWSFVGNKKQQRWLWYAWEPRLKRIIAHIFGRRSKRHFANYWGC 44 >UniRef50_Q32DI9 Iso-IS1 ORF2 n=2 Tax=Shigella RepID=Q32DI9_SHIDS Length = 94 Score = 58.2 bits (139), Expect = 1e-07, Method: Composition-based stats. Identities = 14/52 (26%), Positives = 25/52 (48%) Query: 63 YDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKL 114 + +A+ FG RT T L++LL+PF++ + +D W Y + Sbjct: 32 ITPKQGGGLAYTFGPRTDETCRELLALLTPFNIGMITSDDWGSYGREVPKDK 83 >UniRef50_Q0W4E9 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4E9_UNCMA Length = 160 Score = 58.2 bits (139), Expect = 1e-07, Method: Composition-based stats. Identities = 23/127 (18%), Positives = 39/127 (30%), Gaps = 31/127 (24%) Query: 71 VAHVFGERTMATLGRLMSLLSPF---DVVIWMTDGWPLYESRLKGKLHVIS--------- 118 + G T T ++S +S V +DG Y L + Sbjct: 1 MGFSVGRWTQGTCRVMLSQVSNSVQDGVFTVYSDGNDDYYYTLTDFFQEVRYGQLVKIRE 60 Query: 119 -------------------KRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGH 159 + T +E N LR + RL RK+ +FSK E+ + Sbjct: 61 KGRVVGKEIRVLIGDVDSEQVETFNVENFNSILRGRVGRLVRKTKTFSKIPEMLYYSVAL 120 Query: 160 YLNIKHY 166 + ++ Sbjct: 121 FQFYWNF 127 >UniRef50_Q6MCX8 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MCX8_PARUW Length = 72 Score = 57.1 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 18/65 (27%), Positives = 28/65 (43%), Gaps = 3/65 (4%) Query: 106 YESRLKGKLHV-ISKRY--TQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLN 162 Y + H + K+ T IER N L +R K+LSFSK + H +I ++ Sbjct: 2 YFESIPFGQHRPVGKQSDKTSYIERLNCTLGYRCSRFVGKTLSFSKKLINHIGMITSFIC 61 Query: 163 IKHYQ 167 + Sbjct: 62 DYNLH 66 >UniRef50_Q8PRQ0 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PRQ0_METMA Length = 129 Score = 56.3 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 18/49 (36%), Positives = 26/49 (53%) Query: 118 SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHY 166 S T ER NL +R LAR RK ++FSK+ +H K I + ++ Sbjct: 34 SYIGTSYAERINLTIRTSLARFIRKGMNFSKTKRMHQKAIDLFQAWYNF 82 >UniRef50_Q7NJH9 Gsl1853 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJH9_GLOVI Length = 71 Score = 54.8 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 22/56 (39%), Positives = 27/56 (48%), Gaps = 3/56 (5%) Query: 103 WPLYESRLKGKLHVI-SK--RYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDK 155 Y L K H K T IER N +RQ + RL RK+LSFSK + H+ Sbjct: 2 LKNYGQVLASKRHRAAGKATGTTSCIERFNNTVRQRVGRLVRKALSFSKCLSNHNA 57 >UniRef50_Q218S2 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris BisB18 RepID=Q218S2_RHOPB Length = 191 Score = 54.0 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 2/48 (4%) Query: 120 RYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 T +ER NL+LR R R + FSK ++ H + Y+ HY Sbjct: 54 ISTSYVERQNLSLRMGSRRFTRLTNGFSKKLDNHVAAVALYVA--HYN 99 >UniRef50_B9K3D6 Transposase n=32 Tax=Bacteria RepID=B9K3D6_AGRVS Length = 243 Score = 53.6 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 28/133 (21%), Positives = 46/133 (34%), Gaps = 13/133 (9%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL---SPFDVVIWM 99 +DE +G K WL+ A D + V R RLM L + + Sbjct: 87 LDEVVISIGGKKH--WLWRAVDQDGFVLDVLVQSRRNAKAAKRLMRKLLKGQGRSPRVMI 144 Query: 100 TDGWPLY----ESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLS---FSKSVEL 152 TD Y + H K R E + +R+ R+ ++ S + V + Sbjct: 145 TDKLRSYGAAKREIMPAVEHRSHKGLNNRAENSHQPIRRR-ERIMKRFKSARHLQRFVSI 203 Query: 153 HDKVIGHYLNIKH 165 HD + + +H Sbjct: 204 HDPIANLFQIPRH 216 >UniRef50_C7DAC3 Transposase n=36 Tax=Rhodobacterales RepID=C7DAC3_9RHOB Length = 237 Score = 53.6 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 25/126 (19%), Positives = 49/126 (38%), Gaps = 10/126 (7%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMT 100 +DE +V ++ +L+ A D + + A V R A + + L +T Sbjct: 85 VDEV--FVKVNGKRHYLWRAVDHEGEVLEAVVTKRRNKAAALKFLKKLMKRHGKAEEVVT 142 Query: 101 DGWPLYESRLKG----KLHVISKRYTQRIERHNLNLRQHLARL--GRKSLSFSKSVELHD 154 D + Y++ L+ + + R+E +L R+ + R+ S K +H Sbjct: 143 DRFAPYKAALRDLGALEKQSTGRWLNNRVENSHLPFRRRERAMQRFRRMRSLQKFAAVHS 202 Query: 155 KVIGHY 160 V H+ Sbjct: 203 SVYNHF 208 >UniRef50_Q649W7 Putative uncharacterized protein n=1 Tax=uncultured archaeon GZfos34A6 RepID=Q649W7_9ARCH Length = 217 Score = 53.2 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 54/170 (31%), Gaps = 67/170 (39%) Query: 61 YAYDSLRKTVVAHVFGERTMATLGRLMSLLSP-----FDVVIWMTDGWPLYESRL----- 110 A ++ K V ++ G R L++ + ++ I+ TD W Y++ L Sbjct: 2 VAQEAKTKLVTSYHVGRRAFEDAVELLAEMESRRDKSTELPIFTTDDWDAYKNALVEVYG 61 Query: 111 -----------------------KGKLHVISKRYTQ------------------------ 123 VI R Sbjct: 62 VEEQPEYKGRGRPPNSKKVPPPDLKYGQVIKYREGNEVTDVKKRVVFGNEEEVLSALKLA 121 Query: 124 -------RIERHNLNLRQHLARLGRKSLSFSKSVE---LHDKVIGHYLNI 163 IER+NL +R ++RL RK+++FSK + +H + + N+ Sbjct: 122 GNSINTSYIERNNLTVRNGVSRLIRKTINFSKRLNPLVMHLCLFFAWFNL 171 >UniRef50_Q0RZ53 Transposase n=23 Tax=Bacteria RepID=Q0RZ53_RHOSR Length = 317 Score = 53.2 bits (126), Expect = 4e-06, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 42/156 (26%), Gaps = 24/156 (15%) Query: 13 QHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVA 72 Q L++ RP+ +DE + Q +L+ A D + Sbjct: 60 QDFANQLRRRRPRPGDK-----------WHLDEV--VIRMNGTQHYLWRAVDQDGNVLDV 106 Query: 73 HVFGERTMATLGRLMSLLSPFDVV---IWMTDGWPLY----ESRLKGKLHVISKRYTQRI 125 V R + L + +TD Y + H S+ R Sbjct: 107 LVQSRRNAVAAKKFFRKLLKRQCAVPRVLVTDKLGSYQVAHREVMPSVEHRRSRYLNNRA 166 Query: 126 ERHN----LNLRQHLARLGRKSLSFSKSVELHDKVI 157 E + RL R + V H + + Sbjct: 167 ENSHQPAATRAGDETVRLARSGAAVPLGVRRHRRTL 202 >UniRef50_UPI00018554DD transposase n=1 Tax=Francisella novicida FTG RepID=UPI00018554DD Length = 97 Score = 51.3 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 12/36 (33%), Positives = 19/36 (52%) Query: 35 SDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTV 70 D I E DE W ++G+K ++ W+ AYD + Sbjct: 40 EDNISEIEFDEMWHFIGSKKKKCWIIKAYDRRVGKL 75 >UniRef50_Q11MN9 Transposase n=37 Tax=Bacteria RepID=Q11MN9_MESSB Length = 237 Score = 51.3 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 24/95 (25%), Positives = 35/95 (36%), Gaps = 9/95 (9%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL---SPFDVVIWM 99 +DE V K ++ WL+ A D+ + A + R A RLM L + + Sbjct: 81 LDEM--VVTIKGKKYWLWRAVDTNGYVLDALLQSRRNKAAAMRLMRKLLKDQGTAPRVMV 138 Query: 100 TDGWPLY----ESRLKGKLHVISKRYTQRIERHNL 130 TD Y + G H K R E +L Sbjct: 139 TDKLRSYSAAKSQLMPGVEHRSHKGLNNRAENSHL 173 >UniRef50_B9K4Q6 Transposase n=2 Tax=Alphaproteobacteria RepID=B9K4Q6_AGRVS Length = 232 Score = 50.9 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 25/128 (19%), Positives = 40/128 (31%), Gaps = 15/128 (11%) Query: 10 RWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKT 69 W + L + +DE V K R+ WL+ A D+ Sbjct: 58 EWAAKFGSEFAALLRRRSKGG------FADKWHLDEM--VVTFKGRKYWLWRAVDAEGYM 109 Query: 70 VVAHVFGERTMATLGRLMSLL---SPFDVVIWMTDGWPLYESR----LKGKLHVISKRYT 122 + A + R +LM L + +TD Y++ + G H K Sbjct: 110 LEALLQSRRNKKAALKLMRKLLKGQGLTPRVMVTDKLRSYDAAKRDIMPGVEHRSHKGLN 169 Query: 123 QRIERHNL 130 R E +L Sbjct: 170 NRAENSHL 177 >UniRef50_A8LAQ0 Integrase catalytic region n=1 Tax=Frankia sp. EAN1pec RepID=A8LAQ0_FRASN Length = 175 Score = 50.9 bits (120), Expect = 2e-05, Method: Composition-based stats. Identities = 29/125 (23%), Positives = 40/125 (32%), Gaps = 10/125 (8%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLS--PFDVVIWMT 100 +DE YV R +L+ A D + + R A R V T Sbjct: 25 IDE--TYVKVAGRWTYLYRAVDQHSQVIDVLASTRRDQAAARRFFVRALTHGRRPVKVTT 82 Query: 101 DGWPLYES----RLKGKLHVISKRYTQRIERHNLNLRQHLA--RLGRKSLSFSKSVELHD 154 D P+Y L HV + R RIE + L+ L R ++ S H Sbjct: 83 DKAPVYPRILDELLPEACHVDAARENNRIEADHGRLKARLRPMRGLKRLRSVQTVSAGHA 142 Query: 155 KVIGH 159 V Sbjct: 143 LVQNI 147 >UniRef50_A9EF44 Transposase n=2 Tax=Rhodobacteraceae RepID=A9EF44_9RHOB Length = 156 Score = 50.9 bits (120), Expect = 2e-05, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 43/133 (32%), Gaps = 11/133 (8%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL--SPFDVVIWMT 100 MDE + K WL+ A D+ + V R + R + L + + +T Sbjct: 1 MDEVVITIRGKKH--WLWRAIDADGDVLDILVQTRRNAKSAKRFLQRLVSQFGEPRVVIT 58 Query: 101 DGWPLY----ESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKV 156 D Y ++ H K IE + R+ ++ K S ++ Sbjct: 59 DKLRSYLKPVKTLTPNADHRAHKGLNNAIEVSHRPTRKR-EKIFGKFKSHRQAHRFLAAH 117 Query: 157 --IGHYLNIKHYQ 167 I + YQ Sbjct: 118 DQINLLFRPRRYQ 130 >UniRef50_Q8TRX5 Predicted protein n=3 Tax=Methanosarcina acetivorans RepID=Q8TRX5_METAC Length = 221 Score = 49.4 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 53/165 (32%), Gaps = 57/165 (34%) Query: 58 WLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF---DVVIWMTDGWPLYESRL---- 110 W++ A+ + ++ V G R + + L++ + +++TDG Y L Sbjct: 10 WMWVAFVPGCRLILDFVIGPRKQYVADKFIELVNKHISDKIPVFVTDGLNFYREALLKQF 69 Query: 111 --------------KGKLHVI----------------------------------SKRYT 122 K ++ S+ T Sbjct: 70 GVLREFPRTGKRGRPKKPKIVPSEDLRYAQVVKTRVNGVLEKVEKKIIFGENIEQSEIST 129 Query: 123 QRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 +ER NL RQ R+ RK++ FSK E + + Y H+ Sbjct: 130 TLLERQNLTFRQDNNRVSRKTIGFSKMKEWLEIQMKLYCT--HFN 172 >UniRef50_A7C135 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C135_9GAMM Length = 372 Score = 49.4 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 24/78 (30%), Positives = 29/78 (37%), Gaps = 2/78 (2%) Query: 88 SLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFS 147 L I D L + V S+ T IER NL RQ RL R+S FS Sbjct: 219 GRLIEVKNKIIFGDENELASKL--AESPVRSEINTSFIERDNLTQRQSNRRLTRRSNGFS 276 Query: 148 KSVELHDKVIGHYLNIKH 165 K + D + L H Sbjct: 277 KELSWFDSPLWLSLAYYH 294 Score = 41.7 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 12/89 (13%), Positives = 30/89 (33%), Gaps = 16/89 (17%) Query: 41 AEMDEQWGYVGAKSRQR-------------WLFYAYDSLRKTVVAHVFGERTMATLGRLM 87 ++DE W ++ W++ A+ + + V+A V G L+ Sbjct: 91 LQLDELWSFILTLEHNCTEAKLYHESYGDAWVWLAFAPVWRVVLAFVIGSLPQKNANLLL 150 Query: 88 SLLSPFD---VVIWMTDGWPLYESRLKGK 113 ++ + + +D + + L Sbjct: 151 DRVAHVTDAHIPFFTSDQFSSSRTALLHT 179 >UniRef50_Q10ZQ2 Putative uncharacterized protein n=7 Tax=Cyanobacteria RepID=Q10ZQ2_TRIEI Length = 44 Score = 48.6 bits (114), Expect = 8e-05, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 2/40 (5%) Query: 128 HNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 N LRQ ++RL RK+LSFSK + H I ++ I HY Sbjct: 1 MNNTLRQRISRLVRKTLSFSKKLRSHLGDIWYF--INHYN 38 >UniRef50_Q327E8 IS1 ORF, n=6 Tax=Enterobacteriaceae RepID=Q327E8_SHIDS Length = 94 Score = 48.2 bits (113), Expect = 1e-04, Method: Composition-based stats. Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 4/49 (8%) Query: 112 GKLHVISKR----YTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKV 156 V K + +ER+NL LR + RL RK++ FS+SVE+H+K Sbjct: 18 KDKQVTRKGIFIQHMLYLERNNLPLRTRIKRLARKTICFSRSVEIHEKS 66 >UniRef50_Q0RWC6 Transposase n=24 Tax=Bacteria RepID=Q0RWC6_RHOSR Length = 236 Score = 47.4 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 23/136 (16%), Positives = 39/136 (28%), Gaps = 20/136 (14%) Query: 13 QHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVA 72 Q L++ R + +DE ++ + +L+ A D + Sbjct: 60 QAYANQLRRRRARPGDK-----------WHLDEV--FIRINGKLHYLWRAVDQGGNVLDV 106 Query: 73 HVFGERTMATLGRLMSLLSP---FDVVIWMTDGWPLY----ESRLKGKLHVISKRYTQRI 125 V R + L + + +TD Y L H SK R Sbjct: 107 LVQSRRNAKAAKKFFRKLLKGLRYVPRVIITDKLASYQVVHREMLASVEHRRSKYLNNRA 166 Query: 126 ERHNLNLRQHLARLGR 141 E + RQ + R Sbjct: 167 ENSHQPTRQRERAMKR 182 >UniRef50_B9K5F7 Transposase n=3 Tax=Bacteria RepID=B9K5F7_AGRVS Length = 196 Score = 47.1 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 24/127 (18%), Positives = 40/127 (31%), Gaps = 16/127 (12%) Query: 45 EQWGY----VGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL---SPFDVVI 97 E+W V + ++ WL+ A D + V R LM L + Sbjct: 39 EKWHLDEAVVSIRGKKHWLWRAVDQDGFVLDVLVQSRRNAKAARHLMRQLLKGQGRAPRV 98 Query: 98 WMTDGWPLYE----SRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELH 153 +TD Y G H K + R E + +R+ + R KS Sbjct: 99 MITDKLRSYGAAKWELTPGVEHRSHKGLSNRAENFHQPVRRRERIMKR-----FKSQRHL 153 Query: 154 DKVIGHY 160 + + + Sbjct: 154 QRFVSIH 160 >UniRef50_A3XA77 Putative transposase n=1 Tax=Roseobacter sp. MED193 RepID=A3XA77_9RHOB Length = 154 Score = 47.1 bits (110), Expect = 3e-04, Method: Composition-based stats. Identities = 25/119 (21%), Positives = 43/119 (36%), Gaps = 10/119 (8%) Query: 56 QRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGWPLYESRL--- 110 + WL+ A DS + + V R R +S L + +TD Y + L Sbjct: 36 KHWLWRAMDSEGQVLDILVQSRRNARAAKRFISRLVARWGVPRVIITDRLRSYGAALRKL 95 Query: 111 -KGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVEL---HDKVIGHYLNIKH 165 G H K RIE + R+ ++ + S ++ HD+ + +H Sbjct: 96 ALGVDHRAHKGLNIRIEGTHRPTRKR-EKIQGRFKSARQAQRFLVVHDEAANLFRPCRH 153 >UniRef50_Q2G895 Transposase n=36 Tax=Alphaproteobacteria RepID=Q2G895_NOVAD Length = 238 Score = 46.7 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 18/105 (17%), Positives = 38/105 (36%), Gaps = 8/105 (7%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMT 100 +DE +V + +L+ A D + + ++V R A + +T Sbjct: 87 LDEV--FVKINGERHYLWRAVDHEGEVLESYVTKTRDKAAALTFLKKALKRHGRAEAIVT 144 Query: 101 DGWPLYESRLKG----KLHVISKRYTQRIERHNLNLRQHLARLGR 141 DG Y + ++ + + R+E +L R+ + R Sbjct: 145 DGLRSYPAAMRQLGNLDRRKMGRWLNNRVENSHLPFRRRERAMLR 189 >UniRef50_B2TXL7 IS1 ORF2 n=1 Tax=Shigella boydii CDC 3083-94 RepID=B2TXL7_SHIB3 Length = 44 Score = 46.7 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 33/34 (97%), Positives = 33/34 (97%) Query: 134 QHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 167 HLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ Sbjct: 11 THLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ 44 >UniRef50_A3W3Q5 Transposase n=1 Tax=Roseovarius sp. 217 RepID=A3W3Q5_9RHOB Length = 180 Score = 46.7 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 39/133 (29%), Gaps = 17/133 (12%) Query: 10 RWPQHDFTSL-KKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRK 68 RW + + LR + D +V V R+ WL+ A D Sbjct: 22 RWTAKFGPQIARNLRRRQARPGDVWHLDEVV----------VKISGRKFWLWRAVDQHGV 71 Query: 69 TVVAHVFGERTMATLGRLMSLL--SPFDVVIWMTDGWPLY----ESRLKGKLHVISKRYT 122 + V +R R++ L +TD Y G H K Sbjct: 72 VLEEIVQSKRDKRAAKRVLRRLIKCYGLPKRIVTDKLRAYGAAKREVAPGLDHWSHKDLN 131 Query: 123 QRIERHNLNLRQH 135 R E +L R+ Sbjct: 132 NRAENSHLPFRKR 144 >UniRef50_B4WU12 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WU12_9SYNE Length = 228 Score = 46.7 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 50/168 (29%), Gaps = 31/168 (18%) Query: 10 RWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKT 69 RW Q L K + + +DE YV K ++L+ A DS T Sbjct: 44 RWVQAYSPELDKRCRRYLK-------PTNDSWRVDE--TYVKVKGVWKYLYRAVDSAGNT 94 Query: 70 VVAHVFGERTMATLGRLMSLLSP----FDVVIWMTDGWPLYESRLKG----------KLH 115 + + +R R + + + D Y + Sbjct: 95 LDFMLSAKRDAKAAKRFLRKVLNASHTIEPRAITVDKNAAYPPAINELKADEVLPEATKT 154 Query: 116 VISKRYTQRIERHNLNLRQHLA--------RLGRKSLSFSKSVELHDK 155 S +E+ + +++ + R++L +++ + K Sbjct: 155 RQSNYLNNTVEQDHRFIKRRVNPGLGFGSFNTARRTLKGYEAMNMIRK 202 >UniRef50_A0YAP3 Transposase n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YAP3_9GAMM Length = 133 Score = 46.3 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 31/98 (31%), Gaps = 11/98 (11%) Query: 78 RTMATLGRLMSLL---SPFDVVIWMTDGWPLYES----RLKGKLHVISKRYTQRIERHNL 130 R A R L S ++ +TD Y + +H + R E+ + Sbjct: 2 RDGAAAKRFSKRLVRSSGTELRKIVTDTLQSYGVAHRGFIPDTIHSNQQYENNRAEQSHK 61 Query: 131 NLRQHLARLGRKSLSFSKSVEL---HDKVIGHYLNIKH 165 R R RK S ++ H V + +H Sbjct: 62 ATRVR-ERGMRKFKSAKQAQRFLGAHAAVSNLFNLGRH 98 >UniRef50_Q6MD18 Putative uncharacterized protein n=2 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MD18_PARUW Length = 89 Score = 45.9 bits (107), Expect = 4e-04, Method: Composition-based stats. Identities = 14/32 (43%), Positives = 21/32 (65%) Query: 130 LNLRQHLARLGRKSLSFSKSVELHDKVIGHYL 161 L LR AR RK+LSFSK + H ++I +++ Sbjct: 46 LLLRHRYARFVRKTLSFSKKLTNHIELIKYFI 77 >UniRef50_B9K4C6 Transposase n=4 Tax=Proteobacteria RepID=B9K4C6_AGRVS Length = 346 Score = 45.9 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 61/172 (35%), Gaps = 31/172 (18%) Query: 10 RWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKT 69 RW ++K Q +P + E YV + + R+++ A D Sbjct: 159 RWVLAYAPMIEKRLRQF----RRPHCGSVRVNE-----TYVKIRGKWRYVYRAIDKHGNP 209 Query: 70 VVAHVFGERTMATLGRLMSLLSPFDVV----IWMTDGWPLYESRL----------KGKLH 115 V + +R + R + + + TDG + S + +H Sbjct: 210 VDFLLTAKRDLDAAKRFFRKMLKDEPLLSPNKIGTDGANTFPSAIKTLVDSGLLHPDPVH 269 Query: 116 VISKRYTQRIERHNLNLRQHLARL--------GRKSLSFSKSVELHDKVIGH 159 +K Q IE + L++++ ++ R++++ +++ K G+ Sbjct: 270 YATKHLQQGIESDHFRLKKNMPKIGGVQSFNTARRTIAGFQAMLWLRKGFGY 321 >UniRef50_Q2G8C0 Transposase n=5 Tax=Alphaproteobacteria RepID=Q2G8C0_NOVAD Length = 166 Score = 45.9 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 19/114 (16%), Positives = 37/114 (32%), Gaps = 8/114 (7%) Query: 60 FYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGWPLYESRL----KGK 113 + A D + + + V +R M +TDG Y + + Sbjct: 30 WRAVDHEGEVLESFVTRKRDKTAALTFMKKALKRHGKAEAIVTDGLRSYPAAMRELGNEG 89 Query: 114 LHVISKRYTQRIERHNLNLRQHLARL--GRKSLSFSKSVELHDKVIGHYLNIKH 165 + + R E +L R+ + R+ S K +H + H+ +H Sbjct: 90 RREVGRHLNNRAENSHLPFRRRERAMLRFRQMKSLQKFASVHASIHNHFSQERH 143 >UniRef50_Q64DF0 Putative uncharacterized protein n=6 Tax=cellular organisms RepID=Q64DF0_9ARCH Length = 337 Score = 45.5 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 52/190 (27%), Gaps = 62/190 (32%) Query: 39 VCAEMDEQWGYVGAK----SRQRWLFYAYDSLRKTVVAHVFGERTMATLGR----LMSLL 90 + E DE + V W D + + G++ + L L+ Sbjct: 115 LVIEGDEFYTKVDKNVPAEQSSGWTIVLMDRASRFIWELSCGKKDRSLFENAIETLAELV 174 Query: 91 SPFDVVIWMTDGWPLYESRL---------------------------------------- 110 + +TDG Y L Sbjct: 175 VQTKDITLLTDGERRYGKILFEICHELLLTGKPGRPKKTLKKGVTVRVKNKGSQTHKKGR 234 Query: 111 ------------KGKLHVISKRYT--QRIERHNLNLRQHLARLGRKSLSFSKSVELHDKV 156 + IS + T +E +N +R+ + RK+ +++KS ++ Sbjct: 235 KKPKYQTTCPQHPETSNNISDKETHANHVEANNSAMRRKCSAYRRKTNTYAKSETGLQRI 294 Query: 157 IGHYLNIKHY 166 + Y I ++ Sbjct: 295 LNVYWVIHNF 304 >UniRef50_Q64DD5 Putative uncharacterized protein n=1 Tax=uncultured archaeon GZfos18F2 RepID=Q64DD5_9ARCH Length = 230 Score = 45.1 bits (105), Expect = 0.001, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 52/190 (27%), Gaps = 62/190 (32%) Query: 39 VCAEMDEQWGYVGAK----SRQRWLFYAYDSLRKTVVAHVFGERTMATLGR----LMSLL 90 + E DE + V W D + + G++ + L L+ Sbjct: 8 LVIEGDEFYTKVDKNVPAEQSSGWTIVLMDRASRFIWELSCGKKDRSLFENAIETLAELV 67 Query: 91 SPFDVVIWMTDGWPLYESRL---------------------------------------- 110 + +TDG Y L Sbjct: 68 VQTKDITLLTDGERRYGKILFEICHELLLTGKPGRPKKTLKKGVTVRVKNKGSQTHKKGR 127 Query: 111 ------------KGKLHVISKRYT--QRIERHNLNLRQHLARLGRKSLSFSKSVELHDKV 156 + IS + T +E +N +R+ + RK+ +++KS ++ Sbjct: 128 KKPKYQTTCPQHPETSNNISDKETHANHVEANNSAMRRKCSAYRRKTNTYAKSETGLQRI 187 Query: 157 IGHYLNIKHY 166 + Y I ++ Sbjct: 188 LNVYWVIHNF 197 >UniRef50_Q469A1 Putative uncharacterized protein n=1 Tax=Methanosarcina barkeri str. Fusaro RepID=Q469A1_METBF Length = 180 Score = 45.1 bits (105), Expect = 0.001, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 37/115 (32%), Gaps = 17/115 (14%) Query: 10 RWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQR--------WLFY 61 RW K+ + P EMDE W + + W++ Sbjct: 56 RWLTRAAEQYDKVNDNMMKDLNTPK------IEMDELWIIIKKIVSRMKDYEDDGPWMWV 109 Query: 62 AYDSLRKTVVAHVFGERTMATLGRLMSLLSPF---DVVIWMTDGWPLYESRLKGK 113 A+ + ++ V G R +L+ + + +++TDG Y L Sbjct: 110 AFVPGCQLILGFVIGPRKQYVTDKLVESVKKHLSDKIPLFVTDGLNFYREALLKH 164 >UniRef50_Q6MAQ6 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MAQ6_PARUW Length = 83 Score = 44.4 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 21/52 (40%), Gaps = 3/52 (5%) Query: 11 WPQHDFTSLKKLRPQSVTSRIQPGSDV---IVCAEMDEQWGYVGAKSRQRWL 59 W + P+ + +++ + E+DE+W +V K +WL Sbjct: 16 WLLDFINFIINDLPEDLNAQVTCHEKNELEVAKLEVDERWSFVRNKENDQWL 67 >UniRef50_B8IVA0 Transposase and inactivated derivatives-like protein n=38 Tax=Bacteria RepID=B8IVA0_METNO Length = 346 Score = 43.6 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 56/166 (33%), Gaps = 25/166 (15%) Query: 10 RWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKT 69 RW ++++ ++ +DE Y+ + + R+L+ A D + Sbjct: 63 RWVLAYAPAIERRLR-----MLRKPHCG--SVRVDE--TYICIRGQWRYLYRAIDKHGEP 113 Query: 70 VVAHVFGERTMATLGRLMSLLSPFD----VVIWMTDGWPLYESRLKGKL----------H 115 V + R + R + + TDG Y + H Sbjct: 114 VDFLLTAHRDLDAAKRFFRKMLKEEPLLAPDRIGTDGAGPYPPAIAESHEEGLLPRAPTH 173 Query: 116 VISKRYTQRIERHNLNLRQHLAR--LGRKSLSFSKSVELHDKVIGH 159 ++K Q IE + +++ + R R + ++++ + ++ Sbjct: 174 HVTKHLQQGIESDHFRVKRPMPRVGGFRSFTTGRRTIQGFEAMLWL 219 >UniRef50_B4WST7 Putative uncharacterized protein n=3 Tax=Synechococcus sp. PCC 7335 RepID=B4WST7_9SYNE Length = 186 Score = 43.2 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 37/113 (32%), Gaps = 8/113 (7%) Query: 50 VGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLL---SPFDVVIWMTDGWPLY 106 + K Q +L+ A D + + R A R L + F + +TD Sbjct: 34 IKIKGEQFYLWGAVDQHGMVLDILMQRRRNTAAAYRFFRKLLKSTGFAPRVIITDKLKSC 93 Query: 107 ESR----LKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDK 155 + LKG H K R E + R R+GR S S + Sbjct: 94 GAAKKDILKGVEHRQHKGLNNRAENSHRPTRIRERRMGR-FKSASHAQRFLSA 145 >UniRef50_C7S9U1 IS6100 transposase n=358 Tax=root RepID=C7S9U1_ECOLX Length = 266 Score = 43.2 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 44/139 (31%), Gaps = 23/139 (16%) Query: 10 RWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKT 69 RW Q ++K P +DE YV + + +L+ A D T Sbjct: 61 RWVQCYAPEMEKRLRWFWRRGFDPSWR------LDE--TYVKVRGKWTYLYRAVDKRGDT 112 Query: 70 VVAHVFGERTMATLGRLMSL----LSPFDV-VIWMTDGWPLYESRL----------KGKL 114 + ++ R+ R + L ++ TD P Y + + + Sbjct: 113 IDFYLSPTRSAKAAKRFLGKALRGLKHWEKPATLNTDKAPSYGAAITELKREGKLDRETA 172 Query: 115 HVISKRYTQRIERHNLNLR 133 H K IE + L+ Sbjct: 173 HRQVKYLNNVIEADHGKLK 191 >UniRef50_Q8PWV9 Putative uncharacterized protein n=1 Tax=Methanosarcina mazei RepID=Q8PWV9_METMA Length = 150 Score = 42.8 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 16/49 (32%), Positives = 24/49 (48%) Query: 118 SKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHY 166 S T +ER NL RQ R+ RK++ FSK ++ I Y ++ Sbjct: 54 SDISTSLLERQNLTFRQDNNRISRKTIGFSKKIKCLYNQIRLYSTYFNF 102 >UniRef50_Q6MBH4 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MBH4_PARUW Length = 82 Score = 42.8 bits (99), Expect = 0.005, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 24/67 (35%), Gaps = 4/67 (5%) Query: 77 ERTMATLGRLMSLLSPF-DVVIWMTDGWPLYESRLKGKLHV-ISK--RYTQRIERHNLNL 132 R T L + ++ TD + Y + H +SK T IE+ N L Sbjct: 8 PRDKKTAELLFAKRPESLKKALYFTDKFNAYYETILWSKHQAVSKLSGQTSYIEKFNFTL 67 Query: 133 RQHLARL 139 + + + Sbjct: 68 KTKVCKF 74 >UniRef50_A9VUP9 Integrase catalytic region n=149 Tax=Bacteria RepID=A9VUP9_BACWK Length = 235 Score = 42.4 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 51/166 (30%), Gaps = 24/166 (14%) Query: 10 RWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKT 69 RW L++ + S +DE Y+ K + +L+ A DS T Sbjct: 52 RWVHQYGPQLEEKVRHHLKST-------NDSWRVDE--TYIKVKGQWMYLYRAVDSKGNT 102 Query: 70 VVAHVFGERTMATLGRLMSLLSPF----DVVIWMTDGWPLYE---SRLKGKLH------- 115 + H+ R F + D P Y LK + H Sbjct: 103 IDFHLSKSRDKQAAKCFFKKALAFSYVSKPRVITVDKNPAYPVAIQALKEEKHMPEGIKL 162 Query: 116 VISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYL 161 + +E+ + +++ + R SF + + V ++ Sbjct: 163 RQVRYLNNIVEQDHRFIKKRV-RSMLGFKSFGTATSILAGVEAMHM 207 >UniRef50_A9HNK8 Transposase, putative n=1 Tax=Roseobacter litoralis Och 149 RepID=A9HNK8_9RHOB Length = 175 Score = 42.4 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 42/139 (30%), Gaps = 12/139 (8%) Query: 10 RWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKT 69 RW Q L K + +DE Y+ A + R+L+ A D+ + Sbjct: 28 RWVQKFGPELAKRAEKHHKRSSLDWH-------VDE--TYIRAGGKWRYLWRAIDANDQL 78 Query: 70 VVAHVFGERTMATLG-RLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERH 128 V + R R + + V TD P Y + + ER Sbjct: 79 VEFRLTARRDAKAFLNRAIERVRLHRPVSICTDKAPTYRKAICAGDVSGDRDEKDTQERP 138 Query: 129 NLNLRQHLARLGRKSLSFS 147 + Q R R + S Sbjct: 139 HSQ--QAAPRQRRDCIHAS 155 >UniRef50_A0LBE3 Putative uncharacterized protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0LBE3_MAGSM Length = 116 Score = 42.4 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 16/47 (34%), Positives = 25/47 (53%) Query: 120 RYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHY 166 T +ER+N R A GRK+L FSK ++H+ V+ I ++ Sbjct: 23 IKTAFVERNNATDRHQNAHKGRKTLCFSKGWDVHNAVMVFVAYIYNF 69 >UniRef50_A0RXS8 Transposase n=1 Tax=Cenarchaeum symbiosum RepID=A0RXS8_CENSY Length = 436 Score = 42.0 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 20/99 (20%), Positives = 37/99 (37%), Gaps = 13/99 (13%) Query: 52 AKSRQRWLFYAYDSLRKTVVAHVF--GERTMATLGRLMSLLSP--FDVVIWMTDGWPLYE 107 K WL+ A D + ++ G RT+ ++ + +TD Y Sbjct: 197 NKGHGNWLWSAIDPRTRYLLCTRIAEGSRTLPDAESVIREARKMSEEPDYMITDSLRSYA 256 Query: 108 SR----LKGKLHVISK----RYTQ-RIERHNLNLRQHLA 137 + L H+ +K +T IER++ +R+ L Sbjct: 257 TAAAKCLPRTAHIKTKAIRDGFTNMAIERYHNEIREKLK 295 >UniRef50_A5N1B9 Transposase n=2 Tax=Clostridium kluyveri RepID=A5N1B9_CLOK5 Length = 127 Score = 41.7 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 24/108 (22%), Positives = 39/108 (36%), Gaps = 8/108 (7%) Query: 33 PGSDVIVCAEMDEQW---GYVGAKSRQRWLFYAYDSLRKTVVAHVFGE-RTMATLGRLM- 87 +I DE YV K +L+ DS + +++ V R +L Sbjct: 10 KILCLIPILSSDEWHADETYVKIKGIDYYLWLILDSKTRVIISFVLSRFRNSTQAYKLFF 69 Query: 88 --SLLSPFDVVIWMTDGWPLYESRLKGKL-HVISKRYTQRIERHNLNL 132 S+L+ +TD W Y +K H + +Y+ E N N Sbjct: 70 YSSILTRTSPKKIVTDKWDAYNEAIKNLHCHTLHHKYSAFSEDLNNNF 117 >UniRef50_UPI00016C51C4 hypothetical protein GobsU_02291 n=6 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C51C4 Length = 298 Score = 41.7 bits (96), Expect = 0.009, Method: Composition-based stats. Identities = 13/47 (27%), Positives = 18/47 (38%) Query: 120 RYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHY 166 T +ERHN R +R RK FSK + H ++ Sbjct: 111 VNTCFVERHNGTDRNRCSRKVRKGYGFSKDWDTHRAATAFRYFSDNF 157 >UniRef50_A9EF82 Transposase, putative n=1 Tax=Oceanibulbus indolifex HEL-45 RepID=A9EF82_9RHOB Length = 158 Score = 41.7 bits (96), Expect = 0.011, Method: Composition-based stats. Identities = 29/138 (21%), Positives = 45/138 (32%), Gaps = 21/138 (15%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFD----VVIW 98 MDE YV R +L+ A D + + + R M S + Sbjct: 1 MDE--TYVRVNGRWCYLWRAVDQRGQLIDFRLTARRDANAARAFMRQASETARCYYPMTI 58 Query: 99 MTDGWPLYESRL--------KGK--LHVISKRYTQRIERHNLNLRQ--HLARLGRK---S 143 +TD Y + + HV K RIE + L+Q R RK + Sbjct: 59 VTDKAHSYAKVIEEMNLGNGPDERIRHVDRKYLNNRIEADHAALKQLLRPKRSFRKLTAA 118 Query: 144 LSFSKSVELHDKVIGHYL 161 + K +E H + + Sbjct: 119 KNTLKGIETHRAIKKGHF 136 >UniRef50_A9VUQ5 Integrase catalytic region n=24 Tax=Bacteria RepID=A9VUQ5_BACWK Length = 235 Score = 41.3 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 40/142 (28%), Gaps = 23/142 (16%) Query: 10 RWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKT 69 RW L K + +DE Y+ K + +L+ A DS T Sbjct: 52 RWVHQYGPELDKRIRSHLKQT-------NDSWRVDE--TYIKVKGQWMYLYRAVDSKGNT 102 Query: 70 VVAHVFGERTMATLGRLMSLLSP----FDVVIWMTDGWPLYE---------SRLKGKLHV 116 + ++ R + D P Y + G + + Sbjct: 103 IDFYLSKTRDQKAAKHFFKKALQSFHVSKPPVITVDKNPAYPIAIEQLKKEKSIPGGMRL 162 Query: 117 -ISKRYTQRIERHNLNLRQHLA 137 K +E+ + +++ + Sbjct: 163 RQQKYLNNIVEQDHRFIKKRIR 184 >UniRef50_C6IUV9 Transposase n=4 Tax=Bacteroides RepID=C6IUV9_9BACE Length = 571 Score = 41.3 bits (95), Expect = 0.013, Method: Composition-based stats. Identities = 19/112 (16%), Positives = 37/112 (33%), Gaps = 14/112 (12%) Query: 11 WPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQW--GYVGAKSRQRWLFYAYDSLRK 68 W L KL P +Q G++V +DE W + K R+ +++ + + Sbjct: 273 WADKGAMQLNKLIPALKKIALQDGANVN----VDETWLRYHAYNKKRKTYMWCLVNRKAR 328 Query: 69 TVVAHVFGERTMATLGR--------LMSLLSPFDVVIWMTDGWPLYESRLKG 112 V+ + + L L + +DG+ +Y Sbjct: 329 IVIFFYEDTTDDEGVQKHGGRNRNVLKEFLGDAKIKSLQSDGYNVYMYLDNE 380 >UniRef50_C6GYT4 IS1216, transposase (Fragment) n=121 Tax=root RepID=C6GYT4_STRS4 Length = 234 Score = 40.5 bits (93), Expect = 0.021, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 57/162 (35%), Gaps = 21/162 (12%) Query: 10 RWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKT 69 RW Q L ++ + +MDE Y+ K + +L+ A D+ T Sbjct: 57 RWVQEYGKLLYQIWKKKNKKSFYS-------WKMDE--TYIKIKGKWHYLYRAIDADGLT 107 Query: 70 VVAHVFGERTMATLGRLMSLLSPF--DVVIWMTDGWPLYESRLK---------GKLHVIS 118 + + +R + L + + +TD P S K G H Sbjct: 108 LDIWLRKKRDTQAAYAFLKRLVKQFDEPKVVVTDKAPSITSAFKKLKEYGFYQGTEHRTI 167 Query: 119 KRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHY 160 K IE+ + +++ + R + S +++ + + G Y Sbjct: 168 KYLNNLIEQDHRPVKRRN-KFYRSLRTASTTIKGMEAIRGLY 208 >UniRef50_A7C324 Putative uncharacterized protein n=3 Tax=Beggiatoa sp. PS RepID=A7C324_9GAMM Length = 137 Score = 40.1 bits (92), Expect = 0.027, Method: Composition-based stats. Identities = 17/50 (34%), Positives = 25/50 (50%) Query: 117 ISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHY 166 + T IER NL LRQH++ L RK+L + K ++ L +Y Sbjct: 36 TKTQNTSFIERFNLTLRQHVSYLTRKTLGYCKKKANFKYILWINLYNYNY 85 >UniRef50_C2JSP7 IS431mec transposase n=2 Tax=Enterococcus faecalis RepID=C2JSP7_ENTFA Length = 229 Score = 40.1 bits (92), Expect = 0.028, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 36/107 (33%), Gaps = 14/107 (13%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMT 100 +DE YV K + R+L+ A DS T+ + R + L +T Sbjct: 75 IDE--TYVKVKGQDRYLYRAIDSKGNTLDMWLRNHRDTVSTKAFFKRLIRVYGQPRSIVT 132 Query: 101 DGWPLYESRLK----------GKLHVISKRYTQRIERHNLNLRQHLA 137 D + +K H SK +E+ + L+ L Sbjct: 133 DKYAPSLKAIKELKEEGILYQKVKHWKSKYLNNILEQDHRQLKGKLP 179 >UniRef50_B4WVD1 Putative uncharacterized protein n=7 Tax=Synechococcus sp. PCC 7335 RepID=B4WVD1_9SYNE Length = 298 Score = 39.7 bits (91), Expect = 0.034, Method: Composition-based stats. Identities = 14/101 (13%), Positives = 30/101 (29%), Gaps = 10/101 (9%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF---DVVIWM 99 +DE ++ K +L+ D + + R M + Sbjct: 140 IDE--TFIRVKGVWCYLYRGIDEDGNLMDVRLSKTRDMVGTKAFFAQALGLHEDAPEKIA 197 Query: 100 TDGWPLYESRLKGK-----LHVISKRYTQRIERHNLNLRQH 135 TDG Y +K + H + +E+ + ++ Sbjct: 198 TDGLASYPRAIKEELGKNVEHEVRPCTANPVEQSHRRIKHR 238 >UniRef50_A3NK27 IS6 family transposase n=29 Tax=Burkholderia RepID=A3NK27_BURP6 Length = 242 Score = 39.4 bits (90), Expect = 0.042, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 39/133 (29%), Gaps = 18/133 (13%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPF--DVVIWMT 100 +DE +V + L+ A D + + R A R + +T Sbjct: 78 LDEM--FVNLRGEPWLLWRAVDEHGAELDILLQKRRDKAAAKRSFQRVLRSCPAPCNIVT 135 Query: 101 DGWPLYES------RLKGKLHVISKRY---TQRIERHNLNLRQHLARLG-----RKSLSF 146 D Y + L HV K R E + R+ R+ +++ +F Sbjct: 136 DQLRSYPAAKAGIPELANVKHVFVKAAARVNNRAENSHQPTRERERRMRGFRDPKRTQAF 195 Query: 147 SKSVELHDKVIGH 159 S + Sbjct: 196 LASFGPIRQHFAL 208 >UniRef50_C5A9A4 Putative transposase n=1 Tax=Burkholderia glumae BGR1 RepID=C5A9A4_BURGB Length = 284 Score = 39.4 bits (90), Expect = 0.048, Method: Composition-based stats. Identities = 30/148 (20%), Positives = 49/148 (33%), Gaps = 29/148 (19%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVF--------------GERTMATLG--RL 86 MDE Y+ A+ Q + A V+ + L ++ Sbjct: 128 MDELETYIHARWAQVGVPIAVRVKTGHVLGFGIALLSSNMKKGRLAGWTKDTRHLVVPKV 187 Query: 87 MSLLSPFDV--VIWMTDGWPLYE----SRLKGKLHVISKRY-------TQRIERHNLNLR 133 ++ L P TDG Y L G H I + LR Sbjct: 188 LNALRPVMKPGGTLATDGEASYPKWIARALPGVRHERRASGEPGEFDPLFTINLTHAKLR 247 Query: 134 QHLARLGRKSLSFSKSVELHDKVIGHYL 161 LARLGR+S + +K+++ D + ++ Sbjct: 248 NDLARLGRRSWATTKTMKALDDHLWLWV 275 >UniRef50_UPI00006CAE3F hypothetical protein n=1 Tax=Tetrahymena thermophila SB210 RepID=UPI00006CAE3F Length = 369 Score = 38.6 bits (88), Expect = 0.076, Method: Composition-based stats. Identities = 26/121 (21%), Positives = 41/121 (33%), Gaps = 17/121 (14%) Query: 54 SRQRWLFYAYDSLRKTVVAHVFG-ERTMATLGRLMSLLSPFDVVI---WMTDGWPLYESR 109 Q W Y+ + V G R L L L+P + + MTDGW Y Sbjct: 203 ESQIWAVGLYERGTEDFRVVVVGSNRNEDVLRNLFERLAPRNNNVLQRIMTDGWRGYSFL 262 Query: 110 L-KGKLHVI---------SKRYTQRIERHNLNLRQHLARLGR--KSLSFSKSVELHDKVI 157 G +H I + T IE ++ + + + K S ++ D+V Sbjct: 263 EGAGYVHDIINHDKGFGSGRYTTNHIENLWSRIK-SVGQFNKGWKCNSSEQAQLYVDEVC 321 Query: 158 G 158 Sbjct: 322 W 322 >UniRef50_B5WJN4 Integrase, catalytic region n=1 Tax=Burkholderia sp. H160 RepID=B5WJN4_9BURK Length = 239 Score = 38.6 bits (88), Expect = 0.087, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 51/174 (29%), Gaps = 28/174 (16%) Query: 10 RWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKT 69 RW Q K + G +DE Y+ + R +L+ A D +T Sbjct: 56 RWVQRYAPEFVKRWNRF-------GVPTGQSWRVDE--TYLKVRGRWVYLYRAVDRAGQT 106 Query: 70 VVAHVFGERTMATLGRLMSLL---SPFDVVIWMTDGWPLYESR--------LKGKLHVI- 117 V + + + DG+ L + + Sbjct: 107 VDFMLRAKGDVKAAKGFFRKALKHQGQPPKTITLDGYAASHRAVREMKEDGLPPEDTRVR 166 Query: 118 -SKRYTQRIERHNLNLRQHL------ARLGRKSLSFSKSVELHDKVIGHYLNIK 164 SK IE+ + N++ + RL +++ + + G + +K Sbjct: 167 SSKYLNDLIEQDHRNIKSRITVMLGFKRLRSATIALAGIELMLRIRKGQFNLVK 220 >UniRef50_Q10UW1 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10UW1_TRIEI Length = 138 Score = 38.2 bits (87), Expect = 0.095, Method: Composition-based stats. Identities = 11/41 (26%), Positives = 18/41 (43%) Query: 119 KRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGH 159 K ++ER N +RQH R R+ F K + + + Sbjct: 3 KTIVLQLERINGIIRQHSGRWHRRQNKFGKLWQQTEVTVRL 43 >UniRef50_A8LH39 Integrase catalytic region n=26 Tax=Bacteria RepID=A8LH39_FRASN Length = 262 Score = 38.2 bits (87), Expect = 0.098, Method: Composition-based stats. Identities = 24/101 (23%), Positives = 34/101 (33%), Gaps = 8/101 (7%) Query: 43 MDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLS--PFDVVIWMT 100 +DE YV R +++ A D + + R A R V T Sbjct: 112 VDE--TYVKVAGRWTYVYRAVDQHGQVIDVLASARRDQAAARRFFVRALSHGHRPVEVTT 169 Query: 101 DGWPLYES----RLKGKLHVISKRYTQRIERHNLNLRQHLA 137 D P+Y L HV + R RIE + L+ L Sbjct: 170 DKAPVYPRVLDEFLPEACHVDAARENNRIEADHGRLKARLR 210 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.127 0.376 Lambda K H 0.267 0.0389 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,004,516,675 Number of Sequences: 3077464 Number of extensions: 34984925 Number of successful extensions: 94385 Number of sequences better than 1.0e-01: 138 Number of HSP's better than 0.1 without gapping: 183 Number of HSP's successfully gapped in prelim test: 96 Number of HSP's that attempted gapping in prelim test: 94026 Number of HSP's gapped (non-prelim): 298 length of query: 167 length of database: 1,040,396,356 effective HSP length: 119 effective length of query: 48 effective length of database: 674,178,140 effective search space: 32360550720 effective search space used: 32360550720 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 88 (38.6 bits)